CN100409241C - Information searching method and system based on searching engine - Google Patents

Information searching method and system based on searching engine Download PDF

Info

Publication number
CN100409241C
CN100409241C CNB2006101132499A CN200610113249A CN100409241C CN 100409241 C CN100409241 C CN 100409241C CN B2006101132499 A CNB2006101132499 A CN B2006101132499A CN 200610113249 A CN200610113249 A CN 200610113249A CN 100409241 C CN100409241 C CN 100409241C
Authority
CN
China
Prior art keywords
search
database
input information
information
entity
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CNB2006101132499A
Other languages
Chinese (zh)
Other versions
CN1936896A (en
Inventor
周枫
庄莉
李伟
李志恒
李魁
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
NET EASE YOUDAO INFORMATION TECHNOLOGY (BEIJING) Co Ltd
Original Assignee
NET EASE YOUDAO INFORMATION TECHNOLOGY (BEIJING) Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by NET EASE YOUDAO INFORMATION TECHNOLOGY (BEIJING) Co Ltd filed Critical NET EASE YOUDAO INFORMATION TECHNOLOGY (BEIJING) Co Ltd
Priority to CNB2006101132499A priority Critical patent/CN100409241C/en
Publication of CN1936896A publication Critical patent/CN1936896A/en
Application granted granted Critical
Publication of CN100409241C publication Critical patent/CN100409241C/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Landscapes

  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

Character of the method includes following points: triggering local program or script program of searching web pages; monitoring information on search bar inputted by user, and sending the information to the search server instantly; carrying out match between the said input information and database of the search server, and returning the matched instant result back to the local program or script program; displaying the instant result on hint bar of the current search page. Using search steps for query determined through comparing simplified results, the invention increases searching speed, and makes more intuitive operations for users. Introducing fuzzy query technique and each prompting message, the invention can carry out search even user is unable to input query completely so as to help user to do query and operation further. The invention is applicable to computers, handsets and information household electric appliances. Features are: low cost and no special secret algorithm.

Description

A kind of information retrieval method and searching system based on search engine
Technical field
The present invention relates to information retrieval field, particularly relate to a kind of information retrieval method and searching system based on search engine.
Background technology
Search engine (Search Engines) is one the information resources on the internet is collected arrangement that supply the system of user inquiring then, the work of search engine generally comprises following three processes: 1, find, collect info web in the internet; 2, information is extracted and tissue is set up index database; 3, again by the key word of the inquiry of searcher, in index database, detect document fast, carry out the degree of correlation evaluation of document and inquiry, the result that will export is sorted, and Query Result is returned to the user according to user's input.It is one for people provide the website of information retrieval service, it uses some program that all information categorizations on the Internet are searched needed information to help people in that boundless and indistinct net is marine.
Early stage search engine is the address collection of the Resource Server in the Internet, is divided into different catalogues by the type difference of its resource that provides, and classifies from level to level again.People will look for the information of oneself wanting to enter from level to level by their classification, just can arrive the destination at last, find the information of oneself wanting.This is the most original mode in fact, only is applicable to when internet information is also few.Along with internet information increases by how much formulas, search engine has truly appearred, these search engines are known the beginning of each page on the website, search for all hyperlinks on the Internet subsequently, and a database put in all vocabulary of representing hyperlink.The present prototype of search engine that Here it is.Present search engine has not been the information of simple search and webpage, their synthesizations more that become, and perfection has been changed.
Though along with the development of search engine technique, the resonable degree of its sort result improves day by day, most of search service is keeping similarly always, needs the mode of operation of a plurality of steps.For example, search " Tsing-Hua University " this vocabulary just has millions of pages result, and the user goes for own needed Search Results often needs to carry out following operation: (a) in search column input " Tsing-Hua University "; (b) confirm input; (c) in searched page, search the Search Results that needs; (d) click this Search Results and obtain relevant information.For the inquiry that some results relatively determine, when only being to search for the website of Tsing-Hua University such as the search purpose of searching for " Tsing-Hua University " user, above-mentioned search procedure is then too loaded down with trivial details and redundant.
In addition; the user runs into the situation that Chinese character can not be imported or the foreign language word can not be spelt through regular meeting, at this moment can't finish input, and perhaps sometimes the user can't describe oneself inquiry accurately; at this time he wishes to obtain prompting, but search engine does not but give a hand.For example, in existing search engine the input "? " perhaps other symbols, then existing search engine is directly ignored it, other input informations is searched for display of search results.
In a word, in the existing information retrieval technique,, simplify the search step in the relatively more definite inquiry of result, improve search speed especially in the networked information retrieval field; And in search engine, introduce Fuzzy Query Technology, all be the technical matters that those skilled in the art press for solution.
Summary of the invention
Technical matters to be solved by this invention provides a kind of information retrieval method based on search engine, under the situation that Search Results is determined, return instant result and information, omit search step, especially under the prerequisite that improves search efficiency, in search engine, introduce Fuzzy Query Technology, make things convenient for the inquiry of user in the time can't finishing input.
Another object of the present invention is that above-mentioned search method is applied in the reality, and a kind of information retrieval system based on search engine is provided, in order to realization and the application that guarantees above-mentioned search method.
For solving the problems of the technologies described above, the invention provides a kind of information retrieval method based on search engine, comprising:
Trigger the shell script of local program or searched page; Monitoring user is sent to search server immediately at the input information of search column; Input information is mated in the database of search server, and the instant result that coupling is obtained returns described local program or shell script; Show instant result in the prompt column on current searched page.
Preferably, described instant result is an entity information, and described entity information comprises entity title and entity network address, object or the notion of described entity for being of practical significance in actual life.
Preferably, when described input information comprised asterisk wildcard, described coupling step comprised:
(1) be that prefix is mated in the everyday words database of search server with the input information before the asterisk wildcard;
(2) be that suffix mates in described everyday words database with the input information behind the asterisk wildcard;
(3) the identical everyday words that coupling in (1), (2) is obtained is returned local program or shell script.
Preferably, described coupling step also comprises: described everyday words is mated in the instant result database of search server, and the instant result that coupling is obtained returns local program or shell script.
Preferably, when described input information was the combination of Chinese and letter, described coupling step comprised: Chinese is converted into phonetic alphabet, forms pinyin string with letter; Described pinyin string is mated in the phonetic-entity data bak of search server; The instant result that coupling in phonetic-entity data bak is obtained returns local program or shell script.
Preferably, when described input information is numeral or phonetic, described coupling step comprises: described numeral or phonetic are mated in the numeral-entity data bak of search server or phonetic-entity data bak, and the instant result that coupling is obtained returns local program or shell script.
Preferably, described method also comprises: be called prefix with the physical name among the instant result and mate in the everyday words database of search server, the everyday words that coupling obtains is returned local program or shell script; Show everyday words in the prompt column on current searched page.
Preferably, described method also comprises: be that prefix is mated in the historical search speech database of search server with the input information, the historical search speech that coupling is obtained returns local program or shell script; Show the historical search speech in the prompt column on current searched page.
Preferably, described method also comprises: input information is mated in user's bookmark database of search server, user's bookmark and classified information thereof that coupling obtains are returned shell script; Explicit user bookmark and classified information thereof in the prompt column on current searched page.
Preferably, described method, also comprise:, then will from the everyday words database of search server, mate the everyday words the most close that obtains and return local program or shell script as error correction term with input information if do not obtain the instant result and the everyday words of any coupling; Show error correction term in the prompt column on current searched page.
When described input information was the Chinese abbreviation, preferred, described coupling step comprises: input information is mated in the abbreviation-entity data bak of search server, and the entity information that coupling is obtained returned described local program or shell script; And/or, input information is mated in the abbreviation-entity namebase of search server, the entity title that coupling obtains is returned described local program or shell script.
Preferably, described method also comprises: preset the popular search key; When input information was described popular search key, the top search term database of match search server returned the top search term in the database to local program or shell script; Show top search term in the prompt column on current searched page.
Preferably, when the user by cell phone keyboard in search column during input information, this method also comprises: the combination of numbers of cell phone keyboard input is converted into monogram; Described monogram is mated in the english database of search server, and the english that coupling is obtained is returned the shell script of the local program or the mobile phone searching page, shows english in the prompt column on the current search page; Perhaps, described monogram is mated in the everyday words database of search server, the monogram commonly used that coupling is obtained returns the shell script of the local program or the mobile phone searching page, shows monogram commonly used in the prompt column on the current search page; Perhaps, described monogram is mated in the phonetic-everyday words database of search server, the Chinese everyday words that coupling is obtained is returned the shell script of the local program or the mobile phone searching page; Chinese display everyday words in the prompt column on the current search page.
The invention also discloses a kind of information retrieval system, comprising based on search engine:
Trigger element is used to trigger the shell script of local program or searched page;
Monitor unit is used for the input information of monitoring user at search column, and is sent to search server immediately;
Search server comprises that interface subelement and instant result mate subelement, and wherein: the interface subelement is used to receive described input information, and returns occurrence to described local program or shell script; Instant result is mated subelement and is used for searching instant result as occurrence according to input information at instant result database;
Display unit, the prompt column that is used on current searched page shows occurrence.
Preferably, described instant result database is entity information database, abbreviation-entity data bak, phonetic-entity data bak or numeral-entity data bak; Described entity information comprises entity title and entity network address, object or the notion of described entity for being of practical significance in actual life.
Preferably, described search server also comprises:
Asterisk wildcard recognin unit is used for discerning the asterisk wildcard of input information, is that the boundary is divided into two parts with the asterisk wildcard with input information;
Fuzzy query coupling subelement is used to finish following action:
(1) be that prefix is mated in the everyday words database of search server with the input information before the asterisk wildcard;
(2) be that suffix mates in described everyday words database with the input information behind the asterisk wildcard;
(3) the identical everyday words that coupling in (1), (2) is obtained is as occurrence.
Preferably, described fuzzy query coupling subelement also is used to finish following action:
Described everyday words is mated in the instant result database of search server, and the instant result that coupling is obtained is as occurrence.
Preferably, described system also comprises: conversion unit is used for when described input information is the combination of Chinese and letter the Chinese in the input information being converted into phonetic alphabet; And/or the combination of numbers that cell phone keyboard is imported is converted into monogram.
Preferably, described occurrence also comprises historical search speech, user's bookmark and classified information, error correction term, top search term or english.
Preferably, described search server also comprises phonetic-everyday words database, historical search speech database and user's bookmark database, top search term database or english database.
Compared with prior art, the present invention has the following advantages:
At first, the present invention is by mating input information in the database of search server, and the instant result that coupling is obtained returns local program or shell script, shows instant result then in the prompt column on current searched page.In this case, by reducing execution in step, improved user's search efficiency.And, can make user's operation more directly perceived because the result shows at current page immediately.
Secondly, the instant result that the present invention returns is the instant result of entity, and instant result is more definite with respect to webpage, thereby under the prerequisite that improves search efficiency, has also guaranteed user search result's accuracy to a certain extent.
Moreover the present invention introduces Fuzzy Query Technology, by the identification to asterisk wildcard, can make the user finishing when input, still can search for, even do not obtain mating as a result the time, the user can also obtain error correcting prompt; And the present invention can also carry out character to input information and transform, and to guarantee system input information is discerned more accurately.
In addition, except that instant result, the present invention can also show all kinds of informations at the current search page, to make things convenient for user further inquiry and operation.
In a word, the present invention can be used for various information tools such as computer, mobile phone, information household appliances, the user only need be according to the existing operation technique operation of search engine, just can obtain instant result of entity and information accurately, and directly on current page, browse or operate and get final product, use directly perceived and hommization; For the service provider, technology realizes simple, and no technology barrier does not have special secret algorithm, and cost is lower.
Description of drawings
Fig. 1 is the process flow diagram of a kind of information retrieval method based on search engine of the present invention;
Fig. 2 is the structured flowchart of a kind of information retrieval system based on search engine of the present invention;
Fig. 3 is the process flow diagram that system shown in Figure 2 realizes the information retrieval step;
Fig. 4 is the process flow diagram of system when input information is the combination of Chinese and letter shown in Figure 2;
Fig. 5 is the synoptic diagram of entity information of the present invention when comprising entity title, entity brief introduction and entity network address;
Fig. 6 is the synoptic diagram that the present invention returns instant result and everyday words.
Embodiment
For above-mentioned purpose of the present invention, feature and advantage can be become apparent more, the present invention is further detailed explanation below in conjunction with the drawings and specific embodiments.
With reference to Fig. 1, be the process flow diagram of a kind of information retrieval method based on search engine of the present invention, may further comprise the steps:
The shell script of step 101, triggering local program or searched page;
Step 102, monitoring user are sent to search server immediately at the input information of search column;
Step 103, input information is mated in the database of search server, the instant result that coupling is obtained returns described local program or shell script;
Show instant result in step 104, the prompt column on current searched page.
When using search engine, the user is by clicking search column or using other triggering modes can trigger the shell script of local program or searched page, in case the user begins input, this local program or shell script will monitoring user input equipment, such as keyboard, handwriting pad etc., as long as monitor the input of non-control character or preset key, this local program or shell script can send to search server with this input information immediately, search server mates this input information in instant result database, if corresponding instant result is arranged then should instant result return to local program or shell script, and show in the prompt column on current page.For example, the user is in search column input " Tsing-Hua University ", the input that the shell script of local program or searched page monitors the user is just mated the search server that input information " Tsing-Hua University " sends to search engine, it is the homepage of Tsing-Hua University that search server matches corresponding instant result at instant result database, just shows the network address of Tsing-Hua University in the prompt column of current page " http://www.tsinghua.edu.cn "If this Search Results meets user's search purpose, if the user in prompt column by select to determine directly to visit the website of Tsing-Hua University by enter key, click or alternate manner; If this Search Results does not meet user's search purpose, the user then not be used in and selects in the prompt column to determine, selects search operation can obtain other Search Results after input is finished.Certainly, for how carrying out search operation under the situation that does not meet the user search purpose in mentioned above searching results, get final product according to the search operation that generally uses now, the present invention does not need this to be limited.
Need to prove that preferred, the instant result who shows in prompt column is meant entity information, entity information comprises entity title and entity network address, and entity is meant object or the notion that is of practical significance in actual life.Search server mates at instant result database according to user's input information, often can match mass data, search server is according to predetermined conditions,, the frequency of occurrences the most high conditional filtering the highest such as clicking rate goes out entity information, and it is returned local program or shell script as the instant result corresponding with this input information.
Wherein, screening conditions can be provided with according to demand voluntarily by the service provider, such as the search service provider that the bid ranking service is provided, screening conditions can be made as the highest entity information of bid, in addition, can also the screening control be set in client and according to demand screening conditions be set voluntarily by the user.Certainly, for how screening conditions are set, can adopt the method for prior art to realize that the present invention does not need this to be limited.
The preferred embodiment of the present invention selects the reason of entity information to be: at first, by the statistical query to a large amount of searching record of search engine, find wherein to exist a large amount of inquiries all about entity information; Secondly, entity information has stronger determinacy as a result, because entity information is object or notion about being of practical significance in actual life, has more certain corresponding relation, and, the user does not need to open this webpage and further browses, and just can determine the result whether this entity information is inquired about.Therefore, the preferred embodiment of the present invention selects entity information as instant result, both can simplify the step of user search, can guarantee that again instant result's accuracy is higher.
Be that example illustrates instant result of the present invention to return the highest entity information of clicking rate below, so that those skilled in the art understand better.Such as, input " Netease ", search server filters out the highest entity information of clicking rate in instant result database be the homepage of company of Netease, so show " Netease in the prompt column of current page in the prompt column of current page Www.163.com".Preferably, entity information can also comprise the entity brief introduction, and then when input " Netease ", the instant result who returns as shown in Figure 5.Certainly, described entity brief introduction also can appear at hereinafter in all entity informations.
Under the situation that Search Results is relatively determined, search method of the present invention has been omitted unnecessary search step, thereby has improved user's search efficiency.And, can also make user's operation more directly perceived because the result shows at current page immediately.
Generally comprise Chinese, numeral or alphabetical in user's the input information.When input information was Chinese, if search server matches an entity information in the just in time corresponding instant result database of this Chinese, then returning this entity information was instant result.Possible is, input information is the Chinese abbreviation, and may be a plurality of abbreviations of an entity correspondence, such as input " Beijing University " (abbreviation of Peking University), perhaps " National People's Congress ", " People's University " (abbreviation of the Renmin University of China), in this case, can also dispose entity title abbreviation-entity data bak in the search server, in this database, the abbreviation of entity title forms corresponding relation with entity, search server mates in described entity title abbreviation-entity data bak according to this abbreviation, if can match a corresponding entity information, then returning this entity information is instant result.Such as, the instant result that " Beijing University " returns is " a Peking University Http:// www.pku.edu.cn"; The instant result that " National People's Congress ", " People's University " return is " the Renmin University of China Http:// www.ruc.edu.cn".
Preferably, server end can also dispose one by the abridge database of complete entity title of entity title, when the user imports abbreviation, returns the complete name of this abbreviation corresponding entity.For example: when the user is input as " Beijing University ", return complete query word " Peking University ", " Peking University " shown in search column as an everyday words.
When input information is numeral, also dispose numeral-entity data bak in the search server, in this database, digital or specific combination of numbers and entity form man-to-man corresponding relation, search server mates in numeral-entity data bak according to the numeral of input, if can match a corresponding entity information, then returning this entity information is instant result.For example, input " 163 " is returned instant result and is " Netease Www.163.com".
When input information was letter or monogram, if match a entity information in the just in time corresponding instant result database of this letter or monogram, then returning this entity information was instant result.For example, input " QQ " is returned instant result for " to rise fast net Www.qq.com".
When input information is phonetic, also dispose phonetic-entity data bak in the search server, in this database, one or a spelling sound and entity form man-to-man corresponding relation, and described phonetic comprises spelling, first letter of pinyin (simplicity) or the combination of spelling simplicity.Search server mates in described phonetic-entity data bak, if can match a corresponding entity information, then returning this entity information is instant result.For example, input " wangyi " or " wy " is returned instant result and is " Netease Www.163.com".In some cases, if the instant result of phonetic correspondence conflicts mutually with alphabetical corresponding instant result, then preferentially return the corresponding instant result of letter.
The user runs into through regular meeting when input information can not import the situation that input can't be finished in Chinese character etc.; for example; the user wants to visit the homepage of Netease; but can not import " easily " word; so the user is at search column input " net yi "; yet the mode that existing search engine is handled this combination is to be that key word is searched for Chinese " net " and monogram " yi " respectively, all offers the user searching for the result who obtains respectively.This is just far from each other with user's search purpose.And, also be unfavorable for searching and inquiry further of user.When the present invention is the combination of Chinese and letter at this input information of processing, earlier Chinese is converted into phonetic alphabet, with letter composition pinyin string; Wherein, described pinyin string comprises the combination of spelling, the combination of first letter of pinyin (simplicity) or the combination of spelling simplicity.Disposal route when being phonetic according to input information is then mated described pinyin string in the phonetic-entity data bak of search server; If can match a corresponding entity information, then returning this entity information is instant result.For example, input " net yi ", system is translated into " wangyi " earlier, and the disposal route when being phonetic according to input information is then returned instant result and is " Netease Www.163.com".Therefore, the present invention for input information for the processing of combination of Chinese and letter made things convenient for the user in the time can not importing Chinese character inquiry and saved query steps, under the prerequisite that Search Results is determined, improved the accuracy of Search Results greatly.
When the user imports, might not know also how certain word or word are spelt, do not know in English word which letter this uses, what partial content in the perhaps uncertain input is, in this case, the present invention can make the user that ignorant part is replaced carrying out fuzzy query with asterisk wildcard by introducing Fuzzy Query Technology.The user imports asterisk wildcard in search column, in a single day the shell script of local program or searched page monitors in user's input and contains asterisk wildcard, search server will mate in the everyday words database according to input information, and matching process may further comprise the steps: (1) is that prefix is mated in the everyday words database with the input information before the asterisk wildcard; (2) be that suffix mates in the everyday words database with the input information behind the asterisk wildcard; The everyday words that coupling in (1), (2) is obtained or the instant result of described everyday words correspondence return local program or shell script then.Usually, asterisk wildcard can comprise two kinds: a kind of is to replace zero or a plurality of non-control character with an asterisk wildcard; Another kind is to replace a non-control character with an asterisk wildcard.Asterisk wildcard can be set to various characters as required.
Suppose to replace the asterisk wildcard of zero or a plurality of non-control characters to represent with " * " with an asterisk wildcard.If the user wants to inquire about Pekinese university, can import " Beijing * university " at search column, search server can be that prefix is mated in the everyday words database with " * " preceding " Beijing ", the result that may match is " Peking University ", " tourism of Beijing ", " Tian An-men, Beijing " waits all is the everyday words of prefix with " Beijing ", simultaneously, search server is that suffix mates in the everyday words database with " university " also, the result that may match is " Tsing-Hua University ", " Peking University ", " Nanjing University " waits all is the everyday words of suffix with " university ", then, search server is with the common factor of above-mentioned matching result, the identical everyday words that promptly matches, as " Peking University ", " University of Science ﹠ Technology, Beijing ", " Beijing Institute of Technology " etc. returns the shell script of local program or searched page, and shows this everyday words at the prompt column of the current search page.
The asterisk wildcard of supposing to replace a non-control character with asterisk wildcard with "? " expression.If the user wants to inquire about a name, the name of only knowing this people has three words, first word be " Lee ", the 3rd word is " encouraging ", do not know but what second word be, can import at search column so " Lee? encourage ", search server can be that prefix and suffix mate in the everyday words database with " Lee " and " encouraging " word respectively, obtain the common factor of matching result, the identical everyday words that promptly matches.In identical everyday words, the shell script of the everyday words of 3 characters to local program or searched page only returned in search service, is not the everyday words of 3 characters and do not return other.The everyday words of returning in this example is " Li Dazhao ".
Preferably, the present invention can also be mated in instant result database returning everyday words, an if entity information in the just in time corresponding instant result database of the everyday words of returning, then returning this entity information is instant result, such as, an entity information in the just in time corresponding instant result database of the everyday words of returning " Li Dazhao " then returns instant result for " Lee encourages memorial museum greatly at prompt column Http:// lidazhao.com/", similarly, the user is as long as determine directly to visit this website by selecting in prompt column.In this case, the present invention can also may further comprise the steps: be called prefix with the physical name among the instant result and mate in the everyday words database of search server, the everyday words that coupling obtains is returned local program or shell script; Show everyday words in the prompt column on current searched page.The everyday words that instant result that this example is returned and coupling obtain as shown in Figure 6.Preferably, instant result has precedence over the everyday words demonstration.Certainly, obtain in aforementioned arbitrary embodiment all to make the everyday words that further obtains coupling in this way on instant result's the basis, be not described in detail in this.
The present invention can help the user still can obtain the information of oneself wanting directly, exactly in the time can't finishing input by introducing fuzzy query.And this method not occupying system resources, realize simple, do not have special secret algorithm, cost is lower.
If do not obtain the instant result and the everyday words of any coupling by search method of the present invention, then will from the everyday words database of search server, mate the everyday words the most close that obtains and return local program or shell script as error correction term with input information; Show error correction term in the prompt column on current searched page then.When the user can't describe the inquiry of oneself exactly,, can help the user directly to obtain input information and Search Results accurately by the information of error correction term.
In practice, the user often can repeat inquiry to some search words, and therefore wishing in input process can be with reference to the historical search record.In order to address this problem, the present invention can also may further comprise the steps: be that prefix is mated in the historical search speech database of search server with the input information, the historical search speech that coupling is obtained returns local program or shell script; Show the historical search speech in the prompt column on current searched page.Wherein, historical search speech database can exist local side also can have server end.When the user triggers local program or shell script and begins to import, just can obtain the information of historical search speech, in case definite historical search speech is arranged, the user can select to determine to return the instant result and the everyday words of this historical search speech correspondence in prompt column.Preferably, instant result has precedence over the historical search speech and shows, the historical search speech has precedence over everyday words and shows.
If the user is accustomed to using local collection or network bookmark, the present invention can also be mated input information in local collection, and user's bookmark and classified information thereof that coupling obtains are returned local program; Perhaps, input information is mated in user's bookmark database of search server, user's bookmark and classified information thereof that coupling obtains are returned shell script; Explicit user bookmark and classified information thereof in the prompt column on current searched page then.Similarly, the user can select to determine to return the instant result and the everyday words of this user's bookmark correspondence in prompt column.In this case, same input information may corresponding a plurality of user's bookmarks, and a plurality of bookmarks can be shown to the user simultaneously, but each fixed number (such as 8 or 10) that shows, show that number can select as required.In addition, if the user does not classify to bookmark, displaying contents does not then comprise the classified information of bookmark so.
The user often wants to understand the popular information of each side such as nearest current events are dynamic, news when searching for, in order to meet consumers' demand more fully, to make and search for hommization more, and the present invention can also may further comprise the steps:
Preset the popular search key, such as, presetting space bar is the popular search key;
When input information was described popular search key, the top search term database of match search server returned the top search term in the database to local program or shell script;
Show top search term in the prompt column on current searched page.For example " Paralympic Games ", " Liu Xiang ", " NBA " or the like.
By above-mentioned everyday words, error correction term, historical search speech, top search term and user's bookmark are set, the user can carry out the input of " fool " formula fully, just can not carry out the user of any input operation at last, all can realize its search purpose.Certainly, the service supplier can also be provided with more information as required voluntarily with user-friendly, and the present invention does not need this to limit.
At any time nowadays surfing Internet with cell phone is more and more universal, and more user uses search engine by mobile phone.The present invention is applied to the surfing Internet with cell phone field with its core idea, the search operation when also having further facilitated surfing Internet with cell phone.Because cell phone keyboard is generally less, the corresponding button of a plurality of English alphabets meetings, and a button also is a numerical key usually.So, when the user during input information, can also may further comprise the steps in search column by cell phone keyboard:
The combination of numbers of cell phone keyboard input is converted into monogram;
Described monogram is mated in the english database of search server, and the english that coupling is obtained is returned the shell script of the local program or the mobile phone searching page, shows english in the prompt column on the current search page; Described english comprises English word, english abbreviation or the english of certain sense is arranged, such as " QQ ";
Perhaps, described monogram is mated in the everyday words database of search server, the monogram commonly used that coupling is obtained returns the shell script of the local program or the mobile phone searching page, shows monogram commonly used in the prompt column on the current search page; Such as, input digit " 426 " shows that letter commonly used is combined as " IBM ", " HBO " etc.;
Perhaps, described monogram is mated in the phonetic-everyday words database of search server, the Chinese everyday words that coupling is obtained is returned the shell script of the local program or the mobile phone searching page; Chinese display everyday words in the prompt column on the current search page.Described phonetic comprises the combination of spelling, initial phonetic (simplicity) and spelling simplicity.Such as, input digit " 99 " can be converted into monogram " WY ", " XW " etc., and the Chinese display everyday words is " Netease ", " news " etc.Certainly, as another embodiment, if an entity information in the corresponding instant result database of described english or monogram,
Then also returning this entity information is instant result, is not described in detail in this.
With reference to Fig. 2, be the structured flowchart of a kind of information retrieval system based on search engine of the present invention, comprise with lower member:
Trigger element 201 is used to trigger the shell script of local program or searched page;
Monitor unit 202 is used for the input information of monitoring user at search column, and is sent to search server 203 immediately;
Search server 203 comprises that interface subelement 2031 and instant result mate subelement 2032, and wherein: interface subelement 2031 is used to receive described input information, and returns occurrence to local program or shell script; Instant result is mated subelement 2032 and is used for searching instant result as occurrence according to input information at instant result database;
Display unit 204, the prompt column that is used on current searched page shows occurrence.
Wherein, described instant result database is entity information database, phonetic-entity data bak or numeral-entity data bak; Described entity information comprises entity title and entity network address, object or the notion of described entity for being of practical significance in actual life.
With reference to Fig. 3, be based on the flow chart of steps that system shown in Figure 2 realizes information retrieval, may further comprise the steps:
Step 301, trigger element 201 are clicked search column by the user or are used other triggering modes to trigger the shell script of local program or searched page;
Step 302, monitor unit 202 monitoring users be at the input information of search column, and be sent to search server 203 immediately.Described input information comprises Chinese, numeral, letter or monogram, phonetic or presets the popular search key.
The instant result of step 303, search server 203 is mated subelement 2032 described input information is being mated in instant result database, if corresponding instant result is arranged then should instant result return to local program or shell script;
The interface subelement 2031 of step 304, search server 203 receives described input information, mates in other database of search server 203, and other occurrence is back to local program or shell script;
Step 305, display unit 204 show described instant result and occurrence in the prompt column on current searched page.
Preferably, when the present invention used asterisk wildcard to carry out fuzzy query, described search server 203 also comprised:
Asterisk wildcard recognin unit 2033 is used for discerning the asterisk wildcard of input information, is that the boundary is divided into two parts with the asterisk wildcard with input information;
Fuzzy query coupling subelement 2034 is used to finish following action:
(1) be that prefix is mated in the everyday words database with the input information before the asterisk wildcard;
(2) be that suffix mates in the everyday words database with the input information behind the asterisk wildcard;
(3) the instant result of everyday words that coupling in (1), (2) is obtained or described everyday words correspondence is as occurrence.
System of the present invention preferably also can comprise:
Conversion unit 205 is used for when described input information is the combination of Chinese and letter the Chinese in the input information being converted into phonetic alphabet; And/or the combination of numbers that cell phone keyboard is imported is converted into monogram.
As shown in Figure 4, when described input information was the combination of Chinese and letter, monitor unit 202 has monitored the Chinese that is input as at search column the user, and was further comprising the steps of:
Step 401, conversion unit 205 are converted into phonetic alphabet with Chinese, form pinyin string with letter;
Step 402, search server 203 mate described pinyin string in the phonetic-entity data bak of search server;
Step 403, search server 203 return instant result and the occurrence that coupling in phonetic-entity data bak obtains to local program or shell script;
Step 404, display unit 204 show described instant result and occurrence in the prompt column on current searched page.
Preferably, described search server 203 also comprises phonetic-everyday words database, historical search speech database and user's bookmark database, top search term database or english database.Certainly, as another embodiment, described database also may reside in local side, and the present invention is not described in detail in this.Preferably, described occurrence also comprises historical search speech, user's bookmark and classified information, error correction term, top search term or english.
Above-mentioned about not detailed part in the description of system of the present invention, can be referring to the aforementioned relevant portion of this instructions.
More than to a kind of information retrieval method and searching system provided by the present invention based on search engine, be described in detail, used specific case herein principle of the present invention and embodiment are set forth, the explanation of above embodiment just is used for helping to understand method of the present invention and core concept thereof; Simultaneously, for one of ordinary skill in the art, according to thought of the present invention, the part that all can change in specific embodiments and applications, in sum, this description should not be construed as limitation of the present invention.

Claims (18)

1. the information retrieval method based on search engine is characterized in that, comprising:
Trigger the shell script of local program or searched page;
Monitoring user is sent to search server immediately at the input information of search column;
Input information is mated in the database of search server, and the instant result that coupling is obtained returns described local program or shell script;
Show instant result in the prompt column on current searched page, described instant result is an entity information, and described entity information comprises entity title and entity network address, object or the notion of described entity for being of practical significance in actual life.
2. the method for claim 1 is characterized in that, described input information comprises asterisk wildcard, and described coupling step comprises:
1) be that prefix is mated in the everyday words database of search server with the input information before the asterisk wildcard;
2) be that suffix mates in described everyday words database with the input information behind the asterisk wildcard;
3) with 1), 2) in the identical everyday words that obtains of coupling return local program or shell script.
3. method as claimed in claim 2 is characterized in that, described coupling step also comprises:
Described everyday words is mated in the instant result database of search server, and the instant result that coupling is obtained returns local program or shell script.
4. the method for claim 1 is characterized in that, when described input information was the combination of Chinese and letter, described coupling step comprised:
Chinese is converted into phonetic alphabet, forms pinyin string with letter;
Described pinyin string is mated in the phonetic-entity data bak of search server;
The instant result that coupling in phonetic-entity data bak is obtained returns local program or shell script.
5. the method for claim 1 is characterized in that, when described input information was numeral or phonetic, described coupling step comprised:
Described numeral or phonetic are mated in the numeral-entity data bak of search server or phonetic-entity data bak, and the instant result that coupling is obtained returns local program or shell script.
6. the method for claim 1 is characterized in that, also comprises:
Be called prefix with the physical name among the instant result and in the everyday words database of search server, mate, the everyday words that coupling obtains is returned local program or shell script;
Show everyday words in the prompt column on current searched page.
7. method as claimed in claim 6 is characterized in that, also comprises:
With the input information is that prefix is mated in the historical search speech database of search server, and the historical search speech that coupling is obtained returns local program or shell script;
Show the historical search speech in the prompt column on current searched page.
8. method as claimed in claim 6 is characterized in that, also comprises:
Input information is mated in user's bookmark database of search server, user's bookmark and classified information thereof that coupling obtains are returned shell script;
Explicit user bookmark and classified information thereof in the prompt column on current searched page.
9. method as claimed in claim 3 is characterized in that, also comprises:
If do not obtain the instant result and the everyday words of any coupling, then will from the everyday words database of search server, mate the everyday words the most close that obtains and return local program or shell script as error correction term with input information;
Show error correction term in the prompt column on current searched page.
10. the method for claim 1 is characterized in that, when described input information was the Chinese abbreviation, described coupling step comprised:
Input information is mated in the abbreviation-entity data bak of search server, and the entity information that coupling is obtained returns described local program or shell script;
And/or, input information is mated in the abbreviation-entity namebase of search server, the entity title that coupling obtains is returned described local program or shell script.
11. the method for claim 1 is characterized in that, also comprises:
Preset the popular search key;
When input information was described popular search key, the top search term database of match search server returned the top search term in the database to local program or shell script;
Show top search term in the prompt column on current searched page.
12. the method for claim 1 is characterized in that, when the user by cell phone keyboard in search column during input information, this method also comprises:
The combination of numbers of cell phone keyboard input is converted into monogram;
Described monogram is mated in the english database of search server, and the english that coupling is obtained is returned the shell script of the local program or the mobile phone searching page, shows english in the prompt column on the current search page;
Perhaps, described monogram is mated in the everyday words database of search server, the monogram commonly used that coupling is obtained returns the shell script of the local program or the mobile phone searching page, shows monogram commonly used in the prompt column on the current search page;
Perhaps, described monogram is mated in the phonetic-everyday words database of search server, the Chinese everyday words that coupling is obtained is returned the shell script of the local program or the mobile phone searching page; Chinese display everyday words in the prompt column on the current search page.
13. the information retrieval system based on search engine is characterized in that, comprising:
Trigger element is used to trigger the shell script of local program or searched page;
Monitor unit is used for the input information of monitoring user at search column, and is sent to search server immediately;
Search server comprises that interface subelement and instant result mate subelement, and wherein: the interface subelement is used to receive described input information, and returns occurrence to local program or shell script; Instant result is mated subelement and is used for searching instant result as occurrence according to input information at instant result database; Described instant result database is entity information database, abbreviation-entity data bak, phonetic-entity data bak or numeral-entity data bak; Described entity information comprises entity title and entity network address, object or the notion of described entity for being of practical significance in actual life;
Display unit, the prompt column that is used on current searched page shows occurrence.
14. system as claimed in claim 13 is characterized in that,
Described search server also comprises:
Asterisk wildcard recognin unit is used for discerning the asterisk wildcard of input information, is that the boundary is divided into two parts with the asterisk wildcard with input information;
Fuzzy query coupling subelement is used to finish following action:
1) be that prefix is mated in the everyday words database of search server with the input information before the asterisk wildcard;
2) be that suffix mates in described everyday words database with the input information behind the asterisk wildcard;
3) with 1), 2) in the identical everyday words that obtains of coupling as occurrence.
15. system as claimed in claim 14 is characterized in that, described fuzzy query coupling subelement also is used to finish following action:
Described everyday words is mated in the instant result database of search server, and the instant result that coupling is obtained is as occurrence.
16. as claim 13,14 or 15 described systems, it is characterized in that, also comprise:
Conversion unit is used for when described input information is the combination of Chinese and letter the Chinese in the input information being converted into phonetic alphabet; And/or the combination of numbers that cell phone keyboard is imported is converted into monogram.
17. system as claimed in claim 16 is characterized in that, described occurrence also comprises historical search speech, user's bookmark and classified information, error correction term, top search term or english.
18. system as claimed in claim 16 is characterized in that, described search server also comprises phonetic-everyday words database, historical search speech database and user's bookmark database, top search term database or english database.
CNB2006101132499A 2006-09-20 2006-09-20 Information searching method and system based on searching engine Active CN100409241C (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CNB2006101132499A CN100409241C (en) 2006-09-20 2006-09-20 Information searching method and system based on searching engine

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CNB2006101132499A CN100409241C (en) 2006-09-20 2006-09-20 Information searching method and system based on searching engine

Publications (2)

Publication Number Publication Date
CN1936896A CN1936896A (en) 2007-03-28
CN100409241C true CN100409241C (en) 2008-08-06

Family

ID=37954400

Family Applications (1)

Application Number Title Priority Date Filing Date
CNB2006101132499A Active CN100409241C (en) 2006-09-20 2006-09-20 Information searching method and system based on searching engine

Country Status (1)

Country Link
CN (1) CN100409241C (en)

Families Citing this family (30)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP2370911A1 (en) * 2008-12-02 2011-10-05 Telefonaktiebolaget LM Ericsson (publ) System and method for matching entities
CN101996030A (en) * 2009-08-14 2011-03-30 深圳富泰宏精密工业有限公司 Mobile device and common text inserting method thereof
WO2011153708A1 (en) * 2010-06-11 2011-12-15 上海坦瑞信息技术有限公司 Information searching method base on domain concept
CN102156724A (en) * 2011-03-31 2011-08-17 北京百度网讯科技有限公司 Method and device for matching suffix of inquiry segment
US9087058B2 (en) * 2011-08-03 2015-07-21 Google Inc. Method and apparatus for enabling a searchable history of real-world user experiences
CN103425643B (en) * 2012-05-14 2018-07-31 深圳市世纪光速信息技术有限公司 A kind of relevant search query string recommendation method and system
JP2013246673A (en) * 2012-05-28 2013-12-09 Oki Electric Ind Co Ltd Inquiry system, inquiry terminal and program
CN102982117B (en) * 2012-11-09 2016-11-02 北京奇虎科技有限公司 Information search method and device
EP2932404A4 (en) 2012-12-12 2016-08-10 Google Inc Providing search results based on a compositional query
CN103870501A (en) * 2012-12-14 2014-06-18 联想(北京)有限公司 Automatic matching method and device
CN103020277B (en) * 2012-12-27 2019-04-26 北京百度网讯科技有限公司 A kind of search terms suggestions method and apparatus
CN103631884B (en) * 2013-11-14 2017-12-12 奇智软件(北京)有限公司 The method and apparatus that searching request is initiated in a kind of browser side
CN104965831B (en) * 2014-06-11 2018-09-07 腾讯科技(深圳)有限公司 A kind of network address error correction method, server, terminal and system
CN104484417B (en) * 2014-12-16 2018-05-04 北京奇虎科技有限公司 A kind of generation method and device of collection information
CN104462557B (en) * 2014-12-25 2018-04-17 北京奇虎科技有限公司 Instant search method and device based on search history record
CN106897317A (en) * 2015-12-21 2017-06-27 北京奇虎科技有限公司 Based on the method and apparatus that keyword scans for recommending
CN105677709A (en) * 2015-12-28 2016-06-15 北京搜狗科技发展有限公司 Information processing method and apparatus, and device for processing information
CN105607757A (en) * 2015-12-28 2016-05-25 北京搜狗科技发展有限公司 Input method and device and device used for input
CN107193818A (en) * 2016-03-14 2017-09-22 百度在线网络技术(北京)有限公司 A kind of searching method, device and terminal device
CN107545130A (en) * 2016-06-28 2018-01-05 学透通医疗科技(上海)有限公司 A kind of hospital's dialysis information overall situation searches for method generally
CN106446122B (en) * 2016-09-19 2020-03-10 华为技术有限公司 Information retrieval method and device and computing equipment
CN108874888A (en) * 2017-05-15 2018-11-23 李建文 Data searching method
CN108037837A (en) * 2017-11-07 2018-05-15 朗坤智慧科技股份有限公司 A kind of intelligent prompt method of search term
TWI638271B (en) * 2017-11-08 2018-10-11 國立成功大學 Cloud server system with encrypted file keyword fuzzy search function
CN110110078B (en) * 2018-01-11 2024-04-30 北京搜狗科技发展有限公司 Data processing method and device for data processing
CN109582878A (en) * 2018-11-05 2019-04-05 咪咕文化科技有限公司 Method and device for realizing search prompt and computer readable storage medium
CN111240496A (en) * 2018-11-28 2020-06-05 深圳市帝迈生物技术有限公司 Terminal device, mobile terminal, information input method and computer storage medium
CN109711125A (en) * 2018-12-28 2019-05-03 中国科学院文献情报中心 A kind of unique identities identification and device
CN111859091B (en) * 2020-07-21 2021-06-04 山东省科院易达科技咨询有限公司 Search result aggregation method and device based on artificial intelligence
CN112131461A (en) * 2020-09-09 2020-12-25 重庆易宠科技有限公司 Commodity searching method, system, terminal and computer readable storage medium

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20050177567A1 (en) * 2003-03-19 2005-08-11 International Business Machines Corporation Search for specific files from the run menu
WO2005103959A2 (en) * 2004-04-19 2005-11-03 Yahoo! Inc. Conducting internet search from an instant messenging applicaiton
CN1808437A (en) * 2006-02-17 2006-07-26 北京金山软件有限公司 Instant webpage key word search method
CN1811767A (en) * 2005-01-27 2006-08-02 微软公司 Systems and methods for providing a user interface with an automatic search menu

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20050177567A1 (en) * 2003-03-19 2005-08-11 International Business Machines Corporation Search for specific files from the run menu
WO2005103959A2 (en) * 2004-04-19 2005-11-03 Yahoo! Inc. Conducting internet search from an instant messenging applicaiton
CN1811767A (en) * 2005-01-27 2006-08-02 微软公司 Systems and methods for providing a user interface with an automatic search menu
CN1808437A (en) * 2006-02-17 2006-07-26 北京金山软件有限公司 Instant webpage key word search method

Also Published As

Publication number Publication date
CN1936896A (en) 2007-03-28

Similar Documents

Publication Publication Date Title
CN100409241C (en) Information searching method and system based on searching engine
US7840579B2 (en) Mobile device retrieval and navigation
US7966003B2 (en) Disambiguating ambiguous characters
CN102368262B (en) Method and equipment for providing searching suggestions corresponding to query sequence
JP5133984B2 (en) Input candidate providing device, input candidate providing system, input candidate providing method, and input candidate providing program
US20080319952A1 (en) Dynamic menus for multi-prefix interactive mobile searches
CN101505323B (en) Domain name parsing redirection method on the basis of content analysis under massive data
KR20010103670A (en) Method and system for accessing information on a network using message aliasing functions having shadow callback functions
CN101408879A (en) Method and system for searching product based on search engine
WO2009043175A1 (en) Inquiry-oriented user input apparatus and method
CN101375279A (en) Multi-word word wheeling
CN102063194A (en) Method, equipment, server and system for inputting characters by user
CN105528338A (en) Input method and system with intelligent prediction
JP2007072596A (en) Information sharing system and information sharing method
WO2007046445A1 (en) Search device and search method
CN100456293C (en) Information fast searching device, client end, system and method
US20080312901A1 (en) Character input assist method, character input assist system, character input assist program, user terminal, character conversion method and character conversion program
WO2008091941A2 (en) Method and system for incrementally selecting and providing relevant search engines in response to a user query
JP2011002982A (en) Content providing device, content providing method and content providing program
CN102314462A (en) Method and system for obtaining navigation result on input method platform
CN102567121B (en) Realize the method and apparatus of converged communication
JP2004246422A (en) Information retrieval support device
JP6618103B1 (en) Sentence generating apparatus, sentence generating method, and sentence generating program
CN101923548A (en) Method for searching Internet information and search engine
KR101705556B1 (en) Method and apparatus for providing associated note using degree of association

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C41 Transfer of patent application or patent right or utility model
TA01 Transfer of patent application right

Effective date of registration: 20070413

Address after: Room 3, building 1, Tsinghua Science Park, No. 206, Zhongguancun East Road, Beijing, Haidian District

Applicant after: Beijing Interconnect Technology Co., Ltd.

Address before: Floor 26, building D, science and technology building, Tsinghua Science Park, No. 1 Zhongguancun East Road, Haidian District, Beijing

Applicant before: Wanzhiyi Information Technology (Beijing) Co., Ltd.

C14 Grant of patent or utility model
GR01 Patent grant