The application requires in the right of priority of 35 U.S.C § 119 (e) of the associating provisional application sequence number 60/657,371 co-pending of " search engine that merges user's input " by name of submission on February 28th, 2005, and it is incorporated in here as a reference.
Summary of the invention
The present invention relates to provides the system and method for better Search Results for the input that receives from the user of search engine for the user of system.According to the present invention, user community can management database, can generate Search Results and the out of Memory relevant with search from this database.Especially, the user can the voting result list in the correlativity of element, add side information, such as the link to relevant website, and the search terms that generates with computing machine of user's input moves other search.By this way, Search Results can provide more relevant information for the user.
In a first aspect of the present invention, a kind of method comprises in response to a plurality of users to be inputted management database and shows Search Results from database in response to the first search inquiry.Preferably, Search Results comprises the results list and the supplementary data relevant with the first search inquiry.Management database comprises the combination in any of following operation, the element in the results list of namely again classifying, the storage information relevant with the correlativity of element in the results list, blocks and links, stores linking of the document relevant with the first search inquiry in the results list.
According to the present invention, they think relevant website and Search Results can be classified in response to user's mark, though the website outside their access search context, even or they with the item beyond search inquiry in the first search come mark it.
In one embodiment, supplementary data comprises the description of first concept relevant with the first search inquiry.In one embodiment, management database comprises interpolation, edits and deletes any one operation in the description of the first concept.In other embodiments, the description of the first concept comprises the linking of description of second concept relevant with the first concept.In another embodiment, supplementary data comprise with one of the first concept and second concept or list of concepts that both are relevant in index.The second concept is that subclass, the first concept same existing of the first concept and its appear at any one in the upper concept relevant with the appearance of the first concept of statistics.One of kind by selecting preassigned, user's input and statistical relationship are determined the relation between the first concept and the second concept.
In another embodiment, supplementary data comprises be used to automatically performing linking of second search inquiry relevant with the first search inquiry.The item of the second search inquiry is by user's input or definite by computing machine.The item that computing machine is determined be in the document of its same existing, the item that comprising the first search inquiry the document of the item that comprises the first search inquiry it the position and comprising any one derivation in the density that in the document of item of the first search inquiry, its occurs.In one embodiment, the method also comprises with abort criterion and ends the ability that the user provides user input data.
In a second aspect of the present invention, a kind of method comprises to the search engine submit Query, generates search result list, retrieves the side information relevant with inquiry and show the results page that comprises search result list and side information.User's input is used for revising at least one of search result list and side information.
In a third aspect of the present invention, a kind of demonstration and method from the relevant Search Results of the first user in a plurality of users and the second user's input, comprise generate the first Search Results in response to the first inquiry, receive input from first user, in response to the Update Table storehouse, receiving second inquiry relevant with the first inquiry and the second Search Results to second user's demonstration from the database generation from the input of first user.
In a fourth aspect of the present invention, a kind of system comprises Web server, and it is configured to show Search Results and the side information relevant with the item of the first search inquiry; Search engine is used for Query Database and provides Search Results in response to user's inquiry; And content manager, be used for the management side information in response to a plurality of user's inputs.Preferably, system also comprises the first data storage device that is coupled to content manager, is used for the storage side information.
In one embodiment, content manager is configured to receive a plurality of user's inputs for upgrading side information.Preferably, system also comprises for the viewing area that shows side information.Side information comprises the description of first concept relevant with the first search inquiry.Side information comprises the link of the description of the second concept.In one embodiment, content manager is configured to receive a plurality of user's inputs for interpolation, editor or Remove Links.In another embodiment, content manager is configured to receive a plurality of user's inputs for the description of adding, edit or delete the first concept.
In another embodiment, this system also comprises the zone that links that is used for automatically performing second search inquiry relevant with the first search inquiry for showing.Preferably, this system also comprises for input to organize the search engine of Search Results based on a plurality of users.This search engine is configured to by the Search Results graduation is inputted to organize Search Results based on a plurality of users.This system also comprises for the subscriber equipment of carrying out Web server.Subscriber equipment is personal computer, portable phone and one of personal digital assistant and is configured to use in HTML (Hypertext Markup Language) and WAP (wireless application protocol) any one or a plurality ofly communicates by letter with Web server.
Embodiment
Embodiments of the invention, different from traditional search engine, utilize side information that the more relevant information user to searching for Internet is provided, especially in the situation that this side information is the user inputs.For example, according to the present invention, the first user of carrying out search can add the information relevant with carrying out search of user's input, with the search information relevant with the concept of being quoted by inquiry.First user can be inputted the description of (1) concept relevant with inquiry, (2) be used for carrying out the suggestion of the search relevant with concept, (3) " seeing also " hyperlink of the query term related with Conceptions, (4) related or query term suggestion, (5) to the feedback of the correlativity of result and his search, (6) any out of Memory.In addition, some or all of these information can be generated by computerized algorithm, Web crawl device or other technology.The second user who carries out similar or relevant search then can check this additional information except the results list that is provided by search engine, more may maximally related Search Results with him thereby obtain.This second user can add the information of user's input.Two users can both share the information relevant with search for.Therefore user community can be shared and help the user to estimate or use more accurately and provide fast the information of Search Results.
Database comprises the combination in any for information collected works, search index itself and the following content of replenishing search result list: the record of the data that the user of search finds useful data, inputted by the user of search, and such as by preserving, classify, block, write, edit or delete data.Database is expanded in one or more data storage devices and system.And also as described below, database can be managed in response to user's input.
According to other embodiments of the invention, Search Results also comprises for the options that shows, include but not limited to, (1) be used for providing the mechanism of the feedback of the correlativity that the results list is linked, (2) be used for to preserve the link that can show or the mechanism that votes for peer link on the personal search page, and (3) are used for " obstructions " and Search Results has nothing to do or the linking of undesirable in fact webpage.Other embodiment comprises demonstration and link and the investment advertisement link of relevant search terms.
In whole instructions subsequently, term " search engine " refers to inquiry is used as the equipment (or operate on multi-purpose computer program) of the results list of input and the hyperlink of return electron document or webpage.Search engine comprises the index of document in its collected works, determines code and the algorithm of each document relevance and sends the results list to user's graphical user interface.
In whole instructions subsequently, term " inquiry " refers to submit to the set of the item of search engine, be no matter typewrite, speak, submit to by " link " that embedded collection of search terms, or by other interface submission.Inquiry can comprise word, a plurality of word or phrase.Inquiry can be expressed as problem (for example, " natural language " inquiry) by phrase, freely gather (loose set), structurized boolean's expression.In fact, inquiry can comprise symbol and any other other character, and described symbol and any other other character are used for searching for by search engine and comprise searching character or electronic document or the webpage relevant with searching character.
In whole instructions subsequently, term " website " refers to link together and the set of available webpage on WWW.Term " webpage " refers to the addressable publication on WWW from a plurality of main frames, and includes but not limited to text, video, image, music and figure.
in whole instructions subsequently, term " the results list " refers to quote other related information of hyperlink list and each link of document or webpage, these documents or webpage are to use Hypertext Transmission Protocol (HTTP) or any agreement that other is used for accession page or other electronic document agreement to visit, described other related information includes but not limited to the title of document, the summary of document, the link of the cached copies of document, document is by last index or last date of revising, with document associations or be positioned at wherein image, with the information of extracting from document.
As used herein, term " document " broadly defines, and also comprises computer documents and webpage except its general sense, and no matter whether these pages are really in response to the request that shows is dynamically stored or generated.Term " document " is not limited to comprise the computer documents of text, but also comprises the computer documents that comprises figure, audio frequency, video and other multi-medium data.
As below in greater detail, search engine obtains by the inquiry of user's input and uses various correlation calculations that the index of search terms and webpage is mated, and its objective is those the relevant webpages of information most probable that will identify and be searched by the user.Search engine then returns to the hyperlink ranked list of these webpages, is considered to the top of the more close list of maximally related link.According to the present invention, search engine returns results list based on user input, and the user has input message to the ability of system, in order to for example affect the document listed or the order of link in the results list.
According to the present invention, when the user had been sent the webpage that comprises the results list, he can select to add additional information to the page, and this is for visit subsequently other users visible of search engine by inputting identical or similar inquiry.
Fig. 1 returns to schematic illustration for the typical graphics user interface (GUI) that shows results page 100 according to of the present invention for showing in response to inquiry.GUI allows user add, edits and checks the description about the one or more concepts relevant to query term, and adds, edit and check the suggestion of how to search for the information relevant with concept.
Results page 100 comprises for the frame 110 that inserts query term, be used for showing the description of the concept relevant with query term zone 120, comprise the description of the different concepts relevant with query term zone 130, comprise zone 140 that the concept relevant with other query term " seeing also " link and the zone 180 of the zone 150 that comprises the lists of links that causes that relevant query term is performed and advertisement link.Results page also 100 also comprises the zone 160 that comprises the results list that is returned by search engine.Zone 160 also comprises for the mechanism 170 of the input user feedback mechanism that links 190 related with each result of being returned by search engine with being used for preservation.As described in more detail below, in a preferred embodiment, zone 120,130,140 and 150 can be edited by the user, add or revise, in order to show the information that offers other user who carries out identical or similar inquiry.
As shown in the example of Fig. 1, when input inquiry item " U2 " was also asked search in frame 110 as the user, results page 100 was returned to him.Zone 120 shows the description of a concept relevant with query term " U2 ", and what input as the user here is the description of band " U2 ".The description that zone 130 illustrates the different concepts of inquiry " U2 ", what input as the user here is the U2 reconnaissance plane.Zone 150 shows that users also may be interested in to allow the query term of the relevant search that search engine carries out, such as user's input or by " U2 concert admission ticket " or " U2iPod " of algorithm derivation.Zone 140 comprises as " seeing also " hyperlink user input or the concept relevant with other query term that derived by algorithm, such as " the Dragon Lady " of " Bono " or " U2 reconnaissance plane " concept of " U2 band " concept.
Zone 160 comprises Search Results and user feedback mechanisms 170.User's feedback mechanism 170, the user can grade, and how good want that the content matching of searching must have for corresponding webpage and he.In other words, if the first webpage of listing in 160 in the zone comprises the relevant information relevant with the rock band U2 of user search, the user can user's feedback mechanism 170 with high mark (such as 5 stars) this link of grading.Be exclusively used in second webpage of title of the clothes line that is called " U2 " and concept that the user searches irrelevant but list in zone 160, this second webpage can be graded with lower mark (such as 1 star).According to the present invention, when after a while user is also interested in the search of band " U2 " with " U2 " inquiry, the search listing that returns to him comprises the top of the more close the results list of the first webpage (classifying with 5 stars) and the bottom of the more close list of the second webpage (classifying with 1 star), or does not even list.By this way, the user has been presented the results list with correlated results of only at first listing.Accessing subsequently in the results list the user of website has larger chance and checks the maximally related website of the concept of just searching with him.Except metadata and the out of Memory without user input, the order of the results list middle term is therefore based on user's feedback.
The user can also add the description 120 about the concept relevant to query term, and some background informations relevant with the concept of being quoted by inquiry or suggestion are provided, and described inquiry or suggestion relate to relevant with the described concept information of how searching for.The user can also revise, strengthens or remove the description about the concept relevant to query term, before this is described by itself or other user add or modification.
The user can add the description of the additional concepts relevant to search terms, even inputted other concept.For example, for query term " star wars ", the description of the concept of film " Star Wars " can be added, and comprises the information such as the story of a play or opera, performer and producer.Subsequently, the user can clickthrough 130, and this allows them to add the description relevant with same queries item " starwars ", has described different concepts, for example " strategic defensive initiatively or SDI ".
In interchangeable embodiment, add according to the present invention, revise or the concept of deletion be each other subclass (for example, sub-topics), same existing in document or with the appearance of statistical dependence mode.For example, concept " operating system " and " Linux " are theme and relevant sub-topics.And, in interchangeable embodiment, according to kind and the statistical computation (for example, concept has multifrequency together in document numerous) of preassigned, user's input, determine that concept is relevant.
The user can add to be linked to from hyperlink or " seeing also " of different query term related notions and quotes 140.As an example, " seeing also " of the concept of user add Star Wars film partly, the hyperlink of the concept of George Lucas, the author/producer of query term " GeorgeLucas ".The user can revise, adds or delete " seeing also " and quote.
The user can be that concept adds the inquiry of suggestion, and described concept makes inquiry be submitted to search engine when clicked, and this search engine returns to the results page 100 that comprises with side information 120,140 and 150 related the results lists 160.
Search engine can also generate the query term of advising with computerized algorithm.For example, the item of such computerized algorithm search document to determine in identical document (with existing), often to occur in preset distance each other or in predetermined density (pre-determined number at least namely occurring).Therefore algorithm determines that these are relevant, and search engine provides query term as suggestion.Replacedly, computerized algorithm keeps the list of query term, such as synonym or the word modification of also advising to the user.
The user can add or preserve and is considered to and the linking of the document of concept height correlation.This can input link or complete by the icon 190 of clicking hyperlink or being labeled as " preservations " by craft, or is pointed out by other item such as " bookmark ", " label " or " adding collection to ".Because different user will have relevant different ideas the most relevant from website, therefore algorithm according to the present invention is determined the order of listed website.In one embodiment, algorithm uses " inside " to process, and makes maximum " ballot " (for example by maximum user " preservation the ") documents of reception place highlyer in the results list.
If in the results list that being generated by search engine also appears in the link of institute's " preservation " document, it is also one that has been voted for by the user that this icon 165 can be used for illustrating this link.And, below each Search Results is " By " entry 167, and it shows the user's who adds link title, and a part that makes it can be used as the results list is returned, and " Tags " entry 168, it has listed user and the item that links all marks or generated by prior searches.
According to the present invention, the link of website can with by listing in two ways together with the linking of the user of iconic marker input as mentioned above, as two independent lists: the link of (1) the results list (algorithm) and user's input or, or (2) merge in a list.
Two or more people can revise any information described here.As an example, first user writes out and the second user revises the work of first user.First user can " reduce " or update the second user's work.Should input what information if two or more people disagree with, they can be by alternate manner communication (for example, forum, Email, immediate information software) in order to manage conflict and should say that to entry what reaches an agreement.
If any two or more users can not solve their the relevant different opinions of should inputting and so on, they can take their difference to " editor " that can solve these different opinions." editor " is responsible for a large amount of motif areas and has the authority of calming down controversial issue, interpolation or deletion information, also finally removing the user of refusal cooperation.
If the user inputs the information that other people often restore, can suppose that this user does not input people and wants the information of putting up.For example, the user may uglify or destroy the information in motif area.A rule can be implemented, and the user who has made entry restore pre-determined number in section between it requires is at a time ended a certain predetermined amount of time.This rule is intended to reduce the amount of destruction.
Except the information of any particular type of suggestion here, the user can input the information of any kind.As an example, for all performers, their webpage the Internet film database (
Www.imdb.com) link be transfused to.Perhaps for the city, the link that the Weather.com page of Current Temperatures and weather condition is shown is transfused to.Perhaps, for a first song, the link of selling other song of song, the lyrics, artist or even performing the website of some or all of songs is transfused to.
Will be understood that, can make a lot of modifications according to the present invention.For example, can directly read from the input of terminal the feedback that the user generates from file rather than user.And, although results page 100 shows the information such as " seeing also " link 140, will be understood that according to the present invention, the results page that comprises the information of user input can illustrate together with the combination in any in a plurality of zones, described zone comprise those zones shown in Figure 1 or except the zone.This information be used for making Search Results more comprehensively, accurate and meaningful.
Fig. 2 is the process flow diagram that the operation of internet hunt application 200 has been described according to an embodiment of the invention.Internet hunt uses 200 for the user provides to the ability of system's input message, and the input based on this user allows other user to receive more significant Search Results thus.This information is perhaps added for user's Useful Information of carrying out identical or similar search for the document graduation (result of for example search engine initially being returned is classified again) of the results list that will generate in response to ad hoc inquiry.This results list therefore can be in response to user feedback by " transformation " with the more significant result of backspace, and return to additional information with the Topic relative of inquiring about.
In step 205, the user is to the search engine submit Query.Then this process proceeds to the step 210 and 220 that can be performed simultaneously.In step 210, calculate search result list, and in step 220, retrieval side information (for example zone in Fig. 1 120,130,140 and 150).Step 210 and 220 all proceeds to 230, and wherein results page (for example 100 in Fig. 1) is sent to the user.Step 230 proceeds to any one in step 240,250,260 and 270.
In step 240, the user is allowed to add or editor's additional information (for example zone in Fig. 1 120,130,140 and 150).Replacedly, in step 250, the user can click the search link (for example zone in Fig. 1 130) of suggestion; Or in step 260, click " seeing also " link (for example zone in Fig. 1 140); In step 270, access websites one of (for example follow in the zone 150 in Fig. 1 link after).Step 240 is circulated back to step 230, and step 250,260 and optional 270 all loops back 280 to step 205.Replacedly, from step 270, the user can proceed to step 290, and wherein inquiry is completed.
Fig. 3 explanation is according to each parts of system 300 of the present invention.System 300 comprises the subscription client 305 that is connected with Web server 310.Web server 310 is coupled to content manager 320 and search engine 340.Content manager 320 is coupled to the data repository 330 for the storage supplemental content.Search engine 340 is coupled to the data repository 350 that comprises document index, itself then be coupled to index 360.Index 360 is coupled to web content database 370, and it is coupled to Web crawl device 380.Web crawl device 380 is coupled to one or more websites 399 by internet 390.
In operation, Web crawl device 380 navigates on internet 390, and access websites 399 is also filled web content database 370.Index 360 use web content databases 370 create document index 350.When the user generated inquiry to subscriber's main station 305, Web server 310 sent to search engine 340 with searching request.Search engine 340 determines which webpage is probably the most relevant to this inquiry, and the feedback that generates with above-mentioned user creates the results list.Search engine 340 grade of user's generation as mentioned above comes the ranking results list, and the results list is fed back to the user for demonstration.
Also in response to inquiry, content manager 320 comprises conceptual description, other conceptual description, " seeing also " link and relevant query term from the data repository 330 retrievals side information relevant with inquiry.This information for example shows respectively in the zone 120,130,140 and 150 of Fig. 1.Content manager 320 also allows user add, edits or removes side information.Website 310 will be from the result of search engine 340 and information combination from content manager 320, and this combination is returned to the user.Content manager 320 determines whether the user changes side information, and if change, it is stored in data repository 330.For after move new the user of identical or similar search or information that upgrade is now available.
Fig. 4 is each hardware component that has illustrated according to the internet hunt application system 400 for user 405 of the present invention.System 400 comprises the client device 410 that is coupled to Web server 430 by internet 420.Client device 410 can be for access Web server 430 and any equipment of being configured to use the Internet protocol such as, but not limited to http (HTML (Hypertext Markup Language)) and WAP (WAP (wireless application protocol)) to communicate.Preferably, client device 410 is personal computers.Replacedly, client device 410 is another equipment that includes but not limited to handheld device, such as cell phone or personal digital assistant (PDA), can use such as the standard of HTML (HTML (Hypertext Markup Language)), HDML (handheld device markup language), WML (WAP Markup Language) etc. and come presentation information.
Web server 430 is coupled to content server 440 and search server 460.Content server 440 is coupled to data storage device 450 and search server 460 is coupled to data storage device 470.
Easily be understood that for those skilled in the art, can make other modification in the situation that do not depart from the spirit and scope of the present invention of claims restriction to embodiment.