CN101986293A - Method and equipment for displaying search answer information on search interface - Google Patents

Method and equipment for displaying search answer information on search interface Download PDF

Info

Publication number
CN101986293A
CN101986293A CN 201010271796 CN201010271796A CN101986293A CN 101986293 A CN101986293 A CN 101986293A CN 201010271796 CN201010271796 CN 201010271796 CN 201010271796 A CN201010271796 A CN 201010271796A CN 101986293 A CN101986293 A CN 101986293A
Authority
CN
China
Prior art keywords
answer
question
information
search information
search
Prior art date
Application number
CN 201010271796
Other languages
Chinese (zh)
Other versions
CN101986293B (en
Inventor
戴帅湘
徐犇
Original Assignee
百度在线网络技术(北京)有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 百度在线网络技术(北京)有限公司 filed Critical 百度在线网络技术(北京)有限公司
Priority to CN201010271796.6A priority Critical patent/CN101986293B/en
Publication of CN101986293A publication Critical patent/CN101986293A/en
Application granted granted Critical
Publication of CN101986293B publication Critical patent/CN101986293B/en

Links

Abstract

The invention provides a method and equipment for displaying search answer information on a search interface. In the invention, by acquiring the search information of a user, the search information is matched with prestored question-answer pairs and one or more question-answer pairs matched with the whole or partial content of the search information are acquired; and answer information corresponding to the search information is provided for the user according to the one or more question-answer pairs. The method and the equipment have the advantages of: 1) directly providing a deterministic answer based on the search information for the user on the search interface according to the search information input by the user, and providing a search result based on the search information for the user at the same time; 2) generating a question-answer pair library by analyzing webpage content and encyclopedic knowledge, and providing comprehensive and real-time answers for the user; and 3) improving answer accuracy by using related information of the user and/or further interaction with the user, and providing better personalized experiences for the user.

Description

Be used for presenting the method and apparatus of searching for answer information at search interface
Technical field
The present invention relates to computer realm, relate in particular to and be used for presenting the method, apparatus and system of searching for answer information at search interface.
Background technology
In the prior art,, tend in search engine or special information bank, search for if the user wishes to obtain some information.Wherein, may obtain the information of determinacy answer for some, search engine only provides link according to user's search information to the user, and the user also needs to select and search answer from link; And special information bank mostly is the special storehouse of certain aspect, and being difficult to provides comprehensive information to the user, and when the user furnishes an answer information, can't provide other search content to the user.
Therefore, how can provide the answer of certain problem simultaneously all sidedly to the user, can provide Search Results according to user's search information again, become the problem that those skilled in the art need solve.
Summary of the invention
The purpose of this invention is to provide a kind of being used for presents the method, apparatus and system of searching for answer information at search interface.
According to an aspect of the present invention, provide a kind of being used for to present the method for searching for answer information at search interface, this method may further comprise the steps:
A obtains the search information from the user;
To mating, it is right to obtain one or more question and answer that all or part of content with described search information is complementary with described search information and each question and answer of prestoring for b;
C is right according to described one or more question and answer, provides the answer information corresponding with this search information to the user.
According to a further aspect in the invention, also provide a kind of being used for to present the equipment of searching for answer information at search interface, wherein, this equipment comprises:
First deriving means, be used to obtain search information from the user;
Coalignment, be used for described search information and each question and answer of prestoring mating, it is right to obtain one or more question and answer that all or part of content with described search information is complementary;
Generator, be used for according to described one or more question and answer rightly, provide the answer information corresponding with this search information to the user.
Compared with prior art, the present invention has the following advantages: 1) can be according to the search information of user's input, directly in search interface, provide determinacy answer based on this search information to the user, and, can also be simultaneously provide Search Results based on this search information to the user; 2) by analysis to web page contents and encyclopaedic knowledge, generate question and answer to the storehouse, provide comprehensive, real-time answer to the user; 3) relevant information by the user and/or further mutual with the user have improved the accuracy of answer, give the user better individualized experience.
Description of drawings
By reading the detailed description of doing with reference to the following drawings that non-limiting example is done, it is more obvious that other features, objects and advantages of the present invention will become:
Fig. 1 for one aspect of the invention be used for present the method flow diagram of searching for answer information at search interface;
Fig. 2 for a preferred embodiment of the invention be used for present the method flow diagram of searching for answer information at search interface;
Fig. 3 for another preferred embodiment of the present invention be used for present the method flow diagram of searching for answer information at search interface;
Fig. 4 is that the network equipment generates the right method flow diagram of question and answer according to the question and answer content from webpage;
Fig. 5 is that the network equipment generates the right method flow diagram of question and answer according to the data from the encyclopaedia webpage.
Fig. 6 presents the system architecture synoptic diagram of searching for answer information for one aspect of the invention in search interface;
Fig. 7 is the structural representation of the coalignment of a preferred embodiment of the invention;
Fig. 8 is the structural representation of the coalignment of another preferred embodiment of the present invention;
Fig. 9 is that the network equipment is according to generating the right apparatus structure synoptic diagram of question and answer from the question and answer content of webpage and the data of encyclopaedia webpage.
Same or analogous Reference numeral is represented same or analogous parts in the accompanying drawing.
Embodiment
Below in conjunction with accompanying drawing the present invention is described in further detail.
Fig. 1 illustrates according to one aspect of the invention and present the method flow diagram of searching for answer information in search interface.It illustrates the network equipment 2 and obtains the search information by subscriber equipment 1 input from the user via network, with search information and question and answer to mating the process of obtaining answer information and in search interface, presenting to the user.
Wherein, network includes but not limited to internet, wide area network, Metropolitan Area Network (MAN), LAN (Local Area Network), VPN network, wireless self-organization network (Ad Hoc network) etc.Subscriber equipment 1 includes but not limited to any electronic product that carries out man-machine interaction by keyboard, telepilot, touch pad or voice-operated device with the user, for example computing machine, smart mobile phone, PDA, game machine or IPTV etc.The network equipment 2 includes but not limited to group of server that single network server, a plurality of webserver are formed or based on the cloud that is made of a large amount of computing machines or the webserver of cloud computing (Cloud Computing), wherein, cloud computing is a kind of of Distributed Calculation, a super virtual machine of being made up of the loosely-coupled computing machine collection of a group.Wherein, the network equipment 2 is preserved question and answer to storehouse and vocabulary typelib.Question and answer centering comprises problem and the corresponding answer of this problem, and question and answer are to comprise the right set of a large amount of question and answer to the storehouse.Write down the corresponding relation of the type that the combination of the combination of vocabulary or vocabulary and this vocabulary or vocabulary may describe in the vocabulary typelib.
When the user wishes to search for, the input mode inputted search information that provides by subscriber equipment.Wherein, this input mode includes but not limited to: 1) literal input; 2) phonetic entry; 3) handwriting input.Wherein, the position of inputted search information includes but not limited in aforesaid way: the 1) search column of the page that provides of search engine; 2) searched page that provides of client; 3) search column in embedded web page or the client etc.
Particularly, in step s101, subscriber equipment 1 obtains the search information of user's input by the interactive device that any and user carry out man-machine interaction.This interactive device can be keyboard, telepilot, touch pad or voice-operated device etc.Then, in step s102, subscriber equipment 1 is sent to the network equipment 2 with the search information of described user's input.In step s103, the network equipment 2 obtains above-mentioned search information.
Then, in step s104, the network equipment 2 with the search information that receives and local question and answer to each question and answer of prestoring in the storehouse to mating, promptly each question and answer centering search with search information in same or analogous lexical information, the one or more question and answer that are complementary with all or part of content that obtains with described search information are right.
Particularly, the network equipment 2 mates the problem of search information and question and answer centering, according to search information, the situation of the right problem of one or more question and answer and this search information coupling can occur.
When search information only implied single problem, the network equipment 2 can obtain the result that the full content in this search information can be complementary with the problem of one or more question and answer centerings.
For example, when search information is " Mao Zedong's birthplace is at which ", above-mentioned search information implies single problem.The network equipment 2 mates in question and answer this search information in to the storehouse, obtain a right problem of question and answer and be " Mao Zedong birthplace ", the problem that these question and answer are right and the full content of search information are complementary, and then the network equipment 2 has obtained question and answer to " Mao Zedong birthplace, Shaoshan, Hunan ".
When the implicit a plurality of problem of search information, the network equipment 2 can mate that to obtain a plurality of question and answer right, and the partial content in the problem that each question and answer is right and this search information is complementary.
For example, when search information is " Mao Zedong's date of birth and place ", above-mentioned search information implies two problems.The network equipment 2 mates in question and answer this search information in to the storehouse, obtain question and answer to " Mao Zedong birthplace, Shaoshan, Hunan " and question and answer to " Mao Zedong's date of birth, on Dec 26th, 1893 ".Wherein, the problem that previous question and answer are right is " Mao Zedong birthplace ", " Mao Zedong place of birth " is complementary in this problem and the search information, and the right problem of back question and answer is " Mao Zedong's date of birth ", in this problem and the search information " Mao Zedong's date of birth " be complementary.
When question and answer exist in to the storehouse relative search information more refinement question and answer to the time, it is right that the network equipment 2 can match a plurality of question and answer.
For example, when search information is No. two, a subway " line last buses constantly ", because each No. two line final vehicle hours of city underground difference, question and answer are right to the question and answer of No. two line last buses of subway that tend to have each different cities in the storehouse.The network equipment 2 with above-mentioned search information and each question and answer to after mating, it is right to obtain a plurality of question and answer: " No. two line last buses of Shanghai Underground constantly; 23:00 ", " No. two line last buses of Beijing Metro constantly; 23:15 ", " No. two line last buses of Guangzhou Underground constantly; 23:30 " etc., the problem that above-mentioned question and answer are right all can be mated with the full content of search information.Need to prove that above-mentioned example is only for illustrating the solution of the present invention better, but the present invention is not as limit, those skilled in the art should understand that, any according to search information, obtain the right scheme of one or more question and answer, all should be included in the scope of the present invention.
In step s105, the network equipment 2 according to the question and answer that obtain among the step s104 to generating answer information.
When with the question and answer of search information coupling when only having one, the network equipment 2 obtains the answer of above-mentioned question and answer centering, in conjunction with the right problem of these question and answer as answer information.
When, judging whether and the answer that each question and answer are right to integrate when a plurality of having with the question and answer of search information coupling.If above-mentioned each question and answer can be integrated semantically being associated, the network equipment 2 is integrated above-mentioned each question and answer to using the mode that meets the natural language custom, and generates answer information.For example, for the question and answer that are complementary with search information " Mao Zedong's date of birth and place " to " Mao Zedong's date of birth; on Dec 26th, 1893 " and " Mao Zedong birthplace; Shaoshan, Hunan ", the main body of its problem is identical, the network equipment 2 can be integrated it and generate an answer information " Mao Zedong is born on Dec 26th, 1893, Shaoshan, Hunan " that semantically links up; If each question and answer are to integrating, 2 right according to question and answer respectively answer and right problems of this question and answer of the network equipment, generate a plurality of answers unit as answer information, for example, the question and answer that search information No. two, subway " line last buses constantly " is complementary to " No. two line last buses of Shanghai Underground constantly; 23:00 ", " No. two line last buses of Beijing Metro constantly; 23:15 ", " No. two line last buses of Guangzhou Underground constantly, 23:30 " generate such as " No. two line last buses of Shanghai Underground are 23:00 constantly; No. two line last buses of Beijing Metro are 23:15 constantly; No. two line last buses of Guangzhou Underground are 23:30 constantly " answer information.
What need further specify is, the network equipment 2 can comprise the right generation of compound question and answer in the process that generates answer information, it is right to be that the network equipment 2 can be combined a plurality of question and answer that obtain according to search information to generate new compound question and answer, for example, can generate question and answer to " Mao Zedong's date and place of birth, Shaoshan, Hunan on the 26th Dec in 1893 " to " Mao Zedong birthplace; Shaoshan, Hunan " and question and answer to " Mao Zedong's date of birth, on Dec 26th, 1893 " according to the question and answer that obtain.
Subsequently, in step s107, the network equipment 2 sends to subscriber equipment 1 with described answer information.At last, in step s108, subscriber equipment 1 is according to the described answer information that receives, and upgrades the page, presents to the user with the page that described answer information is incorporated after the renewal.
Preferably, between step s105 and step s107, also comprise step s106 (figure does not show).In step s106, the network equipment 2 upgrades searched page, the answer information that generates among the step s105 is dissolved in the searched page of renewal.
Wherein, the position that presents in the page of answer information includes but not limited to following at least one:
-Search Results article one for example is presented on the result article one in the link that obtains according to search information;
-search suggestion for example is presented on the result in the position of the search key that the user offers suggestions;
-input method candidate bar for example, is presented on the result in the option of user's input method;
-search column candidate item hurdle for example, is presented on the result in the drop-down hurdle that the search candidate item is provided that search column lists;
Candidate item hurdle under the-WEB input field for example, is presented on the result on the drop-down hurdle that is used for listing candidate item in the WEB input field.
Need to prove that above-mentioned example is only for illustrating the solution of the present invention better, but the present invention is not as limit, those skilled in the art should understand that, any searched page is handled,, all should be included in the scope of the present invention answer information is included in the scheme in the searched page.
Accordingly, in step s107, the page of the network equipment 2 after with the described renewal that comprises answer information sends to subscriber equipment 1.In step s108, subscriber equipment 1 is presented to the user after receiving the page that comprises answer information after the described renewal.
Preferably, in step s104, each question and answer is to being with the storage of the frame mode of a four-tuple in the network equipment 2.Question and answer comprise question and answer classification, entity, substance feature description, the tetrameric four-tuple of answer to being expressed as.
At this, " question and answer classification " represents the classification of the contained problem of this question and answer centering, includes but not limited to: time, place, product performance etc.; The object that " entity " expression question and answer centering is putd question to includes but not limited to: name, place name, product, incident, proper noun etc.; The content of the described object of understanding is wished in " substance feature description " expression; " answer " is the right answer of these question and answer.For example, question and answer are to " Shaoshan, Hunan, Mao Zedong birthplace, place ", and wherein " place " is the right question and answer classifications of these question and answer, and " Mao Zedong " is the right entities of these question and answer, " birthplace " is that the right substance feature of these question and answer is described, and " Shaoshan, Hunan " is the right answers of these question and answer., be convenient to store to can be with four-tuple storage question and answer with relational database etc. with question and answer to the data structuring.Therefore, the network equipment 2 with search information and question and answer to coupling further tool turn to search information and described each question and answer that prestore mated the entity and the substance feature description that are comprised, obtaining the description of described entity and described substance feature all can be right with one or more question and answer of all or part of content match of described search information.
For example, when search information is " Mao Zedong's birthplace is at which ", the network equipment 2 is described coupling with above-mentioned search information and described each question and answer that prestore to the entity and the substance feature that are comprised, obtain question and answer to " Shaoshan, Hunan, Mao Zedong birthplace, place ", the full content coupling of its entity " Mao Zedong ", feature description " birthplace " and described search information.It is right so just to have obtained question and answer being complementary with the search information full content.
And for example, when search information is " Mao Zedong's date of birth and place ", above-mentioned search information implies two problems, the network equipment 2 is described above-mentioned search information in to the storehouse in question and answer and is mated with right entity of question and answer and substance feature, obtain question and answer to " Dec 26 1893 time Mao Zedong date of birth " and question and answer to " Shaoshan, Hunan, Mao Zedong birthplace, place ", wherein, above-mentioned two question and answer are to entity " Mao Zedong ", feature description " date of birth " reach " birthplace " respectively with the partial content " Mao Zedong's date of birth " of described search information and Mao Zedong's birthplace " be complementary.
Preferably, in step s104, the network equipment 2 to before mating, also will be judged the question and answer classification that described search information comprises with search information and each question and answer.The question and answer classification is first in the above-mentioned four-tuple, and the question and answer classification is represented the classification of question and answer centering problem.For example " Dec 26 1893 time Mao Zedong date of birth " belong to the question and answer classification of time character; " Shaoshan, Hunan, Mao Zedong birthplace, place " belongs to the question and answer classification of place character.For example, when search information is " Mao Zedong's date of birth and place ", the network equipment 2 mates above-mentioned search information in the vocabulary typelib, type-" place " that type-" time " that this vocabulary combination that acquisition search information partial content " goes out the birthday " is described and this vocabulary combination of search information partial content " birthplace " are described.Then judge the question and answer classification of above-mentioned two types of respectively corresponding time character and the question and answer classification of place character, the network equipment 2 is interrogated the question and answer centering that two question and answer classifications answering classification are comprised in the question and answer classification of its above-mentioned time character that comprises and location respectively with described search information and is mated then, obtains location and interrogates the question and answer answered in the classification " Shaoshan, Hunan, Mao Zedong birthplace, place " and timeliness interrogation are answered question and answer in the classification to " Dec 26 1893 time Mao Zedong date of birth ".Wherein, the problem that previous question and answer are right is " Mao Zedong birthplace ", " Mao Zedong place of birth " is complementary in this problem and the search information, and the right problem of back question and answer is " Mao Zedong's date of birth ", in this problem and the search information " Mao Zedong's date of birth " be complementary.
Preferably, in step s104, the network equipment 2 judge whether earlier from described search information, to extract comprise the entity of problem and substance feature is described, describe if can from described search information, extract entity and substance feature, described entity that can extract and substance feature are described and entity and substance feature description the carry out matching inquiry of described each question and answer to being comprised.For example, when search information is " Mao Zedong's birthplace is at which ", the network equipment 2 is judged according to Entity recognition technology and proper noun recognition technology can extract from described search information that institute comprises the entity of problem and problem in the described search information was described and discerned to substance feature entity is " Mao Zedong ", judges that the substance feature of this entity is described as " birthplace ".Wherein, the Entity recognition technology is a kind of identification content of text institute's description object or theme, and preferred, the technology that described description object or theme are sorted out; The proper noun recognition technology is a kind of proprietary name and significant quantity phrase that occurs in the text of discerning, and preferred, the technology that described proprietary name and significant quantity phrase are sorted out.Then the network equipment 2 mates with each question and answer the entity " Mao Zedong " and the substance feature description " birthplace " that extract to the entity and the substance feature description that are comprised, and the question and answer that obtain to be complementary are to " Shaoshan, Hunan, Mao Zedong birthplace, place ".
Preferably, in step s104, the network equipment 2 judge whether earlier from described search information, to extract comprise the entity of problem and substance feature is described, and whether can judge the classification of problem that this search information comprises.If can extract entity and substance feature from described search information describes, and can judge the classification of problem that this search information comprises, the network equipment 2 is described classification, entity and the substance feature of problem that this search information comprises with classification, entity and the substance feature of each question and answer centering and is described coupling respectively.For example, when search information is " Mao Zedong's date of birth ", the network equipment 2 is judged according to Entity recognition technology and proper noun recognition technology and can be extracted classification, entity and substance feature that institute comprises problem from described search information to describe and discern the classification of problem in the described search information be " time ", entity is " Mao Zedong ", and judges that the substance feature of this entity is described as " date of birth ".Then the network equipment 2 is described with each question and answer the problem category " time ", entity " Mao Zedong " and the substance feature description " date of birth " that extract and is mated to the problem category, entity and the substance feature that are comprised, and the question and answer that obtain to be complementary are to " Shaoshan, time Mao Zedong date of birth Hunan ".
Fig. 2 presents the method flow diagram of searching for answer information for being used for according to one preferred embodiment of the present invention at search interface.
The network equipment 2 at first obtains the search information by subscriber equipment 1 input from the user, when question and answer exist in to the storehouse relative search information more refinement question and answer to the time, it is right to match a plurality of question and answer, as question and answer to candidate item.So it is right that the network equipment 2 will further be chosen one or more question and answer in above-mentioned a plurality of question and answer in to candidate item according to user related information, further obtains answer information then and present to the user in search interface.Wherein user related information includes but not limited to: 1) individual subscriber attribute (including but not limited to: IP address, subscriber equipment classification, user's sex age etc.); 2) user preference setting; 3) user search historical record etc.
Particularly, step s201 to s204 is same or similar with reference to the described step of Fig. 1 s101 to s104 with the front, comprises by reference at this, repeats no more.
In step s205, the network equipment 2 obtains the user related information from subscriber equipment 1.Wherein, described relevant information obtain the following mode that includes but not limited to:
1) directly obtains the user related information that subscriber equipment 1 sends;
2) obtain the user's that subscriber equipment 1 sends identity or identifying information, the network equipment 2 obtains the relevant information that is recorded in this user in the network equipment 2 according to this identity or identifying information;
3) network equipment 2 is according to setting up the identification information of the subscriber equipment 1 that obtains when communicate by letter with described subscriber equipment 1, such as the cell-phone number of the subscriber equipment of acquisition or hardware sequence number etc., judges user's identity, and obtains this user's relevant information according to this identity.
In step s206, the network equipment 2 obtains a plurality of question and answer according to the user related information that obtains among the step s205 in step 204 right to further choosing one or more question and answer in the candidate item.Wherein, describedly choose one or more question and answer right method comprise following at least a:
1) particularly, the network equipment 2 is compared the question and answer that obtain to each problem of candidate item, extract incoherent vocabulary, and described incoherent vocabulary searched in described vocabulary typelib, obtain the type of described uncorrelated vocabulary, and extract or obtain corresponding user related information according to the type, in to the question and answer candidate item, choose.
For example, if search information is " No. two line last buses of subway constantly ", the question and answer that obtain are to being " No. two line last buses of Shanghai Underground constantly; 23:00 ", " No. two line last buses of Beijing Metro constantly; 23:00 ", " No. two line last buses of Guangzhou Underground constantly; 23:00 " etc., above-mentioned question and answer are compared to the problem of candidate item, extract incoherent vocabulary and be " Shanghai ", " Beijing ", " Guangzhou " etc., in the vocabulary typelib, search and obtain above-mentioned uncorrelated vocabulary type and be " place ", and " place " pairing user related information is an IP address, the network equipment 2 is according to IP address, judge that the user location is Shanghai, and then select question and answer " No. two line last buses moment of Shanghai Underground, 23:00 ".
Wherein, if question and answer are to being to store in the mode of four-tuple, then described comparison can further be limited to the comparison to entity and substance feature description.
For example, search information is " No. two line last buses of subway constantly ", the network equipment 2 mates this search information and each question and answer to the entity and the substance feature description that are comprised, obtain " No. two line last buses of time Shanghai Underground are 23:00 constantly ", " No. two line last buses of time Beijing Metro are 23:15 constantly ", a plurality of question and answer such as " No. two line last buses of time Guangzhou Underground are 23:30 constantly " are right, then the network equipment 2 these question and answer of analysis are right, extract the difference " Shanghai " of these question and answer to " entity " item, " Beijing ", " Guangzhou " etc., coupling obtains its type for " place " in the vocabulary typelib, " place " pairing user related information is an IP address, according to the user equipment (UE) IP position, the network equipment 2 is further chosen question and answer to " No. two line last buses of time Shanghai Underground are 23:00 constantly " above-mentioned question and answer centering then.
2), judge how to choose question and answer to candidate item according to the user preference setting that obtains.
For example, in being provided with, user preference is set in search interface when only presenting the answer information of some, if the network equipment 2 mates the right number of coupling question and answer that obtains and surpasses this setting quantity in that search information and each question and answer are described the entity that comprised and substance feature, then to leave out unnecessary question and answer right for the network equipment 2, and to set the question and answer of quantity in being provided with right and only keep user preference.
And for example, can set during user preference is provided with when obtaining a plurality of question and answer to candidate item, it is right to choose the maximum question and answer of answer.
For another example, if question and answer also can be provided with the priority of each question and answer classification to quadruple form storage in the user preference, as the question and answer of preferentially choosing which classification to candidate item etc.
3), judge how to choose question and answer to candidate item according to the user search historical record that obtains.
Particularly, the network equipment 2 is compared each the right problem of question and answer that obtains, extract incoherent vocabulary, and described incoherent vocabulary and user's search information is mated, the question and answer of selecting the vocabulary place that matching degree is the highest in the user search information are to candidate item.Wherein, the factor of judgment of described matching degree includes but not limited to: the search time of the quantity of the vocabulary that is complementary, the vocabulary that is complementary, matched frequency etc.
For example, if user's search information be " the niciest river system ", the question and answer that coupling obtains are to reaching " the niciest river is the restaurant, YYY " etc., the network equipment 2 extraction incoherent vocabulary " dish ", " restaurant " etc. for " the niciest river is a dish, XXX ".Follow the search history record of the network equipment 2 analysis user, obtain the quantitatively more vocabulary that comparatively mate in " dining room ", " restaurant ", " hotel " etc. and " restaurant ", then the network equipment 2 select to comprise " restaurant " question and answer to " the niciest river is the restaurant, YYY ".
Step s207 to s210 comprises at this by reference with step s105 to s108 is same or similar as described in Figure 1, repeats no more.
Fig. 3 for according to the present invention another preferred embodiment be used for present the method flow diagram of searching for answer information at search interface.It illustrates the network equipment 2 and at first obtains the search information by subscriber equipment 1 input from the user, when question and answer exist in to the storehouse relative search information more refinement question and answer to the time, can match a plurality of question and answer to as question and answer to candidate item.So the network equipment 2 will according to the user further further to choose one or more question and answer above-mentioned a plurality of question and answer centerings alternately right, obtain answer information at last and in search interface, present to the user.
Step s301 to s304 comprises at this by reference with step s101 to s104 is same or similar as described in Figure 1, repeats no more.
In step s305, the network equipment 2 to candidate item, obtains corresponding options according to the question and answer that obtain.
Particularly, described options can be according to each question and answer to the problem of candidate item or be used to the entity of the problem of describing and substance feature is described and generated, or directly with each question and answer to the problem of candidate item options as correspondence.
For example, when inputted search information is " Mao Zedong's birth ", the network equipment 2 is described described search information and each question and answer will obtain question and answer to candidate item " Dec 26 1893 time Mao Zedong date of birth ", " Shaoshan, Hunan, Mao Zedong birthplace, place " to the entity that comprised and substance feature after mating, and the network equipment 2 extracts above-mentioned question and answer respectively and the entity of candidate item and substance feature are described is combined into options " Mao Zedong's date of birth ", " Mao Zedong's place of birth ".
In step s306, the network equipment 2 sends to subscriber equipment 1 with above-mentioned two options.Then, in step s307, subscriber equipment 1 is presented to the user with above-mentioned options.In step s308, subscriber equipment 1 obtains user-selected options.Subsequently, in step s309, subscriber equipment 1 sends to the network equipment 2 with user-selected options.At last, in step s310, the network equipment 2 selects its corresponding question and answer right according to this options.
Step s311 to s314 and the described step of Fig. 1 s207 to s210 are same or similar, comprise by reference at this, repeat no more.
Preferably, in step s304, the network equipment 2 judge whether earlier from described search information, to extract comprise the entity of problem and substance feature is described, describe if can from described search information, extract entity and substance feature, described entity that can extract and substance feature description are mated the entity and the substance feature description that are comprised with described each question and answer; If the network equipment 2 can not extract the entity and the substance feature that comprise problem and describe these two, and can only extract one of above-mentioned two, the network equipment 2 will be earlier described in two wherein definite question and answer centering coupling at entity or substance feature, obtain a plurality of coupling question and answer to as question and answer to candidate item.
For example, when search information is " Mao Zedong's birth ", the network equipment 2 according to Entity recognition technology and proper noun recognition technology can only from described search information, extract comprise problem entity be " Mao Zedong ", describe and can not extract its substance feature.The network equipment 2 be that the substance feature of the question and answer centering of " Mao Zedong " is described coupling with " birth " and entity, obtains substance feature and describes two of comprising " birth " and mate question and answer to " Dec 26 1893 time Mao Zedong date of birth " and " Shaoshan, Hunan, Mao Zedong birthplace, place ".
Fig. 4 illustrates the above-mentioned network equipment 2 and generates the right method flow diagram of question and answer according to the question and answer content from webpage.
Particularly, in step s401, the network equipment 2 obtains the website that may contain the question and answer content information from default storehouse, website, knows, searches and ask etc. as Baidu.Then, in step s402, the network equipment 2 can adopt modes such as Web Spider, web crawlers, grasp the web page contents that may contain the question and answer content information in this website, and the above-mentioned web page contents that may contain the question and answer content information analyzed, according to the position of web page code decision problem, and extract problem.Mark such as code " title " can appear in the html format sources code of webpage for example on its question text one hurdle, the network equipment 2 obtains the question text information on the above-mentioned relevant position, has promptly obtained the problem of the question and answer content of this webpage.
Then, the network equipment 2 obtains the answer of this question and answer content.The method of obtaining answer includes but not limited to following at least one:
1) in this question and answer content whether the optimum answer that is identified is arranged, have then with of the answer of this optimum answer as this question and answer content;
For example, whether analyzing web page has " optimum answer " this hurdle in the question and answer content during the Baidu that judgement gets access to is known." optimum answer " represented the pairing answer of this problem.
2) with clicking rate or positive rating are the highest in all answers of this question and answer content answer answer as this question and answer content.
For example, analyze and to obtain certain answer and be subjected to online friend " top " and " favorable comment " at most, judge that then this answer is the answer of this problem.
Subsequently, in step s403, whether the question and answer content of obtaining among the network equipment 2 determining step s402 is determinacy question and answer contents, and "Yes" is then carried out step s405; "No" is carried out step s404, promptly gives up this question and answer content.
Judge that whether the question and answer content is that the method for determinacy question and answer content includes but not limited to:
1) at first by in the vocabulary typelib, searching the classification of this question and answer content of matching judgment, when the classification that prestores in the vocabulary typelib can be with its coupling, again according to Entity recognition technology and proper noun recognition technology discern problem in the described question and answer content entity, judge that the substance feature of this entity describes.When the classification of described question and answer content, entity and substance feature are described and after answer determined by above-mentioned steps, shown that promptly described question and answer content is determinacy question and answer contents;
2) by proper noun recognition technology and Entity recognition technology, judge whether to identify the substance feature description of entity and this entity, if can, judge that then this question and answer content is determinacy question and answer contents, and further judge the classification of this question and answer content.
Need to prove, those skilled in the art should understand that, the method for the determinacy question and answer of judging whether is not with the above-mentioned limit that is exemplified as, in fact, any basis whether can extract entity and substance feature is described, or can not extract the description of entity and substance feature and judge the question and answer classification, judge whether the question and answer content is the method for determinacy question and answer content, all should be within the scope of the present invention.
What need further specify is, those skilled in the art should understand that, judge when whether the question and answer content is determinacy question and answer content, can only judge that therefore, the order of step s403 can be before step s402 by the problem in the question and answer content, accordingly, in step s402, only, just extract answer judging that this question and answer content is under the situation of determinacy question and answer content.
At last, in step s405, the network equipment 2 is described according to classification, entity and the substance feature of the problems referred to above that obtain in the above-mentioned steps, and in conjunction with the answer information in the above-mentioned question and answer content information, the question and answer that generate four-tuple structure (" the class instance substance feature is described answer ") are right.
Fig. 5 illustrates the above-mentioned network equipment 2 and generates the right method flow diagram of question and answer according to the data from the encyclopaedia webpage.
In step s501, the network equipment 2 can obtain the web page address of network encyclopaedia character by the internet, as Baidu's encyclopaedia, wikipedia etc., thereby obtains encyclopaedia data in such website.The network equipment 2 also can be uploaded or the mode of network is obtained the encyclopaedia data by this locality.
Then, in step s502, the network equipment 2 can be judged the entry and the entry explanation of encyclopaedia data by the network encyclopaedia webpage that obtains among the step s501 is analyzed by web page code.For example in the relevant position of webpage html source code, obtain the theme entry of these encyclopaedia data and the entry explanation of described entry etc.; Also can obtain the theme entry of encyclopaedia data and the entry explanation of described entry etc. by parsing (template analysis judgment etc.) to the encyclopaedia data uploaded.
Then, in step s504, the network equipment 2 as entity, and explains that according to entry and entry generating substance feature describes with entry.According to described entity and substance feature the generation problem is described then.For example for " Mao Zedong " this entry, the network equipment 2 will generate its corresponding substance feature and be described as " life ", and generation problem " Mao Zedong's life "; For " favourable trade balance " this entry, the network equipment 2 will generate substance feature and be described as " implication " etc., generation problem " what implication of favourable trade balance is " etc.
At last, in step s505, the network equipment 2 is according to step s503 and the resultant problem of s504, and is interpreted as answer with the entry that is obtained among the step s502, and the question and answer that generate this encyclopaedia data correspondence are right.
Preferably, between described step s502 and the step s504, also comprise step s503 (figure does not show).In step s503, the network equipment 2 is explained according to the entry of above-mentioned entry and described entry and search coupling in the vocabulary typelib, explains its corresponding class with the entry of determining above-mentioned entry and described entry.For example for " Mao Zedong " these encyclopaedia data, the network equipment 2 is searched " Mao Zedong " this entry in the vocabulary typelib, and judges that this entry and entry explain that corresponding class should be " personage ".Accordingly, in step s504, the network equipment 2 as entity, and explains that according to entry and entry corresponding class generates substance feature and describes with entry.For example for " Mao Zedong " this entry, the network equipment 2 will generate its corresponding substance feature and be described as " life "; For " favourable trade balance " this entry, the network equipment 2 will generate substance feature and be described as " implication " etc.In step s505, the network equipment 2 is described according to step s503 and the resulting entry classification of s504, entity and substance feature, and the entry that is obtained among the integrating step s502 is interpreted as answer, and the question and answer that generate this encyclopaedia data correspondence are right.
Need to prove,, in the process of question and answer, can carry out normalized entity and substance feature description to generation or coupling as one of preferred version of the present invention.
Wherein, described normalized includes but not limited to:
1) will belong to a plurality of entities in the same synonym phrase or substance feature and describe to describe with one of them entity or substance feature and explain, wherein, described synonym phrase is stored in the thesaurus;
For example, in the process of question and answer to generation, obtain the right entity of these question and answer and be " Chairman Mao ", substance feature is " date of birth ", and question and answer to the process that generates in or after this process, the network equipment 2 is searched in thesaurus and is obtained " Chairman Mao " and be included in the synonym phrase, this synonym phrase all with " Mao Zedong " as unified description, then in the right generative process of question and answer, entity " Chairman Mao " is normalized to entity " Mao Zedong "; In matching process or after the coupling, the network equipment 2 is searched in thesaurus and is obtained " date of birth " and be included in the synonym phrase, and this synonym phrase all with " birthday " as unified description, then in the right generative process of question and answer, substance feature description " date of birth " is normalized to substance feature describes " birthday ".
Again for example, in the process that search information is mated, the entity that obtains this search information is " Chairman Mao ", substance feature is " date of birth ", and in the process of coupling or after this process, the network equipment 2 is searched in thesaurus and is obtained " Chairman Mao " and be included in the synonym phrase, this synonym phrase all with " Mao Zedong " as unified description, then with search information and question and answer to the process of mating in, entity " Chairman Mao " is normalized to entity " Mao Zedong " mates; In matching process or after the coupling, the network equipment 2 is searched in thesaurus and is obtained " date of birth " and be included in the synonym phrase, and this synonym phrase all with " birthday " as unified description, then with search information and question and answer to the process of mating in, substance feature is described " date of birth " is normalized to substance feature and describes " birthday " and mate.
2) similarity is surpassed the entity of predetermined threshold is unified to be identical entity, the substance feature that similarity is surpassed predetermined threshold is described and is unifiedly described for identical substance feature.
The similarity that it will be understood by those skilled in the art that the description of entity or substance feature can be calculated in several ways, and for example, by similar part proportion, or the default pairing numerical value of the different range to aforementioned similar part proportion is determined.In addition, those skilled in the art should be rule of thumb or actual demand determine aforementioned predetermined threshold, do not do at this and give unnecessary details.
Need to prove, as one of preferred version of the present invention, comprise that also the question and answer that 2 pairs of couplings of the network equipment obtain analyze the answer information that is comprised, judge whether described answer information comprises answer and obtain information, if described answer information comprises answer and obtains information, the network equipment 2 obtains information according to this answer, calls the step that obtains corresponding answer by api interface.Wherein, above-mentioned answer is obtained information and is included but not limited to 1) webpage url link and the particular location of answer in this webpage; 2) customizing messages that obtains from special interface.Information is obtained in above-mentioned answer can be by artificial default.
For example, certain user is in inputted search information on August 31st, 2010 " Shanghai weather condition ", it is right that the network equipment 2 obtains corresponding question and answer according to described search information coupling, comprise the network address of certain webpage and the positional information of content in this webpage that hope is obtained in the right answer information of these question and answer, this positional information includes but not limited to position range or the of living in module of described content in this webpage, then the network equipment 2 judges that these question and answer comprise answer to the answer information that is comprised and obtain information, then, the network equipment 2 calls corresponding webpage by api interface, and on this webpage according to described positional information, grasp and wish that the content of obtaining is " Shanghai; on August 31st, 2010; 25~29 ℃; drizzle to moderate rain; southeaster 4-5 level ", and it is presented to the user as answer, the appearance form of this answer includes but not limited to comprise the textual form or the picture form of above-mentioned answer.
According to a further aspect in the invention, if subscriber equipment 1 is downloaded to this locality according to the question and answer of Fig. 4 and generation embodiment illustrated in fig. 5 to storehouse and default vocabulary typelib with the network equipment 2, subscriber equipment 1 can independently be finished the function as Fig. 1, Fig. 2 and embodiment shown in Figure 3.
Particularly, be with the difference of embodiment shown in Figure 1, in the present embodiment, after subscriber equipment 1 obtains the search information of user's input, need not to send to the network equipment 2, but directly carry out with described step s104, described step s105 and described step s106 in the performed same or analogous step of the network equipment 2, after obtaining to have comprised the searched page of answer information, directly present to the user.Wherein, what the network equipment 2 performed all operations all can be same among the s104 of step described in Fig. 1, described step s105 and the described step s106 is finished by subscriber equipment, comprises in this mode with usefulness, repeats no more.
Be with the difference of embodiment shown in Figure 2, in the present embodiment, after subscriber equipment 1 obtains the search information of user's input, need not to send to the network equipment 2, but directly carry out with described step s204 in the performed same or analogous step of the network equipment 2, obtain question and answer to candidate item, subsequently, subscriber equipment 1 directly obtains user related information from this locality, carry out and described step s206 again, the same or analogous step that the network equipment 2 is done among described step s207 and the described step s208, after acquisition has comprised the searched page of answer information, directly present to the user.Wherein, what the network equipment 2 performed all operations all can be same among the s204 of step described in Fig. 2, described step s206, described step s207 and the described step s208 is finished by subscriber equipment 1, comprises in this mode with usefulness, repeats no more.
Be with the difference of embodiment shown in Figure 3, in the present embodiment, after subscriber equipment 1 obtains the search information of user's input, need not to send to the network equipment 2, but directly carry out with described step s304 and described step s305 in the performed same or analogous step of the network equipment 2, obtain options, subsequently, subscriber equipment 1 is directly presented to the user with described options, and after obtaining user's selection, directly carry out and described step s310, the same or analogous step that the network equipment 2 is done among described step s311 and the described step s312, obtained to comprise the searched page of answer information after, directly present to the user.Wherein, what the network equipment 2 performed all operations all can be same among the s304 of step described in Fig. 3, described step s305, described step s310, described step 311 and the described step s312 is finished by subscriber equipment, comprises in this mode with usefulness, repeats no more.
Fig. 6 illustrates and presents the system construction drawing of searching for answer information according to one aspect of the invention in search interface.
Wherein, subscriber equipment 1 is connected with the network equipment 2 via network, and described network includes but not limited to internet, wide area network, Metropolitan Area Network (MAN), LAN (Local Area Network), VPN network, wireless self-organization network (Ad Hoc network) etc.
Subscriber equipment 1 includes but not limited to any electronic product that carries out man-machine interaction by keyboard, telepilot, touch pad or voice-operated device with the user, for example computing machine, smart mobile phone, PDA, game machine or IPTV etc.Subscriber equipment 1 comprises first dispensing device 11, first receiving device 12 and the 4th deriving means 13.
The network equipment 2 includes but not limited to group of server that single network server, a plurality of webserver are formed or based on the cloud that is made of a large amount of computing machines or the webserver of cloud computing (Cloud Computing).Wherein, cloud computing is a kind of of Distributed Calculation, a super virtual machine of being made up of the loosely-coupled computing machine collection of a group.Wherein, the network equipment 2 comprises first deriving means 21, coalignment 22, generator 23.Described question and answer can be included in the network equipment 2 storehouse 26 and vocabulary typelib 25, also can be with the network equipment 2 physical separation but communicate to connect.Wherein, question and answer centering comprises problem and the corresponding answer of this problem, and question and answer are to comprise the right set of a large amount of question and answer to storehouse 26.Write down the corresponding relation of the type that the combination of the combination of vocabulary or vocabulary and this vocabulary or vocabulary may describe in the vocabulary typelib 25.
When the user wishes to search for, the input mode inputted search information that provides by subscriber equipment.Wherein, this input mode includes but not limited to: 1) literal input; 2) phonetic entry; 3) handwriting input.Wherein, the position of inputted search information includes but not limited in aforesaid way: the 1) search column of the page that provides of search engine; 2) searched page that provides of client; 3) search column in embedded web page or the client etc.
Particularly, the 4th deriving means 13 in the subscriber equipment 1 obtains the search information of user's input by the interactive device that any and user carry out man-machine interaction.This interactive device can be keyboard, telepilot, touch pad or voice-operated device etc.Then, first dispensing device 11 is sent to the network equipment 2 with the search information of described user's input by the internet.
Then, first deriving means 21 in the network equipment 2 obtains above-mentioned user search information, coalignment 22 with the search information that gets access to and question and answer to each question and answer of prestoring in the storehouse 26 to mating, promptly each question and answer centering search with search information in same or analogous lexical information, the one or more question and answer that are complementary with all or part of content that obtains with described search information are right.
Particularly, coalignment 22 mates the problem of search information and question and answer centering.According to search information, the situation of the right problem of one or more question and answer and this search information coupling can appear.
When search information only implied single problem, coalignment 22 can obtain the result that the full content in this search information can be complementary with the problem of one or more question and answer centerings.
For example, when search information is " Mao Zedong's birthplace is at which ", above-mentioned search information implies single problem.Coalignment 22 mates in question and answer this search information in to storehouse 26, and obtaining question and answer is " Mao Zedong birthplace " to the problem in " Mao Zedong birthplace, Shaoshan, Hunan ", and the problem that these question and answer are right and the full content of search information are complementary.
When the implicit a plurality of problem of search information, coalignment 22 can mate that to obtain a plurality of question and answer right, and the partial content in the problem that each question and answer is right and this search information is complementary.
For example, when search information is " Mao Zedong's date of birth and place ", above-mentioned search information implies two problems.Coalignment 22 mates in question and answer this search information in to storehouse 26, obtains question and answer to " Mao Zedong birthplace, Shaoshan, Hunan " and " Mao Zedong's date of birth, on Dec 26th, 1893 ".Wherein, the problem that previous question and answer are right is " Mao Zedong birthplace ", " Mao Zedong place of birth " is complementary in this problem and the search information, and the right problem of back question and answer is " Mao Zedong's date of birth ", in this problem and the search information " Mao Zedong's date of birth " be complementary.
When question and answer exist in to storehouse 26 relative search information more refinement question and answer to the time, it is right that coalignment 22 can match a plurality of question and answer.
For example, when search information is No. two, a subway " line last buses constantly ", because each No. two line final vehicle hours of city underground difference, question and answer are right to the question and answer of No. two line last buses of subway that tend to have each different cities in the storehouse 26.Particularly, coalignment 22 with above-mentioned search information and each question and answer to after mating, it is right to obtain a plurality of question and answer: " No. two line last buses of Shanghai Underground constantly; 23:00 ", " No. two line last buses of Beijing Metro constantly; 23:15 ", " No. two line last buses of Guangzhou Underground constantly; 23:30 " etc., the problem that above-mentioned question and answer are right all can be mated with the full content of search information.
Need to prove that above-mentioned example is only for illustrating the solution of the present invention better, but the present invention is not as limit, those skilled in the art should understand that, any according to search information, obtain the right scheme of one or more question and answer, all should be included in the scope of the present invention.
Then, generator 23 according to above-mentioned one or more coupling question and answer to extracting answer information.
When with the question and answer of search information coupling when only having one, generator 23 obtains the answer of above-mentioned question and answer centering, in conjunction with the right problem of these question and answer as answer information.
When, judging whether and the answer that each question and answer are right to integrate when a plurality of having with the question and answer of search information coupling.Generator 23 also comprises integrating apparatus (figure does not show), and to integrating semantically being associated, described integrating apparatus is integrated above-mentioned each question and answer to using the mode that meets the natural language custom, and generates answer information as if above-mentioned each question and answer.For example, for the question and answer that are complementary with search information " Mao Zedong's date of birth and place " to " Mao Zedong's date of birth; on Dec 26th, 1893 " and " Mao Zedong birthplace; Shaoshan, Hunan ", the main body of its problem is identical, integrating apparatus can be integrated it and generate an answer information " Mao Zedong is born on Dec 26th, 1893, Shaoshan, Hunan " that semantically links up; If each question and answer are to integrating, answer and the right problem of this question and answer that integrating apparatus is then right according to question and answer respectively, generate a plurality of answers unit as answer information, for example, the question and answer that search information No. two, subway " line last buses constantly " is complementary to " No. two line last buses of Shanghai Underground constantly; 23:00 ", " No. two line last buses of Beijing Metro constantly, 23:15 ", " No. two line last buses of Guangzhou Underground constantly, 23:30 " generate such as " No. two line last buses of Shanghai Underground are 23:00 constantly; No. two line last buses of Beijing Metro are 23:15 constantly; No. two line last buses of Guangzhou Underground are 23:30 constantly " answer information.
What need further specify is, the network equipment 2 can comprise the right generation of compound question and answer in the process that generates answer information, be also to comprise integrating apparatus (figure does not show) in the generator 23, it is right that described integrating apparatus can be combined a plurality of question and answer that obtain according to search information to generate new compound question and answer, for example, can generate question and answer to " Mao Zedong's date and place of birth, Shaoshan, Hunan on the 26th Dec in 1893 " to " Mao Zedong birthplace; Shaoshan, Hunan " and question and answer to " Mao Zedong's date of birth; on Dec 26th, 1893 " according to the question and answer that obtain.
Then, generator 23 sends to subscriber equipment 1 with described answer information.First receiving device 12 upgrades the page according to the described answer information that receives, and presents to the user with the page that described answer information is incorporated after the renewal.
Preferably, the network equipment 2 also comprises page refreshment device (figure does not show), and the answer information that described page refreshment device provides according to described generator 23 is upgraded searched page, so that answer information is dissolved in the searched page of renewal.
Wherein the position that presents in the page of answer information includes but not limited to following at least one:
-Search Results article one for example is presented on the result article one in the link that obtains according to search information;
-search suggestion for example is presented on the result in the position of the search key that the user offers suggestions;
-input method candidate bar for example, is presented on the result in the option of user's input method;
-search column candidate item hurdle for example, is presented on the result in the drop-down hurdle that the search candidate item is provided that search column lists;
Candidate item hurdle under the-WEB input field for example, is presented on the result on the drop-down hurdle that is used for listing candidate item in the WEB input field.
Need to prove that above-mentioned example is only for illustrating the solution of the present invention better, but the present invention is not as limit, those skilled in the art should understand that, any searched page is handled,, all should be included in the scope of the present invention answer information is included in the scheme in the searched page.
Accordingly, the page of page refreshment device after with the described renewal that comprises answer information sends to subscriber equipment 1.First receiving device 12 is presented to the user after receiving the page that comprises answer information after the described renewal.
Preferably, each question and answer is to being with the frame mode storage of a four-tuple in to storehouse 26 in question and answer.Question and answer comprise question and answer classification, entity, substance feature description, the tetrameric four-tuple of answer to being expressed as.Wherein, the question and answer classification represents that classification (including but not limited to: time, place, the product performance etc.) entity of this question and answer centering problem represents the object (including but not limited to: name, place name, product, incident, proper noun etc.) that question and answer centering is putd question to; Substance feature is described the content that the described object of understanding is wished in expression; Answer is the right answer of these question and answer.For example question and answer are to " Shaoshan, Hunan, Mao Zedong birthplace, place ", wherein " place " is the right question and answer classifications of these question and answer, " Mao Zedong " is the right entities of these question and answer, and " birthplace " is that the right substance feature of these question and answer is described, and " Shaoshan, Hunan " is the right answers of these question and answer., be convenient to store to can be with four-tuple storage question and answer with relational database etc. with question and answer to the data structuring.Therefore, coalignment 22 with search information and each question and answer of prestoring to coupling further tool turn to search information and described each question and answer that prestore mated the entity and the substance feature description that are comprised, obtaining the description of described entity and described substance feature all can be right with one or more question and answer of all or part of content match of described search information.
For example, when search information is " Mao Zedong's birthplace is at which ", coalignment 22 is described coupling with above-mentioned search information and described each question and answer that prestore to the entity and the substance feature that are comprised, obtain question and answer to " Shaoshan, Hunan, Mao Zedong birthplace, place ", the full content coupling of its entity " Mao Zedong ", feature description " birthplace " and described search information.It is right so just to have obtained question and answer being complementary with the search information full content.
And for example, when search information is " Mao Zedong's date of birth and place ", above-mentioned search information implies two problems, coalignment 22 is described above-mentioned search information in to storehouse 26 in question and answer and is mated with right entity of question and answer and substance feature, obtain question and answer to " Dec 26 1893 time Mao Zedong date of birth " and question and answer to " Shaoshan, Hunan, Mao Zedong birthplace, place ", wherein, above-mentioned two question and answer are to entity " Mao Zedong ", feature description " date of birth " reach " birthplace " respectively with the partial content " Mao Zedong's date of birth " of described search information and Mao Zedong's birthplace " be complementary.
Fig. 7 illustrates the synoptic diagram according to the coalignment 22 of a preferred embodiment of the network equipment 2 of the present invention.Wherein coalignment 22 also comprises first judgment means, 221, the first sub-coalignment 222, second judgment means, 227, the second sub-coalignment 223, the 3rd judgment means the 228, the 3rd sub-coalignment 225.Wherein first judgment means 221 and the first sub-coalignment 222, second judgment means 227 and the second sub-coalignment 223, the 3rd judgment means 228 and the 3rd sub-coalignment 225 are combined as three covering devices respectively.
Preferably, to before mating, first judgment means 221 also will be judged the question and answer classification that described search information comprises in the coalignment 22 with search information and each question and answer.The question and answer classification is first in the above-mentioned four-tuple, and the question and answer classification is represented the classification of question and answer centering problem.For example " Dec 26 1893 time Mao Zedong date of birth " belong to the question and answer classification of time character; " Shaoshan, Hunan, Mao Zedong birthplace, place " belongs to the question and answer classification of place character.Particularly, when search information is " Mao Zedong's date of birth and place ", first judgment means 221 is mated above-mentioned search information in vocabulary typelib 25, type-" place " that type-" time " that this vocabulary combination that acquisition search information partial content " goes out the birthday " is described and this vocabulary combination of search information partial content " birthplace " are described.Then first judgment means 221 is judged the question and answer classification of above-mentioned two types of respectively corresponding time character and the question and answer classification of place character.The first sub-coalignment 222 utilizes question and answer to storehouse 26 then, described search information is interrogated the question and answer centering that two question and answer classifications answering classification are comprised in the question and answer classification of its above-mentioned time character that comprises and location respectively mate, obtain location and interrogate the question and answer answered in the classification " Shaoshan, Hunan, Mao Zedong birthplace, place " and timeliness interrogation are answered question and answer in the classification to " Dec 26 1893 time Mao Zedong date of birth ".Wherein, the problem that previous question and answer are right is " Mao Zedong birthplace ", " Mao Zedong place of birth " is complementary in this problem and the search information, and the right problem of back question and answer is " Mao Zedong's date of birth ", in this problem and the search information " Mao Zedong's date of birth " be complementary.
Preferably, with search information and each question and answer to before mating, second judgment means 227 judge whether earlier from described search information, to extract comprise the entity of problem and substance feature is described, describe if can extract entity and substance feature from described search information, second judgment means 227 is with the described entity that can extract and substance feature is described and described each question and answer are mated the entity and the substance feature description that are comprised.For example, when search information is " Mao Zedong's birthplace is at which ", second judgment means 227 judges that according to Entity recognition technology and proper noun recognition technology can extract entity and the substance feature that institute comprises problem from described search information describes, and the entity of discerning problem in the described search information is that " Mao Zedong ", substance feature are described as " birthplace ".Then, the second sub-coalignment 223 mates with each question and answer the entity " Mao Zedong " and the substance feature description " birthplace " that extract to the entity and the substance feature description that are comprised, the question and answer that obtain to be complementary are to " Shaoshan, Hunan, Mao Zedong birthplace, place ".
Preferably, with search information and each question and answer to before mating, in the coalignment 22 the 3rd judgment means 228 judge whether earlier from described search information, to extract comprise the entity of problem and substance feature is described, and whether can judge the classification of problem that this search information comprises.If can extract entity and substance feature from described search information describes, and can judge the classification of problem that this search information comprises, the 3rd sub-coalignment 225 classifications with problem that this search information comprises, entity and substance feature are described with classification, entity and the substance feature of each question and answer centering and are described coupling respectively.For example, when search information is " Mao Zedong's date of birth ", the 3rd judgment means 228 is judged according to Entity recognition technology and proper noun recognition technology and can be extracted classification, entity and substance feature that institute comprises problem from described search information to describe and discern the classification of problem in the described search information be " time ", entity is " Mao Zedong ", and judges that the substance feature of this entity is described as " date of birth ".Then, the problem category " time ", entity " Mao Zedong " and the substance feature that extract are described " date of birth " with the 3rd sub-coalignment 225 and each question and answer are mated the problem category, entity and the substance feature description that are comprised, and the question and answer that obtain to be complementary are to " Shaoshan, time Mao Zedong date of birth Hunan ".
Fig. 8 illustrates the synoptic diagram according to the coalignment 22 of another preferred embodiment of the network equipment 2 of the present invention.Wherein coalignment 22 also comprises the 4th sub-coalignment 223, selecting arrangement the 224, the 5th sub-coalignment 225 and interactive device 226.
Preferably, in the coalignment 22 the 4th sub-coalignment 229 with described search information and each question and answer of prestoring to mating, when question and answer exist in to storehouse 26 relative search information more refinement question and answer to the time, can match a plurality of question and answer to as question and answer to candidate item.Then further to choose one or more question and answer in above-mentioned a plurality of question and answer in to candidate item according to the user related information that gets access to right for selecting arrangement 224.Wherein user related information includes but not limited to: 1) individual subscriber attribute (including but not limited to: IP address, subscriber equipment classification, user's sex age etc.); 2) user preference setting; 3) user search historical record etc.
Wherein, described relevant information obtain the following mode that includes but not limited to:
1) network equipment 2 directly obtains the user related information of subscriber equipment 1 transmission and offers selecting arrangement 224;
2) network equipment 2 obtains the user's that subscriber equipment 1 sends identity or identifying information and offers selecting arrangement 224, and selecting arrangement 224 obtains the relevant information that is recorded in this user in the network equipment 2 according to this identity or identifying information;
3) selecting arrangement 224 is set up the information (as the cell-phone number of the subscriber equipment of acquisition or hardware sequence number etc.) of the subscriber equipment 1 that obtains when communicate by letter according to the network equipment 2 and described subscriber equipment 1, judge user's identity, and obtain this user's relevant information according to this identity.
Then, further to choose one or more question and answer in to candidate item right for a plurality of question and answer of obtaining in the 4th sub-coalignment 229 according to the user related information that obtains of selecting arrangement 224.Wherein, describedly choose one or more question and answer right method comprise following at least a:
1) particularly, selecting arrangement 224 is compared the question and answer that obtain to each problem of candidate item, extract incoherent vocabulary, and described incoherent vocabulary searched in described vocabulary typelib, obtain the type of described uncorrelated vocabulary, and extract or obtain corresponding user related information according to the type, in to the question and answer candidate item, choose.
For example, if search information is " No. two line last buses of subway constantly ", the question and answer that obtain are " No. two line last buses of Shanghai Underground constantly; 23:00 " to candidate item, " No. two line last buses of Beijing Metro constantly; 23:00 ", " No. two line last buses of Guangzhou Underground constantly; 23:00 " etc., selecting arrangement 224 is compared above-mentioned question and answer to the problem of candidate item, extract incoherent vocabulary and be " Shanghai ", " Beijing ", " Guangzhou " etc., in vocabulary typelib 25, search then and obtain above-mentioned uncorrelated vocabulary type and be " place ", and " place " pairing user related information is an IP address, selecting arrangement 224 is according to IP address, judge that the user location is Shanghai, and then select question and answer " No. two line last buses moment of Shanghai Underground, 23:00 ".
Wherein, if question and answer are to being to store in the mode of four-tuple, then described comparison can further be limited to the comparison to entity and substance feature description.
For example, search information is " No. two line last buses of subway constantly ", the 4th sub-coalignment 229 mates this search information and each question and answer to the entity and the substance feature description that are comprised, obtain " No. two line last buses of time Shanghai Underground are 23:00 constantly ", " No. two line last buses of time Beijing Metro are 23:15 constantly ", a plurality of question and answer such as " No. two line last buses of time Guangzhou Underground are 23:30 constantly " are right, then selecting arrangement 224 these question and answer of analysis are right, extract the difference " Shanghai " of these question and answer to " entity " item, " Beijing ", " Guangzhou " etc., and coupling obtains its type for " place " in vocabulary typelib 25, " place " pairing user related information is an IP address, according to the user equipment (UE) IP position, selecting arrangement 224 is further chosen question and answer to " No. two line last buses of time Shanghai Underground are 23:00 constantly " above-mentioned question and answer centering then.
2), judge how to choose question and answer to candidate item according to the user preference setting that obtains.
For example, in being provided with, user preference is set in search interface when only presenting the answer information of some, if the 4th sub-coalignment 229 mates the right number of coupling question and answer that obtains and surpasses this setting quantity in that search information and each question and answer are described the entity that comprised and substance feature, then to leave out unnecessary question and answer right for selecting arrangement 224, and to set the question and answer of quantity in being provided with right and only keep user preference.
And for example, can set during user preference is provided with when obtaining a plurality of question and answer to candidate item, it is right to choose the maximum question and answer of answer.
For another example, if question and answer also can be provided with the priority of each question and answer classification to quadruple form storage in the user preference, as the question and answer of preferentially choosing which classification to candidate item etc.
3), judge how to choose question and answer to candidate item according to the user search historical record that obtains.
Particularly, selecting arrangement 224 is compared each the right problem of question and answer that obtains, extract incoherent vocabulary, and described incoherent vocabulary and user's search information is mated, the question and answer of selecting the vocabulary place that matching degree is the highest in the user search information are to candidate item.Wherein, the factor of judgment of described matching degree includes but not limited to: the search time of the quantity of the vocabulary that is complementary, the vocabulary that is complementary, matched frequency etc.
For example, if user's search information be " the niciest river system ", the question and answer that coupling obtains are to reaching " the niciest river is the restaurant, YYY " etc., selecting arrangement 224 extraction incoherent vocabulary " dish ", " restaurant " etc. for " the niciest river is a dish, XXX ".Then selecting arrangement 224 obtains the quantitatively more vocabulary that comparatively mate in " dining room ", " restaurant ", " hotel " etc. and " restaurant " according to user's search history record, therefore selecting arrangement 224 is selected to comprise the question and answer of vocabulary " restaurant " to " the niciest river is the restaurant, YYY ".
Preferably, in the coalignment 22 the 5th sub-coalignment 230 with described search information and each question and answer of prestoring to mating, when question and answer exist in to storehouse 26 relative search information more refinement question and answer to the time, can match a plurality of question and answer to as question and answer to candidate item.So interactive device 226 will with the user further and further to choose one or more question and answer according to user's feedback information above-mentioned a plurality of question and answer centerings right.
Particularly, the question and answer of interactive device 226 basis acquisition in the 5th sub-coalignment 230 are earlier obtained corresponding options to candidate item.Wherein, described options can be according to each question and answer to the problem of candidate item or be used to the entity of the problem of describing and substance feature is described and generated, or directly with each question and answer to the problem of candidate item options as correspondence.
For example, when inputted search information is " Mao Zedong's birth ", the 5th sub-coalignment 230 is described with each question and answer described search information in to storehouse 26 in question and answer will obtain question and answer to candidate item " Dec 26 1893 time Mao Zedong date of birth ", " Shaoshan, Hunan, Mao Zedong birthplace, place " to the entity that comprised and substance feature after mating, and interactive device 226 extracts above-mentioned question and answer respectively and the entity of candidate item and substance feature are described is combined into options " Mao Zedong's date of birth ", " Mao Zedong's place of birth ".
Then, interactive device 226 sends to subscriber equipment 1 with above-mentioned two options, subscriber equipment 1 is presented to the user for its selection with options, after the user has selected one of them options, subscriber equipment 1 sends to interactive device 226 by the internet with this options, then, interactive device 226 obtains above-mentioned user-selected options, and selects its corresponding question and answer right according to this options.
It should be noted that coalignment 22 can at least one carries out combination in any among both by selecting arrangement 224 among at least one covering device and Fig. 8 in three covering devices shown in Fig. 7, interactive device 226, to realize further function.For example, coalignment 22 is constituted with interactive device 226 by second judgment means 227 and the second sub-coalignment 223; Describe when judge entity and substance feature by second judgment means 227, and by the second sub-coalignment, 223 couplings obtain corresponding a plurality of question and answer to the time, interactive device 226 obtains with these a plurality of question and answer corresponding options is offered the user, and it is right to obtain the corresponding question and answer of selecting with the user of options.Those skilled in the art are to be understood that, the present invention is not with the above-mentioned limit that is exemplified as, in fact, any with at least one cover combination at least one cover in described three covering devices and selecting arrangement 224 and the interactive device 226 to select the optimum right scheme of question and answer, all within the scope of the present invention.
Fig. 9 illustrates the network equipment 2 according to generating the right apparatus structure synoptic diagram of question and answer from the question and answer content of webpage and the data of encyclopaedia webpage.Wherein, second deriving means 27 also comprises the 4th judgment means (figure does not show), and first generating apparatus 28 also comprises the first sub-generating apparatus (figure does not show).
Particularly, it is right that the network equipment 2 can generate question and answer according to the content from webpage, and the process that these generation question and answer are right relates to second deriving means 27 and first generating apparatus 28.Second deriving means 27 obtains the website that may contain the question and answer content information from default storehouse, website, know, search and ask etc. as Baidu.Then, second deriving means 27 can adopt modes such as Web Spider, web crawlers, grasp the web page contents that may contain the question and answer content information in this website, and the above-mentioned web page contents that may contain the question and answer content information analyzed, according to the position of web page code decision problem, and extract problem.Mark such as code " title " can appear in the html format sources code of webpage for example on its question text one hurdle, second deriving means 27 obtains the question text information on the above-mentioned relevant position, has promptly obtained the problem of the question and answer content of this webpage.
Then second deriving means 27 obtains answer from described question and answer content.The method of obtaining answer includes but not limited to following at least one:
1) in this question and answer content whether the optimum answer that is identified is arranged, have then with of the answer of this optimum answer as this question and answer content;
For example, whether analyzing web page has " optimum answer " this hurdle in the question and answer content during the Baidu that judgement gets access to is known." optimum answer " represented the pairing answer of this problem.
2) in all answers of this question and answer content, clicking rate or the highest answer of positive rating.
For example, analyze and to obtain certain answer and be subjected to online friend " top " and " favorable comment " at most, judge that then this answer is the answer of this problem.
Subsequently, the 4th judgment means judges whether the question and answer content of obtaining is determinacy question and answer contents, and the first sub-generating apparatus that "Yes" then offers this question and answer content in first generating apparatus 28 is right to generate question and answer; "No" is then given up this question and answer content.
The 4th judgment means judges that whether the question and answer content is that the method for determinacy question and answer content includes but not limited to:
1) at first by in the vocabulary typelib, searching the classification of this question and answer content of matching judgment, when the classification that prestores in the vocabulary typelib can be with its coupling, again according to Entity recognition technology and proper noun recognition technology discern problem in the described question and answer content entity, judge that the substance feature of this entity describes.When the classification of described question and answer content, entity and substance feature are described and after answer determined by above-mentioned steps, shown that promptly described question and answer content is determinacy question and answer contents;
2) by proper noun recognition technology and Entity recognition technology, judge whether to identify the substance feature description of entity and this entity, if can, judge that then this question and answer content is determinacy question and answer contents, and further judge the classification of this question and answer content.
Need to prove, those skilled in the art should understand that, the method for the determinacy question and answer of judging whether is not with the above-mentioned limit that is exemplified as, in fact, any basis whether can extract entity and substance feature is described, or can not extract the description of entity and substance feature and judge the question and answer classification, judge whether the question and answer content is the method for determinacy question and answer content, all should be within the scope of the present invention.
What need further specify is, those skilled in the art should understand that, judge when whether the question and answer content is determinacy question and answer content, can only judge by the problem in the question and answer content, therefore, the 4th judgment means can judge promptly before second deriving means 27 obtains answer whether question and answer are the determinacy question and answer, and is corresponding, second deriving means 27 only judges that in the 4th judgment means this question and answer content is under the situation of determinacy question and answer content, just extracts answer.
At last, first generating apparatus 28 is described according to classification, entity and the substance feature of the problems referred to above that obtain in the above-mentioned steps, in conjunction with the answer information in the above-mentioned question and answer content information, the question and answer that generate four-tuple structure (" the class instance substance feature is described answer ") are right.
Particularly, it is right that the network equipment 2 can generate question and answer according to the data from the encyclopaedia webpage, and the process that these generation question and answer are right relates to the 3rd deriving means 29 and second generating apparatus 30.The 3rd deriving means 29 can obtain the web page address of network encyclopaedia character by the internet, as Baidu's encyclopaedia, wikipedia etc., thereby obtains encyclopaedia data in such website.The 3rd deriving means 29 also can be uploaded or the mode of network is obtained the encyclopaedia data by this locality.
Second generating apparatus 30 can be judged the entry and the entry explanation of encyclopaedia data by the above-mentioned network encyclopaedia webpage that obtains is analyzed by web page code.For example in the relevant position of webpage html source code, obtain the theme entry of these encyclopaedia data and the entry explanation of described entry etc.; Also can obtain the theme entry of encyclopaedia data and the entry explanation of described entry etc. by parsing (template analysis judgment etc.) to the encyclopaedia data uploaded.
Second generating apparatus 30 also comprises second judgment means (figure does not show), the 3rd sub-generating apparatus (figure does not show) and the 4th sub-generating apparatus (figure does not show).The 3rd sub-generating apparatus as entity, and explains that according to entry and entry generating substance feature describes with entry.According to described entity and substance feature the generation problem is described then.For example for " Mao Zedong " this entry, the 3rd sub-generating apparatus will generate its corresponding substance feature and be described as " life ", and generation problem " Mao Zedong's life "; For " favourable trade balance " this entry, the 3rd sub-generating apparatus will generate substance feature and be described as " implication " etc., and generation problem " what implication of favourable trade balance is " etc.
At last, the 4th sub-generating apparatus is according to above-mentioned resulting problem, and is interpreted as answer with the entry that obtains in the 3rd deriving means 29, and the question and answer that generate this encyclopaedia data correspondence are right.
Preferably, before the 3rd sub-generating apparatus generation problem, second judgment means can be explained according to the entry of entry that obtains in the 3rd deriving means 29 and described entry and search coupling in vocabulary typelib 25, explain its corresponding class with the entry of determining above-mentioned entry and described entry.For example for " Mao Zedong " these encyclopaedia data, second judgment means is searched " Mao Zedong " this entry in the vocabulary typelib, and judges that this entry and entry explanation corresponding class should be " personage ".Accordingly, the 3rd sub-generating apparatus as entity, and explains that according to entry and entry corresponding class generates substance feature and describes with entry.For example for " Mao Zedong " this entry, the 3rd sub-generating apparatus will generate its corresponding substance feature and be described as " life "; For " favourable trade balance " this entry, the 3rd sub-generating apparatus will generate substance feature and be described as " implication " etc.At last, the 4th sub-generating apparatus is described according to the above-mentioned entry classification that obtains of step, entity and substance feature, is combined in the entry that is obtained in the 3rd deriving means 29 and is interpreted as answer, and the question and answer that generate this encyclopaedia data correspondence are right.
For for purpose of brevity, omitted the device that has nothing to do with the embodiment shown in this figure among Fig. 6 to Fig. 9, it should be appreciated by those skilled in the art that the network equipment 2 can comprise all devices described in Fig. 6 to Fig. 9 or the combination in any of all devices.
Need to prove, as one of preferred version of the present invention, question and answer to the process that generates in, can describe entity and substance feature and carry out normalized, it is right to generate corresponding question and answer in conjunction with the problem category in the corresponding question and answer content and answer again.
Wherein, described normalized includes but not limited to:
1) will belong to a plurality of entities in the same synonym phrase or substance feature and describe to describe with one of them entity or substance feature and explain, wherein, described synonym phrase is stored in the thesaurus;
For example, in the process of question and answer to generation, obtain the right entity of these question and answer and be " Chairman Mao ", substance feature is " date of birth ", and question and answer to the process that generates in or after this process, the network equipment 2 is searched in thesaurus and is obtained " Chairman Mao " and be included in the synonym phrase, this synonym phrase all with " Mao Zedong " as unified description, then in the right generative process of question and answer, entity " Chairman Mao " is normalized to entity " Mao Zedong "; In matching process or after the coupling, the network equipment 2 is searched in thesaurus and is obtained " date of birth " and be included in the synonym phrase, and this synonym phrase all with " birthday " as unified description, then in the right generative process of question and answer, substance feature description " date of birth " is normalized to substance feature describes " birthday ".
Again for example, in the process that search information is mated, the entity that obtains this search information is " Chairman Mao ", substance feature is " date of birth ", and in the process of coupling or after this process, the network equipment 2 is searched in thesaurus and is obtained " Chairman Mao " and be included in the synonym phrase, this synonym phrase all with " Mao Zedong " as unified description, then with search information and question and answer to the process of mating in, entity " Chairman Mao " is normalized to entity " Mao Zedong " mates; In matching process or after the coupling, the network equipment 2 is searched in thesaurus and is obtained " date of birth " and be included in the synonym phrase, and this synonym phrase all with " birthday " as unified description, then with search information and question and answer to the process of mating in, substance feature is described " date of birth " is normalized to substance feature and describes " birthday " and mate.
2) similarity is surpassed the entity of predetermined threshold is unified to be identical entity, the substance feature that similarity is surpassed predetermined threshold is described and is unifiedly described for identical substance feature.
The similarity that it will be understood by those skilled in the art that the description of entity or substance feature can be calculated in several ways, and for example, by similar part proportion, or the default pairing numerical value of the different range to aforementioned similar part proportion is determined.In addition, those skilled in the art should be rule of thumb or actual demand determine aforementioned corresponding predetermined threshold, do not do at this and give unnecessary details.
Need to prove that as one of preferred version of the present invention, the network equipment 2 also comprises the 6th judgment means (figure does not show) and calling device (figure does not show).The 6th judgment means is to analyzing the answer information of obtaining according to the coupling question and answer, judge whether described answer information comprises answer and obtain information, if described answer information comprises answer and obtains information, calling device obtains information according to this answer, calls by api interface and obtains corresponding answer.Wherein, above-mentioned answer is obtained information and is included but not limited to: 1) webpage url link and the particular location of answer in this webpage; 2) customizing messages that obtains from special interface.Information is obtained in above-mentioned answer can be by manually presetting.
For example, certain user is in inputted search information on August 31st, 2010 " Shanghai weather condition ", it is right that the network equipment 2 obtains corresponding question and answer according to described search information coupling, comprise the network address of certain webpage and the positional information of content in this webpage that hope is obtained in the right answer information of these question and answer, this positional information includes but not limited to position range or the of living in module of described content in this webpage, then the 6th judgment means judges that these question and answer comprise answer to the answer information that is comprised and obtain information, then, calling device calls corresponding webpage by api interface, and on this webpage according to described positional information, grasp and wish that the content of obtaining is " Shanghai; on August 31st, 2010; 25~29 ℃; drizzle to moderate rain; southeaster 4-5 level ", and it is presented to the user as answer, the appearance form of this answer includes but not limited to comprise the textual form or the picture form of above-mentioned answer.
According to a further aspect in the invention, if subscriber equipment 1 is downloaded to this locality according to the question and answer of generation embodiment illustrated in fig. 9 to storehouse 26 and default vocabulary typelib 25 with the network equipment 2, subscriber equipment 1 can independently be finished the function as Fig. 6, Fig. 7 and embodiment shown in Figure 8.
Particularly, be with embodiment difference shown in Figure 6, in the present embodiment, comprise the 4th deriving means 13 in the subscriber equipment 1, and further comprise with as shown in Figure 1 embodiment in the network equipment 2 first deriving means 21, coalignment 22, the same or analogous device of generator 23 functions that are comprised, but do not comprise first dispensing device 11 and first receiving device 12.After subscriber equipment 1 obtains the search information of user's input by the 4th deriving means 13, need not to send to the network equipment 2, but directly by described and first deriving means 21, coalignment 22, the same or analogous device of generator 23 functions, after acquisition has comprised the searched page of answer information, directly present to the user.
In the present embodiment, subscriber equipment 1 can comprise each device that is comprised in the coalignment 22, embodiment difference right and shown in Figure 8 is, because selecting arrangement 224 and interactive device 226 are arranged in the subscriber equipment 1, therefore selecting arrangement 224 directly obtains user related information from subscriber equipment 1, and the interactive device (as: display that interactive device 226 is directly possessed by subscriber equipment 1, touch-screen, mouse, keyboard, felt pen etc.) carry out alternately with the user, but selecting arrangement 224 how according to user related information select one or to individual question and answer to and interactive device 226 how to obtain question and answer according to user's selection right, same or similar with the embodiment shown in Fig. 8, comprise by reference at this, repeat no more; The mode of function that first judgment means, 221, the first sub-coalignment 222, second judgment means, 227, the second sub-coalignment 223, the 3rd judgment means 228 and the 3rd sub-coalignment 225 are realized in subscriber equipment 1 and realization function and embodiment shown in Figure 7 are same or similar, comprise by reference at this, repeat no more.
To those skilled in the art, obviously the invention is not restricted to the details of above-mentioned one exemplary embodiment, and under the situation that does not deviate from spirit of the present invention or essential characteristic, can realize the present invention with other concrete form.Therefore, no matter from which point, all should regard embodiment as exemplary, and be nonrestrictive, scope of the present invention is limited by claims rather than above-mentioned explanation, therefore is intended to be included in the present invention dropping on the implication that is equal to important document of claim and all changes in the scope.Any Reference numeral in the claim should be considered as limit related claim.In addition, obviously other unit or step do not got rid of in " comprising " speech, and odd number is not got rid of plural number.A plurality of unit of stating in system's claim or device also can be realized by software or hardware by a unit or device.The first, the second word such as grade is used for representing title, and does not represent any specific order.

Claims (36)

1. one kind is used for presenting the method for searching for answer information at search interface, and this method may further comprise the steps:
A obtains the search information from the user;
To mating, it is right to obtain one or more question and answer that all or part of content with described search information is complementary with described search information and each question and answer of prestoring for b;
C is right according to described one or more question and answer, provides the answer information corresponding with this search information to the user.
2. method according to claim 1 wherein, also comprises:
-the page that the user is imported described search information upgrades processing, it is updated to the renewal page that comprises described answer information.
3. method according to claim 1 and 2, wherein, described step b is further comprising the steps of:
-described search information and described each question and answer that prestore are described the entity that comprised and substance feature mate, obtaining that described entity and described substance feature describe all can be right with one or more question and answer of all or part of content match of described search information.
4. according to each described method in the claim 1 to 3, wherein, described step b is further comprising the steps of:
The question and answer classification that the described search information of-judgement comprises;
-described search information to be mated in the question and answer information that the question and answer classification that it comprises is comprised, one or more question and answer that all or part of content of acquisition and described search information is complementary are right.
5. according to claim 3 or 4 described methods, wherein, described step b is further comprising the steps of:
-judge whether from described search information, to extract entity and substance feature to describe;
-Ruo can extract entity from described search information and substance feature is described, described entity that can extract and substance feature description are mated the entity and the substance feature description that are comprised with described each question and answer, and one or more question and answer that acquisition is complementary are right.
6. method according to claim 5, wherein, described step b is further comprising the steps of:
-judge whether from described search information, to extract entity and substance feature description, and whether can judge the classification that this search information comprises;
-Ruo can extract entity from described search information and substance feature is described, and can judge the classification that this search information comprises, the described entity that can extract, substance feature description and the described classification of judging and described each question and answer are mated the question and answer classification, entity and the substance feature description that are comprised, and one or more question and answer that acquisition is complementary are right.
7. according to each described method in the claim 1 to 4, wherein, described step c is further comprising the steps of:
The question and answer that-Ruo need present are to for a plurality of, then integrate this a plurality of question and answer to the generation answer information.
8. according to each described method in the claim 1 to 7, wherein, described answer information can be presented in following at least one position:
-Search Results article one;
-search suggestion;
-input method candidate bar;
-search column candidate item hurdle;
Candidate item hurdle under the-WEB input field.
9. according to each described method in the claim 1 to 8, wherein, this method is further comprising the steps of:
E obtains the question and answer content from webpage, judges whether described question and answer content is determinacy question and answer contents;
It is right that f generates question and answer according to the content that is judged as the determinacy question and answer.
10. method according to claim 9, wherein, described step e is further comprising the steps of:
-can extract entity and substance feature description by the problem of judging described question and answer content, judge whether described question and answer content is determinacy question and answer contents;
Described step f is further comprising the steps of:
-describe according to described entity and substance feature, in conjunction with the answer of described question and answer content, it is right to generate question and answer.
11. according to each described method in the claim 1 to 8, wherein, this method is further comprising the steps of:
G obtains the encyclopaedia data;
H explains that according to corresponding entries and entry in the described encyclopaedia data generation question and answer are right.
12. method according to claim 11, wherein, described step h is further comprising the steps of:
-explain the right classification of question and answer that judgement is to be generated according to described entry and entry;
-with described entry as entity, and explain that according to described entry generating substance feature describes;
-describe and the explanation of described entry in conjunction with the right classification of described question and answer, described entity, described substance feature, it is right to generate question and answer.
13. according to each described method in the claim 1 to 12, wherein, described step b is further comprising the steps of:
-with described search information and each question and answer of prestoring to mating, obtain a plurality of question and answer to candidate item;
-according to user related information, right from described a plurality of question and answer to choosing one or more question and answer the candidate item.
14. method according to claim 13, wherein, described user related information comprises following at least one:
-individual subscriber attribute;
The setting of-user preference;
-user search historical record.
15. according to each described method in the claim 1 to 14, wherein, described step b is further comprising the steps of:
-with described search information and each question and answer of prestoring to mating, obtain a plurality of question and answer to candidate item;
-basis is further mutual with the user's, and is right to obtaining one or more question and answer the candidate item from described a plurality of question and answer.
16. according to each described method in the claim 1 to 15, wherein, this method is further comprising the steps of:
-judge whether the right answer information of question and answer that obtains comprises answer and obtain information;
-when comprising answer, the right answer information of question and answer that obtains obtains information, then obtain information, by the corresponding answer of API Calls according to this answer.
17. according to each described method in the claim 1 to 16, wherein, this method is finished by the network equipment.
18. according to each described method in claim 1 to 8 and the claim 13 to 15, wherein, this method is finished by subscriber equipment.
19. one kind is used for presenting the equipment of searching for answer information at search interface, wherein, this equipment comprises:
First deriving means, be used to obtain search information from the user;
Coalignment, be used for described search information and each question and answer of prestoring mating, it is right to obtain one or more question and answer that all or part of content with described search information is complementary;
Generator, be used for according to described one or more question and answer rightly, provide the answer information corresponding with this search information to the user.
20. equipment according to claim 19, wherein, this equipment also comprises:
Page refreshment device, the page that is used for the user is imported described search information upgrade processing, it is updated to the renewal page that comprises described answer information.
21. according to claim 19 or 20 described equipment, wherein, described coalignment also is used for:
Described search information and described each question and answer that prestore are described the entity that comprised and substance feature mate, obtaining that described entity and described substance feature describe all can be right with one or more question and answer of all or part of content match of described search information.
22. according to each described equipment in the claim 19 to 21, wherein, described coalignment also comprises:
First judgment means, be used to the question and answer classification of judging that described search information comprises;
The first sub-coalignment, be used for described search information is mated in the question and answer information that its question and answer classification that comprises is comprised, it is right to obtain one or more question and answer that all or part of content with described search information is complementary.
23. according to each described equipment in the claim 19 to 21, wherein, described coalignment also comprises:
Second judgment means, be used for judging whether that can extract entity and substance feature from described search information describes;
The second sub-coalignment, be used for describing if can extract entity and substance feature from described search information, described entity that can extract and substance feature description are mated the entity and the substance feature description that are comprised with described each question and answer, and one or more question and answer that acquisition is complementary are right.
24. according to each described equipment in the claim 19 to 21, wherein, described coalignment also comprises:
The 3rd judgment means, be used for judging whether that can extract entity and substance feature from described search information describes, and whether can judge the classification that this search information comprises;
The 3rd sub-coalignment, be used for describing if can extract entity and substance feature from described search information, and can judge the classification that this search information comprises, the described entity that can extract, substance feature description and the described classification of judging and described each question and answer are mated the question and answer classification, entity and the substance feature description that are comprised, and one or more question and answer that acquisition is complementary are right.
25. according to each described equipment in the claim 19 to 24, wherein, described generator also comprises:
Integrating apparatus, be used for if the question and answer that need present, are then integrated this a plurality of question and answer to for a plurality of to the generation answer information.
26. according to each described equipment in the claim 19 to 25, wherein, described answer information can be presented in following at least one position:
-Search Results article one;
-search suggestion;
-input method candidate bar;
-search column candidate item hurdle;
Candidate item hurdle under the-WEB input field.
27. according to each described equipment in the claim 19 to 26, wherein, this equipment also comprises:
Second deriving means, be used to obtain question and answer content, judge whether described question and answer content is determinacy question and answer contents from webpage;
First generating apparatus, to be used for generating question and answer according to the content that is judged as the determinacy question and answer right.
28. equipment according to claim 27, wherein, described second deriving means also comprises:
The 4th judgment means, be used for to extract entity and substance feature and describe, judge whether described question and answer content is determinacy question and answer contents by the problem of judging described question and answer content;
Described first generating apparatus also comprises:
The first sub-generating apparatus, be used for describing according to described entity and substance feature, in conjunction with the answer of described question and answer content, it is right to generate question and answer.
29. according to each described equipment in the claim 19 to 26, wherein, this equipment also comprises:
The 3rd deriving means, be used to obtain the encyclopaedia data;
Second generating apparatus, be used for explaining that according to described encyclopaedia data corresponding entries and entry to generate question and answer right.
30. equipment according to claim 29, wherein, described second generating apparatus comprises:
The 5th judgment means, be used for explaining and judge the right classification of question and answer to be generated according to described entry and entry;
The 3rd sub-generating apparatus, be used for described entry, and explain that according to described entry generating substance feature describes as entity;
The 4th sub-generating apparatus, be used for describing and described entry is explained in conjunction with the right classification of described question and answer, described entity, described substance feature, it is right to generate question and answer.
31. according to each described equipment in the claim 19 to 30, wherein, described coalignment also comprises:
The 4th sub-coalignment, be used for described search information and each question and answer of prestoring obtaining a plurality of question and answer to candidate item to mating;
Selecting arrangement, be used for according to user related information, right from described a plurality of question and answer to choosing one or more question and answer the candidate item.
32. equipment according to claim 31, wherein, described user related information comprises following at least one:
-individual subscriber attribute;
The setting of-user preference;
-user search historical record.
33. according to each described equipment in the claim 19 to 32, wherein, described coalignment also comprises:
The 5th sub-coalignment, be used for described search information and each question and answer of prestoring obtaining a plurality of question and answer to candidate item to mating;
Interactive device, be used for further mutual according to the user, right from described a plurality of question and answer to obtaining one or more question and answer the candidate item.
34. according to each described method in the claim 19 to 33, wherein, this equipment also comprises:
Whether-Di six judgment means, the right answer information of question and answer that is used to judge acquisition comprise answer and obtain information;
-calling device, obtain information, then obtain information, by the corresponding answer of API Calls according to this answer when the right answer information of question and answer that obtains comprises answer.
35. according to each described equipment in the claim 19 to 34, wherein, this equipment is the network equipment.
36. according to each described equipment in claim 19 to 24 and the claim 31 to 34, wherein, this equipment is subscriber equipment.
CN201010271796.6A 2010-09-03 2010-09-03 For presenting the method and apparatus of search answer information in search interface CN101986293B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201010271796.6A CN101986293B (en) 2010-09-03 2010-09-03 For presenting the method and apparatus of search answer information in search interface

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201010271796.6A CN101986293B (en) 2010-09-03 2010-09-03 For presenting the method and apparatus of search answer information in search interface

Publications (2)

Publication Number Publication Date
CN101986293A true CN101986293A (en) 2011-03-16
CN101986293B CN101986293B (en) 2016-08-24

Family

ID=43710640

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201010271796.6A CN101986293B (en) 2010-09-03 2010-09-03 For presenting the method and apparatus of search answer information in search interface

Country Status (1)

Country Link
CN (1) CN101986293B (en)

Cited By (29)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102214209A (en) * 2011-04-27 2011-10-12 百度在线网络技术(北京)有限公司 Method and equipment for identifying homonymous information entities
CN103186643A (en) * 2011-12-30 2013-07-03 安凯(广州)微电子技术有限公司 Autonomous learning method for realizing association of teaching contents, terminal and system
CN103699590A (en) * 2013-12-09 2014-04-02 北京奇虎科技有限公司 Method and server for providing graphic tutorial problem solution
CN103760991A (en) * 2014-01-13 2014-04-30 北京搜狗科技发展有限公司 Physical input method and physical input device
CN103838554A (en) * 2012-11-21 2014-06-04 腾讯科技(北京)有限公司 Method and device for generating interactive activities
WO2014161292A1 (en) * 2013-08-14 2014-10-09 中兴通讯股份有限公司 Method, device and terminal for starting application program
CN104331441A (en) * 2014-10-24 2015-02-04 北京奇虎科技有限公司 Method and device for providing answers to questions based on search engine
CN104331440A (en) * 2014-10-24 2015-02-04 北京奇虎科技有限公司 Instant messaging method and client for providing query results based on search engine
CN104376046A (en) * 2014-10-24 2015-02-25 北京奇虎科技有限公司 Browsing method based on query result provided by search engine and browser client-side
WO2015058604A1 (en) * 2013-10-21 2015-04-30 北京奇虎科技有限公司 Apparatus and method for obtaining degree of association of question and answer pair and for search ranking optimization
CN105117398A (en) * 2015-06-25 2015-12-02 扬州大学 Software development problem automatic answering method based on crowdsourcing
CN105740362A (en) * 2016-01-26 2016-07-06 百度在线网络技术(北京)有限公司 Information display method and display apparatus
CN105786874A (en) * 2014-12-23 2016-07-20 北京奇虎科技有限公司 Method and device for constructing question-answer knowledge base data items based on encyclopedic entries
CN105786872A (en) * 2014-12-23 2016-07-20 北京奇虎科技有限公司 Method and device for providing question-answer onebox based on user searches
CN105786869A (en) * 2014-12-23 2016-07-20 北京奇虎科技有限公司 Search-based method and device for acquisition of special question-answer data
CN105786851A (en) * 2014-12-23 2016-07-20 北京奇虎科技有限公司 Question and answer knowledge base construction method as well as search provision method and apparatus
CN105786871A (en) * 2014-12-23 2016-07-20 北京奇虎科技有限公司 Question-answer search result display method and device based on search terms
CN106168962A (en) * 2016-06-30 2016-11-30 北京奇虎科技有限公司 Searching method and the device of accurate viewpoint are provided based on natural Search Results
WO2017016104A1 (en) * 2015-07-28 2017-02-02 百度在线网络技术(北京)有限公司 Question-answer information processing method and apparatus, storage medium, and device
CN106776797A (en) * 2016-11-22 2017-05-31 中国人名解放军理工大学 A kind of knowledge Q-A system and its method of work based on ontology inference
CN106919589A (en) * 2015-12-24 2017-07-04 北京奇虎科技有限公司 Customer problem analysis method and device
CN103853842B (en) * 2014-03-20 2017-07-18 百度在线网络技术(北京)有限公司 A kind of automatic question-answering method and system
CN107590252A (en) * 2017-09-19 2018-01-16 百度在线网络技术(北京)有限公司 Method and device for information exchange
CN108959559A (en) * 2018-06-29 2018-12-07 北京百度网讯科技有限公司 Question and answer are to generation method and device
CN109191940A (en) * 2018-08-31 2019-01-11 广东小天才科技有限公司 A kind of exchange method and smart machine based on smart machine
CN109635214A (en) * 2018-12-20 2019-04-16 广东小天才科技有限公司 A kind of method for pushing and electronic equipment of education resource
CN109710747A (en) * 2019-01-16 2019-05-03 北京猎户星空科技有限公司 Information processing method, device and electronic equipment
CN110246493A (en) * 2019-05-06 2019-09-17 百度在线网络技术(北京)有限公司 Address book contact lookup method, device and storage medium
CN109710747B (en) * 2019-01-16 2021-04-06 北京猎户星空科技有限公司 Information processing method and device and electronic equipment

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1821991A (en) * 2005-02-18 2006-08-23 上海赢思软件技术有限公司 Knowledge question-and-answer quick processing system based on artificial intelligence
CN1928864A (en) * 2006-09-22 2007-03-14 浙江大学 FAQ based Chinese natural language ask and answer method
CN101118554A (en) * 2007-09-14 2008-02-06 中兴通讯股份有限公司 Intelligent interactive request-answering system and processing method thereof

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1821991A (en) * 2005-02-18 2006-08-23 上海赢思软件技术有限公司 Knowledge question-and-answer quick processing system based on artificial intelligence
CN1928864A (en) * 2006-09-22 2007-03-14 浙江大学 FAQ based Chinese natural language ask and answer method
CN101118554A (en) * 2007-09-14 2008-02-06 中兴通讯股份有限公司 Intelligent interactive request-answering system and processing method thereof

Cited By (36)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102214209A (en) * 2011-04-27 2011-10-12 百度在线网络技术(北京)有限公司 Method and equipment for identifying homonymous information entities
CN103186643A (en) * 2011-12-30 2013-07-03 安凯(广州)微电子技术有限公司 Autonomous learning method for realizing association of teaching contents, terminal and system
CN103838554B (en) * 2012-11-21 2017-12-12 腾讯科技(北京)有限公司 The generation method and device of a kind of interactive event
US10120546B2 (en) 2012-11-21 2018-11-06 Tencent Technology (Shenzhen) Company Limited Interactive activity generating method and apparatus and computer storage medium
CN103838554A (en) * 2012-11-21 2014-06-04 腾讯科技(北京)有限公司 Method and device for generating interactive activities
WO2014161292A1 (en) * 2013-08-14 2014-10-09 中兴通讯股份有限公司 Method, device and terminal for starting application program
WO2015058604A1 (en) * 2013-10-21 2015-04-30 北京奇虎科技有限公司 Apparatus and method for obtaining degree of association of question and answer pair and for search ranking optimization
CN103699590A (en) * 2013-12-09 2014-04-02 北京奇虎科技有限公司 Method and server for providing graphic tutorial problem solution
CN103760991B (en) * 2014-01-13 2017-02-15 北京搜狗科技发展有限公司 Physical input method and physical input device
CN103760991A (en) * 2014-01-13 2014-04-30 北京搜狗科技发展有限公司 Physical input method and physical input device
CN103853842B (en) * 2014-03-20 2017-07-18 百度在线网络技术(北京)有限公司 A kind of automatic question-answering method and system
CN104376046A (en) * 2014-10-24 2015-02-25 北京奇虎科技有限公司 Browsing method based on query result provided by search engine and browser client-side
CN104331440A (en) * 2014-10-24 2015-02-04 北京奇虎科技有限公司 Instant messaging method and client for providing query results based on search engine
CN104331441A (en) * 2014-10-24 2015-02-04 北京奇虎科技有限公司 Method and device for providing answers to questions based on search engine
CN105786869A (en) * 2014-12-23 2016-07-20 北京奇虎科技有限公司 Search-based method and device for acquisition of special question-answer data
CN105786851A (en) * 2014-12-23 2016-07-20 北京奇虎科技有限公司 Question and answer knowledge base construction method as well as search provision method and apparatus
CN105786871A (en) * 2014-12-23 2016-07-20 北京奇虎科技有限公司 Question-answer search result display method and device based on search terms
CN105786874A (en) * 2014-12-23 2016-07-20 北京奇虎科技有限公司 Method and device for constructing question-answer knowledge base data items based on encyclopedic entries
CN105786872A (en) * 2014-12-23 2016-07-20 北京奇虎科技有限公司 Method and device for providing question-answer onebox based on user searches
CN105786871B (en) * 2014-12-23 2019-03-19 北京奇虎科技有限公司 Question and answer class search result rendering method and device based on search term
CN105117398A (en) * 2015-06-25 2015-12-02 扬州大学 Software development problem automatic answering method based on crowdsourcing
CN105117398B (en) * 2015-06-25 2018-10-26 扬州大学 A kind of software development problem auto-answer method based on crowdsourcing
WO2017016104A1 (en) * 2015-07-28 2017-02-02 百度在线网络技术(北京)有限公司 Question-answer information processing method and apparatus, storage medium, and device
CN106919589A (en) * 2015-12-24 2017-07-04 北京奇虎科技有限公司 Customer problem analysis method and device
CN105740362A (en) * 2016-01-26 2016-07-06 百度在线网络技术(北京)有限公司 Information display method and display apparatus
CN106168962A (en) * 2016-06-30 2016-11-30 北京奇虎科技有限公司 Searching method and the device of accurate viewpoint are provided based on natural Search Results
CN106168962B (en) * 2016-06-30 2020-02-21 北京奇虎科技有限公司 Search method and device for providing accurate viewpoint based on natural search result
CN106776797A (en) * 2016-11-22 2017-05-31 中国人名解放军理工大学 A kind of knowledge Q-A system and its method of work based on ontology inference
CN107590252A (en) * 2017-09-19 2018-01-16 百度在线网络技术(北京)有限公司 Method and device for information exchange
CN108959559B (en) * 2018-06-29 2021-02-26 北京百度网讯科技有限公司 Question and answer pair generation method and device
CN108959559A (en) * 2018-06-29 2018-12-07 北京百度网讯科技有限公司 Question and answer are to generation method and device
CN109191940A (en) * 2018-08-31 2019-01-11 广东小天才科技有限公司 A kind of exchange method and smart machine based on smart machine
CN109635214A (en) * 2018-12-20 2019-04-16 广东小天才科技有限公司 A kind of method for pushing and electronic equipment of education resource
CN109710747A (en) * 2019-01-16 2019-05-03 北京猎户星空科技有限公司 Information processing method, device and electronic equipment
CN109710747B (en) * 2019-01-16 2021-04-06 北京猎户星空科技有限公司 Information processing method and device and electronic equipment
CN110246493A (en) * 2019-05-06 2019-09-17 百度在线网络技术(北京)有限公司 Address book contact lookup method, device and storage medium

Also Published As

Publication number Publication date
CN101986293B (en) 2016-08-24

Similar Documents

Publication Publication Date Title
US9703882B2 (en) Generating search results containing state links to applications
US20190311025A1 (en) Methods and systems for modeling complex taxonomies with natural language understanding
CN105068661B (en) Man-machine interaction method based on artificial intelligence and system
CN104412265B (en) Update for promoting the search of application searches to index
US9613166B2 (en) Search suggestions of related entities based on co-occurrence and/or fuzzy-score matching
AU2019203718A1 (en) Method of and system for inferring user intent in search input in a conversational interaction system
US7917514B2 (en) Visual and multi-dimensional search
CN104899273B (en) A kind of Web Personalization method based on topic and relative entropy
US8745051B2 (en) Resource locator suggestions from input character sequence
CN101267518B (en) Method and system for extracting relevant information from content metadata
US20130254199A1 (en) Providing knowledge content to users
CN102073725B (en) Method for searching structured data and search engine system for implementing same
CN102096717B (en) Search method and search engine
CN101124576B (en) Search system and methods with integration of user annotations from a trust network
US8001135B2 (en) Search support apparatus, computer program product, and search support system
CN103443786B (en) The machine learning method of the independent task of the parallel layout in identification web browser
CN102880649B (en) A kind of customized information disposal route and system
CN101416179B (en) System and method for providing regulated recommended word to every subscriber
US20150178273A1 (en) Unsupervised Relation Detection Model Training
JP6007088B2 (en) Question answering program, server and method using a large amount of comment text
CN1934569B (en) Search systems and methods with integration of user annotations
US10896212B2 (en) System and methods for automating trademark and service mark searches
US10878044B2 (en) System and method for providing content recommendation service
KR100816912B1 (en) System and method for searching documents
CN105760495B (en) A kind of knowledge based map carries out exploratory searching method for bug problem

Legal Events

Date Code Title Description
PB01 Publication
C06 Publication
SE01 Entry into force of request for substantive examination
C10 Entry into substantive examination
GR01 Patent grant
C14 Grant of patent or utility model