CN107305550A - A kind of intelligent answer method and device - Google Patents

A kind of intelligent answer method and device Download PDF

Info

Publication number
CN107305550A
CN107305550A CN201610244036.3A CN201610244036A CN107305550A CN 107305550 A CN107305550 A CN 107305550A CN 201610244036 A CN201610244036 A CN 201610244036A CN 107305550 A CN107305550 A CN 107305550A
Authority
CN
China
Prior art keywords
question
current
omission
current question
complete
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201610244036.3A
Other languages
Chinese (zh)
Inventor
李春霞
鲍焱
冯军
郭斌
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
ZTE Corp
Original Assignee
ZTE Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by ZTE Corp filed Critical ZTE Corp
Priority to CN201610244036.3A priority Critical patent/CN107305550A/en
Priority to PCT/CN2017/078844 priority patent/WO2017181834A1/en
Publication of CN107305550A publication Critical patent/CN107305550A/en
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/33Querying
    • G06F16/3331Query processing
    • G06F16/3332Query translation
    • G06F16/3335Syntactic pre-processing, e.g. stopword elimination, stemming
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/33Querying
    • G06F16/332Query formulation
    • G06F16/3329Natural language query formulation
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/20Natural language analysis
    • G06F40/205Parsing
    • G06F40/211Syntactic parsing, e.g. based on context-free grammar [CFG] or unification grammars

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • Data Mining & Analysis (AREA)
  • Computational Linguistics (AREA)
  • Databases & Information Systems (AREA)
  • Mathematical Physics (AREA)
  • Artificial Intelligence (AREA)
  • Human Computer Interaction (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • General Health & Medical Sciences (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

The invention discloses a kind of intelligent answer method and device.This method includes:Current question sentence is obtained, the current question sentence includes the question sentence that user currently inputs;Whether according to the current question sentence, it is incomplete question sentence to determine the current question sentence, and the incomplete question sentence, which exists, omits word, and the omission word includes lacking the keyword for retrieving the current question sentence answer;If it is determined that the current question sentence is incomplete question sentence, then the current question sentence is subjected to integrality recovery operation, to obtain complete question sentence, the complete question sentence includes the question sentence for omitting word is not present;According to the complete question sentence, the answer of the complete question sentence is retrieved in database.The integrality recovery to incomplete question sentence is realized, so as to retrieve accurate answer in database.

Description

Intelligent question and answer method and device
Technical Field
The invention relates to computer technology, in particular to an intelligent question answering method and device.
Background
With the continuous development of computer technology, users can intelligently ask and answer with a computer, that is, in a ask-answer form, the computer can accurately position answers of questions asked by the users.
In the existing intelligent question-answering system, a user presents a question to a computer, the computer can correctly select an answer to the question presented by the user from a knowledge base, then the user presents a new question to the computer, and the computer continues to correctly select an answer to the new question presented by the user from the knowledge base.
However, the new question posed by the user may be an incomplete question, e.g., what is the 4008 service? The second problem is: how to do it? The computer will not be able to process the incomplete question, resulting in a failure to provide the user with an accurate answer.
Disclosure of Invention
In order to solve the technical problems, the invention provides an intelligent question answering method and device, which are used for solving the problem that a computer cannot provide accurate answers for users.
In order to achieve the purpose of the invention, the invention provides an intelligent question answering method, which comprises the following steps:
acquiring a current question, wherein the current question comprises a question currently input by a user;
determining whether the current question is an incomplete question or not according to the current question, wherein the incomplete question has omission words which comprise key words lacking answers for retrieving the current question;
if the current question is determined to be an incomplete question, performing integrity recovery operation on the current question to obtain an integral question, wherein the integral question comprises a question without omitted words;
and searching answers of the complete question in a database according to the complete question.
Further, before the performing an integrity recovery operation on the current question to obtain a complete question, the method further includes:
acquiring at least one omission judgment result feature vector of the omitted words in the current question, wherein the omission judgment result feature vector comprises a feature vector obtained according to any one of the following information or a combination thereof: an omission type, a pronoun type, an omission component position, an omission component grammar role.
Further, before the performing an integrity recovery operation on the current question to obtain a complete question, the method further includes:
acquiring at least one first keyword of a previous question, wherein the previous question comprises a question acquired before the current question and input by the same user identifier as the current question;
according to the first keyword, obtaining a omitting candidate word characteristic value vector, wherein the omitting candidate word characteristic value vector comprises a characteristic vector obtained according to any one or a combination of the following information: the entity type, the grammatical role of the entity in the previous question, and the interval distance, wherein the interval distance comprises the distance between the previous question where the first keyword is located and the current question.
Further, the performing integrity recovery operation on the current question to obtain a complete question further includes:
multiplying the omission candidate word feature value vector by the omission judgment result feature vector to obtain a multiplication result;
according to the multiplication result, determining a score of the multiplication result corresponding to a first list;
sorting all scores in a descending order, and determining keywords corresponding to the first N scores as candidate keywords;
supplementing the candidate keywords to the current question to obtain a recovery question;
carrying out syntactic analysis on the recovery question sentence to obtain a scoring result of the syntactic analysis;
and according to the scoring result, taking the recovery question with the highest scoring result as the complete question of the current question.
Further, before the obtaining of the at least one first keyword of the previous question sentence, the method further includes:
determining whether the previous question and the current question have relevance or not;
if the correlation exists, executing at least one first keyword of the question sentence before obtaining;
and if the correlation does not exist, searching the answer of the current question in the database.
The invention also provides an intelligent question answering device, which comprises:
the system comprises an acquisition module, a display module and a display module, wherein the acquisition module is used for acquiring a current question, and the current question comprises a question currently input by a user;
a determining module, configured to determine whether the current question is an incomplete question according to the current question, where the incomplete question includes an omitted word, and the omitted word includes a keyword lacking an answer for retrieving the current question;
the integrity recovery module is used for carrying out integrity recovery operation on the current question to obtain a complete question if the current question is determined to be an incomplete question, wherein the complete question comprises a question without omitted words;
and the retrieval module is used for retrieving answers of the complete question in a database according to the complete question.
Further, the obtaining module is specifically configured to obtain at least one omission judgment result feature vector of the omitted word in the current question, where the omission judgment result feature vector includes a feature vector obtained according to any one of the following information or a combination thereof: an omission type, a pronoun type, an omission component position, an omission component grammar role.
Further, the obtaining module is further configured to obtain at least one first keyword of a previous question, where the previous question includes a question obtained before the current question and input by the same user identifier as the current question; according to the first keyword, obtaining a omitting candidate word characteristic value vector, wherein the omitting candidate word characteristic value vector comprises a characteristic vector obtained according to any one or a combination of the following information: the entity type, the grammatical role of the entity in the previous question, and the interval distance, wherein the interval distance comprises the distance between the previous question where the first keyword is located and the current question.
Further, the integrity recovery module is further configured to multiply the omitted candidate word feature value vector with the omitted determination result feature vector to obtain a multiplication result; according to the multiplication result, determining a score of the multiplication result corresponding to a first list; sorting all scores in a descending order, and determining keywords corresponding to the first N scores as candidate keywords; supplementing the candidate keywords to the current question to obtain a recovery question; carrying out syntactic analysis on the recovery question sentence to obtain a scoring result of the syntactic analysis; and according to the scoring result, taking the recovery question with the highest scoring result as the complete question of the current question.
Further, the method also comprises the following steps: a processing module;
the processing module is used for determining whether the correlation exists between the previous question and the current question; if the correlation exists, executing at least one first keyword of the question sentence before obtaining; and if the correlation does not exist, searching the answer of the current question in the database.
Compared with the prior art, the invention comprises the following steps: acquiring a current question, wherein the current question comprises a question currently input by a user; determining whether the current question is an incomplete question or not according to the current question, wherein the incomplete question has omission words which comprise key words lacking answers for retrieving the current question; if the current question is determined to be an incomplete question, performing integrity recovery operation on the current question to obtain an integral question, wherein the integral question comprises a question without omitted words; and searching answers of the complete question in a database according to the complete question. The integrity recovery of the incomplete question is realized, so that an accurate answer can be retrieved in a database.
Additional features and advantages of the invention will be set forth in the description which follows, and in part will be obvious from the description, or may be learned by practice of the invention. The objectives and other advantages of the invention will be realized and attained by the structure particularly pointed out in the written description and claims hereof as well as the appended drawings.
Drawings
The accompanying drawings are included to provide a further understanding of the invention and are incorporated in and constitute a part of this specification, illustrate embodiments of the invention and together with the example serve to explain the principles of the invention and not to limit the invention.
FIG. 1 is a schematic flow chart diagram of an embodiment of an intelligent question answering method according to the present invention;
FIG. 2 is a schematic structural diagram of an embodiment of an intelligent question answering device according to the present invention;
fig. 3 is a schematic structural diagram of a second embodiment of the intelligent question answering device according to the present invention.
Detailed Description
In order to make the objects, technical solutions and advantages of the present invention more apparent, embodiments of the present invention will be described in detail below with reference to the accompanying drawings. It should be noted that the embodiments and features of the embodiments in the present application may be arbitrarily combined with each other without conflict.
The intelligent question answering method provided by the embodiment of the invention can be particularly applied to the intelligent answer provided by a computer. The intelligent question-answering method provided by the embodiment can be specifically executed through an intelligent question-answering device, the intelligent question-answering device can be integrated in a mobile terminal, a computer or be separately arranged, and the intelligent question-answering device can be realized in a software and/or hardware mode. The following describes the intelligent question answering method and apparatus provided in this embodiment in detail.
FIG. 1 is a schematic flow chart diagram of an embodiment of an intelligent question answering method according to the present invention; as shown in fig. 1, the executing subject in this embodiment may be an intelligent question answering device, and the intelligent question answering method provided by the present invention includes:
step 101, obtaining a current question.
In this embodiment, the current question includes a question currently input by the user.
And step 102, determining whether the current question is an incomplete question or not according to the current question.
In this embodiment, the incomplete question has an omission word, and the omission word includes a keyword lacking an answer for retrieving the current question.
Specifically, whether the current question has the omission phenomenon is judged, and if the current question has the omission phenomenon, the omission component is recovered according to the content of the current question and the entity recognition result. Omission herein means omission of key components or the use of pronouns for key components. In the omission phenomenon judgment work, linguistic data are labeled firstly, effective features are extracted, an omission judgment model is trained, whether the omission phenomenon exists in words in a sentence is judged by using the omission judgment model, and information such as the position of an omitted component and a grammatical role is judged under the condition that the omission phenomenon exists.
For example, the current question may be determined whether the current question is an incomplete question by omitting the decision model. Wherein, the omitting judgment model training mode comprises the following steps: the multiple question sentences are preprocessed, that is, the parts of speech, the sequence of speech, the omission characteristics, and the like of the words in the question sentences are labeled, for example, the omission characteristics may be the omission subject, the verb, and the like. Then, reading the preprocessed corpus, extracting characteristic values, and performing offline training of the model; the characteristic value comprises pronoun components, omission characteristics, semantic characteristics, entity word information and the like. Then, off-line training is performed to generate an omitted recovery model.
Step 103, if the current question is determined to be an incomplete question, performing integrity recovery operation on the current question to obtain an integral question.
In the present embodiment, the complete question includes a question in which the omitted word does not exist.
Specifically, after the omission component position is determined, it is the most important to determine the restoration content next.
And recovering the candidate words into the first few question sentences of the same user, and extracting one or more entity words through entity recognition. And selecting the best candidate word as the omitted recovery word through algorithm calculation and syntactic analysis.
And step 104, searching answers of the complete question in a database according to the complete question.
It should be noted that the database may be a conventional relational database, or alternatively, an indexed database, and is not limited herein.
In the embodiment, a current question is obtained, wherein the current question comprises a question currently input by a user; determining whether the current question is an incomplete question or not according to the current question, wherein the incomplete question has omission words which comprise key words lacking answers for retrieving the current question; if the current question is determined to be an incomplete question, performing integrity recovery operation on the current question to obtain an integral question, wherein the integral question comprises a question without omitted words; and searching answers of the complete question in a database according to the complete question. The integrity recovery of the incomplete question is realized, so that an accurate answer can be retrieved in a database.
On the basis of the above embodiment, the determining, according to the current question, whether the current question is an incomplete question further includes:
and if the current question is determined not to be an incomplete question, searching the answer of the current question in a database.
On the basis of the foregoing embodiment, before performing an integrity recovery operation on the current question to obtain a complete question, the method further includes:
acquiring at least one omission judgment result feature vector of the omitted words in the current question, wherein the omission judgment result feature vector comprises a feature vector obtained according to any one of the following information or a combination thereof: an omission type, a pronoun type, an omission component position, an omission component grammar role.
On the basis of the foregoing embodiment, before performing an integrity recovery operation on the current question to obtain a complete question, the method further includes:
acquiring at least one first keyword of a previous question, wherein the previous question comprises a question acquired before the current question and input by the same user identifier as the current question;
according to the first keyword, obtaining a omitting candidate word characteristic value vector, wherein the omitting candidate word characteristic value vector comprises a characteristic vector obtained according to any one or a combination of the following information: the entity type, the grammatical role of the entity in the previous question, and the interval distance, wherein the interval distance comprises the distance between the previous question where the first keyword is located and the current question.
Further, on the basis of the foregoing embodiment, the performing integrity recovery operation on the current question to obtain a complete question further includes:
multiplying the omission candidate word feature value vector by the omission judgment result feature vector to obtain a multiplication result;
according to the multiplication result, determining a score of the multiplication result corresponding to a first list;
sorting all scores in a descending order, and determining keywords corresponding to the first N scores as candidate keywords;
supplementing the candidate keywords to the current question to obtain a recovery question;
carrying out syntactic analysis on the recovery question sentence to obtain a scoring result of the syntactic analysis;
and according to the scoring result, taking the recovery question with the highest scoring result as the complete question of the current question.
It should be noted that, before acquiring at least one first keyword of a preceding question, the method further includes:
determining whether the previous question and the current question have relevance or not;
if the correlation exists, executing at least one first keyword of the question sentence before obtaining;
and if the correlation does not exist, searching the answer of the current question in the database.
For example, the User: what are the 4G set of meals enjoyed? Intelligent customer service IQA: the enjoy 4G package is a certain package in China telecom. Next, the User: how to do it? Before answering, the intelligent customer service IQA can do the following operations:
step 1, preparing and marking domain linguistic data, and training a model in an off-line mode. Respectively training a context recognition model, an entity recognition model and an omission judgment model;
step 2, judging whether the user has cache, if so, continuing to execute context recognition in the step 3, otherwise, executing search in the step 7; the present example has a cache;
step 3, calling a context recognition model to judge whether the two contexts are the same, if so, continuing to execute step 4 to omit judgment, otherwise, executing step 7 to search; this example is the same context;
and 4, calling an omission judgment model to judge whether omission exists or not, if so, determining the omission position and the omission components, obtaining the omission judgment result feature vector A, and continuing to execute the entity identification in the step 5. Otherwise, executing step 7 search; in this example, there is an omission, and there is a missing subject, the omission is located before "what" and
step 5, calling an entity recognition model to recognize an entity which can be used for omitting recovery; continuing to execute step 6 to omit recovery; the entity recognition model of the embodiment recognizes the previous question entity 'Lexiang 4G package'
Step 6, extracting a group of vectors of feature values of omitted candidate words for each entity, wherein the vectors comprise entity categories, grammatical roles of the entities in original sentences, interval distances and the like, the interval distances refer to distances between the entities and current question sentences, and the question sentences are used as distance units; and calculating the score of the candidate entity word by omitting the candidate word feature value vector, and selecting the recovery word according to the score. The omission judgment result feature vector A obtained in the step 4 is a weight coefficient of the candidate word feature vector; the entity identified in this example is the subject in the previous question, and is spaced from the current question by a distance of 1, and is calculated as the best recovery word; after recovery, the current question is: how do a 4G set of fun?
Step 7, searching the corpus, performing result post-processing, screening query results (if a plurality of query results are hit, selecting the best answer by using similarity calculation, wherein the part is not in the scope of the invention) and caching the query conditions of the user;
and 8, returning the final result to the user.
For another example, the embodiment is applicable to a life service APP. User: how do the weather in Beijing? Intelligent customer service IQA: weather conditions in Beijing: cloudy turns fine. User: nanjing woolen? Before answering, the intelligent customer service IQA can do the following operations:
step 1, preparing and marking domain linguistic data, and training a model in an off-line mode. Respectively training a context recognition model, an entity recognition model and an omission judgment model;
step 2, judging whether the user has cache, if so, continuing to execute context recognition in the step 3, otherwise, executing search in the step 7; the present example has a cache;
step 3, calling a context recognition model to judge whether the two contexts are the same, if so, continuing to execute step 4 to omit judgment, otherwise, executing step 7 to search; this example is the same context;
and 4, calling an omission judgment model to judge whether omission exists or not, if so, determining the omission position and the omission components, obtaining the omission judgment result feature vector A, and continuing to execute the entity identification in the step 5. Otherwise, executing step 7 search; the example has omission, and has regional word subject "Nanjing", no object, and omission position behind the subject
Step 5, calling an entity recognition model to recognize an entity which can be used for omitting recovery; continuing to execute step 6 to omit recovery; the entity recognition model of the embodiment recognizes that the entities of the previous question have Nanjing and weather "
Step 6, extracting a group of vectors of feature values of omitted candidate words for each entity, wherein the vectors comprise entity types, grammatical roles of the entities in original sentences, interval distances (the distance between the entities and the current question is taken as a distance unit), and the like; and calculating the score of the candidate entity word by omitting the candidate word feature value vector, and selecting the recovery word according to the score. The omission judgment result feature vector A obtained in the step 4 is a weight coefficient of the candidate word feature vector; the entity identified in the example is the subject and the object in the previous question, the interval distance between the entity and the current question is 1, the current question lacks the object, and the weather is calculated as the best recovery word; after recovery, the current question is: nanjing weather woollen?
Step 7, searching the corpus, performing result post-processing, screening query results (if a plurality of query results are hit, selecting the best answer by using similarity calculation, wherein the part is not in the scope of the invention) and caching the query conditions of the user;
and 8, returning the final result to the user.
FIG. 2 is a schematic structural diagram of an embodiment of an intelligent question answering device according to the present invention; as shown in fig. 2, the intelligent question answering device provided by the present invention includes: an acquisition module 21, a determination module 22, an integrity restoration module 23 and a retrieval module 24. Wherein,
an obtaining module 21, configured to obtain a current question, where the current question includes a question currently input by a user;
a determining module 22, configured to determine, according to the current question, whether the current question is an incomplete question, where an omitted word exists in the incomplete question, and the omitted word includes a keyword lacking an answer for retrieving the current question;
the integrity recovery module 23 is configured to, if it is determined that the current question is an incomplete question, perform integrity recovery operation on the current question to obtain an complete question, where the complete question includes a question without an omitted word;
and the retrieval module 24 is used for retrieving answers of the complete question from a database according to the complete question.
In the embodiment, a current question is obtained, wherein the current question comprises a question currently input by a user; determining whether the current question is an incomplete question or not according to the current question, wherein the incomplete question has omission words which comprise key words lacking answers for retrieving the current question; if the current question is determined to be an incomplete question, performing integrity recovery operation on the current question to obtain an integral question, wherein the integral question comprises a question without omitted words; and searching answers of the complete question in a database according to the complete question. The integrity recovery of the incomplete question is realized, so that an accurate answer can be retrieved in a database.
Further, on the basis of the foregoing embodiment, the obtaining module 21 is specifically configured to obtain at least one omission judgment result feature vector of the omitted word in the current question, where the omission judgment result feature vector includes a feature vector obtained according to any one of the following information or a combination thereof: an omission type, a pronoun type, an omission component position, an omission component grammar role.
Further, on the basis of the foregoing embodiment, the obtaining module 21 is further configured to obtain at least one first keyword of a previous question, where the previous question includes a question that is obtained before the current question and is input by the same user identifier as the current question; according to the first keyword, obtaining a omitting candidate word characteristic value vector, wherein the omitting candidate word characteristic value vector comprises a characteristic vector obtained according to any one or a combination of the following information: the entity type, the grammatical role of the entity in the previous question, and the interval distance, wherein the interval distance comprises the distance between the previous question where the first keyword is located and the current question.
Further, on the basis of the foregoing embodiment, the integrity recovering module 23 is further configured to multiply the omitted candidate word feature value vector by the omitted determination result feature vector to obtain a multiplication result; according to the multiplication result, determining a score of the multiplication result corresponding to a first list; sorting all scores in a descending order, and determining keywords corresponding to the first N scores as candidate keywords; supplementing the candidate keywords to the current question to obtain a recovery question; carrying out syntactic analysis on the recovery question sentence to obtain a scoring result of the syntactic analysis; and according to the scoring result, taking the recovery question with the highest scoring result as the complete question of the current question.
FIG. 3 is a schematic structural diagram of a second embodiment of the intelligent question answering device according to the present invention; as shown in fig. 3, on the basis of the above embodiment, the intelligent question answering device provided by the present invention may further include: a processing module 25;
the processing module 25 is configured to determine whether there is a correlation between the previous question and the current question; if the correlation exists, executing at least one first keyword of the question sentence before obtaining; and if the correlation does not exist, searching the answer of the current question in the database.
In the embodiment, the completeness recovery of the incomplete question is realized, and the entity composed of multiple words is obtained, so that an accurate answer can be retrieved from the database, and the accuracy of the answer is improved. Although the embodiments of the present invention have been described above, the above description is only for the convenience of understanding the present invention, and is not intended to limit the present invention. It will be understood by those skilled in the art that various changes in form and details may be made therein without departing from the spirit and scope of the invention as defined by the appended claims.

Claims (10)

1. An intelligent question answering method is characterized by comprising the following steps:
acquiring a current question, wherein the current question comprises a question currently input by a user;
determining whether the current question is an incomplete question or not according to the current question, wherein the incomplete question has omission words which comprise key words lacking answers for retrieving the current question;
if the current question is determined to be an incomplete question, performing integrity recovery operation on the current question to obtain an integral question, wherein the integral question comprises a question without omitted words;
and searching answers of the complete question in a database according to the complete question.
2. The method of claim 1, wherein before performing the integrity recovery operation on the current question to obtain the complete question, further comprising:
acquiring at least one omission judgment result feature vector of the omitted words in the current question, wherein the omission judgment result feature vector comprises a feature vector obtained according to any one of the following information or a combination thereof: an omission type, a pronoun type, an omission component position, an omission component grammar role.
3. The method according to claim 1 or 2, wherein before performing the integrity recovery operation on the current question to obtain a complete question, the method further comprises:
acquiring at least one first keyword of a previous question, wherein the previous question comprises a question acquired before the current question and input by the same user identifier as the current question;
according to the first keyword, obtaining a omitting candidate word characteristic value vector, wherein the omitting candidate word characteristic value vector comprises a characteristic vector obtained according to any one or a combination of the following information: the entity type, the grammatical role of the entity in the previous question, and the interval distance, wherein the interval distance comprises the distance between the previous question where the first keyword is located and the current question.
4. The method according to any one of claims 1 to 3, wherein the performing an integrity recovery operation on the current question to obtain a complete question further comprises:
multiplying the omission candidate word feature value vector by the omission judgment result feature vector to obtain a multiplication result;
according to the multiplication result, determining a score of the multiplication result corresponding to a first list;
sorting all scores in a descending order, and determining keywords corresponding to the first N scores as candidate keywords;
supplementing the candidate keywords to the current question to obtain a recovery question;
carrying out syntactic analysis on the recovery question sentence to obtain a scoring result of the syntactic analysis;
and according to the scoring result, taking the recovery question with the highest scoring result as the complete question of the current question.
5. The method according to claims 1-4, wherein said obtaining at least one first keyword of a previous question further comprises:
determining whether the previous question and the current question have relevance or not;
if the correlation exists, executing at least one first keyword of the question sentence before obtaining;
and if the correlation does not exist, searching the answer of the current question in the database.
6. An intelligent question answering device, comprising:
the system comprises an acquisition module, a display module and a display module, wherein the acquisition module is used for acquiring a current question, and the current question comprises a question currently input by a user;
a determining module, configured to determine whether the current question is an incomplete question according to the current question, where the incomplete question includes an omitted word, and the omitted word includes a keyword lacking an answer for retrieving the current question;
the integrity recovery module is used for carrying out integrity recovery operation on the current question to obtain a complete question if the current question is determined to be an incomplete question, wherein the complete question comprises a question without omitted words;
and the retrieval module is used for retrieving answers of the complete question in a database according to the complete question.
7. The apparatus according to claim 6, wherein the obtaining module is specifically configured to obtain at least one omission decision result feature vector of the omitted words in the current question sentence, where the omission decision result feature vector includes a feature vector obtained according to any one or a combination of the following information: an omission type, a pronoun type, an omission component position, an omission component grammar role.
8. The apparatus according to claim 6 or 7, wherein the obtaining module is further configured to obtain at least one first keyword of a previous question, where the previous question includes a question obtained before the current question and input by the same user identifier as the current question; according to the first keyword, obtaining a omitting candidate word characteristic value vector, wherein the omitting candidate word characteristic value vector comprises a characteristic vector obtained according to any one or a combination of the following information: the entity type, the grammatical role of the entity in the previous question, and the interval distance, wherein the interval distance comprises the distance between the previous question where the first keyword is located and the current question.
9. The apparatus according to any one of claims 6 to 8, wherein the integrity recovery module is further configured to multiply the omission candidate word feature value vector by the omission decision result feature vector to obtain a multiplication result; according to the multiplication result, determining a score of the multiplication result corresponding to a first list; sorting all scores in a descending order, and determining keywords corresponding to the first N scores as candidate keywords; supplementing the candidate keywords to the current question to obtain a recovery question; carrying out syntactic analysis on the recovery question sentence to obtain a scoring result of the syntactic analysis; and according to the scoring result, taking the recovery question with the highest scoring result as the complete question of the current question.
10. The apparatus of claims 6-9, further comprising: a processing module;
the processing module is used for determining whether the correlation exists between the previous question and the current question; if the correlation exists, executing at least one first keyword of the question sentence before obtaining; and if the correlation does not exist, searching the answer of the current question in the database.
CN201610244036.3A 2016-04-19 2016-04-19 A kind of intelligent answer method and device Pending CN107305550A (en)

Priority Applications (2)

Application Number Priority Date Filing Date Title
CN201610244036.3A CN107305550A (en) 2016-04-19 2016-04-19 A kind of intelligent answer method and device
PCT/CN2017/078844 WO2017181834A1 (en) 2016-04-19 2017-03-30 Intelligent question and answer method and device

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201610244036.3A CN107305550A (en) 2016-04-19 2016-04-19 A kind of intelligent answer method and device

Publications (1)

Publication Number Publication Date
CN107305550A true CN107305550A (en) 2017-10-31

Family

ID=60115675

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201610244036.3A Pending CN107305550A (en) 2016-04-19 2016-04-19 A kind of intelligent answer method and device

Country Status (2)

Country Link
CN (1) CN107305550A (en)
WO (1) WO2017181834A1 (en)

Cited By (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108170792A (en) * 2017-12-27 2018-06-15 北京百度网讯科技有限公司 Question and answer bootstrap technique, device and computer equipment based on artificial intelligence
CN108763356A (en) * 2018-05-16 2018-11-06 深圳市三宝创新智能有限公司 A kind of intelligent robot chat system and method based on the search of similar sentence
CN109284503A (en) * 2018-10-22 2019-01-29 传神语联网网络科技股份有限公司 Translate Statement Completion judgment method and system
CN109344238A (en) * 2018-09-18 2019-02-15 阿里巴巴集团控股有限公司 The benefit word method and apparatus of user's question sentence
CN110334268A (en) * 2019-07-05 2019-10-15 北京国创动力文化传媒有限公司 A kind of block chain project hot word generation method and device
CN111507088A (en) * 2020-04-15 2020-08-07 深圳前海微众银行股份有限公司 Statement completion method, device and readable storage medium
CN111984766A (en) * 2019-05-21 2020-11-24 华为技术有限公司 Missing semantic completion method and device
WO2021012772A1 (en) * 2019-07-22 2021-01-28 中兴通讯股份有限公司 Speech information processing method and device, storage medium, and electronic device
CN112328762A (en) * 2020-11-04 2021-02-05 平安科技(深圳)有限公司 Question and answer corpus generation method and device based on text generation model

Families Citing this family (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109933779A (en) * 2017-12-18 2019-06-25 苏宁云商集团股份有限公司 User's intension recognizing method and system
US11423068B2 (en) * 2018-11-27 2022-08-23 Google Llc Canonicalizing search queries to natural language questions
CN110377803B (en) * 2019-06-14 2023-04-07 创新先进技术有限公司 Information processing method and device
CN112988987B (en) * 2019-12-16 2024-06-28 科沃斯商用机器人有限公司 Man-machine interaction method and device, intelligent robot and storage medium
CN111563147B (en) * 2020-04-03 2023-09-22 北京明略软件系统有限公司 Entity linking method and device in knowledge question-answering system
CN111813916B (en) * 2020-07-21 2024-03-05 华润数字科技有限公司 Intelligent question-answering method, device, computer equipment and medium
CN111984763B (en) * 2020-08-28 2023-09-19 海信电子科技(武汉)有限公司 Question answering processing method and intelligent device
CN112380836A (en) * 2020-11-12 2021-02-19 华侨大学 Intelligent Chinese message question generating method
CN114064874A (en) * 2021-11-19 2022-02-18 浙江百应科技有限公司 Knowledge base problem adding method and device based on vector search engine

Family Cites Families (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US9042540B2 (en) * 2012-10-30 2015-05-26 Teletech Holdings, Inc. Method for providing support using answer engine and dialog rules
CN103995870A (en) * 2014-05-21 2014-08-20 百度在线网络技术(北京)有限公司 Interactive searching method and device
CN105373527B (en) * 2014-08-27 2020-11-27 中兴通讯股份有限公司 An omission recovery method and question answering system
CN106294340A (en) * 2015-05-12 2017-01-04 中兴通讯股份有限公司 A kind of intelligent customer service processing method and device thereof
CN105159996B (en) * 2015-09-07 2018-09-07 百度在线网络技术(北京)有限公司 Depth question and answer service providing method based on artificial intelligence and device
CN105589844B (en) * 2015-12-18 2017-08-08 北京中科汇联科技股份有限公司 It is a kind of to be used to take turns the method for lacking semantic supplement in question answering system more

Cited By (16)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108170792B (en) * 2017-12-27 2021-12-28 北京百度网讯科技有限公司 Question and answer guiding method and device based on artificial intelligence and computer equipment
CN108170792A (en) * 2017-12-27 2018-06-15 北京百度网讯科技有限公司 Question and answer bootstrap technique, device and computer equipment based on artificial intelligence
CN108763356A (en) * 2018-05-16 2018-11-06 深圳市三宝创新智能有限公司 A kind of intelligent robot chat system and method based on the search of similar sentence
CN109344238A (en) * 2018-09-18 2019-02-15 阿里巴巴集团控股有限公司 The benefit word method and apparatus of user's question sentence
CN109284503A (en) * 2018-10-22 2019-01-29 传神语联网网络科技股份有限公司 Translate Statement Completion judgment method and system
CN109284503B (en) * 2018-10-22 2023-08-18 传神语联网网络科技股份有限公司 Translation statement ending judgment method and system
US12135941B2 (en) 2019-05-21 2024-11-05 Huawei Technologies Co., Ltd. Missing semantics complementing method and apparatus
CN111984766A (en) * 2019-05-21 2020-11-24 华为技术有限公司 Missing semantic completion method and device
WO2020233380A1 (en) * 2019-05-21 2020-11-26 华为技术有限公司 Missing semantic completion method and apparatus
CN111984766B (en) * 2019-05-21 2023-02-24 华为技术有限公司 Missing semantic completion method and device
CN110334268A (en) * 2019-07-05 2019-10-15 北京国创动力文化传媒有限公司 A kind of block chain project hot word generation method and device
CN110334268B (en) * 2019-07-05 2022-01-14 李晨 Block chain project hot word generation method and device
WO2021012772A1 (en) * 2019-07-22 2021-01-28 中兴通讯股份有限公司 Speech information processing method and device, storage medium, and electronic device
CN111507088A (en) * 2020-04-15 2020-08-07 深圳前海微众银行股份有限公司 Statement completion method, device and readable storage medium
CN112328762A (en) * 2020-11-04 2021-02-05 平安科技(深圳)有限公司 Question and answer corpus generation method and device based on text generation model
CN112328762B (en) * 2020-11-04 2023-12-19 平安科技(深圳)有限公司 Question-answer corpus generation method and device based on text generation model

Also Published As

Publication number Publication date
WO2017181834A1 (en) 2017-10-26

Similar Documents

Publication Publication Date Title
CN107305550A (en) A kind of intelligent answer method and device
CN110196901B (en) Method and device for constructing dialog system, computer equipment and storage medium
CN107729468B (en) Answer extraction method and system based on deep learning
CN107480143B (en) Method and system for dialogue topic segmentation based on contextual relevance
CN108304372B (en) Entity extraction method and device, computer equipment and storage medium
US10831769B2 (en) Search method and device for asking type query based on deep question and answer
EP2833271A1 (en) Multimedia question and answer system and method
CN105930452A (en) Smart answering method capable of identifying natural language
CN104050256A (en) Initiative study-based questioning and answering method and questioning and answering system adopting initiative study-based questioning and answering method
CN106844571B (en) Method and device for identifying synonyms and computing equipment
CN109492081B (en) Text information searching and information interaction method, device, equipment and storage medium
CN109710732B (en) Information query method, device, storage medium and electronic equipment
CN107679070B (en) Intelligent reading recommendation method and device and electronic equipment
CN105956053A (en) Network information-based search method and apparatus
CN109522397B (en) Information processing method and device
CN105354199A (en) Scene information based entity meaning identification method and system
CN108345612A (en) A kind of question processing method and device, a kind of device for issue handling
CN112612875A (en) Method, device and equipment for automatically expanding query words and storage medium
CN113326702A (en) Semantic recognition method and device, electronic equipment and storage medium
CN109492085B (en) Answer determination method, device, terminal and storage medium based on data processing
CN113886544A (en) Text matching method, device, storage medium and computer equipment
CN118115781A (en) Label identification method, system, equipment and storage medium based on multi-mode model
CN107562774A (en) Generation method, system and the answering method and system of rare foreign languages word incorporation model
CN113177061A (en) Searching method and device and electronic equipment
CN109684357B (en) Information processing method and device, storage medium and terminal

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
WD01 Invention patent application deemed withdrawn after publication
WD01 Invention patent application deemed withdrawn after publication

Application publication date: 20171031