CN111177331B - Dialog intention recognition method and device - Google Patents

Dialog intention recognition method and device Download PDF

Info

Publication number
CN111177331B
CN111177331B CN201911168331.5A CN201911168331A CN111177331B CN 111177331 B CN111177331 B CN 111177331B CN 201911168331 A CN201911168331 A CN 201911168331A CN 111177331 B CN111177331 B CN 111177331B
Authority
CN
China
Prior art keywords
intention
historical
probability
user
input text
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201911168331.5A
Other languages
Chinese (zh)
Other versions
CN111177331A (en
Inventor
曾祥荣
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Unisound Intelligent Technology Co Ltd
Original Assignee
Unisound Intelligent Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Unisound Intelligent Technology Co Ltd filed Critical Unisound Intelligent Technology Co Ltd
Priority to CN201911168331.5A priority Critical patent/CN111177331B/en
Publication of CN111177331A publication Critical patent/CN111177331A/en
Application granted granted Critical
Publication of CN111177331B publication Critical patent/CN111177331B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/33Querying
    • G06F16/332Query formulation
    • G06F16/3329Natural language query formulation or dialogue systems
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/33Querying
    • G06F16/3331Query processing
    • G06F16/334Query execution
    • G06F16/3346Query execution using probabilistic model
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y02TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
    • Y02DCLIMATE CHANGE MITIGATION TECHNOLOGIES IN INFORMATION AND COMMUNICATION TECHNOLOGIES [ICT], I.E. INFORMATION AND COMMUNICATION TECHNOLOGIES AIMING AT THE REDUCTION OF THEIR OWN ENERGY USE
    • Y02D10/00Energy efficient computing, e.g. low power processors, power management or thermal management

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Theoretical Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Mathematical Physics (AREA)
  • Computational Linguistics (AREA)
  • Data Mining & Analysis (AREA)
  • Databases & Information Systems (AREA)
  • General Engineering & Computer Science (AREA)
  • Human Computer Interaction (AREA)
  • Artificial Intelligence (AREA)
  • Probability & Statistics with Applications (AREA)
  • Machine Translation (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

The invention discloses a method and a device for recognizing a conversation intention, which comprise the following steps: counting the probability of the intention of each historical user in the historical dialogue data to serve as a priori knowledge sequence; predicting that the currently input text belongs to the user intention a according to a preset classification model i The probability of (a) q1; calculating the current input text belonging to the user intention a by using a priori knowledge sequence i The probability of (q 2); and selecting the intention with the highest probability as the current intention of the currently input text according to the probability q1 and the probability q 2. Aiming at the existing conversation intention recognition method, the prior knowledge sequence is inserted into the current input text to predict the current intention of the user on the basis of predicting the probability that the currently input text belongs to the intention of a certain user in the training model, so that the problem that the intention of the user cannot be accurately judged only by using the text information input by the user at the current moment in the prior art is effectively solved, and the experience of the user is enhanced.

Description

Dialog intention recognition method and device
Technical Field
The invention relates to the technical field of artificial intelligence, in particular to a conversation intention identification method and device
Background
With the development of search engine technology, modern search engines, question-answering systems and dialogue robots need not to simply retrieve relevant information, but can deeply understand the information requirements of users so as to accurately provide the users with required services, and correctly recognizing the intentions of the users is a key step for achieving the aim.
The prior art dialog intention recognition methods mainly include methods based on keywords, template matching and machine learning models. However, in these methods, the intention of the user is determined only by using the text information input by the user at the current time in the conversation, and the current conversation intention of the user cannot be accurately determined.
Disclosure of Invention
In response to the displayed question, the method identifies the current dialog intent of the user based on predicting the user intent in the current input text using the probability of the user intent in the historical dialog data.
A dialog intention recognition method, comprising the steps of:
counting the probability of the intention of each historical user in the historical dialogue data to serve as a priori knowledge sequence;
predicting that the currently input text belongs to the user intention a according to a preset classification model i The probability q1 of (a);
calculating the user intention a of the currently input text by using the prior knowledge sequence i The probability of (q 2);
and selecting the intention with the maximum probability as the current intention of the currently input text according to the probability q1 and the probability q 2.
Preferably, the statistical method includes, as the prior knowledge sequence, counting the probability of occurrence of each historical user intention in the historical dialogue data, including:
collecting historical conversation data by using a conversation log, and acquiring keywords in the historical conversation data;
determining each historical user intention according to the keywords;
labeling the dialogue sentences containing the intentions of all historical users in the historical dialogue data;
the user intention a in the history dialogue data is calculated by using the following formula i Probability of (c):
Figure BDA0002288053280000021
wherein, a i I-th user intention, m, for each historical user intention i For the user's intention a i Number of occurrences in historical dialogue data, p (a) i ) For the user's intention a i Probability of occurrence in historical dialogue data.
Preferably, the statistics of the probability of the occurrence of each historical user intention in the historical dialogue data as the prior knowledge sequence further includes:
the occurrence of the user intention a under the first specific condition in the history dialogue data is calculated according to the following formula i Probability of (c):
Figure BDA0002288053280000022
wherein,a j For the jth user intention, m, of the historical user intentions ij Is a i At a j Number of subsequent occurrences in the historical dialogue data, m j Is a j Number of occurrences in historical session data;
the occurrence of the user intention a under the second specific condition in the history dialogue data is calculated according to the following formula i Probability of (c):
Figure BDA0002288053280000023
wherein, a k For the kth user intention, m, among the historical user intentions ijk Is a i At a j And a k Number of subsequent occurrences in the historical dialogue data, m jk Is a j And a k As well as the number of occurrences in the historical dialog data.
Preferably, the user intention a in the currently input text is calculated by using the prior knowledge sequence i Includes:
predicting that the currently input text belongs to the user intention a according to the prior knowledge sequence i The probability of (q 2);
if the sequence length is larger than or equal to a first preset threshold value, q2= p (a) i |b 1 ,b 2 );
If the sequence length is equal to the second predetermined threshold, q2= p (a) i |b 1 );
If the sequence length is equal to a third predetermined threshold, q2= p (a) i );
Wherein b1 and b2 are historical intentions to which the currently input text belongs.
Preferably, according to the probability q1 and the probability q2, selecting the intention with the highest probability as the current intention of the currently input text comprises:
calculating the user intention a of the currently input text by using the following formula i Total probability of (q):
q=q1*q2;
and outputting the user intention with the maximum total probability q as the current intention of the currently input text.
A dialog intention recognition apparatus, the apparatus comprising:
the statistical module is used for counting the probability of the intention of each historical user in the historical dialogue data to serve as a priori knowledge sequence;
a prediction module for predicting the current input text belonging to the user intention a according to a preset classification model i The probability q1 of (a);
a calculation module for calculating the user intention a of the currently input text by using the prior knowledge sequence i The probability of (q 2);
and the selection module is used for selecting the intention with the maximum probability as the current intention of the currently input text according to the probability q1 and the probability q 2.
Preferably, the statistical module includes:
the acquisition submodule is used for collecting historical dialogue data by using the dialogue logs and acquiring keywords in the historical dialogue data;
the determining submodule is used for determining each historical user intention according to the keywords;
the marking submodule is used for marking the dialogue sentences containing the intentions of all historical users in the historical dialogue data;
a first calculation submodule for calculating a user intention a in the history dialogue data by using the following formula i Probability of (c):
Figure BDA0002288053280000041
wherein, a i I-th user intention, m, for each historical user intention i For the user's intention a i Number of occurrences in historical dialogue data, p (a) i ) For the user's intention a i Probability of occurrence in the historical dialogue data.
Preferably, the statistical module further includes:
a second calculation submodule for calculating the first specific condition of the historical dialogue data according to the following formulaThe user intention a appears i Probability of (c):
Figure BDA0002288053280000042
wherein, a j For the jth user intention, m, of the historical user intentions ij Is a i At a j Number of subsequent occurrences in the historical dialogue data, m j Is a j Number of occurrences in historical dialogue data;
a third calculation submodule for calculating the user intention a appearing in the historical dialogue data under a second specific condition according to the following formula i Probability of (c):
Figure BDA0002288053280000043
wherein, a k For the kth user intention, m, among the historical user intents ijk Is a i At a j And a k Number of later occurrences in the historical dialogue data, m jk Is a j And a k As well as the number of occurrences in the historical dialog data.
Preferably, the calculation module includes:
a prediction submodule for predicting that the currently input text belongs to the user intention a according to the prior knowledge sequence i The probability of (q 2);
a first output submodule for outputting q2= p (a) if the sequence length is greater than or equal to a first preset threshold value i |b 1 ,b 2 ) If the sequence length is equal to a second predetermined threshold, the output q2= p (a) i |b 1 ) If the sequence length is equal to a third predetermined threshold, the output q2= p (a) i );
Wherein b1 and b2 are historical intentions to which the currently input text belongs.
Preferably, the selection module includes:
a fourth calculation submodule for calculating the attribute of the currently inputted text by using the following formulaIntention of the user a i Total probability of (q):
q=q1*q2;
and the second output submodule is used for outputting the user intention with the maximum total probability q as the current intention of the currently input text.
Additional features and advantages of the invention will be set forth in the description which follows, and in part will be obvious from the description, or may be learned by practice of the invention. The objectives and other advantages of the invention will be realized and attained by the structure particularly pointed out in the written description and claims hereof as well as the appended drawings.
The technical solution of the present invention is further described in detail by the accompanying drawings and embodiments.
Drawings
FIG. 1 is a flowchart illustrating a dialog intention recognition method according to the present invention;
FIG. 2 is another flowchart illustrating a dialog intention recognition method according to the present invention;
FIG. 3 is a block diagram of a dialog intention recognition device according to the present invention;
fig. 4 is another structural diagram of a dialog intention recognition device according to the present invention.
Detailed Description
Reference will now be made in detail to the exemplary embodiments, examples of which are illustrated in the accompanying drawings. When the following description refers to the accompanying drawings, like numbers in different drawings represent the same or similar elements unless otherwise indicated. The implementations described in the exemplary embodiments below are not intended to represent all implementations consistent with the present disclosure. Rather, they are merely examples of apparatus and methods consistent with certain aspects of the present disclosure, as detailed in the appended claims.
With the development of search engine technology, modern search engines, question-answering systems and dialogue robots need not to simply retrieve relevant information, but can deeply understand the information requirements of users so as to accurately provide the users with required services, and correctly recognizing the intentions of the users is a key step for achieving the aim.
The prior art dialog intention recognition methods mainly include methods based on keywords, template matching and machine learning models. However, in these methods, the intention of the user is determined only by using the text information input by the user at the current time in the conversation, and the current conversation intention of the user cannot be accurately determined. In order to solve the above-described problems, the present embodiment provides a method of identifying a current dialog intention of a user based on predicting the user intention in a currently input text using a probability of the user intention in historical dialog data.
A dialog intention recognition method, as shown in fig. 1, comprising the steps of:
s101, counting the probability of the intention of each historical user in historical dialogue data to serve as a priori knowledge sequence;
step S102, predicting that the currently input text belongs to the user intention a according to a preset classification model i The probability of (a) q1;
step S103, calculating the user intention a of the currently input text by using the prior knowledge sequence i The probability of (q 2);
and step S104, selecting the intention with the maximum probability as the current intention of the currently input text according to the probability q1 and the probability q 2.
The working principle of the technical scheme is as follows: predicting that the currently input text belongs to the user intention a according to a training model by counting the probability of the occurrence of each historical intention in the historical dialogue data of the user and taking the probability as a priori knowledge sequence i The probability of (a) q1; then, the prior knowledge sequence is utilized to calculate that the currently input text belongs to the user intention a i The probability of (q 2); and selecting the intention with the highest probability as the current intention of the currently input text according to the probability q1 and the probability q 2.
The beneficial effects of the above technical scheme are: aiming at the existing conversation intention recognition method, the prior knowledge sequence is inserted into the current input text to predict the current intention of the user on the basis of predicting the probability that the currently input text belongs to the intention of a certain user in the training model, so that the problem that the intention of the user cannot be accurately judged only by using the text information input by the user at the current moment in the prior art is effectively solved, and the experience of the user is enhanced.
In one embodiment, as shown in fig. 2, counting the probability of occurrence of each historical user intention in the historical dialogue data as a priori knowledge sequence includes:
step S201, collecting historical dialogue data by using a dialogue log, and acquiring keywords in the historical dialogue data;
step S202, determining each historical user intention according to the keywords;
step S203, labeling the dialogue sentences containing the intentions of the historical dialogue data;
step S204, calculating the user intention a in the historical dialogue data by using the following formula i Probability of (c):
Figure BDA0002288053280000071
wherein, a i I-th user intention, m, for each historical user intention i For the user's intention a i Number of occurrences in historical dialogue data, p (a) i ) For the user's intention a i Probability of occurrence in historical dialogue data.
The beneficial effects of the above technical scheme are: the method comprises the steps of accurately collecting each historical user intention in user historical dialogue data, calculating the probability of each user intention according to a formula, and predicting the user intention probability in the current input text of a user by using the calculated probability as a priori knowledge sequence.
In one embodiment, counting the probability of the occurrence of each historical user intention in the historical dialogue data as a priori knowledge sequence further comprises:
the occurrence of the user intention a under the first specific condition in the history dialogue data is calculated according to the following formula i Probability of (2):
Figure BDA0002288053280000081
Wherein, a j For the jth user intention, m, of the historical user intentions ij Is a i At a j Number of subsequent occurrences in the historical dialogue data, m j Is a j Number of occurrences in historical dialogue data;
the occurrence of the user intention a under the second specific condition in the history dialogue data is calculated according to the following formula i Probability of (c):
Figure BDA0002288053280000082
wherein, a k For the kth user intention, m, among the historical user intents ijk Is a i At a j And a k Number of subsequent occurrences in the historical dialogue data, m jk Is a j And a k Number of occurrences in historical dialogue data at the same time;
the first specific condition mentioned above may be in particular given the user's intention a j The above-mentioned second specific condition may be that the user's intention a is given j And user intention a k Under the conditions of (a).
The beneficial effects of the above technical scheme are: calculating a under various conditions i The occurrence probability enriches the types of the priori knowledge sequences, accurately predicts the user intentions according to the number of the user intentions of the current input text of the user, reduces the prediction range to a certain extent, increases the prediction types and improves the prediction accuracy.
In one embodiment, the user intention a in the currently input text is calculated by using a priori knowledge sequence i Includes:
predicting that the currently input text belongs to the user intention a according to the prior knowledge sequence i The probability of (q 2);
if the sequence length is larger than or equal to a first preset threshold value, q2= p (a) i |b 1 ,b 2 );
If the sequence length is equal to a second predetermined threshold, q2= p (a) i |b 1 );
If the sequence length is equal to a third predetermined threshold, q2= p (a) i );
Wherein b1 and b2 are historical intentions to which the currently input text belongs.
Specifically, the first preset threshold may be 2, the second preset threshold may be 1, and the third preset threshold may be 0.
The beneficial effects of the above technical scheme are: the current intention of the user is predicted according to the situation of the combination of the current input text and the prior knowledge sequence, the user intention in the current input text can be predicted more effectively, and compared with the prior art, the prediction result is more accurate.
In one embodiment, selecting the most probable intent as the current intent of the currently input text based on the probabilities q1 and q2 comprises:
calculating the user intention a of the currently input text by using the following formula i Total probability of (q):
q=q1*q2;
and outputting the user intention with the maximum total probability q as the current intention of the currently input text.
The beneficial effects of the above technical scheme are: the predicted probability in the training model and the predicted probability of the prior knowledge sequence are combined to more accurately predict the user intention of the current input text of the user.
In one embodiment, the method comprises the following steps:
step 1: and collecting and labeling data.
Step 1.1: a large amount of dialogue data is collected using information such as dialogue logs.
Step 1.2: user intentions are defined that the current domain will involve. As in the air ticket booking task, the user intentions are: greetings, fares, flights, time, price, etc. Assume a total of n different intents, of which the ith oneIs intended to be denoted as a i
Step 1.3: for part of the dialogue data, each sentence in each dialogue is manually marked to indicate which of the intentions defined in the step 1.2 belongs to.
Step 2: and acquiring prior knowledge.
Step 2.1: counting intention a according to the marked data in the step 1 i The number of occurrences is recorded as m i Thereby calculating the probability of each intention occurrence
Figure BDA0002288053280000091
Step 2.2: according to the marked data in the step 1, counting intentions a i In intention a j The number of subsequent occurrences, denoted m ij Thereby calculating the intention a i In a given intention a j Is a probability of occurrence under the condition of
Figure BDA0002288053280000092
Step 2.3: according to the marked data in the step 1, counting intentions a i In intention a j And intention a k The number of subsequent occurrences, denoted m ijk Thereby calculating the intention a i In a given intention a j And intention a k Probability of occurrence under the condition of (2)
Figure BDA0002288053280000101
And 3, step 3: a priori knowledge is fused in dialog intent recognition.
Step 3.1: predicting the user's intention a by using the classification model in the machine learning method according to the text input by the user i Probability of (a) q' (a) i )。
Step 3.2: predicting that the current moment of the user belongs to the intention a according to the intention sequence in the historical information of the current conversation i Probability of (a) q ″ i ). Assume that the historical intent sequence of this dialog is [ b ] 1 ,b 2 ,…]Wherein the intention of the last moment is denoted b 1 . If it is pairedIf the length of the historical intention sequence is greater than or equal to 2, q' (a) i )=p(a i |b 1 ,b 2 ). If the dialog history intention sequence length is 1, q' (a) i )=p(a i |b 1 ). If the dialog history intention sequence length is 0, then q' (a) i )=p(a i )。
Step 3.3: predicting the belongingness a of a user i Probability of (a) q (a) i )=q′(a i )×q″(a i )。
Step 3.4: and selecting the intention with the highest probability as the intention of the user.
The working principle and the beneficial effects of the technical scheme are as follows: the method provides a dialogue intention recognition method fusing priori knowledge, and the intention recognition is regarded as a step Markov process, namely when the intention of a current user is predicted, the previous intention is required to be considered to enable the predicted result to be more accurate. Compared with the method for inputting the current text and utilizing the training model to perform the user intention in the prior art, the method disclosed by the invention is more integrated with the prior knowledge.
The present embodiment also provides a dialog intention recognition apparatus, as shown in fig. 3, the apparatus including:
a statistical module 301, configured to count probabilities of occurrence of each historical user intention in the historical dialog data as a priori knowledge sequence;
a prediction module 302 for predicting that the currently inputted text belongs to the user intention a according to a preset classification model i The probability of (a) q1;
a calculating module 303, configured to calculate that the currently input text belongs to the user intention a by using the priori knowledge sequence i The probability of (q 2);
and a selecting module 304, configured to select, according to the probability q1 and the probability q2, the intention with the highest probability as the current intention of the currently input text.
In one embodiment, as shown in fig. 4, the statistics module includes:
the obtaining sub-module 401 is configured to collect historical dialogue data by using the dialogue log, and obtain a keyword in the historical dialogue data;
a determining submodule 402 configured to determine each historical user intention according to the keyword;
a labeling submodule 403, configured to label a dialog statement that includes an intention of each historical user in historical dialog data;
a first calculation submodule 404 for calculating the user intention a in the history dialogue data by using the following formula i Probability of (c):
Figure BDA0002288053280000111
wherein, a i I-th user intention, m, for each historical user intention i For the user's intention a i Number of occurrences in historical dialogue data, p (a) i ) For the user's intention a i Probability of occurrence in the historical dialogue data.
In one embodiment, the statistics module further comprises:
a second calculation submodule for calculating the occurrence of the user intention a under the first specific condition in the history dialogue data according to the following formula i Probability of (c):
Figure BDA0002288053280000112
wherein, a j For the jth user intention, m, of the historical user intentions ij Is a i At a j Number of subsequent occurrences in the historical dialogue data, m j Is a j Number of occurrences in historical dialogue data;
a third calculation submodule for calculating the user intention a appearing in the historical dialogue data under the second specific condition according to the following formula i Probability of (c):
Figure BDA0002288053280000113
wherein, a k For each historical user intention(k) th user intention, m ijk Is a i At a j And a k Number of subsequent occurrences in the historical dialogue data, m jk Is a j And a k As well as the number of occurrences in the historical dialog data.
In one embodiment, a computing module, comprising:
a prediction submodule for predicting that the currently input text belongs to the user intention a according to the prior knowledge sequence i The probability of (q 2);
a first output submodule for outputting q2= p (a) if the sequence length is greater than or equal to a first preset threshold value i |b 1 ,b 2 ) If the sequence length is equal to a second predetermined threshold, output q2= p (a) i |b 1 ) If the sequence length is equal to a third predetermined threshold, the output q2= p (a) i );
Wherein b1 and b2 are historical intentions to which the currently input text belongs.
In one embodiment, the selection module includes:
a fourth calculation sub-module for calculating that the currently inputted text belongs to the user intention a using the following formula i Total probability of (q):
q=q1*q2;
and the second output sub-module is used for outputting the user intention with the maximum total probability q as the current intention of the currently input text.
Other embodiments of the disclosure will be apparent to those skilled in the art from consideration of the specification and practice of the disclosure disclosed herein. This application is intended to cover any variations, uses, or adaptations of the disclosure following, in general, the principles of the disclosure and including such departures from the present disclosure as come within known or customary practice in the art to which the disclosure pertains. It is intended that the specification and examples be considered as exemplary only, with a true scope and spirit of the disclosure being indicated by the following claims.
It will be understood that the present disclosure is not limited to the precise arrangements described above and shown in the drawings and that various modifications and changes may be made without departing from the scope thereof. The scope of the present disclosure is limited only by the appended claims.

Claims (4)

1. A dialog intention recognition method, comprising the steps of:
counting the probability of the intention of each historical user in the historical dialogue data to serve as a priori knowledge sequence;
predicting that the currently input text belongs to the user intention a according to a preset classification model i The probability of (a) q1;
calculating the current input text belonging to the user intention a by using the prior knowledge sequence i The probability of (q 2);
according to the probability q1 and the probability q2, selecting an intention with the highest probability as a current intention of the currently input text;
wherein, the statistical historical dialogue data with the probability of each historical user intention as a priori knowledge sequence comprises:
collecting the historical dialogue data by using a dialogue log to obtain key words in the historical dialogue data;
determining the intentions of the historical users according to the keywords;
labeling the dialogue sentences containing the intentions of the historical dialogue data;
calculating the user intention a in the historical dialogue data by using the following formula i Probability of (c):
Figure QLYQS_1
wherein, the a i For the ith user intention of the historical user intentions, the m i For the user intention a i The number of occurrences in the historical dialogue data, p (a) i ) For the user intention a i A probability of occurrence in the historical dialogue data;
wherein, the statistical historical dialogue data with the probability of each historical user intention as a priori knowledge sequence, further comprises:
calculating the user intention a appearing in the historical dialogue data under a first specific condition according to the following formula i Probability of (c):
Figure QLYQS_2
wherein, the a j For the jth user intention in the historical user intentions, the m ij Is a said i At the a j Number of subsequent occurrences in the historical dialogue data, the m j Is a said j A number of occurrences in the historical dialog data;
calculating the user intention a appearing in the historical dialogue data under a second specific condition according to the following formula i Probability of (c):
Figure QLYQS_3
wherein, the a k For the kth user intention in the historical user intents, the m ijk Is a said i At the a j And said a k Number of subsequent occurrences in the historical dialog data, the m jk Is a said j And a k The number of occurrences in the historical dialog data at the same time;
wherein, the prior knowledge sequence is used for calculating the intention a of the user in the currently input text i Includes:
predicting that the currently input text belongs to the user intention a according to the prior knowledge sequence i The probability of (q 2);
if the sequence length is greater than or equal to a first preset threshold,
Figure QLYQS_4
if the sequence length is equal to a second predetermined lengthThe threshold value is set to a value that is,
Figure QLYQS_5
if the sequence length is equal to a third predetermined threshold,
Figure QLYQS_6
wherein, the b1 and the b2 are historical intentions to which the currently input text belongs.
2. The dialog intention recognition method of claim 1 wherein selecting the most probable intention as the current intention of the currently input text based on the probability q1 and the probability q2 comprises:
calculating that the currently inputted text belongs to the user intention a by using the following formula i Total probability of (q):
Figure QLYQS_7
and outputting the user intention with the maximum total probability q as the current intention of the currently input text.
3. A dialog intention recognition apparatus, characterized in that the apparatus comprises:
the statistical module is used for counting the probability of the intention of each historical user in the historical dialogue data to serve as a priori knowledge sequence;
a prediction module for predicting the currently input text belonging to the user intention a according to a preset classification model i The probability of (a) q1;
a calculation module for calculating the current input text belonging to the user intention a by using the priori knowledge sequence i The probability of (q 2);
the selection module is used for selecting the intention with the maximum probability as the current intention of the currently input text according to the probability q1 and the probability q2;
wherein, the statistic module comprises:
the acquisition sub-module is used for collecting the historical dialogue data by using a dialogue log and acquiring keywords in the historical dialogue data;
the determining submodule is used for determining the intentions of the historical users according to the keywords;
the marking submodule is used for marking the dialogue sentences containing the intentions of the historical user in the historical dialogue data;
a first calculation submodule for calculating a user intention a in the history dialogue data using the following formula i Probability of (c):
Figure QLYQS_8
wherein, the a i The ith user intention being the historical user intents, the m i For the user intention a i The number of occurrences in the historical dialogue data, p (a) i ) For the user intention a i A probability of occurrence in the historical dialogue data;
wherein, the statistic module further comprises:
a second calculation submodule for calculating the occurrence of the user intention a under a first specific condition in the history dialogue data according to the following formula i Probability of (c):
Figure QLYQS_9
wherein, the a j For the jth user intention in the historical user intentions, the m ij Is a said i At the a j Number of subsequent occurrences in the historical dialog data, the m j Is a said j A number of occurrences in the historical dialog data;
a third calculation submodule for calculating the user intention a appearing in the historical dialogue data under a second specific condition according to the following formula i Probability of (c):
Figure QLYQS_10
wherein, the a k For the kth user intention in the historical user intentions, the m ijk Is a said i At the a j And said a k Number of subsequent occurrences in the historical dialog data, the m jk Is a said j And a k The number of occurrences in the historical dialog data at the same time;
wherein the computing module comprises:
a prediction sub-module for predicting that the currently inputted text belongs to the user intention a according to the prior knowledge sequence i The probability of (q 2);
a first output submodule for outputting if the sequence length is greater than or equal to a first preset threshold value
Figure QLYQS_11
If the sequence length equals a second predetermined threshold value, output->
Figure QLYQS_12
If the sequence length equals a third predetermined threshold value, an output->
Figure QLYQS_13
Wherein, the b1 and the b2 are historical intentions to which the currently input text belongs.
4. The dialog intent recognition device of claim 3 wherein the selection module comprises:
a fourth calculation sub-module for calculating that the currently inputted text belongs to the user intention a using the following formula i Total probability of (q):
Figure QLYQS_14
and the second output submodule is used for outputting the user intention with the maximum total probability q as the current intention of the currently input text.
CN201911168331.5A 2019-11-25 2019-11-25 Dialog intention recognition method and device Active CN111177331B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201911168331.5A CN111177331B (en) 2019-11-25 2019-11-25 Dialog intention recognition method and device

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201911168331.5A CN111177331B (en) 2019-11-25 2019-11-25 Dialog intention recognition method and device

Publications (2)

Publication Number Publication Date
CN111177331A CN111177331A (en) 2020-05-19
CN111177331B true CN111177331B (en) 2023-04-18

Family

ID=70653709

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201911168331.5A Active CN111177331B (en) 2019-11-25 2019-11-25 Dialog intention recognition method and device

Country Status (1)

Country Link
CN (1) CN111177331B (en)

Families Citing this family (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111858936A (en) * 2020-07-21 2020-10-30 龙马智芯(珠海横琴)科技有限公司 Intention identification method and device, identification equipment and readable storage medium
CN112035648B (en) * 2020-09-02 2024-02-23 中国科学技术大学 User data processing method and device and electronic equipment

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPH10198395A (en) * 1997-01-10 1998-07-31 Atr Onsei Honyaku Tsushin Kenkyusho:Kk Statistical language model generator and voice recognition device
CN104699784A (en) * 2015-03-13 2015-06-10 苏州思必驰信息科技有限公司 Data searching method and device based on interactive input
CN110162633A (en) * 2019-05-21 2019-08-23 深圳市珍爱云信息技术有限公司 Voice data is intended to determine method, apparatus, computer equipment and storage medium

Family Cites Families (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
KR101709187B1 (en) * 2012-11-14 2017-02-23 한국전자통신연구원 Spoken Dialog Management System Based on Dual Dialog Management using Hierarchical Dialog Task Library

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPH10198395A (en) * 1997-01-10 1998-07-31 Atr Onsei Honyaku Tsushin Kenkyusho:Kk Statistical language model generator and voice recognition device
CN104699784A (en) * 2015-03-13 2015-06-10 苏州思必驰信息科技有限公司 Data searching method and device based on interactive input
CN110162633A (en) * 2019-05-21 2019-08-23 深圳市珍爱云信息技术有限公司 Voice data is intended to determine method, apparatus, computer equipment and storage medium

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
彭武 ; 姚淑萍 ; .基于概率推理的入侵意图识别研究.计算机科学.2010,(01),全文. *
杨成彪 ; 吕荣荣 ; 吴刚 ; .一种基于记忆网络的多轮对话下的意图识别方法.电子技术与软件工程.2018,(10),全文. *

Also Published As

Publication number Publication date
CN111177331A (en) 2020-05-19

Similar Documents

Publication Publication Date Title
US11693894B2 (en) Conversation oriented machine-user interaction
CN112069298A (en) Human-computer interaction method, device and medium based on semantic web and intention recognition
CN107045496A (en) The error correction method and error correction device of text after speech recognition
CN107704512A (en) Financial product based on social data recommends method, electronic installation and medium
CN112395421B (en) Course label generation method and device, computer equipment and medium
CN112163424A (en) Data labeling method, device, equipment and medium
CN114416942A (en) Automatic question-answering method based on deep learning
CN110580308A (en) information auditing method and device, electronic equipment and storage medium
CN105930432A (en) Training method and apparatus for sequence labeling tool
CN111177331B (en) Dialog intention recognition method and device
CN117435716B (en) Data processing method and system of power grid man-machine interaction terminal
CN114416979A (en) Text query method, text query equipment and storage medium
CN113742733A (en) Reading understanding vulnerability event trigger word extraction and vulnerability type identification method and device
CN112417132A (en) New intention recognition method for screening negative samples by utilizing predicate guest information
CN112699671B (en) Language labeling method, device, computer equipment and storage medium
CN114266252A (en) Named entity recognition method, device, equipment and storage medium
CN110362828B (en) Network information risk identification method and system
AU2019290658B2 (en) Systems and methods for identifying and linking events in structured proceedings
CN117216275A (en) Text processing method, device, equipment and storage medium
CN116303983A (en) Keyword recommendation method and device and electronic equipment
CN110543636B (en) Training data selection method for dialogue system
CN113935309A (en) Skill optimization processing method and system based on semantic platform
CN113673237A (en) Model training method, intent recognition method, device, electronic equipment and storage medium
CN113011162A (en) Reference resolution method, device, electronic equipment and medium
CN112925910A (en) Method, device and equipment for assisting corpus labeling and computer storage medium

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant