CN104166643A - Dialogue act analyzing method in intelligent question-answering system - Google Patents

Dialogue act analyzing method in intelligent question-answering system Download PDF

Info

Publication number
CN104166643A
CN104166643A CN201410410275.2A CN201410410275A CN104166643A CN 104166643 A CN104166643 A CN 104166643A CN 201410410275 A CN201410410275 A CN 201410410275A CN 104166643 A CN104166643 A CN 104166643A
Authority
CN
China
Prior art keywords
label
dialogue
dialogue behavior
behavior unit
classification
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201410410275.2A
Other languages
Chinese (zh)
Inventor
吴云芳
王异秀
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
NANJING JINWAWA SOFTWARE TECHNOLOGY Co Ltd
Original Assignee
NANJING JINWAWA SOFTWARE TECHNOLOGY Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by NANJING JINWAWA SOFTWARE TECHNOLOGY Co Ltd filed Critical NANJING JINWAWA SOFTWARE TECHNOLOGY Co Ltd
Priority to CN201410410275.2A priority Critical patent/CN104166643A/en
Publication of CN104166643A publication Critical patent/CN104166643A/en
Pending legal-status Critical Current

Links

Landscapes

  • Machine Translation (AREA)

Abstract

The invention discloses a dialogue act analyzing method in an intelligent question-answering system. Firstly, a dialogue act label set suitable for spoken Chinese is sorted out; meanwhile, a hierarchical classification device used for analyzing dialogue act units is built; then each sentence of a user is segmented into the dialogue act units; the dialogue act units are classified through the hierarchical classification device to obtain corresponding classes. By means of the technical scheme, automatic analysis of dialogue acts can be achieved effectively at a high speed, and a foundation is laid for intelligent dialogues.

Description

Dialogue behavior analysis method in a kind of Intelligent Answer System
Technical field
The present invention relates to a kind of automatic analysis method of talking with behavior, be specifically related to a kind of sequence labelling method of behavior unit and machine learning method of level of talking with.
Background technology
Dialogue behavior, main research dialogue is beyond semanteme, and the information reflecting, as dialogue intention, session structure information etc.In English, dialogue behavior is a time-honored research, comprises complete tag system and automatic analysis method.And in Chinese, this research does not launch.Below briefly introduce the progress of dialogue behavioral study in English.
2000, Stollcke proposes to use the feature of grammer and morphology, utilize the complete dialogue behavior unit of mark, dialogue is regarded as to independently sentence stream, set up the task that Hidden Markov Model (HMM) completes dialogue behavior classification, in conjunction with n-gram algorithm, decision tree and neural network, generate validity feature.But the method does not complete the automatic segmentation for dialogue behavior unit.
2005, Ang proposed to utilize decision-tree model to complete the cutting of dialogue behavior unit, and has used the inactive distance feature in same teller, and for dialogue behavior unit, cutting reaches 44% accuracy.Yet this accuracy rate is still lower in dialogue behavioural analysis.
Summary of the invention
Goal of the invention: for above-mentioned prior art, the invention provides the dialogue behavior analysis method in a kind of Intelligent Answer System, solve the automatic segmentation accuracy rate of the dialogue behavior under Chinese spoken environment at present low and then cause talking with the low technical matters of automatic discrimination of behavior.
Technical scheme: for solving the problems of the technologies described above, the invention provides following technical scheme:
First arrange out the dialogue behavior tally set being applicable in Chinese characters spoken language, then by every words cutting of user, be dialogue behavior unit, then creation analysis is talked with the hierarchical classifier of behavior unit, each dialogue behavior unit, through the classification of hierarchical classifier, is finally classified in each label in dialogue behavior tally set; Wherein, while arranging out the dialogue behavior tally set that is applicable to Chinese characters spoken language, with reference to the DAMSL broad sense tally set in English language, and the characteristic of oral language of combined with intelligent question answering system, in the present invention, described tally set comprises 4 kinds of large classes of label, and comprises multiple label in the large class of each label, is specifically respectively:
The large class of the first label: reaction teller's intention; Concrete label classification and meaning tag are as follows:
<S>: state a conviction or event, referred to as declarative sentence;
<Q>: can not be with being or the open question sentence of no answer, referred to as open question sentence;
<QYN>: yes-no question;
<R>:: expect that hearer makes the question sentence of an action or response, referred to as demand question sentence;
<TH>: the term of courtesy such as express thanks, referred to as thanking to sentence;
The large class of the second label: reaction teller is for reaction above; Concrete label classification and meaning tag are as follows:
<SA>:: a upper declarative sentence is made to response;
<RA>: a upper demand question sentence is made to response;
<AY>: the affirmative acknowledgement (ACK) of yes-no question;
<AN>: the negative acknowledge of yes-no question;
<AQ>: the answer of open question sentence;
<D>: to thanking the courtesy that sentence is made to respond;
The large class of the third label: session structure information, the effect of response utterance to session structure; Concrete label classification and meaning tag are as follows:
<CO>: start a dialogue;
<CC>: finish a dialogue;
<CT>:: continue a dialogue;
The 4th kind of large class of label: except other situations three kinds of large classes of label above; Concrete label classification and meaning tag are as follows:
<U>: represent uncertain information.
Owing to considering in a sentence, can contain a plurality of dialogue behaviors, first, we adopt following algorithm is dialogue behavior unit sentence cutting:
Step 1.1, language be take to comma, space as separator is broken into elementary cell, extract the text feature of each elementary cell;
Step 1.2, utilize sequence labelling algorithm, judge whether each elementary cell is the beginning of a new dialogue behavior unit;
Step 1.3, according to the judged result of step 1.2, the elementary unit groups that will be under the jurisdiction of same dialogue behavior unit becomes dialogue behavior unit, talks with and between behavior unit, forms dialogue behavior cellular chain.
Further, in the present invention, the text feature of extraction comprises following 10 kinds:
The length of elementary cell;
Whether contain verb;
Whether contain conjunction;
Whether contain number;
Whether contain pronoun;
Whether contain high frequency stop word;
Whether contain the high frequency words that independently forms a complete sentence;
With this elementary cell before the cell pairs that forms of elementary cell, whether contain conjunction pair;
With this elementary cell before the cell pairs that forms of elementary cell, whether contain identical word;
With this elementary cell before the cell pairs that forms of elementary cell, whether contain similar word.
As preferably, in the present invention, described sequence labelling algorithm is selected linear chain condition random field algorithm.This algorithm is widely used in sequence labelling problem.
Because the judgement of subdialogue behavior need to depend on hierarchical classifier above, and the purposes of subdialogue behavior is comparatively single, and the form of expression is similar.Therefore in the present invention, the These characteristics having with reference to dialogue behavior builds hierarchical classifier, and the sorting technique of hierarchical classifier comprises the steps:
Whether step 2.1, judgement dialogue behavior unit can sort out a kind of in <CO>, <CC>, <CT>, <TH>, <D>, <AN>5 kind tag class; If it is exit hierarchical classifier, the classification information of label under returning, finishes classification; If, do not continue step 2.2;
Whether step 2.2, judgement dialogue behavior unit can sort out a kind of in <SA>, <RA>, <AY>3 kind tag class, if it is skip to step 2.4; Otherwise skip to step 2.3;
Whether step 2.3, judgement dialogue behavior unit can sort out a kind of in <Q>, <QYN>, <R>3 kind tag class; If judge dialogue behavior unit, can sort out specifically a certain in <Q>, <QYN>, <R>, exit hierarchical classifier, the classification information of label under returning, finishes classification; If judge dialogue behavior unit, all can not sort out to these 3 kinds of tag class, skip to step 2.5;
The dialogue behavior of having decided in step 2.4, utilization dialogue above, judgement dialogue behavior unit belongs to any in <SA>, <RA>, <AY>, then exit hierarchical classifier, the classification information of label under returning, finishes classification;
Step 2.5, structure svm classifier device, utilize svm classifier device judgement dialogue behavior unit can sort out to label <S> or label <AQ>, then exit hierarchical classifier, the classification information of label under returning, finishes classification.
In the present invention, <CO> in step 2.1, <CC> and <CT> all represent session structure information, together with <TH>, <D> and these 6 of <AN>, be all the classification the most easily judging, so first judge; Next label <SA>, the <RA> in step 2.2 and <AY> all belong to a kind of positive response, judged this step, remaining judgement direction can be divided into the concrete positive response type of judgement and determine whether 2 kinds of interrogative sentences, wherein interrogative sentence comprises <Q>, <QYN>, <R>3 kind; Whether following step 2.3 is again for being that interrogative sentence judges and draws thus two kinds of results, if interrogative sentence continues to judge the type of concrete interrogative sentence, be any in <Q>, <QYN>, <R> on earth, if not interrogative sentence, judge and can sort out to label <S> or label <AQ>.Above process is known, sets artificially the judgement sequencing of hierarchical classifier, by easy first and difficult later mode, is judged and is classified, and has improved efficiency and the accuracy rate of operation.
Further, in the present invention, in step 2.3, it is as follows whether judgement dialogue behavior unit can be sorted out to a kind of method in <Q>, <QYN>, <R>: first whether judgement dialogue behavior unit can be sorted out to label <QYN>; If can not sort out to label <QYN>, utilize frequent sequence signature and word bag model judgement dialogue behavior unit whether can sort out to label <R>; If can not sort out to label <R>, judge that this dialogue behavior unit sorts out to label <Q>.
Further, in the present invention, in step 2.5, the structure of svm classifier device utilizes following feature: the classification results of the dialogue behavior of above having decided, contained verb and noun in the dialogue behavior unit that belongs to label <Q> in this dialogue behavior unit and 5 dialogues above, distance between this dialogue behavior unit and the nearest dialogue behavior unit that belongs to label <Q>, in the dialogue behavior unit that belongs to label <Q> in this dialogue behavior unit and 5 dialogues above, whether contain the word pair repeating.
Beneficial effect:
The inventive method is first according to talking with behavior label standard in English, arrange out the applicable dialogue behavior tally set of Chinese characters spoken language, and a kind of automatic analysis method of talking with behavior is provided, for solving for the automatic segmentation of dialogue behavior unit and the automatic identification of dialogue behavior.
Principle of the present invention is, first extracts text feature, uses sequence labelling algorithm to be syncopated as dialogue behavior unit, recycles frequent arrangement set feature, and the component level sorter such as non-text feature, automatic analysis dialogue behavior unit.
In Chinese, for dialogue behavior research seldom, the feature that the present invention is directed to dialogue behavior is effectively analyzed, for next needing the work that relies on dialogue behavior to lay the foundation;
Compare with the dialogue behavior in English, the present invention is directed to Intelligent Answer System, tally set in English has been carried out improving obtaining being applicable to the tally set of Chinese, and taked sorting technique efficient, level, it can be applied in practical Intelligent Answer System.
Utilize technical scheme provided by the invention, the automatic analysis of the behavior that can engage in the dialogue effectively, at high speed, for reaching of Intelligent dialogue lays the foundation.
Accompanying drawing explanation
Fig. 1 dialogue behavior unit of the present invention cutting flow process;
The dialogue behavior autoanalyzer flow process of Fig. 2 level of the present invention.
Embodiment
Below in conjunction with accompanying drawing, the present invention is further described.
Dialogue behavior analysis method in a kind of Intelligent Answer System of the present invention, first arranges out the as shown in table 1 below dialogue behavior tally set in Chinese characters spoken language that is applicable to.
Table 1
As can be seen from Table 1, tally set comprises 4 kinds of label classifications, is respectively:
The first, reaction teller's intention;
The second, reaction teller are for reaction above;
The third, the effect of response utterance to session structure;
The 4th kind, except other situations three kinds above.
For every kind of label classification, give more careful several label groups, the implication of every kind of label group representative is all different, there is no each other cross one another part.Before three kinds of label classifications substantially contained the common dialogue behavior of language representative, also have the atypical dialogue behavior of small part to be all referred in the third.
In the process that label is marked, notice that a lot of sentences contain a plurality of dialogue behaviors.Therefore, first we will be dialogue behavior unit sentence cutting, then carry out next step dialogue behavior analytical work.
In practical operation, by a word cutting of user, be dialogue behavior unit, comprise execution following steps:
Step 1.1, user's language be take to comma, space as separator is broken into elementary cell, extracts the text feature of each elementary cell, comprise following 10 kinds:
The length of elementary cell;
Whether contain verb;
Whether contain conjunction;
Whether contain number;
Whether contain pronoun;
Whether contain high frequency stop word;
Whether contain the high frequency words that independently forms a complete sentence;
With this elementary cell before the cell pairs that forms of elementary cell, whether contain conjunction pair;
With this elementary cell before the cell pairs that forms of elementary cell, whether contain identical word;
With this elementary cell before the cell pairs that forms of elementary cell, whether contain similar word.
By feature numeral, represent respectively above-mentioned text feature, the length of elementary cell wherein represents by three-dimensional, all the other 9 kinds of text features are all used one-dimensional representation, concrete, if the answer of the problem of paying close attention in text feature is for negating, do not occur, text feature represents with 0, if the answer of the problem of paying close attention in text feature is sure, occur, with 1, represent, also has a kind of situation, if text feature is that invalid feature is (as first elementary cell of this elementary cell formula, cannot try to achieve with this elementary cell before the unit that forms of elementary cell to), the text feature of this situation represents with 2.Become characteristic number word string in order to represent elementary cell above-mentioned feature numeral der group.
Step 1.2, utilize the sequence labelling algorithm of linear chain random field algorithm, judge whether each elementary cell is the beginning of a new dialogue behavior unit;
Step 1.3, according to the judged result of step 1.2, the elementary unit groups that will be under the jurisdiction of same dialogue behavior unit becomes dialogue behavior unit, talks with and between behavior unit, forms dialogue behavior cellular chain.
Engage in the dialogue after the cutting of behavior unit, by according to the feature of the dialogue behavior summing up, the hierarchical classifier of creation analysis dialogue behavior unit, the sorting technique of hierarchical classifier comprises the steps:
Whether step 2.1, judgement dialogue behavior unit can sort out a kind of in <CO>, <CC>, <CT>, <TH>, <D>, <AN>5 kind tag class; If it is exit hierarchical classifier, the classification information of label under returning, finishes classification; If, do not continue step 2.2;
Whether step 2.2, judgement dialogue behavior unit can sort out a kind of in <SA>, <RA>, <AY>3 kind tag class, if it is skip to step 2.4; Otherwise skip to step 2.3;
Whether step 2.3, judgement dialogue behavior unit can sort out a kind of in <Q>, <QYN>, <R>3 kind tag class; If judge dialogue behavior unit, can sort out a certain in <Q>, <QYN>, <R>, the category that belongs to interrogative sentence, then specifically judge and belong to any, first whether judgement dialogue behavior unit can be sorted out to label <QYN>; If can not sort out to label <QYN>, utilize frequent sequence signature and word bag model judgement dialogue behavior unit whether can sort out to label <R>; If can not sort out to label <R>, judge that this dialogue behavior unit sorts out to label <Q>, judge backed off after random hierarchical classifier, the classification information of label under returning, finishes classification; If judge dialogue behavior unit, all can not sort out to these 3 kinds of tag class, skip to step 2.5;
The dialogue behavior of having decided in step 2.4, utilization dialogue above, judgement dialogue behavior unit belongs to any in <SA>, <RA>, <AY>, then exit hierarchical classifier, the classification information of label under returning, finishes classification;
Step 2.5, utilize the classification results of the dialogue behavior of having decided above, contained verb and noun in the dialogue behavior unit that belongs to label <Q> in this dialogue behavior unit and 5 dialogues above, distance between this dialogue behavior unit and the nearest dialogue behavior unit that belongs to label <Q>, in the dialogue behavior unit that belongs to label <Q> in this dialogue behavior unit and 5 dialogues above, whether contain the word pair repeating, build svm classifier device, utilize svm classifier device judgement dialogue behavior unit can sort out to label <S> or label <AQ>, then exit hierarchical classifier, the classification information of label under returning, finish classification.
Below by a specific embodiment, the present invention will be further described.
Suppose that in one section of dialogue advancing, active user's language is: our use, will the picture of RM be charged according to purposes, is it right?
And the information above of knowing the current language of user is following 3 dialogues:
The 1st
A customer service: you are good!
The 2nd
Client: I want to buy a pictures.
The 3rd
Customer service: are you what purposes of current picture?
Wherein the 1st word are automatically recognized as this dialogue behavior is to open dialogue, and the 2nd word are automatically recognized as this dialogue behavior is declarative sentence, the 3rd word be automatically recognized for this dialogue behavior be interrogative sentence.
Current language is analyzed, first, the cutting of the behavior unit that engages in the dialogue, flow process as shown in Figure 1, comprises the steps:
1) user's language short by comma be two elementary cells, the set expression of the elementary cell of the words language is { (we use ourselves) so, (picture of RM will be charged according to purposes, right?), and the text feature of choosing according to the present invention, each elementary cell is expressed as to the characteristic number word string { (0 1002210220 1), (0 0100000010 1) } that string number forms;
2) utilize linear chain condition random field algorithm to judge whether these 2 elementary cells are the beginning of a new dialogue behavior unit, input using characteristic number word string as linear chain condition random field algorithm, the judged result obtaining is as output, obtaining (we use ourselves) is the beginning of a dialogue behavior unit, (picture of RM will be charged according to purposes, right?) be also the beginning of a dialogue behavior unit;
3) according to the result of previous step, language is divided into 2 dialogue behavior units, will be respectively (we oneself with) and (picture of RM be charged according to purposes, right?), thereby acquisition dialogue behavior cellular chain (we use ourselves), (picture of RM will be charged according to purposes, right?).
Secondly, take that to talk with behavior unit (we own with) be example, the engage in the dialogue automatic identification of behavior of use hierarchical classifier, comprises the following steps:
1) judgement (we use) does not belong to a kind of in <CO>, <CC>, <CT>, <TH>, <D>, <AN>6 kind tag class by own;
2) judgement (we use) does not belong to a kind of in <SA>, <RA>, <AY>3 kind tag class by own;
3) judgement (we use) does not belong to a kind of in <Q>, <QYN>, <R>3 kind tag class by own;
4) basis dialog information above, calculating (we use ourselves) is 1 (distance that represents two sentences is 1) with the distance feature of the open question sentence (your what purposes of current picture) above occurring, and contain (you, I), (purposes, with) two pairs of words pair that repeat, by svm classifier device, judge that it belongs to for the answer of open question sentence is above label <AQ>, return to this classification information, exit hierarchical classifier, and continue the differentiation of next dialogue behavior unit.
Finally, obtain the net result of judgement: (we use ourselves) is the answer of open question sentence; (picture of RM will be charged according to purposes, right?) be yes-no question.
The above is only the preferred embodiment of the present invention; be noted that for those skilled in the art; under the premise without departing from the principles of the invention, can also make some improvements and modifications, these improvements and modifications also should be considered as protection scope of the present invention.

Claims (7)

1. the dialogue behavior analysis method in an Intelligent Answer System, it is characterized in that: first arrange out the dialogue behavior tally set being applicable in Chinese characters spoken language, then by every words cutting of user, be dialogue behavior unit, then creation analysis is talked with the hierarchical classifier of behavior unit, each dialogue behavior unit, through the classification of hierarchical classifier, is finally classified in each label in dialogue behavior tally set; Wherein, while arranging out the dialogue behavior tally set that is applicable to Chinese characters spoken language, build tally set and be characterised in that: described tally set comprises 4 kinds of large classes of label, and comprises multiple label in the large class of each label, is specifically respectively:
The large class of the first label: reaction teller's intention; Concrete label classification and meaning tag are as follows:
<S>: state a conviction or event, referred to as declarative sentence;
<Q>: can not be with being or the open question sentence of no answer, referred to as open question sentence;
<QYN>: yes-no question;
<R>: expect that hearer makes the question sentence of an action or response, referred to as demand question sentence;
<TH>: the term of courtesy such as express thanks, referred to as thanking to sentence;
The large class of the second label: reaction teller is for reaction above; Concrete label classification and meaning tag are as follows:
<SA>:: a upper declarative sentence is made to response;
<RA>: a upper demand question sentence is made to response;
<AY>: the affirmative acknowledgement (ACK) of yes-no question;
<AN>: the negative acknowledge of yes-no question;
<AQ>: the answer of open question sentence;
<D>: to thanking the courtesy that sentence is made to respond;
The large class of the third label: session structure information, the effect of response utterance to session structure; Concrete label classification and meaning tag are as follows:
<CO>: start a dialogue;
<CC>: finish a dialogue;
<CT>:: continue a dialogue;
The 4th kind of large class of label: except other situations three kinds of large classes of label above; Concrete label classification and meaning tag are as follows:
<U>: represent uncertain information.
2. the cutting method of dialogue behavior unit according to claim 1, is characterized in that: during cutting dialogue behavior unit, order is carried out following steps:
Step 1.1, language be take to comma, space as separator is broken into elementary cell, extract the text feature of each elementary cell;
Step 1.2, utilize sequence labelling algorithm, judge whether each elementary cell is the beginning of a new dialogue behavior unit;
Step 1.3, according to the judged result of step 1.2, the elementary unit groups that will be under the jurisdiction of same dialogue behavior unit becomes dialogue behavior unit, talks with and between behavior unit, forms dialogue behavior cellular chain.
3. the dialogue behavior analysis method in Intelligent Answer System according to claim 1, is characterized in that: the text feature of extraction comprises following 10 kinds:
The length of elementary cell;
Whether contain verb;
Whether contain conjunction;
Whether contain number;
Whether contain pronoun;
Whether contain high frequency stop word;
Whether contain the high frequency words that independently forms a complete sentence;
With this elementary cell before the cell pairs that forms of elementary cell, whether contain conjunction pair;
With this elementary cell before the cell pairs that forms of elementary cell, whether contain identical word;
With this elementary cell before the cell pairs that forms of elementary cell, whether contain similar word.
4. the dialogue behavior analysis method in Intelligent Answer System according to claim 1, is characterized in that: described sequence labelling algorithm is selected linear chain condition random field algorithm.
5. the dialogue behavior analysis method in Intelligent Answer System according to claim 1, is characterized in that: the sorting technique of hierarchical classifier comprises the steps:
Whether step 2.1, judgement dialogue behavior unit can sort out a kind of in <CO>, <CC>, <CT>, <TH>, <D>, <AN>6 kind tag class; If it is exit hierarchical classifier, the classification information of label under returning, finishes classification; If, do not continue step 2.2;
Whether step 2.2, judgement dialogue behavior unit can sort out a kind of in <SA>, <RA>, <AY>3 kind tag class, if it is skip to step 2.4; Otherwise skip to step 2.3;
Whether step 2.3, judgement dialogue behavior unit can sort out a kind of in <Q>, <QYN>, <R>3 kind tag class; If judge dialogue behavior unit, can sort out specifically a certain in <Q>, <QYN>, <R>, exit hierarchical classifier, the classification information of label under returning, finishes classification; If judge dialogue behavior unit, all can not sort out to these 3 kinds of tag class, skip to step 2.5;
The dialogue behavior of having decided in step 2.4, utilization dialogue above, judgement dialogue behavior unit belongs to any in <SA>, <RA>, <AY>, then exit hierarchical classifier, the classification information of label under returning, finishes classification;
Step 2.5, structure svm classifier device, utilize svm classifier device judgement dialogue behavior unit can sort out to label <S> or label <AQ>, then exit hierarchical classifier, the classification information of label under returning, finishes classification.
6. the dialogue behavior analysis method in Intelligent Answer System according to claim 5, it is characterized in that: in step 2.3, it is as follows whether judgement dialogue behavior unit can be sorted out to a kind of method in <Q>, <QYN>, <R>: first whether judgement dialogue behavior unit can be sorted out to label <QYN>; If can not sort out to label <QYN>, utilize frequent sequence signature and word bag model judgement dialogue behavior unit whether can sort out to label <R>; If can not sort out to label <R>, judge that this dialogue behavior unit sorts out to label <Q>.
7. the dialogue behavior analysis method in Intelligent Answer System according to claim 5, it is characterized in that: in step 2.5, the structure of svm classifier device utilizes following feature: the classification results of the dialogue behavior of above having decided, contained verb and noun in the dialogue behavior unit that belongs to label <Q> in this dialogue behavior unit and 5 dialogues above, distance between this dialogue behavior unit and the nearest dialogue behavior unit that belongs to label <Q>, in the dialogue behavior unit that belongs to label <Q> in this dialogue behavior unit and 5 dialogues above, whether contain the word pair repeating.
CN201410410275.2A 2014-08-19 2014-08-19 Dialogue act analyzing method in intelligent question-answering system Pending CN104166643A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201410410275.2A CN104166643A (en) 2014-08-19 2014-08-19 Dialogue act analyzing method in intelligent question-answering system

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201410410275.2A CN104166643A (en) 2014-08-19 2014-08-19 Dialogue act analyzing method in intelligent question-answering system

Publications (1)

Publication Number Publication Date
CN104166643A true CN104166643A (en) 2014-11-26

Family

ID=51910463

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201410410275.2A Pending CN104166643A (en) 2014-08-19 2014-08-19 Dialogue act analyzing method in intelligent question-answering system

Country Status (1)

Country Link
CN (1) CN104166643A (en)

Cited By (13)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105389296A (en) * 2015-12-11 2016-03-09 小米科技有限责任公司 Information partitioning method and apparatus
CN106708983A (en) * 2016-12-09 2017-05-24 竹间智能科技(上海)有限公司 Dialogue interactive information-based user portrait construction system and method
CN106844506A (en) * 2016-12-27 2017-06-13 竹间智能科技(上海)有限公司 The knowledge retrieval method and the automatic improving method of knowledge base of a kind of artificial intelligence dialogue
CN107679042A (en) * 2017-11-15 2018-02-09 北京灵伴即时智能科技有限公司 A kind of multi-layer dialog analysis method towards Intelligent voice dialog system
CN107832293A (en) * 2017-11-07 2018-03-23 北京灵伴即时智能科技有限公司 A kind of conversation activity analysis method towards non-free talk formula Chinese characters spoken language
CN107924680A (en) * 2015-08-17 2018-04-17 三菱电机株式会社 Speech understanding system
WO2018145436A1 (en) * 2017-02-13 2018-08-16 长沙军鸽软件有限公司 Method for extracting conversation pair from conversation content
CN108763510A (en) * 2018-05-30 2018-11-06 北京五八信息技术有限公司 Intension recognizing method, device, equipment and storage medium
CN108959447A (en) * 2018-06-13 2018-12-07 北京信息科技大学 The psychological distance analysis method of interlocutor in an a kind of pair of language piece
WO2019053629A1 (en) * 2017-09-15 2019-03-21 International Business Machines Corporation Training data update
CN109754793A (en) * 2017-11-07 2019-05-14 现代自动车株式会社 Device and method for recommending the function of vehicle
CN110019704A (en) * 2017-10-17 2019-07-16 阿里巴巴集团控股有限公司 Sentence analysis and man-machine interaction method and device calculate equipment, storage medium
CN110188190A (en) * 2019-04-03 2019-08-30 阿里巴巴集团控股有限公司 Talk with analytic method, device, server and readable storage medium storing program for executing

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20030091163A1 (en) * 1999-12-20 2003-05-15 Attwater David J Learning of dialogue states and language model of spoken information system
CN1484217A (en) * 2003-07-11 2004-03-24 中国科学院声学研究所 Hierarchical classifying and logic combined natural colloquial conversation dessription method
CN102360383A (en) * 2011-10-15 2012-02-22 西安交通大学 Method for extracting text-oriented field term and term relationship
CN103309926A (en) * 2013-03-12 2013-09-18 中国科学院声学研究所 Chinese and English-named entity identification method and system based on conditional random field (CRF)
CN103823844A (en) * 2014-01-26 2014-05-28 北京邮电大学 Question forwarding system and question forwarding method on the basis of subjective and objective context and in community question-and-answer service

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20030091163A1 (en) * 1999-12-20 2003-05-15 Attwater David J Learning of dialogue states and language model of spoken information system
CN1484217A (en) * 2003-07-11 2004-03-24 中国科学院声学研究所 Hierarchical classifying and logic combined natural colloquial conversation dessription method
CN102360383A (en) * 2011-10-15 2012-02-22 西安交通大学 Method for extracting text-oriented field term and term relationship
CN103309926A (en) * 2013-03-12 2013-09-18 中国科学院声学研究所 Chinese and English-named entity identification method and system based on conditional random field (CRF)
CN103823844A (en) * 2014-01-26 2014-05-28 北京邮电大学 Question forwarding system and question forwarding method on the basis of subjective and objective context and in community question-and-answer service

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
杨平: "面向自然语言处理的会话行为分类研究", 《语言文字应用》 *

Cited By (20)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107924680A (en) * 2015-08-17 2018-04-17 三菱电机株式会社 Speech understanding system
CN105389296A (en) * 2015-12-11 2016-03-09 小米科技有限责任公司 Information partitioning method and apparatus
CN106708983A (en) * 2016-12-09 2017-05-24 竹间智能科技(上海)有限公司 Dialogue interactive information-based user portrait construction system and method
CN106844506A (en) * 2016-12-27 2017-06-13 竹间智能科技(上海)有限公司 The knowledge retrieval method and the automatic improving method of knowledge base of a kind of artificial intelligence dialogue
WO2018145436A1 (en) * 2017-02-13 2018-08-16 长沙军鸽软件有限公司 Method for extracting conversation pair from conversation content
GB2580805A (en) * 2017-09-15 2020-07-29 Ibm Training data update
US10372826B2 (en) 2017-09-15 2019-08-06 International Business Machines Corporation Training data update
US10621284B2 (en) 2017-09-15 2020-04-14 International Business Machines Corporation Training data update
WO2019053629A1 (en) * 2017-09-15 2019-03-21 International Business Machines Corporation Training data update
CN110019704A (en) * 2017-10-17 2019-07-16 阿里巴巴集团控股有限公司 Sentence analysis and man-machine interaction method and device calculate equipment, storage medium
CN110019704B (en) * 2017-10-17 2023-01-17 阿里巴巴集团控股有限公司 Statement analysis and human-computer interaction method and device, computing device and storage medium
CN109754793A (en) * 2017-11-07 2019-05-14 现代自动车株式会社 Device and method for recommending the function of vehicle
CN107832293A (en) * 2017-11-07 2018-03-23 北京灵伴即时智能科技有限公司 A kind of conversation activity analysis method towards non-free talk formula Chinese characters spoken language
CN107832293B (en) * 2017-11-07 2021-04-09 北京灵伴即时智能科技有限公司 Conversation behavior analysis method for non-free talking Chinese spoken language
CN109754793B (en) * 2017-11-07 2024-03-19 现代自动车株式会社 Device and method for recommending functions of vehicle
CN107679042A (en) * 2017-11-15 2018-02-09 北京灵伴即时智能科技有限公司 A kind of multi-layer dialog analysis method towards Intelligent voice dialog system
CN107679042B (en) * 2017-11-15 2021-02-05 北京灵伴即时智能科技有限公司 Multi-level dialogue analysis method for intelligent voice dialogue system
CN108763510A (en) * 2018-05-30 2018-11-06 北京五八信息技术有限公司 Intension recognizing method, device, equipment and storage medium
CN108959447A (en) * 2018-06-13 2018-12-07 北京信息科技大学 The psychological distance analysis method of interlocutor in an a kind of pair of language piece
CN110188190A (en) * 2019-04-03 2019-08-30 阿里巴巴集团控股有限公司 Talk with analytic method, device, server and readable storage medium storing program for executing

Similar Documents

Publication Publication Date Title
CN104166643A (en) Dialogue act analyzing method in intelligent question-answering system
CN101599071B (en) Automatic extraction method of conversation text topic
Boltužić et al. Back up your stance: Recognizing arguments in online discussions
Bramsen et al. Extracting social power relationships from natural language
CN104572892B (en) A kind of file classification method based on cyclic convolution network
CN104408093B (en) A kind of media event key element abstracting method and device
CN103729474B (en) Method and system for recognizing forum user vest account
CN111125354A (en) Text classification method and device
Sanchez et al. ICDAR 2015 competition HTRtS: Handwritten Text Recognition on the tranScriptorium dataset
CN106202032A (en) A kind of sentiment analysis method towards microblogging short text and system thereof
CN101320374A (en) Field question classification method combining syntax structural relationship and field characteristic
CN104063399B (en) Method and system for automatically identifying emotional probability borne by texts
CN104331506A (en) Multiclass emotion analyzing method and system facing bilingual microblog text
CN108804608A (en) A kind of microblogging rumour position detection method based on level attention
CN106776574A (en) User comment text method for digging and device
CN111209363B (en) Corpus data processing method, corpus data processing device, server and storage medium
CN103474061A (en) Automatic distinguishing method based on integration of classifier for Chinese dialects
CN113505200B (en) Sentence-level Chinese event detection method combined with document key information
Brocardo et al. Verifying online user identity using stylometric analysis for short messages
Rentoumi et al. United we stand: Improving sentiment analysis by joining machine learning and rule based methods
CN110610003B (en) Method and system for assisting text annotation
CN111274390A (en) Emotional reason determining method and device based on dialogue data
CN106294315B (en) The natural language predicate verb recognition methods merged based on syntactic property with statistics
CN111382366B (en) Social network user identification method and device based on language and non-language features
CN112949284B (en) Text semantic similarity prediction method based on Transformer model

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
WD01 Invention patent application deemed withdrawn after publication
WD01 Invention patent application deemed withdrawn after publication

Application publication date: 20141126