JP2008083753A

JP2008083753A - Dialogue support device and its method

Info

Publication number: JP2008083753A
Application number: JP2006259754A
Authority: JP
Inventors: Hisayoshi Nagae; 尚義永江
Original assignee: Toshiba Corp
Current assignee: Toshiba Corp
Priority date: 2006-09-25
Filing date: 2006-09-25
Publication date: 2008-04-10

Abstract

<P>PROBLEM TO BE SOLVED: To provide a dialogue support device for allowing accurate and smooth communications by detecting miscommunications in a dialogue between users. <P>SOLUTION: The dialogue support device has a voice recognition part 102, an ambiguous expression detecting part 103, an ambiguous expression subject detecting part 104, a DB speaker specifying part 105, a cognition range estimating part 106, a cognition range difference detecting part 107, an ambiguous expression word list 120, a speaker information DB 121, and a cognition range DB 122. The voice of a speaker is recognized, and if there is any ambiguous expression in the content of the speech, the cognition range DB 122 is searched for the cognition ranges of the speaker and the listener; if there is a difference between the cognition ranges of the speaker and the listener, a warning is displayed to notify the speaker and the listener of the possibility of miscommunications. <P>COPYRIGHT: (C)2008,JPO&INPIT

Description

本発明は、話し手と聞き手間のコミュニケーションにおける曖昧な表現における誤解や意識のずれを検出する対話支援装置及びその方法に関する。 The present invention relates to a dialogue support apparatus and method for detecting misunderstandings and consciousness shifts in ambiguous expressions in communication between a speaker and a listener.

近年、企業活動のスピードアップやグローバル化の要請を背景にＴＶ会議などの遠隔コミュニケーションの重要性が増している。 In recent years, the importance of remote communication such as video conferencing has been increasing against the background of speeding up of corporate activities and globalization.

また、今後はＶｏＩＰの広がりなどから遠隔コミュニケーションの端末にＰＣ（パーソナルコンピュータ）が使用されるようになることが見込まれる。ＰＣが使用されるとこれまでのＴＶ会議用の端末のように単にユーザ間の音声や画像を相互転送するだけでなく、音声認識（例えば、特許文献１参照）や画像処理などの高度な処理を実現することが可能になる。
「確率モデルによる音声認識」中川聖一著電子情報通信学会 ISBN4−88552−072−X In the future, it is expected that PCs (personal computers) will be used as remote communication terminals due to the spread of VoIP. When a PC is used, advanced processing such as voice recognition (see, for example, Patent Document 1) and image processing as well as mutual transfer of voices and images between users as in conventional TV conference terminals. Can be realized.
"Speech recognition using probabilistic models" by Seiichi Nakagawa IEICE ISBN4-88552-072-X

遠隔コミュニケーションにおいて話し手と聞き手が異なる人間である以上ミスコミュニケーションは避けられない。文化、生活、育ち、バックグラウンドの違いにより、物事の感覚が異なることがよくある。しかも今後は海外とのコミュニケーションの機会が増加することが見込まれ、ミスコミュニケーションの発生する可能性は増加することが予想される。言語のニュアンスに対する感覚が異なる人間同士が会話を行うとこれまで以上に重大な意識・認識ずれが生じやすくなる可能性がある。 Miscommunication is inevitable as long as the speaker and listener are different in remote communication. The sense of things is often different due to differences in culture, life, growth and background. Moreover, it is expected that opportunities for communication with foreign countries will increase in the future, and the possibility of miscommunication is expected to increase. When people with different senses of language nuances have conversations, there is a possibility that more serious consciousness / recognition shifts may occur.

ＴＶ会議の議論において、このような誤解や意識ずれを放置したままにするとその後の業務に支障が出ることは容易に想像される。しかし、意識ずれが生じそうな曖昧な表現が会話に現れるたびに細かく確認していては遅々として会話が進まず、効率が悪く、円滑なコミュニケーションが取れない。 It is easily imagined that if such misunderstandings and misconceptions are left unattended in the discussion of the TV conference, the subsequent work will be hindered. However, if an ambiguous expression that is likely to cause consciousness dissatisfaction is confirmed every time it appears in the conversation, the conversation does not proceed slowly, and the efficiency is low and smooth communication cannot be achieved.

そこで、本発明は上記問題点に鑑み、話し手の表現に曖昧さがあっても、その曖昧さによる話し手と聞き手の差異を求めることができ対話支援装置及びその方法を提供する。 In view of the above problems, the present invention provides a dialogue support apparatus and method that can determine the difference between a speaker and a listener due to the ambiguity even if the expression of the speaker is ambiguity.

本発明は、話し手と聞き手間のコミュニケーションを仲介する対話支援装置において、前記話し手の音声を認識する音声認識部と、前記話し手と前記聞き手との間に誤解や意識のずれを発生させるような曖昧表現単語のリストを格納した曖昧表現単語リスト記憶部と、前記曖昧表現単語リストと前記音声認識結果とを比較して、前記音声認識結果から曖昧表現単語を含む曖昧表現句を検出する曖昧表現検出部と、前記曖昧表現句によって前記聞き手の解釈が曖昧となる曖昧表現対象を前記音声認識結果から検出する曖昧表現対象検出部と、前記話し手の人物情報と前記聞き手の人物情報を特定する話者特定部と、前記曖昧表現句と前記曖昧表現対象の組合せに対して、前記人物情報毎に前記曖昧な表現の認知範囲を格納する認知範囲記憶部と、前記認知範囲記憶部に格納した前記話し手の人物情報に関する前記曖昧表現句と前記曖昧表現対象に対する前記話し手の認知範囲を求め、また、前記認知範囲記憶部に格納した前記聞き手の人物情報に関する前記曖昧表現句と前記曖昧表現対象に対する前記聞き手の認知範囲を求める認知範囲推定部と、前記話し手の認知範囲と前記聞き手の認知範囲とを比較して、前記話し手と前記聞き手の認知範囲の差異を求める認知範囲相違検出部と、を具備したことを特徴とする対話支援装置である。 The present invention relates to a dialogue support apparatus that mediates communication between a speaker and a listener, and an ambiguous that causes misunderstandings and deviations of consciousness between the speaker and the listener, and a speech recognition unit that recognizes the speech of the speaker. An ambiguous expression word list storage unit that stores a list of expression words, and an ambiguous expression detection that compares the ambiguous expression word list with the speech recognition result and detects an ambiguous expression phrase including the ambiguous expression word from the speech recognition result. An ambiguous expression target detection unit that detects from the speech recognition result an ambiguous expression target whose interpretation of the listener is ambiguous by the ambiguous expression phrase, and a speaker that identifies the person information of the speaker and the person information of the listener A recognizing range storage unit that stores a recognizing range of the ambiguous expression for each person information with respect to a combination of the specifying unit and the ambiguous expression phrase and the ambiguous expression target; The ambiguous expression phrase related to the person information of the speaker stored in the recognition range storage unit and the recognition range of the speaker for the object of the ambiguous expression are obtained, and the ambiguous information about the person information of the listener stored in the recognition range storage unit A recognition range estimation unit for obtaining a recognition range of the listener with respect to an expression phrase and the ambiguous expression target is compared with a recognition range of the speaker and the listener to obtain a difference between the recognition range of the speaker and the listener. And a recognition range difference detection unit.

本発明によれば、話し手の表現に曖昧さがあっても、その曖昧さによる話し手と聞き手の差異を求めることができ、誤解、意識ずれを防止できる。 According to the present invention, even if there is ambiguity in the expression of the speaker, the difference between the speaker and the listener due to the ambiguity can be obtained, and misunderstanding and misconception can be prevented.

以下、本発明の一実施形態の音声認識装置について図面に基づいて説明する。 Hereinafter, a speech recognition apparatus according to an embodiment of the present invention will be described with reference to the drawings.

（第１の実施形態）
以下、図１で示した構成図と図３を参照しながら第１の実施形態の対話支援装置について説明する。なお、本実施形態の対話支援装置は、音声認識装置を内蔵したものである。 (First embodiment)
The dialogue support apparatus according to the first embodiment will be described below with reference to the configuration diagram shown in FIG. 1 and FIG. Note that the dialogue support apparatus of the present embodiment incorporates a voice recognition apparatus.

（１）対話支援装置の構成
対話支援装置は、図１に示すように、音声認識部１０２、曖昧表現検出部１０３、曖昧表現対象検出部１０４、ＤＢ（データーベース）話者特定部１０５、認知範囲推定部１０６、認知範囲相違検出部１０７、曖昧表現単語リスト１２０、話者情報ＤＢ１２１、認知範囲ＤＢ１２２を有している。これら各部１０２、１０３、１０４、１０５、１０６、１０７、１０８の各機能は、コンピュータに記憶されたプログラムによって実現できる。 (1) Configuration of Dialogue Support Device As shown in FIG. 1, the dialogue support device includes a speech recognition unit 102, an ambiguous expression detection unit 103, an ambiguous expression target detection unit 104, a DB (data base) speaker identification unit 105, a recognition It has a range estimation unit 106, a recognition range difference detection unit 107, an ambiguous expression word list 120, a speaker information DB 121, and a recognition range DB 122. Each function of these units 102, 103, 104, 105, 106, 107, 108 can be realized by a program stored in a computer.

（１−１）音声認識部１０２
音声認識部１０２は、話し手１００が発声した発話音声１３０を音声認識する。また、音声認識部１０２は、この認識した話し手の音声１３０に対する認識結果１３１を曖昧表現検出部１０３に通知する。 (1-1) Voice recognition unit 102
The speech recognition unit 102 recognizes speech speech 130 uttered by the speaker 100. Further, the speech recognition unit 102 notifies the ambiguous expression detection unit 103 of the recognition result 131 for the recognized speech 130 of the speaker.

（１−２）曖昧表現検出部１０３
曖昧表現検出部１０３はまず認識結果１３１中に曖昧表現単語リスト１２０の単語が存在するかどうかを検査する。見つからなければ発言中に曖昧な表現がないとして何もしない。一方、曖昧表現単語が見つかった場合にはその曖昧表現単語を含む曖昧表現句１３２を検出する。 (1-2) Ambiguous Expression Detection Unit 103
The ambiguous expression detection unit 103 first checks whether or not the word in the ambiguous expression word list 120 exists in the recognition result 131. If it is not found, nothing is done if there is no ambiguous expression in the statement. On the other hand, when an ambiguous expression word is found, an ambiguous expression phrase 132 including the ambiguous expression word is detected.

ここで、曖昧表現単語リストは、例えば「やや、若干、もう少し、あまり、くらい」等のような程度を表す副詞や形容詞等の単語リストとして曖昧表現検出部１０３のＤＢに記憶されている。曖昧表現単語は程度を表す単語以外にも「英国人のsecond floor」と「米国人のsecond floor」のように同じ表層で異なる概念を表す語のリストでもよい。また、曖昧表現検出部１０３は、音声認識結果中に曖昧表現が含まれている場合にだけ曖昧表現を含む句１３２を曖昧表現対象検出部１０４へ通知する。 Here, the ambiguous expression word list is stored in the DB of the ambiguous expression detection unit 103 as a word list of adverbs, adjectives, and the like indicating a degree such as “slightly, slightly, a little more, too much”, for example. The ambiguous expression word may be a list of words representing different concepts on the same surface layer, such as “English second floor” and “American second floor”, in addition to the degree word. Further, the ambiguous expression detection unit 103 notifies the ambiguous expression target detection unit 104 of the phrase 132 including the ambiguous expression only when the ambiguous expression is included in the speech recognition result.

また、その曖昧表現単語を含む曖昧表現句は、例えば形容詞句、副詞句となる。 The ambiguous phrase including the ambiguous expression word is, for example, an adjective phrase or an adverb phrase.

（１−３）曖昧表現対象検出部１０４
曖昧表現対象検出部１０４は曖昧表現句が検出した曖昧表現を含む曖昧表現句１３２の対象を検出する。曖昧表現対象検出部１０４は、受け取った曖昧表現句１３２とその句の対象の組１３３を認知範囲推定部１０６へ通知する。 (1-3) Ambiguous Expression Object Detection Unit 104
The ambiguous expression target detection unit 104 detects an object of the ambiguous expression phrase 132 including the ambiguous expression detected by the ambiguous expression phrase. The ambiguous expression target detection unit 104 notifies the recognition range estimation unit 106 of the received ambiguous expression phrase 132 and a set 133 of the target of the phrase.

（１−４）ＤＢ話者特定部１０５
ＤＢ話者特定部１０５は話し手、聞き手の情報を特定する対話者特定情報１３４を受け取り、話者情報ＤＢ１２１を参照して認知範囲ＤＢ１２２中でのどの人間に該当するかを検索し、その話し手情報１３５、聞き手情報１３６を認知範囲推定部１０６へ通知する。 (1-4) DB speaker specifying unit 105
The DB speaker specifying unit 105 receives the speaker specifying information 134 for specifying the information of the speaker and the listener, searches the speaker information DB 121 to find out which person in the recognition range DB 122 corresponds to the speaker information, and the speaker information. 135, the listener information 136 is notified to the recognition range estimation unit 106.

対話者特定情報１３４には人名やＩＤ番号等がある。話者情報ＤＢ１２１は対話者特定情報と認知範囲ＤＢ１２２中の人物情報との対応表を保持する。認知範囲ＤＢ１２２で個人レベルまで細かい人物別情報が管理されている場合は、人名などの対話者特定情報からその人物を特定する。一方、認知範囲ＤＢ１２２で人種や国別といった大雑把な単位で人物別情報が管理されている場合には、対話者特定情報から人種や出身国を特定する。 The conversation person specifying information 134 includes a person name and an ID number. The speaker information DB 121 holds a correspondence table between the conversation person specifying information and the person information in the recognition range DB 122. When detailed personal information is managed to the individual level in the recognition range DB 122, the person is specified from the interlocutor specifying information such as the name of the person. On the other hand, in the case where information by person is managed in rough units such as race or country in the recognition range DB 122, the race or country of origin is specified from the interlocutor specifying information.

（１−５）認知範囲推定部１０６
認知範囲推定部１０６では、曖昧表現対象部１０４が出力する曖昧表現句とその句の対象の組１３３及びＤＢ話者特定部１０５が出力する対話者が認知範囲ＤＢ１２２中のどの人物かを特定する人物特定情報１３５、１３６から話し手と聞き手の曖昧表現に対するそれぞれの認知範囲１３７、１３８を計算し、認知範囲相違検出部１０７へ通知する。 (1-5) Recognition range estimation unit 106
In the recognition range estimation unit 106, the ambiguous expression phrase output from the ambiguous expression target unit 104, the target group 133 of the phrase, and the conversation person output from the DB speaker specifying unit 105 identifies which person in the recognition range DB 122. Recognizing ranges 137 and 138 for the vague expressions of the speaker and the listener are calculated from the person specifying information 135 and 136, and notified to the recognized range difference detecting unit 107.

（１−６）認知範囲相違検出部１０７
認知範囲相違検出部１０７は、話し手と聞き手の曖昧表現に対するそれぞれの認知範囲１３７、１３８を受け取り、認知範囲に差異があるか判定する。認知範囲相違検出部１０７で使用する比較方法は、従来の情報の比較方法のいずれで構わない。例えば認知範囲が数値で表される時にはその２つの値の差が一定の閾値以内かどうかや認知範囲が統計分布で与えられる時には平均と分散から分布の重なり度合いを計算して、相違があるかどうかという方法で差異を判定する。 (1-6) Recognition range difference detection unit 107
The recognition range difference detection unit 107 receives the respective recognition ranges 137 and 138 for the vague expression of the speaker and the listener, and determines whether there is a difference in the recognition range. The comparison method used in the recognition range difference detection unit 107 may be any conventional information comparison method. For example, when the recognition range is expressed as a numerical value, whether the difference between the two values is within a certain threshold, or when the recognition range is given as a statistical distribution, is the degree of overlap between the distributions calculated from the mean and variance? The difference is judged by the method of how.

認知範囲相違検出部１０７で話し手と聞き手の認知範囲１３７、１３８に差異があると判定された場合には警告を発する。警告をユーザに伝える方法は従来の警告通知方法のいずれで構わない。一方、差異がないと判定された場合には何もしない。 When the recognition range difference detection unit 107 determines that there is a difference between the recognition ranges 137 and 138 of the speaker and the listener, a warning is issued. Any conventional warning notification method may be used as a method of notifying the user of the warning. On the other hand, if it is determined that there is no difference, nothing is done.

（２）対話支援装置の動作内容
以下、図３の対話例を参照しながら、曖昧表現検出部１０３、曖昧表現対象検出部１０４、認知範囲推定部１０６、認知範囲相違検出部１０７について説明する。 (2) Operation Contents of Dialogue Supporting Device Hereinafter, the ambiguous expression detection unit 103, the ambiguous expression target detection unit 104, the recognition range estimation unit 106, and the recognition range difference detection unit 107 will be described with reference to the example of dialogue in FIG.

図３は人物Ａさんと人物Ｂさんの対話例を示している。図３中の第１カラムは対話番号、第２カラムは会話、第３カラムは曖昧表現句とその句の対象、第４カラムは第３カラムの同一行の曖昧表現、対象組に対する人物Ａさんの認知範囲、第５カラムは人物Ｂさんの認知範囲を表す。 FIG. 3 shows an example of dialogue between the person A and the person B. The first column in FIG. 3 is the dialogue number, the second column is the conversation, the third column is the ambiguous phrase and the target of the phrase, the fourth column is the ambiguous expression of the same line in the third column, and the person A for the target set The fifth column represents the recognition range of the person B.

対話００１の時、曖昧表現検出部１０３は、まず話し手Ａさんの音声認識結果から曖昧表現単語リストの語である「若干」「少し」を検出する（ステップ６０１）。 At the time of the dialogue 001, the ambiguous expression detection unit 103 first detects “slightly” and “a little” which are words in the ambiguous expression word list from the speech recognition result of the speaker A (step 601).

そして、曖昧表現単語の句をそれぞれ「若干変更して」「少し遅らせ」と検出する。曖昧表現検出部１０３は、この曖昧表現句を曖昧表現対象検出部１０４へ通知する（ステップ６０２）。 Then, the phrase of the ambiguous expression word is detected as “slightly changed” and “slightly delayed”, respectively. The ambiguous expression detection unit 103 notifies the ambiguous expression target detection unit 104 of the ambiguous expression phrase (step 602).

曖昧表現対象検出部１０４では曖昧表現句の対象の名詞句を検出する。対象を検出する最も一般的な方法は係り受け解析による句の係り受け先単語である。図３の対話００１に対して係り受け解析を行うと「若干変更して」→「仕様」、「少し遅らせ」→「スケジュール」という係り受け関係が検出される。曖昧表現対象検出部１０４はこの曖昧表現句と曖昧表現対象の組１３３を認知範囲推定部１０６へ通知する（ステップ６０３）。 The ambiguous expression target detection unit 104 detects the target noun phrase of the ambiguous expression phrase. The most common method of detecting an object is a phrase dependency word by dependency analysis. When dependency analysis is performed on the dialogue 001 in FIG. 3, dependency relationships of “change slightly” → “specification”, “slightly delay” → “schedule” are detected. The ambiguous expression target detection unit 104 notifies the recognition range estimation unit 106 of the set 133 of the ambiguous expression phrase and the ambiguous expression target (step 603).

ここで、曖昧表現対象検出部１０４で対象を検出する際に使用する方法と認知範囲ＤＢ１２２に格納される「人物、曖昧表現句、曖昧表現対象、認知範囲」における曖昧表現対象とは同じ方法で検出されたものである必要がある。すなわち、曖昧表現対象検出部１０４で係り受け解析を使用する場合は、認知範囲ＤＢ１２２における「曖昧表現対象」も係り受け解析によって検出した単語を使用する必要がある。 Here, the method used when detecting an object by the ambiguous expression object detection unit 104 and the ambiguous expression object in the “person, ambiguous expression phrase, ambiguous expression object, and recognition area” stored in the recognition area DB 122 are the same method. It must be detected. That is, when the dependency analysis is used in the ambiguous expression target detection unit 104, it is necessary to use the word detected by the dependency analysis in the “ambiguous expression target” in the recognition range DB 122.

一方、ＤＢ話者特定部１０５は対話者特定情報１３４に基づいて話し手、聞き手がＤＢ中のどの人物かを特定する。対話者を特定する情報には氏名やＩＤ番号などがある。対話者特定情報は対話者の認知範囲ＤＢ１２２中の人物のどれにマッチするかさえ特定できれば、どのような認証情報であっても構わない。ユーザが明示的にシステム開始時に入力してもよいし、カメラと顔認識を使用して自動的に認証してもよい。また、ここで特定するレベルは個人名まで特定してもよいし、国、出身地、所属会社等の一般化したレベルであっても構わない。認知範囲ＤＢ１２２中のデータにおける人物に最もマッチする人物に対応付ければよい。ＤＢ話者特定部１０５が話し手を特定した話し手情報１３５、聞き手を特定した聞き手情報１３６は認知範囲推定部１０６に通知される。 On the other hand, the DB speaker specifying unit 105 specifies which person in the DB the speaker and listener are based on the conversation person specifying information 134. The information specifying the dialogue person includes a name and an ID number. The authentication information may be any kind of authentication information as long as it can specify which person in the recognition range DB 122 of the interaction person matches. The user may explicitly input when starting the system, or may automatically authenticate using a camera and face recognition. Further, the level specified here may be specified up to a personal name, or may be a generalized level such as country, hometown, affiliated company, and the like. What is necessary is just to match | combine with the person who most matches the person in the data in recognition range DB122. The speaker information 135 specifying the speaker and the listener information 136 specifying the listener are notified to the recognition range estimation unit 106.

認知範囲推定部１０６は曖昧表現句と曖昧表現対象の組１３３と話し手情報１３５、聞き手情報１３６とを組み合わせて認知範囲ＤＢ１２２を検索する。図３の例ではまず「若干変更して、仕様」という組に対してはそれぞれ認知範囲は＜若干変更して、仕様、Ａ＞は「１０〜２５％」１３６、＜若干変更して、仕様、Ｂ＞は「１０〜３０％」１３７という結果が得られる。同様に「少し遅らせ、スケジュール」に対しては＜少し遅らせ、スケジュール、Ａ＞は「１〜４週」、＜少し遅らせ、スケジュール、Ｂ＞は「１〜２週」という結果になる。認知範囲推定部１０６はこの結果を認知範囲相違比較部１０７に通知する（ステップ６０４）。 The recognition range estimation unit 106 searches the recognition range DB 122 by combining the ambiguous expression phrase and ambiguous expression target set 133 with the speaker information 135 and the listener information 136. In the example of FIG. 3, first, for the group “slightly changed, specifications”, the recognition range is <slightly changed, specifications, A> is “10-25%” 136, <slightly changed, specifications , B> results in “10-30%” 137. Similarly, “slightly delayed, schedule” results in <slightly delayed, schedule A> is “1-4 weeks”, <slightly delayed, schedule, B> is “1-2 weeks”. The recognition range estimation unit 106 notifies the recognition range difference comparison unit 107 of this result (step 604).

認知範囲相違比較部１０７では、認知範囲推定部１０６が推定した話し手の認知範囲１３７と聞き手の認知範囲１３８の比較を行う。例えば、「若干変更して、仕様、Ａ−Ｂ、最大５％差」、「少し遅らせ、スケジュール、Ａ−Ｂ、最大２週」という比較結果になる。例では比較結果を認知範囲の数値の差で表しているが、この方法に限定するものではない。例えば、認知範囲が統計情報のような形で与えられる場合には確率分布の重なり度合や平均値や分散の差の形で表現してもよい。これについては後から説明する。 The recognition range difference comparison unit 107 compares the speaker's recognition range 137 estimated by the recognition range estimation unit 106 with the listener's recognition range 138. For example, the comparison results are “slightly change, specification, AB, maximum 5% difference”, “slightly delay, schedule, AB, maximum 2 weeks”. In the example, the comparison result is expressed by the difference in the numerical value of the recognition range, but it is not limited to this method. For example, when the recognition range is given in the form of statistical information, it may be expressed in the form of an overlap degree of probability distributions, an average value, or a variance difference. This will be described later.

認知範囲相違検出部１０７では判定された話し手、聞き手の曖昧表現に対する認知範囲の比較結果に基づいて差異がある場合に検出する（ステップ６０５，６０６）。いまここで、差異があると判定する基準として「１５％以上、１週以上の期間差、１００００円以上の金額の差」を想定する。この場合、図３の例の対話００１−ａ「若干変更して、仕様、Ａ−Ｂ、最大５％差」は「１５％未満」であるため「対話者Ａ、Ｂの間で誤解を生じない」と判定する。一方、対話００１−ｂ「少し遅らせ、スケジュール、Ａ−Ｂ、最大２週」は「１週以上」であるため「対話者Ａ、Ｂの間で誤解を生じる」と判定する。認知範囲相違検出部１０７が「誤解を生じる」可能性を検出した時に対話者に通知する警告は公知のユーザインタフェースのいずれの方法で構わない。例えば、警告メッセージの表示や警告音声の再生などユーザに注意を喚起できれば方法は問わない。 The recognition range difference detection unit 107 detects the case where there is a difference based on the comparison result of the recognition range with respect to the determined expression of the speaker and listener indefinitely (steps 605 and 606). Here, “15% or more, period difference of 1 week or more, difference of money of 10,000 yen or more” is assumed as a criterion for determining that there is a difference. In this case, since the dialogue 001-a in the example of FIG. 3 “changes slightly, the specification, AB, maximum 5% difference” is “less than 15%”, “the misunderstanding occurs between the dialogues A and B. It is determined that no. On the other hand, since the dialogue 001-b “slightly delayed, schedule, AB, maximum 2 weeks” is “1 week or more”, it is determined that “misunderstanding occurs between the dialogues A and B”. Any known user interface may be used as a warning to be notified to the conversation person when the recognition range difference detection unit 107 detects the possibility of “misunderstanding”. For example, any method can be used as long as the user can be alerted by displaying a warning message or playing a warning voice.

対話者Ａさん、Ｂさんはそれぞれ対話支援装置が警告を表示した場合にだけ発言内容に対する双方の認知にずれがないかを確認すればよい。例えば図３の対話の時、対話支援装置は対話００１−ｂ、００６−ｃ、０１３−ｆの３箇所についてだけ警告を表示するため、図３の無駄な確認作業を削減することができ、結果として図４のような円滑な対話を行うことができる。 Each of the conversation persons A and B only needs to confirm whether or not there is a deviation in the recognition of the utterance contents only when the conversation support apparatus displays a warning. For example, during the dialogue shown in FIG. 3, the dialogue support apparatus displays warnings only for the three locations 001-b, 006-c, and 013-f, so that the wasteful confirmation work shown in FIG. 3 can be reduced. As shown in FIG. 4, a smooth dialogue can be performed.

（第２の実施形態）
以下、図２、図５を参照しながら第２の実施形態の対話支援装置について説明する。なお、本実施形態の対話支援装置は、音声翻訳装置を内蔵したものである。なお、図１と同一部分には同一符号を付して説明を省略する。 (Second Embodiment)
The dialogue support apparatus according to the second embodiment will be described below with reference to FIGS. Note that the dialogue support apparatus of the present embodiment incorporates a speech translation apparatus. Note that the same parts as those in FIG.

まず、話し手の使用している言語とシステムの曖昧表現単語リスト１２０，認知範囲ＤＢ１２２，認知範囲調整ＤＢ１２３で使用される言語が異なる場合、音声認識部１０２では話し手の音声を認識した上で聞き手の言語への翻訳処理も行う。 First, when the language used by the speaker and the language used in the system ambiguous expression word list 120, the recognition range DB 122, and the recognition range adjustment DB 123 are different, the speech recognition unit 102 recognizes the speaker's voice and then listens to the listener. It also performs language translation processing.

認知範囲調整部１０９は認知範囲推定部１０６が推定した話し手、聞き手の認知範囲１３７、１３８を比較する際に、認知範囲調整ＤＢ１２３を参照して、認知範囲を調整する。調整後の認知範囲情報を認知範囲相違検出部１０７へ通知する。 The recognition range adjustment unit 109 adjusts the recognition range by referring to the recognition range adjustment DB 123 when comparing the recognition ranges 137 and 138 of the speaker and the listener estimated by the recognition range estimation unit 106. The cognitive range difference detection unit 107 is notified of the adjusted cognitive range information.

図５を使って認知範囲調整部１０９の動作を説明する。 The operation of the recognition range adjustment unit 109 will be described with reference to FIG.

図５は韓国人と日本人の対話例を示している。第１カラムは対話番号、第２カラムは会話、第３カラムは曖昧表現句とその句の対象、第４カラムは第３カラムの同一行の曖昧表現、対象組に対する韓国人の認知範囲、第５カラムは日本人の認知範囲を表す。 Fig. 5 shows an example of dialogue between Korean and Japanese. The first column is the conversation number, the second column is the conversation, the third column is the ambiguous phrase and the target of the phrase, the fourth column is the ambiguous expression of the same line in the third column, the Korean recognition range for the target group, the first The 5th column represents the recognition range of the Japanese.

ここで、対話００６で出現する金額に関する認知範囲は認知範囲ＤＢ１２２において、韓国人は「ウォン（Ｗ）」、日本人は「円」という単位で管理される。これは、各国民の認知範囲を正確に表現するためである。どちらか一方の単位（例えば「円」）に統一して認知範囲ＤＢ１２２に格納することも考えられるが、為替レートが変わった場合に正しく追従できない不都合がある。そのため、認知範囲ＤＢ１２２にはそれぞれ＜もう少し安い、服、韓国人、−１００００Ｗ＞、＜もう少し安い、服、日本人、−３０００円＞が格納されるのが望ましい。しかし、この場合、金額の単位が異なるためそのままでは対話００６の認知範囲の差異を単純に比較できない。 Here, the recognition range regarding the amount of money that appears in the dialogue 006 is managed in the recognition range DB 122 in units of “won (W)” for Koreans and “yen” for Japanese. This is to accurately represent the recognition range of each citizen. Although it is conceivable that the unit is stored in the recognition range DB 122 in a unit of one of the units (for example, “yen”), there is an inconvenience that it cannot be correctly followed when the exchange rate changes. Therefore, it is desirable to store <a little cheaper clothes, Korean, -10000W> and <a little cheaper clothes, Japanese, -3000 yen> in the recognition range DB 122, respectively. However, in this case, since the unit of the amount is different, the difference in the recognition range of the dialogue 006 cannot be simply compared as it is.

ここで、認知範囲調整ＤＢ１２３において「韓国人、日本人、０．１２Ｗ＝１円」という認知範囲の調整式が与えられると、＜もう少し安い、服、韓国人、−１００００Ｗ＞は＜もう少し安い、服、韓国人、−１２００円＞に変換できる。これにより、対話００６の会話で韓国人、日本人の間で誤解が生じる可能性があることが正しく検出できる。 Here, in the cognitive range adjustment DB 123, if a cognitive range adjustment formula of “Korean, Japanese, 0.12W = 1 yen” is given, <a little cheaper clothes, Korean, -10000W> is <a little cheaper, Clothes, Korean, -1200 yen>. Thereby, it can be correctly detected that there is a possibility of misunderstandings between Koreans and Japanese in the conversation 006.

（変更例）
本発明は上記各実施形態に限らず、その主旨を逸脱しない限り種々に変更することができる。 (Example of change)
The present invention is not limited to the above-described embodiments, and various modifications can be made without departing from the gist thereof.

（第１の変更例）
曖昧表現対象の検出方法は、上記実施形態の係り受け以外にも曖昧表現単語の前後のＮ個単語列（Ｎは１以上の自然数）とする方法がある。この方法は係り受け解析のような複雑な解析処理をせずに簡便に曖昧表現の対象を検出することができる利点を持つ。 (First change example)
As a method for detecting an ambiguous expression target, there is a method in which N word strings before and after an ambiguous expression word (N is a natural number of 1 or more) other than the dependency in the above embodiment. This method has an advantage that an object of ambiguous expression can be easily detected without performing complicated analysis processing such as dependency analysis.

但し、この場合は、認知範囲ＤＢ１２２における「曖昧表現対象」も曖昧表現単語の前後のＮ個単語列を使用する。 However, in this case, N word strings before and after the ambiguous expression word are also used as the “ambiguous expression object” in the recognition range DB 122.

（第２の変更例）
図９に基づいて、＜若干変更、スケジュール＞表現に対するＡさん、Ｂさんの認知範囲を統計的に表現した変更例を説明する。 (Second modification)
Based on FIG. 9, a description will be given of a modification example in which the recognition ranges of Mr. A and Mr. B with respect to the <slightly changed, schedule> expression are statistically expressed.

単にイメージする範囲であればＡさん、Ｂさん両者とも１〜５週をイメージする。 If it is just the image range, both Mr. A and Mr. B image 1 to 5 weeks.

そのため、認知範囲を単純な数値で表現する場合は「１〜５週」と表現され、差異がないと判定される。 Therefore, when the recognition range is expressed by a simple numerical value, it is expressed as “1 to 5 weeks”, and it is determined that there is no difference.

統計的に表現する場合は、図９に示すように、Ａさん、Ｂさん両者が＜若干変更、スケジュール＞という表現に対して何回、ｘ週をイメージするか集計し、その頻度に基づいて統計分布を作成する。そして、統計分布の差異比較は重なり面積Ｓで判定できる。 In the case of statistical expression, as shown in FIG. 9, A and B both count how many times x weeks are imaged for the expression <slightly changed, schedule>, and based on the frequency Create a statistical distribution. The statistical distribution difference comparison can be determined by the overlapping area S.

そのため、両者が強くイメージするｘ週が異なれば差異があると判定される。 Therefore, it is determined that there is a difference if the x weeks that both images strongly differ.

本発明の第１の実施形態に係わる対話支援装置を示すブロック図である。It is a block diagram which shows the dialog assistance apparatus concerning the 1st Embodiment of this invention. 第２の実施形態に係わる対話支援装置を示すブロック図である。It is a block diagram which shows the dialog assistance apparatus concerning 2nd Embodiment. 第１の実施形態の動作を説明する動作シチュエーション例である。It is the example of an operation situation explaining operation of a 1st embodiment. 第１の実施形態により効率化された動作シチュエーション例である。It is the example of the operation | movement situation improved by 1st Embodiment. 第２の実施形態の動作を説明する動作シチュエーション例である。It is an operation | movement situation example explaining the operation | movement of 2nd Embodiment. 曖昧表現検出部１０３、曖昧表現対象検出部１０４、ＤＢ話者特定部１０５、認知範囲推定部１０６、認知範囲相違検出部１０７の動作手順を説明するフローチャートである。10 is a flowchart for explaining the operation procedure of an ambiguous expression detection unit 103, an ambiguous expression target detection unit 104, a DB speaker identification unit 105, a recognition range estimation unit 106, and a recognition range difference detection unit 107. 曖昧表現単語リスト１２０の曖昧単語、品詞の組データの例を示す表の図である。It is a table | surface which shows the example of the group data of the ambiguous word of the ambiguous expression word list | wrist 120, and a part of speech. 認知範囲ＤＢ１２２の人物、曖昧表現句、曖昧表現対象、認知範囲の4つ組データの例を示す表の図である。It is a figure of the table | surface which shows the example of quaternary data of the person of the recognition range DB122, an ambiguous expression phrase, an ambiguous expression object, and a recognition range. 統計情報で表現される認知範囲の図と統計分布の比較方法の例を示す図である。It is a figure which shows the example of the comparison method of the figure of the recognition range expressed with statistical information, and statistical distribution.

Explanation of symbols

１００話し手ユーザ
１０１聞き手ユーザ
１０２音声認識部
１０３曖昧表現検出部
１０４曖昧表現対象検出部
１０５ＤＢ話者特定部
１０６認知範囲推定部
１０７認知範囲相違検出部
１０９認知範囲調整部
１２０曖昧表現単語リスト
１２１話者情報ＤＢ
１２２認知範囲ＤＢ１２２
１２３認知範囲調整ＤＢ１２３ DESCRIPTION OF SYMBOLS 100 Speaker user 101 Listener user 102 Speech recognition part 103 Ambiguous expression detection part 104 Ambiguous expression object detection part 105 DB speaker identification part 106 Recognition range estimation part 107 Recognition range difference detection part 109 Recognition range adjustment part 120 Ambiguous expression word list 121 Talk Information DB
122 Recognition range DB122
123 Recognition range adjustment DB123

Claims

In a dialogue support device that mediates communication between the speaker and the listener,
A voice recognition unit for recognizing the voice of the speaker;
An ambiguous expression word list storage unit that stores a list of ambiguous expression words that cause misunderstandings and misconceptions between the speaker and the listener;
An ambiguous expression detection unit that compares the ambiguous expression word list with the speech recognition result and detects an ambiguous expression phrase including the ambiguous expression word from the speech recognition result;
An ambiguous expression target detection unit for detecting an ambiguous expression target whose interpretation of the listener is ambiguous by the ambiguous expression phrase from the speech recognition result;
A speaker identification unit that identifies the person information of the speaker and the person information of the listener;
A recognition range storage unit that stores a recognition range of the ambiguous expression for each person information with respect to the combination of the ambiguous expression phrase and the ambiguous expression target;
The ambiguous expression phrase related to the person information of the speaker stored in the recognition range storage unit and the recognition range of the speaker with respect to the object of the ambiguous expression are obtained, and the ambiguous information about the person information of the listener stored in the recognition range storage unit A recognition range estimation unit for obtaining a recognition range of the listener for the expression phrase and the ambiguous expression target;
A recognition range difference detection unit that compares the speaker's recognition range with the listener's recognition range to determine a difference between the speaker and the listener's recognition range;
A dialogue support apparatus characterized by comprising:

The dialogue support apparatus according to claim 1, wherein the ambiguous expression phrase is an adjective phrase or an adverb phrase.

The dialogue support apparatus according to claim 1, wherein the ambiguous expression target is a dependency destination.

The dialogue support apparatus according to claim 1, wherein the ambiguous expression target is a noun phrase.

The dialogue support apparatus according to claim 1, wherein the object of ambiguous expression is a word string before and after an ambiguous expression phrase.

The dialogue support apparatus according to claim 1, wherein the recognition range is represented by a numerical value.

The dialogue support apparatus according to claim 1, wherein the recognition range is represented by statistical information.

The dialogue support apparatus according to claim 1, further comprising a recognition range adjustment unit that adjusts the recognition range of the speaker and the recognition range of the listener based on one of the standards.

The dialogue support apparatus according to claim 1, wherein the speaker or the listener is warned when the difference in the recognition range is not less than an arbitrary standard.

In a dialogue support method that mediates communication between the speaker and the listener,
Recognizing the voice of the speaker,
Storing a list of ambiguous words that cause misunderstandings and misconceptions between the speaker and the listener;
Comparing the stored ambiguous expression word list with the speech recognition result to detect an ambiguous expression phrase including the ambiguous expression word from the speech recognition result;
Detecting an ambiguous expression target in which the interpretation of the listener is ambiguous by the ambiguous expression phrase from the speech recognition result;
Identifying the person information of the speaker and the person information of the listener,
For the combination of the ambiguous expression phrase and the ambiguous expression object, store the recognition range of the ambiguous expression for each person information,
The ambiguous expression phrase related to the stored person information of the speaker and the recognition range of the speaker for the object of the ambiguous expression are obtained, and the ambiguous expression phrase related to the person information of the listener stored in the recognition range storage unit and the ambiguous expression Obtaining the listener's perception range for the expression object;
A dialogue support method, comprising: comparing a recognition range of the speaker with a recognition range of the listener to obtain a difference between the recognition range of the speaker and the listener.

In a dialogue support program that mediates communication between the speaker and the listener,
A voice recognition function for recognizing the voice of the speaker;
An ambiguous expression word list storage function that stores a list of ambiguous expression words that cause misunderstandings and misconceptions between the speaker and the listener;
An ambiguous expression detection function for comparing the ambiguous expression word list with the speech recognition result and detecting an ambiguous expression phrase including the ambiguous expression word from the speech recognition result;
An ambiguous expression object detection function for detecting an ambiguous expression object whose interpretation of the listener is ambiguous by the ambiguous expression phrase from the speech recognition result;
A speaker identification function for identifying the person information of the speaker and the person information of the listener;
A recognition range storage function for storing a recognition range of the ambiguous expression for each person information for the combination of the ambiguous expression phrase and the ambiguous expression target;
The ambiguous expression phrase related to the person information of the speaker stored in the recognition range storage function and the recognition range of the speaker for the object of the ambiguous expression are obtained, and the ambiguous information about the person information of the listener stored in the recognition range storage function A recognition range estimation function for obtaining a recognition range of the listener for the expression phrase and the ambiguous expression target;
A recognition range difference detection function for comparing the speaker's recognition range and the listener's recognition range to obtain a difference between the speaker and the listener's recognition range;
A dialogue support program characterized by