JP2020170310A

JP2020170310A - Conversation analysis device, conversation analysis method and conversation analysis program

Info

Publication number: JP2020170310A
Application number: JP2019070723A
Authority: JP
Inventors: 原田　将治; Masaharu Harada; 将治原田
Original assignee: Fujitsu Ltd
Current assignee: Fujitsu Ltd
Priority date: 2019-04-02
Filing date: 2019-04-02
Publication date: 2020-10-15
Anticipated expiration: 2039-04-02
Also published as: JP7293816B2

Abstract

To perform risk determination as an object of a conference based on an utterance of the conference.SOLUTION: A conversation analysis device 100 includes a first determination unit 110, a voice recognition unit 120, a second determination unit 130, and an evaluation unit 140. The first determination unit 110 analyzes a conversation voice, and determines a state of the conversation. The voice recognition unit 120 performs voice recognition of the conversation voice, and extracts words included in the conversation voice. The second determination unit 130 determines the abstraction degree of conversation contents in the conversation voice on the basis of the words included in the conversation voice. The evaluation unit 140 evaluates the conversation contents on the basis of the state and the abstraction degree of the conversation.SELECTED DRAWING: Figure 2

Description

本発明は、会話分析装置等に関する。 The present invention relates to a conversation analyzer and the like.

コールセンターにおけるオペレータと顧客との会話、または、会議の参加者による会話等において、「会話対象（会話内容）」を評価することが求められている。たとえば、オペレータと顧客との会話の対象は、商品に関する問合せや、クレーム対応等となる。会議の参加者による会話の対象は、プロジェクト等となる。 It is required to evaluate the "conversation target (conversation content)" in the conversation between the operator and the customer in the call center, or in the conversation between the participants of the conference. For example, the target of conversation between the operator and the customer is inquiries about products, complaint handling, and the like. The target of conversation by the participants of the conference is the project.

会議中の会話を録音した音声データを基にして、プロジェクトを評価する技術として、以下に説明するような従来技術がある。この従来技術は、設定ワードおよび設定ワードに類似する類似ワードを予め定義する。図１８は、設定ワードおよび類似ワードの一例を示す図である。たとえば、設定ワード「遅れている」に対応する類似ワードは「遅延、遅い、・・・」となる。以下の説明では、設定ワードおよび設定ワードに類似する類似ワードをまとめて「設定ワード」と表記する。 As a technique for evaluating a project based on voice data recorded from a conversation during a meeting, there is a conventional technique as described below. This prior art predefines a set word and a similar word similar to the set word. FIG. 18 is a diagram showing an example of a setting word and a similar word. For example, a similar word corresponding to the set word "delayed" is "delayed, slow, ...". In the following description, the setting word and similar words similar to the setting word are collectively referred to as "setting word".

従来技術は、音声データに対して音声認識を行い、音声データに含まれる設定ワードの出現回数をカウントする。従来技術は、設定ワードの出現回数に応じて、リスク評価値を特定し、会話の対象を評価する。たとえば、従来技術では、設定ワードの出現回数が多いほど、リスク評価値を大きくし、このリスク評価値が大きいほど、会話対象（プロジェクト）のリスクが高いと評価する。また、この従来技術では、会議時間に対する最大の発話時間の比率を算出し、比率が大きいほど、リスク評価値を大きくしている。 In the prior art, voice recognition is performed on voice data, and the number of appearances of a set word included in the voice data is counted. In the prior art, the risk evaluation value is specified and the conversation target is evaluated according to the number of occurrences of the set word. For example, in the prior art, the higher the number of occurrences of the set word, the larger the risk evaluation value, and the larger the risk evaluation value, the higher the risk of the conversation target (project). Further, in this conventional technique, the ratio of the maximum utterance time to the meeting time is calculated, and the larger the ratio, the larger the risk evaluation value.

特開２００９−１４６１２１号公報JP-A-2009-146121 特開２０１７−２７１０２号公報JP-A-2017-27102 特開２０１１−５５１６０号公報Japanese Unexamined Patent Publication No. 2011-55160 特開２０１８−３６８６８号公報Japanese Unexamined Patent Publication No. 2018-36868

会議の会話において、要求仕様の整理が「遅れている」、「できていない」と発話された場合のリスクと、出力メッセージの決定が「遅れている」、「できていない」と発話された場合のリスクとは、同程度のリスクではない。プロジェクト全体として「要求仕様の整理」に関する作業は数日から数週間かかるものであり、要求仕様の整理が遅れている場合には、大きなリスクである。これに対して、プロジェクト全体として「出力メッセージ」に関する作業は、１時間で解決できるものであり、出力メッセージが送れていても、リスクは小さいといえる。 In the conversation of the meeting, the risk of being told that the requirement specifications were "delayed" or "not completed" and the decision of the output message was "delayed" or "not completed" were spoken. The risk of the case is not the same risk. The work related to "organizing the required specifications" for the entire project takes days to weeks, and if the required specifications are not organized, it is a big risk. On the other hand, the work related to the "output message" can be solved in one hour for the entire project, and even if the output message is sent, the risk is small.

しかしながら、従来技術では、要求仕様の整理に関して、「遅れている」、「できていない」と発話された回数と、出力メッセージに関して、「遅れている」、「できていない」と発話された回数とが同数の場合には、各リスクは同じリスクと評価してしまう。このため、従来技術では、リスク有無を評価できても、会話の対象としてのリスクの程度を適切に評価することができていない。 However, in the prior art, the number of times that "delayed" or "not completed" was spoken regarding the arrangement of required specifications, and the number of times that "delayed" or "not completed" was spoken regarding the output message. If the numbers are the same, each risk is evaluated as the same risk. Therefore, in the prior art, even if the presence or absence of risk can be evaluated, the degree of risk as a conversation target cannot be appropriately evaluated.

１つの側面では、本発明は、会議の発話に基づいた会議の対象としてのリスク判定をおこなうことができる会話分析装置、会話分析方法および会話分析プログラムを提供することを目的とする。 In one aspect, it is an object of the present invention to provide a conversation analyzer, a conversation analysis method, and a conversation analysis program capable of making a risk determination as a subject of a conference based on the utterance of the conference.

第１の案では、会話分析装置は、第１判定部と、音声認識部と、第２判定部と、評価部とを有する。第１判定部は、会話音声を分析して、会話の状態を判定する。音声認識部は、会話音声に対して音声認識を行い、会話音声に含まれる単語を抽出する。第２判定部は、会話音声に含まれる単語を基にして、会話音声における会話内容の抽象度を判定する。評価部は、会話の状態と抽象度とを基にして、会話対象を評価する。 In the first plan, the conversation analyzer has a first determination unit, a voice recognition unit, a second determination unit, and an evaluation unit. The first determination unit analyzes the conversation voice and determines the state of the conversation. The voice recognition unit performs voice recognition on the conversational voice and extracts words included in the conversational voice. The second determination unit determines the degree of abstraction of the conversation content in the conversation voice based on the words included in the conversation voice. The evaluation unit evaluates the conversation target based on the state of conversation and the degree of abstraction.

会議の発話に基づいた会議の対象としてのリスク判定をおこなうことができる。 It is possible to make a risk judgment as a target of a meeting based on the utterance of the meeting.

図１は、本実施例１に係る会話分析装置の処理を説明するための図である。FIG. 1 is a diagram for explaining the processing of the conversation analyzer according to the first embodiment. 図２は、本実施例１に係る会話分析装置の構成を示す機能ブロック図である。FIG. 2 is a functional block diagram showing the configuration of the conversation analyzer according to the first embodiment. 図３は、本実施例１に係るリスク評価値テーブルのデータ構造の一例を示す図である。FIG. 3 is a diagram showing an example of the data structure of the risk evaluation value table according to the first embodiment. 図４は、本実施例１に係る抽象度判定テーブルのデータ構造の一例を示す図である。FIG. 4 is a diagram showing an example of the data structure of the abstraction degree determination table according to the first embodiment. 図５は、概念ＤＢの一例を示す図である。FIG. 5 is a diagram showing an example of the concept DB. 図６は、本実施例１に係る第２判定部の処理を説明するための図である。FIG. 6 is a diagram for explaining the processing of the second determination unit according to the first embodiment. 図７は、本実施例１に係る会話分析装置の処理手順を示すフローチャートである。FIG. 7 is a flowchart showing a processing procedure of the conversation analyzer according to the first embodiment. 図８は、発明の効果を補足するための図である。FIG. 8 is a diagram for supplementing the effect of the invention. 図９は、本実施例２に係る会話分析装置の構成を示す機能ブロック図である。FIG. 9 is a functional block diagram showing the configuration of the conversation analyzer according to the second embodiment. 図１０は、本実施例２に係る設定キーワードテーブルのデータ構造の一例を示す図である。FIG. 10 is a diagram showing an example of the data structure of the setting keyword table according to the second embodiment. 図１１は、本実施例２に係るリスク評価値テーブルのデータ構造の一例を示す図である。FIG. 11 is a diagram showing an example of the data structure of the risk evaluation value table according to the second embodiment. 図１２は、本実施例２に係る生成部の処理を説明するための図である。FIG. 12 is a diagram for explaining the processing of the generation unit according to the second embodiment. 図１３は、本実施例２に係る会話分析装置の処理手順を示すフローチャートである。FIG. 13 is a flowchart showing a processing procedure of the conversation analyzer according to the second embodiment. 図１４は、本実施例３に係る会話分析装置の構成の一例を示す機能ブロック図である。FIG. 14 is a functional block diagram showing an example of the configuration of the conversation analyzer according to the third embodiment. 図１５は、本実施例３に係る抽出部の処理を説明するための図である。FIG. 15 is a diagram for explaining the processing of the extraction unit according to the third embodiment. 図１６は、本実施例３に係る会話分析装置の処理手順を示すフローチャートである。FIG. 16 is a flowchart showing a processing procedure of the conversation analyzer according to the third embodiment. 図１７は、本実施例に係る会話分析装置と同様の機能を実現するコンピュータのハードウェア構成の一例を示す図である。FIG. 17 is a diagram showing an example of a computer hardware configuration that realizes the same functions as the conversation analyzer according to the present embodiment. 図１８は、設定ワードおよび類似ワードの一例を示す図である。FIG. 18 is a diagram showing an example of a setting word and a similar word.

以下に、本願の開示する会話分析装置、会話分析方法および会話分析プログラムの実施例を図面に基づいて詳細に説明する。なお、この実施例によりこの発明が限定されるものではない。 Hereinafter, examples of the conversation analyzer, the conversation analysis method, and the conversation analysis program disclosed in the present application will be described in detail with reference to the drawings. The present invention is not limited to this embodiment.

図１は、本実施例１に係る会話分析装置の処理を説明するための図である。まず、会話分析装置は、テーブル１０Ａを基にして、会議Ａの会話対象のリスクを評価する場合について説明する。テーブル１０Ａは、会議Ａで発声された単語と、回数との関係を示すテーブルである。 FIG. 1 is a diagram for explaining the processing of the conversation analyzer according to the first embodiment. First, the conversation analyzer will explain the case of evaluating the risk of the conversation target of the conference A based on the table 10A. Table 10A is a table showing the relationship between the words uttered in the conference A and the number of times.

会話分析装置は、テーブル１０Ａの各単語のうち、設定キーワードに対応する単語を特定し、特定した設定キーワードに対応する単語の出現回数を基にして、「会議の状態」の良し悪しを判定する。たとえば、会話分析装置は、設定キーワードに対応する単語の出現回数が閾値以上である場合、会議の状態が「悪い」と判定し、出現回数が閾値未満である場合、会議の状態が「良い」と判定する。 The conversation analyzer identifies the word corresponding to the set keyword from each word in the table 10A, and determines whether the "meeting state" is good or bad based on the number of occurrences of the word corresponding to the specified set keyword. .. For example, the conversation analyzer determines that the state of the conference is "bad" when the number of occurrences of the word corresponding to the set keyword is equal to or more than the threshold value, and the state of the conference is "good" when the number of occurrences is less than the threshold value. Is determined.

たとえば、設定キーワードを「遅れている」、「できていない」とし、出現回数の閾値を「１０」とする。そうすると、会話分析装置は、テーブル１０Ａのうち、設定キーワードに対応する単語の出現回数「１６」が閾値「１０」以上となるため、会議Ａの状態が悪いと判定する。 For example, the setting keywords are "delayed" and "not completed", and the threshold value of the number of occurrences is "10". Then, the conversation analyzer determines that the state of the conference A is bad because the number of occurrences "16" of the word corresponding to the set keyword in the table 10A becomes the threshold value "10" or more.

続いて、会話分析装置は、テーブル１０Ａの各単語の抽象度を基にして「会話内容の抽象度」を判定する。各単語の抽象度は、言語コーパスにおける出現頻度によって決定され、出現頻度が高いものほど、抽象度が高くなる。 Subsequently, the conversation analyzer determines the "abstraction level of the conversation content" based on the abstraction level of each word in the table 10A. The degree of abstraction of each word is determined by the frequency of occurrence in the language corpus, and the higher the frequency of occurrence, the higher the degree of abstraction.

たとえば、テーブル１０Ａでは、抽象度の高い「仕様書」が多く出現している。このため、会話分析装置は、会議Ａの会話内容の抽象度が高いと判定する。「会話内容の抽象度が高い」ということは、会議で具体的な点が議論されておらず、抽象的な議論しかなされていないことを意味する。 For example, in Table 10A, many "specifications" with a high degree of abstraction appear. Therefore, the conversation analyzer determines that the conversation content of the conference A has a high degree of abstraction. "Highly abstract conversation content" means that no specific points have been discussed at the meeting, only abstract discussions have been made.

会話分析装置は、会話の状態の判定結果と、会話内容の抽象度の判定結果とを基にして、会議Ａの会話対象のリスクを評価する。上記のように、会話の状態が悪く、かつ、会話内容の抽象度が高いため、会議Ａにおける会話対象に関しては、意識合わせが求められ、解決しにくいと言える。このため、会話分析装置は、会議Ａの会話対象のリスクが「大」であると評価する。 The conversation analyzer evaluates the risk of the conversation target of the conversation A based on the judgment result of the conversation state and the judgment result of the abstraction degree of the conversation content. As described above, since the state of conversation is poor and the degree of abstraction of the conversation content is high, it can be said that it is difficult to solve the conversation target in the conference A because it is required to match the consciousness. Therefore, the conversation analyzer evaluates that the risk of the conversation target of the conference A is “high”.

次に、会話分析装置は、テーブル１０Ｂを基にして、会議Ｂの会話対象のリスクを評価する場合について説明する。テーブル１０Ｂは、会議Ｂで発声された単語と、回数との関係を示すテーブルである。 Next, the case where the conversation analyzer evaluates the risk of the conversation target of the conference B based on the table 10B will be described. Table 10B is a table showing the relationship between the words uttered in the conference B and the number of times.

会話分析装置は、テーブル１０Ｂの各単語のうち、設定キーワードに対応する単語を特定し、特定した設定キーワードに対応する単語の出現回数を基にして、「会議の状態」の良し悪しを判定する。たとえば、会話分析装置は、テーブル１０Ｂのうち、設定キーワードに対応する単語の出現回数「１６」が閾値「１０」以上となるため、会議Ｂの状態が悪いと判定する。 The conversation analyzer identifies the word corresponding to the set keyword from each word in the table 10B, and determines the quality of the "meeting state" based on the number of occurrences of the word corresponding to the specified set keyword. .. For example, the conversation analyzer determines that the state of the conference B is bad because the number of occurrences "16" of the word corresponding to the set keyword in the table 10B is equal to or more than the threshold value "10".

続いて、会話分析装置は、テーブル１０Ｂの各単語の抽象度を基にして「会話内容の抽象度」を判定する。たとえば、テーブル１０Ｂでは、抽象度の低い「文字コード」が多く出現している。このため、会話分析装置は、会議Ｂの会話内容の抽象度が低いと判定する。「会話内容の抽象度が低い」ということは、会議で具体的な点が議論されていることを意味する。 Subsequently, the conversation analyzer determines the "abstraction level of the conversation content" based on the abstraction level of each word in the table 10B. For example, in Table 10B, many "character codes" having a low degree of abstraction appear. Therefore, the conversation analyzer determines that the degree of abstraction of the conversation content of the conference B is low. "Low abstraction of conversation content" means that specific points are being discussed at the meeting.

会話分析装置は、会話の状態の判定結果と、会話内容の抽象度の判定結果とを基にして、会議Ｂの会話対象のリスクを評価する。上記のように、会話の状態は悪いが、会話内容の抽象度が低いため、会議Ｂにおける会話対象に関しては、論点が明確で意思疎通しやすく解決に結びつきやすいといえる。このため、会話分析装置は、会議Ｂの会話対象のリスクが「小」であると評価する。 The conversation analyzer evaluates the risk of the conversation target of the conversation B based on the judgment result of the conversation state and the judgment result of the abstraction degree of the conversation content. As described above, although the state of conversation is poor, the degree of abstraction of the conversation content is low, so it can be said that the points of discussion are clear, easy to communicate, and easy to solve with respect to the conversation target in the conference B. Therefore, the conversation analyzer evaluates that the risk of the conversation target of the conference B is “small”.

上記のように、本実施例１に係る会話分析装置は、設定キーワードの出現回数に基づく会議の状態の良し悪しに加えて、会議内容の抽象度を用いて、会話対象のリスクを評価している。たとえば、設定キーワードの出現回数だけでは、会議Ａ、会議Ｂともにリスクが大であると判定してしまう場合があるが、抽象度に鑑みると、会議Ｂは抽象度が低いため、具体的な議論が行われており、リスクは小さいと判定することができる。すなわち、会話の対象としてのリスクの程度を適切に評価することができる。 As described above, the conversation analyzer according to the first embodiment evaluates the risk of the conversation target by using the abstraction level of the meeting content in addition to the good or bad state of the meeting based on the number of occurrences of the set keyword. There is. For example, it may be judged that the risk is high in both conference A and conference B only by the number of occurrences of the set keyword. However, considering the degree of abstraction, conference B has a low degree of abstraction, so a concrete discussion It can be judged that the risk is small. That is, the degree of risk as a conversation target can be appropriately evaluated.

次に、本実施例１に係る会話分析装置の構成の一例について説明する。図２は、本実施例１に係る会話分析装置の構成を示す機能ブロック図である。図２に示すように、この会話分析装置１００は、第１判定部１１０と、音声認識部１２０と、第２判定部１３０と、評価部１４０とを有する。 Next, an example of the configuration of the conversation analyzer according to the first embodiment will be described. FIG. 2 is a functional block diagram showing the configuration of the conversation analyzer according to the first embodiment. As shown in FIG. 2, the conversation analyzer 100 includes a first determination unit 110, a voice recognition unit 120, a second determination unit 130, and an evaluation unit 140.

記憶部１０５は、リスク評価値テーブル１０５ａと、抽象度判定テーブル１０５ｂとを有する。記憶部１０５は、ＲＡＭ（Random Access Memory）、ＲＯＭ（Read Only Memory）、フラッシュメモリ（Flash Memory）などの半導体メモリ素子や、ＨＤＤ（Hard Disk Drive）などの記憶装置に対応する。 The storage unit 105 has a risk evaluation value table 105a and an abstraction degree determination table 105b. The storage unit 105 corresponds to semiconductor memory elements such as RAM (Random Access Memory), ROM (Read Only Memory), and flash memory (Flash Memory), and storage devices such as HDD (Hard Disk Drive).

図３は、本実施例１に係るリスク評価値テーブルのデータ構造の一例を示す図である。図３に示すように、このリスク評価値テーブル１０５ａは、比率と、リスク評価値とを対応付ける。比率は、後述する第１判定部１１０により算出されるものである。リスク評価値は、会議の状態の悪さの程度を示す値であり、リスク評価値が大きいほど、会議の状態が悪いことを示す。 FIG. 3 is a diagram showing an example of the data structure of the risk evaluation value table according to the first embodiment. As shown in FIG. 3, the risk evaluation value table 105a associates the ratio with the risk evaluation value. The ratio is calculated by the first determination unit 110, which will be described later. The risk evaluation value is a value indicating the degree of poor state of the meeting, and the larger the risk evaluation value, the worse the state of the meeting.

図４は、本実施例１に係る抽象度判定テーブルのデータ構造の一例を示す図である。図４に示すように、抽象度判定テーブル１０５ｂは、単語と、抽象度とを対応付ける。抽象度は、単語の抽象度の程度を示す値であり、抽象度が大きいほど、単語がより抽象的であることを示す。たとえば、単語の抽象度は、概念ＤＢ（Data Base）に基づいて決定される。 FIG. 4 is a diagram showing an example of the data structure of the abstraction degree determination table according to the first embodiment. As shown in FIG. 4, the abstraction degree determination table 105b associates a word with an abstraction degree. The degree of abstraction is a value indicating the degree of abstraction of a word, and the higher the degree of abstraction, the more abstract the word is. For example, the degree of abstraction of a word is determined based on a concept DB (Data Base).

図５は、概念ＤＢの一例を示す図である。概念ＤＢ５０において、各単語が概念木構造で定義され、概念階層が高いほど、抽象度が高くなる。概念階層「１」が最も高い階層であり、概念階層２、３、４、・・・、９の順に、階層は低くなる。 FIG. 5 is a diagram showing an example of the concept DB. In the concept DB 50, each word is defined by a concept tree structure, and the higher the concept hierarchy, the higher the degree of abstraction. The concept hierarchy "1" is the highest hierarchy, and the hierarchy becomes lower in the order of concept hierarchy 2, 3, 4, ..., 9.

図５において、単語「事象」、「行為」の概念階層は「１」であり、抽象度は「９」となる。単語「要求仕様」の概念階層は「５」であり、抽象度は「５」となる。単語「会員」の概念階層は「６」であり、抽象度は「４」となる。単語「ＩＤ」の概念階層は「７」であり、抽象度は「３」となる。単語「ユニバーサルデザイン」の階層は「９」であり、抽象度は「１」となる。 In FIG. 5, the conceptual hierarchy of the words “event” and “action” is “1”, and the degree of abstraction is “9”. The concept hierarchy of the word "requirement specification" is "5", and the degree of abstraction is "5". The conceptual hierarchy of the word "member" is "6" and the abstraction level is "4". The conceptual hierarchy of the word "ID" is "7" and the abstraction level is "3". The hierarchy of the word "universal design" is "9" and the abstraction level is "1".

たとえば、単語「要求仕様」は、概念ＤＢ５０の「要求仕様」にヒットするため、単語「要求仕様」の抽象度は「５」となる。単語「ユニバーサルデザイン」は、概念ＤＢ５０の「ユニバーサルデザイン」にヒットするため、単語「ユニバーサルデザイン」の抽象度は「１」となる。 For example, since the word "requirement specification" hits the "requirement specification" of the concept DB 50, the abstraction degree of the word "requirement specification" is "5". Since the word "universal design" hits the concept DB50 "universal design", the degree of abstraction of the word "universal design" is "1".

なお、単語が複合語の場合には、複合語に含まれる複数の単語のうち、概念階層の最も低い単語を特定し、特定した単語の概念階層に１を加算した概念階層を、複合語の概念階層とする。たとえば、単語（複合語）「会員ＩＤ」は、概念ＤＢ５０の「会員」と「ＩＤ」とにヒットする。概念ＤＢの「会員」の概念階層「６」と「ＩＤ」の概念階層「７」のうち、低い方の概念階層「７」に１を加算した概念階層「８」を、単語「会員ＩＤ」の概念階層として特定し、特定した概念階層「８」の抽象度「２」を、単語「会員ＩＤ」の抽象度として特定する。 When the word is a compound word, the word having the lowest conceptual hierarchy among the plurality of words included in the compound word is specified, and the conceptual hierarchy obtained by adding 1 to the conceptual hierarchy of the specified word is the compound word. It is a conceptual hierarchy. For example, the word (compound word) "member ID" hits "member" and "ID" in the concept DB 50. Of the concept hierarchy "6" of "member" and the concept hierarchy "7" of "ID" in the concept DB, the concept hierarchy "8" obtained by adding 1 to the lower concept hierarchy "7" is the word "member ID". The concept hierarchy of the above is specified, and the abstraction degree "2" of the specified concept hierarchy "8" is specified as the abstraction degree of the word "member ID".

図２の説明に戻る。図２の各処理部１１０，１２０，１３０，１４０は、ＣＰＵ（Central Processing Unit）やＭＰＵ（Micro Processing Unit）などによって実現できる。また、各処理部１１０〜１４０は、ＡＳＩＣ（Application Specific Integrated Circuit）やＦＰＧＡ（Field Programmable Gate Array）などのハードワイヤードロジックによっても実現できる。 Returning to the description of FIG. Each of the processing units 110, 120, 130, 140 in FIG. 2 can be realized by a CPU (Central Processing Unit), an MPU (Micro Processing Unit), or the like. Further, each processing unit 110-140 can also be realized by hard-wired logic such as ASIC (Application Specific Integrated Circuit) or FPGA (Field Programmable Gate Array).

会話分析装置１００は、会話音声データ２０を図示しない外部装置から取得する。会話音声データ２０は、会議の会話を録音した音声データであり、たとえば、時間と音声のパワーとを対応付ける。 The conversation analyzer 100 acquires the conversation voice data 20 from an external device (not shown). The conversation voice data 20 is voice data recorded from a conference conversation, for example, associating time with voice power.

第１判定部１１０は、会話音声データ２０を分析して、会話の状態を判定する処理部である。たとえば、第１判定部１１０は、会話音声データ２０の音声のパワーを走査して、音声のパワーが閾値Ｔｈｐ以上となる区間を、それぞれ発話区間として特定する。 The first determination unit 110 is a processing unit that analyzes the conversation voice data 20 and determines the state of the conversation. For example, the first determination unit 110 scans the voice power of the conversation voice data 20 and specifies the sections in which the voice power is equal to or higher than the threshold value Thp as the utterance sections.

第１判定部１１０は、会話音声データ２０から特定する複数の発話区間のうち、最初の発話区間の開始時刻から、最後の発話区間の終了時刻までの時間を「会議時間」として算出する。 The first determination unit 110 calculates the time from the start time of the first utterance section to the end time of the last utterance section as the "meeting time" among the plurality of utterance sections specified from the conversation voice data 20.

第１判定部１１０は、複数の発話区間の長さを比較して、最長の発話区間を特定する。第１判定部１１０は、式（１）を基にして、会議時間に対する最長の発話区間の比率を算出する。 The first determination unit 110 compares the lengths of the plurality of utterance sections and identifies the longest utterance section. The first determination unit 110 calculates the ratio of the longest utterance section to the conference time based on the equation (1).

比率＝最長の発話区間／会議時間×１００（％）・・・（１） Ratio = longest utterance section / meeting time x 100 (%) ... (1)

第１判定部１１０は、式（１）に基づいて算出した比率と、リスク評価値テーブル１０５ａとを比較して、リスク評価値を判定する。リスク評価値は、会話の状態の一例である。第１判定部１１０は、リスク評価値の判定結果を、評価部１４０に出力する。 The first determination unit 110 determines the risk evaluation value by comparing the ratio calculated based on the equation (1) with the risk evaluation value table 105a. The risk assessment value is an example of the state of conversation. The first determination unit 110 outputs the determination result of the risk evaluation value to the evaluation unit 140.

なお、本実施例１で説明する第１判定部１１０は、式（１）を基にして、リスク評価値を判定する場合について説明したが、図１で説明したように、設定キーワードの出現回数に基づいて、会話の状態を判定してもよい。たとえば、第１判定部１１０は、音声認識部１２０の音声認識結果に含まれる全単語の出現回数の総数に対する設定キーワードの出現回数の比率を算出し、算出した比率と、リスク評価値テーブル１０５ａとを基にして、リスク評価値を判定してもよい。 The first determination unit 110 described in the first embodiment has described the case where the risk evaluation value is determined based on the equation (1), but as described in FIG. 1, the number of occurrences of the set keyword has been described. The state of the conversation may be determined based on. For example, the first determination unit 110 calculates the ratio of the number of appearances of the set keyword to the total number of appearances of all the words included in the voice recognition result of the voice recognition unit 120, and the calculated ratio and the risk evaluation value table 105a The risk evaluation value may be determined based on.

音声認識部１２０は、会話音声データ２０を音声解析し、発声された単語を抽出する処理部である。音声認識部１２０は、抽出した各単語の情報を、第２判定部１３０に出力する。音声認識部１２０は、どのような音声認識技術を利用してもよい。たとえば、会話音声データ２０に含まれる声の特徴を基にして、話し言葉を文字列に変換し、単語を定義した辞書情報を基にして、単語を抽出する。 The voice recognition unit 120 is a processing unit that analyzes the conversation voice data 20 and extracts the spoken word. The voice recognition unit 120 outputs the information of each extracted word to the second determination unit 130. The voice recognition unit 120 may use any voice recognition technology. For example, the spoken word is converted into a character string based on the characteristics of the voice included in the conversation voice data 20, and the word is extracted based on the dictionary information that defines the word.

第２判定部１３０は、音声認識部１２０の音声認識結果と、抽象度判定テーブル１０５ｂとを基にして、会話内容の抽象度を判定する処理部である。第２判定部１３０は、判定した会話内容の抽象度の情報を、評価部１４０に出力する。 The second determination unit 130 is a processing unit that determines the degree of abstraction of the conversation content based on the voice recognition result of the voice recognition unit 120 and the abstraction degree determination table 105b. The second determination unit 130 outputs information on the degree of abstraction of the determined conversation content to the evaluation unit 140.

たとえば、第２判定部１３０は、音声認識結果に含まれる複数の単語と、抽象度判定テーブル１０５ｂとを比較して、抽象度判定テーブル１０５ｂに登録された各単語について、単語毎の出現回数をカウントする。 For example, the second determination unit 130 compares a plurality of words included in the speech recognition result with the abstraction degree determination table 105b, and determines the number of occurrences of each word for each word registered in the abstraction degree determination table 105b. Count.

図６は、本実施例１に係る第２判定部の処理を説明するための図である。図６に示す例では、単語「要求仕様」の出現回数が「６」、単語「会員ＩＤ」の出現回数が「１」、単語「出力メッセージ」の出現回数が「１」である。単語「要求仕様」の抽象度が「１０」、単語「会員ＩＤ」の抽象度が「２」、単語「出力メッセージ」の抽象度が「４」であるため、抽象度の合計は「６６」となる。また、各単語の抽象度の平均は「８．２５」となる。第２判定部１３０は、この抽象度の平均を、判定結果の抽象度の情報として、評価部１４０に出力する。 FIG. 6 is a diagram for explaining the processing of the second determination unit according to the first embodiment. In the example shown in FIG. 6, the number of appearances of the word "requirement specification" is "6", the number of appearances of the word "member ID" is "1", and the number of appearances of the word "output message" is "1". Since the abstraction level of the word "requirement specification" is "10", the abstraction level of the word "member ID" is "2", and the abstraction level of the word "output message" is "4", the total abstraction level is "66". It becomes. The average degree of abstraction of each word is "8.25". The second determination unit 130 outputs the average of the abstraction degree to the evaluation unit 140 as information on the abstraction degree of the determination result.

評価部１４０は、第１判定部１１０から出力される会話の状態（リスク評価値）と、第２判定部１３０から出力される抽象度とを基にして、会話音声データ２０に含まれる会話対象のリスクを評価する処理部である。 The evaluation unit 140 is a conversation target included in the conversation voice data 20 based on the conversation state (risk evaluation value) output from the first determination unit 110 and the abstraction degree output from the second determination unit 130. It is a processing department that evaluates the risk of.

たとえば、評価部１４０は、リスク評価値と、抽象度とを乗算した値が、基準評価値以上である場合、会話対象のリスクが「リスク大」であると評価する。一方、評価部１４０は、リスク評価値と、抽象度とを乗算した値が、基準評価値未満である場合、会話対象のリスクが「リスク小」であると評価する。 For example, the evaluation unit 140 evaluates that the risk of the conversation target is "high risk" when the value obtained by multiplying the risk evaluation value and the degree of abstraction is equal to or greater than the reference evaluation value. On the other hand, when the value obtained by multiplying the risk evaluation value by the degree of abstraction is less than the standard evaluation value, the evaluation unit 140 evaluates that the risk of the conversation target is "small risk".

評価部１４０は、評価結果を図示しない表示装置に出力して表示させてもよいし、ネットワークを介して図示しない外部装置に評価結果を通知してもよい。 The evaluation unit 140 may output the evaluation result to a display device (not shown) and display the evaluation result, or may notify the evaluation result to an external device (not shown) via the network.

次に、本実施例１に係る会話分析装置１００の処理手順の一例について説明する。図７は、本実施例１に係る会話分析装置の処理手順を示すフローチャートである。図７に示すように、会話分析装置１００は、会話音声データ２０を取得する（ステップＳ１０１）。 Next, an example of the processing procedure of the conversation analyzer 100 according to the first embodiment will be described. FIG. 7 is a flowchart showing a processing procedure of the conversation analyzer according to the first embodiment. As shown in FIG. 7, the conversation analyzer 100 acquires the conversation voice data 20 (step S101).

会話分析装置１００の第１判定部１１０は、会議時間に対する最長の発話時間の比率と、リスク評価値テーブル１０５ａとを基にして、会話の状態（リスク評価値）を判定する（ステップＳ１０２）。 The first determination unit 110 of the conversation analyzer 100 determines the conversation state (risk evaluation value) based on the ratio of the longest utterance time to the conference time and the risk evaluation value table 105a (step S102).

会話分析装置１００の音声認識部１２０は、会話音声データ２０に対して音声認識を実行し、発声された単語を抽出する（ステップＳ１０３）。会話分析装置１００の第２判定部１３０は、発声された単語と、抽象度判定テーブル１０５ｂとを基にして、会話内容の抽象度を判定する（ステップＳ１０４）。 The voice recognition unit 120 of the conversation analyzer 100 executes voice recognition on the conversation voice data 20 and extracts the spoken word (step S103). The second determination unit 130 of the conversation analyzer 100 determines the degree of abstraction of the conversation content based on the uttered word and the abstraction degree determination table 105b (step S104).

会話分析装置１００の評価部１４０は、会話の状態（リスク評価値）および会話内容の抽象度を基にして、会話対象のリスクを総合的に評価する（ステップＳ１０５）。評価部１４０は、評価結果を通知する（ステップＳ１０６）。 The evaluation unit 140 of the conversation analyzer 100 comprehensively evaluates the risk of the conversation target based on the conversation state (risk evaluation value) and the degree of abstraction of the conversation content (step S105). The evaluation unit 140 notifies the evaluation result (step S106).

次に、本実施例１に係る会話分析装置１００の効果について説明する。会話分析装置１００は、第１判定部１１０によって判定される会議の状態に加えて、会議内容の抽象度を用いて、会話対象のリスクを評価している。これによって、会話の対象としてのリスクの程度を適切に評価することができる。 Next, the effect of the conversation analyzer 100 according to the first embodiment will be described. The conversation analyzer 100 evaluates the risk of the conversation target by using the abstraction level of the conference content in addition to the state of the conference determined by the first determination unit 110. This makes it possible to appropriately evaluate the degree of risk as a conversation target.

ここで、発明の効果の補足として、リスクの大小と、単語の出現頻度との関係について説明する。図８は、発明の効果を補足するための図である。たとえば、単語「中止処理」は、関連文書３０ａ，３０ｂ，３０ｃの様々な箇所で出現しており、出現頻度が高く、抽象度が高いと言える。抽象度が高いと、改善すべき点の範囲が広く、リスクは大きいと考えられる。抽象度が高いと、具体性にかけ、「中止処理が具体的に何の中止処理になのか」誤解のリスクもある。 Here, as a supplement to the effect of the invention, the relationship between the magnitude of risk and the frequency of occurrence of words will be described. FIG. 8 is a diagram for supplementing the effect of the invention. For example, the word "stop processing" appears in various places in the related documents 30a, 30b, and 30c, and it can be said that the frequency of appearance is high and the degree of abstraction is high. The higher the level of abstraction, the wider the range of points to be improved and the greater the risk. If the degree of abstraction is high, there is a risk of misunderstanding "what kind of cancellation process is the cancellation process" in terms of concreteness.

単語「文字コード」は、関連文書３０ａ，３０ｂ，３０ｃのうち、特定の関連文書３０ｃにのみ出現しており、出現頻度が低く、抽象度が低いと言える。抽象度が低いと、改善すべき点の範囲が限定的であり、リスクは小さいと考えられる。 It can be said that the word "character code" appears only in a specific related document 30c among the related documents 30a, 30b, and 30c, and the frequency of appearance is low and the degree of abstraction is low. If the level of abstraction is low, the range of points to be improved is limited, and the risk is considered to be small.

図８で説明したように、リスク評価値の大きい（会話の状態が悪い）会話の内容の議論の中心が、抽象度の高い単語であれば、リスクの程度も大きいと推定できる。 As explained in FIG. 8, if the center of the discussion of the content of the conversation with a large risk evaluation value (the state of the conversation is bad) is a word with a high degree of abstraction, it can be estimated that the degree of risk is also large.

ところで、本実施例１に係る会話分析装置１００の第２判定部１３０は、概念ＤＢ５０を基に生成された抽象度判定テーブル１０５ｂを基にして、単語の抽象度を特定していたがこれに限定されるものではない。第２判定部１３０は、単語（あるいは複合語）と、概念ＤＢ５０とを直接比較して、単語（あるいは複合語）に対応する抽象度を特定してもよい。 By the way, the second determination unit 130 of the conversation analyzer 100 according to the first embodiment specifies the abstraction degree of the word based on the abstraction degree determination table 105b generated based on the concept DB 50. It is not limited. The second determination unit 130 may directly compare the word (or compound word) with the concept DB 50 to specify the degree of abstraction corresponding to the word (or compound word).

図９は、本実施例２に係る会話分析装置の構成を示す機能ブロック図である。図９に示すように、会話分析装置２００は、記憶部２０５と、音声認識部２１０と、第１判定部２２０と、第２判定部２３０と、評価部２４０と、生成部２５０とを有する。 FIG. 9 is a functional block diagram showing the configuration of the conversation analyzer according to the second embodiment. As shown in FIG. 9, the conversation analyzer 200 includes a storage unit 205, a voice recognition unit 210, a first determination unit 220, a second determination unit 230, an evaluation unit 240, and a generation unit 250.

記憶部２０５は、設定キーワードテーブル２０５ａと、リスク評価値テーブル２０５ｂと、抽象度判定テーブル２０５ｃとを有する。記憶部２０５は、ＲＡＭ、ＲＯＭ、フラッシュメモリなどの半導体メモリ素子や、ＨＤＤなどの記憶装置に対応する。 The storage unit 205 has a setting keyword table 205a, a risk evaluation value table 205b, and an abstraction degree determination table 205c. The storage unit 205 corresponds to semiconductor memory elements such as RAM, ROM, and flash memory, and storage devices such as HDD.

設定キーワードテーブル２０５ａは、会話の状態を判定するための抽出対象となる設定キーワードを定義するテーブルである。図１０は、本実施例２に係る設定キーワードテーブルのデータ構造の一例を示す図である。図１０に示すように、この設定キーワードテーブル２０５ａには、各種の設定キーワードが登録される。 The setting keyword table 205a is a table that defines setting keywords to be extracted for determining the state of conversation. FIG. 10 is a diagram showing an example of the data structure of the setting keyword table according to the second embodiment. As shown in FIG. 10, various setting keywords are registered in the setting keyword table 205a.

図１１は、本実施例２に係るリスク評価値テーブルのデータ構造の一例を示す図である。図１１に示すように、このリスク評価値テーブル２０５ｂは、出現回数と、リスク評価値とを対応付ける。出現回数は、会話音声データ２０の音声認識結果に含まれる単語のうち、設定キーワードの出現回数を示すものである。リスク評価値は、会議の状態の悪さの程度を示す値であり、リスク評価値が大きいほど、会議の状態が悪いことを示す。 FIG. 11 is a diagram showing an example of the data structure of the risk evaluation value table according to the second embodiment. As shown in FIG. 11, the risk evaluation value table 205b associates the number of occurrences with the risk evaluation value. The number of occurrences indicates the number of appearances of the set keyword among the words included in the voice recognition result of the conversation voice data 20. The risk evaluation value is a value indicating the degree of poor state of the meeting, and the larger the risk evaluation value, the worse the state of the meeting.

たとえば、出現回数が「Ｎａ以上、Ｎｂ未満」の場合、リスク評価値は「０」となる。出現回数が「Ｎｂ以上、Ｎｃ未満」の場合、リスク評価値は「１」となる。出現回数が「Ｎｃ以上」の場合、リスク評価値は「２」となる。ここで、Ｎａ、Ｎｂ、Ｎｃの値は、予め設定される値であり、大小関係をＮａ＜Ｎｂ＜Ｎｃとする。なお、２つの閾値Ｎａ、Ｎｂを用いて、リスク評価値を特定してもよい。たとえば、Ｎａ未満の場合に、リスク評価値を「０」とし、Ｎａ以上、Ｎｂ未満の場合に、リスク評価値を「１」とし、Ｎｂ以上の場合にリスク評価値「２」としてもよい。 For example, when the number of occurrences is "Na or more and less than Nb", the risk evaluation value is "0". When the number of appearances is "Nb or more and less than Nc", the risk evaluation value is "1". When the number of appearances is "Nc or more", the risk evaluation value is "2". Here, the values of Na, Nb, and Nc are preset values, and the magnitude relationship is Na <Nb <Nc. The risk evaluation value may be specified using the two threshold values Na and Nb. For example, if it is less than Na, the risk evaluation value may be "0", if it is Na or more and less than Nb, the risk evaluation value may be "1", and if it is Nb or more, the risk evaluation value may be "2".

抽象度判定テーブル２０５ｃは、単語と抽象度とを対応付けるテーブルである。抽象度判定テーブル２０５ｃのデータ構造は、図４で説明した抽象度判定テーブル１０５ｂのデータ構造と同様である。 The abstraction degree determination table 205c is a table that associates words with the abstraction degree. The data structure of the abstraction degree determination table 205c is the same as the data structure of the abstraction degree determination table 105b described with reference to FIG.

図９の説明に戻る。図９の各処理部２１０，２２０，２３０，２４０は、ＣＰＵやＭＰＵなどによって実現できる。また、各処理部２１０〜２４０は、ＡＳＩＣやＦＰＧＡなどのハードワイヤードロジックによっても実現できる。 Returning to the description of FIG. Each of the processing units 210, 220, 230, 240 in FIG. 9 can be realized by a CPU, MPU, or the like. Further, each processing unit 210-240 can be realized by hard-wired logic such as ASIC or FPGA.

会話分析装置２００は、会話音声データ２０を図示しない外部装置から取得する。会話音声データ２０は、会議の会話を録音した音声データであり、たとえば、時間と音声のパワーとを対応付ける。 The conversation analyzer 200 acquires the conversation voice data 20 from an external device (not shown). The conversation voice data 20 is voice data recorded from a conference conversation, for example, associating time with voice power.

音声認識部２１０は、会話音声データ２０を音声解析し、発声された単語を抽出する処理部である。音声認識部２１０は、抽出した各単語の情報を、第１判定部２２０および第２判定部２３０に出力する。音声認識部２１０は、どのような音声認識技術を利用してもよい。たとえば、会話音声データ２０に含まれる声の特徴を基にして、話し言葉を文字列に変換し、単語を定義した辞書情報を基にして、単語を抽出する。 The voice recognition unit 210 is a processing unit that analyzes the conversation voice data 20 and extracts the spoken word. The voice recognition unit 210 outputs the information of each extracted word to the first determination unit 220 and the second determination unit 230. The voice recognition unit 210 may use any voice recognition technology. For example, the spoken word is converted into a character string based on the characteristics of the voice included in the conversation voice data 20, and the word is extracted based on the dictionary information that defines the word.

第１判定部２２０は、会話音声データ２０の音声認識結果を基にして、会話の状態を判定する処理部である。たとえば、第１判定部２２０は、音声認識部２１０の音声認識結果と、設定キーワードテーブル２０５ａとを基にして、音声認識結果の各単語のうち、設定キーワードの出現回数をカウントする。第１判定部２２０は、カウントした出現回数と、リスク評価値テーブル２０５ｂとを比較して、リスク評価値（会話の状態）を判定する。第１判定部２２０は、判定結果となる会話の状態を、評価部２４０に出力する。 The first determination unit 220 is a processing unit that determines the state of conversation based on the voice recognition result of the conversation voice data 20. For example, the first determination unit 220 counts the number of occurrences of the set keyword in each word of the voice recognition result based on the voice recognition result of the voice recognition unit 210 and the set keyword table 205a. The first determination unit 220 compares the counted number of occurrences with the risk evaluation value table 205b to determine the risk evaluation value (conversation state). The first determination unit 220 outputs the state of conversation that is the determination result to the evaluation unit 240.

第２判定部２３０は、音声認識部２１０の音声認識結果と、抽象度判定テーブル２０５ｃとを基にして、会話内容の抽象度を判定する処理部である。第２判定部２３０は、判定した会話内容の抽象度の情報を、評価部２４０に出力する。第２判定部２３０が、会話内容の抽象度を判定する処理は、実施例１で説明した第２判定部１３０が、会話内容の抽象度を判定する処理と同様である。 The second determination unit 230 is a processing unit that determines the degree of abstraction of the conversation content based on the voice recognition result of the voice recognition unit 210 and the abstraction degree determination table 205c. The second determination unit 230 outputs information on the degree of abstraction of the determined conversation content to the evaluation unit 240. The process in which the second determination unit 230 determines the abstraction degree of the conversation content is the same as the process in which the second determination unit 130 described in the first embodiment determines the abstraction degree of the conversation content.

評価部２４０は、第１判定部２２０から出力される会話の状態（リスク評価値）と、第２判定部２３０から出力される抽象度とを基にして、会話音声データ２０に含まれる会話対象のリスクを評価する処理部である。評価部２４０がリスクを評価する処理は、評価部１４０がリスクを評価する処理と同様である。 The evaluation unit 240 is a conversation target included in the conversation voice data 20 based on the conversation state (risk evaluation value) output from the first judgment unit 220 and the abstraction degree output from the second judgment unit 230. It is a processing department that evaluates the risk of. The process in which the evaluation unit 240 evaluates the risk is the same as the process in which the evaluation unit 140 evaluates the risk.

生成部２５０は、言語コーパス２５を取得し、抽象度判定テーブル２０５ｃを生成する処理部である。言語コーパス２５は、会話対象に関する言語コーパスとする。たとえば、会話対象が会議に関するものであれば、言語コーパス２５は、会議に関する言語コーパスとなる。 The generation unit 250 is a processing unit that acquires the language corpus 25 and generates the abstraction degree determination table 205c. The language corpus 25 is a language corpus related to the conversation target. For example, if the conversation target is related to a conference, the language corpus 25 is a language corpus related to the conference.

生成部２５０は、言語コーパス２５を形態素解析し、各形態素（以下、単語）の出現頻度を集計する。生成部２５０は、単語の出現頻度を基にして、抽象度判定テーブル２０５ｃの各単語に対する抽象度を算出する。 The generation unit 250 analyzes the language corpus 25 for morphological elements and totals the frequency of appearance of each morpheme (hereinafter, word). The generation unit 250 calculates the abstraction degree for each word in the abstraction degree determination table 205c based on the frequency of occurrence of the words.

図１２は、本実施例２に係る生成部の処理を説明するための図である。図１２のテーブル１０Ｃは、単語と、言語コーパス２５での単語の出現頻度とを対応付けるテーブルである。生成部２５０は、式（２）を基にして、各単語の抽象度を算出してもよいし、出現頻度が低い順の順位を抽象度として算出してもよい。 FIG. 12 is a diagram for explaining the processing of the generation unit according to the second embodiment. Table 10C of FIG. 12 is a table for associating words with the frequency of occurrence of words in the language corpus 25. The generation unit 250 may calculate the abstraction degree of each word based on the equation (2), or may calculate the rank in ascending order of appearance frequency as the abstraction degree.

抽象度＝ｌｏｇ（出現頻度）×２・・・（２） Abstraction = log (frequency of appearance) x 2 ... (2)

たとえば、式（２）を基にして算出した各単語の抽象度を、第１抽象度と表記する。出現頻度の順位を基にして算出した各単語の抽象度を、第２抽象度と表記する。生成部２５０は、第１抽象度または第２抽象度のいずれか一方の抽象度を、抽象度判定テーブル２０５ｃに登録する。 For example, the degree of abstraction of each word calculated based on the equation (2) is expressed as the first degree of abstraction. The degree of abstraction of each word calculated based on the order of appearance frequency is referred to as the second degree of abstraction. The generation unit 250 registers the abstraction degree of either the first abstraction degree or the second abstraction degree in the abstraction degree determination table 205c.

次に、本実施例２に係る会話分析装置２００の処理手順の一例について説明する。図１３は、本実施例２に係る会話分析装置の処理手順を示すフローチャートである。図１３に示すように、会話分析装置２００は、会話音声データ２０を取得する（ステップＳ２０１）。会話分析装置２００は、会話音声データ２０に対して音声認識を実行し、発声された単語を抽出する（ステップＳ２０２）。 Next, an example of the processing procedure of the conversation analyzer 200 according to the second embodiment will be described. FIG. 13 is a flowchart showing a processing procedure of the conversation analyzer according to the second embodiment. As shown in FIG. 13, the conversation analyzer 200 acquires the conversation voice data 20 (step S201). The conversation analyzer 200 executes voice recognition on the conversation voice data 20 and extracts the spoken word (step S202).

会話分析装置２００の第１判定部２２０は、発声された単語を基にして、設定キーワードの出現回数をカウントし、音声の状態（リスク評価値）を判定する（ステップＳ２０３）。 The first determination unit 220 of the conversation analyzer 200 counts the number of occurrences of the set keyword based on the spoken word, and determines the voice state (risk evaluation value) (step S203).

会話分析装置２００の第２判定部２３０は、発声された単語と、抽象度判定テーブル２０５ｃとを基にして、会話内容の抽象度を判定する（ステップＳ２０４）。 The second determination unit 230 of the conversation analyzer 200 determines the degree of abstraction of the conversation content based on the uttered word and the abstraction degree determination table 205c (step S204).

会話分析装置２００の評価部２４０は、会話の状態（リスク評価値）および会話内容の抽象度を基にして、会話対象のリスクを総合的に評価する（ステップＳ２０５）。評価部２４０は、評価結果を通知する（ステップＳ２０６）。 The evaluation unit 240 of the conversation analyzer 200 comprehensively evaluates the risk of the conversation target based on the conversation state (risk evaluation value) and the abstraction level of the conversation content (step S205). The evaluation unit 240 notifies the evaluation result (step S206).

次に、本実施例２に係る会話分析装置２００の効果について説明する。会話分析装置１００は、第１判定部１１０によって判定される、設定キーワードの出現回数に基づく会議の状態に加えて、会議内容の抽象度を用いて、会話対象のリスクを評価している。これによって、会話の対象としてのリスクの程度を適切に評価することができる。 Next, the effect of the conversation analyzer 200 according to the second embodiment will be described. The conversation analyzer 100 evaluates the risk of the conversation target by using the abstraction degree of the conference content in addition to the state of the conference based on the number of occurrences of the set keyword determined by the first determination unit 110. This makes it possible to appropriately evaluate the degree of risk as a conversation target.

図１４は、本実施例３に係る会話分析装置の構成の一例を示す機能ブロック図である。図１４に示すように、この会話分析装置３００は、記憶部３０５と、第１判定部３１０と、抽出部３２０と、音声認識部３３０と、第２判定部３４０と、評価部３５０とを有する。 FIG. 14 is a functional block diagram showing an example of the configuration of the conversation analyzer according to the third embodiment. As shown in FIG. 14, the conversation analyzer 300 includes a storage unit 305, a first determination unit 310, an extraction unit 320, a voice recognition unit 330, a second determination unit 340, and an evaluation unit 350. ..

記憶部３０５は、リスク評価値テーブル３０５ａと、抽象度判定テーブル３０５ｂとを有する。記憶部３０５は、ＲＡＭ、ＲＯＭ、フラッシュメモリなどの半導体メモリ素子や、ＨＤＤなどの記憶装置に対応する。 The storage unit 305 has a risk evaluation value table 305a and an abstraction degree determination table 305b. The storage unit 305 corresponds to semiconductor memory elements such as RAM, ROM, and flash memory, and storage devices such as HDD.

リスク評価値テーブル３０５ａは、比率と、リスク評価値とを対応付けるテーブルである。比率およびリスク評価値に関する説明は、実施例１のリスク評価値テーブル１０５ａで行った説明と同様である。リスク評価値テーブル３０５ａのデータ構造は、図３で説明したリスク評価値テーブル１０５ａのデータ構造と同様である。 The risk evaluation value table 305a is a table that associates the ratio with the risk evaluation value. The description of the ratio and the risk evaluation value is the same as the description given in the risk evaluation value table 105a of Example 1. The data structure of the risk evaluation value table 305a is the same as the data structure of the risk evaluation value table 105a described with reference to FIG.

抽象度判定テーブル３０５ｂは、単語と、抽象度とを対応付けるテーブルである。抽象度に関する説明は、実施例１の抽象度判定テーブル１０５ｂで行った説明と同様である。抽象度判定テーブル３０５ｂのデータ構造は、図４で説明した抽象度判定テーブル３０５ｂのデータ構造と同様である。 The abstraction degree determination table 305b is a table that associates words with the abstraction degree. The description of the degree of abstraction is the same as the description given in the level of abstraction determination table 105b of the first embodiment. The data structure of the abstraction degree determination table 305b is the same as the data structure of the abstraction degree determination table 305b described with reference to FIG.

会話分析装置３００は、会話音声データ２０を図示しない外部装置から取得する。会話音声データ２０は、会議の会話を録音した音声データであり、たとえば、時間と音声のパワーとを対応付ける。 The conversation analyzer 300 acquires the conversation voice data 20 from an external device (not shown). The conversation voice data 20 is voice data recorded from a conference conversation, for example, associating time with voice power.

第１判定部３１０は、会話音声データ２０を分析して、会話の状態を判定する処理部である。第１判定部３１０は、第１判定部１１０と同様にして、会議時間に対する最長の発話区間の比率を算出する。第１判定部３１０は、算出した比率と、リスク評価値テーブル３０５ａとを比較して、リスク評価値を判定する。第１判定部１１０は、リスク評価値（会話の状態）の判定結果を、評価部３５０に出力する。 The first determination unit 310 is a processing unit that analyzes the conversation voice data 20 and determines the state of the conversation. The first determination unit 310 calculates the ratio of the longest utterance section to the conference time in the same manner as the first determination unit 110. The first determination unit 310 compares the calculated ratio with the risk evaluation value table 305a to determine the risk evaluation value. The first determination unit 110 outputs the determination result of the risk evaluation value (conversation state) to the evaluation unit 350.

また、第１判定部３１０は、比率を算出する場合に利用した、「最長の発話区間」の情報を、抽出部３２０に出力する。 Further, the first determination unit 310 outputs the information of the "longest utterance section" used when calculating the ratio to the extraction unit 320.

抽出部３２０は、第１判定部３１０による会話の状態の判定に寄与した区間を「着目区間」として抽出する処理部である。図１５は、本実施例３に係る抽出部の処理を説明するための図である。たとえば、最長の発話区間を、時刻ｔ_１〜時刻ｔ_２とすると、抽出部３２０は、時刻ｔ_１ａ〜時刻ｔ_２ａを着目区間として抽出する。時刻ｔ_１ａは、ｔ_１よりも所定時間（たとえば、５分間）前の時刻である。時刻ｔ_２ａは、ｔ_２よりも所定時間（たとえば、５分間）後の時刻である。抽出部３２０は、着目区間の情報を、第２判定部３４０に出力する。 The extraction unit 320 is a processing unit that extracts a section that contributes to the determination of the state of conversation by the first determination unit 310 as a “section of interest”. FIG. 15 is a diagram for explaining the processing of the extraction unit according to the third embodiment. For example, assuming that the longest utterance section is time t ₁ to time t ₂ , the extraction unit 320 extracts time t _1a to time t _2a as the section of interest. The time t _1a is a time before a predetermined time (for example, 5 minutes) from t ₁ . The time t _2a is a time after a predetermined time (for example, 5 minutes) after t ₂ . The extraction unit 320 outputs the information of the section of interest to the second determination unit 340.

音声認識部３３０は、会話音声データ２０を音声解析し、発声された単語を抽出する処理部である。音声認識部３３０は、抽出した各単語の情報を、第２判定部３４０に出力する。音声認識部３３０は、どのような音声認識技術を利用してもよい。たとえば、会話音声データ２０に含まれる声の特徴を基にして、話し言葉を文字列に変換し、単語を定義した辞書情報を基にして、単語を抽出する。 The voice recognition unit 330 is a processing unit that analyzes the conversation voice data 20 and extracts the spoken word. The voice recognition unit 330 outputs the information of each extracted word to the second determination unit 340. The voice recognition unit 330 may use any voice recognition technology. For example, the spoken word is converted into a character string based on the characteristics of the voice included in the conversation voice data 20, and the word is extracted based on the dictionary information that defines the word.

第２判定部３４０は、音声認識部３３０の音声認識結果の単語のうち、着目区間に発声された単語と、抽象度判定テーブル３０５ｂとを基にして、会話内容の抽象度を判定する処理部である。第２判定部３４０は、判定した会話内容の抽象度の情報を、評価部３５０に出力する。 The second determination unit 340 is a processing unit that determines the degree of abstraction of the conversation content based on the words uttered in the section of interest and the abstraction degree determination table 305b among the words of the voice recognition result of the voice recognition unit 330. Is. The second determination unit 340 outputs information on the degree of abstraction of the determined conversation content to the evaluation unit 350.

たとえば、第２判定部３４０は、着目区間に含まれる複数の単語と、抽象度判定テーブル３０５ｂとを比較して、抽象度判定テーブル３０５ｂに登録された各単語について、単語毎の出現回数をカウントする。第２判定部３４０は、カウントした単語毎の出現回数を基にして、抽象度を判定する。カウントした単語毎の出現回数を基にして、抽象度を判定する処理は、実施例１で説明した第２判定部１３０の処理と同様である。 For example, the second determination unit 340 compares a plurality of words included in the section of interest with the abstraction degree determination table 305b, and counts the number of occurrences of each word for each word registered in the abstraction degree determination table 305b. To do. The second determination unit 340 determines the degree of abstraction based on the number of occurrences of each counted word. The process of determining the degree of abstraction based on the number of occurrences of each counted word is the same as the process of the second determination unit 130 described in the first embodiment.

評価部３５０は、第１判定部３１０から出力される会話の状態（リスク評価値）と、第２判定部３４０から出力される抽象度とを基にして、会話音声データ２０に含まれる会話対象のリスクを評価する処理部である。評価部３５０がリスクを評価する処理は、評価部１４０がリスクを評価する処理と同様である。 The evaluation unit 350 is a conversation target included in the conversation voice data 20 based on the conversation state (risk evaluation value) output from the first judgment unit 310 and the abstraction degree output from the second judgment unit 340. It is a processing department that evaluates the risk of. The process in which the evaluation unit 350 evaluates the risk is the same as the process in which the evaluation unit 140 evaluates the risk.

次に、本実施例３に係る会話分析装置３００の処理手順の一例について説明する。図１６は、本実施例３に係る会話分析装置の処理手順を示すフローチャートである。図１６に示すように、会話分析装置３００は、会話音声データ２０を取得する（ステップＳ３０１）。 Next, an example of the processing procedure of the conversation analyzer 300 according to the third embodiment will be described. FIG. 16 is a flowchart showing a processing procedure of the conversation analyzer according to the third embodiment. As shown in FIG. 16, the conversation analyzer 300 acquires the conversation voice data 20 (step S301).

会話分析装置３００の第１判定部１１０は、会議時間に対する最長の発話時間の比率と、リスク評価値テーブル３０５ａとを基にして、会話の状態（リスク評価値）を判定する（ステップＳ３０２）。会話分析装置３００の抽出部３２０は、会話の状態の判定に寄与した着目区間を抽出する（ステップＳ３０３）。 The first determination unit 110 of the conversation analyzer 300 determines the conversation state (risk evaluation value) based on the ratio of the longest utterance time to the conference time and the risk evaluation value table 305a (step S302). The extraction unit 320 of the conversation analyzer 300 extracts the section of interest that contributed to the determination of the conversation state (step S303).

会話分析装置３００の音声認識部３３０は、会話音声データ２０に対して音声認識を実行し、発声された単語を抽出する（ステップＳ３０４）。会話分析装置３００の第２判定部３４０は、着目区間中に発声された単語と、抽象度判定テーブル３０５ｂとを基にして、会話内容の抽象度を判定する（ステップＳ３０５）。 The voice recognition unit 330 of the conversation analyzer 300 executes voice recognition on the conversation voice data 20 and extracts the spoken word (step S304). The second determination unit 340 of the conversation analyzer 300 determines the degree of abstraction of the conversation content based on the words uttered during the section of interest and the abstraction degree determination table 305b (step S305).

会話分析装置３００の評価部３５０は、会話の状態（リスク評価値）および会話内容の抽象度を基にして、会話対象のリスクを総合的に評価する（ステップＳ３０６）。評価部３５０は、評価結果を通知する（ステップＳ３０７）。 The evaluation unit 350 of the conversation analyzer 300 comprehensively evaluates the risk of the conversation target based on the conversation state (risk evaluation value) and the abstraction level of the conversation content (step S306). The evaluation unit 350 notifies the evaluation result (step S307).

次に、本実施例３に係る会話分析装置３００の効果について説明する。会話分析装置３００は、会議内容の抽象度を判定する場合に、会議の状態の判定に寄与した着目区間で発声された単語を用いて、抽象度を判定し、会話対象のリスク評価に用いる。これにより、会議の状態に密接に関係する区間の抽象度を用いて、リスクの大小を評価できる。 Next, the effect of the conversation analyzer 300 according to the third embodiment will be described. When determining the degree of abstraction of the content of the conference, the conversation analyzer 300 determines the degree of abstraction using the words uttered in the section of interest that contributed to the determination of the state of the conference, and uses it for risk evaluation of the conversation target. This makes it possible to evaluate the magnitude of risk using the level of abstraction of the section closely related to the state of the meeting.

ところで、本実施例３に係る会話分析装置３００は、会話の状態の判定に寄与した着目区間を、最長の発話区間を用いて抽出していたがこれに限定されるものではない。たとえば、会話分析装置３００の抽出部３２０は、設定キーワードが含まれる発話区間を基にして、着目区間を抽出してもよい。この場合には、第１判定部３１０は、音声認識部３３０から、音声認識結果を取得し、発話区間に設定キーワードが含まれるか否かを判定し、設定キーワードの含まれる発話区間の情報を、抽出部３２０に出力する。 By the way, the conversation analyzer 300 according to the third embodiment has extracted the section of interest that contributed to the determination of the state of conversation by using the longest utterance section, but the present invention is not limited to this. For example, the extraction unit 320 of the conversation analyzer 300 may extract the section of interest based on the utterance section including the set keyword. In this case, the first determination unit 310 acquires the voice recognition result from the voice recognition unit 330, determines whether or not the set keyword is included in the utterance section, and obtains information on the utterance section including the set keyword. , Output to the extraction unit 320.

次に、本実施例に示した会話分析装置１００（２００，３００）と同様の機能を実現するコンピュータのハードウェア構成の一例について説明する。図１７は、本実施例に係る会話分析装置と同様の機能を実現するコンピュータのハードウェア構成の一例を示す図である。 Next, an example of a computer hardware configuration that realizes the same functions as the conversation analyzer 100 (200, 300) shown in this embodiment will be described. FIG. 17 is a diagram showing an example of a computer hardware configuration that realizes the same functions as the conversation analyzer according to the present embodiment.

図１７に示すように、コンピュータ５００は、各種演算処理を実行するＣＰＵ５０１と、ユーザからのデータの入力を受け付ける入力装置５０２と、ディスプレイ５０３とを有する。また、コンピュータ５００は、記憶媒体からプログラム等を読み取る読み取り装置５０４と、有線または無線ネットワークを介して、外部装置等との間でデータの授受を行うインタフェース装置５０５とを有する。コンピュータ５００は、各種情報を一時記憶するＲＡＭ５０６と、ハードディスク装置５０７とを有する。そして、各装置５０１〜５０７は、バス５０８に接続される。 As shown in FIG. 17, the computer 500 includes a CPU 501 that executes various arithmetic processes, an input device 502 that receives data input from a user, and a display 503. Further, the computer 500 has a reading device 504 that reads a program or the like from a storage medium, and an interface device 505 that exchanges data with an external device or the like via a wired or wireless network. The computer 500 has a RAM 506 for temporarily storing various information and a hard disk device 507. Then, each device 501 to 507 is connected to the bus 508.

ハードディスク装置５０７は、音声認識プログラム５０７ａ、第１判定プログラム５０７ｂ、抽出プログラム５０７ｃ、第２判定プログラム５０７ｄ、評価プログラム５０７ｅを有する。ＣＰＵ５０１は、音声認識プログラム５０７ａ、第１判定プログラム５０７ｂ、抽出プログラム５０７ｃ、第２判定プログラム５０７ｄ、評価プログラム５０７ｅを読み出してＲＡＭ５０６に展開する。 The hard disk device 507 includes a voice recognition program 507a, a first determination program 507b, an extraction program 507c, a second determination program 507d, and an evaluation program 507e. The CPU 501 reads out the voice recognition program 507a, the first determination program 507b, the extraction program 507c, the second determination program 507d, and the evaluation program 507e and deploys them in the RAM 506.

音声認識プログラム５０７ａは、音声認識プロセス５０６ａとして機能する。第１判定プログラム５０７ｂは、第１判定プロセス５０６ｂとして機能する。抽出プログラム５０７ｃは、抽出プロセス５０６ｃとして機能する。第２判定プログラム５０７ｄは、第２判定プロセス５０６ｄとして機能する。評価プログラム５０７ｅは、評価プロセス５０６ｅとして機能する。 The voice recognition program 507a functions as a voice recognition process 506a. The first determination program 507b functions as the first determination process 506b. The extraction program 507c functions as the extraction process 506c. The second determination program 507d functions as the second determination process 506d. The evaluation program 507e functions as the evaluation process 506e.

音声認識プロセス５０６ａの処理は、音声認識部１２０，２１０，３３０の処理に対応する。第１判定プロセス５０６ｂの処理は、第１判定部１１０，２２０，３１０の処理に対応する。抽出プロセス５０６ｃの処理は、抽出部３２０の処理に対応する。第２判定プロセス５０６ｄの処理は、第２判定部１３０，２３０，３４０の処理に対応する。評価プロセス５０６ｅの処理は、評価部１４０，２４０，３５０の処理に対応する。 The processing of the voice recognition process 506a corresponds to the processing of the voice recognition units 120, 210, 330. The processing of the first determination process 506b corresponds to the processing of the first determination units 110, 220, 310. The processing of the extraction process 506c corresponds to the processing of the extraction unit 320. The processing of the second determination process 506d corresponds to the processing of the second determination units 130, 230, 340. The processing of the evaluation process 506e corresponds to the processing of the evaluation units 140, 240, 350.

なお、各プログラム５０７ａ〜５０７ｅついては、必ずしも最初からハードディスク装置５０７に記憶させておかなくてもよい。例えば、コンピュータ５００に挿入されるフレキシブルディスク（ＦＤ）、ＣＤ−ＲＯＭ、ＤＶＤディスク、光磁気ディスク、ＩＣカードなどの「可搬用の物理媒体」に各プログラムを記憶させておく。そして、コンピュータ５００が各プログラム５０７ａ〜５０７ｅを読み出して実行するようにしてもよい。 It should be noted that the programs 507a to 507e do not necessarily have to be stored in the hard disk device 507 from the beginning. For example, each program is stored in a "portable physical medium" such as a flexible disk (FD), a CD-ROM, a DVD disk, a magneto-optical disk, or an IC card inserted into the computer 500. Then, the computer 500 may read and execute each of the programs 507a to 507e.

以上の各実施例を含む実施形態に関し、さらに以下の付記を開示する。 The following additional notes will be further disclosed with respect to the embodiments including each of the above embodiments.

（付記１）会話音声を分析して、会話の状態を判定する第１判定部と、
前記会話音声に対して音声認識を行い、前記会話音声に含まれる単語を抽出する音声認識部と、
前記会話音声に含まれる単語を基にして、前記会話音声における会話内容の抽象度を判定する第２判定部と、
前記会話の状態と前記抽象度とを基にして、前記会話内容を評価する評価部と
を有することを特徴とする会話分析装置。 (Appendix 1) The first judgment unit that analyzes the conversation voice and judges the state of the conversation,
A voice recognition unit that performs voice recognition on the conversation voice and extracts words included in the conversation voice.
A second determination unit that determines the degree of abstraction of the conversation content in the conversation voice based on the words included in the conversation voice.
A conversation analyzer characterized by having an evaluation unit that evaluates the content of the conversation based on the state of the conversation and the degree of abstraction.

（付記２）前記第１判定部は、前記会話音声に含まれる所定の単語の出現回数、または、前記会話音声に含まれる複数の発話時間のうち、最長の発話時間を基にして、前記会話内容のリスクの程度を示す評価値を、前記会話の状態として判定することを特徴とする付記１に記載の会話分析装置。 (Appendix 2) The first determination unit is based on the number of occurrences of a predetermined word included in the conversation voice or the longest utterance time among a plurality of utterance times included in the conversation voice. The conversation analyzer according to Appendix 1, wherein an evaluation value indicating the degree of risk of the content is determined as the state of the conversation.

（付記３）前記第２判定部は、言語コーパスに含まれる単語の出現頻度を基にして前記抽象度を判定することを特徴とする付記１または２に記載の会話分析装置。 (Appendix 3) The conversation analyzer according to Appendix 1 or 2, wherein the second determination unit determines the degree of abstraction based on the frequency of appearance of words included in the language corpus.

（付記４）前記第２判定部は、より抽象的な単語ほど上位の層に位置する概念構造情報と、前記会話音声に含まれる単語とを基にして、前記抽象度を判定することを特徴とする付記２に記載の会話分析装置。 (Appendix 4) The second determination unit is characterized in that the abstraction degree is determined based on the conceptual structure information located in the upper layer as the more abstract words are and the words included in the conversation voice. The conversation analyzer according to Appendix 2.

（付記５）前記会話音声の区間のうち、前記会話の状態の判定に寄与した区間を着目区間として抽出する抽出部を更に有し、前記第２判定部は、前記着目区間に含まれる単語を基にして、前記会話内容の抽象度を判定することを特徴とする付記１〜４のいずれか一つに記載の会話分析装置。 (Appendix 5) Among the conversation voice sections, the second determination unit further includes an extraction unit that extracts the section that contributed to the determination of the conversation state as the interest section, and the second determination unit extracts words included in the interest section. The conversation analyzer according to any one of Supplementary notes 1 to 4, wherein the degree of abstraction of the conversation content is determined based on the conversation content.

（付記６）コンピュータが実行する会話分析方法であって、
会話音声を分析して、会話の状態を判定し、
前記会話音声に対して音声認識を行い、前記会話音声に含まれる単語を抽出し、
前記会話音声に含まれる単語を基にして、前記会話音声における会話内容の抽象度を判定し、
前記会話の状態と前記抽象度とを基にして、前記会話内容を評価する
処理を実行することを特徴とする会話分析方法。 (Appendix 6) A conversation analysis method executed by a computer.
Analyze the conversation voice to determine the state of the conversation,
Voice recognition is performed on the conversation voice, and words included in the conversation voice are extracted.
Based on the words contained in the conversation voice, the degree of abstraction of the conversation content in the conversation voice is determined.
A conversation analysis method characterized by executing a process of evaluating the conversation content based on the state of the conversation and the degree of abstraction.

（付記７）前記会話の状態を判定する処理は、前記会話音声に含まれる所定の単語の出現回数、または、前記会話音声に含まれる複数の発話時間のうち、最長の発話時間を基にして、前記会話内容のリスクの程度を示す評価値を、前記会話の状態として判定することを特徴とする付記６に記載の会話分析方法。 (Appendix 7) The process of determining the state of the conversation is based on the number of occurrences of a predetermined word included in the conversation voice or the longest utterance time among a plurality of utterance times included in the conversation voice. The conversation analysis method according to Appendix 6, wherein an evaluation value indicating the degree of risk of the conversation content is determined as the state of the conversation.

（付記８）前記抽象度を判定する処理は、言語コーパスに含まれる単語の出現頻度を基にして前記抽象度を判定することを特徴とする付記６または７に記載の会話分析方法。 (Appendix 8) The conversation analysis method according to Appendix 6 or 7, wherein the process of determining the degree of abstraction determines the degree of abstraction based on the frequency of appearance of words included in the language corpus.

（付記９）前記抽象度を判定する処理は、より抽象的な単語ほど上位の層に位置する概念構造情報と、前記会話音声に含まれる単語とを基にして、前記抽象度を判定することを特徴とする付記７に記載の会話分析方法。 (Appendix 9) In the process of determining the degree of abstraction, the degree of abstraction is determined based on the conceptual structure information located in the higher layer of the more abstract words and the words included in the conversation voice. The conversation analysis method according to Appendix 7, characterized by the above.

（付記１０）前記会話音声の区間のうち、前記会話の状態の判定に寄与した区間を着目区間として抽出する処理を更に実行し、前記抽象度を判定する処理は、前記着目区間に含まれる単語を基にして、前記会話内容の抽象度を判定することを特徴とする付記６〜９のいずれか一つに記載の会話分析方法。 (Appendix 10) Of the conversation voice sections, the process of extracting the section that contributed to the determination of the conversation state as the section of interest is further executed, and the process of determining the degree of abstraction is a word included in the section of interest. The conversation analysis method according to any one of Appendix 6 to 9, wherein the degree of abstraction of the conversation content is determined based on the above.

（付記１１）コンピュータに、
会話音声を分析して、会話の状態を判定し、
前記会話音声に対して音声認識を行い、前記会話音声に含まれる単語を抽出し、
前記会話音声に含まれる単語を基にして、前記会話音声における会話内容の抽象度を判定し、
前記会話の状態と前記抽象度とを基にして、前記会話内容を評価する
処理を実行させることを特徴とする会話分析プログラム。 (Appendix 11) To the computer
Analyze the conversation voice to determine the state of the conversation,
Voice recognition is performed on the conversation voice, and words included in the conversation voice are extracted.
Based on the words contained in the conversation voice, the degree of abstraction of the conversation content in the conversation voice is determined.
A conversation analysis program characterized by executing a process of evaluating the conversation content based on the state of the conversation and the degree of abstraction.

（付記１２）前記会話の状態を判定する処理は、前記会話音声に含まれる所定の単語の出現回数、または、前記会話音声に含まれる複数の発話時間のうち、最長の発話時間を基にして、前記会話内容のリスクの程度を示す評価値を、前記会話の状態として判定することを特徴とする付記１１に記載の会話分析プログラム。 (Appendix 12) The process of determining the state of the conversation is based on the number of occurrences of a predetermined word included in the conversation voice or the longest utterance time among a plurality of utterance times included in the conversation voice. The conversation analysis program according to Appendix 11, wherein an evaluation value indicating the degree of risk of the conversation content is determined as the state of the conversation.

（付記１３）前記抽象度を判定する処理は、言語コーパスに含まれる単語の出現頻度を基にして前記抽象度を判定することを特徴とする付記１１または１２に記載の会話分析プログラム。 (Appendix 13) The conversation analysis program according to Appendix 11 or 12, wherein the process of determining the degree of abstraction determines the degree of abstraction based on the frequency of appearance of words included in the language corpus.

（付記１４）前記抽象度を判定する処理は、より抽象的な単語ほど上位の層に位置する概念構造情報と、前記会話音声に含まれる単語とを基にして、前記抽象度を判定することを特徴とする付記１２に記載の会話分析プログラム。 (Appendix 14) In the process of determining the degree of abstraction, the degree of abstraction is determined based on the conceptual structure information located in the higher layer of the more abstract words and the words included in the conversation voice. The conversation analysis program according to Appendix 12, which comprises the above.

（付記１５）前記会話音声の区間のうち、前記会話の状態の判定に寄与した区間を着目区間として抽出する処理を更にコンピュータに実行させ、前記抽象度を判定する処理は、前記着目区間に含まれる単語を基にして、前記会話内容の抽象度を判定することを特徴とする付記１１〜１４のいずれか一つに記載の会話分析プログラム。 (Appendix 15) Of the conversation voice sections, a process of further executing a process of extracting a section contributing to the determination of the conversation state as a section of interest and determining the degree of abstraction is included in the section of interest. The conversation analysis program according to any one of Supplementary note 11 to 14, wherein the degree of abstraction of the conversation content is determined based on the words.

１００，２００，３００会話分析装置
１０５，２０５，３０５記憶部
１０５ａ，２０５ｂ，３０５ａリスク評価値テーブル
１０５ｂ，２０５ｃ，３０５ｂ抽象度判定テーブル
１１０，２２０，３１０第１判定部
１２０，２１０，３３０音声認識部
１３０，２３０，３４０第２判定部
１４０，２４０，３５０評価部
２０５ａ設定キーワードテーブル
３２０抽出部 100, 200, 300 Conversation analyzer 105, 205, 305 Storage unit 105a, 205b, 305a Risk evaluation value table 105b, 205c, 305b Abstraction degree judgment table 110, 220, 310 First judgment unit 120, 210, 330 Speech recognition unit 130, 230, 340 Second judgment unit 140, 240, 350 Evaluation unit 205a Setting keyword table 320 Extraction unit

Claims

The first judgment unit that analyzes the conversation voice and judges the state of the conversation,
A voice recognition unit that performs voice recognition on the conversation voice and extracts words included in the conversation voice.
A second determination unit that determines the degree of abstraction of the conversation content in the conversation voice based on the words included in the conversation voice.
A conversation analyzer characterized by having an evaluation unit that evaluates the content of the conversation based on the state of the conversation and the degree of abstraction.

The first determination unit determines the risk of the conversation content based on the number of occurrences of a predetermined word included in the conversation voice or the longest utterance time among a plurality of utterance times included in the conversation voice. The conversation analyzer according to claim 1, wherein an evaluation value indicating the degree is determined as the state of the conversation.

The conversation analyzer according to claim 1 or 2, wherein the second determination unit determines the degree of abstraction based on the frequency of appearance of words included in the language corpus.

The second determination unit is characterized in that the degree of abstraction is determined based on the conceptual structure information located in the upper layer as the more abstract words are and the words included in the conversation voice. 2. The conversation analyzer according to 2.

The second determination unit further includes an extraction unit that extracts a section of the conversation voice section that has contributed to the determination of the conversation state as a section of interest, and the second determination unit is based on a word included in the section of interest. The conversation analyzer according to any one of claims 1 to 4, wherein the degree of abstraction of the conversation content is determined.

A computer-executed conversation analysis method
Analyze the conversation voice to determine the state of the conversation,
Voice recognition is performed on the conversation voice, and words included in the conversation voice are extracted.
Based on the words contained in the conversation voice, the degree of abstraction of the conversation content in the conversation voice is determined.
A conversation analysis method characterized by executing a process of evaluating the conversation content based on the state of the conversation and the degree of abstraction.

On the computer
Analyze the conversation voice to determine the state of the conversation,
Voice recognition is performed on the conversation voice, and words included in the conversation voice are extracted.
Based on the words contained in the conversation voice, the degree of abstraction of the conversation content in the conversation voice is determined.
A conversation analysis program characterized by executing a process of evaluating the conversation content based on the state of the conversation and the degree of abstraction.