JP5674689B2

JP5674689B2 - Knowledge amount estimation information generation device, knowledge amount estimation device, method, and program

Info

Publication number: JP5674689B2
Application number: JP2012031020A
Authority: JP
Inventors: 千明宮崎; 東中　竜一郎; 竜一郎東中; 牧野　俊朗; 俊朗牧野; 松尾　義博; 義博松尾
Original assignee: Nippon Telegraph and Telephone Corp
Current assignee: Nippon Telegraph and Telephone Corp
Priority date: 2012-02-15
Filing date: 2012-02-15
Publication date: 2015-02-25
Anticipated expiration: 2032-02-15
Also published as: JP2013167765A

Description

本発明は、知識量推定情報生成装置、知識量推定装置、方法、及びプログラムに係り、特に、問合せ者と回答者との対話における発話列のテキストデータから問合せ者の当該対話の内容に関する知識量を精度良く推定するのに好適な知識量推定情報生成装置、知識量推定装置、方法、及びプログラムに関する。 The present invention relates to a knowledge amount estimation information generation device, a knowledge amount estimation device, a method, and a program, and in particular, a knowledge amount related to the content of an inquirer's dialogue from text data of an utterance string in the dialogue between the inquirer and the respondent. The present invention relates to a knowledge amount estimation information generation device, a knowledge amount estimation device, a method, and a program suitable for accurately estimating the amount of information.

プログラムされたコンピュータにより人の知識量を推定する従来技術としては、例えば、特許文献１，２に記載のものがある。これらの特許文献１，２においては、文書検索を行うユーザの分野ごとの知識量や知識の深さといった背景を推定する技術が記載されている。 For example, Patent Documents 1 and 2 disclose conventional techniques for estimating the amount of human knowledge using a programmed computer. In these Patent Documents 1 and 2, a technique for estimating a background such as a knowledge amount and a knowledge depth for each field of a user who performs a document search is described.

文書検索では、ユーザが自分の探している情報を得るために、情報に関連しそうなキーワードを指定して検索を実施する。検索の精度を向上させるためには、検索を行うユーザの背景を知るための情報の獲得が重要であることが知られており、このため、例えば特許文献３では、ユーザの指定したキーワード履歴や、閲覧した電子文書の履歴から、どの分野に関して関心があるかといった嗜好分野を推定する技術が記載されている。 In document search, in order to obtain information that the user is searching for, a search is performed by specifying a keyword that is likely to be related to the information. In order to improve the accuracy of the search, it is known that acquisition of information for knowing the background of the user who performs the search is important. For this reason, in Patent Document 3, for example, the keyword history specified by the user or In addition, a technique for estimating a favorite field such as which field is interested from the history of a browsed electronic document is described.

しかしながら、特許文献３に記載の嗜好推定技術では、ユーザのよく調べる情報の分野を推定できても、ユーザがその分野において、どの程度精通しているのかといったことは推定できない。 However, with the preference estimation technique described in Patent Literature 3, even if the field of information that the user frequently examines can be estimated, it is not possible to estimate how familiar the user is in that field.

これに対して、特許文献１，２においては、検索システムのユーザログの一部であるクエリログを利用し、クエリログのクエリから分野毎にクエリの専門性度合いを算出し、ユーザの知識量に関する背景を推定することにより、ユーザがその分野において、どの程度精通しているのかといったことが推定できる。 On the other hand, in Patent Documents 1 and 2, a query log that is a part of a user log of a search system is used, a degree of expertise of a query is calculated for each field from a query of the query log, and a background relating to a user's knowledge amount It is possible to estimate how familiar the user is in the field.

また、非特許文献１においては、上述の特許文献１，２と同様の技術が開示されている。すなわち、使用語彙の専門性に着目し、各単語の専門性をその希少さなどから算出しておき、あるユーザが入力した検索クエリの履歴から、クエリとして使われた単語の専門性の平均値を計算し、当該ユーザの知識量を推定する技術が記載されている。 Further, Non-Patent Document 1 discloses a technique similar to that of Patent Documents 1 and 2 described above. That is, paying attention to the expertise of the vocabulary used, the expertise of each word is calculated from its rarity, etc., and the average value of the expertise of the word used as the query from the history of the search query entered by a user Is described, and a technique for estimating the knowledge amount of the user is described.

また、非特許文献２においては、バス運行情報案内システムのユーザを対象として、音声対話システムにおいてユーザに協調的な対話を行うために、ユーザの知識量を推定する技術が記載されている。ここでは、あるユーザが検索対象としたバス停の履歴から、バス停の属性（市民のみが利用するバス停／それ以外）の割合を求め、ユーザの当該地域に関する知識量を推定する技術が記載されている。 Non-Patent Document 2 describes a technique for estimating the amount of knowledge of a user in order to perform a collaborative dialogue with the user in a voice dialogue system for the user of the bus operation information guidance system. Here, a technique is described in which the ratio of bus stop attributes (bus stops used only by citizens / others) is determined from the history of bus stops searched by a certain user, and the amount of knowledge about the area of the user is estimated. .

特開２０１１−１７０６９９号公報JP 2011-170699 A 特開２０１１−２２１８７２号公報JP 2011-221872 A 特開２０００−１４８７７３号公報JP 2000-148773 A

佐藤大祐, 安田宜仁, 望月崇由, 鈴木智也, 松浦由美子, 片岡良治, “検索システムユーザの分野別の知識推定”, DEIM Forum 2010 I-3 （2010）Daisuke Sato, Yoshihito Yasuda, Takayoshi Mochizuki, Tomoya Suzuki, Yumiko Matsuura, Ryoji Kataoka, “Knowledge Estimation by Search System User Field”, DEIM Forum 2010 I-3 (2010) 上野晋一, 駒谷和範, 河原達也, 奥乃博, “バス運行情報案内システムにおけるユーザモデルを用いた適応的応答の生成”, 音声言語情報処理 42-2 2002.7.12, vol.65, ｐｐ.5-10 （2002）Junichi Ueno, Kazunori Komatani, Tatsuya Kawahara, Hiroshi Okuno, “Generating Adaptive Responses Using User Models in Bus Operation Information Guidance Systems”, Spoken Language Information Processing 42-2 2002.7.12, vol.65, pp.5- 10 (2002)

しかしながら、上述の特許文献１，２、及び非特許文献１に記載の知識量推定技術では、人間同士の対話における話者を対象として当該話者の知識量を推定することはできない。 However, with the knowledge amount estimation techniques described in Patent Documents 1 and 2 and Non-Patent Document 1, it is not possible to estimate the speaker's knowledge amount for speakers in a dialogue between humans.

対話の話者に対しても知識量を推定できるようにするためには、音声認識結果を知識量推定の入力として扱えるようにすることが必要である。しかしながら、上述した従来の語彙の専門性を用いてユーザの知識量を推定する技術において音声認識技術を用いる場合には、音声認識結果に含まれる誤認識に弱いという問題がある。 In order to be able to estimate the amount of knowledge even for a conversation speaker, it is necessary to be able to handle the speech recognition result as an input for knowledge amount estimation. However, when the speech recognition technology is used in the technology for estimating the amount of knowledge of the user using the above-described vocabulary expertise, there is a problem that it is vulnerable to misrecognition included in the speech recognition result.

具体的に起こる問題としては、ある話者の発話の音声認識結果に、専門性の高い単語が誤っていくつか出現した場合、その話者の知識量は誤って高いと推定されてしまう恐れがある。 As a specific problem, if some highly specialized words appear in the speech recognition result of a speaker's utterance, the knowledge amount of the speaker may be presumed to be erroneously high. is there.

このような対話の話者に対して知識量推定技術を使用する場面の例としてコンタクトセンタの通話分析や音声対話システムが考えられるが、どちらの場合においても、発話を音声認識することは不可欠であるため、このような場面で使用する知識量推定技術には、音声認識結果に含まれる誤認識に頑健であることが求められる。 Examples of situations where knowledge amount estimation technology is used for such conversational speakers include contact center call analysis and voice dialogue systems. In either case, speech recognition is indispensable. Therefore, the knowledge amount estimation technique used in such a scene is required to be robust against misrecognition included in the speech recognition result.

上述の非特許文献２においては、バス運行情報案内システムにおいてユーザの知識量を推定する際に、あるエリアに居るユーザが電話等で入力する音声情報を音声対話システムにより認識して、当該エリアに関しての当該ユーザの知識量を推定する技術が記載されている。 In the above-mentioned Non-Patent Document 2, when estimating the amount of knowledge of the user in the bus operation information guidance system, the voice information input by the user in a certain area by telephone or the like is recognized by the voice dialogue system, and A technique for estimating the amount of knowledge of the user is described.

しかしながら、この技術においても、ユーザが電話等で入力する音声情報は、バス停、乗車場所、及び降車場所等の特定の情報であり、さらに、これらの特定情報の属性、すなわち、市民のみが利用するバス停とその他のバス停、正式なバス停名、最寄りの施設名でのバス停指定など、上述の特許文献１，２、及び非特許文献１と同様に、当該エリアに関してのユーザの専門的な単語を用いて、当該エリアにおけるユーザの知識量を推定するものであり、ユーザの発話の音声認識結果に、専門性の高い単語が誤っていくつか出現した場合には、そのユーザの知識量は誤って高いと推定されてしまう恐れがある。 However, also in this technology, the voice information input by the user by telephone or the like is specific information such as a bus stop, a boarding place, and a getting-off place, and further, attributes of these specific information, that is, only citizens use it. Similar to the above-mentioned Patent Documents 1 and 2 and Non-Patent Document 1, such as bus stops and other bus stops, formal bus stop names, bus stop designations with the nearest facility names, etc. The amount of knowledge of the user in the area is estimated, and if some highly specialized words appear in error in the speech recognition result of the user's utterance, the amount of knowledge of the user is erroneously high. It may be estimated that.

本発明で解決しようとする問題点は、音声による対話の話者に対する知識量の推定に、従来のユーザの使用語彙の専門性に着目して当該ユーザの知識量を推定する技術を用いた場合には、音声認識結果に含まれる誤認識により、当該話者の知識量を正しく推定することができない恐れがある点である。 The problem to be solved by the present invention is that, when the technology for estimating the knowledge amount of the user is used for estimating the knowledge amount for the speaker of the dialogue by voice, focusing on the expertise of the conventional user's vocabulary There is a possibility that the knowledge amount of the speaker cannot be estimated correctly due to misrecognition included in the speech recognition result.

本発明は、上記問題点を解決するためになされたものであり、対話結果に基づき、対話の対象となった事項に対する問合せ者の知識量の推定を精度良く行うことが可能となる知識量推定情報生成装置、知識量推定装置、方法、及びプログラムを提供することを目的とする。 The present invention has been made to solve the above-mentioned problems, and based on the result of dialogue, it is possible to estimate the knowledge amount of the inquirer with respect to the subject matter of the dialogue with high accuracy. An object is to provide an information generation device, a knowledge amount estimation device, a method, and a program.

上記目的を達成するために、請求項１に記載の知識量推定情報生成装置は、問合せ者と回答者との対話における発話列のテキストデータから、相手への質問、説明に該当する発話意図を示す発話を判別する発話意図判別手段と、前記発話意図判別手段で判別された前記発話が該当する前記発話意図に基づき、前記テキストデータにおける前記発話意図の発生状態に関する特徴量を示す意図特徴量を抽出すると共に、前記テキストデータから前記問合せ者が使用した単語の種類の総数を使用語彙特徴量として抽出する特徴量抽出手段と、前記特徴量抽出手段で抽出された意図特徴量と使用語彙特徴量、及び当該意図特徴量と当該使用語彙特徴量の抽出の対象とされた前記テキストデータに対して当該テキストデータにより示される対話から想定される前記問合せ者の知識量を示すものとして予め付与された知識量情報を学習データとして用いることで、前記テキストデータに対応する前記知識量の推定に用いる推定情報を生成する知識量推定情報生成手段と、を備えている。 In order to achieve the above object, the knowledge amount estimation information generating apparatus according to claim 1 generates an utterance intention corresponding to a question and explanation to the other party from text data of an utterance string in a dialogue between an inquirer and a respondent. An intention feature amount indicating a feature amount related to the occurrence state of the speech intention in the text data, based on the speech intention corresponding to the speech determined by the speech intention determination unit; And extracting the total number of types of words used by the inquirer from the text data as a used vocabulary feature amount, and an intention feature amount and a used vocabulary feature amount extracted by the feature amount extraction unit. Assumed from the dialogue indicated by the text data with respect to the text data targeted for extraction of the intended feature quantity and the used vocabulary feature quantity Knowledge amount estimation information generating means for generating estimated information used for estimating the knowledge amount corresponding to the text data by using knowledge amount information given in advance as the learning data to indicate the knowledge amount of the inquirer And.

請求項１に記載の知識量推定情報生成装置によれば、発話意図判別手段により、問合せ者と回答者との対話における発話列のテキストデータから、各発話が、相手への質問、説明に該当する発話意図を示す発話が判別され、特徴量抽出手段により、前記発話意図判別手段で判別された前記発話が該当する前記発話意図に基づき、前記テキストデータにおける前記発話意図の発生状態に関する特徴量を示す意図特徴量が抽出されると共に、前記テキストデータから前記問合せ者が使用した単語の種類の総数を使用語彙特徴量として抽出され、知識量推定情報生成手段により、前記特徴量抽出手段で抽出された意図特徴量と使用語彙特徴量、及び当該意図特徴量と当該使用語彙特徴量の抽出の対象とされた前記テキストデータに対して当該テキストデータにより示される対話から想定される前記問合せ者の知識量を示すものとして予め付与された知識量情報を学習データとして用いることで、前記テキストデータに対応する前記知識量の推定に用いる推定情報が生成される。 According to the knowledge amount estimation information generating device according to claim 1, each utterance corresponds to a question and explanation to the other party from the text data of the utterance string in the dialogue between the inquirer and the respondent by the utterance intention determination means. An utterance indicating the utterance intention to be performed is determined, and a feature amount relating to the state of occurrence of the utterance intention in the text data is determined by the feature amount extraction unit based on the utterance intention to which the utterance determined by the utterance intention determination unit corresponds. The intention feature amount to be shown is extracted, and the total number of types of words used by the inquirer is extracted from the text data as the used vocabulary feature amount, and is extracted by the feature amount extraction unit by the knowledge amount estimation information generation unit. Intent text and used vocabulary feature quantity, and the text data that is the target of extraction of the intended feature quantity and used vocabulary feature quantity By using, as learning data, knowledge amount information given in advance as indicating the knowledge amount of the inquirer assumed from the dialogue indicated by the data, the estimation information used for estimating the knowledge amount corresponding to the text data is Generated.

すなわち、本発明では、問合せ者の知識量の推定に、対話における問合せ者と回答者とでやりとりされる質問、説明に該当する発話者の意図を示す情報（発話意図）と、テキストデータにおける問合せ者の使用単語の種類の総数である使用語彙特徴量とを適用しており、これによって、従来の使用語彙の専門性に着目して問合せ者の知識量を推定する技術における誤認識に伴う不具合を解決することができるようにしている。 That is, in the present invention, the amount of knowledge of the inquirer, the question exchanged between the inquirer and the respondent in the dialogue, information indicating the intention of the speaker corresponding to the explanation (utterance intention), and the inquiry in the text data Vocabulary features, which is the total number of types of words used by the user, and this results in a problem associated with misrecognition in the technique for estimating the knowledge amount of the inquirer by focusing on the expertise of the conventional vocabulary To be able to solve.

なお、前記学習データには、前記意図特徴量及び使用語彙特徴量の抽出の対象とされた問合せ者と回答者との対話を書き起こしたテキストデータや、問合せ者と回答者との対話の音声データから音声認識処理を行って作成したテキストデータ等が含まれる。知識量情報は、例えば、テキストデータ及び問合せ者と回答者との対話の音声データの少なくとも何れか一方を用いて事前に人手で付与しておく。 The learning data includes text data that transcribes the dialogue between the inquirer and the respondent who are the target of extraction of the intended feature amount and the used vocabulary feature amount, and voice of the dialogue between the inquirer and the respondent. Text data created by performing speech recognition processing from data is included. For example, the knowledge amount information is manually given in advance using at least one of text data and voice data of dialogue between the inquirer and the respondent.

このように、請求項１に記載の知識量推定情報生成装置によれば、問合せ者と回答者との対話における発話列のテキストデータから、相手への質問、説明に該当する発話意図を示す発話を判別し、判別した発話が該当する発話意図に基づき、前記テキストデータにおける当該発話意図の発生状態に関する特徴量を示す意図特徴量を抽出すると共に、テキストデータから問合せ者が使用した単語の種類の総数を使用語彙特徴量として抽出し、抽出した意図特徴量及び使用語彙特徴量と、当該意図特徴量の抽出の対象とされたテキストデータに対して当該テキストデータにより示される対話から想定される前記問合せ者の知識量を示すものとして予め付与された知識量情報とを学習データとして用いることで、前記テキストデータに対応する知識量の推定に用いる推定情報を生成しているので、この推定情報を用いることで、対話の対象となった事項に関する問合せ者の知識量の推定を精度良く行うことができる。 Thus, according to the knowledge amount estimation information generating apparatus according to claim 1, the utterance indicating the utterance intention corresponding to the question and explanation to the other party from the text data of the utterance string in the dialogue between the inquirer and the respondent. Based on the utterance intention to which the determined utterance corresponds, an intention feature amount indicating a feature amount related to the occurrence state of the utterance intention in the text data is extracted, and the type of the word used by the inquirer from the text data is extracted. The total number is extracted as the used vocabulary feature amount, and the extracted intention feature amount and the used vocabulary feature amount, and the dialogue assumed by the text data targeted for the extraction of the intended feature amount are assumed from the dialog Knowledge corresponding to the text data by using, as learning data, knowledge amount information given in advance to indicate the knowledge amount of the inquirer Since generates estimated information used for the estimation, by using the estimated information may be queried's knowledge of the estimated related matters were subject to dialogue accurately.

また、本発明は、請求項２に記載した発明のように、前記発話意図判別手段は、前記テキストデータから、前記相手への質問、説明に加えて、相槌に該当する発話意図を示す発話を判別するものとしても良い。 Further, according to the present invention, as in the invention described in claim 2, the utterance intention determination means generates an utterance indicating the utterance intention corresponding to the conflict in addition to the question and explanation to the other party from the text data. It may be determined.

また、本発明は、請求項３に記載した発明のように、前記意図特徴量は、前記問合せ者の質問回数、前記回答者の質問回数、前記問合せ者の説明回数、前記回答者の説明回数を含むものとしても良い。 Further, according to the present invention, as in the invention described in claim 3, the intention feature amount includes the number of questions of the inquirer, the number of questions of the respondent, the number of explanations of the inquirer, and the number of explanations of the respondent. May be included.

また、本発明は、請求項４に記載した発明のように、前記意図特徴量は、さらに前記問合せ者の相槌回数、及び前記回答者の相槌回数の少なくとも１つを含むものとしても良い。 Further, according to the present invention, as in the invention described in claim 4, the intention feature amount may further include at least one of the inquiry number of the inquirer and the inquiry number of the respondent.

また、本発明は、請求項５に記載した発明のように、前記特徴量抽出手段は、前記質問に該当する発話に疑問詞が含まれる発話を疑問詞疑問文として抽出し、前記意図特徴量は、前記問合せ者の質問回数、前記回答者の質問回数、前記問合せ者の説明回数、前記回答者の説明回数を含むと共に、前記問合せ者の相槌回数、前記回答者の相槌回数、前記問合せ者の疑問詞疑問文の回数、及び前記回答者の疑問詞疑問文の回数の少なくとも１つを含むものとしても良い。 Further, according to the present invention, as in the invention described in claim 5, the feature amount extraction unit extracts an utterance including a questionable word in an utterance corresponding to the question as a questionable question sentence, and the intended feature amount Includes the number of questions of the inquirer, the number of questions of the respondent, the number of explanations of the inquirer, the number of explanations of the respondent, and the number of inquiries of the inquirer, the number of inquiries of the respondent, the inquirer It is also possible to include at least one of the number of interrogative question sentences and the number of interrogative question sentences of the respondent.

前記問合せ者の質問回数、前記回答者の質問回数、前記問合せ者の説明回数、前記回答者の説明回数、前記問合せ者の相槌回数、前記回答者の相槌回数、前記問合せ者の疑問詞疑問文の回数、及び前記回答者の疑問詞疑問文の回数などの意図特徴量は、何れも比較的容易にかつ高精度で抽出することができるため、他の特徴量を適用する場合に比較して、より簡易かつ高精度で、対話の対象となった事項に対する問合せ者の知識量の推定情報を生成することができる。 The number of questions of the inquirer, the number of questions of the respondent, the number of times of explanation of the inquirer, the number of times of explanation of the respondent, the number of times of inquiry of the inquirer, the number of times of inquiry of the respondent, the questioning question of the inquirer Intention feature quantities such as the number of times and the number of interrogative question sentences of the respondent can be extracted relatively easily and with high precision, compared to the case where other feature quantities are applied. Thus, it is possible to generate the estimated information of the inquirer's knowledge amount with respect to the matter that is the object of the dialogue, with simpler and higher accuracy.

一方、上記目的を達成するために、請求項６に記載の知識量推定装置は、請求項１から請求項５のいずれか１項に記載の知識量推定情報生成装置で生成された前記推定情報を予め記憶した記憶手段と、知識量の推定対象とする問合せ者と回答者との対話における発話列のテキストデータから、相手への質問、説明に該当する発話意図を示す発話を判別する推定対象発話意図判別手段と、前記推定対象発話意図判別手段で判別された前記発話が該当する前記発話意図に基づき、前記テキストデータにおける前記発話意図の発生状態に関する特徴量を示す推知対象意図特徴量を抽出すると共に、前記テキストデータから前記問合せ者が使用した単語の種類の総数を使用語彙特徴量として抽出する推定対象特徴量抽出手段と、前記推定対象特徴量抽出手段で抽出された推定対象意図特徴量及び推定対象使用語彙特徴量と、前記記憶手段に記憶された前記推定情報とを用いて、推定対象とする前記テキストデータに対応する前記知識量を推定する知識量推定手段と、を備えている。 On the other hand, in order to achieve the above object, the knowledge amount estimation device according to claim 6 is the estimation information generated by the knowledge amount estimation information generation device according to any one of claims 1 to 5. Predictive object that distinguishes utterances that indicate utterance intentions that correspond to questions and explanations to the other party from the text data of the utterance string in the dialogue between the inquirer and the respondent as the subject of estimation of the knowledge amount Based on the utterance intention to which the utterance determined by the utterance intention determination unit and the estimation target utterance intention determination unit corresponds, an extraction target intention feature amount indicating a feature amount related to the generation state of the utterance intention in the text data is extracted. And an estimation target feature amount extraction means for extracting the total number of types of words used by the inquirer from the text data as a used vocabulary feature amount, and the estimation target feature amount extraction The knowledge amount corresponding to the text data to be estimated is estimated using the estimation target intention feature value and the estimation target use vocabulary feature value extracted by the means and the estimation information stored in the storage means. Knowledge amount estimation means.

請求項６に記載の知識量推定装置によれば、記憶手段により、請求項１から請求項５のいずれか１項に記載の知識量推定情報生成装置で生成された推定情報が予め記憶され、推定対象発話意図判別手段により、知識量の推定対象とする問合せ者と回答者との対話における発話列のテキストデータから、相手への質問、説明に該当する発話意図を示す発話が判別され、推定対象特徴量抽出手段により、前記推定対象発話意図判別手段で判別された前記発話が該当する前記発話意図に基づき、前記テキストデータにおける前記発話意図の発生状態に関する特徴量を示す推定対象意図特徴量が抽出されると共に、前記テキストデータから前記問合せ者が使用した単語の種類の総数が推定対象使用語彙特徴量として抽出され、推定対象知識量推定手段により、前記推定対象特徴量抽出手段で抽出された推定対象意図特徴量及び推定対象使用語彙特徴量と、前記記憶手段に記憶された前記推定情報とを用いて、推定対象とする前記テキストデータに対応する前記知識量が推定される。 According to the knowledge amount estimation device according to claim 6, the storage unit stores in advance the estimation information generated by the knowledge amount estimation information generation device according to any one of claims 1 to 5, The estimation target utterance intention discriminating means discriminates the utterance indicating the utterance intention corresponding to the question and explanation to the other party from the text data of the utterance string in the dialogue between the inquirer and the respondent whose knowledge is to be estimated. Based on the utterance intention corresponding to the utterance determined by the estimation target utterance intention determination unit by the target feature amount extraction unit, an estimation target intention feature amount indicating a feature amount related to the generation state of the utterance intention in the text data is obtained. A total number of types of words used by the inquirer is extracted from the text data as an estimation target use vocabulary feature amount, and an estimation target knowledge amount estimation means From the estimation target intention feature quantity and the estimation target used vocabulary feature quantity extracted by the estimation target feature quantity extraction unit, and the estimation information stored in the storage unit, the text data to be estimated The corresponding knowledge amount is estimated.

このように、請求項６に記載の知識量推定装置によれば、請求項１から請求項５のいずれか１項に記載の知識量推定情報生成装置で生成された推定情報を予め記憶し、知識量の推定対象とする問合せ者と回答者との対話における発話列のテキストデータから、相手への質問、説明に該当する発話意図を示す発話を判別し、判別した発話が該当する前記発話意図に基づき、前記テキストデータにおける前記発話意図の発生状態に関する特徴量を示す推定対象意図特徴量を抽出すると共に、当該テキストデータから問合せ者が使用した単語の種類の総数を推定対象使用語彙特徴量として抽出し、抽出された推定対象意図特徴量及び推定対象使用語彙特徴量と、記憶手段に記憶した推定情報とを用いて、推定対象とするテキストデータに対応する知識量を推定しているので、対話の対象となった事項に関する問合せ者の知識量の推定を精度良く行うことができる。 Thus, according to the knowledge amount estimation device according to claim 6, the estimation information generated by the knowledge amount estimation information generation device according to any one of claims 1 to 5 is stored in advance, The utterance intention indicating the utterance intention corresponding to the question and explanation to the other party is determined from the text data of the utterance string in the dialogue between the inquirer and the respondent as the knowledge amount estimation target, and the utterance intention corresponding to the determined utterance Based on the above, the estimation target intention feature amount indicating the feature amount regarding the occurrence state of the utterance intention in the text data is extracted, and the total number of types of words used by the inquirer from the text data is used as the estimation target use vocabulary feature amount Using the extracted and extracted estimation target intention feature quantity and estimation target use vocabulary feature quantity and the estimation information stored in the storage means, the knowledge corresponding to the text data to be estimated is extracted. Since the estimated amount can be accurately query's knowledge of the estimated regarding matters subject to interaction.

また、本発明は、請求項７に記載した発明のように、前記推定対象発話意図判別手段は、前記テキストデータから、前記相手への質問、説明に加えて、相槌に該当する発話意図を示す発話を判別するものとしても良い。 Further, according to the present invention, as in the invention described in claim 7, the estimation target utterance intention determination means indicates the utterance intention corresponding to the conflict in addition to the question and explanation to the other party from the text data. It is good also as what distinguishes utterance.

また、本発明は、請求項８に記載した発明のように、前記推定対象意図特徴量は、前記問合せ者の質問回数、前記回答者の質問回数、前記問合せ者の説明回数、前記回答者の説明回数を含むものとしても良い。 Further, according to the present invention, as in the invention described in claim 8, the estimation target intention feature amount includes the number of questions of the inquirer, the number of questions of the respondent, the number of explanations of the inquirer, The number of times of explanation may be included.

また、本発明は、請求項９に記載した発明のように、前記推定対象意図特徴量は、さらに前記問合せ者の相槌回数、及び前記回答者の相槌回数の少なくとも１つを含むものとしても良い。 Further, according to the present invention, as in the invention described in claim 9, the estimation target intention feature amount may further include at least one of the number of inquiries of the inquirer and the number of inquiries of the respondent. .

また、本発明は、請求項１０に記載した発明のように、前記推定対象特徴量抽出手段は、前記質問に該当する発話に疑問詞が含まれる発話を疑問詞疑問文として抽出し、前記意図特徴量は、前記問合せ者の質問回数、前記回答者の質問回数、前記問合せ者の説明回数、前記回答者の説明回数を含むと共に、前記問合せ者の相槌回数、前記回答者の相槌回数、前記問合せ者の疑問詞疑問文の回数、及び前記回答者の疑問詞疑問文の回数の少なくとも１つを含むものとしても良い。 Further, according to the present invention, as in the invention described in claim 10, the estimation target feature amount extraction unit extracts an utterance including a questionable word in an utterance corresponding to the question as a questionable question sentence, and the intention The feature amount includes the number of questions of the inquirer, the number of questions of the respondent, the number of times of explanation of the inquirer, the number of explanations of the respondent, and the number of times of inquiry of the inquirer, the number of times of inquiry of the respondent, It may include at least one of the number of interrogator question sentences of the inquirer and the number of questionant question sentences of the respondent.

なお、請求項６から請求項１０のいずれか１項に記載の知識量推定装置における推定対象発話意図判別手段と推定対象特徴量抽出手段は、請求項１から請求項５のいずれか１項に記載の知識量推定情報生成装置における発話意図判別手段と特徴量抽出手段を共有して用いることができる。 In addition, the estimation target utterance intention determination unit and the estimation target feature amount extraction unit in the knowledge amount estimation device according to any one of claims 6 to 10 are described in any one of claims 1 to 5. The utterance intention determination unit and the feature amount extraction unit in the described knowledge amount estimation information generation device can be used in common.

前記問合せ者の質問回数、前記回答者の質問回数、前記問合せ者の説明回数、前記回答者の説明回数、前記問合せ者の相槌回数、前記回答者の相槌回数、前記問合せ者の疑問詞疑問文の回数、及び前記回答者の疑問詞疑問文の回数などの意図特徴量は、何れも比較的容易にかつ高精度で抽出することができるため、他の特徴量を適用する場合に比較して、より簡易かつ高精度で、対話の対象となった事項に対する問合せ者の知識量の推定を行うことができる。 The number of questions of the inquirer, the number of questions of the respondent, the number of times of explanation of the inquirer, the number of times of explanation of the respondent, the number of times of inquiry of the inquirer, the number of times of inquiry of the respondent, the questioning question of the inquirer Intention feature quantities such as the number of times and the number of interrogative question sentences of the respondent can be extracted relatively easily and with high precision, compared to the case where other feature quantities are applied. Therefore, it is possible to estimate the amount of knowledge of the inquirer for the subject matter that is the object of the dialogue more simply and with high accuracy.

一方、上記目的を達成するために、請求項１１に記載の知識量推定情報生成方法は、発話意図判別手段、特徴量抽出手段、及び知識量推定情報生成手段を含む知識量推定情報生成装置における知識量推定情報生成方法であって、前記発話意図判別手段によって、問合せ者と回答者との対話における発話列のテキストデータから、相手への質問、説明に該当する発話の意図を示す発話を判別する発話意図判別ステップと、前記特徴量抽出手段によって、前記発話意図判別ステップで判別された前記発話の、前記テキストデータにおける発生状態に関する特徴量を示す意図特徴量を抽出すると共に、前記テキストデータから前記問合せ者が使用した単語の種類の総数を使用語彙特徴量として抽出する特徴量抽出ステップと、前記知識量推定情報生成手段によって、前記特徴量抽出ステップで抽出された意図特徴量と使用語彙特徴量、及び当該意図特徴量と当該使用語彙特徴量の抽出の対象とされた前記テキストデータに対して当該テキストデータにより示される対話から想定される前記問合せ者の知識量を示すものとして予め付与された知識量情報を学習データとして用いることで、前記テキストデータに対応する前記知識量の推定に用いる推定情報を生成する知識量推定情報生成ステップと、を備えている。 On the other hand, in order to achieve the above object, a knowledge amount estimation information generation method according to claim 11 is a knowledge amount estimation information generation apparatus including an utterance intention determination unit, a feature amount extraction unit, and a knowledge amount estimation information generation unit. A knowledge amount estimation information generating method, wherein the utterance intention discriminating means discriminates an utterance indicating intention of an utterance corresponding to a question to a partner and an explanation from text data of an utterance string in a dialogue between an inquirer and a respondent. An intention feature amount indicating a feature amount related to an occurrence state in the text data of the utterance determined in the utterance intention determination step by the feature amount extraction unit and the feature amount extraction means, and from the text data a feature amount extraction step of extracting a total number of word types that the querying party is used as the use lexical feature quantity, the amount of knowledge estimation information producing By means indicated, the feature quantity extraction intended feature amount extracted at step uses lexical feature amounts, and by the text data to target have been the text data extraction of the intended features and the use lexical feature quantity Knowledge that generates estimated information used to estimate the knowledge amount corresponding to the text data by using, as learning data, knowledge amount information given in advance as an indication of the knowledge amount of the inquirer assumed from the dialogue A quantity estimation information generation step.

従って、請求項１１に記載の知識量推定情報生成方法によれば、請求項１に記載の発明と同様に作用するので、請求項１に記載の発明と同様に、生成した推定情報を用いることで、対話の対象となった事項に対する問合せ者の知識量の推定を精度良く行うことができる。 Therefore, according to the knowledge amount estimation information generation method described in claim 11, the method operates in the same manner as in the invention described in claim 1, so that the generated estimation information is used as in the invention described in claim 1. Thus, it is possible to accurately estimate the amount of knowledge of the inquirer with respect to the matter that is the subject of the dialogue.

一方、上記目的を達成するために、請求項１２に記載の知識量推定方法は、推定対象発話意図判別手段、推定対象特徴量抽出手段、及び知識量推定手段を含む知識量推定装置における知識量推定方法であって、請求項１１記載の知識量推定情報生成方法で生成された前記推定情報を予め記憶装置に記憶する記憶ステップと、前記推定対象発話意図判別手段によって、知識量の推定対象とする問合せ者と回答者との対話における発話列のテキストデータから、相手への質問、説明に該当する発話の意図を示す発話を判別する推定対象発話意図判別ステップと、前記推定対象特徴量抽出手段によって、前記推定対象発話意図判別ステップで判別された前記発話の、前記テキストデータにおける発生状態に関する特徴量を示す推定対象意図特徴量を抽出すると共に、前記テキストデータから前記問合せ者が使用した単語の種類の総数を推定対象使用語彙特徴量として抽出する推定対象特徴量抽出ステップと、前記知識量推定手段によって、前記推定対象特徴量抽出ステップで抽出された推定対象意図特徴量及び推定対象使用語彙特徴量と、前記記憶装置に記憶された前記推定情報とを用いて、推定対象とする前記テキストデータに対応する前記知識量を推定する知識量推定ステップと、を備えている。 On the other hand, in order to achieve the above object, a knowledge amount estimation method according to claim 12 is the knowledge amount in the knowledge amount estimation device including the estimation target utterance intention determination means, the estimation target feature amount extraction means, and the knowledge amount estimation means. a method for estimating a storing step of storing in advance in the storage device the estimated information generated by claim 11, wherein the knowledge estimation information generation method, by the estimated target speech intention determining means, the estimated target knowledgeable An estimation target utterance intention determination step for determining an utterance indicating the intention of the utterance corresponding to the question and explanation to the other party, from the text data of the utterance sequence in the dialogue between the inquirer and the respondent, and the estimation target feature quantity extraction means Accordingly, the estimation target speech intention determination the speech is determined in step, the estimation target intent feature quantity indicating a feature amount relating to occurrence of said text data While leaving the estimation target feature quantity extracting the total number of word types that the querying party from the text data is used as the estimated target using lexical feature quantity, by the knowledge amount estimating means, the estimation target feature quantity extraction The knowledge amount corresponding to the text data to be estimated is estimated using the estimation target intention feature amount and the estimation target use vocabulary feature amount extracted in the step and the estimation information stored in the storage device. A knowledge amount estimation step.

従って、請求項１２に記載の知識量推定方法によれば、請求項６に記載の発明と同様に作用するので、請求項６に記載の発明と同様に、対話の対象となった事項に対する問合せ者の知識量の推定を精度良く行うことができる。 Therefore, according to the knowledge amount estimation method described in claim 12, since it operates in the same manner as the invention described in claim 6, as in the invention described in claim 6, inquiries regarding matters subject to dialogue are obtained. The amount of knowledge of the person can be estimated with high accuracy.

一方、上記目的を達成するために、請求項１３に記載のプログラムによれば、コンピュータを請求項１から請求項５のいずれか１項に記載の知識量推定情報生成装置または請求項６から請求項１０のいずれか１項に記載の知識量推定装置と同様に作用させることができるので、当該知識量推定情報生成装置または当該知識量推定装置と同様に、対話の対象となった事項に対する問合せ者の知識量の推定を精度良く行うことができる。 On the other hand, in order to achieve the above object, according to the program according to claim 13, the computer according to claim 1, or the knowledge amount estimation information generating apparatus according to claim 1 or claim 6. Since it can be operated in the same manner as the knowledge amount estimation device according to any one of items 10, an inquiry about an item that has been a target of dialogue, similarly to the knowledge amount estimation information generation device or the knowledge amount estimation device. The amount of knowledge of the person can be estimated with high accuracy.

問合せ者と回答者との対話における発話意図の発生状態に関する特徴量を用いて該対話の対象となった事項に関する問合せ者の知識量を推定することで、使用語彙や会話内容の分野によらず、問合せ者の知識量を高精度に推定することができる。また、使用語彙に依存しない質問の発生状態に関する特徴量を用いるため、音声認識の誤りによる影響を受けにくい。使用語彙や会話内容の分野ごとに学習データを用意する必要もない。 Regardless of the vocabulary used or the content of the conversation, by estimating the amount of knowledge of the inquirer regarding the subject matter of the conversation using the characteristic amount related to the state of the intention of utterance in the conversation between the inquirer and the respondent The amount of knowledge of the inquirer can be estimated with high accuracy. In addition, since the feature amount related to the occurrence state of the question that does not depend on the vocabulary used is used, the feature amount is not easily affected by a speech recognition error. There is no need to prepare learning data for each field of vocabulary or conversation content.

実施の形態に係る知識量推定装置の機能的な構成例を示すブロック図（一部流れ図）である。It is a block diagram (partial flowchart) which shows the functional structural example of the knowledge amount estimation apparatus which concerns on embodiment. 実施の形態に係る知識量推定装置のコンピュータ構成例を示すブロック図である。It is a block diagram which shows the computer structural example of the knowledge amount estimation apparatus which concerns on embodiment. 実施の形態に係る知識量推定装置により実行される学習処理プログラムの処理の流れを示すフローチャートである。It is a flowchart which shows the flow of a process of the learning process program performed by the knowledge amount estimation apparatus which concerns on embodiment. 実施の形態に係る知識量推定装置により実行される推定処理プログラムの処理の流れを示すフローチャートである。It is a flowchart which shows the flow of a process of the estimation process program performed by the knowledge amount estimation apparatus which concerns on embodiment. 実施の形態に係る知識量推定装置で用いられる質問パターン情報の一例を示す模式図である。It is a schematic diagram which shows an example of the question pattern information used with the knowledge amount estimation apparatus which concerns on embodiment. 実施の形態に係る知識量推定装置で用いられる説明パターン情報の一例を示す模式図である。It is a schematic diagram which shows an example of the description pattern information used with the knowledge amount estimation apparatus which concerns on embodiment. 実施の形態に係る知識量推定装置で用いられる相槌パターン情報の一例を示す模式図である。It is a schematic diagram which shows an example of the conflict pattern information used with the knowledge amount estimation apparatus which concerns on embodiment. 実施の形態に係る知識量推定装置で用いられる疑問詞パターン情報の一例を示す模式図である。It is a schematic diagram which shows an example of the interrogative pattern information used with the knowledge amount estimation apparatus which concerns on embodiment. 実施の形態に係る知識量推定装置の動作の説明に供する図であり、発話列からなるテキストデータの一例を示す模式図である。It is a figure with which it uses for description of operation | movement of the knowledge amount estimation apparatus which concerns on embodiment, and is a schematic diagram which shows an example of the text data which consists of an utterance row | line | column. 実施の形態に係る知識量推定装置の動作の説明に供する図であり、判定基準情報の一例を示す模式図である。It is a figure with which it uses for description of operation | movement of the knowledge amount estimation apparatus which concerns on embodiment, and is a schematic diagram which shows an example of criteria information. 実施の形態に係る知識量推定装置の動作の説明に供する図であり、評価結果情報の一例を示す模式図である。It is a figure with which it uses for description of operation | movement of the knowledge amount estimation apparatus which concerns on embodiment, and is a schematic diagram which shows an example of evaluation result information. 実施の形態に係る知識量推定装置の動作の説明に供する図であり、評価結果情報の他の例を示す模式図である。It is a figure with which it uses for description of operation | movement of the knowledge amount estimation apparatus which concerns on embodiment, and is a schematic diagram which shows the other example of evaluation result information.

以下、図面を参照して、本発明の実施の形態を詳細に説明する。なお、本例では、コンタクトセンタにおけるオペレータとユーザとの対話（以下コンタクトセンタ通話という）の音声認識結果を入力として、ユーザの知識量を推定することを例として説明を行う。 Hereinafter, embodiments of the present invention will be described in detail with reference to the drawings. In this example, an explanation will be given of an example in which the amount of knowledge of a user is estimated by using a voice recognition result of an interaction between an operator and a user in a contact center (hereinafter referred to as a contact center call) as an input.

本実施の形態に係る知識量推定装置１０は、図１に示される機能的な構成を備えており、図２に示されるコンピュータ構成を有している。そこで、まず、図２を参照してコンピュータの構成を説明する。 The knowledge amount estimation apparatus 10 according to the present embodiment has the functional configuration shown in FIG. 1, and has the computer configuration shown in FIG. First, the configuration of the computer will be described with reference to FIG.

図２に示すように、本実施の形態に係る知識量推定装置１０は、知識量推定装置１０全体の動作を司るＣＰＵ（Central Processing Unit；中央処理装置）２２と、ＣＰＵ２２による各種プログラムの実行時のワークエリア等として用いられるＲＡＭ（Random Access Memory）２４と、各種制御プログラムや各種パラメータ等が予め記憶されたＲＯＭ（Read Only Memory）２６と、各種情報を記憶するために用いられるハードディスク２８（図中、「ＨＤＤ」２８と記載）と、キーボード１４、マウス１６、及びディスプレイ１８と、外部に接続された装置との間の各種情報の授受を司る外部インタフェース３０（図中、「外部Ｉ／Ｆ」３０と記載）と、を備えており、これら等がシステムバスＢＵＳにより相互に接続されて構成されている。なお、外部インタフェース３０には例えば対話でやりとりされる音声データを入出力するヘッドセット等の通話装置５０が接続されている。 As shown in FIG. 2, the knowledge amount estimation device 10 according to the present embodiment includes a CPU (Central Processing Unit) 22 that controls the operation of the entire knowledge amount estimation device 10, and when the CPU 22 executes various programs. RAM (Random Access Memory) 24 used as a work area, a ROM (Read Only Memory) 26 in which various control programs and various parameters are stored in advance, and a hard disk 28 (see FIG. The external interface 30 (referred to as “External I / F” in the figure) is responsible for exchanging various types of information among the “HDD” 28), the keyboard 14, mouse 16, display 18, and externally connected devices. These are connected to each other by a system bus BUS. Note that the external interface 30 is connected to a communication device 50 such as a headset for inputting / outputting voice data exchanged through dialogue, for example.

ＣＰＵ２２は、ＲＡＭ２４、ＲＯＭ２６、及びハードディスク２８に対するアクセス、キーボード１４及びマウス１６を介した各種情報の取得、ディスプレイ１８に対する各種情報の表示、及び外部インタフェース３０に接続された通話装置５０を介してやりとりされる音声データからなる対話情報の入出力等を、各々行うことができる。 The CPU 22 accesses the RAM 24, the ROM 26 and the hard disk 28, acquires various information via the keyboard 14 and the mouse 16, displays various information on the display 18, and exchanges via the call device 50 connected to the external interface 30. Input / output of dialogue information consisting of voice data can be performed.

ＣＰＵ２２が、ハードディスク２８に記憶された本発明に係る知識量推定装置としての処理を制御するプログラムを、ＲＡＭ２４に読み込み実行することにより、図１に示す本発明に係る知識量推定装置１０における各処理部の機能が実行される。 Each process in the knowledge amount estimation device 10 according to the present invention shown in FIG. 1 is executed by the CPU 22 reading a program for controlling the processing as the knowledge amount estimation device according to the present invention stored in the hard disk 28 into the RAM 24 and executing it. Part functions are executed.

図１に示すように、本実施の形態に係る知識量推定装置１０においては、発話列抽出部１、発話意図判別部２、特徴量抽出部３、推定情報生成部４、知識量推定部５が、プログラムされたコンピュータ処理により実現される機能として設けられている。 As shown in FIG. 1, in the knowledge amount estimation device 10 according to the present embodiment, an utterance sequence extraction unit 1, an utterance intention determination unit 2, a feature amount extraction unit 3, an estimated information generation unit 4, and a knowledge amount estimation unit 5 Are provided as functions realized by programmed computer processing.

このような構成からなる本実施の形態に係る知識量推定装置１０における処理部は、大きく分けて「学習部」と「推定部」からなる。 The processing unit in the knowledge amount estimation apparatus 10 according to the present embodiment configured as described above is roughly divided into a “learning unit” and an “estimation unit”.

学習部は、発話列抽出部１、発話意図判別部２、特徴量抽出部３、推定情報生成部４を含み、予め学習用に用意された、問合せ者（以下、ユーザとも記載する）と回答者（以下、オペレータとも記載する）との対話に対する音声認識結果（学習用通話）７の対話情報が入力されると、まず、発話列抽出部１において、発話列からなるテキストデータを抽出する。 The learning unit includes an utterance string extraction unit 1, an utterance intention determination unit 2, a feature amount extraction unit 3, and an estimation information generation unit 4, and is prepared as an inquirer (hereinafter also referred to as a user) and a reply. When dialogue information of a speech recognition result (learning call) 7 for dialogue with a person (hereinafter also referred to as an operator) is input, first, the utterance string extraction unit 1 extracts text data composed of the utterance string.

次に、発話意図判別部２において、発話列抽出部１により抽出された問合せ者と回答者との対話における発話列のテキストデータについて、各発話毎に、詳細を後述する「質問」、「説明」、「相槌」に該当する発話の意図を示す発話（以下、発話意図とも記載する）を判別する。ここで判別された発話意図は、判別対象の発話と対応付けられて出力される。 Next, in the utterance intention discriminating section 2, the text data of the utterance string in the dialogue between the inquirer and the respondent extracted by the utterance string extracting section 1 is described in detail for each utterance "question" and "explanation". ”,“ Utterance ”, and an utterance indicating the intention of the utterance (hereinafter also referred to as the utterance intention) are determined. The speech intention determined here is output in association with the speech to be determined.

さらに、特徴量抽出部３において、詳細を後述する「問合せ者の使用語彙の異なり語数」を算出すると共に、発話意図判別部２で判別された「質問」、「説明」、「相槌」それぞれの発話意図の出現回数を算出する。さらに、「質問」発話のうち、疑問詞を含む発話を「疑問詞疑問文」として抽出し、出現回数を算出する。算出された「問合せ者の使用語彙の異なり語数」、発話意図判別部２で判別された「質問」、「説明」、「相槌」それぞれの発話意図の出現回数、「疑問詞疑問文」の出現回数を当該テキストデータにおける発話意図の発生状態に関する特徴量を示す意図特徴量とする。なお、このような複数の特徴量は特徴量ベクトルと呼ばれる。特徴量抽出部３で出力される特徴量ベクトルの要素は、質問回数、説明回数、相槌回数、疑問詞疑問文の回数、及び、使用語彙の異なり語数となる。 Further, the feature amount extraction unit 3 calculates “the number of different words in the inquirer's vocabulary”, which will be described in detail later, and each of the “question”, “explanation”, and “consideration” determined by the utterance intention determination unit 2. The number of appearances of the utterance intention is calculated. Furthermore, among “question” utterances, utterances including question words are extracted as “question word question sentences”, and the number of appearances is calculated. The calculated “number of different vocabulary words of the inquirer”, the number of occurrences of the “question”, “explanation”, and “interaction” utterance intentions determined by the utterance intention determination unit 2, and the appearance of “question word question sentence” The number of times is an intention feature amount indicating a feature amount related to the state of occurrence of the utterance intention in the text data. Such a plurality of feature amounts are called feature amount vectors. The elements of the feature quantity vector output by the feature quantity extraction unit 3 are the number of questions, the number of explanations, the number of questions, the number of interrogative questions, and the number of words used in different vocabularies.

そして、推定情報生成部４において、特徴量抽出部３で抽出された意図特徴量４ａ（図中、「特徴量４ａ」と記載）、及び当該意図特徴量４ａの抽出の対象とされたテキストデータに対して当該テキストデータにより示される対話から想定される問合せ者の知識量を示すものとして予め付与された知識量情報４ｂ（図中、「知識量ラベル４ｂ」と記載）を学習データとして用いることで、モデル生成機能４ｃ（図中、「知識量と特徴量を対応付けるモデル生成４ｃ」と記載）により、テキストデータに対応する知識量の推定に用いる推定情報を生成する。 Then, in the estimation information generation unit 4, the intention feature amount 4a extracted by the feature amount extraction unit 3 (described as “feature amount 4a” in the figure) and the text data that is the target of extraction of the intention feature amount 4a Use knowledge amount information 4b (denoted as “knowledge amount label 4b” in the figure) given in advance as an indication of the knowledge amount of the inquirer assumed from the dialogue indicated by the text data as learning data Thus, estimation information used to estimate the knowledge amount corresponding to the text data is generated by the model generation function 4c (described as “model generation 4c for associating the knowledge amount with the feature amount” in the drawing).

このように、本例の学習部は、本発明に係る知識量推定情報生成装置に相当する。そして、学習部において、このような処理を、比較的多数の予め用意された対話に対して行うことにより、推定情報をモデル情報５ａ（図中、「モデル５ａ」と記載）として生成し、ハードディスク２８等の記憶装置に記憶する。 Thus, the learning unit of this example corresponds to the knowledge amount estimation information generation device according to the present invention. Then, the learning unit performs such processing on a relatively large number of previously prepared dialogs, thereby generating estimation information as model information 5a (denoted as “model 5a” in the figure), and a hard disk It memorize | stores in memory | storage devices, such as 28.

一方、推定部は、学習部と共有する発話列抽出部１、発話意図判別部２、及び特徴量抽出部３と共に、知識量推定部５を含み、学習部（知識量推定情報生成装置）において生成され、ハードディスク２８等の記憶装置に記憶されたモデル情報５ａを用いて、推定の対象となる個別の対話における問合せ者の当該対話の事項に関する知識量を推定する。 On the other hand, the estimation unit includes a knowledge amount estimation unit 5 together with an utterance sequence extraction unit 1, an utterance intention determination unit 2, and a feature amount extraction unit 3 shared with the learning unit, and in the learning unit (knowledge amount estimation information generation device). Using the model information 5a that is generated and stored in a storage device such as the hard disk 28, the amount of knowledge regarding the matter of the inquirer in the individual dialogue to be estimated is estimated.

すなわち、推定の対象となる個別の対話である通話ｉの音声認識結果（推定対象）６が入力されると、まず、発話列抽出部１において、発話列からなるテキストデータを抽出する。なお、通話ｉの音声認識結果（推定対象）６は、認識精度が低すぎるものについては、用いなくても良い。 That is, when a speech recognition result (estimation target) 6 of a call i, which is an individual conversation to be estimated, is input, first, the utterance string extraction unit 1 extracts text data including the utterance string. Note that the speech recognition result (estimation target) 6 of the call i may not be used if the recognition accuracy is too low.

次に、発話意図判別部２において、発話列抽出部１により抽出された推定対象とする問合せ者と回答者との対話における発話列のテキストデータについて、各発話毎に、発話者（問合せ者、回答者）の「質問」、「説明」、「相槌」に該当する発話の意図を示す発話（推定対象発話意図）を判別する。ここで判別された発話意図は、判別対象の発話と対応付けられて出力される。 Next, in the utterance intention discriminating unit 2, the utterance string text data in the dialogue between the inquirer to be estimated and the respondent extracted by the utterance string extracting unit 1 is set for each utterance. An utterance (estimation target utterance intention) indicating the intention of the utterance corresponding to “question”, “explanation”, and “conflict” of the respondent) is determined. The speech intention determined here is output in association with the speech to be determined.

さらに、特徴量抽出部３において、当該テキストデータから問合せ者が使用した単語の種類の総数を推定対象使用語彙特徴量として抽出すると共に、発話意図判別部２で判別された、推定対象発話意図に含まれる「質問」、「説明」、「相槌」それぞれの出現回数を算出する。さらに、「質問」発話のうち、疑問詞を含む発話を「疑問詞疑問文」として抽出し、出現回数を算出する。算出された「問合せ者の使用語彙の異なり語数」、発話意図判別部２で判別された「質問」、「説明」、「相槌」それぞれの推定対象発話意図の出現回数、「疑問詞疑問文」の出現回数を、当該テキストデータにおける発話意図の発生状態に関する特徴量を示す推定対象意図特徴量とする。特徴量抽出部３で出力される特徴量ベクトルの要素は、質問回数、説明回数、相槌回数、疑問詞疑問文の回数、及び、使用語彙の異なり語数となる。 Further, the feature amount extraction unit 3 extracts the total number of types of words used by the inquirer from the text data as the estimation target used vocabulary feature amount, and also determines the estimation target utterance intention determined by the utterance intention determination unit 2. The number of appearances of each “question”, “explanation”, and “consideration” included is calculated. Furthermore, among “question” utterances, utterances including question words are extracted as “question word question sentences”, and the number of appearances is calculated. The calculated “number of different words in the inquirer's vocabulary”, the number of appearances of the estimated utterance intentions of “question”, “explanation”, and “consideration” determined by the utterance intention determination unit 2, “question question sentence” Is the estimation target intention feature value indicating the feature value related to the utterance intention occurrence state in the text data. The elements of the feature quantity vector output by the feature quantity extraction unit 3 are the number of questions, the number of explanations, the number of questions, the number of interrogative questions, and the number of words used in different vocabularies.

そして、知識量推定部５においては、照合機能５ｃ（図中、「モデルと照合して知識量を推定５ｃ」と記載）により、特徴量抽出部３で抽出された推定対象意図特徴量及び推定対象使用語彙特徴量と、学習部の推定情報生成部４により生成され、ハードディスク２８等の記憶装置にモデル情報５ａとして記憶された推定情報とを用いて、推定対象とするテキストデータに対応する知識量を推定する。 Then, in the knowledge amount estimation unit 5, the estimation target intended feature amount and the estimation extracted by the feature amount extraction unit 3 by the collation function 5 c (in the drawing, described as “estimate the knowledge amount by collating with the model 5 c”). Knowledge corresponding to text data to be estimated using the target vocabulary feature amount and the estimation information generated by the estimation information generation unit 4 of the learning unit and stored as model information 5a in the storage device such as the hard disk 28 Estimate the amount.

このようにして推定したテキストデータに対応する知識量が、推定対象として入力された通話ｉの話者（ユーザ）の知識量８（図中、「出力：通話ｉの知識量ラベル８」と記載）として出力される。 The knowledge amount corresponding to the text data estimated in this way is described as the knowledge amount 8 of the speaker (user) of the call i inputted as the estimation target (in the figure, “output: knowledge amount label 8 of the call i”). ) Is output.

なお、本例では、学習部と推定部において、共通な発話列抽出部１、発話意図判別部２、特徴量抽出部３については同一のものを使用しているが、学習部と推定部のそれぞれで異なる発話列抽出部１、発話意図判別部２、特徴量抽出部３を用いる構成としても良い。 In this example, the learning unit and the estimation unit use the same utterance sequence extraction unit 1, utterance intention determination unit 2, and feature quantity extraction unit 3, but the learning unit and the estimation unit are identical. It is good also as a structure which uses the utterance sequence extraction part 1, the utterance intention discrimination | determination part 2, and the feature-value extraction part 3 which are different in each.

また、本実施の形態に係る知識量推定装置１０では、図２に示す通話装置５０で入出力される音声データに対して、ＣＰＵ２２のプログラムされた処理により、音声認識処理を行いテキストファイルに変換し、通話ｉの音声認識結果（推定対象）６及び音声認識結果（学習用通話）７を生成する。 Further, in the knowledge amount estimation apparatus 10 according to the present embodiment, voice recognition processing is performed on voice data input / output by the call device 50 shown in FIG. Then, the speech recognition result (estimation target) 6 and the speech recognition result (learning call) 7 of the call i are generated.

そして、発話列抽出部１においては、テキストファイルである音声認識結果（学習用通話）７から、発話に該当する部分のみを取り出す。通常、音声認識結果は、時間情報や認識精度を示す情報などが含まれている場合が多いため、この処理が必要となる。 Then, the utterance string extraction unit 1 extracts only the portion corresponding to the utterance from the speech recognition result (learning call) 7 which is a text file. Usually, the speech recognition result often includes time information, information indicating recognition accuracy, and the like, so this processing is necessary.

もし、発話列抽出部１に入力されるデータが、音声認識結果を示すテキストファイルではなく、発話列からなるテキストデータが入力される場合には、発話列抽出部１での処理は不要である。 If the data input to the utterance string extraction unit 1 is not a text file indicating a speech recognition result, but text data including an utterance string is input, processing in the utterance string extraction unit 1 is not necessary. .

また、本例では、発話意図判別部２においては、対話におけるオペレータとユーザの全発話に対し、その発話が「質問」、「説明」、「相槌」であるかどうかを判別する。 Further, in this example, the utterance intention determination unit 2 determines whether the utterances are “question”, “explanation”, and “compatibility” for all utterances of the operator and the user in the dialogue.

また、特徴量抽出部３においては、発話意図判別部２で抽出された「質問」、「説明」、「相槌」それぞれの出現回数を算出し、さらに、「質問」発話のうち疑問詞を含む発話を「疑問詞疑問文」として抽出し、出現回数を算出して、学習部の場合には意図特徴量（推定部の場合であれば推定対象意図特徴量）として抽出すると共に、当該テキストデータから問合せ者が使用した単語の種類の総数を使用語彙特徴量及び推定対象使用語彙特徴量として抽出する。特徴量抽出部３で出力される特徴量ベクトルの要素は、質問回数、説明回数、相槌回数、疑問詞疑問文の回数、及び、使用語彙の異なり語数となる。 In addition, the feature amount extraction unit 3 calculates the number of appearances of each of “question”, “explanation”, and “consideration” extracted by the utterance intention determination unit 2, and further includes a question word in the “question” utterance. The utterance is extracted as “question word question sentence”, the number of appearances is calculated, and in the case of the learning unit, it is extracted as the intention feature amount (in the case of the estimation unit, the estimation target intention feature amount), and the text data The total number of types of words used by the inquirer is extracted as the used vocabulary feature amount and the estimation target used vocabulary feature amount. The elements of the feature quantity vector output by the feature quantity extraction unit 3 are the number of questions, the number of explanations, the number of questions, the number of interrogative questions, and the number of words used in different vocabularies.

また、推定情報生成部４における意図特徴量と知識量との対応付けのモデル化、及び知識量推定部５での、問合せ（ユーザ）側の話者の知識量の自動推定は、例えば、公知技術である「教師あり学習（Supervised learning）」の識別手法の１つである「Support Vector Machine （SVM）」などを用いる。 In addition, modeling of the association between the intention feature quantity and the knowledge quantity in the estimation information generation unit 4 and automatic estimation of the knowledge quantity of the speaker on the inquiry (user) side in the knowledge quantity estimation unit 5 are, for example, publicly known For example, “Support Vector Machine (SVM)” which is one of identification techniques of “Supervised learning” is used.

以下、図３と図４を用いて、本実施の形態に係る知識量推定装置１０の、発話列抽出部１、発話意図判別部２、特徴量抽出部３、推定情報生成部４、及び知識量推定部５による処理内容の詳細な説明を行う。 Hereinafter, using FIG. 3 and FIG. 4, the utterance string extraction unit 1, the utterance intention determination unit 2, the feature amount extraction unit 3, the estimated information generation unit 4, and the knowledge of the knowledge amount estimation device 10 according to the present embodiment. The details of the processing performed by the quantity estimation unit 5 will be described.

図３においては、本実施の形態に係る知識量推定装置１０の学習部における処理内容を示しており、まず、予め定められた所定量の音声データが入力されたか否かを判別し（ステップＳ１０１）、入力されると当該音声データを再生すると共に（ステップＳ１０２）、音声認識処理を行って音声データをテキストデータに書き起こし（ステップＳ１０３）、発話列抽出部１により、音声認識結果を書き起こしたテキストデータから、発話に該当する発話列のテキストデータを抽出する（ステップＳ１０４）。 FIG. 3 shows the processing contents in the learning unit of the knowledge amount estimation apparatus 10 according to the present embodiment. First, it is determined whether or not a predetermined amount of speech data has been input (step S101). When input, the voice data is reproduced (step S102), voice recognition processing is performed to transcribe the voice data into text data (step S103), and the speech string extraction unit 1 transcribes the voice recognition result. The text data of the speech string corresponding to the speech is extracted from the text data (step S104).

例えば、音声認識結果がxmlファイルであり、発話に該当する部分が<TEXT>と</TEXT>で囲んで示されている場合は、正規表現を用いたパターンマッチングで<TEXT>と</TEXT>で囲まれている部分を特定すれば発話を抽出することができる。 For example, if the speech recognition result is an xml file, and the part corresponding to the utterance is enclosed in <TEXT> and </ TEXT>, <TEXT> and </ TEXT If the part surrounded by> is specified, the utterance can be extracted.

次に、発話意図判別部２において、発話列抽出部１によって抽出された発話列からなるテキストデータに対して、入力された音声データにより示される対話で交わされるユーザとオペレータ間でやりとりされた各発話の発話意図が「質問」、「説明」、「相槌」であるか否かの判別処理を行う（ステップＳ１０５）。 Next, in the utterance intention determination unit 2, each of the text data composed of the utterance sequence extracted by the utterance sequence extraction unit 1 is exchanged between the user and the operator exchanged in the dialogue indicated by the input voice data. A determination process is performed to determine whether the utterance intention of the utterance is “question”, “explanation”, or “consideration” (step S105).

ここでの対話で交わされる各発話が「質問」、「説明」、「相槌」であるかどうかの判別は、例えば、「今村賢治、泉朋子、菊井玄一郎、佐藤理史、“述部機能表現の意味ラベルタガー”、言語処理学会第１７回年次大会発表論文集（２０１１年３月）」等に記載の公知の技術を用いて、統計的手法によって自動的に付与された機能表現の意味ラベルを利用して行っても良いし、予め用意された「質問」、「説明」、「相槌」の各発話意図を表す言語表現とのパターンマッチング技術によって行っても良い。 To determine whether each utterance exchanged in this dialogue is “question”, “explanation”, or “commitment”, for example, “Kenji Imamura, Atsuko Izumi, Genichiro Kikui, Satoshi Sato,“ Predicate Functional Expression Semantic label tagger, using the well-known techniques described in the 17th Annual Conference of the Language Processing Society of Japan (March 2011), etc. It may be performed by using a pattern matching technique with a language expression representing each utterance intention of “question”, “explanation”, and “consideration” prepared in advance.

本例では後者のパターンマッチング技術を採用し、詳細を後述する図５に示す質問パターン情報、図６に示す説明パターン情報、図７に示す相槌パターン情報、及び図８に示す疑問詞情報を用いて発話意図を判別する。 In this example, the latter pattern matching technique is adopted, and the question pattern information shown in FIG. 5, which will be described in detail later, the explanation pattern information shown in FIG. 6, the conflict pattern information shown in FIG. 7, and the question information shown in FIG. To determine the utterance intention.

例えば、図５の質問パターン情報として例示するように、「ですか。」「ますか。」などのパターンを用いて質問発話を判別する。なお、図５に示す質問パターン情報、図６に示す説明パターン情報、図７に示す相槌パターン情報、及び図８に示す疑問詞情報は、図２におけるハードディスク２８等に予め記憶され、ＲＡＭ２４に読み出されて用いられるものとする。 For example, as illustrated as the question pattern information in FIG. 5, the question utterance is determined using a pattern such as “? The question pattern information shown in FIG. 5, the explanation pattern information shown in FIG. 6, the conflict pattern information shown in FIG. 7, and the question information shown in FIG. 8 are stored in advance in the hard disk 28 in FIG. It shall be issued and used.

次に、特徴量抽出部３において、発話意図判別部２によって判別された各意図発話列から、ユーザとオペレータのそれぞれの発話列における「質問」、「説明」、「相槌」の出現回数を求め、さらに、発話意図が「質問」の発話中に、図８に示す疑問詞が出現する場合、その発話を「疑問詞疑問文」として判定し、その出現回数を求め、意図特徴量とすると共に、当該テキストデータから問合せ者が使用した単語の種類の総数を使用語彙特徴量として抽出する。意図特徴量及び使用語彙特徴量は、意図特徴量及び使用語彙特徴量を要素とする特徴量ベクトルとする（ステップＳ１０６）。特徴量抽出部３で出力される特徴量ベクトルの要素は、質問回数、説明回数、相槌回数、疑問詞疑問文の回数、及び、使用語彙の異なり語数となる。なお、このような複数の特徴量は特徴量ベクトルと呼ばれる。 Next, the feature quantity extraction unit 3 obtains the number of occurrences of “question”, “explanation”, and “compatibility” in each utterance sequence of the user and the operator from each intention utterance sequence determined by the utterance intention determination unit 2. Furthermore, when the interrogative shown in FIG. 8 appears during the utterance of “question” as the utterance intention, the utterance is determined as “interrogative question sentence”, the number of appearances is obtained, and the intention feature amount is obtained. The total number of types of words used by the inquirer is extracted from the text data as the used vocabulary feature amount. The intention feature quantity and the used vocabulary feature quantity are set as feature quantity vectors having the intention feature quantity and the used vocabulary feature quantity as elements (step S106). The elements of the feature quantity vector output by the feature quantity extraction unit 3 are the number of questions, the number of explanations, the number of questions, the number of interrogative questions, and the number of words used in different vocabularies. Such a plurality of feature amounts are called feature amount vectors.

そして、推定情報生成部４において、ステップＳ１０２の処理で再生された音声データに応じて、当該音声データにより示される対話に対して正解ラベルとしての知識量情報（知識量ラベル４ｂ）が入力されると（ステップＳ１０７）、特徴量抽出部３により抽出された意図特徴量及び使用語彙特徴量の特徴量ベクトルと、入力された知識量情報（知識量ラベル４ｂ）を１対１の組にして学習データとし、この学習データを入力として、公知のパターン識別手法である「Support Vector Machine （SVM）」を用いてモデルの学習を行うことで、特徴量から知識量の推定に用いるモデル情報５ａ（推定情報）を生成し（ステップＳ１０８）、その後に本処理を終了する。 Then, in the estimated information generation unit 4, knowledge amount information (knowledge amount label 4b) as a correct answer label is input to the dialogue indicated by the sound data in accordance with the sound data reproduced in the process of step S102. (Step S107), the feature quantity vector of the intention feature quantity and used vocabulary feature quantity extracted by the feature quantity extraction unit 3 and the input knowledge quantity information (knowledge quantity label 4b) are learned as a one-to-one pair. The model information 5a (estimation used for estimating the knowledge amount from the feature amount is obtained by using the learning data as input and learning the model using the “Support Vector Machine (SVM)” which is a known pattern identification method. Information) is generated (step S108), and then this processing is terminated.

「Support Vector Machine （SVM）」は２値の判別手法である。「Support Vector Machine （SVM）」を用いて知識量を２段階で推定する場合には、「知識量が小さい」に該当するか、それ以外かを判別するモデルを作成する。モデル作成の際には、例えば、「知識量が小さい」に該当する学習データを正の学習データとし、それ以外を負の学習データ（「知識量が大きい」に該当する学習データ）として学習を行う。 “Support Vector Machine (SVM)” is a binary discrimination method. When the knowledge amount is estimated in two stages using the “Support Vector Machine (SVM)”, a model for determining whether the knowledge amount is “small” or not is created. When creating a model, for example, learning data corresponding to “small knowledge amount” is set as positive learning data, and other learning data is negative learning data (learning data corresponding to “high knowledge amount”). Do.

「Support Vector Machine （SVM）」を用いて知識量を３段階で推定する場合には、学習により段階別に３つのモデルを作成する。「知識量が大きい」に該当するか、それ以外かを判別するモデル１、「知識量が中程度」に該当するか、それ以外かを判別するモデル２、「知識量が小さい」に該当するか、それ以外かを判別するモデル３、の３つのモデルである。例えば、モデル１では、「知識量が大きい」に該当する学習データを正の学習データとし、それ以外を負の学習データとして学習を行う。モデル２では、「知識量が中程度」に該当する学習データを正の学習データとし、それ以外を負の学習データとして学習を行う。モデル３では、「知識量が小さい」に該当する学習データを正の学習データとし、それ以外を負の学習データとして学習を行う。 When the knowledge amount is estimated in three stages using "Support Vector Machine (SVM)", three models are created for each stage by learning. Model 1 that discriminates whether “knowledge amount is large” or not, Model 2 that discriminates whether “knowledge amount is medium” or other, and “knowledge amount is small” Or model 3 for discriminating whether it is other than that. For example, in the model 1, learning is performed with learning data corresponding to “a large amount of knowledge” as positive learning data, and other learning data as negative learning data. In the model 2, learning is performed with learning data corresponding to “medium amount of knowledge” as positive learning data and other learning data as negative learning data. In the model 3, learning is performed with learning data corresponding to “small knowledge amount” as positive learning data and other learning data as negative learning data.

なお、本例では、パターン識別に「Support Vector Machine （SVM）」を用いてモデルの学習を行っているが、これに限定されるものではなく、例えば、バックプロパゲーション学習等によるニューラルネットワークを用いた学習を行うことでも良い。また、特徴量対知識量のテーブルを作成し、当該テーブルに基づいて知識量を判別する方法を用いても良い。なお、生成されたモデル情報５ａは、図２におけるハードディスク２８等の記憶装置に記憶される。 In this example, model learning is performed using “Support Vector Machine (SVM)” for pattern identification. However, the present invention is not limited to this. For example, a neural network using back propagation learning is used. It is also possible to do learning. Alternatively, a method of creating a feature amount vs. knowledge amount table and determining the knowledge amount based on the table may be used. The generated model information 5a is stored in a storage device such as the hard disk 28 in FIG.

また、本例では、「質問」、「説明」、「相槌」、及び「疑問詞疑問文」を発話意図として説明を行ったが、「質問」及び「説明」のみを発話意図として、実施しても良い。その場合、学習部の発話意図判別部２、特徴量抽出部３での処理、及び推定部の発話意図判別部２、特徴量抽出部３での処理が上記の例と異なる。 In this example, “question”, “explanation”, “conformity”, and “question word question sentence” are explained as utterance intentions, but only “question” and “explanation” are implemented as utterance intentions. May be. In that case, the processing in the utterance intention determination unit 2 and the feature amount extraction unit 3 of the learning unit and the processing in the utterance intention determination unit 2 and the feature amount extraction unit 3 of the estimation unit are different from the above example.

学習部では、発話意図判別部２において、発話列抽出部１により抽出された問合せ者と回答者との対話における発話列のテキストデータについて、各発話毎に、「質問」、「説明」に該当する発話意図を示す発話を判別して、判別された発話が該当する発話意図を判別対象の発話と対応付けて出力する。さらに、特徴量抽出部３において、「問合せ者の使用語彙の異なり語数」を算出すると共に、発話意図判別部２で判別された「質問」、「説明」それぞれの発話意図の出現回数を算出する。算出された「問合せ者の使用語彙の異なり語数」、発話意図判別部２で判別された「質問」、「説明」それぞれの発話意図の出現回数を当該テキストデータにおける発話意図の発生状態に関する特徴量を示す意図特徴量とする。特徴量抽出部３で出力される特徴量ベクトルの要素は、質問回数、説明回数、及び、使用語彙の異なり語数となる。 In the learning unit, the utterance intention determination unit 2 corresponds to “question” and “explanation” for each utterance for the text data of the utterance sequence in the dialogue between the inquirer and the respondent extracted by the utterance sequence extraction unit 1. The utterance indicating the utterance intention to be determined is determined, and the utterance intention corresponding to the determined utterance is output in association with the utterance to be determined. Further, the feature quantity extraction unit 3 calculates “the number of different words of the inquirer's vocabulary” and calculates the number of appearances of each utterance intention of “question” and “explanation” determined by the utterance intention determination unit 2. . The calculated “number of different words of the inquirer's vocabulary”, the number of appearances of the “question” and “explanation” of the utterance intention determined by the utterance intention determination unit 2, and the feature amount relating to the occurrence state of the utterance intention in the text data Is an intention feature amount. The elements of the feature quantity vector output by the feature quantity extraction unit 3 are the number of questions, the number of explanations, and the number of words different in the used vocabulary.

推定部では、発話意図判別部２において、発話列抽出部１により抽出された推定対象とする問合せ者と回答者との対話における発話列のテキストデータについて、各発話毎に、発話者（問合せ者、回答者）の「質問」、「説明」に該当する発話の意図を示す発話を判別する。判別された発話は、当該発話が該当する発話意図に対応付けられて出力される。さらに、特徴量抽出部３において、当該テキストデータから問合せ者が使用した単語の種類の総数を推定対象使用語彙特徴量として抽出すると共に、発話意図判別部２で判別された、推定対象発話意図に含まれる「質問」、「説明」それぞれの発話意図（推定対象発話意図）の出現回数を算出する。算出された「問合せ者の使用語彙の異なり語数」、発話意図判別部２で判別された「質問」、「説明」それぞれの推定対象発話意図の出現回数を、当該テキストデータにおける発話意図の発生状態に関する特徴量を示す推定対象意図特徴量とする。特徴量抽出部３で出力される特徴量ベクトルの要素は、質問回数、説明回数、及び、使用語彙の異なり語数となる。 In the estimator, the utterance intention discriminator 2 uses the utterer (inquirer) for each utterance for the text data of the utterance string in the dialogue between the inquirer to be estimated and the respondent extracted by the utterance string extractor 1. The utterance indicating the intention of the utterance corresponding to the “question” and “explanation” of the respondent) is determined. The determined utterance is output in association with the utterance intention corresponding to the utterance. Further, the feature amount extraction unit 3 extracts the total number of types of words used by the inquirer from the text data as the estimation target used vocabulary feature amount, and also determines the estimation target utterance intention determined by the utterance intention determination unit 2. The number of appearances of the utterance intention (estimation target utterance intention) of each of the “question” and “explanation” included is calculated. The number of occurrences of the estimated utterance intentions of the calculated “number of different words in the vocabulary used by the inquirer”, the “question” and “explanation” determined by the utterance intention determination unit 2, and the occurrence state of the utterance intention in the text data It is assumed that the estimation target intention feature amount indicates the feature amount. The elements of the feature quantity vector output by the feature quantity extraction unit 3 are the number of questions, the number of explanations, and the number of words different in the used vocabulary.

以下、本実施の形態に係る知識量推定装置１０で用いられる、上述の「質問」、「説明」、「相槌」、及び「疑問詞疑問文」について、図５〜図８を用いて説明する。 Hereinafter, the above-mentioned “question”, “explanation”, “conflict” and “question question” used in the knowledge amount estimation apparatus 10 according to the present embodiment will be described with reference to FIGS. .

まず、図５に示す質問パターン情報について説明する。質問を表す発話列として、「ですかね。」、「でしたかね。」、「でしょうかね。」、「ますかね。」、「ましたかね。」、「ましょうかね。」、「ませんかね。」、「かしらね。」、「かな。」、及び「っけ。」、が例示されている。なお、「ね」は、なくてもパターンにマッチするものとする。 First, the question pattern information shown in FIG. 5 will be described. As the utterance string that represents the question, “Is it?”, “Is it?”, “Is it?”, “Is it?”, “Is it?”, “Is it?” “Kane.”, “Kashirane.”, “Kana.”, And “Koke.” Are illustrated. It should be noted that “ne” matches the pattern even if it does not exist.

次に、図６に示す説明パターン情報について詳細を説明する。図６における説明の種類のうち、「記述文」及び「同定文」は、例えば、「坂原茂、“役割，ガ・ハ，ウナギ文” 認知科学の発展第３巻、日本認知科学会、ｐｐ．２９−６６講談社（１９９０年５月２０日）」で定義されている。 Next, the details of the explanation pattern information shown in FIG. 6 will be described. Among the types of explanation in FIG. 6, “description sentence” and “identification sentence” are, for example, “Shigeru Sakahara,“ Role, Ga Ha, Unagi sentence ”, Cognitive Science Development Vol. 3, Japan Cognitive Science Society, pp. .29-66 Kodansha (May 20, 1990) ".

すなわち、「記述文」とは、ある事物の属性を記述する文のことで、「同定文」とは、ある役割に値を割り当てる文のことである。例えば、「日本は日本列島と周辺の島々から成る国です。」は日本の属性を記述する記述文であり、「日本の首相は野田氏です。」は「日本の首相」という役割に「野田氏」という値を割り当てる同定文である。 That is, the “description sentence” is a sentence that describes the attribute of a certain thing, and the “identification sentence” is a sentence that assigns a value to a certain role. For example, “Japan is a country made up of the Japanese archipelago and surrounding islands” is a descriptive statement that describes the attributes of Japan, and “Japan's prime minister is Mr. Noda.” This is an identification sentence to which the value “Mr.” is assigned.

これらの記述文、及び同定文の抽出には、構文解析によって判別した係助詞の「は」を用いるなどしても良いが、本例では、構文解析なしに容易に抽出可能な発話末の「です」等のパターンを用いる。 For the extraction of these descriptive sentences and identification sentences, the particle “ha” determined by the syntax analysis may be used. However, in this example, “ Use a pattern such as

次の「理由の説明」とは、自身の先行する発話に対し、発話内容（命題）が成立する理由を付け加えるための発話とする。ここでは、発話末が「ので」、または「から」で終わっているものを「理由の説明」として定義する。 The next “explanation of reason” is an utterance for adding a reason why the utterance content (proposition) is established with respect to the preceding utterance. Here, the utterance ends with “So” or “From” is defined as “Explanation of Reason”.

最後に、「聞き手の意思決定に資する情報・条件の提示」とは、聞き手が何かをするための参考となるよう、提示される情報のことを指す。これは、例えば、『白川博之、“「ケド」で言い終わる文” 広島大学日本語学科紀要６号、ｐｐ．９−１７（１９９６）』において定義する、言い終わりの「けど」の持つ終助詞的機能であり、ここでは、これを説明の一部として加えることとする。 Finally, “presentation of information / conditions that contribute to the decision-making of the listener” refers to the information that is presented so that the listener can use it to do something. This is defined as, for example, “Hiroyuki Shirakawa,“ Sentence Finished with “Ked” ”Bulletin of Hiroshima University Japanese Language Course No. 6, pp. 9-17 (1996)”. This is a function, which is added here as part of the description.

本例では、上記３種類の定義に合致する発話を説明発話として抽出する。 In this example, utterances that match the above three types of definitions are extracted as explanatory utterances.

次に、図７に示す相槌パターン情報について説明する。 Next, the conflict pattern information shown in FIG. 7 will be described.

対話において聞き手が打つ相槌の果たす役割は、例えば、「Senko K. Maynard, Japanese Communication: Language and Thought in Context, University of Hawai‘i Press, Honolulu (1997)」に記載のように、相手の発話権を認め、発話の継続を促すと同時に、相手の発話内容を理解していることを示すことである。 The role played by the listener in the dialogue is, for example, as described in `` Senko K. Maynard, Japanese Communication: Language and Thought in Context, University of Hawai'i Press, Honolulu (1997) ''. Is to encourage the continuation of the utterance and at the same time indicate that the other person's utterance is understood.

従って、相槌を打つ回数と知識量の間に何らかの関連性があると考えられる。本例では、図７に示すように、発話末に「はい」、「ええ」、及び「うん」が現れている発話を相槌発話とし、その出現回数を特徴量として利用する。 Therefore, it can be considered that there is some relationship between the number of times of hitting and the amount of knowledge. In this example, as shown in FIG. 7, an utterance in which “Yes”, “Yes”, and “Ye” appear at the end of the utterance is used as a combined utterance, and the number of appearances is used as a feature amount.

次に、図８に示す疑問詞情報について説明する。 Next, the interrogative information shown in FIG. 8 will be described.

質問の形式をとる発話の中には、例えば、「徳久雅人、前田浩佑、村上仁一、池原悟、“心的状態を表す対話行為タグ付きテキスト対話コーパスの構築” 電子情報通信学会技術研究報告、思考と言語、ＴＬ２００７−４５、ｐｐ．２５−３０、（２００７）」に記載のように、仮説が述べられている疑問文とそうでないものがある。 Among the utterances that take the form of questions, for example, “Masato Tokuhisa, Hiroaki Maeda, Jinichi Murakami, Satoru Ikehara,“ Construction of a text dialogue corpus with dialogue action tags representing mental states ”IEICE technical report , Thought and Language, TL 2007-45, pp. 25-30, (2007) ", there are question sentences in which hypotheses are stated and others that are not.

前者は「光回線って高いんですか。」のように、自身の想定・仮説の検証を目的とした、言わば「確認」の機能を果たすと考えられ、また、後者は「光回線って何ですか。」のように、純粋に未知の情報を要求する機能を果たすと考えられる。 The former is considered to fulfill the function of “confirmation” for the purpose of verifying its own assumptions and hypotheses, such as “Is optical line expensive?”, And the latter is “What is optical line?” It is thought that it fulfills the function of requesting purely unknown information.

話者の知識量を推定するうえでは、後者の、未知の情報を要求する質問を抽出することが重要であると考えられるため、オペレータとユーザそれぞれが発話した「疑問詞疑問文」の回数も抽出し、特徴量として用いる。 In estimating the amount of knowledge of a speaker, it is considered important to extract the latter question that requires unknown information. Therefore, the number of “question word questions” spoken by the operator and the user is also determined. Extracted and used as a feature value.

本例において、疑問詞疑問文は、発話意図ラベルが「質問」である発話のうち、図８に示される疑問詞が含まれるものとする。 In this example, the interrogative question sentence includes the interrogative shown in FIG. 8 among the utterances whose utterance intention label is “question”.

図３のステップＳ１０５の処理において、発話意図判別部２は、以上の図５に示す質問パターン情報、図６に示す説明パターン情報、図７に示す相槌パターン情報、及び図８に示す疑問詞情報を用いて、パターンマッチングにより、当該意図に該当する発話を判別する。 In the process of step S105 in FIG. 3, the speech intention determination unit 2 performs the above question pattern information shown in FIG. 5, the explanation pattern information shown in FIG. 6, the conflict pattern information shown in FIG. 7, and the question information shown in FIG. Is used to determine the utterance corresponding to the intention by pattern matching.

次に、図４を用いて、本実施の形態に係る知識量推定装置１０の推定部における処理内容を説明する。 Next, processing contents in the estimation unit of the knowledge amount estimation apparatus 10 according to the present embodiment will be described with reference to FIG.

図４においては、本実施の形態に係る知識量推定装置１０の推定部における処理内容を示しており、推定対象となるコンタクトセンタでのユーザとオペレータとの間の対話（通話）（以下、「推定対象対話」という。）の音声データが入力されると（ステップＳ２０１）、当該音声データに対する音声認識処理を行った後（ステップＳ２０３）、図３のステップＳ１０４〜Ｓ１０６と同様、発話列抽出部１、発話意図判別部２、及び特徴量抽出部３の各処理（ステップＳ２０４，Ｓ２０５，Ｓ２０６）を行い、その後、知識量推定部５において、推定情報生成部４と同様に「Support Vector Machine （SVM）」を用いて、ステップＳ２０６の処理により得られた推定対象意図特徴量と推定対象使用語彙特徴量（特徴量ベクトル）を、前述した学習部による処理（図３）により生成されたモデル情報５ａと照合することにより、推定対象対話が、何れの知識量に該当するものであるかを特定することで、推定対象対話を行っている当該ユーザ（話者）の知識量を推定する（ステップＳ２０９）。 In FIG. 4, the processing content in the estimation part of the knowledge amount estimation apparatus 10 according to the present embodiment is shown, and dialogue (call) between the user and the operator at the contact center to be estimated (hereinafter referred to as “ When speech data of “estimation target dialogue” is input (step S201), speech recognition processing is performed on the speech data (step S203), and then the utterance string extraction unit is performed as in steps S104 to S106 of FIG. 1, each process of the speech intention determination unit 2 and the feature amount extraction unit 3 (steps S204, S205, and S206) is performed, and thereafter, in the knowledge amount estimation unit 5, the “Support Vector Machine ( SVM) ”is used to calculate the estimation target intention feature amount and the estimation target use vocabulary feature amount (feature vector) obtained by the processing in step S206, as described above. The user who is performing the estimation target dialogue by identifying which knowledge amount the estimation target dialogue corresponds to by comparing with the model information 5a generated by the process (FIG. 3) The knowledge amount of (speaker) is estimated (step S209).

なお、知識量推定部５に関しても、推定情報生成部４と同様、「Support Vector Machine （SVM）」を用いて知識量の推定を行うものとするが、ニューラルネットワークを用いた推定等、他のどのような推定手法を用いても構わない。「Support Vector Machine （SVM）」を用いて、例えば上記のモデル１〜３により知識量の推定を行う場合には、モデル１〜３それぞれを用いて判別を行い、最も適合したモデルの正のデータに対応する知識量を話者の知識量とする。 As for the knowledge amount estimation unit 5, similarly to the estimated information generation unit 4, the knowledge amount is estimated using “Support Vector Machine (SVM)”. Any estimation method may be used. When using “Support Vector Machine (SVM)” to estimate the amount of knowledge using, for example, the above models 1 to 3, discrimination is performed using each of the models 1 to 3, and the positive data of the most suitable model is used. Let the amount of knowledge corresponding to be the speaker's amount of knowledge.

次に、特徴量抽出部３が抽出する、４種類の意図特徴量、すなわち、「１．ユーザとオペレータのそれぞれの質問回数」、「２．ユーザとオペレータの疑問詞疑問文の回数」、「３．ユーザとオペレータの説明回数」及び「４．ユーザとオペレータの相槌回数」、及び、テキストデータから抽出する使用語彙特徴量からなる特徴量ベクトルについて詳細に説明する。 Next, four types of intention feature amounts extracted by the feature amount extraction unit 3, that is, “1. The number of questions for each of the user and the operator”, “2. The number of question sentences for the user and the operator”, “ 3. “User and Operator Description Frequency” and “4. User and Operator Conflict Count”, and feature quantity vectors composed of vocabulary feature quantities used extracted from text data will be described in detail.

まず、「１．ユーザとオペレータのそれぞれの質問回数」について以下に説明する。 First, “1. The number of questions for each of the user and the operator” will be described below.

コンタクトセンタ等での対話（通話）において質問が果たす役割のうち、最も基本的なものは、相手に対して情報の提供を要求することである。具体的には、ユーザが発する質問は、そのユーザに欠けている知識をオペレータに提供してもらうためのものであり、また、オペレータが発する質問は、ユーザの用件や置かれた状況について情報を提供してもらうためのものである。 Of the roles played by questions in dialogue (call) at a contact center or the like, the most basic one is to request information from the other party. Specifically, a question that a user issues is for the operator to provide knowledge that the user lacks, and a question that the operator issues is information about the user's requirements and situation. It is for having you offer.

つまり、ユーザの質問回数は、ユーザに足りない知識の量を反映しており、また、オペレータの質問回数は、ユーザの用件・状況把握にかかった労力を反映していると言える。ここで、オペレータがユーザの用件・状況を把握しようとする際は、知識量の少ないユーザとの対話において、より多くの労力を要すると考えられるため、ユーザとオペレータの質問回数を用いれば、ユーザの知識量を推定することができる。 In other words, it can be said that the user's number of questions reflects the amount of knowledge that the user lacks, and the operator's number of questions reflects the effort required to grasp the user's requirements and situation. Here, when the operator tries to grasp the user's requirement / situation, it is considered that more labor is required in the dialogue with the user having a small amount of knowledge, so if the number of questions of the user and the operator is used, The amount of knowledge of the user can be estimated.

本例では、ユーザの質問回数とオペレータの質問回数のそのままの値を特徴量として用いることとする。例えば、図９の発話列からなるテキストデータ５００で例示する対話内容の場合、アンダーラインが付与された部分（「でしょうか。」、「ますか。」）を含む発話が「質問」であり、図９の例では、オペレータとユーザ共に、質問回数は２回ずつということになる。 In this example, the values of the user's question count and the operator's question count are used as feature values. For example, in the case of the dialogue content illustrated in the text data 500 including the utterance string in FIG. 9, the utterance including the underlined part (“Is it?”, “Is it?”) Is “Question”, In the example of FIG. 9, the number of questions is two times for both the operator and the user.

従って、ここでは、特徴量として使う値は、オペレータによる質問回数、ユーザによる質問回数は共に「２」となる。なお、この図９に示す発話列からなるテキストデータ５００は、ＲＡＭ２４、ハードディスク２８等の記憶装置に記憶される。 Accordingly, here, the value used as the feature amount is “2” for both the number of questions by the operator and the number of questions by the user. 9 is stored in a storage device such as the RAM 24 and the hard disk 28.

次に、「２．ユーザとオペレータの疑問詞疑問文の回数」について以下に説明する。 Next, “2. The number of questionable question sentences of the user and the operator” will be described below.

図８で説明したように、話者の知識量を推定するうえでは、未知の情報を要求する質問を抽出することが重要であると考えられるため、オペレータとユーザそれぞれが発話した「疑問詞疑問文」の回数も抽出し、特徴量として用いる。 As explained in FIG. 8, it is considered important to extract questions that require unknown information in estimating the amount of knowledge of the speaker. The number of “sentences” is also extracted and used as a feature quantity.

図９の発話列からなるテキストデータ５００で例示する対話内容の場合、図８に示される疑問詞が含まれるものとして、波線のアンダーラインで示す部分（「どう」、「何」）が疑問詞に該当し、図９の例では、オペレータの疑問詞疑問文の回数が「１」、ユーザの疑問詞疑問文の回数が「２」となり、特徴量として使う値は、オペレータの疑問詞疑問文の回数は「１」、ユーザの疑問詞疑問文の回数は「２」となる。 In the case of the dialogue content exemplified by the text data 500 including the utterance string of FIG. 9, the question lines shown in FIG. 8 are included, and the portions indicated by the underline of the wavy lines (“how”, “what”) are the question words. In the example of FIG. 9, the number of questionable question sentences of the operator is “1”, the number of questionable question sentences of the user is “2”, and the value used as the feature amount is the questionable question text of the operator Is “1”, and the number of questionable question sentences of the user is “2”.

次に、「３．ユーザとオペレータの説明回数」について以下に説明する。 Next, “3. Number of times of explanation by user and operator” will be described below.

本例において、説明とは、相手に情報を提供する行為であると定義する。具体的には、図６に示される３種類の発話（「記述文・同定文」、「理由の説明」、「聞き手の意思決定に資する情報・条件の提示」）を説明とみなす。 In this example, the description is defined as an act of providing information to the other party. Specifically, the three types of utterances shown in FIG. 6 ("description sentence / identification sentence", "explanation of reason", and "presentation of information / conditions that contribute to the listener's decision making") are regarded as explanations.

図９の例では、二重下線で示してある部分（「ます」、「ですが」、「けど」、「です」）が説明に該当し、具体的に特徴量として使う値は、オペレータによる説明回数が「１」、ユーザによる説明回数が「３」となる．
次に、「４．ユーザとオペレータの相槌回数」について以下に説明する。 In the example of FIG. 9, the parts indicated by double underscores (“Masu”, “Issuga”, “Bur” ”,“ Iss ”) correspond to the description, and the values used as the feature values are specifically determined by the operator. The number of explanations is “1”, and the number of explanations by the user is “3”.
Next, “4. The number of conflicts between the user and the operator” will be described below.

本例においては、図７で示した発話末に「はい」、「ええ」、「うん」が現れている発話を相槌発話とし、その回数を特徴量として利用する。 In this example, an utterance in which “Yes”, “Yes”, and “Ye” appear at the end of the utterance shown in FIG. 7 is used as a mutual utterance, and the number of times is used as a feature amount.

図９の例では、破線で示してある部分（「はい」）が相槌に該当し、具体的に特徴量として使う値は、オペレータによる相槌回数が「１」、ユーザによる相槌回数は「０」となる。 In the example of FIG. 9, a portion (“Yes”) indicated by a broken line corresponds to the conflict, and the value used as the feature amount is specifically “1” for the number of conflicts by the operator and “0” for the number of conflicts by the user. It becomes.

次に、使用語彙特徴量について以下に説明する。ここでは、使用語彙特徴量を「ユーザの使用語彙の異なり語数」として説明する。 Next, the vocabulary feature amount used will be described below. Here, the used vocabulary feature amount will be described as “the number of words different in the user's used vocabulary”.

異なり語数とは、通話において何種類の単語を使用したかを指す。同じ用件の通話を終えるのに、あるユーザが他のユーザより多くの種類の単語を使う必要があったということは、そのユーザの意思疎通が非効率であったことを示している。 The number of words is different from the number of words used in a call. The fact that a user needed to use more types of words than other users to finish a call on the same message indicates that the user's communication was inefficient.

知識量の少ないユーザは、自分の状態を的確に伝えることが不得意であると考えられるため、本実施例では、知識量が小であるユーザを特徴づけるために、ユーザの使用語彙の異なり語数を用いることとする。 Since it is considered that a user with a small amount of knowledge is not good at accurately communicating his / her state, in this embodiment, in order to characterize a user with a small amount of knowledge, the number of words used in the user's vocabulary differs. Will be used.

本実施例では、異なり語数を数えるために、まず、発話列を入力として形態素解析を行い、発話列を単語に分割し、各単語に品詞を付与する。そして、名詞と未知語のみを対象として単語の種類を数える。 In this embodiment, in order to count the number of different words, first, morphological analysis is performed using an utterance string as input, the utterance string is divided into words, and parts of speech are given to each word. Then, word types are counted only for nouns and unknown words.

本実施例では、形態素解析には、例えば、『松本裕治、 “形態素解析システム「茶筌」”、情報処理４１巻１１号２０００年１１月、 pp.1208-1214 (2000)』に記載の形態素解析システム「茶筌」を用いるが、形態素解析ツールであれば何を用いても構わない。また、音声認識結果が、単語区切りで出力され、品詞が付与されている場合は、出力された単語区切りと品詞をそのまま利用してもよい。 In this embodiment, the morpheme analysis is described in, for example, “Yuji Matsumoto,“ Morphological Analysis System “Chaya” ”, Information Processing Vol. 41, No. 11, November 2000, pp. 1208-1214 (2000)”. The system “tea bowl” is used, but any morphological analysis tool may be used. If the speech recognition result is output at word breaks and part of speech is given, the output word breaks and part of speech may be used as they are.

例えば、図９の発話列からなるテキストデータ５００で例示する対話内容の場合、ユーザ役が使用した名詞、及び未知語は斜体で示されている「パソコン」、「インターネット」、及び「回線」の３種類である。よって、使用語彙特徴量として使用する値は「３」となる。 For example, in the case of the dialog content illustrated by the text data 500 including the utterance string in FIG. 9, the nouns and unknown words used by the user role are “PC”, “Internet”, and “line” shown in italics. There are three types. Therefore, the value used as the used vocabulary feature amount is “3”.

以上によって抽出された各特徴量を特徴量ベクトルとして出力する。すなわち、本実施例で使用する特徴量ベクトルは、「１：オペレータ役の質問回数、２：ユーザ役の質問回数、３：オペレータ役の説明回数、４：ユーザ役の説明回数、５：オペレータ役の相槌回数、６：ユーザ役の相槌回数、７：オペレータ役の疑問詞疑問文の回数、８：ユーザ役の疑問詞疑問文の回数、９：ユーザ役の異なり語数」という構成になっている。 Each feature quantity extracted as described above is output as a feature quantity vector. That is, the feature quantity vector used in this embodiment is “1: operator role question count, 2: user role question count, 3: operator role description count, 4: user role description count, 5: operator role count. 6: Number of questions of user role, 7: Number of questions of questioning question of operator role, 8: Number of questions of questioning question of user role, 9: Number of words differing in user role .

以下、本実施の形態に係る知識量推定装置１０に対する評価実験例について説明する。ここでは、コンタクトセンタでの通話における各ユーザ役を対象として知識量の推定を行い、その評価実験を行った。 Hereinafter, an evaluation experiment example for the knowledge amount estimation apparatus 10 according to the present embodiment will be described. Here, the amount of knowledge was estimated for each user role in a call at a contact center, and an evaluation experiment was performed.

ここで、本実施の形態に係る知識量推定装置１０への入力としては、各コンタクトセンタでの通話のオペレータとユーザの両方の音声認識結果を用い、音声認識結果として得られた発話列からなるテキストデータを用いて上述した各種の意図特徴量と使用語彙特徴量を抽出する。 Here, as the input to the knowledge amount estimation apparatus 10 according to the present embodiment, the speech recognition result of both the operator and the user of the call at each contact center is used, and the speech sequence obtained as the speech recognition result is used. The above-described various intention feature amounts and used vocabulary feature amounts are extracted using text data.

また、各通話に対する知識量の推定結果に対する評価に用いる正解ラベル（知識量情報）としては、人間の主観評価で、各通話における各ユーザが「知識量小」、「知識量大」のどちらに該当するかを、図１０において判定基準情報として示される判定基準に従って判定させ、各通話に対して知識量情報（知識量ラベル）を付与した。なお、図１０の判定基準情報は、予めハードディスク２８に記憶されており、当該ハードディスク２８から読み込まれてＲＡＭ２４に記憶されて用いられる。 In addition, as a correct label (knowledge amount information) used for evaluation of the estimation result of the knowledge amount for each call, each user in each call is classified as “small knowledge amount” or “large knowledge amount” in human subjective evaluation. Whether it is applicable or not is determined according to the criterion shown as criterion information in FIG. 10, and knowledge amount information (knowledge amount label) is assigned to each call. 10 is stored in advance in the hard disk 28, and is read from the hard disk 28 and stored in the RAM 24 for use.

この評価実験例では、オペレータとユーザの音声認識結果と知識量ラベルの対は、１８０通話分用意した。そのうち、９４通話に「知識量小」のラベルが付与され、８６通話に「知識量大」のラベルが付与されている。 In this evaluation experiment example, a pair of operator and user voice recognition results and knowledge amount labels are prepared for 180 calls. Among them, the 94 call has the “small knowledge” label, and the 86 call has the “large knowledge” label.

各通話に付与された知識量ラベルと、各通話から抽出した意図特徴量及び使用語彙特徴量を対応付けるモデルの学習、及び知識量の推定の評価は、データを１０分割したうちの９つを学習データとし、残り１つを評価データとすることで１０パターンのデータセットを用意する１０分割交差検定によって実施した。 Learning the model that associates the knowledge amount label given to each call with the intention feature amount and vocabulary feature amount extracted from each call, and the evaluation of the estimation of the knowledge amount learns 9 out of 10 divided data This was performed by 10-fold cross-validation in which a data set of 10 patterns was prepared by using the remaining one as evaluation data.

今回の例で、知識量の推定に使用した意図特徴量は、（ｉ）オペレータ役の質問回数、（ｉｉ）ユーザ役の質問回数、（ｉｉｉ）オペレータ役の説明回数、（ｉｖ）ユーザ役の説明回数、（ｖ）オペレータ役の相槌回数、（ｖｉ）ユーザ役の相槌回数、（ｖｉｉ）オペレータ役の疑問詞疑問文の回数、（ｖｉｉｉ）ユーザ役の疑問詞疑問文の回数、及び（ｉｘ）ユーザ役の異なり語数の９種類とした。また、各通話から抽出した意図特徴量及び使用語彙特徴量と知識量正解ラベルとの対応付けの学習、及び推定対象意図特徴量と推定対象使用語彙特徴量に基づく知識量推定には、「Support Vector Machine （SVM）」を用いた。 In this example, the intention feature used to estimate the knowledge amount is (i) the number of questions for the operator role, (ii) the number of questions for the user role, (iii) the number of times the operator role is explained, (iv) The number of explanations, (v) the number of competing operator roles, (vi) the number of competing user roles, (vii) the number of questionable question sentences for operator roles, (viii) the number of questionable question sentences for user roles, and (ix ) Nine types of words with different user roles. In addition, learning of the correspondence between the intention feature amount and used vocabulary feature amount extracted from each call and the knowledge amount correct label, and the knowledge amount estimation based on the estimation target intention feature amount and the estimation target used vocabulary feature amount, Vector Machine (SVM) ”was used.

一方、評価指標として、本例の精度評価では、正確性の指標である「適合率（precision）」を利用する。この指標は、推定した結果の中に正解の知識量ラベルと一致するものがどのくらいあるかを示すものである。 On the other hand, in the accuracy evaluation of this example, “precision” that is an accuracy index is used as an evaluation index. This index indicates how many of the estimated results coincide with the correct knowledge amount label.

このように、適合率を重視するのには理由がある。すなわち、本例で知識量の推定の対象としたコンタクトセンタでの通話は、データ数が膨大であるのが一番の特徴である。その膨大な通話データから、分析すべき対象を正確に絞り込むのが本発明を利用する大きな目的であると考えると、適合率が重要な意味を持つといえる。 In this way, there is a reason to place importance on the precision. That is, the most characteristic feature of the call at the contact center, which is the target of knowledge amount estimation in this example, is that the number of data is enormous. Considering that it is a major purpose of using the present invention to accurately narrow down the objects to be analyzed from the enormous amount of call data, it can be said that the relevance rate has an important meaning.

本評価例では、「知識量小」を推定する問題と捉え、以下の式を用いて適合率を算出した。 In this evaluation example, it was regarded as a problem of estimating “small amount of knowledge”, and the precision was calculated using the following formula.

適合率＝正しく推定できた「知識量小」の数÷「知識量小」として推定した総数
なお、本実験例では、この適合率に加え、網羅性の指標である「再現率（recall）」も同時に評価し、参考にした。再現率は以下の式を用いて算出した。 Relevance rate = number of “small amount of knowledge” that can be correctly estimated ÷ total number estimated as “small amount of knowledge” In this experimental example, in addition to this relevance rate, “recall rate”, which is an index of completeness Was also evaluated and used as a reference. The recall was calculated using the following formula.

再現率＝正しく推定できた「知識量小」の数÷全データ中の「知識量小」の数
本例の１０分割交差検定では、全１８０通話分のデータから１０個の学習データ・評価データの対を作って学習と精度評価を行っている。今回の実験例では、１０対のデータそれぞれの推定結果から算出した適合率のマイクロ平均によって評価を行う。 Recall rate = number of “small amount of knowledge” that can be correctly estimated ÷ number of “small amount of knowledge” in all data In the 10-fold cross-validation of this example, 10 learning data / evaluation data from the data for all 180 calls The pair is made and learning and accuracy evaluation are performed. In this experimental example, the evaluation is performed by the micro average of the relevance ratios calculated from the estimation results of the 10 pairs of data.

本例の評価では、上述の非特許文献１，２として示したユーザの使用語彙の専門性を用いた手法、及び、ユーザがどの単語を何回使用したかという単語特徴量（bag of words）をベースラインとして、本実施例との比較をする。 In the evaluation of this example, the technique using the expertise of the user's vocabulary shown as Non-Patent Documents 1 and 2 above, and the word feature (bag of words) indicating how many times the user has used As a baseline, a comparison with this example will be made.

非特許文献１に倣い、ユーザの使用語彙の専門性を特徴量化する際、本実施例では、そのユーザが使用する単語の珍しさの平均値を用いる。単語の珍しさとしては、「Inverse Document Frequency (IDF)」を用いる。 According to Non-Patent Document 1, when specializing a user's vocabulary to be featured, in this embodiment, an average value of the rarity of words used by the user is used. “Inverse Document Frequency (IDF)” is used as an unusual word.

総通話数がN、単語tを含む通話数がdf(t)のとき、単語tのIDF値は以下の式（１）を用いて算出する。 When the total number of calls is N and the number of calls including the word t is df (t), the IDF value of the word t is calculated using the following equation (1).

idf(t)＝log(N/df(t))・・・式（１）
なお、本実施例では、ユーザの使用語彙の平均IDFの算出には、名詞と未知語のみを利用した。 idf (t) = log (N / df (t)) ... Formula (1)
In this embodiment, only nouns and unknown words are used to calculate the average IDF of the user's vocabulary.

このような評価の結果、本実施例により求めた特徴量を用いた場合、適合率０．８、再現率０．４３で、知識量小ユーザの通話を抽出できることが確認できた。 As a result of such evaluation, it was confirmed that when the feature amount obtained by the present embodiment was used, it was possible to extract a call of a user with a small knowledge amount with a precision of 0.8 and a recall of 0.43.

図１１に示されるとおり、再現率をそろえた場合、本実施例が最も高い適合率で知識量小ユーザの通話を抽出できる。つまり、対象データ中の知識量小ユーザの通話を同じくらい網羅して抽出できるとき、抽出した通話に含まれる知識量小ユーザの通話（正解）の割合が最も高くなるのは本実施例によるものである。 As shown in FIG. 11, when recall rates are uniform, this embodiment can extract a call of a user with a small knowledge amount with the highest relevance rate. In other words, when calls of small users with a small amount of knowledge in the target data can be extracted as much as possible, the percentage of calls (correct answers) of small users with a small amount of knowledge included in the extracted calls is the highest according to this embodiment. It is.

さらに、図１２に示されるとおり、適合率をそろえた場合も、本実施例が最も高い再現率で知識量小ユーザの通話を抽出できる。つまり、同じくらい正確に知識量小ユーザの通話を抽出できるとき、最も網羅的に知識量小ユーザの通話を抽出できるのは本実施例によるものでる．以上により、本実施例で求める特徴量は、話者の知識量の推定において有効であると言える。 Furthermore, as shown in FIG. 12, even when the matching ratios are uniform, this embodiment can extract a call of a user with a small knowledge amount with the highest recall rate. In other words, when it is possible to extract the calls of small-knowledge users as accurately as possible, it is according to this embodiment that the calls of small-knowledge users can be extracted most comprehensively. From the above, it can be said that the feature amount obtained in this embodiment is effective in estimating the speaker's knowledge amount.

以上説明したように、本実施の形態に係る知識量推定装置１０では、コンタクトセンタ等におけるオペレータとユーザとの間の対話に表れる発話意図の特徴量、及び、使用語彙の特徴量を利用して、話者（ユーザ）の知識量を推定する。 As described above, in the knowledge amount estimation apparatus 10 according to the present embodiment, the feature amount of the utterance intention and the feature amount of the used vocabulary that appear in the dialogue between the operator and the user in the contact center or the like are used. The amount of knowledge of the speaker (user) is estimated.

具体的には、プログラムされたコンピュータ処理により実装される機能として、少なくとも発話意図判別部２、特徴量抽出部３、推定情報生成部４、及び知識量推定部５を備え、発話意図判別部２は、問合せ者と回答者との対話における発話列のテキストデータから、相手への質問、説明、及び相槌に該当する発話の意図を示す発話を判別し、特徴量抽出部３は、発話意図判別部２で判別された発話の、当該テキストデータにおける発生状態に関する特徴量を示す意図特徴量を抽出すると共に、当該テキストデータから問合せ者が使用した単語の種類の総数を使用語彙特徴量として抽出し、推定情報生成部４は、特徴量抽出部３で抽出された意図特徴量と使用語彙特徴量、及び当該意図特徴量と使用語彙特徴量の抽出の対象とされたテキストデータに対して当該テキストデータにより示される対話から想定される問合せ者の知識量を示すものとして予め付与された知識量情報を学習データとして用いることで、テキストデータに対応する知識量の推定に用いる推定情報を生成する。 Specifically, as a function implemented by programmed computer processing, at least an utterance intention determination unit 2, a feature amount extraction unit 3, an estimation information generation unit 4, and a knowledge amount estimation unit 5 are provided, and the utterance intention determination unit 2 Determines the utterance indicating the intention of the utterance corresponding to the question, the explanation, and the companion from the text data of the utterance string in the dialogue between the inquirer and the respondent, and the feature amount extraction unit 3 determines the utterance intention The intention feature amount indicating the feature amount related to the occurrence state of the utterance determined in the section 2 in the text data is extracted, and the total number of word types used by the inquirer is extracted from the text data as the used vocabulary feature amount. The estimated information generation unit 4 includes the intention feature amount and the used vocabulary feature amount extracted by the feature amount extraction unit 3, and the text that is the target of extraction of the intended feature amount and the used vocabulary feature amount. The amount of knowledge corresponding to the text data can be estimated by using, as learning data, knowledge amount information given in advance to indicate the knowledge amount of the inquirer assumed from the dialogue indicated by the text data. Generate estimated information to use.

このようにして推定情報生成部４で生成された推定情報は、ハードディスク２８等の記憶装置に記憶される。 The estimation information generated by the estimation information generation unit 4 in this way is stored in a storage device such as the hard disk 28.

また、発話意図判別部２は、知識量の推定対象とする問合せ者と回答者との対話における発話列のテキストデータから、相手への質問、説明、相槌などに該当する発話の意図を示す発話を判別し、特徴量抽出部３は、発話意図判別部２で判別された推定対象の発話の、テキストデータにおける発生状態に関する特徴量を示す推定対象意図特徴量を抽出すると共に、当該テキストデータから問合せ者が使用した単語の種類の総数を推定対象使用語彙特徴量として抽出し、さらに、相手への質問に該当する発話に疑問詞が含まれる発話を疑問詞疑問文として抽出して、当該疑問詞の当該テキストデータにおける発生状態に関する特徴量も上記推定対象意図特徴量として抽出し、知識量推定部５は、特徴量抽出部３で抽出された推定対象意図特徴量及び推定対象使用語彙特徴量と、予め記憶装置に記憶された推定情報とを用いて、推定対象とするテキストデータに対応する知識量を推定する。 The utterance intention discriminating unit 2 also utters an utterance indicating the intention of the utterance corresponding to the question, explanation, conflict, etc. to the other party from the text data of the utterance string in the dialogue between the inquirer and the respondent whose knowledge is to be estimated. The feature amount extraction unit 3 extracts an estimation target intention feature amount indicating a feature amount related to the occurrence state of the utterance of the estimation target determined by the utterance intention determination unit 2 in the text data, and from the text data The total number of types of words used by the inquirer is extracted as the estimated target vocabulary feature quantity, and further, the utterances that contain interrogative words in the utterance corresponding to the question to the other party are extracted as interrogative question sentences. The feature quantity regarding the occurrence state of the lyrics in the text data is also extracted as the estimation target intention feature quantity, and the knowledge quantity estimation unit 5 extracts the estimation target intention feature quantity extracted by the feature quantity extraction unit 3. And fine estimation target using lexical feature quantity, by using the estimated information stored in advance in the storage device, and estimates the amount of knowledge corresponding to the text data to be estimated.

ここでは、意図特徴量としては、問合せ者の質問回数、回答者の質問回数、問合せ者の説明回数、回答者の説明回数、問合せ者の相槌回数、回答者の相槌回数、問合せ者の疑問詞疑問文の回数、及び回答者の疑問詞疑問文の回数を含むが、相槌回数と疑問詞疑問文の回数に関しては必要に応じて用いることで良い。 Here, the intention feature amount includes the number of questions of the inquirer, the number of questions of the respondent, the number of explanations of the inquirer, the number of explanations of the respondent, the number of inquiries of the inquirer, the number of inquiries of the respondent, the questioner of the inquirer This includes the number of questions and the number of questions of the respondent's question, but the number of questions and the number of questions may be used as necessary.

このように、本例では、対話の中の質問、説明、相槌、及び疑問詞疑問文に該当する発話を用いた特徴量を利用して、対話から話者の知識量を推定しており、従来技術で音声認識結果を知識量の推定の入力として扱う場合における問題点を回避することができる。 In this way, in this example, the amount of knowledge of the speaker is estimated from the dialogue using the features using the utterances corresponding to the questions, explanations, companions, and interrogative questions in the dialogue, It is possible to avoid problems in the case where the speech recognition result is handled as an input for estimating the knowledge amount in the prior art.

すなわち、従来の語彙の専門性を用いた音声認識結果を知識量の推定の入力として扱う場合、ある話者の発話の音声認識結果に、専門性の高い単語が誤っていくつか出現した場合には、その話者の知識量は誤って高いと推定されてしまう恐れがあるが、本例では、対話の中の質問、説明、相槌、及び疑問詞疑問文に該当する発話を用いた特徴量利用して、対話から当該話者の知識量を推定しており、このような問題点を回避することができる。 In other words, when speech recognition results using conventional vocabulary expertise are treated as input for estimation of the amount of knowledge, when some highly specialized words appear erroneously in the speech recognition results of a speaker's utterance May be presumed that the speaker's knowledge amount is erroneously high, but in this example, the feature amount using utterances corresponding to questions, explanations, companions, and interrogative questions in the dialogue By utilizing this, the knowledge amount of the speaker is estimated from the dialogue, and such problems can be avoided.

これにより、本実施の形態に係る知識量推定装置１０を用いることで、対話結果に基づき、対話の対象となった事項に対する当該話者（問合せ者）の知識量の推定を精度良く行うことができる。 Thereby, by using the knowledge amount estimation apparatus 10 according to the present embodiment, it is possible to accurately estimate the knowledge amount of the speaker (inquirer) for the item that is the subject of the dialogue based on the dialogue result. it can.

なお、本発明は、上述した例に限定されるものではなく、この発明の要旨を逸脱しない範囲内で様々な変形や応用が可能である。 In addition, this invention is not limited to the example mentioned above, A various deformation | transformation and application are possible within the range which does not deviate from the summary of this invention.

例えば、本例では、処理対象とする対話としてコンタクトセンタにおけるユーザとオペレータ間の対話（通話）を例に説明したが、これに限定されず、例えば、上記非特許文献２のバス運行情報案内システムにおけるユーザからの問合せに対する応答者との対話等、他の音声認識を用いた対話におけるユーザの知識量を推定するシステムに適用可能である。 For example, in this example, the dialogue (call) between the user and the operator in the contact center has been described as an example of the dialogue to be processed. However, the present invention is not limited to this, for example, the bus operation information guidance system of Non-Patent Document 2 above. The present invention is applicable to a system that estimates the amount of knowledge of a user in a dialog using other speech recognition, such as a dialog with a responder to an inquiry from the user.

また、上記対話とは、システム（回答者）対人間（問合せ者）でも、人間（回答者）対人間（問合せ者）でも、２者の発話のやりとりであれば何れでも良い。 The dialogue may be any system (respondent) versus human (inquirer), human (respondent) versus human (inquirer), or any conversation between two parties.

また、上記対話のテキストデータとは、対話の音声認識結果や書き起こし、テキストチャットなど、文字化（テキスト化）された対話であれば何でもよい。 The text data of the dialog may be anything as long as it is a text-converted dialog, such as a speech recognition result of the dialog, a transcription, or a text chat.

また、図２に示したコンピュータ構成において、本発明に係る各処理部の機能を実現するためのプログラムをコンピュータ読み取り可能な記録媒体に記録して、この記録媒体に記録されたプログラムをコンピュータシステムに読み込ませ、実行することにより、各構成による処理が実行されてもよいし、図示されていない通信機能を用いて、当該プログラムを読み込ませることでもよい。 In the computer configuration shown in FIG. 2, a program for realizing the function of each processing unit according to the present invention is recorded on a computer-readable recording medium, and the program recorded on the recording medium is stored in a computer system. By reading and executing, the processing by each configuration may be executed, or the program may be read by using a communication function not shown.

なお、コンピュータ読み取り可能な記録媒体とは、フレキシブルディスク、光磁気ディスク、ＲＯＭ、ＣＤ−ＲＯＭ等の可搬媒体、コンピュータシステムに内蔵されるハードディスク等の記憶装置のことをいう。 The computer-readable recording medium refers to a portable medium such as a flexible disk, a magneto-optical disk, a ROM, and a CD-ROM, and a storage device such as a hard disk built in the computer system.

また、上記プログラムは、このプログラムを記憶装置等に格納したコンピュータシステムから、伝送媒体を介して、あるいは、伝送媒体中の伝送波により他のコンピュータシステムに伝送されてもよい。ここで、プログラムを伝送する「伝送媒体」は、インターネット等のネットワーク（通信網）や電話回線等の通信回線（通信線）のように情報を伝送する機能を有する媒体のことをいう。 The program may be transmitted from a computer system storing the program in a storage device or the like to another computer system via a transmission medium or by a transmission wave in the transmission medium. Here, the “transmission medium” for transmitting the program refers to a medium having a function of transmitting information, such as a network (communication network) such as the Internet or a communication line (communication line) such as a telephone line.

また、上記プログラムは、前述した機能の一部を実現するためのものであってもよい。さらに、前述した機能を、コンピュータシステムにすでに記録されているプログラムとの組み合わせで実現できるもの、いわゆる差分ファイル（差分プログラム）であってもよい。 The program may be for realizing a part of the functions described above. Furthermore, what can implement | achieve the function mentioned above in combination with the program already recorded on the computer system, what is called a difference file (difference program) may be sufficient.

このように、本発明を実施する形態例を、図面を参照して詳述してきたが、具体的な構成はこの実施の形態例に限られるものではなく、この発明の要旨を逸脱しない範囲の設計等も含まれる。 As described above, the embodiment for carrying out the present invention has been described in detail with reference to the drawings. However, the specific configuration is not limited to this embodiment, and the scope of the present invention is not deviated. Design etc. are also included.

１発話列抽出部
２発話意図判別部
３特徴量抽出部
４推定情報生成部
４ａ意図特徴量
４ｂ知識量情報（知識量ラベル）
４ｃモデル生成機能
５知識量推定部
５ａモデル情報
５ｂ通話ｉの特徴量
５ｃ照合機能
６通話ｉの音声認識結果（推定対象）
７音声認識結果（学習用通話）
８通話ｉの話者（ユーザ）の知識量
１０知識量推定装置
１４キーボード
１６マウス
１８ディスプレイ
２２ＣＰＵ
２４ＲＡＭ
２６ＲＯＭ
２８ハードディスク
３０外部Ｉ／Ｆ（インタフェース）
５０通話装置 DESCRIPTION OF SYMBOLS 1 Speech sequence extraction part 2 Speech intention discrimination | determination part 3 Feature-value extraction part 4 Estimated information generation part 4a Intention feature-value 4b Knowledge-amount information (knowledge-amount label)
4c Model generation function 5 Knowledge amount estimation unit 5a Model information 5b Feature amount 5c of call i Collation function 6 Speech recognition result of call i (estimation target)
7 Speech recognition results (call for learning)
8 Knowledge amount of speaker (user) of call i 10 Knowledge amount estimation device 14 Keyboard 16 Mouse 18 Display 22 CPU
24 RAM
26 ROM
28 Hard Disk 30 External I / F (Interface)
50 Communication equipment

Claims

An utterance intention discriminating means for discriminating an utterance indicating an utterance intention corresponding to a question and explanation to the other party from text data of an utterance sequence in the dialogue between the inquirer and the respondent;
Based on the utterance intention to which the utterance determined by the utterance intention determination means corresponds, an intention feature amount indicating a feature amount regarding the occurrence state of the utterance intention in the text data is extracted, and the inquirer is extracted from the text data. Feature quantity extraction means for extracting the total number of types of words used by as vocabulary feature quantities used;
From the dialogue indicated by the text data for the intended feature quantity and the used vocabulary feature quantity extracted by the feature quantity extracting means, and the text data as the extraction target of the intended feature quantity and the used vocabulary feature quantity Knowledge amount estimation information for generating estimation information used for estimation of the knowledge amount corresponding to the text data by using knowledge amount information given in advance as the learning data indicating the knowledge amount of the inquirer assumed Generating means;
A knowledge amount estimation information generating apparatus comprising:

The knowledge amount estimation information generating apparatus according to claim 1, wherein the utterance intention determination unit determines, from the text data, an utterance indicating an utterance intention corresponding to a conflict in addition to the question and explanation to the other party.

The knowledge amount estimation information generation according to claim 1, wherein the intention feature amount includes the number of questions of the inquirer, the number of questions of the respondent, the number of times of explanation of the inquirer, and the number of times of explanation of the respondent. apparatus.

The knowledge amount estimation information generation device according to claim 3, wherein the intention feature amount further includes at least one of the inquiry number of the inquirer and the inquiry number of the respondent.

The feature amount extraction means extracts an utterance including a questionable word in an utterance corresponding to the question as a questionable question sentence, and the intention feature amount includes the number of questions of the inquirer, the number of questions of the respondent, Including the number of times the inquirer explained, the number of times the answerer explained, the number of inquiries from the inquirer, the number of inquiries from the respondent, the number of questions from the inquirer, and the question from the respondent The knowledge amount estimation information generation device according to claim 2, comprising at least one of the number of times.

Storage means for storing in advance the estimation information generated by the knowledge amount estimation information generation device according to any one of claims 1 to 5,
An estimation target utterance intention discriminating means for discriminating an utterance indicating a utterance intention corresponding to a question to a partner, an explanation, from text data of an utterance sequence in a dialogue between an inquirer and a respondent as an estimation target of a knowledge amount;
Together with the utterance is determined by the estimated target speech intention determining means on the basis of the utterance intention appropriate, to extract the estimated target intended feature quantity indicating a feature amount relating to occurrence of the utterance intention of the text data, the text An estimation target feature amount extraction means for extracting a total number of types of words used by the inquirer from data as a used vocabulary feature amount;
Corresponding to the text data to be estimated using the estimation target intention feature quantity and the estimation target used vocabulary feature quantity extracted by the estimation target feature quantity extraction unit and the estimation information stored in the storage unit Knowledge amount estimating means for estimating the knowledge amount;
A knowledge amount estimation device.

The knowledge amount estimation apparatus according to claim 6, wherein the estimation target utterance intention determination unit determines, from the text data, an utterance indicating an utterance intention corresponding to a conflict in addition to the question and explanation to the other party.

The knowledge amount estimation according to claim 6 or 7, wherein the estimation target intention feature amount includes the number of questions of the inquirer, the number of questions of the respondent, the number of times of explanation of the inquirer, and the number of times of explanation of the respondent. apparatus.

The knowledge amount estimation device according to claim 8, wherein the estimation target intention feature amount further includes at least one of the inquiry number of the inquirer and the inquiry number of the respondent.

The estimation target feature amount extraction unit extracts an utterance including a questionable word in an utterance corresponding to the question as a question word question sentence, and the estimation target intention feature amount includes the number of questions of the inquirer, the answerer's question number, Including the number of questions, the number of explanations of the inquirer, the number of explanations of the respondent, the number of questions of the inquirer, the number of questions of the answerer, the number of questions of the questioner of the inquirer, and the number of questions of the respondent The knowledge amount estimation device according to claim 7, comprising at least one of the number of interrogative question sentences.

A knowledge amount estimation information generation method in a knowledge amount estimation information generation device including an utterance intention determination unit, a feature amount extraction unit, and a knowledge amount estimation information generation unit,
An utterance intention determination step of determining an utterance indicating intention of an utterance corresponding to a question to a partner, an explanation, from text data of an utterance string in a dialogue between an inquirer and an answerer by the utterance intention determination means ,
The feature amount extraction means extracts an intention feature amount indicating a feature amount related to the occurrence state of the utterance determined in the utterance intention determination step in the text data, and a word used by the inquirer from the text data. A feature amount extraction step of extracting the total number of types as vocabulary feature amounts used;
The intention feature amount and the used vocabulary feature amount extracted in the feature amount extraction step by the knowledge amount estimation information generating means , and the text data that is the target of extraction of the intention feature amount and the used vocabulary feature amount. The knowledge amount information given in advance as the knowledge amount indicating the knowledge amount of the inquirer assumed from the dialogue indicated by the text data is used as learning data, so that the knowledge amount corresponding to the text data is used for estimation. A knowledge amount estimation information generation step for generating estimation information;
A knowledge amount estimation information generation method comprising:

A knowledge amount estimation method in a knowledge amount estimation device including an estimation target utterance intention determination unit, an estimation target feature amount extraction unit, and a knowledge amount estimation unit,
A storage step of storing in advance in a storage device the estimation information generated by the knowledge amount estimation information generation method according to claim 11;
The estimation target utterance intention discrimination means discriminates an utterance indicating the intention of the utterance corresponding to the question and explanation to the other party from the text data of the utterance string in the dialogue between the inquirer and the respondent as the estimation target of the knowledge amount. An estimation target utterance intention determination step;
The estimation target feature amount extraction unit extracts an estimation target intention feature amount indicating a feature amount related to a generation state in the text data of the utterance determined in the estimation target utterance intention determination step. An estimation target feature amount extraction step for extracting the total number of types of words used by the inquirer as an estimation target use vocabulary feature amount; and
By using the estimation target intention feature quantity and the estimation target used vocabulary feature quantity extracted in the estimation target feature quantity extraction step by the knowledge quantity estimation means, and the estimation information stored in the storage device , A knowledge amount estimation step for estimating the knowledge amount corresponding to the text data;
A knowledge amount estimation method comprising:

A computer for causing the computer to function as the knowledge amount estimation information generation device according to any one of claims 1 to 5 or the knowledge amount estimation device according to any one of claims 6 to 10. program.