JP5574842B2

JP5574842B2 - FAQ candidate extraction system and FAQ candidate extraction program

Info

Publication number: JP5574842B2
Application number: JP2010140675A
Authority: JP
Inventors: 一彰竹原
Original assignee: Nomura Research Institute Ltd
Current assignee: Nomura Research Institute Ltd
Priority date: 2010-06-21
Filing date: 2010-06-21
Publication date: 2014-08-20
Anticipated expiration: 2030-06-21
Also published as: JP2012003704A

Description

本発明は、自然言語処理の技術に関し、特に、音声認識処理により得られた談話データの構造を解析した結果からＦＡＱ（Frequently Asked Questions）の候補を抽出するＦＡＱ候補抽出システムおよびＦＡＱ候補抽出プログラムに適用して有効な技術に関するものである。 The present invention relates to a natural language processing technique, and more particularly, to a FAQ candidate extraction system and a FAQ candidate extraction program for extracting FAQ (Frequently Asked Questions) candidates from the result of analyzing the structure of discourse data obtained by speech recognition processing. It is related to effective technology when applied.

例えば、コールセンター業務において、オペレータやコミュニケータは、電話応対後に応対記録を作成して応対管理システム等へ保存するという一連の業務を行う必要がある。このとき、オペレータ等は、自身の記憶を頼りにして応対記録を作成するため作成に時間がかかるという問題がある。 For example, in a call center operation, an operator or a communicator needs to perform a series of operations such as creating a response record and storing it in a response management system after receiving a call. At this time, there is a problem that it takes time for the operator or the like to create a response record based on his / her memory.

また、各オペレータ等のスキルに応じて品質（内容、統一性、網羅性、簡潔性等）にバラツキが生じる。このように応対記録の品質にバラツキがあったり品質が低かったりすることにより、顧客の声を正確に把握することができなかったり、応対記録の蓄積からテキストマイニング等の技術を利用して新たな知見を得るというようなことができなかったりなどの問題も生じる。 Further, quality (content, uniformity, completeness, conciseness, etc.) varies according to the skill of each operator. In this way, the quality of the response records varies or the quality is low, so it is not possible to accurately grasp the customer's voice, or from the accumulation of response records, new technologies such as text mining are used. Problems such as inability to obtain knowledge also occur.

このような問題を解決するため、音声認識の技術を活用した解決策が検討されている。音声認識の技術によって応対の音声情報をテキスト化して保存することにより、応対記録の網羅性を確保することができる。また、テキスト化したデータに対して自然言語処理の技術を適用して内容を要約することで応対記録を自動作成することも可能であり、応対記録作成にかかるコストを削減して、統一性、簡潔性を確保することで応対記録の活用性を向上させることも可能と考えられる。 In order to solve such a problem, a solution using a speech recognition technology is being studied. By storing the voice information of the response as text using the speech recognition technology, it is possible to ensure the completeness of the response record. It is also possible to automatically create response records by applying natural language processing technology to the text data and summarizing the contents, reducing the cost of response record creation, uniformity, It is possible to improve the usability of response records by ensuring conciseness.

このような技術として、例えば、非特許文献１には、コールセンターにおける音声対話において、音声データをテキスト化し、過去の対話内容とその営業日報から帰納的学習により再帰的に規則獲得を行い、それらの規則を用いて未知の対話内容に対して重要箇所を決定・抽出し、文体変換を行うことで営業日報を自動生成する技術が記載されている。 As such a technique, for example, in Non-Patent Document 1, in voice conversation at a call center, voice data is converted into text, and rules are recursively obtained by inductive learning from past conversation contents and business daily reports. It describes a technology that automatically generates daily business reports by determining and extracting important points for unknown dialogue contents using rules and performing style conversion.

一方で、テキスト化した音声データの蓄積からテキストマイニング等による分析などにより新たな知見を得る技術としては、音声データの対話の種類により種々のものが想定されるが、例えば、上述したコールセンターにおける電話応対の内容からは、よく行われる質問を自動的に抽出し、これに対する回答を用意することでＦＡＱ（Frequently Asked Questions）の作成を効率的・効果的に行うという技術が想定される。 On the other hand, there are various technologies that can be used to obtain new knowledge through analysis by text mining or the like from the accumulation of voice data that has been converted into text, depending on the type of voice data interaction. From the contents of the response, it is assumed that frequently asked questions are automatically extracted, and answers (FAQs) are prepared to prepare FAQ (Frequently Asked Questions) efficiently and effectively.

従来では、予め質問と回答に分類されて蓄積されたデータの集合からＦＡＱとすべき質問−回答対を抽出することは行われていた。また、応対記録のテキストデータなどから、特定のパターンや抽出したパターンとのマッチングと統計的な処理によって、よく行われる質問や回答を抽出するということも行われている。例えば、特開２００１−１３４５７５号公報（特許文献１）には、大量のテキストデータについて、これらを構文解析したデータ構造の集合から出現頻度が高い重要なパターンを検出することでＦＡＱの自動作成を可能とする技術が記載されている。 Conventionally, a question-answer pair to be used as a FAQ has been extracted from a set of data classified and stored in advance as questions and answers. In addition, frequently asked questions and answers are extracted from text data of response records by matching with specific patterns or extracted patterns and statistical processing. For example, in Japanese Patent Laid-Open No. 2001-134575 (Patent Document 1), for a large amount of text data, an FAQ is automatically created by detecting an important pattern having a high appearance frequency from a set of data structures obtained by parsing the text data. The technology to be enabled is described.

さらに、例えば非特許文献２には、近年増えてきたクローズドキャプション付きのテレビ番組を対象として、テレビ番組内のクローズドキャプションの中から、ＣＲＦ（Conditional Random Fields）を用いて質問文を同定し、これに対応する解答文の中心を推定し、さらにＣＲＦを用いて解答文の導入・補足となる情報も合わせて同定することにより、質問−解答対を抽出する技術が記載されている。 Further, for example, in Non-Patent Document 2, for a television program with closed captions that has been increasing in recent years, a question sentence is identified using CRF (Conditional Random Fields) from among closed captions in the television program. Describes a technique for extracting a question-answer pair by estimating the center of an answer sentence corresponding to, and further identifying information for introducing and supplementing the answer sentence using CRF.

特開２００１−１３４５７５号公報JP 2001-134575 A

矢野純司、荒木健治、“コールセンターにおける音声対話を対象とした帰納的学習を用いた営業日報生成手法の性能評価”、情報処理学会研究報告２００７−ＮＬ−１７８、２００７年３月２８日、ｐ．２１−２８Junji Yano, Kenji Araki, “Performance evaluation of daily report generation method using inductive learning for voice conversation in call center”, Information Processing Society of Japan Research Report 2007-NL-178, March 28, 2007, p. 21-28 河野将弘、他６名、“ＣＲＦを用いたテレビ番組クローズドキャプションからの質問−解答対自動抽出”、言語処理学会第１４回年次大会発表論文集、２００８年３月２０日、ｐ．９１５−９１８Masahiro Kawano and 6 others, “Question-answer pair automatic extraction from TV program closed caption using CRF”, Proc. Of the 14th Annual Conference of the Language Processing Society, March 20, 2008, p. 915-918

人の談話を音声認識によりテキスト化したデータ（以下では「談話データ」と記載する場合がある）を活用する場合、談話データ内の文章は通常は話し言葉で記録されているため、正しい文法に則った日本語文や、書き言葉を前提とした既存の自然言語処理の技術ではうまく対応できない場合がある。また、現状の音声認識エンジンの精度では認識ミスを排除することはできず、誤認識による誤字・脱字などのノイズが多い。従って、談話データについて自然言語処理を利用して解析し、有効活用するのは困難である。 When using data obtained by converting a person's discourse into text by speech recognition (hereinafter sometimes referred to as “discourse data”), the sentences in the discourse data are usually recorded in spoken language, so follow the correct grammar. Japanese language sentences and existing natural language processing technology based on written language may not be able to cope well. Moreover, the accuracy of the current speech recognition engine cannot eliminate recognition errors, and there are many noises such as typographical errors and omissions due to erroneous recognition. Therefore, it is difficult to analyze and effectively use discourse data using natural language processing.

一方、テキスト化した音声データの蓄積からＦＡＱの候補となるような質問文といった新たな知見を得るためには、例えば、特許文献１等の技術のような、パターンの抽出とマッチングおよび統計的な処理によって、頻出する質問や回答を抽出するということが考えられる。しかし、これらの技術は、単語や構文パターン等の出現頻度などの情報に基づくものであって、談話データの各文章の意味内容（単に特定の表現パターンを有するというだけではなく、過去の文章からの流れ・展開も考慮した上で判断される意味）を考慮したものではない。従って、データの内容によっては、例えば質問に相当する文章のみを精度よく効率的に抽出するというようなことは十分に行えない場合も想定される。 On the other hand, in order to obtain new knowledge such as a question sentence that is a candidate for FAQ from the accumulation of voice data that has been converted into text, for example, pattern extraction and matching as well as statistical techniques such as the technique of Patent Document 1 and the like are used. It may be possible to extract frequently asked questions and answers by processing. However, these technologies are based on information such as the frequency of appearance of words and syntax patterns, and the meaning content of each sentence of discourse data (not only having a specific expression pattern, but also from past sentences). This does not take into account the meaning that is determined in consideration of the flow and development of Therefore, depending on the content of the data, for example, it may be assumed that it is not possible to sufficiently extract only the sentence corresponding to the question with high accuracy and efficiency.

また、上述の非特許文献２の技術では、テレビ番組のクローズドキャプションから質問−回答対を抽出することができるが、ここでは、対象のクローズドキャプションの内容はスキーマにアノテーションされる、すなわち、あらかじめ規定された文章構造の枠組みに当てはめられ、これに基づいて質問と回答の抽出を行うものである。しかしながら、例えば、コールセンターにおける電話応対をテキスト化したデータなど、決められた文章構造の枠組みやスキーマに当てはめることができない、もしくは困難であるような談話データから質問文を特定して抽出することは、非特許文献２のような技術では困難である。 In the technique of Non-Patent Document 2 described above, a question-answer pair can be extracted from a closed caption of a television program. Here, the content of the target closed caption is annotated in the schema, that is, specified in advance. It is applied to the framework of the sentence structure, and questions and answers are extracted based on this. However, for example, it is possible to identify and extract a question sentence from discourse data that cannot be applied to the framework or schema of a predetermined sentence structure, such as text data of telephone reception at a call center. It is difficult with the technique as described in Non-Patent Document 2.

そこで本発明の目的は、話し言葉やノイズといった談話データの特性に強く、談話の文章構造の枠組みを規定せずに、談話データの構造を解析した結果から得られる意味内容に係る情報に基づいて、談話データからＦＡＱの候補となる質問文を抽出するＦＡＱ候補抽出システムおよびＦＡＱ候補抽出プログラムを提供することにある。本発明の前記ならびにその他の目的と新規な特徴は、本明細書の記述および添付図面から明らかになるであろう。 Therefore, the object of the present invention is strong in the characteristics of discourse data such as spoken words and noise, and based on the information on the semantic content obtained from the result of analyzing the structure of discourse data without prescribing the framework of the discourse sentence structure, An object of the present invention is to provide a FAQ candidate extraction system and a FAQ candidate extraction program for extracting question sentences as FAQ candidates from discourse data. The above and other objects and novel features of the present invention will be apparent from the description of this specification and the accompanying drawings.

本願において開示される発明のうち、代表的なものの概要を簡単に説明すれば、以下のとおりである。 Of the inventions disclosed in this application, the outline of typical ones will be briefly described as follows.

本発明の代表的な実施の形態によるＦＡＱ候補抽出システムは、音声認識処理によって談話の内容がテキスト化された談話データ、および前記談話データにおける前記談話の構造の解析結果である、前記談話全体の意味内容を把握するための情報である談話セマンティクスを入力とし、前記談話データからＦＡＱ候補となる質問文を抽出して出力するＦＡＱ候補抽出システムであって、以下の特徴を有するものである。 The FAQ candidate extraction system according to a representative embodiment of the present invention is a discourse data in which the content of a discourse is converted into text by speech recognition processing, and an analysis result of the discourse structure in the discourse data. A FAQ candidate extraction system that inputs discourse semantics, which is information for grasping meaning content, and extracts and outputs a question sentence as a FAQ candidate from the discourse data, and has the following characteristics.

すなわち、ＦＡＱ候補抽出システムにおいて、前記談話セマンティクスは、前記談話データにおける各ステートメントの意味内容を示すフローの情報を含むフロー情報を含み、ＦＡＱ候補抽出システムは、前記談話データから、顧客によって発話され、前記談話セマンティクスの前記フロー情報において質問文もしくは要求文であることを示すフローが設定された質問・要求ステートメントを抽出する質問抽出部と、前記質問抽出部によって抽出された前記質問・要求ステートメントから、指定されたキーワードを含むものを抽出する候補抽出部と、前記候補抽出部によって抽出された前記質問・要求ステートメントについて、内容が同様のものにクラスタリングし、各クラスタの代表となる前記質問・要求ステートメントをＦＡＱ候補として出力するクラスタリング部とを有することを特徴とするものである。 That is, in the FAQ candidate extraction system, the discourse semantics includes flow information including flow information indicating the semantic content of each statement in the discourse data, and the FAQ candidate extraction system is uttered by the customer from the discourse data. A question extraction unit that extracts a question / request statement in which a flow indicating that it is a question sentence or a request sentence in the flow information of the discourse semantics, and the question / request statement extracted by the question extraction unit, A candidate extraction unit that extracts a keyword including the specified keyword, and the question / request statement that is representative of each cluster by clustering the question / request statements extracted by the candidate extraction unit into the same contents As a FAQ candidate It is characterized in that it has a clustering unit for outputting.

また、本発明は、コンピュータを上記のようなＦＡＱ候補抽出システムとして機能させるプログラムにも適用することができる。 The present invention can also be applied to a program that causes a computer to function as the FAQ candidate extraction system as described above.

本願において開示される発明のうち、代表的なものによって得られる効果を簡単に説明すれば以下のとおりである。 Among the inventions disclosed in the present application, effects obtained by typical ones will be briefly described as follows.

本発明の代表的な実施の形態によれば、話し言葉やノイズといった談話データの特性に強く、談話の文章構造の枠組みを規定せずに、談話データの構造を解析した結果から得られる意味内容に係る情報に基づいて、談話データからＦＡＱの候補となる質問文を抽出することが可能となり、効率的・効果的なＦＡＱの作成を支援することが可能となる。 According to the exemplary embodiment of the present invention, the semantic content is obtained from the result of analyzing the structure of discourse data, without having to define the framework of the discourse sentence structure, strong in the characteristics of discourse data such as spoken words and noise. Based on such information, it is possible to extract a question sentence as a FAQ candidate from the discourse data, and it is possible to support the creation of an efficient and effective FAQ.

本発明の一実施の形態であるＦＡＱ候補抽出システムの構成例の概要について示したブロック図である。It is the block diagram shown about the outline | summary of the structural example of the FAQ candidate extraction system which is one embodiment of this invention. 本発明の一実施の形態における談話構造解析システムの構成例の概要について示したブロック図である。It is the block diagram shown about the outline | summary of the structural example of the discourse structure analysis system in one embodiment of this invention. 本発明の一実施の形態における談話の例および談話セマンティクスの概念について示した図である。It is the figure shown about the example of the discourse in one embodiment of this invention, and the concept of discourse semantics. 本発明の一実施の形態における談話データのフローとコンテキストの例について示した図である。It is the figure shown about the flow of discourse data and the example of a context in one embodiment of this invention. 本発明の一実施の形態における談話構造解析システムのフロー解析ルールの例について示した図である。It is the figure shown about the example of the flow analysis rule of the discourse structure analysis system in one embodiment of this invention. 本発明の一実施の形態における談話構造解析システムのフロー解析部でのフロー解析処理の流れの例を示したフローチャートである。It is the flowchart which showed the example of the flow of the flow analysis process in the flow analysis part of the discourse structure analysis system in one embodiment of this invention. 本発明の一実施の形態における談話構造解析システムの固有表現解析ルールの例について示した図である。It is the figure shown about the example of the specific expression analysis rule of the discourse structure analysis system in one embodiment of this invention. 本発明の一実施の形態における談話構造解析システムの固有表現解析部での固有表現解析処理の流れの例を示したフローチャートである。It is the flowchart which showed the example of the flow of the specific expression analysis process in the specific expression analysis part of the discourse structure analysis system in one embodiment of this invention. 本発明の一実施の形態における談話構造解析システムでの結束性解析の例について示した図である。It is the figure shown about the example of cohesion analysis in the discourse structure analysis system in one embodiment of this invention. 本発明の一実施の形態における談話構造解析システムの結束性解析部での結束性解析処理の流れの例を示したフローチャートである。It is the flowchart which showed the example of the flow of the cohesiveness analysis process in the cohesiveness analysis part of the discourse structure analysis system in one embodiment of this invention. 本発明の一実施の形態における抽出したＦＡＱ候補を表示する画面の例について示した図である。It is the figure shown about the example of the screen which displays the extracted FAQ candidate in one embodiment of this invention. 本発明の一実施の形態におけるＦＡＱ候補抽出システムでのＦＡＱ候補抽出処理の流れの例を示したフローチャートである。It is the flowchart which showed the example of the flow of the FAQ candidate extraction process in the FAQ candidate extraction system in one embodiment of this invention.

以下、本発明の実施の形態を図面に基づいて詳細に説明する。なお、実施の形態を説明するための全図において、同一部には原則として同一の符号を付し、その繰り返しの説明は省略する。 Hereinafter, embodiments of the present invention will be described in detail with reference to the drawings. Note that components having the same function are denoted by the same reference symbols throughout the drawings for describing the embodiment, and the repetitive description thereof will be omitted.

本発明の一実施の形態であるＦＡＱ候補抽出システムは、話し言葉を含む談話データについての談話構造の解析結果である談話セマンティクス（談話全体の意味内容を把握するための談話構造に係る情報）に基づいて、談話データから顧客による質問に相当する文章を抽出し、単語の出現頻度に基づく重要度を算出する。さらに、既存のＦＡＱにおける質問文を含めた集合から、指定されたキーワードを含む質問を抽出し、抽出された質問を同様のものにクラスタリングすることで、回答作成の優先度（重要度）が高い順に、回答の整備済／未済の情報を含んだＦＡＱ候補として出力するシステムである。 The FAQ candidate extraction system according to one embodiment of the present invention is based on discourse semantics (information related to discourse structure for grasping the semantic content of the entire discourse), which is the analysis result of discourse structure for discourse data including spoken words. Then, the sentence corresponding to the question from the customer is extracted from the discourse data, and the importance based on the appearance frequency of the word is calculated. Furthermore, by extracting questions that include a specified keyword from a set including question sentences in an existing FAQ, and clustering the extracted questions into similar ones, the priority (importance) of answer creation is high The system sequentially outputs the FAQ candidates including the information on the completed / unanswered answers.

ここで、談話セマンティクスとは、例えば、談話データにおける文字列や形態素から意味内容を推測して、談話データ全体について後述するようなフロー、さらには結束性（談話のセグメント）、固有表現などの談話構造の解析を行って得られた解析結果である。この談話セマンティクスは、例えば、後述する談話構造解析システムを用いることによって談話データから生成してもよいし、他の自然言語処理を利用した文章構造の解析システム等を利用して生成してもよい。また、人手によって談話データにタグ付け等を行って作成することも可能である。本実施の形態では、後述する談話構造解析システムを用いて生成するものとして説明する。 Here, discourse semantics means, for example, a flow as described below for the entire discourse data by inferring the semantic content from character strings and morphemes in the discourse data, and also a discourse such as cohesiveness (discourse segment), proper expression, etc. It is the analysis result obtained by analyzing the structure. This discourse semantics may be generated from discourse data by using a discourse structure analysis system, which will be described later, or may be generated using a sentence structure analysis system using other natural language processing, etc. . It is also possible to manually create a tag by tagging the discourse data. In the present embodiment, description will be made assuming that the generation is performed using a discourse structure analysis system described later.

本実施の形態における談話セマンティクスは、日本語の文章としての内容や構造の厳密な解析結果である必要はなく、例えば、コールセンターにおける応対記録の作成などの実用上支障のないレベルの精度のものであればよい。なお、本明細書において、「談話」とは１人以上の話者による発話行為を指し、例えば、１人で話す講演会でのスピーチ等や、上述したようなコールセンターのオペレータ等と顧客とのやり取り、複数人での電話会議やテレビ会議、その他の複数人での会話・対話が含まれる。 The discourse semantics in this embodiment do not have to be the result of strict analysis of the content and structure of Japanese sentences. For example, the discourse semantics have a level of accuracy that does not impede practical use, such as creating a response record at a call center. I just need it. In this specification, “discussion” refers to an utterance action by one or more speakers, for example, a speech at a lecture where one person speaks, a call center operator as described above, and a customer. Includes communication, multi-person conference calls and video conferencing, and other multi-person conversations and dialogues.

図３は、本実施の形態における談話の例および談話セマンティクスの概念について示した図である。図３の例では、コールセンターのオペレータと顧客とのやり取りからなる談話の例を示している。左側の発話はオペレータの発話であり、右側は顧客の発話を示している。本実施の形態では、各話者の発話１文を「ステートメント」と呼ぶものとする。また、話者毎の連続したステートメントのまとまりを「ブロック」と呼ぶものとする（同一話者の連続したステートメントでも時間的に間隔が空いた場合は別のブロックとなる）。 FIG. 3 is a diagram showing an example of discourse and the concept of discourse semantics in the present embodiment. The example of FIG. 3 shows an example of a discourse consisting of interaction between a call center operator and a customer. The utterance on the left side is the utterance of the operator, and the right side shows the utterance of the customer. In the present embodiment, one sentence of each speaker is called a “statement”. In addition, a group of consecutive statements for each speaker is referred to as a “block” (even if consecutive statements of the same speaker are separated in time, they become different blocks).

図３において、「フロー」とは、各ステートメントの意味内容を端的に示す情報である。この情報をトレースすることによって談話の流れを把握することができる。例えば、図３のオペレータのステートメントにおいて、最初の“はい。”はフローが“挨拶”になっているのに対し、次の“はい。”ではフローが“相槌”となっている。このように、談話の状況（コンテキスト）に応じて、同じ文言のステートメントであってもフローが異なるものとなる場合がある。 In FIG. 3, “flow” is information that simply indicates the semantic content of each statement. By tracing this information, the flow of discourse can be grasped. For example, in the operator statement of FIG. 3, the first “Yes.” Indicates that the flow is “greeting”, while the next “Yes.” Indicates that the flow is “confusion”. In this way, depending on the discourse situation (context), the flow may be different even for statements of the same wording.

また、図３において、「固有表現」とは、談話の内容を個別に特徴付ける表現であり、例えば、図３のコールセンターでのやり取りの談話では、話者のＩＤや人名、企業名、電話番号、所在地などの固有名詞等が該当する。また、「セグメント」とは、ブロック毎の談話の内容が共通する（結束する）ものをまとめてセグメント化したものである。上記の「フロー」、「固有表現」、「セグメント」についての情報は、談話構造の解析結果として「談話セマンティクス」の構成要素となる。 In FIG. 3, “proprietary expression” is an expression that individually characterizes the content of the discourse. For example, in the discourse of exchange at the call center in FIG. 3, the speaker ID, person name, company name, telephone number, Applicable to proper nouns such as location. Further, the “segment” is a segmented set of common (united) contents of the discourse for each block. The information about the above “flow”, “specific expression”, and “segment” is a constituent element of “Discourse Semantics” as the analysis result of the discourse structure.

［システム構成（談話構造解析システム）］
図２は、本発明の一実施の形態における談話構造解析システムの構成例の概要について示したブロック図である。談話構造解析システム２は、サーバやＰＣ（Personal Computer）等のコンピュータシステムによって構成され、例えば、ソフトウェアプログラムによって実装される前処理部１０、フロー解析部２０、形態素解析部３０、固有表現解析部４０、結束性解析部５０の各部と、フロー解析ルール２２、および固有表現解析ルール４２の各テーブルを有し、音声認識エンジン３によって談話の内容がテキスト化された談話データ１００を入力として、談話構造の解析を行って談話セマンティクス２００を出力する。 [System configuration (Discourse structure analysis system)]
FIG. 2 is a block diagram showing an outline of a configuration example of the discourse structure analyzing system according to the embodiment of the present invention. The discourse structure analysis system 2 is configured by a computer system such as a server or a PC (Personal Computer). For example, the preprocessing unit 10, the flow analysis unit 20, the morpheme analysis unit 30, and the proper expression analysis unit 40 are implemented by software programs. The discourse structure having each unit of the cohesiveness analysis unit 50, each table of the flow analysis rule 22 and the specific expression analysis rule 42, and the discourse data 100 in which the content of the discourse is converted into text by the speech recognition engine 3 is input. And the discourse semantics 200 is output.

なお、音声認識エンジン３には既存のものを利用することができる。本実施の形態の談話構造解析システム２は、上述したように、談話データ１００のノイズ（音声認識エンジン３による認識ミス等）に強いという特徴を有するが、音声認識エンジン３の認識率が高いほうがより精度の高い談話構造の解析を行うことが可能であることはいうまでもない。 Note that the existing voice recognition engine 3 can be used. As described above, the discourse structure analysis system 2 according to the present embodiment is characterized by being resistant to noise (such as recognition errors by the speech recognition engine 3) of the discourse data 100, but the recognition rate of the speech recognition engine 3 is higher. Needless to say, it is possible to analyze the discourse structure with higher accuracy.

談話構造解析システム２の前処理部１０は、談話データ１００の入力を受けて、必要に応じて、他の各部での処理が可能なようにデータのレイアウト変換などの前処理を行う機能を有する。ここでは、例えば、話者毎に連続したステートメントをブロックとしてまとめて、各ブロックおよび各ステートメントに順序を示すシーケンス番号（ＩＤ）を割り振ったり、各ブロックの話者のデータを抽出・設定したりなどの処理を行う。なお、談話データ１００は、例えば、ＸＭＬ（eXtended Markup Language）などを利用したデータ構造として表される。 The preprocessing unit 10 of the discourse structure analysis system 2 has a function of receiving input of the discourse data 100 and performing preprocessing such as data layout conversion so that processing can be performed in other units as necessary. . Here, for example, continuous statements for each speaker are grouped into blocks, sequence numbers (IDs) indicating the order are assigned to each block and each statement, and speaker data of each block is extracted and set. Perform the process. The discourse data 100 is represented as a data structure using, for example, XML (eXtended Markup Language).

フロー解析部２０は、前処理部１０による前処理後の談話データ１００に対して、フロー解析ルール２２に基づくルールベースでのフローの解析を行い、解析結果としてフロー情報２１を出力する機能を有する。フロー情報２１は、談話データ１００内の各ステートメントのＩＤとこれに対応するフローの情報を保持し、例えば、ＸＭＬなどを利用したデータ構造として表される。 The flow analysis unit 20 has a function of analyzing a rule-based flow based on the flow analysis rule 22 on the discourse data 100 after the preprocessing by the preprocessing unit 10 and outputting the flow information 21 as an analysis result. . The flow information 21 holds the ID of each statement in the discourse data 100 and the flow information corresponding thereto, and is represented as a data structure using XML or the like, for example.

このフロー情報２１は、談話構造解析システム２の出力として談話セマンティクス２００に含まれ、必須の構成要素となる。すなわち、談話セマンティクス２００には、少なくとも談話データ１００についてのフローの解析結果としてフロー情報２１が含まれる。なお、フロー解析ルール２２の内容、およびフロー解析部２０での処理の詳細については後述する。 This flow information 21 is included in the discourse semantics 200 as an output of the discourse structure analysis system 2 and is an essential component. That is, the discourse semantics 200 includes flow information 21 as a flow analysis result for at least the discourse data 100. Details of the flow analysis rule 22 and details of processing in the flow analysis unit 20 will be described later.

形態素解析部３０は、前処理部１０による前処理後の談話データ１００に対して、形態素（言語における意味を持つ最小の単位）の列に分割して品詞を判別するいわゆる形態素解析を行い、解析結果として形態素情報３１を出力する機能を有する。形態素情報３１は、談話データ１００内の各ステートメントのＩＤとこれに対応する形態素列の情報を保持し、例えば、ＸＭＬなどを利用したデータ構造として表される。 The morpheme analysis unit 30 performs a so-called morpheme analysis on the discourse data 100 after the preprocessing by the preprocessing unit 10 and divides the discourse data into columns of morphemes (the smallest unit having meaning in the language) to determine the part of speech. As a result, the morpheme information 31 is output. The morpheme information 31 holds the ID of each statement in the discourse data 100 and morpheme string information corresponding thereto, and is represented as a data structure using, for example, XML.

この形態素情報３１は、フロー情報２１と同様に、談話構造解析システム２の出力として談話セマンティクス２００に含まれるとともに、後述する固有表現解析部４０や結束性解析部５０に対する入力となる。すなわち、固有表現解析部４０による固有表現の解析や、結束性解析部５０による結束性の解析（セグメント化）を行わない場合は、形態素解析部３０による形態素解析は不要な場合がある。本実施の形態では、後述するＦＡＱ候補抽出システムにおいて形態素情報３１を利用するため、形態素解析部３０による形態素解析は必須である。なお、形態素解析部３０には既存の形態素解析エンジン（例えば、ＭｅＣａｂ（和布蕪）やＣｈａＳｅｎ（茶筌）など）を利用することができる。 Similar to the flow information 21, the morpheme information 31 is included in the discourse semantics 200 as an output of the discourse structure analysis system 2, and becomes an input to the specific expression analysis unit 40 and the cohesiveness analysis unit 50 described later. In other words, when the specific expression analysis by the specific expression analysis unit 40 and the cohesiveness analysis (segmentation) by the cohesiveness analysis unit 50 are not performed, the morpheme analysis by the morpheme analysis unit 30 may be unnecessary. In the present embodiment, since the morpheme information 31 is used in the FAQ candidate extraction system described later, the morpheme analysis by the morpheme analysis unit 30 is essential. Note that an existing morpheme analysis engine (for example, MeCab (Japanese cloth candy) or ChaSen (tea candy)) can be used for the morpheme analysis unit 30.

固有表現解析部４０は、前処理部１０による前処理後の談話データ１００と、形態素解析部３０から出力された形態素情報３１とに基づいて、固有表現解析ルール４２に基づくルールベースでの固有表現の解析を行い、解析結果として固有表現情報４１を出力する機能を有する。すなわち、固有表現解析部４０での解析処理は、少なくとも形態素解析部３０による形態素解析が行われていることが前提となる。固有表現情報４１は、談話データ１００内の各ステートメントＩＤとこれに対応する固有表現のリストの情報を保持し、例えば、ＸＭＬなどを利用したデータ構造として表される。 The specific expression analysis unit 40 is a rule-based specific expression based on the specific expression analysis rule 42 based on the discourse data 100 after the preprocessing by the preprocessing unit 10 and the morpheme information 31 output from the morpheme analysis unit 30. And the specific expression information 41 is output as an analysis result. That is, the analysis process in the specific expression analysis unit 40 is based on the premise that at least morpheme analysis by the morpheme analysis unit 30 is performed. The specific expression information 41 holds information of each statement ID in the discourse data 100 and a list of specific expressions corresponding to the statement ID, and is represented as a data structure using, for example, XML.

この固有表現情報４１は、フロー情報２１と同様に、談話構造解析システム２の出力として談話セマンティクス２００に含まれるが、その後の談話セマンティクス２００を利用するアプリケーション等による利用形態や、求められる談話セマンティクスの精度などによって不要な場合もあり得る。従って、固有表現情報４１の出力は必須ではなく、出力の有無をユーザが選択可能としてもよい。なお、固有表現解析ルール４２の内容、および固有表現解析部４０での処理の詳細については後述する。 The specific expression information 41 is included in the discourse semantics 200 as an output of the discourse structure analysis system 2, as with the flow information 21, but the usage form by the application that uses the discourse semantics 200 thereafter and the required discourse semantics are included. It may be unnecessary depending on accuracy. Therefore, the output of the specific expression information 41 is not essential, and the user may be able to select whether or not to output. Details of the specific expression analysis rule 42 and details of the processing in the specific expression analysis unit 40 will be described later.

結束性解析部５０は、前処理部１０による前処理後の談話データ１００と、フロー解析部２０から出力されたフロー情報２１および形態素解析部３０から出力された形態素情報３１に基づいて、各ブロックの結束性の解析を行ってセグメント化し、解析結果として結束性情報５１を出力する機能を有する。すなわち、固有表現解析部４０での解析処理は、少なくとも形態素解析部３０による形態素解析、およびフロー解析部２０によるフローの解析が行われていることが前提となる。結束性情報５１は、談話データ１００内の各ブロックＩＤと対応するセグメントＩＤの情報を保持し、例えば、ＸＭＬなどを利用したデータ構造として表される。 The cohesiveness analysis unit 50 determines each block based on the discourse data 100 after the preprocessing by the preprocessing unit 10, the flow information 21 output from the flow analysis unit 20, and the morpheme information 31 output from the morpheme analysis unit 30. The cohesiveness is analyzed and segmented, and the cohesiveness information 51 is output as an analysis result. That is, the analysis processing in the specific expression analysis unit 40 is premised on that at least morpheme analysis by the morpheme analysis unit 30 and flow analysis by the flow analysis unit 20 are performed. The cohesiveness information 51 holds segment ID information corresponding to each block ID in the discourse data 100, and is represented as a data structure using XML or the like, for example.

この結束性情報５１は、固有表現情報４１と同様に、談話構造解析システム２の出力として談話セマンティクス２００に含まれるが、その後の談話セマンティクス２００を利用するアプリケーション等による利用形態や、求められる談話セマンティクスの精度などによっては不要な場合もあり得る。従って、結束性情報５１の出力は必須ではなく、出力の有無をユーザが選択可能としてもよい。なお、結束性解析部５０での処理の詳細については後述する。 The cohesiveness information 51 is included in the discourse semantics 200 as an output of the discourse structure analysis system 2 as in the case of the specific expression information 41. However, the usage form by the application using the discourse semantics 200 after that and the required discourse semantics are included. Depending on the accuracy, etc., it may be unnecessary. Therefore, the output of the cohesiveness information 51 is not essential, and the user may be able to select whether or not to output. Details of processing in the cohesiveness analysis unit 50 will be described later.

談話構造解析システム２による解析結果として出力される談話セマンティクス２００は、上述したように、フロー情報２１や、形態素情報３１、固有表現情報４１、結束性情報５１を含んで構成される。このとき、単にこれらのデータの集合を談話セマンティクス２００として取り扱ってもよいし、例えば、これらのデータをＸＭＬを利用した１つのデータ構造に結合してもよい。 As described above, the discourse semantics 200 output as the analysis result by the discourse structure analysis system 2 includes the flow information 21, the morpheme information 31, the unique expression information 41, and the cohesiveness information 51. At this time, a set of these data may simply be handled as the discourse semantics 200, or these data may be combined into one data structure using XML, for example.

［システム構成（ＦＡＱ候補抽出システム）］
図１は、本発明の一実施の形態であるＦＡＱ候補抽出システムの構成例の概要について示したブロック図である。ＦＡＱ候補抽出システム１は、談話構造解析システム２と同様に、サーバやＰＣ等のコンピュータシステムによって構成され、例えば、ソフトウェアプログラムによって実装される質問抽出部６０、スコアリング部７０、候補抽出部８０およびクラスタリング部９０の各部と、既存ＦＡＱ８２のテーブルを有し、談話の内容がテキスト化された談話データ１０１と、談話データ１０１についての談話構造の解析結果である談話セマンティクス２００の全部または一部（特にフロー情報２１、および形態素情報３１）とを入力として、談話データ１０１からＦＡＱの候補となる質問文を抽出してＦＡＱ候補３００として出力するシステムである。 [System configuration (FAQ candidate extraction system)]
FIG. 1 is a block diagram showing an outline of a configuration example of a FAQ candidate extraction system according to an embodiment of the present invention. The FAQ candidate extraction system 1 is configured by a computer system such as a server or a PC, similar to the discourse structure analysis system 2, and includes, for example, a question extraction unit 60, a scoring unit 70, a candidate extraction unit 80, and a software program. Each part of the clustering unit 90 and an existing FAQ 82 table, the discourse data 101 in which the content of the discourse is converted into text, and all or part of the discourse semantics 200 that is the analysis result of the discourse structure of the discourse data 101 (particularly This is a system that receives flow information 21 and morpheme information 31), extracts a question sentence that is a FAQ candidate from the discourse data 101, and outputs it as a FAQ candidate 300.

なお、談話データ１０１は、上述した談話構造解析システム２の前処理部１０によって前処理されたものと同等のもの、すなわち、例えば、話者毎に連続したステートメントがブロックとしてまとめられ、各ブロックおよび各ステートメントに順序を示すシーケンス番号（ＩＤ）が割り振られ、各ブロックの話者のデータが設定されている状態のものである。また、談話セマンティクス２００は、例えば、上述した談話構造解析システム２等によって、談話データ１０１について談話構造（フロー、固有表現、結束性、および形態素）の解析が行われた結果として出力されたものである。 Note that the discourse data 101 is equivalent to that preprocessed by the preprocessing unit 10 of the discourse structure analysis system 2 described above, that is, for example, continuous statements for each speaker are collected as blocks. A sequence number (ID) indicating an order is assigned to each statement, and the speaker data of each block is set. The discourse semantics 200 is output as a result of analyzing the discourse structure (flow, specific expression, cohesiveness, and morpheme) of the discourse data 101 by the discourse structure analysis system 2 described above, for example. is there.

なお、本実施の形態では、談話構造解析システム２によって出力された談話構造の解析結果をＦＡＱ候補抽出システム１に入力する構成としているが、例えば、これらのシステムを１つのコンピュータシステム上に構成し、談話構造の解析処理からＦＡＱ候補３００の抽出までの処理を連続的に実行するようにしてもよい。 In the present embodiment, the analysis result of the discourse structure output by the discourse structure analysis system 2 is input to the FAQ candidate extraction system 1. For example, these systems are configured on one computer system. The processing from the discourse structure analysis processing to the FAQ candidate 300 extraction may be executed continuously.

ＦＡＱ候補抽出システム１の質問抽出部６０は、談話データ１０１と、談話セマンティクス２００のフロー情報２１とに基づいて、談話データ１０１から顧客による質問のステートメントの抽出を行う機能を有する。スコアリング部７０は、質問抽出部６０によって抽出された各ステートメントについて、例えば、含まれる単語の出現頻度の情報に基づいて重要度を算出し、質問情報７１を出力する機能を有する。質問情報７１は、顧客による質問のステートメントもしくはステートメントＩＤと重要度とのリストの情報を保持する。 The question extraction unit 60 of the FAQ candidate extraction system 1 has a function of extracting a question statement by a customer from the discourse data 101 based on the discourse data 101 and the flow information 21 of the discourse semantics 200. The scoring unit 70 has a function of calculating the degree of importance for each statement extracted by the question extraction unit 60 based on the appearance frequency information of the included words and outputting the question information 71, for example. The question information 71 holds a statement of a question by a customer or a list of statement IDs and importance lists.

候補抽出部８０は、質問情報７１に含まれる質問のステートメントに、既存ＦＡＱ８２に保持されている既存のＦＡＱの質問文を含めた集合から、キーワード８１を含む質問のステートメントもしくは質問文を抽出する機能を有する。ここで、キーワード８１は、ユーザからの入力によって指定されてもよいし、例えばＦＡＱ候補抽出システム１や他のシステム等によって抽出された直近の急騰キーワード等を取得して指定するものでもよい。クラスタリング部９０は、候補抽出部８０によって抽出された質問のステートメントおよび質問文を同様のものにクラスタリングしてまとめ、各クラスタの代表となる質問のステートメント（既存のＦＡＱの質問文ではないもの）について、スコアリング部７０で算出した重要度の順にＦＡＱ候補３００として出力する機能を有する。 The candidate extraction unit 80 has a function of extracting a question statement or a question sentence including the keyword 81 from a set including a question sentence of the existing FAQ held in the existing FAQ 82 in the question statement included in the question information 71 Have Here, the keyword 81 may be specified by an input from the user, or may be acquired and specified, for example, the latest rapid keyword extracted by the FAQ candidate extraction system 1 or another system. The clustering unit 90 clusters the question statements and the question sentences extracted by the candidate extraction unit 80 into the same ones, and collects the question statements that are representative of each cluster (not the existing FAQ question sentences). , And a function of outputting the FAQ candidate 300 in the order of importance calculated by the scoring unit 70.

本実施の形態では、上述したように、重要度が高いＦＡＱ候補ほど回答を作成して整備すべき優先度が高いものとしている。また、ＦＡＱ候補が属するクラスタに、既存ＦＡＱ８２に保持された質問文が存在する場合には、既存ＦＡＱ８２に既に対応する回答文が登録されているため、当該ＦＡＱ候補については回答が整備済であるとして取り扱う。なお、ＦＡＱ候補抽出システム１の各部による処理の詳細については後述する。 In the present embodiment, as described above, it is assumed that the FAQ candidates with higher importance have higher priority to be prepared and maintained. If there is a question sentence held in the existing FAQ 82 in the cluster to which the FAQ candidate belongs, the answer sentence corresponding to the existing FAQ 82 has already been registered, and thus the answer has been prepared for the FAQ candidate. Treat as. Details of processing by each unit of the FAQ candidate extraction system 1 will be described later.

［フロー解析］
以下では、図２に示した談話構造解析システム２のフロー解析部２０によるフロー解析の処理の詳細について説明する。図４は、談話データ１００におけるフローとコンテキストの例について示した図である。本実施の形態では、コンテキストは談話の各ブロックに対して設定される。図４の例では、最初のブロック（話者：オペレータ）のコンテキストは“開始”であり、次のブロック（話者：顧客）およびその次のブロック（話者：オペレータ）はともに“陳述”となっている。 [Flow analysis]
Below, the detail of the process of the flow analysis by the flow analysis part 20 of the discourse structure analysis system 2 shown in FIG. 2 is demonstrated. FIG. 4 is a diagram illustrating an example of a flow and context in the discourse data 100. In the present embodiment, a context is set for each block of discourse. In the example of FIG. 4, the context of the first block (speaker: operator) is “start”, and the next block (speaker: customer) and the next block (speaker: operator) are both “declaration”. It has become.

図４の例に示すように、本実施の形態では、原則として、対象のブロックのコンテキストとして、異なる話者の直前のブロックの最後のステートメントに設定されたフローを設定する。なお、談話データ１００の先頭のブロックについては直前のブロックがないため、コンテキストとして常に“開始”を設定するものとする。このようにコンテキストを判断することによって、同じ文言のステートメントであっても談話のコンテキストに沿ったフローの解析が可能となる。 As shown in the example of FIG. 4, in this embodiment, in principle, the flow set in the last statement of the block immediately before a different speaker is set as the context of the target block. Since there is no immediately preceding block for the first block of the discourse data 100, “start” is always set as the context. By determining the context in this way, it is possible to analyze the flow along the discourse context even if the statements have the same wording.

図５は、談話構造解析システム２のフロー解析ルール２２の例について示した図である。フロー解析ルール２２は、ファイルやデータベース等に保持されており、ステートメント中におけるフローを判断するための表現（以下では「フロー表現」と記載する場合がある）と、当該フロー表現とマッチする表現を有するステートメントに対してそのコンテキストに応じて設定するフローのパターン（以下では「フロー定義」と記載する場合がある）との対応について予め指定したリストを保持するテーブルである。 FIG. 5 is a diagram illustrating an example of the flow analysis rule 22 of the discourse structure analysis system 2. The flow analysis rule 22 is held in a file, database, or the like, and includes an expression for determining a flow in a statement (hereinafter sometimes referred to as “flow expression”) and an expression that matches the flow expression. It is a table that holds a list designated in advance with respect to correspondence with a flow pattern (hereinafter, sometimes referred to as “flow definition”) set in accordance with the context of a statement it has.

図５の例に示すように、フロー表現は、フローを判断するための手掛かりとなる文字列の正規表現（例えば、“/はい。/”や“/ですか。/”）によって指定する。従って、フロー解析部２０もしくは談話構造解析システム２は、例えば、既存の正規表現ライブラリなどを導入することにより正規表現に必要な処理を行うことができるようにしておく必要がある。 As shown in the example of FIG. 5, the flow expression is specified by a regular expression of a character string (for example, “/Yes./” or “/ ?? /”) that serves as a clue for determining the flow. Accordingly, the flow analysis unit 20 or the discourse structure analysis system 2 needs to be able to perform processing necessary for regular expressions by introducing, for example, an existing regular expression library.

なお、フロー表現として指定する文字列は、例えば、話し言葉において談話の流れ上特徴的となる文頭の挨拶、返事や、助詞、接続詞、文末表現などを指定するのが望ましく、これらを適用の優先度が高い順に指定する。優先度については、例えば、所定の条件により正規表現の複雑度を計算し、複雑度の降順で優先度を設定するようにしてもよい。なお、指定するフロー表現のパターンは、談話構造解析システム２が対象とする談話の種類（例えば、コールセンターでのやり取りや、企業における定例会議など）に応じて異なる場合がある。 Note that the character string specified as the flow expression should preferably specify, for example, a greeting at the beginning of a sentence, a reply, a particle, a conjunction, a sentence ending expression, etc. Specify in descending order. As for the priority, for example, the complexity of the regular expression may be calculated according to a predetermined condition, and the priority may be set in descending order of the complexity. Note that the flow expression pattern to be specified may differ depending on the type of discourse targeted by the discourse structure analysis system 2 (for example, exchange at a call center or regular meeting in a company).

各フロー表現に対しては、例えば“flow”のラベルによって、フロー定義を指定する。フロー定義では、対象のステートメントのコンテキストと、当該ステートメントに設定するフロー（例えば、“/はい。/”のフロー表現とマッチするステートメントにおいて、コンテキストが“開始”の場合はフローとして“挨拶”を設定する）を指定する。“flow”のラベルによるフロー定義は、各フロー表現において想定されるコンテキストの種類に応じて複数指定することができる。 For each flow expression, for example, a flow definition is specified by a label “flow”. In the flow definition, set “greeting” as the flow if the context is “start” in the statement that matches the context of the target statement and the flow that is set in the statement (for example, “/Yes./”) Specify). A plurality of flow definitions by the label “flow” can be specified according to the type of context assumed in each flow expression.

また、例えば“default”のラベルによって、対象のステートメントのコンテキストがいずれの“flow”のラベルにも該当しなかった場合に設定するフローであるデフォルトフロー（例えば、“/はい。/”のフロー表現とマッチするステートメントにおいて、当該ステートメントのコンテキストがいずれの“flow”のラベルにもマッチしない場合はフローとして“相槌”を設定する）を指定する。なお、デフォルトフローは、各フロー表現において必ず１つ設定するものとする。 In addition, for example, a default flow (for example, a flow expression of “/Yes./”, which is a flow to be set when the context of the target statement does not correspond to any “flow” label by the label “default”. In the statement that matches, if the context of the statement does not match any of the “flow” labels, the flow is set to “contrast”. Note that one default flow is always set in each flow expression.

フロー定義の指定は、談話構造解析システム２が対象とする談話の種類に応じて、一般的な談話の流れのパターンとして主要なものを指定するのが望ましい。例えば、コールセンターでのやり取りにおいては、顧客が“苦情”を発した場合はオペレータは“謝罪”する、などのパターンに基づいて、フロー定義を指定することができる。 For the flow definition, it is desirable to specify a main pattern as a general discourse flow pattern according to the type of discourse targeted by the discourse structure analysis system 2. For example, in the exchange at the call center, the flow definition can be specified based on a pattern such as “apology” if the customer makes a “complaint”.

フロー解析ルール２２では、上述のように、指定するフロー表現や対応するフロー定義の内容が談話構造解析システム２が対象とする談話の種類に応じて異なる場合があるため、談話の種類に応じて適用するフロー解析ルール２２を複数保持して、ユーザが切り換えられるようにするなど、談話の特性に応じて柔軟に対応することも可能である。 In the flow analysis rule 22, as described above, the flow expression to be specified and the content of the corresponding flow definition may differ depending on the type of discourse targeted by the discourse structure analysis system 2. A plurality of flow analysis rules 22 to be applied can be held so that the user can be switched, so that it is possible to flexibly cope with the characteristics of the discourse.

なお、上述のように、本実施の形態では、フロー解析ルール２２において品詞の指定は行っていない。すなわち、フロー解析部２０においては品詞の判断を行わないため、フロー解析部２０での解析処理に際して、事前に形態素解析部３０による形態素解析を行う必要はない。なお、より精度を上げるために形態素情報３１を利用して、例えば、フロー解析ルール２２のフロー表現における正規表現を、後述するように形態素情報３１の形態素列（品詞の情報も含む）に対して適用するようにして、より詳細な条件でマッチングを行うようすることも可能である。 As described above, in this embodiment, the part of speech is not specified in the flow analysis rule 22. That is, since the flow analysis unit 20 does not determine the part of speech, it is not necessary to perform the morpheme analysis by the morpheme analysis unit 30 in advance in the analysis process in the flow analysis unit 20. In order to improve accuracy, the morpheme information 31 is used, for example, a regular expression in the flow expression of the flow analysis rule 22 is applied to a morpheme sequence (including part-of-speech information) of the morpheme information 31 as described later. It is also possible to perform matching under more detailed conditions as applied.

図６は、談話構造解析システム２のフロー解析部２０におけるフロー解析処理の流れの例を示したフローチャートである。フロー解析部２０は、フロー解析処理を開始すると、まず、前処理部１０による前処理後の談話データ１００の各ステートメントについて処理を繰り返すループ処理を開始する（Ｓ１０１）。各ステートメントについてのループ処理を開始すると、フロー解析ルール２２の各フロー表現について処理を繰り返すループ処理を開始する（Ｓ１０２）。 FIG. 6 is a flowchart showing an example of the flow of flow analysis processing in the flow analysis unit 20 of the discourse structure analysis system 2. When starting the flow analysis process, the flow analysis unit 20 first starts a loop process that repeats the process for each statement of the discourse data 100 after the preprocessing by the preprocessing unit 10 (S101). When loop processing for each statement is started, loop processing for repeating processing for each flow expression of the flow analysis rule 22 is started (S102).

各フロー表現についてのループ処理を開始すると、対象のステートメントと対象のフロー表現とがマッチするか否かを判定する（Ｓ１０３）。マッチしない場合は次のフロー表現の処理に移る（Ｓ１０８、Ｓ１０２）。一方、マッチする場合は、まず、対象のステートメントについて、当該ステートメントが属するブロックの直前のブロックにおける最終ステートメントに設定されているフローをコンテキストとして設定する（Ｓ１０４）。このとき、図４で示したように、当該ステートメントが属するブロックが談話データ１００の最初のブロックである場合には、コンテキストに“開始”を設定する。 When the loop processing for each flow expression is started, it is determined whether or not the target statement matches the target flow expression (S103). If there is no match, the process proceeds to the next flow expression process (S108, S102). On the other hand, if there is a match, first, for the target statement, the flow set in the last statement in the block immediately before the block to which the statement belongs is set as a context (S104). At this time, as shown in FIG. 4, when the block to which the statement belongs is the first block of the discourse data 100, “start” is set in the context.

次に、ステップＳ１０４で設定したコンテキストについて、対象のフロー表現におけるフロー定義の”flow”ラベルで指定されたコンテキストとマッチするものがあるか否かを判定する（Ｓ１０５）。マッチするものがある場合には、マッチするフロー定義におけるコンテキストに対応するフローを、対象のステートメントのフローとして設定する（Ｓ１０６）。一方、マッチするものがない場合は、デフォルトフローを対象のステートメントのフローとして設定する（Ｓ１０７）。その後、各フロー表現についてのループ処理を抜ける。 Next, with respect to the context set in step S104, it is determined whether or not there is a context that matches the context specified by the “flow” label of the flow definition in the target flow expression (S105). If there is a match, the flow corresponding to the context in the matching flow definition is set as the flow of the target statement (S106). On the other hand, if there is no match, the default flow is set as the flow of the target statement (S107). Thereafter, the loop processing for each flow expression is exited.

各フロー表現についてのループ処理が終了すると、対象のステートメントにフローが設定されているか否かを判定する（Ｓ１０９）。フローが設定されている場合は何も行わず、また、フローが設定されていない場合は、対象のステートメントに“陳述”のフローを設定して（Ｓ１１０）、次のステートメントの処理に移る（Ｓ１１１、Ｓ１０１）。 When the loop processing for each flow expression is completed, it is determined whether or not a flow is set in the target statement (S109). If the flow is set, nothing is performed. If the flow is not set, the “statement” flow is set in the target statement (S110), and the process proceeds to the next statement (S111). , S101).

ここで、“陳述”のフローは、対象のステートメントについて、フロー解析ルール２２に指定されたフロー表現にマッチする文言を含まない場合に設定されるものである。すなわち、談話の枠組みの中における所定のパターンや定型的なパターン等に該当するステートメント（フロー表現を含むステートメント）ではなく、一般的に談話に固有の事項について具体的な内容を述べているステートメントであると考えられる。 Here, the “statement” flow is set when the target statement does not include a word that matches the flow expression specified in the flow analysis rule 22. In other words, it is not a statement (statement that includes a flow expression) that corresponds to a predetermined pattern or a typical pattern in the framework of discourse, but generally a statement that describes specific details about matters specific to discourse. It is believed that there is.

各ステートメントについてのループ処理が全て終了した場合は、各ステートメントのＩＤとこれに対応するフローの情報をフロー情報２１として出力してフロー解析処理を終了する。 When all the loop processes for each statement are completed, the ID of each statement and the flow information corresponding thereto are output as the flow information 21 and the flow analysis process is terminated.

このように、話し言葉を考慮したフロー表現によって各ステートメントとマッチングし、マッチしたステートメントのコンテキストに応じて適切なフローを設定することで、談話データ１００の全体について話し言葉やノイズに影響されにくいフロー解析を行うことができる。 In this way, flow analysis that considers spoken language is used to match each statement, and by setting an appropriate flow according to the context of the matched statement, flow analysis that is less susceptible to spoken language and noise for the entire discourse data 100 is performed. It can be carried out.

［固有表現解析］
以下では、図２に示した談話構造解析システム２の固有表現解析部４０による固有表現解析の処理の詳細について説明する。図７は、談話構造解析システム２の固有表現解析ルール４２の例について示した図である。固有表現解析ルール４２は、ファイルやデータベース等に保持されており、ステートメント中の固有表現を特定するための正規表現と、当該正規表現とマッチする部分から固有表現を特定して抽出するための規則（以下では「固有表現規則」と記載する場合がある）との対応について予め指定したリストを保持するテーブルである。 [Specific expression analysis]
Hereinafter, the details of the specific expression analysis processing by the specific expression analysis unit 40 of the discourse structure analysis system 2 shown in FIG. 2 will be described. FIG. 7 is a diagram showing an example of the specific expression analysis rule 42 of the discourse structure analysis system 2. The specific expression analysis rule 42 is held in a file, a database, or the like, and a regular expression for specifying a specific expression in a statement and a rule for specifying and extracting a specific expression from a portion that matches the regular expression It is a table that holds a list designated in advance for correspondence with (hereinafter may be described as “specific expression rules”).

図７の例に示すように、ステートメント中の固有表現は正規表現によって特定する。従って、固有表現解析部４０もしくは談話構造解析システム２は、例えば、既存の正規表現ライブラリなどを導入することにより正規表現に必要な処理を行うことができるようにしておく必要がある。なお、固有表現解析においては、通常の文字列に対する正規表現だけではなく、形態素情報３１に保持された各ステートメントの形態素列の表現についても判断する必要があるため、正規表現の処理機能を形態素列の情報を判断することができるよう拡張しておく。 As shown in the example of FIG. 7, the specific expression in the statement is specified by a regular expression. Therefore, the specific expression analysis unit 40 or the discourse structure analysis system 2 needs to be able to perform processing necessary for regular expressions by introducing, for example, an existing regular expression library. In the specific expression analysis, it is necessary to determine not only the regular expression for the normal character string but also the expression of the morpheme string of each statement held in the morpheme information 31. It is expanded so that the information can be judged.

例えば、図７の例では、各正規表現中で形態素列の範囲を“（）”で指定しており、この部分については文字列ではなく形態素列の情報とマッチングするようにする。１つ目の正規表現の例では、“（＜ｆ：組織＞＋）”の部分は、品詞（ｆ）が“組織”であると解析された１つ以上の形態素列を示しており、“（？：の）？”の部分は、表層形が“の”であると解析された０以上１つ以下の品詞を示している。また、“（＜ｆ：人名，姓＞）”の部分は、品詞が“人名，姓”であると解析された１つの形態素列を示している。この正規表現は、例えば、“ＡＡＡ商事の鈴木”や、“ＡＡＡ商事鈴木”などの表現とマッチする。なお、形態素列の表現については、形態素解析部３０における形態素解析エンジン等の出力仕様によって異なる場合がある。 For example, in the example of FIG. 7, the range of the morpheme string is specified by “()” in each regular expression, and this part is matched with the information of the morpheme string instead of the character string. In the first regular expression example, the part of “(<f: organization> +)” indicates one or more morpheme sequences analyzed that the part of speech (f) is “organization”. (?: Of)? "Indicates a part of speech of 0 or more and 1 or less analyzed that the surface shape is" no ". The part of “(<f: person name, surname>)” indicates one morpheme string that has been analyzed with the part of speech being “person name, surname”. This regular expression matches expressions such as “AAA Shoji Suzuki” and “AAA Shoji Suzuki”. Note that the representation of the morpheme string may vary depending on the output specifications of the morpheme analysis engine or the like in the morpheme analysis unit 30.

このとき、対応する固有表現規則において、当該形態素列における１番目の要素（品詞が“組織”の形態素列、例えば“ＡＡＡ商事”）を“企業名”という属性を有する固有表現として特定し、形態素列における２番目の要素（品詞が“人名，姓”の形態素列、例えば“鈴木”）を“人名”という属性を有する固有表現として特定するよう指定している。なお、“（？：）”で囲われた部分（例えば、“の”）については対象の形態素列の要素に含まれないものとして取り扱う。 At this time, in the corresponding specific expression rule, the first element in the morpheme string (the morpheme string whose part of speech is “organization”, for example, “AAA Shoji”) is specified as a specific expression having the attribute “company name”. The second element in the column (the morpheme string whose part of speech is “person name, surname”, for example “Suzuki”) is specified to be specified as a specific expression having the attribute “person name”. It should be noted that a portion (for example, “no”) enclosed by “(?:)” Is treated as not included in the element of the target morpheme string.

図７の２つ目の正規表現の例では、品詞は特に考慮せずに通常の正規表現によって指定しており、“０３−１２３４−５６７８”などの電話番号の表現にマッチする。このとき、対応する固有表現規則において、当該形態素列における１番目の要素（この例で要素は“０３−１２３４−５６７８”等の１つしかない）を“電話番号”という属性を有する固有表現として特定するよう指定している。 In the example of the second regular expression of FIG. 7, the part of speech is specified by a normal regular expression without particular consideration, and matches the expression of a telephone number such as “03-1234-5678”. At this time, in the corresponding specific expression rule, the first element in the morpheme string (in this example, there is only one element such as “03-1234-5678”) is defined as a specific expression having the attribute “phone number”. It is specified to be specified.

なお、正規表現によって特定される固有表現の文字列は、例えば、談話の内容を個別に特徴付ける固有名詞や、数値等からなる文字列などを指定するのが望ましい。また、例えば、話者（顧客等）の感情を示す表現（例えば、苦情の表現や感謝の表現など）からなる文字列を指定して、対応する感情の属性を有する固有表現として特定するようにしてもよい。なお、解析結果として、ある文字列が複数の固有表現に含まれるという場合もあり得る。 It is desirable that the character string of the unique expression specified by the regular expression is, for example, a proper noun that individually characterizes the content of the discourse or a character string made up of numerical values. In addition, for example, a character string composed of expressions (for example, complaint expression or gratitude expression) indicating the emotion of a speaker (customer, etc.) is specified and specified as a specific expression having a corresponding emotion attribute. May be. In addition, as a result of analysis, a certain character string may be included in a plurality of unique expressions.

固有表現解析ルール４２には、上述のように、固有表現を特定するための正規表現と固有表現規則との対応のリストを予め指定しておくが、フロー解析ルール２２と同様に、指定する正規表現や対応する固有表現規則については、談話構造解析システム２が対象とする談話の種類に応じて異なるものを設定したり、談話の種類に応じて適用する固有表現解析ルール４２をユーザが切り換えたりなど、談話の特性に応じて柔軟に対応することが可能である。 In the specific expression analysis rule 42, as described above, a list of correspondence between the regular expression for specifying the specific expression and the specific expression rule is specified in advance. As with the flow analysis rule 22, the normal expression to be specified is specified. For the expression and the corresponding specific expression rule, different ones are set according to the type of discourse targeted by the discourse structure analysis system 2, or the user switches the specific expression analysis rule 42 to be applied according to the type of discourse. It is possible to respond flexibly according to the characteristics of discourse.

図８は、談話構造解析システム２の固有表現解析部４０における固有表現解析処理の流れの例を示したフローチャートである。固有表現解析部４０は、固有表現解析処理を開始すると、まず、談話データ１００の各ステートメントについて処理を繰り返すループ処理を開始する（Ｓ２０１）。各ステートメントについてのループ処理を開始すると、固有表現解析ルール４２の固有表現を特定するための各正規表現について処理を繰り返すループ処理を開始する（Ｓ２０２）。 FIG. 8 is a flowchart showing an example of the flow of the specific expression analysis process in the specific expression analysis unit 40 of the discourse structure analysis system 2. When the specific expression analysis unit 40 starts the specific expression analysis process, first, the specific expression analysis unit 40 starts a loop process that repeats the process for each statement of the discourse data 100 (S201). When the loop processing for each statement is started, loop processing for repeating the processing for each regular expression for specifying the specific expression of the specific expression analysis rule 42 is started (S202).

各正規表現についてのループ処理を開始すると、対象のステートメントに対応する形態素情報３１内の各形態素列と、対象の正規表現とがマッチするか否かを判定する（Ｓ２０３）。マッチしない場合は次の正規表現の処理に移る（Ｓ２０５、Ｓ２０２）。一方、マッチする場合は、対象の正規表現に対応する固有表現規則を適用して固有表現を特定して抽出し（Ｓ２０４）、次の正規表現の処理に移る（Ｓ２０５、Ｓ２０２）。 When the loop processing for each regular expression is started, it is determined whether or not each morpheme sequence in the morpheme information 31 corresponding to the target statement matches the target regular expression (S203). If there is no match, the processing proceeds to the next regular expression (S205, S202). On the other hand, if a match is found, a specific expression is identified and extracted by applying a specific expression rule corresponding to the target regular expression (S204), and the process proceeds to the next regular expression process (S205, S202).

各正規表現についてのループ処理が全て終了した場合は、次のステートメントの処理に移る（Ｓ２０６、Ｓ２０１）。各ステートメントについてのループ処理が全て終了した場合は、各ステートメントのＩＤとこれに対応する固有表現のリストの情報を固有表現情報４１として出力して固有表現解析処理を終了する。 When all the loop processes for each regular expression are completed, the process proceeds to the next statement (S206, S201). When all the loop processing for each statement is completed, the ID of each statement and the information of the list of specific expressions corresponding thereto are output as the specific expression information 41, and the specific expression analysis process is ended.

このように、形態素列の品詞を考慮した正規表現によるマッチングと形態素列の抽出を行うことで、例えば助詞の省略などの話し言葉特有の文法を考慮した固有表現解析を行うことができる。 In this way, by performing matching with a regular expression considering the part of speech of the morpheme string and extracting the morpheme string, it is possible to perform a specific expression analysis considering a grammar peculiar to spoken words such as omission of a particle.

［結束性解析］
以下では、図２に示した談話構造解析システム２の結束性解析部５０による結束性解析の処理の詳細について説明する。図９は、談話構造解析システム２での結束性解析の例について示した図である。結束性解析処理では、まず、上段の図に示すように、談話データ１００における全てのブロックを１つのセグメントとして設定する。 [Cohesion analysis]
In the following, details of cohesiveness analysis processing by the cohesiveness analysis unit 50 of the discourse structure analysis system 2 shown in FIG. 2 will be described. FIG. 9 is a diagram showing an example of cohesiveness analysis in the discourse structure analysis system 2. In the cohesiveness analysis process, first, as shown in the upper diagram, all the blocks in the discourse data 100 are set as one segment.

その後、まず、セグメントの分割処理を行う。分割処理では、各ステートメントについてフロー情報２１において特定の種別のフローが設定されているものを判別し、当該ステートメントが属するブロックにおいてセグメントを分割する。ここで、判別するフローの種別は、談話の切れ目や内容が切り替わる際に出現すると推測されるフローであることが望ましい。 Thereafter, segment division processing is performed first. In the dividing process, it is determined for each statement that a flow of a specific type is set in the flow information 21, and a segment is divided in a block to which the statement belongs. Here, it is desirable that the type of the flow to be identified is a flow that is estimated to appear when the discontinuity or content changes.

例えば、コールセンターにおけるやり取りを始めとする複数人の談話の場合は、質問とそれに対する回答等の応答によって談話が進む場合が多いため、フローが“質問”のステートメントにおいて内容が切り替わることが多いと考えられる。従って、本実施の形態では、フロー情報２１に基づいてフローが“質問”のステートメント（以下では“質問ステートメント”と記載する場合がある）を判別し、図９の中段の図に示すように、質問ステートメントが出現するたびにセグメントを分割するものとする。 For example, in the case of multi-person discourse such as exchanges at a call center, it is often the case that the discourse progresses depending on the response of the question and the response to it, so the content is often switched in the “question” statement of the flow. It is done. Therefore, in the present embodiment, the statement of the flow “question” (hereinafter sometimes referred to as “question statement”) is determined based on the flow information 21, and as shown in the middle diagram of FIG. Each time a question statement appears, the segment shall be split.

ここで、質問ステートメントは、談話構造解析システム１のフロー解析部２０において、相手の話者に対する質問文を示すフロー表現を有するものとして特定されたステートメントである。質問ステートメントを特定するフロー表現としては、例えば、“/ですか。/”や”/よね。/”などの疑問文の文末表現などが考えられる。 Here, the question statement is a statement that is identified in the flow analysis unit 20 of the discourse structure analysis system 1 as having a flow expression indicating a question sentence for the other speaker. As a flow expression for specifying a question statement, for example, the sentence end expression of a question sentence such as “/?./” or “/like./” can be considered.

セグメントを分割した後、セグメントの統合処理を行う。統合処理では、各セグメントについて予め定められた統合条件に該当する質問ステートメントを有する場合に、当該質問ステートメントを有するセグメントを直前のセグメントと統合する。ここで、統合条件は、当該質問ステートメントを有するセグメントについて以前のセグメントから内容が継続している（以前のセグメントと結束性が高い）と推測させる表現の指定である。 After dividing the segments, segment integration processing is performed. In the integration process, when there is a question statement corresponding to a predetermined integration condition for each segment, the segment having the question statement is integrated with the immediately preceding segment. Here, the integration condition is designation of an expression that causes the segment having the question statement to be inferred that the content continues from the previous segment (highly cohesive with the previous segment).

統合条件としては、例えば、当該質問ステートメントについて、直前の質問ステートメントと共通する単語（固有名詞）がある場合や、指示詞を含む場合などが挙げられる。これらの統合条件に該当する質問ステートメントを有するセグメントは以前のセグメントと結束性が高いと判断し、図９の下段の図に示すように、直前のセグメントと統合する。なお、統合処理については、統合可能なセグメントがなくなるまで繰り返すものとする。 Examples of the integration condition include a case where the question statement has a word (proprietary noun) common to the immediately previous question statement or a case where a directive is included. A segment having a question statement corresponding to these integration conditions is determined to be highly cohesive with the previous segment, and is integrated with the immediately preceding segment as shown in the lower diagram of FIG. It should be noted that the integration process is repeated until there are no segments that can be integrated.

図１０は、談話構造解析システム２の結束性解析部５０における結束性解析処理の流れの例を示したフローチャートである。結束性解析部５０は、結束性解析処理を開始すると、まず、談話データ１００の全てのブロックを１つのセグメントに設定する（Ｓ３０１）。具体的には、例えば、全てのブロックにセグメントＩＤの初期値を割り当てるなどによって設定する。その後、セグメントの分割処理として、談話データ１００の各ブロックについて処理を繰り返すループ処理を開始する（Ｓ３０２）。 FIG. 10 is a flowchart showing an example of the flow of cohesiveness analysis processing in the cohesiveness analyzing unit 50 of the discourse structure analyzing system 2. When the cohesiveness analysis unit 50 starts cohesiveness analysis processing, first, all the blocks of the discourse data 100 are set as one segment (S301). Specifically, for example, the initial value of the segment ID is assigned to all the blocks. After that, as a segment dividing process, a loop process for repeating the process for each block of the discourse data 100 is started (S302).

各ブロックについてのループ処理を開始すると、対象のブロックに質問ステートメントが含まれているか否か（フローが“質問”のステートメントがあるか否か）を判定する（Ｓ３０３）。質問ステートメントが含まれている場合は、対象のブロックと直前のブロックとの間でセグメントを分割する（Ｓ３０４）。具体的には、例えば、対象のブロック以降の全てのブロックに次のセグメントＩＤを割り当てるなどによって分割する。その後、次のブロックの処理に移る（Ｓ３０５、Ｓ３０２）。 When the loop processing for each block is started, it is determined whether or not a question statement is included in the target block (whether or not the flow has a “question” statement) (S303). If a question statement is included, the segment is divided between the target block and the immediately preceding block (S304). Specifically, for example, the next segment ID is assigned to all blocks after the target block. Thereafter, the processing moves to the next block (S305, S302).

各ブロックについてのループ処理（分割処理）が終了した場合は、次に、セグメントの結合処理として、上記の分割処理で分割した各セグメントについて処理を繰り返すループ処理を開始する（Ｓ３０６）。各セグメントについてのループ処理を開始すると、対象のセグメントに上記の統合条件に該当する質問ステートメントがあるか否かを判定する（Ｓ３０７）。 When the loop processing (division processing) for each block is completed, next, loop processing that repeats the processing for each segment divided by the above-described division processing is started as segment combination processing (S306). When the loop processing for each segment is started, it is determined whether or not there is a question statement corresponding to the integration condition in the target segment (S307).

統合条件に該当する質問ステートメントがある場合は、対象のセグメントと直前のセグメントとを統合する（Ｓ３０８）。具体的には、例えば、対象のセグメントに含まれる各ブロックのセグメントＩＤを直前のセグメントのセグメントＩＤに更新するなどによって統合する。このとき、以降の各セグメントに含まれる各ブロックのセグメントＩＤを順次繰り上げてもよい。その後、次のセグメントの処理に移る（Ｓ３０９、Ｓ３０６）。各セグメントについてのループ処理（統合処理）が終了した場合は、各ブロックのＩＤと対応する（各ブロックが含まれる）セグメントのＩＤのリストの情報を結束性情報５１として出力して、結束性解析処理を終了する。 If there is a question statement corresponding to the integration condition, the target segment and the previous segment are integrated (S308). Specifically, for example, the integration is performed by updating the segment ID of each block included in the target segment to the segment ID of the immediately preceding segment. At this time, the segment ID of each block included in each subsequent segment may be sequentially incremented. Thereafter, the process proceeds to the next segment (S309, S306). When the loop processing (integration processing) for each segment is completed, information on a list of segment IDs corresponding to the ID of each block (including each block) is output as the cohesiveness information 51 for cohesiveness analysis. The process ends.

これにより、フロー解析によって得られたフロー情報２１に基づいて談話データ１００内の各ステートメントをセグメント化し、談話セマンティクス２００を利用する種々のアプリケーションに対する有用な情報とすることができる。 Thereby, each statement in the discourse data 100 can be segmented based on the flow information 21 obtained by the flow analysis, and can be used as useful information for various applications using the discourse semantics 200.

なお、上述のフロー解析、固有表現解析、結束性解析の各処理によって出力された談話セマンティクス２００に含まれるフロー情報２１、固有表現情報４１、結束性情報５１については、談話構造解析システム２において、ユーザが内容を参照して適宜修正することが可能なインタフェースを設けてもよい。 The flow information 21, the unique expression information 41, and the cohesiveness information 51 included in the discourse semantics 200 output by the processes of the flow analysis, the specific expression analysis, and the cohesiveness analysis described above are described in the discourse structure analysis system 2. An interface that allows the user to refer to the contents and make corrections as appropriate may be provided.

［ＦＡＱ候補抽出］
以下では、図１に示したＦＡＱ候補抽出システム１によるＦＡＱ候補抽出の処理の詳細について説明する。図１１は、ＦＡＱ候補抽出システム１によって抽出したＦＡＱ候補３００を表示する画面の例について示した図である。画面上部には、ＦＡＱ候補３００を抽出する際の基準となるキーワードを指定する入力フィールドを有する。ここでユーザがキーワードを指定すると、ＦＡＱ候補抽出システム１によって、談話データ１０１内の顧客による質問に相当するステートメントおよび既存ＦＡＱ８２内に保持された質問文から抽出されクラスタリングされたＦＡＱ候補３００の内容がリストとして表示される。 [Faq candidate extraction]
Details of the FAQ candidate extraction process by the FAQ candidate extraction system 1 shown in FIG. 1 will be described below. FIG. 11 is a diagram showing an example of a screen that displays the FAQ candidate 300 extracted by the FAQ candidate extraction system 1. In the upper part of the screen, there is an input field for designating a keyword as a reference when extracting the FAQ candidate 300. When the user designates a keyword, the FAQ candidate extraction system 1 extracts the contents of the FAQ candidate 300 that is extracted from the statement corresponding to the question by the customer in the discourse data 101 and the question sentence held in the existing FAQ 82 and clustered. Displayed as a list.

図１１の例では“ＸＸＸ物件”や“契約書”などのキーワード毎に、抽出されクラスタリングされたＦＡＱ候補３００の内容が表示されている。このとき、既に既存ＦＡＱ８２に登録されている同様の質問文が同じクラスタ内にあるＦＡＱ候補については、対応する回答文が既存ＦＡＱ８２に既に登録されているため“整備済”として表示される。一方、既存ＦＡＱ８２にクラスタ内の同様の質問文が登録されていないＦＡＱ候補については、対応する回答文が既存ＦＡＱ８２に整備されていないため“未整備”として表示される。 In the example of FIG. 11, the contents of FAQ candidates 300 extracted and clustered are displayed for each keyword such as “XXX property” and “contract”. At this time, for the FAQ candidates in which the same question text already registered in the existing FAQ 82 is in the same cluster, the corresponding answer text is already registered in the existing FAQ 82 and is displayed as “completed”. On the other hand, FAQ candidates for which similar question texts in the cluster are not registered in the existing FAQ 82 are displayed as “undeveloped” because the corresponding answer text is not maintained in the existing FAQ 82.

“未整備”のＦＡＱ候補については、スーパバイザ等のユーザが回答を作成して整備する必要がある。そのため、例えば、各ＦＡＱ候補の文章部分に設定されているハイパーリンクを指示することにより、当該質問に対する回答を入力する画面（図示しない）に遷移し、回答を整備することができるようにする。なお、本実施の形態では、スコアリング部７０によって算出された重要度の順に回答整備の優先度が高いものとして表示するようにしているが、例えば、“未整備”のものを優先的に上位に表示するなど、表示順は任意に設定することができる。 A supervisor such as a supervisor needs to prepare an answer for the “undeveloped” FAQ candidate. Therefore, for example, by instructing a hyperlink set in the sentence portion of each FAQ candidate, the screen is changed to a screen (not shown) for inputting an answer to the question so that the answer can be prepared. In this embodiment, the priority is given to the priority of answer preparation in the order of importance calculated by the scoring unit 70. For example, “not ready” is given higher priority. The display order can be arbitrarily set.

また、図１１の例では、各ＦＡＱ候補に“質問”もしくは“要求”のフローの情報が表示されている。本実施の形態では、ＦＡＱ候補３００を抽出するにあたり、質問抽出部６０は、談話データ１０１から顧客による質問に相当するステートメントを抽出する際に、談話セマンティクス２００のフロー情報２１においてフローが“質問”もしくは“要求”であるステートメント（以下では“質問・要求ステートメント”と記載する場合がある）を抽出するものとする。 Further, in the example of FIG. 11, “question” or “request” flow information is displayed for each FAQ candidate. In this embodiment, when extracting the FAQ candidate 300, the question extraction unit 60 extracts the statement corresponding to the question from the customer from the discourse data 101, and the flow is “question” in the flow information 21 of the discourse semantics 200. Alternatively, a statement that is a “request” (hereinafter sometimes referred to as “question / request statement”) is extracted.

ここで、“要求”のフローは、談話構造解析システム１のフロー解析部２０において、相手の話者に対する要求を示すフロー表現を有するステートメントに対して設定される。図５に示したフロー解析ルール２２の例において、“要求”のフローに対応するフロー表現としては、例えば、“/したい/”、“/知りたい/”などの動作要求を表す表現などが考えられる。 Here, the flow of “request” is set in the flow analysis unit 20 of the discourse structure analysis system 1 for a statement having a flow expression indicating a request for the partner speaker. In the example of the flow analysis rule 22 shown in FIG. 5, as the flow expression corresponding to the “request” flow, for example, an expression indicating an operation request such as “/ I want /” or “/ I want to know /” is considered. It is done.

図１２は、ＦＡＱ候補抽出システム１におけるＦＡＱ候補抽出処理の流れの例を示したフローチャートである。ＦＡＱ候補抽出システム１は、ＦＡＱ候補抽出処理を開始すると、まず、質問抽出部６０により、談話データ１０１と、談話セマンティクス２００のフロー情報２１とに基づいて、顧客の発話による質問・要求ステートメントを抽出する（Ｓ４０１）。すなわち、フローが“質問”もしくは“要求”であるステートメントを談話データ１０１から抽出する。 FIG. 12 is a flowchart showing an example of the flow of FAQ candidate extraction processing in the FAQ candidate extraction system 1. When the FAQ candidate extraction system 1 starts the FAQ candidate extraction process, the question extraction unit 60 first extracts a question / request statement based on the customer's utterance based on the discourse data 101 and the flow information 21 of the discourse semantics 200. (S401). That is, a statement whose flow is “question” or “request” is extracted from the discourse data 101.

次に、スコアリング部７０により、ステップＳ４０１にて抽出された質問・要求ステートメントから、談話セマンティクス２００の形態素情報３１に基づいて単語を抽出し、抽出した単語毎に、当該単語が各質問・要求ステートメントに含まれる数をそれぞれ集計した情報（転置インデックス）を作成する（Ｓ４０２）。さらに、この転置インデックスに基づいて、各質問・要求ステートメントの重要度を算出し、各質問・要求ステートメントとその重要度の情報を質問情報７１として出力する（Ｓ４０３）。 Next, the scoring unit 70 extracts words based on the morpheme information 31 of the discourse semantics 200 from the question / request statement extracted in step S401, and for each extracted word, the word corresponds to each question / request. Information (transposition index) obtained by counting the numbers included in the statement is created (S402). Further, the importance of each question / request statement is calculated based on this transposed index, and each question / request statement and its importance information are output as question information 71 (S403).

重要度の算出の手法は特に限定されず、例えば一般的に行われる手法として、対象の質問・要求ステートメントに含まれる各単語についてＴＦ−ＩＤＦ（Term Frequency-Inverse Document Frequency）値を算出し、その平均値を対象の質問・要求ステートメントの重要度とする手法を用いることができる。また、例えばＯｋａｐｉ−ＢＭ２５などの他のアルゴリズムを利用してもよい。 The method of calculating the importance is not particularly limited. For example, as a commonly performed method, a TF-IDF (Term Frequency-Inverse Document Frequency) value is calculated for each word included in the target question / request statement. A method can be used in which the average value is the importance of the target question / request statement. Further, for example, another algorithm such as Okapi-BM25 may be used.

次に、ＦＡＱ候補抽出システム１は、例えば図１１に示した画面等を介して、ユーザからＦＡＱ候補を抽出するためのキーワード８１の入力を受け付ける（Ｓ４０４）。当該ステップはＦＡＱ候補抽出処理の最初に行うようにしてもよい。また、ユーザからの入力ではなく、ＦＡＱ候補抽出システム１や他のシステムにおける公知技術を含む他の技術による解析によって、例えば、談話データ１０１内のステートメントや他のテキストデータから最近頻出している急騰キーワードを抽出し、これをキーワード８１として利用してもよい。 Next, the FAQ candidate extraction system 1 accepts input of a keyword 81 for extracting FAQ candidates from the user, for example, via the screen shown in FIG. 11 (S404). This step may be performed at the beginning of the FAQ candidate extraction process. In addition, by the analysis by other techniques including the known technique in the FAQ candidate extraction system 1 and other systems, instead of the input from the user, for example, the rapid increase that has recently occurred frequently from statements in the discourse data 101 and other text data A keyword may be extracted and used as the keyword 81.

次に、候補抽出部８０により、質問情報７１に含まれる質問・要求ステートメントに、既存ＦＡＱ８２に保持されている既存のＦＡＱの質問文を含めた集合から、ステップＳ４０４で入力されたキーワードを含む質問・要求ステートメントもしくは質問文（以下では単に“質問・要求ステートメント”と総称する場合がある）を抽出する（Ｓ４０５）。ここでは、単純にキーワードマッチングにより質問・要求ステートメントを絞り込む。なお、本実施の形態では、既存ＦＡＱ８２の質問文を含めた集合を抽出の対象としているが、質問情報７１のみを対象として抽出するようにしてもよい。 Next, a question including the keyword input in step S404 from the set including the question sentence of the existing FAQ held in the existing FAQ 82 in the question / request statement included in the question information 71 by the candidate extraction unit 80 A request statement or a question sentence (hereinafter, sometimes simply referred to as “question / request statement” in some cases) is extracted (S405). Here, the question / request statement is narrowed down simply by keyword matching. In the present embodiment, a set including the question text of the existing FAQ 82 is targeted for extraction, but only the question information 71 may be extracted.

次に、クラスタリング部９０により、ステップＳ４０５で抽出された質問・要求ステートメントを同様のものにクラスタリングしてまとめ、各クラスタの代表の質問・要求ステートメントをＦＡＱ候補３００として出力し（Ｓ４０６）、処理を終了する。なお、ＦＡＱ候補３００の形式等については特に限定されず、ＸＭＬなどを利用したデータ構造や単純なテキスト形式など種々の形式とすることができる。 Next, the clustering unit 90 clusters the question / request statements extracted in step S405 into similar ones, and outputs representative question / request statements of each cluster as FAQ candidates 300 (S406). finish. The format of the FAQ candidate 300 is not particularly limited, and various formats such as a data structure using XML or a simple text format can be used.

また、ステップＳ４０６において、クラスタリングの手法は特に限定されず、例えば一般的に行われる手法として、ステップＳ４０２で作成された転置インデックスを利用して、各ＦＡＱ候補について含まれる単語の出現頻度に基づく単語ベクトルを作成し、ベクトル空間における単語ベクトル間の代数的な距離（類似度）に基づいて、Ｋ平均法などのアルゴリズムによりクラスタリングする手法を用いることができる。 In step S406, the clustering method is not particularly limited. For example, as a generally performed method, using the transposed index created in step S402, a word based on the appearance frequency of words included in each FAQ candidate A method can be used in which vectors are created and clustered by an algorithm such as a K-average method based on an algebraic distance (similarity) between word vectors in the vector space.

なお、各クラスタから代表となる質問・要求ステートメントを選択してＦＡＱ候補とする際に、同一クラスタ内に質問情報７１に含まれる質問・要求ステートメントと既存ＦＡＱ８２に保持されている既存のＦＡＱの質問文がともに含まれる場合は、質問情報７１に含まれる質問・要求ステートメントの方から代表となる質問・要求ステートメントを選択するものとする。このとき、当該質問・要求ステートメントについては、既存ＦＡＱ８２に対応する回答文が整備されているものと取り扱い、例えばその旨（同一クラスタ内に既存ＦＡＱ８２に保持されている質問文が属する、すなわち回答が整備済である）のフラグを設定するなど識別可能なように出力する。 When selecting a representative question / request statement from each cluster to be a FAQ candidate, the question / request statement included in the question information 71 and the existing FAQ question held in the existing FAQ 82 in the same cluster. When both sentences are included, a representative question / request statement from the question / request statements included in the question information 71 is selected. At this time, the question / request statement is handled as a response sentence corresponding to the existing FAQ 82, for example, to that effect (a question sentence held in the existing FAQ 82 in the same cluster belongs, that is, the answer is It is output so that it can be identified, for example, by setting a flag that is already maintained.

また、ＦＡＱ候補３００の出力に際しては、上述したように、ステップＳ４０３で算出した重要度が高いＦＡＱ候補ほど回答を作成して整備すべき優先度が高いものとして上位にランキングして出力してもよい。出力されたＦＡＱ候補３００の内容は、例えば、図１１に示したような画面で表示される。 Further, when outputting the FAQ candidate 300, as described above, the FAQ candidate having a higher importance calculated in step S403 may be ranked higher and output as a higher priority that should be prepared by preparing an answer. Good. The contents of the output FAQ candidate 300 are displayed on a screen as shown in FIG. 11, for example.

以上に説明したように、本実施の形態のＦＡＱ候補抽出システム１によれば、談話構造の解析結果である、談話データ１０１の各ステートメントについてのフロー情報２１を利用することにより、談話の文章構造の枠組みを規定せずに、談話データ１０１から質問・要求ステートメントを抽出することができる。また、抽出した質問・要求ステートメントを重要度に基づいてスコアリングするとともに、単語ベクトルの距離（類似度）に基づいてクラスタリングすることにより、質問・要求ステートメントから効果的にＦＡＱ候補３００を抽出して出力することができる。また、既存ＦＡＱ８２の質問文と合わせて抽出を行うことで、既存のＦＡＱとの重複を考慮してＦＡＱ候補３００を抽出することができる。 As described above, according to the FAQ candidate extraction system 1 of the present embodiment, the sentence structure of the discourse is obtained by using the flow information 21 for each statement of the discourse data 101, which is the analysis result of the discourse structure. The question / request statement can be extracted from the discourse data 101 without prescribing the framework. In addition to scoring the extracted question / request statement based on importance, clustering based on the distance (similarity) of the word vectors effectively extracts the FAQ candidate 300 from the question / request statement. Can be output. Further, by extracting together with the question text of the existing FAQ 82, the FAQ candidate 300 can be extracted in consideration of duplication with the existing FAQ.

また、質問・要求ステートメントの抽出に際して、談話データ１０１の内容および談話構造の解析結果から必要な部分（フロー）のみを参照して抽出するため、他の部分にノイズやノイズによる誤った解析結果が含まれている場合でも、これらからの影響を極小化して質問・要求ステートメントを抽出することができる。 Further, when extracting the question / request statement, only the necessary part (flow) is referred to and extracted from the contents of the discourse data 101 and the analysis result of the discourse structure, so that an erroneous analysis result due to noise or noise is present in other parts. Even if it is included, it is possible to extract the question / request statement by minimizing the influence from these.

このように、ＦＡＱ候補３００を効果的・効率的に抽出することが可能となることで、スーパバイザ等によるＦＡＱの整備が容易となり、これらのＦＡＱを例えばＷｅｂサイト等で顧客に提供することで、コール数を削減させ、コールセンターの運用コストの削減につなげることも可能となる。 Thus, it becomes possible to extract FAQ candidates 300 effectively and efficiently, facilitating the maintenance of FAQs by supervisors, etc., and providing these FAQs to customers on, for example, websites, It is also possible to reduce the number of calls and reduce the operating cost of the call center.

以上、本発明者によってなされた発明を実施の形態に基づき具体的に説明したが、本発明は前記実施の形態に限定されるものではなく、その要旨を逸脱しない範囲で種々変更可能であることはいうまでもない。 As mentioned above, the invention made by the present inventor has been specifically described based on the embodiment. However, the present invention is not limited to the embodiment, and various modifications can be made without departing from the scope of the invention. Needless to say.

本発明は、音声認識処理により得られた談話データの構造を解析した結果からＦＡＱの候補を抽出するＦＡＱ候補抽出システムおよびＦＡＱ候補抽出プログラムに利用可能である。 The present invention can be applied to a FAQ candidate extraction system and a FAQ candidate extraction program that extract FAQ candidates from the result of analyzing the structure of discourse data obtained by voice recognition processing.

１…ＦＡＱ候補抽出システム、２…談話構造解析システム、３…音声認識エンジン、
１０…前処理部、
２０…フロー解析部、２１…フロー情報、２２…フロー解析ルール、
３０…形態素解析部、３１…形態素情報、
４０…固有表現解析部、４１…固有表現情報、４２…固有表現解析ルール、
５０…結束性解析部、５１…結束性情報、
６０…質問抽出部、
７０…スコアリング部、７１…質問情報、
８０…候補抽出部、８１…キーワード、８２…既存ＦＡＱ、
１００、１０１…談話データ、２００…談話セマンティクス、３００…ＦＡＱ候補。 1 ... FAQ candidate extraction system, 2 ... Discourse structure analysis system, 3 ... Speech recognition engine,
10: Pre-processing unit,
20 ... Flow analysis unit, 21 ... Flow information, 22 ... Flow analysis rule,
30 ... Morphological analyzer, 31 ... Morphological information,
40 ... proper expression analysis unit, 41 ... proper expression information, 42 ... proper expression analysis rule,
50: Cohesiveness analysis unit, 51 ... Cohesiveness information,
60 ... Question extraction unit,
70 ... scoring part, 71 ... question information,
80 ... candidate extraction unit, 81 ... keyword, 82 ... existing FAQ,
100, 101 ... Discourse data, 200 ... Discourse semantics, 300 ... FAQ candidates.

Claims

With the input of discourse semantics, which is information for grasping the semantic content of the whole discourse, which is the analysis result of the discourse structure in the discourse data, and the discourse data in which the content of the discourse is converted into text by voice recognition processing, A FAQ candidate extraction system that extracts and outputs a question sentence as a FAQ candidate from the discourse data,
The discourse semantics includes flow information including flow information indicating the semantic content of each statement in the discourse data,
A question extraction unit that extracts a question / request statement that is uttered by a customer from the discourse data and has a flow indicating that it is a question sentence or a request sentence in the flow information of the discourse semantics;
A candidate extraction unit that extracts a specified keyword from the question / request statement extracted by the question extraction unit;
The question / request statement extracted by the candidate extraction unit and the question statement in the existing FAQ are clustered into the same content, and the question / request statement that is representative of each cluster is set as a FAQ candidate. A FAQ candidate extraction system comprising: a clustering unit that outputs together with information as to whether or not a question sentence extracted from an existing FAQ is included in a cluster to which the group belongs .

The FAQ candidate extraction system according to claim 1 ,
The discourse semantics includes morpheme information including information on a morpheme sequence obtained by dividing each statement into morphemes and determining a part of speech;
Further, for the question / request statement extracted by the question extraction unit, information on the appearance frequency of the included word is acquired based on the morpheme information, and the score for calculating the importance based on the information on the appearance frequency Having a ring part,
The clustering unit outputs the question / request statements included in the FAQ candidate in the order of the importance levels.

The FAQ candidate extraction system according to claim 2 ,
The scoring unit calculates a TF-IDF value for each word included in the question / request statement extracted by the question extraction unit, and uses an average value of the TF-IDF values of the words as the importance level. FAQ candidate extraction system characterized by this.

With the input of discourse semantics, which is information for grasping the semantic content of the whole discourse, which is the analysis result of the discourse structure in the discourse data, and the discourse data in which the content of the discourse is converted into text by voice recognition processing, A FAQ candidate extraction program for causing a computer to function as a FAQ candidate extraction system that extracts and outputs a question sentence as a FAQ candidate from the discourse data,
The discourse semantics includes flow information including flow information indicating the semantic content of each statement in the discourse data,
A question extraction process for extracting a question / request statement uttered by a customer from the discourse data, and a flow indicating a question sentence or a request sentence is set in the flow information of the discourse semantics;
Candidate extraction processing for extracting the one containing the specified keyword from the set including the question sentence in the existing FAQ in the question / request statement extracted by the question extraction processing;
The question / request statements extracted by the candidate extraction process are clustered to have the same contents, and the question / request statements that are representative of each cluster are set as FAQ candidates , and the existing FAQ in the cluster to which the FAQ candidate belongs. A FAQ candidate extraction program that executes clustering processing that is output together with information indicating whether or not a question sentence extracted from is included .

In the FAQ candidate extraction program according to claim 4 ,
The discourse semantics includes morpheme information including information on a morpheme sequence obtained by dividing each statement into morphemes and determining a part of speech;
Further, for the question / request statement extracted by the question extraction process, a score for obtaining information on the appearance frequency of the contained word based on the morpheme information and calculating the importance based on the information on the appearance frequency Perform the ring process,
The FAQ candidate extraction program, wherein the clustering process outputs the question / request statements included in the FAQ candidate in the order of importance.

In the FAQ candidate extraction program according to claim 5 ,
In the scoring process, a TF-IDF value is calculated for each word included in the question / request statement extracted by the question extraction process, and an average value of the TF-IDF values of the words is used as the importance level. FAQ candidate extraction program characterized by the above.