JP6208794B2

JP6208794B2 - Conversation analyzer, method and computer program

Info

Publication number: JP6208794B2
Application number: JP2016051032A
Authority: JP
Inventors: 哲石井
Original assignee: ベクスト株式会社
Priority date: 2016-03-15
Filing date: 2016-03-15
Publication date: 2017-10-04
Anticipated expiration: 2036-03-15
Also published as: JP2017167726A

Description

本発明は、対話型テキストを解析する技術に関する。 The present invention relates to a technique for analyzing interactive text.

顧客及び潜在顧客（以下、顧客等）から寄せられる様々な情報は企業にとって有用である。例えば、既存顧客から寄せられた意見などをテキスト化し、このテキストを解析することで将来に起きるかもしれない商品またはサービスに関する問題の予兆を検出する技術がある（例えば、特許文献１）。 Various information received from customers and potential customers (hereinafter referred to as customers) is useful for companies. For example, there is a technique for detecting a sign of a problem relating to a product or service that may occur in the future by converting an opinion received from an existing customer into text and analyzing the text (for example, Patent Document 1).

特許第５０７８９６０号公報Japanese Patent No. 5078960

近年は、インターネット技術の発達及びＳＮＳ（ソーシャル・ネットワーク・サービス）の普及などにより、顧客等の声を集める手段が多様化している。しかしながら、顧客等とオペレータとが直接会話をして商品等のトラブルに対応したり、顧客等の疑問に答えたりする電話対応業務は、依然として顧客等の声を集める手段として重要である。オペレータが適切に対応することで、他の手段では得がたい情報を得られる場合もあるからである。そのためには、会話スキルの高いオペレータを養成することが重要である。 In recent years, due to the development of Internet technology and the spread of SNS (Social Network Service), means for collecting customers' voices have been diversified. However, the telephone handling operation in which the customer and the operator talk directly with each other to deal with troubles of the product and answer the questions of the customer is still important as a means of collecting the voice of the customer. This is because the operator may be able to obtain information that cannot be obtained by other means by appropriately responding. To that end, it is important to train operators with high conversation skills.

ところで、顧客等とオペレータとの会話で用いられるのは、通常は口語体（話し言葉）であって書き言葉である文語体の文章に比べて文法が不正確で曖昧であり、かつ、挨拶、相づち、感嘆詞など文脈とは関係のない言葉が大量に含まれるという特徴がある。そのため、顧客等とオペレータとの会話をテキスト化した会話テキストの自動解析技術は実用化されていない。 By the way, conversations between customers and operators are usually used in spoken language (spoken language), and the grammar is inaccurate and ambiguous compared to written sentences. It is characterized by a large amount of words that are not related to the context, such as lyrics. For this reason, automatic analysis technology for conversational text, which is a text conversation between a customer and an operator, has not been put to practical use.

そこで、本発明の目的は、会話テキストを自動解析する技術を提供することである。 Therefore, an object of the present invention is to provide a technique for automatically analyzing a conversation text.

本発明の別の目的は、会話テキストを自動解析して、話者の会話の特徴を抽出することである。 Another object of the present invention is to automatically analyze the conversation text and extract the features of the speaker's conversation.

本発明の一つの実施態様に従う会話分析装置は、少なくとも第１の話者と第２の話者との会話の内容を記述したテキストを含む会話記録データを記憶する手段と、前記会話記録データのテキストを複数のユニットに分割し、前記ユニット同士の類似度に基づいて、前記複数のユニットを複数のセグメントに分割するセグメント分け手段と、前記セグメント分け手段によって形成された複数のセグメントを複数のカテゴリに分類するカテゴリ分け手段と、前記カテゴリ分け手段による分類の結果に基づいて、前記会話に含まれる複数のカテゴリ及び前記会話の中でのカテゴリの遷移を示す会話パターンデータを記憶する手段と、を備える。 According to an embodiment of the present invention, there is provided a conversation analysis apparatus for storing conversation record data including text describing at least the contents of a conversation between a first speaker and a second speaker; Segmenting means for dividing the text into a plurality of units and dividing the plurality of units into a plurality of segments based on the similarity between the units; and the plurality of segments formed by the segmenting means in a plurality of categories Categorizing means for classifying into: and a means for storing conversation pattern data indicating a plurality of categories included in the conversation and a transition of categories in the conversation based on a result of classification by the categorizing means; Prepare.

好適な実施態様では、前記会話パターンデータを記憶する手段に記憶された複数の第１の話者の会話パターンデータを参照して、第１の話者別にカテゴリの遷移の特徴を抽出する会話パターン分析手段を、さらに備えてもよい。 In a preferred embodiment, a conversation pattern for extracting a category transition characteristic for each first speaker with reference to conversation pattern data of a plurality of first speakers stored in the means for storing the conversation pattern data. An analysis means may be further provided.

好適な実施態様では、前記セグメント分け手段は、前記複数のユニットを所定数のクラスタに分離するクラスタリング処理を行い、同一クラスタに属し、かつ、連続している複数のユニットで一つのセグメントを形成してもよい。 In a preferred embodiment, the segmenting means performs a clustering process for separating the plurality of units into a predetermined number of clusters, and forms one segment with a plurality of units belonging to the same cluster and continuous. May be.

好適な実施態様では、前記カテゴリ分け手段による分類の結果に基づいて、前記会話に含まれる各カテゴリに要した時間を算出する手段を、さらに備えてもよい。 In a preferred embodiment, there may be further provided means for calculating a time required for each category included in the conversation based on a result of classification by the categorizing means.

好適な実施態様では、前記会話記録データのテキストから不要語を削除する不要語削除手段を、さらに備え、前記セグメント分け手段は、前記不要語削除手段により不要語が削除されたテキストに基づいてセグメント分けを行ってもよい。 In a preferred embodiment, the apparatus further comprises unnecessary word deleting means for deleting unnecessary words from the text of the conversation record data, and the segmenting means performs segmentation based on the text from which unnecessary words have been deleted by the unnecessary word deleting means. Dividing may be performed.

好適な実施態様では、前記不要語削除手段は、前記複数の発話を複数の処理単位に分割し、概念検索機能を用いて前記処理単位と削除対象語との関連度を求めて前記不要語を特定してもよい。 In a preferred embodiment, the unnecessary word deleting means divides the plurality of utterances into a plurality of processing units, obtains a degree of association between the processing unit and the deletion target word using a concept search function, and determines the unnecessary words. You may specify.

本発明の一実施形態にかかる会話分析装置１の機能構成図である。It is a functional lineblock diagram of conversation analysis device 1 concerning one embodiment of the present invention. 会話属性データ３３０の一例を示す図である。It is a figure which shows an example of the conversation attribute data. 会話テキスト３５１及び編集テキスト３５３の一例を示す図である。It is a figure which shows an example of the conversation text 351 and the edit text 353. FIG. 会話パターンデータ３７０の一例を示す図である。It is a figure which shows an example of the conversation pattern data 370. カテゴリ遷移データ３９０の一例を示す図である。It is a figure which shows an example of the category transition data 390. 不要語辞書５１０の一例を示す図である。It is a figure which shows an example of the unnecessary word dictionary. クラスタリングされた会話テキストのユニットとセグメントの関係を示す図である。It is a figure which shows the relationship between the unit of the clustered conversation text, and a segment. カテゴリ辞書５３０の一例を示す図である。It is a figure which shows an example of the category dictionary 530. FIG.

以下、本発明の一実施形態に係る会話分析装置について、図面を参照して説明する。 Hereinafter, a conversation analysis device according to an embodiment of the present invention will be described with reference to the drawings.

図１は、本実施形態にかかる会話分析装置１の機能構成図である。会話分析装置１は、プロセッサ、メモリ及び入出力デバイス等を備える汎用的な情報処理装置に所定のコンピュータプログラムを実行させることで実現されたものであっても良い。 FIG. 1 is a functional configuration diagram of a conversation analysis apparatus 1 according to the present embodiment. The conversation analysis apparatus 1 may be realized by causing a general-purpose information processing apparatus including a processor, a memory, an input / output device, and the like to execute a predetermined computer program.

会話分析装置１は、同図に示すように、データを記憶する手段として、会話音声データ記憶部３１と、会話属性データ記憶部３３と、会話記録データ記憶部３５と、会話パターンデータ記憶部３７と、カテゴリ遷移記憶部３９と、を備える。 As shown in the figure, the conversation analysis apparatus 1 has a conversation voice data storage unit 31, a conversation attribute data storage unit 33, a conversation record data storage unit 35, and a conversation pattern data storage unit 37 as means for storing data. And a category transition storage unit 39.

会話音声データ記憶部３１は、複数の話者が発話した会話の音声データのファイルを記憶する。音声データファイルのファイル名には後述する会話ＩＤが含まれていても良い。 The conversation voice data storage unit 31 stores a voice data file of conversations uttered by a plurality of speakers. The file name of the audio data file may include a conversation ID described later.

会話属性データ記憶部３３は、会話音声データ記憶部３１に保存されている会話音声データの属性を示すデータを記憶する。会話属性データは、例えば、会話を行っている話者、会話が行われた日時及び状況などの情報を含んでも良い。 The conversation attribute data storage unit 33 stores data indicating the attributes of the conversation voice data stored in the conversation voice data storage unit 31. The conversation attribute data may include, for example, information such as a speaker having a conversation, the date and time of the conversation, and the situation.

図２は、会話属性データ３３０の一例を示す図である。 FIG. 2 is a diagram illustrating an example of the conversation attribute data 330.

同図に示すように会話属性データ３３０は、データ項目として会話ＩＤ３３１と、話者ＩＤ３３３とを有する。 As shown in the figure, the conversation attribute data 330 has a conversation ID 331 and a speaker ID 333 as data items.

会話ＩＤ３３１は、会話分析装置１が処理対象としている複数の会話を識別する識別情報である。 The conversation ID 331 is identification information for identifying a plurality of conversations to be processed by the conversation analysis apparatus 1.

話者ＩＤ３３３は、会話ＩＤ３３１の会話に参加している話者の識別情報である。参加している人数分の登録が可能である。話者が特定できない場合は、不明としてもよい。 The speaker ID 333 is identification information of a speaker participating in the conversation with the conversation ID 331. Registration for the number of participants is possible. If the speaker cannot be identified, it may be unknown.

図１に戻ると、会話記録データ記憶部３５は、複数の話者が発話して行われた会話のテキストのデータを記憶する。例えば、会話記録データ記憶部３５は、少なくとも第１の話者と第２の話者の発話とが含まれる会話の内容を記述したテキストを含む会話データのファイルを記憶する。会話テキストデータのファイル名に会話ＩＤが含まれていても良い。会話記録データ記憶部３５は、例えば、会話テキスト３５１及び編集テキスト３５３を記憶する。 Returning to FIG. 1, the conversation record data storage unit 35 stores text data of conversations performed by a plurality of speakers. For example, the conversation record data storage unit 35 stores a conversation data file including text describing the content of a conversation including at least a first speaker and an utterance of a second speaker. The conversation ID may be included in the file name of the conversation text data. The conversation record data storage unit 35 stores, for example, conversation text 351 and edited text 353.

図３は、会話テキスト３５１及び編集テキスト３５３の一例を示す図である。 FIG. 3 is a diagram illustrating an example of the conversation text 351 and the edit text 353.

同図Ａに示すように、会話テキスト３５１は、テキスト部３５１Ａとタイムコード部３５１Ｂとを有する。 As shown in FIG. A, the conversation text 351 has a text part 351A and a time code part 351B.

テキスト部３５１Ａは、複数（少なくとも二人）の話者の発話を含む会話の内容を記述したテキストである。会話においては、複数の話者の発話が交互にされていてもよい。会話中には、挨拶や相づちなどの文脈を特定するために不要な表現が多数含まれている。後述するように、不要語削除処理部１３がこれらの不要な表現を除去するようにしても良い。 The text part 351A is a text describing the content of a conversation including the utterances of a plurality (at least two) speakers. In conversation, the utterances of a plurality of speakers may be alternated. During the conversation, there are many unnecessary expressions to specify the context such as greetings and messages. As will be described later, the unnecessary word deletion processing unit 13 may remove these unnecessary expressions.

タイムコード部３５１Ｂは、発話毎に、各発話が開始したタイムコード及び終了したタイムコードを有する。タイムコード部３５１Ｂは、音声認識処理部１１が会話音声データに基づいて音声認識を行う際に生成されて良い。 The time code unit 351B has, for each utterance, a time code at which each utterance starts and an end time code. The time code unit 351B may be generated when the voice recognition processing unit 11 performs voice recognition based on conversation voice data.

なお、会話テキスト３５１は、音声認識処理部１１で音声データから抽出されたテキスト以外のテキストでもよい。例えば、人が入力したテキストでもよいし、ＯＣＲ（ＯｐｔｉｃａｌＣｈａｒａｃｔｅｒＲｅａｄｅｒ）等で画像から抽出したテキストでもよい。 The conversation text 351 may be text other than the text extracted from the voice data by the voice recognition processing unit 11. For example, text entered by a person may be used, or text extracted from an image by OCR (Optical Character Reader) or the like may be used.

同図Ｂは編集テキスト３５３を示す。編集テキスト３５３は、後述する不要語削除処理によって会話テキスト３５１から不要語が削除されたテキストのデータである。編集テキスト３５３は、テキスト部３５３Ａとタイムコード部３５３Ｂと有する。 FIG. 5B shows the edit text 353. The edited text 353 is text data in which an unnecessary word is deleted from the conversation text 351 by an unnecessary word deletion process described later. The edit text 353 has a text part 353A and a time code part 353B.

図１に戻ると、会話パターンデータ記憶部３７は、会話テキスト３５１の分析によって明らかになった会話パターンを示すデータを記憶する。会話パターンデータは、例えば、会話に含まれる複数のカテゴリ及び前記会話の中でのカテゴリの遷移を示すものでも良い。 Returning to FIG. 1, the conversation pattern data storage unit 37 stores data indicating a conversation pattern that is clarified by analysis of the conversation text 351. The conversation pattern data may indicate, for example, a plurality of categories included in the conversation and category transitions in the conversation.

図４は、会話パターンデータ３７０の一例を示す図である。 FIG. 4 is a diagram illustrating an example of the conversation pattern data 370.

同図に示すように、会話パターンデータ３７０は、データ構造として会話ＩＤ３７１と、セグメント３７３と、時間３７４と、カテゴリ３７５とを有する。 As shown in the figure, the conversation pattern data 370 has a conversation ID 371, a segment 373, a time 374, and a category 375 as a data structure.

セグメント３７３は、会話テキスト３５１を複数に分割して形成されたセグメントの識別情報である。ここでは、各セグメントは、会話に登場した順に、一つの会話ＩＤ３７１に対してユニークな連番が振られている。 The segment 373 is segment identification information formed by dividing the conversation text 351 into a plurality of pieces. Here, each segment is assigned a unique serial number for one conversation ID 371 in the order of appearance in the conversation.

時間３７４は、各セグメントが継続した時間である。セグメントの継続時間については後述する。 Time 374 is the time that each segment lasts. The duration of the segment will be described later.

カテゴリ３７５は各セグメントに割り当てられるカテゴリである。一つのセグメントに割り当てられるカテゴリ３７５は複数あっても良い。 A category 375 is a category assigned to each segment. There may be a plurality of categories 375 assigned to one segment.

会話パターンデータ３７０では、セグメント３７３の番号は会話内でカテゴリが登場した順序に対応する。例えば、図４の会話ＩＤ「１００１」の会話では、セグメント３７３の番号順に、カテゴリＡ→カテゴリＢまたはＣ→カテゴリＣ→カテゴリＤまたはＡの順にカテゴリが遷移したことを示す。 In the conversation pattern data 370, the number of the segment 373 corresponds to the order in which the categories appear in the conversation. For example, in the conversation with the conversation ID “1001” in FIG. 4, the category transitions in the order of category A → category B or C → category C → category D or A in the order of the segment 373.

図１に戻ると、カテゴリ遷移記憶部３９は、会話内でのカテゴリ遷移の特徴を示すデータを記憶する。例えば、カテゴリ遷移記憶部３９は、話者別のカテゴリの遷移の特徴を示す統計データを記憶しても良い。 Returning to FIG. 1, the category transition storage unit 39 stores data indicating the characteristics of the category transition in the conversation. For example, the category transition storage unit 39 may store statistical data indicating characteristics of category transitions by speaker.

図５は、カテゴリ遷移データ３９０の一例を示す図である。 FIG. 5 is a diagram illustrating an example of the category transition data 390.

同図に示すように、カテゴリ遷移データ３９０は、データ項目として、カテゴリ３９１と、話者ＩＤ３９２と、件数３９３と、時間３９４と、遷移カテゴリ３９５とを有する。件数３９３は、カテゴリ３９１及び話者ＩＤ３９２別の件数でよい。時間３９４は、各カテゴリ３９１に割り当てられた、話者ＩＤ３９２別のセグメントの継続時間の総和である。遷移カテゴリ３９５は、カテゴリ３９１から遷移した遷移先のカテゴリである。遷移カテゴリ３９５は、さらに、カテゴリ名、件数及び確率をデータ項目として有する。遷移カテゴリ３９５は、カテゴリ３９１から直接遷移したカテゴリ、つまりカテゴリ３９１の次に登場するカテゴリを対象としてカウントするだけでなく、カテゴリ３９１から他のカテゴリを経由して遷移したカテゴリ、つまり、カテゴリ３９１の次の次に登場するカテゴリを対象としてカウントしても良い。 As shown in the figure, the category transition data 390 includes, as data items, a category 391, a speaker ID 392, the number of cases 393, a time 394, and a transition category 395. The number of cases 393 may be the number of cases for each category 391 and speaker ID 392. The time 394 is a total sum of durations of segments assigned to the respective categories 391 by the speaker ID 392. The transition category 395 is a transition destination category transitioned from the category 391. The transition category 395 further includes a category name, the number of cases, and a probability as data items. The transition category 395 not only counts the category that has directly transitioned from the category 391, that is, the category that appears next to the category 391, but also the category that has transitioned from the category 391 via another category, that is, the category 391. The next category that appears next may be counted.

図１を改めて参照すると、会話分析装置１は、音声認識処理部１１と、不要語削除処理部１３と、セグメント処理部１５と、カテゴライズ処理部１７と、会話パターン分析処理部１９と、を有する。 Referring back to FIG. 1, the conversation analysis device 1 includes a speech recognition processing unit 11, an unnecessary word deletion processing unit 13, a segment processing unit 15, a categorization processing unit 17, and a conversation pattern analysis processing unit 19. .

音声認識処理部１１は、音声データを認識してテキスト抽出する音声認識処理を行う。音声認識処理部１１は、例えば、会話音声データ記憶部３１に保存されている会話の音声ファイルの音声認識を行って、会話の内容を記述したテキストを含む会話テキスト３５１を生成する。会話テキスト３５１のデータは会話記録データ記憶部３５に保存される。 The speech recognition processing unit 11 performs speech recognition processing that recognizes speech data and extracts text. For example, the voice recognition processing unit 11 performs voice recognition of a conversation voice file stored in the conversation voice data storage unit 31 and generates a conversation text 351 including text describing the contents of the conversation. The data of the conversation text 351 is stored in the conversation record data storage unit 35.

音声認識処理部１１は、例えば、音声認識により会話音声データからテキストを抽出し、会話を発話単位で区切ったテキストを含むテキスト部３１５Ａを生成する。さらに音声認識処理部１１は、会話音声データにおける発話の開始及び終了のタイムコードを特定し、テキスト部３５１Ａのテキストと対応させたタイムコード部３５１Ｂを生成する。 For example, the voice recognition processing unit 11 extracts text from conversation voice data by voice recognition, and generates a text part 315A including text obtained by dividing the conversation in units of utterances. Furthermore, the voice recognition processing unit 11 specifies the time code of the start and end of the utterance in the conversation voice data, and generates a time code unit 351B corresponding to the text of the text unit 351A.

音声認識処理部１１は、音声認識によって、それぞれの発話がどの話者によってなされているのかを識別してもよい。さらに、音声認識処理部１１は、識別された話者の特定が可能であれば、特定された話者ＩＤを会話属性データ３３０に登録してもよい。 The voice recognition processing unit 11 may identify which speaker is making each utterance by voice recognition. Furthermore, the speech recognition processing unit 11 may register the identified speaker ID in the conversation attribute data 330 if the identified speaker can be identified.

不要語削除処理部１３は、会話テキスト３５１から不必要な語を削除する。例えば、不要語削除処理部１３は、概念検索機能を用いて会話記録データ記憶部３５に保存されている会話テキスト３５１のテキスト部３５１Ａに含まれる語や文のうち、不要語を特定しても良い。不要語削除処理部１３は、例えば不要語辞書を有し、この辞書を参照して削除対象の不要語を特定するようにしても良い。 The unnecessary word deletion processing unit 13 deletes unnecessary words from the conversation text 351. For example, the unnecessary word deletion processing unit 13 may identify an unnecessary word among words and sentences included in the text portion 351A of the conversation text 351 stored in the conversation record data storage unit 35 using the concept search function. good. The unnecessary word deletion processing unit 13 may have an unnecessary word dictionary, for example, and may specify an unnecessary word to be deleted with reference to this dictionary.

図６は、不要語辞書５１０の一例を示す図である。 FIG. 6 is a diagram illustrating an example of the unnecessary word dictionary 510.

同図に示すように、不要語辞書５１０は、挨拶などの定型句の種類別に、削除対象の表現例及び削除しない表現例を有してもよい。 As shown in the figure, the unnecessary word dictionary 510 may have an expression example to be deleted and an expression example not to be deleted for each type of fixed phrase such as greetings.

不要語削除処理部１３は、会話テキスト３５１のテキストを、例えば形態素に区切り、概念検索により不要語辞書５１０に登録されている表現との関連度を算出するようにしても良い。不要語削除処理部１３は、概念検索の関連度が所定以上である処理単位のテキストを不要語として会話テキスト３５１から削除しても良い。不要語削除処理部１３は、会話テキスト３５１から不要語を削除して編集テキスト３５３を生成する。なお、概念検索は、任意の概念検索のアルゴリズムを用いて行ってもよい。 The unnecessary word deletion processing unit 13 may divide the text of the conversation text 351 into, for example, morphemes, and calculate the degree of association with the expression registered in the unnecessary word dictionary 510 by concept search. The unnecessary word deletion processing unit 13 may delete from the conversation text 351, as a unnecessary word, a text in a processing unit having a degree of relevance of concept search of a predetermined level or more. The unnecessary word deletion processing unit 13 deletes unnecessary words from the conversation text 351 and generates an edit text 353. The concept search may be performed using any concept search algorithm.

テキスト部３５１Ａのテキストにおいて、一つの発話の中の一部が不要語として削除された場合でも、その発話に対応するタイムコードは変更しなくてよい。例えば、ある発話から不要語が削除され、その発話に対応するテキストが短くなったときでも、その発話に対応するタイムコード部３５１Ｂのタイムコードを変更せずに、タイムコード部３５３Ｂを生成してもよい。 Even when a part of one utterance is deleted as an unnecessary word in the text of the text portion 351A, the time code corresponding to the utterance need not be changed. For example, even when an unnecessary word is deleted from an utterance and the text corresponding to the utterance becomes shorter, the time code part 353B is generated without changing the time code of the time code part 351B corresponding to the utterance. Also good.

なお、会話テキスト３５１に対して不要語削除処理を適用することが必須ではない。例えば、チャットを記録した会話テキストには不要語削除処理を適用しなくてもよい。 Note that it is not essential to apply the unnecessary word deletion process to the conversation text 351. For example, the unnecessary word deletion process may not be applied to the conversation text in which the chat is recorded.

セグメント処理部１５は、編集テキスト３５３を複数のセグメントに分割する。例えば、セグメント処理部１５は、テキストを所定の処理単位であるユニットに区切って処理を行っても良い。セグメント処理部１５は、各ユニットの時間を計測してもよい。さらに、セグメント処理部１５は、そのユニット同士の類似度に基づいて、会話中の複数のユニットを複数のセグメントに分類してもよい。セグメント処理部１５は、一人の話者が話し始めてから他の話者が話を始めるまでの間に続いた発話のテキスト部分を一つのユニットとしてもよいし、句読点で区切れる範囲を一つのユニットとしても良いし、一文を一つのユニットとしても良い。 The segment processing unit 15 divides the edited text 353 into a plurality of segments. For example, the segment processing unit 15 may perform processing by dividing a text into units that are predetermined processing units. The segment processing unit 15 may measure the time of each unit. Further, the segment processing unit 15 may classify a plurality of units in conversation into a plurality of segments based on the similarity between the units. The segment processing unit 15 may set the text portion of the utterance that has continued from when one speaker starts speaking until another speaker starts speaking as one unit, or the range delimited by punctuation marks as one unit. Or a sentence as a unit.

セグメント処理部１５は、さらに、セグメントが継続した時間を算出してもよい。例えば、セグメント処理部１５は、同一セグメントに割り当てられた発話の時間を、タイムコード部３５３Ｂを参照して特定し、それらの時間を足し合わせてセグメントの継続時間としてもよい。 The segment processing unit 15 may further calculate the time that the segment has continued. For example, the segment processing unit 15 may specify the utterance time assigned to the same segment with reference to the time code unit 353B, and add these times to make the duration of the segment.

セグメント処理部１５は、会話テキスト３５１を対象として複数のセグメントに分割してもよい。 The segment processing unit 15 may divide the conversation text 351 into a plurality of segments.

セグメント処理部１５は、会話に含まれる多数のユニットをクラスタリング処理によって所定数のクラスタに分類してもよい。クラスタリング処理によって分類されるクラスタ数は任意に定めることができる。いずれのクラスタにも分類されないユニットは「その他」に分類しても良い。セグメント処理部１５は、任意のクラスタリングアルゴリズムを用いることができる。 The segment processing unit 15 may classify a large number of units included in the conversation into a predetermined number of clusters by clustering processing. The number of clusters classified by the clustering process can be arbitrarily determined. Units that are not classified into any cluster may be classified as “others”. The segment processing unit 15 can use any clustering algorithm.

セグメント処理部１５は、テキストの構文解析または意味解析などを行うことなく、ユニット同士の類似度に着目してクラスタリングを行うようにしてもよい。 The segment processing unit 15 may perform clustering by paying attention to the similarity between units without performing text syntax analysis or semantic analysis.

セグメント処理部１５は、クラスタリングの結果に基づいて複数のユニットを複数のセグメントに分割する。 The segment processing unit 15 divides a plurality of units into a plurality of segments based on the result of clustering.

図７は、クラスタリングされた会話テキストのユニットとセグメントの関係を示す図である。 FIG. 7 is a diagram illustrating the relationship between clustered conversation text units and segments.

同図では、編集テキスト３５３が区切られた複数のユニット４２０に項番４１０が振られている。さらに、クラスタ４３０は４つのクラスタ（Ｃ１〜Ｃ４）及びその他を含み、各ユニット４２０が割り当てられたクラスタを示す。 In the figure, an item number 410 is assigned to a plurality of units 420 in which the edit text 353 is divided. Further, the cluster 430 includes four clusters (C1 to C4) and others, and indicates a cluster to which each unit 420 is allocated.

ここで、セグメント処理部１５は、同一クラスタに属し、かつ、連続している複数のユニットを一つのセグメントとしてもよい。一つのセグメントには一つ以上のユニットが割り当てられる。同図の例では、一つに囲まれた項番１−２、項番３−６、項番７、・・・、項番２５−２７、項番２８−２９及び項番３０が、それぞれ一つのセグメントになる。 Here, the segment processing unit 15 may set a plurality of continuous units belonging to the same cluster as one segment. One or more units are assigned to one segment. In the example of the figure, item number 1-2, item number 3-6, item number 7,..., Item number 25-27, item number 28-29 and item number 30 surrounded by one are respectively It becomes one segment.

図１を参照すると、カテゴライズ処理部１７は、セグメント処理部１５によって形成された複数のセグメントを複数のカテゴリに分類してもよい。例えば、カテゴライズ処理部１７は、カテゴリ辞書を有していて、カテゴリ辞書を参照しながら、各セグメントを予め定められた複数のカテゴリのうちのいずれかに分類しても良い。カテゴリは、例えば、会話で話題になり得るトピックを示す分類でよい。 Referring to FIG. 1, the categorization processing unit 17 may classify a plurality of segments formed by the segment processing unit 15 into a plurality of categories. For example, the categorization processing unit 17 may have a category dictionary and classify each segment into one of a plurality of predetermined categories while referring to the category dictionary. The category may be, for example, a classification indicating a topic that can become a topic in conversation.

図８は、カテゴリ辞書５３０の一例を示す図である。カテゴリ辞書５３０は、項目としてカテゴリ５３１と、模範文５３３と、重要単語５３５とを有する。 FIG. 8 is a diagram illustrating an example of the category dictionary 530. The category dictionary 530 includes a category 531, an exemplary sentence 533, and an important word 535 as items.

カテゴライズ処理部１７は、各セグメントに分類されているユニット４２０とカテゴリ辞書５３０の模範文５３３または重要単語５３５とを比較して、各セグメントにいずれかのカテゴリを割り当てる。一つのセグメントに二つ以上のカテゴリが割り当てられることもある。カテゴライズ処理部１７が、各セグメントにカテゴリを割り当てた会話パターンデータ３７０（図４参照）を生成し、会話パターンデータ記憶部３７に保存する。 The categorization processing unit 17 compares the unit 420 classified into each segment with the model sentence 533 or the important word 535 of the category dictionary 530, and assigns any category to each segment. Two or more categories may be assigned to one segment. The categorization processing unit 17 generates conversation pattern data 370 (see FIG. 4) in which a category is assigned to each segment, and stores it in the conversation pattern data storage unit 37.

会話パターン分析処理部１９は、会話パターンデータを分析して、会話の特徴を抽出する。 The conversation pattern analysis processing unit 19 analyzes conversation pattern data and extracts conversation features.

会話パターン分析処理部１９は、例えば、複数の第１の話者の会話パターンデータを参照して、第１の話者別にカテゴリの遷移の特徴を抽出してもよい。例えば、会話パターンデータ記憶部３７に、多数のオペレータがさまざまな顧客等に対応したときの会話テキスト３５１を処理して得られた会話パターンデータ３７０が保存されているとき、会話パターン分析処理部１９は、オペレータ別に、会話内でのあるカテゴリから別のカテゴリへの遷移回数をカウントしてもよいし、遷移する確率を算出しても良い。算出された確率は、カテゴリ遷移データ３９０の遷移先カテゴリ３９５に保存されてもよい。 The conversation pattern analysis processing unit 19 may extract, for example, the characteristics of category transition for each first speaker by referring to the conversation pattern data of a plurality of first speakers. For example, when the conversation pattern data storage unit 37 stores the conversation pattern data 370 obtained by processing the conversation text 351 when a large number of operators correspond to various customers or the like, the conversation pattern analysis processing unit 19 May count the number of transitions from one category to another in the conversation for each operator, or may calculate the probability of transition. The calculated probability may be stored in the transition category 395 of the category transition data 390.

また、会話パターン分析処理部１９は、会話パターンデータに基づいて、第１の話者別に各カテゴリに要した時間を算出してもよい。例えば、会話パターン分析処理部１９は、オペレータ別カテゴリ別に時間３７４を積算して、オペレータ毎に各カテゴリに要した時間を算出してもよい。算出された時間は、カテゴリ遷移データ３９０の時間３９４に保存されてもよい。 The conversation pattern analysis processing unit 19 may calculate the time required for each category for each first speaker based on the conversation pattern data. For example, the conversation pattern analysis processing unit 19 may calculate the time required for each category for each operator by accumulating the time 374 for each operator category. The calculated time may be stored in the time 394 of the category transition data 390.

会話パターン分析処理部１９は、カテゴリ遷移の算出結果として、カテゴリ遷移データ３９０（図５参照）を生成しても良い。カテゴリ遷移データ３９０はカテゴリ遷移記憶部３９に保存される。 The conversation pattern analysis processing unit 19 may generate category transition data 390 (see FIG. 5) as a result of calculating the category transition. The category transition data 390 is stored in the category transition storage unit 39.

これにより、会話で話題にしたカテゴリを抽出できる。さらに、カテゴリの継続時間、及びカテゴリ遷移に着目すると、話者ごとの会話の特徴を抽出できる。例えば、カテゴリの継続時間、カテゴリの出現頻度、特定のカテゴリから別の特定のカテゴリへ遷移する確率等を求めることで、話者別の特徴的な傾向を抽出することも可能である。 This makes it possible to extract categories that are discussed in conversation. Furthermore, focusing on the duration of the category and the category transition, the conversation characteristics for each speaker can be extracted. For example, by determining the duration of a category, the frequency of appearance of a category, the probability of transition from a specific category to another specific category, etc., it is possible to extract a characteristic tendency for each speaker.

特に、オペレータが顧客等に対応したときの会話テキストデータかカテゴリ遷移データを抽出したときには、オペレータごとの顧客応対スキルを可視化することができる。例えば、評価の高いオペレータのカテゴリ遷移の傾向を把握できれば、新人オペレータ教育に活用できる可能性がある。 In particular, when conversation text data or category transition data when an operator corresponds to a customer or the like is extracted, the customer reception skill for each operator can be visualized. For example, if the tendency of category transition of highly evaluated operators can be grasped, there is a possibility that it can be used for new operator education.

上述した本発明の実施形態は、本発明の説明のための例示であり、本発明の範囲をそれらの実施形態にのみ限定する趣旨ではない。当業者は、本発明の要旨を逸脱することなしに、他の様々な態様で本発明を実施することができる。 The above-described embodiments of the present invention are examples for explaining the present invention, and are not intended to limit the scope of the present invention only to those embodiments. Those skilled in the art can implement the present invention in various other modes without departing from the gist of the present invention.

１会話分析装置
１１音声認識処理部
１３不要語削除処理部
１５セグメント処理部
１７カテゴライズ処理部
１９会話パターン分析処理部
３１会話音声データ記憶部
３３会話属性データ記憶部
３５会話記録データ記憶部
３７会話パターンデータ記憶部
３９カテゴリ遷移記憶部 DESCRIPTION OF SYMBOLS 1 Conversation analysis apparatus 11 Speech recognition process part 13 Unnecessary word deletion process part 15 Segment process part 17 Categorize process part 19 Conversation pattern analysis process part 31 Conversation voice data storage part 33 Conversation attribute data storage part 35 Conversation record data storage part 37 Conversation pattern Data storage unit 39 Category transition storage unit

Claims

Means for storing conversation record data including text describing the content of a conversation between a first speaker who is a customer-facing operator and a second speaker including the customer ;
A clustering process is performed for dividing the text of the conversation recording data into a plurality of units, measuring the time of the units, and separating the text of the conversation recording data into a predetermined number of clusters based on the similarity between the units. The text of the conversation recording data is divided into a plurality of segments so as to form one segment with a plurality of continuous units belonging to the same cluster, and the time of the units of the same segment is added to form a segment Segment processing means for calculating the duration of
A category Rise processing means for classifying the plurality of segments formed by the segment processing means, into a plurality of categories respectively indicating topics related to the requirements or conditions of the second speaker,
And the duration of the segment, a storage means for storing the speech pattern data indicating a result of classification by the category rise processing means,
Referring to the conversation pattern data of the first speaker stored in the storage means, the conversation pattern analysis for extracting the category included in the conversation of the first speaker, the time required for the category, and the category transition Processing means;
Conversation analyzer with

The conversation analysis apparatus according to claim 1, wherein the category transition is indicated by a category name, a number of cases, and a probability of a transition destination category that has transitioned from the category.

The conversation analysis apparatus according to claim 1, wherein the time required for the category is a total sum of durations of segments classified into the category in the conversation pattern data.

An unnecessary word dictionary in which a deletion target word is registered is held in advance, unnecessary words are identified from the text of the conversation recording data with reference to the unnecessary word dictionary , and further includes unnecessary word deletion processing means for deleting,
The segment processing means, said performing segmentation based on the text of unnecessary words are removed by the unnecessary word deletion section, conversation analysis apparatus according to any one of claims 1-3.

5. The unit according to claim 1, wherein the unit is a text portion of an utterance that lasts from the time when one speaker starts speaking to the time when another speaker starts speaking, a range separated by punctuation marks, or a sentence. The conversation analysis device described in 1.

A conversation analysis method performed by an information processing apparatus including means for storing conversation record data including text describing the contents of a conversation between a first speaker who is a customer-facing operator and a second speaker including the customer There,
Segment processing means divides the text of the conversation recording data into a plurality of units, measures the time of the units, and separates the text of the conversation recording data into a predetermined number of clusters based on the similarity between the units. The conversation recording data text is divided into a plurality of segments so that a plurality of continuous units belonging to the same cluster and a plurality of continuous units form one segment. To calculate the duration of the segment by adding together ,
Category Rise processing means, a step of classifying a plurality of segments formed by the segment processing means, into a plurality of categories respectively indicating topics related to the requirements or conditions of the second speaker,
And recording the duration of the segment, the speech pattern data indicating a result of classification by the category rise processing means,
The conversation pattern analysis processing means refers to the recorded conversation pattern data of the plurality of first speakers, the category included in the conversation of the first speaker, the time required for the category, and the category transition Extracting the
Having a method.

In order to perform conversation analysis in an information processing apparatus comprising means for storing conversation record data including text describing the contents of a conversation between a first speaker who is a customer-facing operator and a second speaker including the customer Computer program,
When executed on the information processing apparatus,
Segment processing means divides the text of the conversation recording data into a plurality of units, measures the time of the units, and separates the text of the conversation recording data into a predetermined number of clusters based on the similarity between the units. The conversation recording data text is divided into a plurality of segments so that a plurality of continuous units belonging to the same cluster and a plurality of continuous units form one segment. To calculate the duration of the segment by adding together ,
Category Rise processing means, a step of classifying a plurality of segments formed by the segment processing means, into a plurality of categories respectively indicating topics related to the requirements or conditions of the second speaker,
And recording the duration of the segment, the speech pattern data indicating a result of classification by the category rise processing means,
The conversation pattern analysis processing means refers to the recorded conversation pattern data of the plurality of first speakers, the category included in the conversation of the first speaker, the time required for the category, and the category transition And a step of extracting a computer program.