JP6438674B2

JP6438674B2 - Response system, response method, and computer program

Info

Publication number: JP6438674B2
Application number: JP2014093147A
Authority: JP
Inventors: ハキム翔太松田
Original assignee: NTT Communications Corp
Current assignee: NTT Communications Corp
Priority date: 2014-04-28
Filing date: 2014-04-28
Publication date: 2018-12-19
Anticipated expiration: 2034-04-28
Also published as: JP2015211403A

Description

本発明は、音声を自動認識し応答する技術に関する。 The present invention relates to a technique for automatically recognizing and responding to voice.

従来、コールセンタでは、利用者が音声ガイダンスに従ってプッシュダイヤルすることによって、希望するサービスメニューを選択している。
一方で、音声データを解析することによって話者の感情を認識する技術や、音声データをテキストデータに変換する技術などの音声認識技術が実用化されている（例えば、特許文献１及び２参照。）。
近年では、音声認識技術を用いて通話中の利用者の感情をモニタリングすることによって、感情が怒りの状態にある利用者にはベテランのオペレータをアサインする技術が提案されている。また、音声認識技術を用いて通話内容をテキストデータに変換することによって、通話内容に関連する情報をオペレータに即座に提供する技術が提案されている。 Conventionally, in a call center, a user selects a desired service menu by performing push dialing according to voice guidance.
On the other hand, and a technique for recognizing emotion of the speaker by analyzing the audio data, voice recognition techniques, such as converting the audio data into text data has been put into practical use (for example, see Patent Documents 1 and 2 .)
In recent years, there has been proposed a technique for assigning a veteran operator to a user whose emotion is angry by monitoring the emotion of the user during a call using voice recognition technology. In addition, a technique has been proposed in which information related to a call content is immediately provided to an operator by converting the call content into text data using a voice recognition technology.

特開２０１２−１１３５４２号公報JP 2012-113542 A 特開２０１２−４７８７５号公報JP 2012-47875 A

しかしながら、従来のコールセンタでは、上記のような音声認識技術を用いて利用者の感情を認識したとしても、利用者の感情を考慮して応対するのはオペレータであり、そのような応対が可能な人的リソースを確保しなければいけないという問題があった。 However, in the conventional call center, even if the user's emotion is recognized by using the voice recognition technology as described above, it is an operator that takes into account the user's emotion, and such response is possible. There was a problem that human resources had to be secured.

上記事情に鑑み、本発明は、コールセンタサービスの提供において、利用者の感情に応じた対応を、より少ない人的リソースで行うことができる技術を提供することを目的としている。 In view of the above circumstances, an object of the present invention is to provide a technique that can perform a response according to a user's emotion with less human resources in providing a call center service.

本発明の一態様は、コールセンタの利用者の通話装置から送信される音声データに基づいて、前記利用者の感情を表す利用者感情情報を取得する感情認識部と、前記音声データをテキストデータに変換する音声変換部と、前記テキストデータと前記利用者感情情報とに基づいて、利用者に応答するメッセージをメッセージ候補の中から選択する応答部と、を備える応答システムである。 One aspect of the present invention is an emotion recognition unit that acquires user emotion information representing a user's emotion based on voice data transmitted from a call device of a call center user, and the voice data is converted into text data. A response system comprising: a voice conversion unit for conversion; and a response unit that selects a message to be responded to a user from message candidates based on the text data and the user emotion information.

本発明の一態様は、上記の応答システムであって、前記応答部は、利用者への応答の種別を表す応答種別情報とテキストデータとを対応づけて記憶する記憶部から、前記テキストデータに基づいて前記応答種別情報を取得し、取得した前記応答種別情報と、前記利用者感情情報とに基づいて、利用者に応答するメッセージをメッセージ候補の中から選択する。 One aspect of the present invention is the above-described response system, in which the response unit stores the response type information indicating the type of response to the user and text data in association with the text data. Based on the acquired response type information and the user emotion information, a message that responds to the user is selected from message candidates.

本発明の一態様は、上記の応答システムであって、前記応答部は、利用者への応答において実行すべき処理の内容を表す応答内容情報と応答種別情報とを対応づけて記憶する前記記憶部から、前記応答種別情報に基づいて前記応答内容情報を取得し、取得した前記応答内容情報と、前記利用者感情情報とに基づいて、利用者に応答するメッセージをメッセージ候補の中から選択する。 One aspect of the present invention is the above-described response system, in which the response unit stores response content information representing the content of processing to be executed in response to a user and response type information in association with each other. The response content information is acquired based on the response type information, and a message that responds to the user is selected from message candidates based on the acquired response content information and the user emotion information. .

本発明の一態様は、上記の応答システムであって、コールセンタのオペレータが過去に利用者に対して応答したメッセージのうち、利用者の感情を変化させたメッセージを、前記メッセージ候補に追加又は前記メッセージ候補から削除する追加部をさらに備える。 One aspect of the present invention is the above-described response system, in which a message in which a call center operator responds to a user in the past is a message in which a user's emotion is changed is added to the message candidate or the message candidate An additional unit for deleting from the message candidates is further provided.

本発明の一態様は、上記の応答システムであって、前記追加部は、コールセンタのオペレータが過去に利用者に対して応答したメッセージのうち、利用者の感情をより良い感情に変えることができたメッセージを前記メッセージ候補に追加する。 One aspect of the present invention is the response system described above, wherein the additional unit can change a user's emotion into a better emotion among messages that the call center operator has responded to the user in the past. Added to the message candidates.

本発明の一態様は、上記の応答システムであって、前記応答部は、前記利用者感情情報をコールセンタのオペレータに提供する。 One aspect of the present invention is the response system described above, wherein the response unit provides the user emotion information to an operator of a call center.

本発明の一態様は、コールセンタの利用者の通話装置から送信される音声データに基づいて、前記利用者の感情を表す利用者感情情報を取得する感情認識ステップと、前記音声データをテキストデータに変換する音声変換ステップと、前記テキストデータと前記利用者感情情報とに基づいて、利用者に応答するメッセージをメッセージ候補の中から選択する応答ステップと、を有する応答方法である。 One aspect of the present invention is an emotion recognition step of acquiring user emotion information representing the user's emotion based on voice data transmitted from a call device of a call center user, and the voice data is converted into text data. A response method comprising: a voice conversion step for conversion; and a response step for selecting a message to be responded to a user from among message candidates based on the text data and the user emotion information.

本発明の一態様は、コールセンタの利用者の通話装置から送信される音声データに基づいて、前記利用者の感情を表す利用者感情情報を取得する感情認識ステップと、前記音声データをテキストデータに変換する音声変換ステップと、前記テキストデータと前記利用者感情情報とに基づいて、利用者に応答するメッセージをメッセージ候補の中から選択する応答ステップと、をコンピュータに実行させるためのコンピュータプログラムである。 One aspect of the present invention is an emotion recognition step of acquiring user emotion information representing the user's emotion based on voice data transmitted from a call device of a call center user, and the voice data is converted into text data. A computer program for causing a computer to execute a voice conversion step for conversion, and a response step for selecting a message to respond to a user from message candidates based on the text data and the user emotion information. .

本発明により、コールセンタサービスの提供において、利用者の感情に応じた対応を、より少ない人的リソースで行うことが可能となる。 According to the present invention, in providing a call center service, it is possible to perform a response corresponding to a user's emotion with fewer human resources.

第１実施形態の音声自動応答システム１のシステム構成を示すシステム構成図。1 is a system configuration diagram showing a system configuration of an automatic voice response system 1 according to a first embodiment. 音声情報テーブル３１０の具体例を示す図。The figure which shows the specific example of the audio | voice information table 310. FIG. テキスト情報テーブル５１０の第１の具体例を示す図。The figure which shows the 1st specific example of the text information table 510. FIG. フロー管理テーブル７１０の具体例を示す図。The figure which shows the specific example of the flow management table 710. FIG. アクション管理テーブル７２０の具体例を示す図。The figure which shows the specific example of the action management table 720. FIG. 応答管理テーブル７３０の具体例を示す図。The figure which shows the specific example of the response management table 730. 自動応答装置６００の機能構成を示す機能ブロック図。The functional block diagram which shows the function structure of the automatic response apparatus 600. FIG. 音声自動応答システム１における音声自動応答の処理の流れを示すフローチャート。The flowchart which shows the flow of a process of the voice automatic response in the voice automatic response system. 音声自動応答システム１と利用者との通話の具体例を示す図。The figure which shows the specific example of the telephone call between the audio | voice automatic response system 1 and a user. 第２実施形態の音声自動応答システム１ａのシステム構成を示すシステム構成図。The system block diagram which shows the system configuration | structure of the audio | voice automatic response system 1a of 2nd Embodiment. 自動応答装置６００ａの機能構成を示す機能ブロック図。The functional block diagram which shows the function structure of the automatic response apparatus 600a. 応答履歴テーブル７４０の具体例を示す図。The figure which shows the specific example of the response log | history table 740. 第２実施形態の音声自動応答システム１ａにおけるテンプレート生成の処理の流れを示すフローチャート。The flowchart which shows the flow of a template production | generation process in the audio | voice automatic response system 1a of 2nd Embodiment. テキスト情報テーブル５１０の第２の具体例を示す図。The figure which shows the 2nd specific example of the text information table 510. FIG.

［第１実施形態］
図１は、第１実施形態の音声自動応答システム１のシステム構成を示すシステム構成図である。
電話機２−１〜２−３は、利用者がコールセンタ３−１〜３−３に発信するために使用する電話機である。電話機２−１〜２−３は、携帯電話やスマートフォンなどの無線通信によって通話発信する電話機であってもよいし、固定電話などの有線通信によって通話発信する電話機であってもよい。電話機２−１〜２−３はネットワーク４に接続され、ネットワーク４を介して音声自動応答システム１と通信する。以下の説明では、説明を簡単にするために、特に区別しない限り電話機２−１〜２−３を電話機２と記載する。なお、電話機２の数は、図１と異なる数であってもよい。 [First Embodiment]
FIG. 1 is a system configuration diagram showing a system configuration of an automatic voice response system 1 according to the first embodiment.
The telephones 2-1 to 2-3 are telephones used by the user to make calls to the call centers 3-1 to 3-3. The telephones 2-1 to 2-3 may be telephones that make a call by wireless communication such as a mobile phone or a smartphone, or may be telephones that make a call by wired communication such as a fixed telephone. The telephones 2-1 to 2-3 are connected to the network 4 and communicate with the automatic voice response system 1 via the network 4. In the following description, the telephone sets 2-1 to 2-3 are described as the telephone set 2 unless otherwise distinguished for the sake of simplicity. The number of telephones 2 may be different from that shown in FIG.

コールセンタ３−１〜３−３は、事業者が顧客への電話対応業務を専門に行うために設けられた事業所や施設である。コールセンタ３−１〜３−３は、同一の事業者が複数拠点に持つコールセンタであってもよいし、異なる事業者が運営するコールセンタであってもよい。以下の説明では、説明を簡単にするために、特に区別しない限りコールセンタ３−１〜３−３をコールセンタ３と記載する。コールセンタ３には、オペレータが配置される。オペレータとは、利用者に電話応対する人員である。なお、コールセンタ３の数は、図１と異なる数であってもよい。 The call centers 3-1 to 3-3 are offices and facilities provided for the business operators to specialize in the business of handling calls to customers. The call centers 3-1 to 3-3 may be call centers that the same business operator has at a plurality of locations, or may be call centers that are operated by different business operators. In the following description, in order to simplify the description, the call centers 3-1 to 3-3 are described as the call center 3 unless otherwise distinguished. An operator is arranged in the call center 3. The operator is a person who answers the user by telephone. Note that the number of call centers 3 may be different from that in FIG.

ネットワーク４は、電話機２が音声自動応答システム１に接続するためのネットワークである。ネットワーク４は、電話機２の通信方式によって複数種類のネットワークで構成されてもよい。例えば、ネットワーク４は携帯電話網であってもよいし、固定電話回線網であってもよいし、ＶｏＩＰ（Voice over Internet Protocol）網であってもよい。 The network 4 is a network for connecting the telephone 2 to the automatic voice response system 1. The network 4 may be composed of a plurality of types of networks depending on the communication method of the telephone 2. For example, the network 4 may be a mobile phone network, a fixed telephone line network, or a VoIP (Voice over Internet Protocol) network.

コールセンタ管理装置５−１〜５−３は、それぞれコールセンタ３−１〜３−３を管理及び制御する装置である。例えば、コールセンタ管理装置５−１〜５−３は、顧客管理システムなどの外部システムに接続され、コールセンタ３のオペレータに情報提供を行う。また、例えば、コールセンタ管理装置５−１〜５−３は、音声自動応答システム１の要求に応じて音声通信をコールセンタ３のオペレータに中継する。以下の説明では、説明を簡単にするために、特に区別しない限りコールセンタ管理装置５−１〜５−３をコールセンタ管理装置５と記載する。なお、コールセンタ管理装置５の数は、コールセンタ３の数に応じて、図１と異なる数であってもよい。また、コールセンタ管理装置５は、複数のコールセンタ３を管理及び制御してもよい。 The call center management devices 5-1 to 5-3 are devices that manage and control the call centers 3-1 to 3-3, respectively. For example, the call center management devices 5-1 to 5-3 are connected to an external system such as a customer management system and provide information to an operator of the call center 3. Further, for example, the call center management devices 5-1 to 5-3 relay voice communication to an operator of the call center 3 in response to a request from the voice automatic response system 1. In the following description, the call center management devices 5-1 to 5-3 are referred to as the call center management device 5 unless otherwise distinguished for the sake of simplicity. The number of call center management devices 5 may be different from that in FIG. 1 depending on the number of call centers 3. Further, the call center management device 5 may manage and control a plurality of call centers 3.

音声自動応答システム１は、各コールセンタ３と利用者の各電話機２との間にあって、利用者の通話発信に対して一次応答する。すなわち、音声自動応答システム１は、利用者の通話に対して音声による自動応答を行う機能をコールセンタ３に提供するシステムである。音声自動応答システム１は、利用者の通話を必要に応じてコールセンタ３に転送する。 The automatic voice response system 1 is between each call center 3 and each telephone 2 of the user, and makes a primary response to the user's call transmission. That is, the automatic voice response system 1 is a system that provides the call center 3 with a function of performing automatic voice response to a user's call. The automatic voice response system 1 transfers the user's call to the call center 3 as necessary.

音声自動応答システム１は、中継装置１００、感情認識装置２００、音声ＤＢ３００、音声変換装置４００、テキストＤＢ５００、自動応答装置６００及び応答ＤＢ７００を備える。
中継装置１００は、電話機２から送信された音声信号を受信する。中継装置１００は、受信した音声信号に含まれる利用者の音声とオペレータの音声とを分離する。中継装置１００は、音声信号を分離することによって取得された音声データを感情認識装置２００に中継する。また、中継装置１００は、自動応答装置６００の要求に応じて、利用者の通話をオペレータに転送する。中継装置１００は、オペレータの音声や、音声自動応答システム１の自動応答機能によって生成された音声データを電話機２に中継する。 The automatic voice response system 1 includes a relay device 100, an emotion recognition device 200, a voice DB 300, a voice conversion device 400, a text DB 500, an automatic response device 600, and a response DB 700.
The relay device 100 receives the audio signal transmitted from the telephone set 2. The relay device 100 separates the user's voice and the operator's voice included in the received voice signal. The relay device 100 relays voice data acquired by separating the voice signal to the emotion recognition device 200. Also, the relay device 100 transfers the user's call to the operator in response to a request from the automatic response device 600. The relay device 100 relays the voice of the operator and voice data generated by the automatic response function of the automatic voice response system 1 to the telephone set 2.

感情認識装置２００は、中継装置１００によって中継された音声データを受信する。感情認識装置２００は、受信した音声データを発話ごとに分割する。１つの発話は、ある人が発言を開始し、他者が発言を開始するまでに行った発言を表す。感情認識装置２００は、分割した音声データごとに、その音声データに含まれる音声を発したときの利用者の感情を認識する。音声データから話者の感情を認識する手法には、既存のどのような技術が用いられてもよい。感情認識装置２００は、音声データと、その音声データから認識された利用者の感情を表す情報（以下、「利用者感情情報」という。）とを対応づけて音声ＤＢ３００に登録する。 The emotion recognition device 200 receives the voice data relayed by the relay device 100. The emotion recognition device 200 divides the received voice data for each utterance. One utterance represents a utterance made by one person before the other person starts speaking. The emotion recognition device 200 recognizes the user's emotion when the voice included in the voice data is emitted for each divided voice data. Any existing technique may be used as a method for recognizing a speaker's emotion from voice data. Emotion recognition apparatus 200 registers voice data and information representing user emotion recognized from the voice data (hereinafter referred to as “user emotion information”) in voice DB 300 in association with each other.

音声ＤＢ３００は、音声データを蓄積するデータベースである。音声ＤＢ３００は、音声データを記憶するための音声情報テーブル３１０を保持する。音声ＤＢ３００は、音声データと利用者感情情報とを対応づけて記憶する。音声ＤＢ３００は、感情認識装置２００及び音声変換装置４００に接続される。音声情報テーブル３１０は感情認識装置２００によって登録される。 The voice DB 300 is a database that stores voice data. The voice DB 300 holds a voice information table 310 for storing voice data. The voice DB 300 stores voice data and user emotion information in association with each other. The voice DB 300 is connected to the emotion recognition device 200 and the voice conversion device 400. The voice information table 310 is registered by the emotion recognition device 200.

図２は、音声情報テーブル３１０の具体例を示す図である。
音声情報テーブル３１０は、音声ＩＤごとに音声情報レコードを有する。音声情報レコードは、音声ＩＤ、通話ＩＤ、通話区分、音声情報及び感情情報の各値を有する。音声ＩＤは、音声情報テーブル３１０に記憶される音声データの識別情報である。通話ＩＤは、音声ＩＤが示す通話の識別情報である。通話区分は、音声ＩＤが示す音声データの音声を発した主体を表す識別情報である。具体的には、通話区分は、利用者による通話か、オペレータによる通話か、音声自動応答システム１による自動応答かを示す情報である。音声情報は、音声ＩＤが示す音声データの実データである。感情情報は、音声ＩＤが示す音声データに基づいて認識された利用者感情情報である。音声情報レコードは、感情認識装置２００によって音声情報テーブル３１０に登録される。 FIG. 2 is a diagram showing a specific example of the audio information table 310.
The voice information table 310 has a voice information record for each voice ID. The voice information record has values of voice ID, call ID, call classification, voice information, and emotion information. The voice ID is identification information of voice data stored in the voice information table 310. The call ID is call identification information indicated by the voice ID. The call category is identification information that represents the subject that has produced the voice of the voice data indicated by the voice ID. Specifically, the call classification is information indicating whether the call is a call by a user, a call by an operator, or an automatic response by the voice automatic response system 1. The voice information is actual data of the voice data indicated by the voice ID. Emotion information is user emotion information recognized based on voice data indicated by a voice ID. The voice information record is registered in the voice information table 310 by the emotion recognition device 200.

図１の説明に戻る。
音声変換装置４００は、音声ＤＢ３００の音声情報テーブル３１０を参照し、音声データを取得する。音声変換装置４００は、取得した音声データに含まれる音声をテキストデータに変換する。音声データをテキストデータに変換する手法には、既存のどのような技術が用いられてもよい。 Returning to the description of FIG.
The voice conversion device 400 refers to the voice information table 310 of the voice DB 300 and acquires voice data. The voice conversion device 400 converts voice included in the acquired voice data into text data. Any existing technique may be used as a method for converting voice data into text data.

音声変換装置４００は、取得したテキストデータからコールセンタ３が提供するサービスに関連するキーワードを抽出する。テキストデータからキーワードを抽出する手法には、既存のどのような技術が用いられてもよい。例えば、音声変換装置４００は、コールセンタ３を運用する事業者の事業や業種に関する情報などを辞書情報として予め保持し、辞書情報との関連性が強い語をキーワードとして抽出してもよい。また、例えば、音声変換装置４００は、蓄積されたテキストデータに統計処理を行うことによって、辞書情報を自ら学習するように構成されてもよい。音声変換装置４００は、音声データの識別情報と、音声データから取得したキーワードとを対応づけてテキストＤＢ５００に登録する。 The voice conversion device 400 extracts keywords related to the service provided by the call center 3 from the acquired text data. Any existing technique may be used as a method for extracting a keyword from text data. For example, the voice conversion device 400 may hold information relating to the business or type of business of the business operator who operates the call center 3 in advance as dictionary information, and may extract words that are strongly related to the dictionary information as keywords. Further, for example, the voice conversion device 400 may be configured to learn dictionary information by performing statistical processing on the accumulated text data. The speech conversion apparatus 400 registers the identification information of the speech data and the keyword acquired from the speech data in the text DB 500 in association with each other.

テキストＤＢ５００は、音声データから取得されたテキストデータを蓄積するデータベースである。テキストＤＢ５００は、音声データごとのテキストデータを記憶するためのテキスト情報テーブル５１０を保持する。テキストＤＢ５００は、音声データと、音声データから取得されたテキストデータとを対応づけてテキスト情報テーブル５１０に記憶する。テキストＤＢ５００は、音声変換装置４００及び自動応答装置６００に接続される。テキスト情報テーブル５１０は音声変換装置４００によって登録される。 The text DB 500 is a database that accumulates text data acquired from voice data. The text DB 500 holds a text information table 510 for storing text data for each voice data. The text DB 500 stores voice data and text data acquired from the voice data in the text information table 510 in association with each other. The text DB 500 is connected to the voice conversion device 400 and the automatic response device 600. The text information table 510 is registered by the voice conversion device 400.

図３は、テキスト情報テーブル５１０の第１の具体例を示す図である。
テキスト情報テーブル５１０は、音声ＩＤごとにテキスト情報レコードを有する。テキスト情報レコードは、音声ＩＤ、通話ＩＤ、通話区分、テキスト情報、キーワード及び感情情報の各値を有する。音声ＩＤ、通話ＩＤ、通話区分及び感情情報は、音声情報テーブル３１０の音声情報レコードが有する音声ＩＤと同様である。テキスト情報は、音声ＩＤが示す音声データが変換されたテキストデータである。キーワードは、テキスト情報が示すテキストデータから、音声変換装置４００によって抽出されたキーワードを表す。テキスト情報レコードは、音声変換装置４００によってテキスト情報テーブル５１０に登録される。 FIG. 3 is a diagram showing a first specific example of the text information table 510.
The text information table 510 has a text information record for each voice ID. The text information record has each value of voice ID, call ID, call classification, text information, keyword, and emotion information. The voice ID, the call ID, the call classification, and the emotion information are the same as the voice ID included in the voice information record of the voice information table 310. The text information is text data obtained by converting the voice data indicated by the voice ID. The keyword represents a keyword extracted by the speech conversion device 400 from text data indicated by the text information. The text information record is registered in the text information table 510 by the voice conversion device 400.

図１の説明に戻る。
自動応答装置６００は、音声データから取得されたキーワード及び利用者感情情報に基づいて、利用者に応答するメッセージを選択する。具体的には、自動応答装置６００は、キーワード及び利用者感情情報に基づいて、応答ＤＢ７００が記憶する応答メッセージのテンプレートの中から、利用者に応答するメッセージを取得する。自動応答装置６００は、取得した応答メッセージを音声データに変換する。自動応答装置６００は、応答メッセージの変換によって生成された音声データを利用者の電話機２に送信する。なお、自動応答装置６００は、応答メッセージを示すテキストデータを取得する代わりに、予めに変換された音声データを取得するように構成されてもよい。
なお、自動応答装置６００は、利用者への応答に用いるメッセージを示すテキストデータが取得できない場合は、利用者の通話を発信先のコールセンタ３のオペレータに転送するように中継装置１００に要求する。 Returning to the description of FIG.
The automatic response device 600 selects a message to respond to the user based on the keyword and user emotion information acquired from the voice data. Specifically, the automatic response device 600 acquires a message that responds to the user from the response message template stored in the response DB 700 based on the keyword and the user emotion information. The automatic response device 600 converts the acquired response message into voice data. The automatic response device 600 transmits the voice data generated by the conversion of the response message to the user's telephone 2. Note that the automatic response device 600 may be configured to acquire voice data converted in advance, instead of acquiring text data indicating a response message.
Note that the automatic answering device 600 requests the relay device 100 to transfer the user's call to the operator of the call center 3 that is the destination when the text data indicating the message used for the response to the user cannot be acquired.

応答ＤＢ７００は、自動応答装置６００が自動応答するために必要となる情報を保持するデータベースである。例えば、応答ＤＢ７００は、フロー管理テーブル７１０、アクション管理テーブル７２０及び応答管理テーブル７３０を保持する。応答ＤＢ７００は、自動応答装置６００に接続される。応答管理テーブル７３０は自動応答装置６００によって更新される。 The response DB 700 is a database that holds information necessary for the automatic response device 600 to respond automatically. For example, the response DB 700 holds a flow management table 710, an action management table 720, and a response management table 730. The response DB 700 is connected to the automatic response device 600. The response management table 730 is updated by the automatic response device 600.

図４は、フロー管理テーブル７１０の具体例を示す図である。
フロー管理テーブル７１０は、音声自動応答システム１によって利用者に提供されるサービスが定義されたテーブルである。フロー管理テーブル７１０は、フローＩＤごとにフロー管理レコードを有する。フロー管理レコードは、フローＩＤ、関連情報及びサービス情報の各値を有する。フローＩＤは、サービスの識別情報であり、音声自動応答システム１が利用者に提供するサービスに対応する。関連情報は、フローＩＤが示すサービスに関連するテキストデータを値に持つ。サービス情報は、フローＩＤが示すサービスの内容を表す。フロー管理レコードは、音声自動応答システム１の管理者によって予めフロー管理テーブル７１０に登録される。 FIG. 4 is a diagram illustrating a specific example of the flow management table 710.
The flow management table 710 is a table in which services provided to the user by the automatic voice response system 1 are defined. The flow management table 710 has a flow management record for each flow ID. The flow management record has values of flow ID, related information, and service information. The flow ID is service identification information and corresponds to a service provided to the user by the automatic voice response system 1. The related information has text data related to the service indicated by the flow ID as a value. The service information represents the content of the service indicated by the flow ID. The flow management record is registered in advance in the flow management table 710 by the administrator of the automatic voice response system 1.

図５は、アクション管理テーブル７２０の具体例を示す図である。
アクション管理テーブル７２０は、利用者に提供されるサービスごとに、自動応答装置６００が行うべき応答の流れが定義されたテーブルである。以下では、サービスごとに定義された個々の応答をアクションと呼ぶ。アクション管理テーブル７２０は、アクションＩＤごとにアクション管理レコードを有する。アクション管理レコードは、フローＩＤ、アクションＩＤ、アクション情報及び実行フラグの各値を有する。フローＩＤは、フロー管理テーブル７１０のフロー管理レコードが有するフローＩＤと同様である。アクションＩＤは、自動応答装置６００が選択されたサービスにおいて実行するアクションの識別情報である。アクション情報は、アクションＩＤが示すアクションの内容を表す。実行フラグは、選択されたサービスにおいて、自動応答装置６００が次に実行するアクションを示す情報である。 FIG. 5 is a diagram illustrating a specific example of the action management table 720.
The action management table 720 is a table in which a response flow to be performed by the automatic response device 600 is defined for each service provided to the user. Below, each response defined for each service is called an action. The action management table 720 has an action management record for each action ID. The action management record has values of flow ID, action ID, action information, and execution flag. The flow ID is the same as the flow ID that the flow management record of the flow management table 710 has. The action ID is identification information of an action executed by the automatic response device 600 in the selected service. The action information represents the content of the action indicated by the action ID. The execution flag is information indicating an action to be executed next by the automatic response device 600 in the selected service.

なお、アクション管理レコードは、音声自動応答システム１の管理者によって予めマスタとなるアクション管理テーブル７２０に登録される。そして、アクション管理テーブル７２０は、利用者に提供されるサービスごとに実行フラグの値が初期化された状態で生成される。例えば、図５の例は、図４におけるフローＩＤの値“Ｆ００１”が示すサービスに対して生成されたアクション管理テーブル７２０を表す。 The action management record is registered in advance in the action management table 720 as a master by the administrator of the automatic voice response system 1. The action management table 720 is generated with the execution flag value initialized for each service provided to the user. For example, the example of FIG. 5 represents the action management table 720 generated for the service indicated by the flow ID value “F001” in FIG.

図６は、応答管理テーブル７３０の具体例を示す図である。
応答管理テーブル７３０は、アクション管理テーブル７２０に登録されたアクションＩＤごとに応答するメッセージのテンプレート（メッセージ候補の一例）を記憶するテーブルである。応答管理テーブル７３０は、応答ＩＤごとに応答履歴レコードを有する。応答履歴レコードは、応答ＩＤ、アクションＩＤ、アクション情報、応答情報及び感情情報の各値を有する。応答ＩＤは、応答履歴レコードの識別情報である。アクションＩＤは、アクション管理テーブル７２０のアクション管理レコードが有するアクションＩＤと同様である。アクション情報は、アクション管理テーブル７２０のアクション管理レコードが有するアクション情報と同様である。応答情報は、応答ＩＤが示す応答メッセージを表す。すなわち、応答情報は、アクションＩＤが示すアクションに用いられる音声データを生成するためのテンプレートとなるテキストデータである。感情情報は、音声情報テーブル３１０の音声情報レコード及びテキスト情報テーブル５１０のテキスト情報レコードが有する感情情報と同様である。応答履歴レコードは、音声自動応答システム１の管理者によって予め応答管理テーブル７３０に登録される。 FIG. 6 is a diagram illustrating a specific example of the response management table 730.
The response management table 730 stores a message template (an example of a message candidate) that responds for each action ID registered in the action management table 720. The response management table 730 has a response history record for each response ID. The response history record has each value of response ID, action ID, action information, response information, and emotion information. The response ID is identification information of a response history record. The action ID is the same as the action ID that the action management record of the action management table 720 has. The action information is the same as the action information included in the action management record of the action management table 720. The response information represents a response message indicated by the response ID. That is, the response information is text data serving as a template for generating voice data used for the action indicated by the action ID. The emotion information is the same as the emotion information that the voice information record of the voice information table 310 and the text information record of the text information table 510 have. The response history record is registered in advance in the response management table 730 by the administrator of the automatic voice response system 1.

図７は、自動応答装置６００の機能構成を示す機能ブロック図である。
自動応答装置６００は、バスで接続されたＣＰＵ（Central Processing Unit）やメモリや補助記憶装置などを備え、自動応答プログラムを実行する。自動応答装置６００は、自動応答プログラムの実行によって通信部６１０、応答判断部６２０及び応答生成部６３０（応答部の一例）を備える装置として機能する。
なお、自動応答装置６００の各機能の全て又は一部は、ＡＳＩＣ（Application Specific Integrated Circuit）やＰＬＤ（Programmable Logic Device）やＦＰＧＡ（Field Programmable Gate Array）等のハードウェアを用いて実現されてもよい。自動応答プログラムは、コンピュータ読み取り可能な記録媒体に記録されてもよい。コンピュータ読み取り可能な記録媒体とは、例えばフレキシブルディスク、光磁気ディスク、ＲＯＭ、ＣＤ−ＲＯＭ等の可搬媒体、コンピュータシステムに内蔵されるハードディスク等の記憶装置である。自動応答プログラムは、電気通信回線を介して送信されてもよい。 FIG. 7 is a functional block diagram showing a functional configuration of the automatic response device 600.
The automatic response device 600 includes a CPU (Central Processing Unit), a memory, an auxiliary storage device, and the like connected by a bus, and executes an automatic response program. The automatic response device 600 functions as a device including a communication unit 610, a response determination unit 620, and a response generation unit 630 (an example of a response unit) by executing an automatic response program.
All or some of the functions of the automatic response device 600 may be realized using hardware such as an application specific integrated circuit (ASIC), a programmable logic device (PLD), or a field programmable gate array (FPGA). . The automatic response program may be recorded on a computer-readable recording medium. The computer-readable recording medium is, for example, a portable medium such as a flexible disk, a magneto-optical disk, a ROM, a CD-ROM, or a storage device such as a hard disk built in the computer system. The automatic response program may be transmitted via a telecommunication line.

通信部６１０は、ＬＡＮ（Local Area Network）等の通信インターフェースを用いて構成される。通信部６１０は、中継装置１００、感情認識装置２００、音声変換装置４００及びコールセンタ管理装置５との間で通信する。 The communication unit 610 is configured using a communication interface such as a LAN (Local Area Network). The communication unit 610 communicates with the relay device 100, the emotion recognition device 200, the voice conversion device 400, and the call center management device 5.

応答判断部６２０は、音声データから取得されたキーワードに基づいて、利用者に応答すべき応答の種別を判断する。例えば、応答判断部６２０は、利用者が希望するサービスを判断する。応答判断部６２０は、キーワードに基づいて判断された応答の種別を表す情報（以下、「応答種別情報」という。）を応答生成部６３０に出力する。 The response determination unit 620 determines the type of response to be answered to the user based on the keyword acquired from the voice data. For example, the response determination unit 620 determines a service desired by the user. The response determination unit 620 outputs information indicating the type of response determined based on the keyword (hereinafter referred to as “response type information”) to the response generation unit 630.

応答生成部６３０は、応答判断部６２０から出力された応答種別情報に基づいて、応答するアクションを表す情報（以下、「応答内容情報」という。）を取得する。応答生成部６３０は、応答内容情報と、利用者感情情報とに基づいて、利用者への応答に用いる応答メッセージを応答ＤＢ７００から取得する。例えば、応答生成部６３０は、応答判断部６２０から出力された応答種別情報に基づいて、アクション管理テーブル７２０から応答するアクションを選択する。応答判断部６２０は、選択したアクションの識別情報を応答内容情報として取得する。応答生成部６３０は、応答内容情報と、利用者感情情報とに基づいて、応答管理テーブル７３０から利用者に応答するメッセージのテキストデータを取得する。応答生成部６３０は、取得したテキストデータを音声データに変換する。応答生成部６３０は、音声データを利用者の電話機２に送信する。 Based on the response type information output from the response determination unit 620, the response generation unit 630 acquires information indicating an action to respond (hereinafter referred to as “response content information”). The response generation unit 630 acquires a response message used for a response to the user from the response DB 700 based on the response content information and the user emotion information. For example, the response generation unit 630 selects an action to respond from the action management table 720 based on the response type information output from the response determination unit 620. The response determination unit 620 acquires the identification information of the selected action as response content information. The response generation unit 630 acquires text data of a message that responds to the user from the response management table 730 based on the response content information and the user emotion information. The response generation unit 630 converts the acquired text data into voice data. The response generation unit 630 transmits voice data to the user's telephone 2.

図８は、音声自動応答システム１における音声自動応答の処理の流れを示すフローチャートである。
まず、利用者が電話機２を操作してコールセンタ３に発信する。中継装置１００は、電話機２の発信を着信する。中継装置１００は着信した通話から音声データを取得する。中継装置１００は、音声データを感情認識装置２００に中継する。感情認識装置２００は、中継された音声データを取得する（ステップＳ１０１）。感情認識装置２００は、音声データの識別情報となる音声ＩＤを生成する。また、感情認識装置２００は、中継された音声データの通話の識別情報となる通話ＩＤを生成する。 FIG. 8 is a flowchart showing the flow of automatic voice response processing in the automatic voice response system 1.
First, a user operates the telephone 2 to make a call to the call center 3. Relay device 100 receives a call from telephone 2. The relay device 100 acquires voice data from the incoming call. The relay device 100 relays voice data to the emotion recognition device 200. The emotion recognition apparatus 200 acquires the relayed voice data (step S101). The emotion recognition device 200 generates a voice ID that is identification information of the voice data. In addition, the emotion recognition device 200 generates a call ID that is identification information of the call of the relayed voice data.

例えば、通話ＩＤは、発信元の電話機２の電話番号や時刻などの情報に基づいて生成されてもよい。感情認識装置２００は、音声データから音声データに含まれる音声を発したときの利用者の感情を認識する（ステップＳ１０２）。感情認識装置２００は、音声ＩＤと、通話ＩＤと、音声データと、利用者感情情報とを音声ＤＢ３００に出力する。音声ＤＢ３００は、感情認識装置２００から出力された情報を値に持つ音声情報レコードを生成する。音声ＤＢ３００は生成した音声情報レコードを音声情報テーブル３１０に登録する。感情認識装置２００は、新規の音声情報レコードを登録したことを音声変換装置４００に通知する。 For example, the call ID may be generated based on information such as the telephone number and time of the caller telephone 2. The emotion recognition device 200 recognizes the user's emotion when the voice included in the voice data is emitted from the voice data (step S102). Emotion recognition device 200 outputs voice ID, call ID, voice data, and user emotion information to voice DB 300. The voice DB 300 generates a voice information record having information output from the emotion recognition device 200 as a value. The voice DB 300 registers the generated voice information record in the voice information table 310. The emotion recognition device 200 notifies the speech conversion device 400 that a new speech information record has been registered.

音声変換装置４００は、感情認識装置２００によって新規の音声情報レコードが登録されたことの通知を受けると、音声情報テーブル３１０を参照し、新規登録された音声情報レコードの各値を取得する。音声変換装置４００は、取得した音声データに含まれる音声をテキストデータに変換する。音声変換装置４００は、取得したテキストデータからキーワードを抽出する。そして、音声変換装置４００は、音声情報レコードから取得した音声ＩＤの値と、音声情報レコードから取得した感情情報の値と、音声データから取得したテキストデータ及びキーワードをテキストＤＢ５００に出力する。テキストＤＢ５００は、音声変換装置４００から出力された情報を値に持つテキスト情報レコードを生成する。テキストＤＢ５００は、生成したテキスト情報レコードをテキスト情報テーブル５１０に登録する。音声変換装置４００は、新規のテキスト情報レコードを登録したことを自動応答装置６００に通知する。 When receiving the notification that a new voice information record is registered by the emotion recognition apparatus 200, the voice conversion device 400 refers to the voice information table 310 and acquires each value of the newly registered voice information record. The voice conversion device 400 converts voice included in the acquired voice data into text data. The voice conversion device 400 extracts keywords from the acquired text data. Then, the voice conversion device 400 outputs the value of the voice ID acquired from the voice information record, the value of emotion information acquired from the voice information record, and the text data and keywords acquired from the voice data to the text DB 500. The text DB 500 generates a text information record having information output from the speech conversion device 400 as a value. The text DB 500 registers the generated text information record in the text information table 510. The voice conversion device 400 notifies the automatic response device 600 that a new text information record has been registered.

自動応答装置６００の応答判断部６２０は、音声変換装置４００によって新規のテキスト情報レコードが登録されたことの通知を受けると、テキスト情報テーブル５１０を参照し、新規登録されたテキスト情報レコードの各値を取得する。応答判断部６２０は、フロー管理テーブル７１０を参照し、取得したテキスト情報の値と関連性が強いテキストデータを関連情報の値に持つフロー管理レコードを選択する。応答判断部６２０は、選択したフロー管理レコードからフローＩＤの値を取得する。応答判断部６２０は、取得したフローＩＤの値を応答種別情報として応答生成部６３０に出力する。このように、応答判断部６２０は、音声データから取得したキーワードに基づいて、利用者が希望するサービスを判断する（ステップＳ１０３）。 When receiving a notification that a new text information record has been registered by the voice conversion device 400, the response determination unit 620 of the automatic response device 600 refers to the text information table 510, and each value of the newly registered text information record. To get. The response determination unit 620 refers to the flow management table 710 and selects a flow management record that has text data that is strongly related to the acquired text information value as the related information value. The response determination unit 620 acquires the value of the flow ID from the selected flow management record. The response determination unit 620 outputs the acquired flow ID value to the response generation unit 630 as response type information. As described above, the response determination unit 620 determines the service desired by the user based on the keyword acquired from the voice data (step S103).

次に、応答生成部６３０は、応答判断部６２０から出力された応答種別情報が示すサービスに対応するアクション管理テーブル７２０を選択する。応答生成部６３０は、選択したアクション管理テーブル７２０から実行フラグの値を取得する。応答生成部６３０は、実行フラグの値に基づいて、選択されたサービスにおいて次に実行すべきアクションを判断する（ステップＳ１０４）。 Next, the response generation unit 630 selects the action management table 720 corresponding to the service indicated by the response type information output from the response determination unit 620. The response generation unit 630 acquires the value of the execution flag from the selected action management table 720. Based on the value of the execution flag, the response generation unit 630 determines an action to be executed next in the selected service (step S104).

次に、応答生成部６３０は、応答管理テーブル７３０を参照し、ステップＳ１０４において判断されたアクションを示す応答内容情報をアクションＩＤの値に持つ応答管理レコードを選択する。さらに、応答生成部６３０は、選択した応答管理レコードの中から、ステップＳ１０２において判断された利用者感情情報を感情情報の値に持つ応答管理レコードを選択する。応答生成部６３０は、選択した応答管理レコードから応答情報の値を取得する。応答生成部６３０は、取得した応答情報の値を音声データに変換する。応答生成部６３０は、音声データを発信元の利用者の電話機２に送信する。このように、応答生成部６３０は、利用者の感情に基づいて選択された応答メッセージを利用者に応答する（ステップＳ１０５）。 Next, the response generation unit 630 refers to the response management table 730 and selects a response management record having response content information indicating the action determined in step S104 as an action ID value. Furthermore, the response generation unit 630 selects, from the selected response management records, a response management record having the user emotion information determined in step S102 as the emotion information value. The response generation unit 630 acquires the value of response information from the selected response management record. The response generation unit 630 converts the acquired response information value into audio data. The response generation unit 630 transmits the voice data to the telephone 2 of the caller user. As described above, the response generation unit 630 responds to the user with a response message selected based on the user's emotion (step S105).

次に、応答生成部６３０は、選択されたサービスにおけるアクションのフローにおいて、全てのアクションが行われたか否かを判定する（ステップＳ１０６）。実行していないアクションが存在する場合（ステップＳ１０６−ＮＯ）、応答判断部６２０はステップＳ１０１に処理を移し、利用者との通話を継続する。一方、全ての応答が行われた場合（ステップＳ１０６−ＹＥＳ）、応答判断部６２０は、利用者との通話を終了する。 Next, the response generation unit 630 determines whether or not all actions have been performed in the action flow in the selected service (step S106). When there is an action that has not been executed (step S106: NO), the response determination unit 620 moves the process to step S101 and continues the call with the user. On the other hand, when all responses have been made (step S106—YES), the response determination unit 620 ends the call with the user.

図９は、音声自動応答システム１と利用者との通話の具体例を示す図である。
図９において、通話１、３、５、７及び通話９に示された通話は、音声自動応答システム１に対して利用者が行った通話を表す。また、通話２、４、６及び通話８に示された通話は、利用者に対する音声自動応答システム１の応答を表す。図９に示された通話は、通話１から通話９の順に、利用者と音声自動応答システム１との間でやり取りされる。なお、利用者の通話における（）内の記載は、その通話を行ったときの利用者の感情を表す。 FIG. 9 is a diagram illustrating a specific example of a call between the automatic voice response system 1 and a user.
In FIG. 9, the calls shown in the calls 1, 3, 5, 7 and the call 9 represent calls made by the user to the automatic voice response system 1. Calls shown as calls 2, 4, 6 and 8 represent responses of the automatic voice response system 1 to the user. The call shown in FIG. 9 is exchanged between the user and the automatic voice response system 1 in the order of the call 1 to the call 9. Note that the description in parentheses in the user's call represents the user's feelings when the call is made.

利用者の通話が図９の例に示された内容である場合、例えば、図２に示された内容が感情認識装置２００によって音声情報テーブル３１０に登録される。音声変換装置４００は、音声情報レコードの音声情報として保持された音声データをテキストデータに変換する。音声変換装置４００は、音声データが変換されたテキストデータからキーワードを抽出する。その結果、例えば、図３に示された内容が音声変換装置４００によってテキスト情報テーブル５１０に登録される。 When the user's call has the contents shown in the example of FIG. 9, for example, the contents shown in FIG. 2 are registered in the voice information table 310 by the emotion recognition device 200. The voice conversion device 400 converts voice data held as voice information of the voice information record into text data. The voice conversion device 400 extracts keywords from text data obtained by converting voice data. As a result, for example, the contents shown in FIG. 3 are registered in the text information table 510 by the voice conversion device 400.

フロー管理テーブル７１０が図４に示された内容で登録されている場合、自動応答装置６００の応答判断部６２０は、例えば、“Ｖ００１”を音声ＩＤの値に持つ音声データに基づいて、利用者が“クレジットカードの利用再開“を希望していることを認識する。また、自動応答装置６００は、テキスト情報テーブル５１０を参照し、“Ｖ００１”を音声ＩＤの値に持つテキスト情報レコードが持つ感情情報の値に基づいて、その通話における利用者の感情が“怒り”であることを認識する。そして、アクション管理テーブル７２０が図５に示された内容で登録されている場合、自動応答装置６００は、“Ａ００１”をアクションＩＤの値に持つアクション管理レコードが持つアクション情報の値“顧客情報の確認”が選択されたサービスに対して実行すべきアクションであることを認識する。 When the flow management table 710 is registered with the contents shown in FIG. 4, the response determination unit 620 of the automatic response device 600 uses, for example, the voice data having “V001” as the voice ID value. Recognizes that they want to “restart credit card usage”. Further, the automatic response device 600 refers to the text information table 510, and the emotion of the user in the call is “angry” based on the emotion information value of the text information record having “V001” as the voice ID value. Recognize that. When the action management table 720 is registered with the content shown in FIG. 5, the automatic response device 600 determines that the action information value “customer information value” of the action management record having “A001” as the action ID value. Recognize that “confirm” is the action to be performed on the selected service.

そして、応答管理テーブル７３０が図６に示された内容で登録されている場合、自動応答装置６００は、“Ａ００１”をアクションＩＤの値に持ち、“怒り”を感情情報の値に持つ応答履歴レコード、すなわち“Ｒ００１”を応答ＩＤの値に持つ応答履歴レコードの応答情報の値が、利用者に応答するメッセージであることを認識する。このとき、利用者の感情が“平常”であったと仮定すると、“Ｒ００２”を応答ＩＤの値に持つ応答履歴レコードの応答情報の値が利用者に応答するメッセージとして選択される。すなわち、同じ内容のメッセージであっても、そのときの利用者の感情に適したメッセージが選択される。このように、自動応答装置６００は、利用者の感情に応じて、利用者に応答するメッセージを選択する。 When the response management table 730 is registered with the contents shown in FIG. 6, the automatic response device 600 has a response history having “A001” as the action ID value and “anger” as the emotion information value. It is recognized that the value of the response information of the response history record having the record, that is, “R001” as the response ID value is a message responding to the user. At this time, assuming that the emotion of the user is “normal”, the value of the response information in the response history record having “R002” as the response ID value is selected as a message to respond to the user. That is, even if the message has the same content, a message suitable for the user's emotion at that time is selected. Thus, the automatic response device 600 selects a message to respond to the user according to the user's emotion.

このように構成された第１実施形態の音声自動応答システム１では、自動応答装置６００は、音声データから取得された利用者の感情を表す情報に基づいて、利用者に応答するメッセージを選択する。自動応答装置６００を備える音声自動応答システム１は、例えば、“怒り”の感情を持つ利用者にはより丁寧な言葉づかいで応答し、“焦り”の感情を持つ利用者にはより短い言葉で応答することが可能となる。
そのため、実施形態の音声自動応答システム１によって、コールセンタ３の事業者は、コールセンタサービスの提供において、利用者の感情に応じた対応をより少ない人的リソースで行うことが可能となる。 In the automatic voice response system 1 according to the first embodiment configured as described above, the automatic response device 600 selects a message to respond to the user based on information representing the emotion of the user acquired from the voice data. . The automatic voice response system 1 including the automatic response device 600 responds with a more polite wording to a user having an emotion of “anger” and responds with a shorter word to a user having an emotion of “impression”, for example. It becomes possible to do.
Therefore, the voice automatic response system 1 according to the embodiment enables the service provider of the call center 3 to perform a response corresponding to the emotion of the user with less human resources in providing the call center service.

また、第１実施形態の自動応答装置６００は、音声データから抽出されたキーワードに基づいて、利用者が希望するサービスを認識し、サービスに応じた応答メッセージを生成する。そのため、自動応答装置６００を備える音声自動応答システム１は、情報伝達のための利用者の入力操作の負担を軽減し、利用者との情報のやり取りを円滑に行うことが可能となる。 In addition, the automatic response device 600 according to the first embodiment recognizes a service desired by a user based on a keyword extracted from voice data, and generates a response message corresponding to the service. Therefore, the automatic voice response system 1 including the automatic response device 600 can reduce the burden of the user's input operation for information transmission, and can smoothly exchange information with the user.

第１実施形態の音声自動応答システム１が上記の自動応答装置６００を備えることによって、コールセンタ３の事業者は、夜間などのオペレータがいない状況であっても、利用者にコールセンタのサービスを提供することが可能となる。また、夜間対応が可能なコールセンタであっても、夜間に配置されるオペレータの人員は昼間よりも少ない場合がある。その結果、コールセンタの夜間の利用において、利用者がサービスを受けるまでの待ち時間が長くなる可能性があった。また、日中の利用であってもコールセンタが混雑している状況においては、利用者は時間をおいて電話をかけ直し、混雑が解消されるまで待つ必要があった。第１実施形態の音声自動応答システム１が上記の自動応答装置６００を備えることによって、コールセンタ３の事業者は、より短い待ち時間で利用者にサービスを提供することが可能となる。 By providing the automatic answering system 1 of the first embodiment with the automatic answering device 600 described above, the service provider of the call center 3 provides the call center service to the user even in the absence of an operator such as at night. It becomes possible. Even in a call center capable of handling nighttime, there are cases where the number of operators arranged at night is less than in daytime. As a result, when the call center is used at night, there is a possibility that the waiting time until the user receives the service becomes long. Further, even in daytime use, in a situation where the call center is congested, the user has to make a call again after a while and wait until the congestion is resolved. When the automatic voice response system 1 according to the first embodiment includes the automatic response device 600 described above, the operator of the call center 3 can provide a service to the user with a shorter waiting time.

また、第１実施形態の音声自動応答システム１は、コールセンタが混雑している状況において実施形態による自動応答を行うことによって、オペレータによる応対が可能となるまでの待ち時間を活用し利用者から情報収集を行うように構成されてもよい。このように収集された情報が応対可能となったオペレータに引き継がれることによって、第１実施形態の音声自動応答システム１では、オペレータの応対時間を短縮することが可能となる。 Further, the automatic voice response system 1 of the first embodiment makes use of the waiting time until an operator can respond by performing an automatic response according to the embodiment in a situation where the call center is congested. It may be configured to collect. The information collected in this way is taken over by an operator who can respond, so that in the automatic voice response system 1 of the first embodiment, it is possible to shorten the operator's response time.

＜変形例＞
音声データの実データが音声情報テーブル３１０以外に保持される場合、音声情報テーブル３１０に登録される音声情報の値は、音声データの実データの格納場所を示す情報であってもよい。
自動応答装置６００において、応答判断部６２０及び応答生成部６３０は、両方の機能を持つ１つの機能部として構成されてもよい。 <Modification>
When the actual data of the audio data is stored other than the audio information table 310, the value of the audio information registered in the audio information table 310 may be information indicating the storage location of the actual data of the audio data.
In the automatic response device 600, the response determination unit 620 and the response generation unit 630 may be configured as one functional unit having both functions.

応答判断部６２０が出力する応答種別情報は、利用者への応答における次のアクション（応答内容情報）を取得する以外の用途に用いられてもよい。本実施形態では、応答種別情報は、利用者に提供されるサービスの種別を表す。そのため、例えば、自動応答装置６００は、カウンタ機能を備えることによって、サービスごとの利用回数を取得するように構成されてもよい。自動応答装置６００がこのように構成されることによって、コールセンタ３の事業者は、サービスの利用回数の統計を分析してサービス改善の活動につなげることができる。また、上記の自動応答装置６００は、さらに通話ＩＤを用いることによって、利用者ごとにサービスの利用回数を取得するように構成されてもよい。自動応答装置６００がこのように構成されることによって、コールセンタ３の事業者は、利用者ごとのサービスの利用回数の統計を分析して、例えば不正利用の監視などを行うことができる。 The response type information output by the response determination unit 620 may be used for purposes other than acquiring the next action (response content information) in response to the user. In the present embodiment, the response type information represents the type of service provided to the user. Therefore, for example, the automatic response device 600 may be configured to acquire the usage count for each service by providing a counter function. By configuring the automatic answering apparatus 600 in this way, the service provider of the call center 3 can analyze the statistics of the number of times of using the service and connect it to service improvement activities. In addition, the automatic answering device 600 may be configured to acquire the number of times the service is used for each user by using the call ID. By configuring the automatic answering apparatus 600 in this way, the service provider of the call center 3 can analyze the statistics of the number of times the service is used for each user and monitor, for example, unauthorized use.

自動応答装置６００は、ステップＳ１０５において、利用者への応答に用いる応答メッセージが取得できなかった場合、過去にオペレータが他の利用者に応答した音声を含む音声データを送信することによって、利用者への応答を行ってもよい。例えば、この場合、自動応答装置６００は、新規登録されたテキスト情報レコードのキーワードの値と関連性が強いテキスト情報の値を持つテキスト情報レコードを選択してもよい。自動応答装置６００は、選択されたテキスト情報レコードからテキスト情報の値を取得する。自動応答装置６００は、取得したテキスト情報の値を音声データに変換する。自動応答装置６００は、変換された音声データを利用者の電話機２に送信する。 In step S105, when the response message used for the response to the user cannot be acquired, the automatic response device 600 transmits the voice data including the voice that the operator has responded to the other user in the past. A response may be made. For example, in this case, the automatic response device 600 may select a text information record having a text information value that is strongly related to the keyword value of the newly registered text information record. The automatic response device 600 acquires the value of the text information from the selected text information record. The automatic response device 600 converts the value of the acquired text information into voice data. The automatic response device 600 transmits the converted voice data to the telephone 2 of the user.

なお、過去にオペレータが他の利用者に応答した音声を含む音声データに、名前や住所、クレジットカード番号などの個人情報が含まれる場合、自動応答装置６００は、これらの個人情報を除外するか、応答の対象となる利用者の個人情報に置き換えて音声データを送信するように構成されてもよい。また、過去にオペレータが他の利用者に応答した音声を含む音声データに個人情報が含まれる場合、自動応答装置６００は自動応答を行わず、利用者の通話をコールセンタ３のオペレータに転送することを中継装置１００に要求するように構成されてもよい。 When personal data such as name, address, and credit card number is included in voice data including voices that the operator has responded to other users in the past, the automatic response device 600 excludes such personal information. The voice data may be transmitted in place of the personal information of the user to be responded to. In addition, when personal information is included in voice data including voices that the operator has responded to other users in the past, the automatic answering device 600 does not automatically answer but transfers the user's call to the operator of the call center 3. To the relay device 100.

自動応答装置６００は、通話開始時の利用者の感情が“怒り”の状態にある場合や、自動応答によって利用者の感情が悪化した場合には、自動応答を行わず利用者の通話をコールセンタ３のオペレータに転送することを中継装置１００に要求するように構成されてもよい。 When the user's emotion at the start of the call is in an “anger” state or when the user's emotion deteriorates due to the automatic response, the automatic answering device 600 does not perform an automatic response and does not perform the automatic response. It may be configured to request the relay apparatus 100 to transfer to the third operator.

自動応答装置６００は、利用者の通話をオペレータに転送する際、通話開始から転送までの間に利用者とやり取りされた情報をオペレータに提供してもよい。この機能によって、例えば、オペレータは、利用者から聞き出した情報を再度利用者に質問する必要がなくなり、サービスの効率を向上させることができるとともに、利用者の感情が悪化するのを抑制することができる。 The automatic answering device 600 may provide the operator with information exchanged with the user between the start of the call and the transfer when the user's call is transferred to the operator. With this function, for example, the operator does not need to ask the user again about the information heard from the user, so that the service efficiency can be improved and the user's feelings can be prevented from deteriorating. it can.

音声自動応答システム１は、自動応答装置６００によって選択された応答メッセージを自動応答以外の用途に用いてもよい。例えば、自動応答装置６００は、選択した応答メッセージを応答の例としてオペレータに提供するように構成されてもよい。自動応答装置６００がこのように構成されることによって、例えば、経験の浅いオペレータでもより適切な応答を行うことが可能となる。さらに、自動応答装置６００は、応答メッセージの選択に用いた利用者感情情報をオペレータに提供してもよい。例えば、利用者感情情報が提供されることによって、オペレータは、把握することが難しい他国の利用者の感情を知ることが可能となる。 The automatic voice response system 1 may use the response message selected by the automatic response device 600 for purposes other than the automatic response. For example, the automatic response device 600 may be configured to provide the selected response message to the operator as an example of a response. By configuring the automatic response device 600 in this way, for example, even an inexperienced operator can perform a more appropriate response. Furthermore, the automatic response device 600 may provide the user emotion information used for selecting the response message to the operator. For example, by providing user emotion information, the operator can know the emotions of users in other countries that are difficult to grasp.

音声自動応答システム１は、音声データに基づいて利用者の言語種別を判断し、利用者の言語に応じた応答メッセージを選択するように構成されてもよい。この場合、例えば、感情認識装置２００は、利用者の言語種別を判断する。音声ＤＢ３００及びテキストＤＢ５００は、音声データの識別情報と言語種別とを対応づけて記憶する。応答ＤＢ７００は、応答メッセージの識別情報と言語種別とを対応づけて記憶する。自動応答装置６００は、応答内容情報、感情情報及び言語種別に基づいて、応答ＤＢ７００から応答メッセージを取得する。 The automatic voice response system 1 may be configured to determine a user's language type based on the voice data and select a response message according to the user's language. In this case, for example, the emotion recognition device 200 determines the language type of the user. The voice DB 300 and the text DB 500 store voice data identification information and a language type in association with each other. The response DB 700 stores the identification information of the response message and the language type in association with each other. The automatic response device 600 acquires a response message from the response DB 700 based on the response content information, emotion information, and language type.

なお、利用者の言語種別に応じて取得された応答メッセージは、利用者への自動応答に用いられてもよいし、上述したように、応答の例としてオペレータに提供されてもよい。言語種別に応じた応答メッセージがオペレータに提供されることによって、母国語以外の言語で対応するオペレータの応対の品質を向上させることができる。 Note that the response message acquired in accordance with the language type of the user may be used for an automatic response to the user, or may be provided to the operator as an example of the response as described above. By providing the operator with a response message corresponding to the language type, it is possible to improve the quality of response of the operator who corresponds in a language other than the native language.

音声自動応答システム１は、利用者の通話をオペレータの母国語に翻訳する機能を備えてもよい。この場合、例えば、音声変換装置４００は、テキストデータに変換した利用者の通話をオペレータの母国語に翻訳する。テキストＤＢ５００は、音声データの識別情報と翻訳されたテキストデータとを対応づけて記憶する。自動応答装置６００は、母国語に翻訳された利用者の通話をオペレータに提供する。利用者の通話が母国語に翻訳されてオペレータに提供されることによって、母国語以外の言語で対応するオペレータが利用者の通話の内容をより正確に理解することが可能となる。 The automatic voice response system 1 may have a function of translating a user's call into the operator's native language. In this case, for example, the voice conversion device 400 translates the user's call converted into text data into the native language of the operator. Text DB 500 stores identification information of speech data and translated text data in association with each other. The automatic answering device 600 provides the operator's call translated into the native language to the operator. By translating the user's call into the native language and providing it to the operator, the operator corresponding to the language other than the native language can understand the content of the user's call more accurately.

［第２実施形態］
図１０は、第２実施形態の音声自動応答システム１ａのシステム構成を示すシステム構成図である。
音声自動応答システム１ａは、自動応答装置６００に代えて自動応答装置６００ａを備える点、応答ＤＢ７００に代えて応答ＤＢ７００ａを備える点で第１実施形態の音声自動応答システム１と異なる。音声自動応答システム１ａのその他の構成は、音声自動応答システム１と同様である。図１０では、音声自動応答システム１と同様の構成要素には、図１と同様の符号を付すことによって説明を省略する。 [Second Embodiment]
FIG. 10 is a system configuration diagram showing a system configuration of the automatic voice response system 1a of the second embodiment.
The automatic voice response system 1a differs from the automatic voice response system 1 of the first embodiment in that an automatic response device 600a is provided instead of the automatic response device 600 and a response DB 700a is provided instead of the response DB 700. Other configurations of the automatic voice response system 1 a are the same as those of the automatic voice response system 1. In FIG. 10, the same components as those in the automatic voice response system 1 are denoted by the same reference numerals as those in FIG.

図１１は、自動応答装置６００ａの機能構成を示す機能ブロック図である。
自動応答装置６００ａは、応答生成部６３０に代えて応答生成部６３０ａを備える点、テンプレート生成部６４０をさらに備える点で第１実施形態の自動応答装置６００と異なる。自動応答装置６００ａのその他の構成は、自動応答装置６００と同様である。図１１では、自動応答装置６００と同様の機能を有する機能部には、図７と同様の符号を付すことによって説明を省略する。 FIG. 11 is a functional block diagram showing a functional configuration of the automatic response device 600a.
The automatic response device 600a is different from the automatic response device 600 of the first embodiment in that a response generation unit 630a is provided instead of the response generation unit 630 and a template generation unit 640 is further provided. The other configuration of the automatic response device 600a is the same as that of the automatic response device 600. In FIG. 11, functional units having the same functions as those of the automatic response apparatus 600 are denoted by the same reference numerals as those in FIG.

応答生成部６３０ａは、応答生成部６３０の機能に加えて、利用者への応答の履歴を応答ＤＢ７００ａに登録する機能を有する。この機能により応答生成部６３０ａは、オペレータの音声データの識別情報と、オペレータが実行したアクションを示す応答内容情報とを対応づけて応答ＤＢ７００ａに登録する。 In addition to the function of the response generation unit 630, the response generation unit 630a has a function of registering a history of responses to the user in the response DB 700a. With this function, the response generation unit 630a registers the identification information of the voice data of the operator and the response content information indicating the action executed by the operator in the response DB 700a.

テンプレート生成部６４０は、オペレータによる過去の応答の履歴を参照し、品質の高い応答を自動応答に用いられる応答メッセージのテンプレートに追加する。例えば、テンプレート生成部６４０は、利用者の感情をより良い感情に変化させることができた応答を品質の高い応答として抽出する。例えば、テンプレート生成部６４０は、通話開始時に“怒り”の状態にあった利用者の感情を、通話終了時に“喜び”に変えることができた応答を抽出してもよい。テンプレート生成部６４０は、抽出した応答を新規テンプレートとして、応答管理テーブル７３０に追加する。 The template generation unit 640 refers to the history of past responses by the operator, and adds a high-quality response to a response message template used for an automatic response. For example, the template generation unit 640 extracts a response that can change the user's emotion to a better emotion as a high-quality response. For example, the template generation unit 640 may extract a response that can change the emotion of the user who was in an “anger” state at the start of the call to “joy” at the end of the call. The template generation unit 640 adds the extracted response as a new template to the response management table 730.

応答ＤＢ７００ａは、応答履歴テーブル７４０をさらに備える点で第１実施形態の応答ＤＢ７００と異なる。
図１２は、応答履歴テーブル７４０の具体例を示す図である。
応答履歴テーブル７４０は、利用者への応答において実行されたアクションの履歴を保持するテーブルである。応答履歴テーブル７４０は、通話ＩＤごとに応答履歴レコードを有する。応答履歴レコードは、通話ＩＤ、音声ＩＤ及びアクションＩＤの各値を有する。通話ＩＤ及び音声ＩＤは、音声情報レコード及びテキスト情報レコードが有する通話ＩＤ及び音声ＩＤと同様である。アクションＩＤは、アクション管理レコードが有するアクションＩＤと同様である。応答履歴レコードは、利用者への応答のアクションが実行されると応答生成部６３０ａによって応答履歴テーブル７４０に登録される。 The response DB 700a is different from the response DB 700 of the first embodiment in that it further includes a response history table 740.
FIG. 12 is a diagram illustrating a specific example of the response history table 740.
The response history table 740 is a table that holds a history of actions executed in response to a user. The response history table 740 has a response history record for each call ID. The response history record has each value of call ID, voice ID, and action ID. The call ID and the voice ID are the same as the call ID and the voice ID included in the voice information record and the text information record. The action ID is the same as the action ID included in the action management record. The response history record is registered in the response history table 740 by the response generation unit 630a when an action of response to the user is executed.

図１３は、第２実施形態の音声自動応答システム１ａにおけるテンプレート生成の処理の流れを示すフローチャートである。
まず、利用者がオペレータとの間で通話を行うと、感情認識装置２００は、ステップＳ１０２と同様の処理を行い、音声データから利用者の感情を認識する。また、音声変換装置４００は、ステップＳ１０３と同様の処理を行い、音声データをテキストデータに変換する。自動応答装置６００の応答判断部６２０は、ステップＳ１０３と同様の処理を行い、利用者が希望するサービスを判断する。応答判断部６２０は、利用者が希望するサービスとして判断されたサービスの識別情報を応答種別情報として応答生成部６３０ａに出力する。応答生成部６３０ａは、応答種別情報に対応するアクション管理テーブル７２０を参照し、オペレータの応答に対応するアクションを判断する。応答生成部６３０ａは、対象の通話の識別情報、音声データの識別情報及び応答のアクションを示す応答内容情報を値に持つ応答履歴レコードを生成する。応答生成部６３０ａは、生成した応答履歴レコードを応答履歴テーブル７４０に登録する。 FIG. 13 is a flowchart showing a flow of template generation processing in the automatic voice response system 1a of the second embodiment.
First, when the user makes a call with the operator, the emotion recognition apparatus 200 performs the same processing as step S102 and recognizes the user's emotion from the voice data. In addition, the voice conversion device 400 performs the same processing as step S103, and converts voice data into text data. The response determination unit 620 of the automatic response device 600 performs the same process as in step S103, and determines the service desired by the user. The response determination unit 620 outputs the identification information of the service determined as the service desired by the user to the response generation unit 630a as response type information. The response generation unit 630a refers to the action management table 720 corresponding to the response type information, and determines an action corresponding to the operator's response. The response generation unit 630a generates a response history record having the identification information of the target call, the identification information of the voice data, and the response content information indicating the response action as values. The response generation unit 630a registers the generated response history record in the response history table 740.

テンプレート生成部６４０は、テキスト情報テーブル５１０を参照し、過去の通話において通話終了時の感情情報が通話開始時の感情情報よりもより良い感情に変化した通話ＩＤに対応する応答履歴テーブル７４０を選択する（ステップＳ２０１）。
テンプレート生成部６４０は、選択した応答履歴テーブル７４０に登録されている応答履歴レコードの音声ＩＤの値を取得する。テンプレート生成部６４０は、テキスト情報テーブル５１０を参照し、応答履歴レコードから取得した音声ＩＤの値を音声ＩＤの値に持つテキスト情報レコードを選択する。テンプレート生成部６４０は、選択したテキスト情報レコードからテキスト情報の値と、感情情報の値とを取得する。このように、テンプレート生成部６４０は、オペレータの応答履歴に基づいて、オペレータが応答したメッセージを取得する（ステップＳ２０２）。 The template generation unit 640 refers to the text information table 510 and selects the response history table 740 corresponding to the call ID in which the emotion information at the end of the call has changed to a better emotion than the emotion information at the start of the call in the past call. (Step S201).
The template generation unit 640 acquires the value of the voice ID of the response history record registered in the selected response history table 740. The template generation unit 640 refers to the text information table 510 and selects a text information record having the voice ID value acquired from the response history record as the voice ID value. The template generation unit 640 acquires the value of text information and the value of emotion information from the selected text information record. As described above, the template generation unit 640 acquires a message that the operator has responded to based on the response history of the operator (step S202).

テンプレート生成部６４０は、テキスト情報レコードから取得したテキスト情報及び感情情報の値を応答ＤＢ７００ａに出力する。応答ＤＢ７００ａは、テンプレート生成部６４０から出力された値を持つ応答管理レコードを生成する。応答ＤＢ７００ａは、生成した応答管理レコードを応答管理テーブル７３０に登録する（ステップＳ２０３）。 The template generation unit 640 outputs the text information and emotion information values acquired from the text information record to the response DB 700a. The response DB 700a generates a response management record having the value output from the template generation unit 640. The response DB 700a registers the generated response management record in the response management table 730 (step S203).

図１４は、テキスト情報テーブル５１０の第２の具体例を示す図である。
図１４では、利用者とオペレータとの間で図３の例のテキスト情報テーブル５１０が示す通話が行われたことを表している。
例えば、テンプレート生成部６４０は、過去の通話における応答履歴テーブル７４０のうち、通話終了時の感情情報が、通話開始時の感情情報よりもより良い感情となったテーブルとして、図１２の例に示す内容の応答履歴テーブル７４０を選択する。テンプレート生成部６４０は、選択した応答履歴テーブル７４０から、応答履歴レコードの各値を取得する。 FIG. 14 is a diagram showing a second specific example of the text information table 510.
FIG. 14 shows that a call indicated by the text information table 510 in the example of FIG. 3 is performed between the user and the operator.
For example, the template generation unit 640 is shown in the example of FIG. 12 as a table in which the emotion information at the end of the call becomes a better emotion than the emotion information at the start of the call in the response history table 740 in the past call. The content response history table 740 is selected. The template generation unit 640 acquires each value of the response history record from the selected response history table 740.

テンプレート生成部６４０は、テキスト情報テーブル５１０を参照し、応答履歴レコードから取得した音声ＩＤ及び通話ＩＤの値を音声ＩＤ及び通話ＩＤの値にもつテキスト情報レコードを選択する。テンプレート生成部６４０は、選択したテキスト情報レコードからテキスト情報の値と、感情情報の値とを取得する。テンプレート生成部６４０は、テキスト情報レコードから取得したテキスト情報及びテキスト情報レコードから取得した感情情報の値を応答ＤＢ７００ａに出力する。応答ＤＢ７００ａは、テンプレート生成部６４０から出力された応答情報及び感情情報の値に持つ応答管理レコードを生成する。応答ＤＢ７００ａは、生成した応答管理レコードを応答管理テーブル７３０に登録する。 The template generation unit 640 refers to the text information table 510 and selects a text information record having the voice ID and call ID values acquired from the response history record as the voice ID and call ID values. The template generation unit 640 acquires the value of text information and the value of emotion information from the selected text information record. The template generation unit 640 outputs the text information acquired from the text information record and the value of emotion information acquired from the text information record to the response DB 700a. The response DB 700a generates a response management record having values of response information and emotion information output from the template generation unit 640. The response DB 700a registers the generated response management record in the response management table 730.

上記の処理によって、図１４の例において“Ｖ００２”、“Ｖ００４”、“Ｖ００６”及び“Ｖ００８”を音声ＩＤの値に持つテキスト情報レコードのテキスト情報の値が応答管理テーブル７３０に追加されるメッセージとして選択される。 The message in which the text information value of the text information record having “V002”, “V004”, “V006”, and “V008” as the voice ID values is added to the response management table 730 in the example of FIG. Selected as.

このように構成された第２実施形態の音声自動応答システム１ａでは、自動応答装置６００ａは、過去のオペレータの応答において利用者の感情をより良い感情にすることができた応答を、自動応答装置６００ａが選択する応答メッセージのテンプレートに追加する。そのため、音声自動応答システム１ａは、自動応答機能の精度を自律的に向上させることが可能となる。 In the voice automatic response system 1a according to the second embodiment configured as described above, the automatic response device 600a generates a response that can make the user's emotion better in the response of the past operator. It adds to the template of the response message which 600a selects. Therefore, the voice automatic response system 1a can autonomously improve the accuracy of the automatic response function.

＜変形例＞
自動応答装置６００ａは、ステップＳ２０１において、過去の通話における感情の変化の度合いに基づいて、応答履歴テーブル７４０を選択するように構成されてもよい。また、自動応答装置６００ａは、選択された応答履歴テーブル７４０に含まれる応答メッセージに所定の重み付けを行い、重みに応じて応答メッセージをテンプレートに追加するように構成されてもよい。例えば、選択された応答メッセージが複数の応答履歴テーブル７４０に存在する場合、自動応答装置６００ａは、その応答メッセージによって利用者の感情が変化した回数で重み付けを行ってもよい。 <Modification>
In step S201, the automatic response device 600a may be configured to select the response history table 740 based on the degree of change in emotion in the past call. The automatic response device 600a may be configured to perform a predetermined weight on the response message included in the selected response history table 740 and add the response message to the template according to the weight. For example, when the selected response message exists in a plurality of response history tables 740, the automatic response device 600a may weight the number of times the user's emotion has changed due to the response message.

自動応答装置６００ａは、過去のオペレータの応答において、利用者の感情をより悪い感情にしてしまった応答メッセージを、応答管理テーブル７３０から削除するように構成されてもよい。また、自動応答装置６００ａが、そのような応答をＮＧ集として保持しオペレータに提供すれば、オペレータの応答品質を向上させることも可能となる。自動応答装置６００ａによるテンプレートの応答メッセージの追加において、同一のアクションＩＤ及び感情情報に対応するテンプレートが複数登録された場合、自動応答装置６００ａは、選択された複数のテンプレートから最新のテンプレートを用いて応答するように構成されてもよい。 The automatic response device 600a may be configured to delete, from the response management table 730, a response message that has made the user's emotion worse in the past operator's response. Further, if the automatic response device 600a holds such a response as an NG collection and provides it to the operator, the response quality of the operator can be improved. When a plurality of templates corresponding to the same action ID and emotion information are registered in addition of a template response message by the automatic response device 600a, the automatic response device 600a uses the latest template from the selected templates. It may be configured to respond.

自動応答装置６００は、ステップＳ２０２において取得されたメッセージに名前や住所、クレジットカード番号などの個人情報が含まれる場合、自動応答装置６００は、これらの個人情報を除外するか、応答の対象となる利用者の個人情報に置き換えたメッセージを生成してもよい。 When the message acquired in step S202 includes personal information such as a name, an address, and a credit card number, the automatic response device 600 excludes the personal information or becomes a response target. A message replaced with personal information of the user may be generated.

上述した実施形態における自動応答装置６００及び６００ａをコンピュータで実現するようにしてもよい。その場合、この機能を実現するためのプログラムをコンピュータ読み取り可能な記録媒体に記録して、この記録媒体に記録されたプログラムをコンピュータシステムに読み込ませ、実行することによって実現してもよい。なお、ここでいう「コンピュータシステム」とは、ＯＳや周辺機器等のハードウェアを含むものとする。また、「コンピュータ読み取り可能な記録媒体」とは、フレキシブルディスク、光磁気ディスク、ＲＯＭ、ＣＤ−ＲＯＭ等の可搬媒体、コンピュータシステムに内蔵されるハードディスク等の記憶装置のことをいう。さらに「コンピュータ読み取り可能な記録媒体」とは、インターネット等のネットワークや電話回線等の通信回線を介してプログラムを送信する場合の通信線のように、短時間の間、動的にプログラムを保持するもの、その場合のサーバやクライアントとなるコンピュータシステム内部の揮発性メモリのように、一定時間プログラムを保持しているものも含んでもよい。また上記プログラムは、前述した機能の一部を実現するためのものであってもよく、さらに前述した機能をコンピュータシステムにすでに記録されているプログラムとの組み合わせで実現できるものであってもよく、ＦＰＧＡ等のプログラマブルロジックデバイスを用いて実現されるものであってもよい。 The automatic response devices 600 and 600a in the above-described embodiment may be realized by a computer. In that case, a program for realizing this function may be recorded on a computer-readable recording medium, and the program recorded on this recording medium may be read into a computer system and executed. Here, the “computer system” includes an OS and hardware such as peripheral devices. The “computer-readable recording medium” refers to a storage device such as a flexible medium, a magneto-optical disk, a portable medium such as a ROM and a CD-ROM, and a hard disk incorporated in a computer system. Furthermore, the “computer-readable recording medium” dynamically holds a program for a short time like a communication line when transmitting a program via a network such as the Internet or a communication line such as a telephone line. In this case, a volatile memory inside a computer system serving as a server or a client in that case may be included and a program held for a certain period of time. Further, the program may be a program for realizing a part of the above-described functions, and may be a program capable of realizing the functions described above in combination with a program already recorded in a computer system. It may be realized using a programmable logic device such as an FPGA.

以上、この発明の実施形態について図面を参照して詳述してきたが、具体的な構成はこの実施形態に限られるものではなく、この発明の要旨を逸脱しない範囲の設計等も含まれる。 The embodiment of the present invention has been described in detail with reference to the drawings. However, the specific configuration is not limited to this embodiment, and includes designs and the like that do not depart from the gist of the present invention.

１、１ａ…音声自動応答システム，２、２−１〜２−３…電話機，３、３−１〜３−３…コールセンタ，４…ネットワーク，５、５−１〜５−３…コールセンタ管理装置，１００…中継装置，２００…感情認識装置，３００…音声ＤＢ，３１０…音声情報テーブル，４００…音声変換装置，５００…テキストＤＢ，５１０…テキスト情報テーブル，６００、６００ａ…自動応答装置，６１０…通信部，６２０…応答判断部，６３０、６３０ａ…応答生成部，６４０…テンプレート生成部，７００、７００ａ…応答ＤＢ，７１０…フロー管理テーブル，７２０…アクション管理テーブル，７３０…応答管理テーブル，７４０…応答履歴テーブル DESCRIPTION OF SYMBOLS 1, 1a ... Voice automatic response system, 2, 2-1 to 2-3 ... Telephone, 3, 3-1 to 3-3 ... Call center, 4 ... Network, 5, 5-1 to 5-3 ... Call center management apparatus , 100 ... Relay device, 200 ... Emotion recognition device, 300 ... Voice DB, 310 ... Voice information table, 400 ... Voice conversion device, 500 ... Text DB, 510 ... Text information table, 600, 600a ... Automatic response device, 610 ... Communication unit, 620... Response judgment unit, 630, 630a ... Response generation unit, 640 ... Template generation unit, 700, 700a ... Response DB, 710 ... Flow management table, 720 ... Action management table, 730 ... Response management table, 740 ... Response history table

Claims

An emotion recognition unit that acquires user emotion information representing the user's emotion based on voice data transmitted from a call device of a call center user;
A voice conversion unit that converts the voice data into text data;
Based on the text data and the user emotion information, a response message for the user is selected from message candidates, a response unit that converts the selected response message into voice data and transmits it to the call device;
Among the response messages that the call center operator responded to past users, an addition unit that adds or deletes response messages that change the past user's emotions to the message candidates;
With
The adding unit weights the response message that has changed the emotion of the past user according to the number of times used in the past response, and adds the response message to the message candidate according to the weight or Delete,
Response system.

The response unit is a case in which the voice data transmitted to the call device is second voice data that the call center operator responds to a past user, and the second voice data is personal. When voice indicating information is included, voice data obtained by excluding voice indicating the personal information from the second voice data, or voice data obtained by replacing the voice indicating the personal information with personal information of the current user Send to call device,
The response system according to claim 1.

The addition unit adds a message that can change the emotion of the user to a better emotion among the messages that the call center operator has responded to the user in the past, to the message candidate.
The response system according to claim 1 or 2 .

The response unit acquires the response type information based on the text data from a storage unit that stores response type information indicating the type of response to the user and text data in association with each other, and acquires the acquired response type Based on the information and the user emotion information, a message that responds to the user is selected from message candidates.
The response system according to any one of claims 1 to 3 .

The response unit stores the response content information based on the response type information from the storage unit that stores the response content information indicating the content of the process to be executed in response to the user and the response type information in association with each other. Acquiring and selecting a message to respond to the user from among the message candidates based on the acquired response content information and the user emotion information;
The response system according to any one of claims 1 to 4 .

The response unit provides the user emotion information to a call center operator.
The response system according to any one of claims 1 to 5 .

An emotion recognition step of acquiring user emotion information representing the user's emotion based on voice data transmitted from a call device of a call center user;
A voice conversion step of converting the voice data into text data;
A response step of selecting a response message for the user from message candidates based on the text data and the user emotion information, converting the selected response message into voice data and transmitting it to the call device;
An addition step of adding or deleting a response message that has changed the emotion of the past user among response messages that the call center operator responded to the past user;
Have
In the adding step, the response message in which the past user's emotion is changed is weighted according to the number of times used in the past response, and the response message is added to the message candidate according to the weight or Delete,
Response method.

The computer program for functioning a computer as a response system as described in any one of Claim 1 to 6 .