JP2017152948A

JP2017152948A - Information provision method, information provision program, and information provision system

Info

Publication number: JP2017152948A
Application number: JP2016033952A
Authority: JP
Inventors: 藤戸　初夫; Hatsuo Fujito; 初夫藤戸; 山田　和範; Kazunori Yamada; 山田　　和範
Original assignee: Bank of Tokyo Mitsubishi UFJ Trust Co
Current assignee: MUFG Bank Ltd
Priority date: 2016-02-25
Filing date: 2016-02-25
Publication date: 2017-08-31

Abstract

PROBLEM TO BE SOLVED: To create a method for providing a user with optimum information on a service of a company by creating a new point of contact between the company and the user via a communication network.SOLUTION: An information provision method from a company to a user includes: displaying a virtual operator on a communication terminal of the user; and, via an interaction between the user and the virtual operator, selecting an optimum answer from a plurality of answers corresponding to a request of the user to provide the selected answer to the user. The interaction is executed by the user inputting voice information via the communication terminal, transmitting the voice information to a voice recognition server via a network, the voice recognition server recognizing the voice information, an interaction server understanding an intention of the recognized voice information, selecting an answer message corresponding to the intention understood by the interaction server from a database, a voice synthesis server outputting the answer message, and outputting the output answer message as voice of the virtual operator displayed on the communication terminal.SELECTED DRAWING: Figure 3

Description

本発明は、事業者から顧客あるいはユーザへの情報提供方法、および当該情報提供方法を実現するためのプログラムとシステムに関する。 The present invention relates to an information providing method from a business operator to a customer or user, and a program and system for realizing the information providing method.

コンピュータや携帯電話などの通信端末の大幅な普及と、通信ネットワークの飛躍的な発展に伴い、各種サービスを提供する事業者と顧客あるいはユーザ（以下、本明細書、特許請求の範囲において、ユーザとユーザを総じてユーザと記す）との間のビジネス形態も大きく変化している。例えば金融商品をサービスとして提供する銀行や証券会社では、従来ユーザは店舗に足を運ぶ、あるいは電話などによって事業者と直接口頭で通話することにより、事業者が提供するサービスを購入したり、サービスに係る種々の問題を解決していた。現在では、通信ネットワークの発展により、ユーザは通信ネットワークを経由してサービスの提供を受けたり、問題解決に至ることができるようになってきた。典型的な例としてインターネットバンキングが挙げられ、ユーザは通信端末を操作することにより、時間と場所を選ばずに、各種商取引が可能となっている。このような商取引の変化は金融業界のみならず、様々な事業形態でも同様である。 With the widespread use of communication terminals such as computers and mobile phones, and the rapid development of communication networks, businesses and customers or users who provide various services (hereinafter referred to as users in this specification and claims) The form of business between the user and the user as a whole is also changing greatly. For example, in banks and securities companies that provide financial products as services, users can purchase services provided by business operators by visiting the store, or by making a verbal call directly to the business operator over the telephone, etc. Various problems related to the problem were solved. Nowadays, with the development of communication networks, users have been able to receive services via the communication network and solve problems. A typical example is Internet banking, and a user can perform various commercial transactions by operating a communication terminal regardless of time and place. Such changes in commerce are the same not only in the financial industry but also in various business forms.

また最近では人工知能を備えた音声認識技術が発展し、ユーザからの要望に対する対応や苦情の処理など、従来事業者あるいはそのオペレータが直接対処してきた業務に対する支援システムも開発されている。例えば特許文献１には、ユーザが通信端末に対して音声情報を入力し、入力された音声情報の解析結果に基づいてユーザの要求や質問、苦情の意図を抽出することで、ユーザに対するより適切かつ迅速な対応を支援するシステムが開示されている。 Recently, speech recognition technology equipped with artificial intelligence has been developed, and a support system has been developed for work that has been directly handled by the operator or its operator, such as handling requests from users and handling complaints. For example, in Patent Document 1, the user inputs voice information to the communication terminal, and the user's request, question, and complaint intention are extracted based on the analysis result of the input voice information. A system that supports quick response is disclosed.

特開平０５−２１６４９３号公報JP 05-216493 A

本発明の課題の一つは、通信ネットワークを通じて事業者とユーザとの新たな接点を創出し、これを利用することで、事業者のサービスに関する最適な情報をユーザに提供する方法を創成することである。あるいは本発明の課題の一つは、携帯端末を介した自然な通話という手段を通じて、ユーザと事業者との距離感を低減させてより親密なユーザ−事業者関係を構築し、ユーザに新しいユーザ体験を享受する機会を与えることである。あるいは本発明の課題の一つは、ユーザ対応による人的資源の消耗を回避することで事業者の業務負担を軽減するとともに、豊富な情報を、事業者の知識や経験、能力に制限されることなく、ユーザに提供することで、ユーザに対して高品質で高付加価値を有するサービスを適時に提供する方法を創成することである。あるいは本発明の課題の一つは、上記方法を実現するためのプログラム、およびシステムを提供することである。 One of the problems of the present invention is to create a new point of contact between a business operator and a user through a communication network, and to create a method for providing the user with optimal information related to the business of the business by using this. It is. Alternatively, one of the problems of the present invention is to build a more intimate user-business relationship by reducing the sense of distance between the user and the business through means such as a natural phone call via a mobile terminal, so that the user can Giving you the opportunity to enjoy the experience. Alternatively, one of the problems of the present invention is to reduce the business burden on the business operator by avoiding the consumption of human resources due to user correspondence, and to limit the abundant information to the knowledge, experience, and ability of the business operator. It is to create a method of providing a high-quality and high-value-added service to the user in a timely manner by providing it to the user. Alternatively, one of the objects of the present invention is to provide a program and a system for realizing the above method.

本発明の実施形態の一つは、情報提供システムである。該情報提供システムは、通信端末から送信されたユーザの音声情報を認識する音声認識部と、音声情報に対する回答メッセージを、複数のメッセージが格納されたデータベースから選択する対話部と、選択された回答メッセージを、少なくとも音声情報および文字情報のいずれか一つとして通信端末に送信する送信部を有し、対話部は、回答メッセージを一つに絞り切れない場合、ユーザに質問するための問合せメッセージをデータベースから選択し、選択された問合せメッセージを通信端末に送信する。 One embodiment of the present invention is an information providing system. The information providing system includes a voice recognition unit that recognizes user voice information transmitted from a communication terminal, a dialog unit that selects a response message for the voice information from a database in which a plurality of messages are stored, and a selected answer. A transmitting unit that transmits the message to the communication terminal as at least one of voice information and character information, and the dialog unit displays an inquiry message for asking a user a question when the answer message cannot be narrowed down to one Select from the database and send the selected inquiry message to the communication terminal.

上記情報提供システムはさらに通信端末を含むことができる。通信端末は、仮想オペレータを表示するように構成することができる。また上記情報提供システムはさらに、回答メッセージ、問合せメッセージを通信端末で再生する際、回答メッセージ、問合せメッセージを再生しない時と比較し、仮想オペレータの一部が大きな動作を行うように構成することができる。 The information providing system may further include a communication terminal. The communication terminal can be configured to display a virtual operator. In addition, the information providing system may be configured such that when the answer message and the inquiry message are reproduced on the communication terminal, a part of the virtual operator performs a large operation compared to when the answer message and the inquiry message are not reproduced. it can.

上記情報提供システムはさらに、ユーザの標準生体情報が格納される第１のサーバを有し、通信端末はユーザの生体情報を取り込む画像取込装置を有し、生体情報と標準生体情報に基づいてユーザが承認された場合に、ユーザに音声情報の入力を許容するように構成することができる。 The information providing system further includes a first server in which standard biometric information of the user is stored, and the communication terminal includes an image capturing device that captures the biometric information of the user, based on the biometric information and the standard biometric information. It can be configured to allow the user to input voice information when the user is approved.

上記情報提供システムはさらに、ユーザと事業者との通話のための第２のサーバをさらに有することができる。 The information providing system may further include a second server for calling between the user and the operator.

上記情報提供システムはさらに、音声認識部によって認識された音声情報、選択された回答メッセージ、および問合せメッセージを第２のサーバが格納するように構成することができる。 The information providing system can be further configured such that the second server stores the voice information recognized by the voice recognition unit, the selected answer message, and the inquiry message.

上記情報提供システムはさらに地図情報が格納される第３のサーバを有し、事業者の複数の店舗に関する情報を格納するように第１のサーバを構成し、表示部に通信端末の位置と複数の店舗の情報を表示するように通信端末を構成することができる。 The information providing system further includes a third server in which map information is stored, the first server is configured to store information on a plurality of stores of the business operator, and the position of the communication terminal and the plurality of information are displayed on the display unit. The communication terminal can be configured to display the store information.

音声情報に基づいて複数の店舗から少なくとも一つの店舗を選択するように対話部を構成し、選択された店舗への道順を通信端末へ送信するように第３のサーバを構成することができる。 The dialogue unit can be configured to select at least one store from a plurality of stores based on the audio information, and the third server can be configured to transmit the route to the selected store to the communication terminal.

本発明の実施形態の一つは、ユーザによって音声入力された質問に対応する回答メッセージを複数のメッセージから絞り込み、回答メッセージを一つに絞り切れない場合には、ユーザに質問するための問合せメッセージを複数のメッセージから選択するように構成されるサーバである。 In one embodiment of the present invention, an answer message corresponding to a question inputted by voice by a user is narrowed down from a plurality of messages, and when the answer message cannot be narrowed down to one, an inquiry message for asking a question to the user Is a server configured to select from a plurality of messages.

サーバは、ユーザが入力した音声情報を認識する音声認識サーバから質問が伝達されるように構成することができる。 The server can be configured such that the question is transmitted from a voice recognition server that recognizes voice information input by the user.

サーバは、回答メッセージと問合せメッセージが音声合成サーバへ伝達されるように構成することができる。 The server can be configured such that the answer message and the inquiry message are transmitted to the speech synthesis server.

サーバは、ユーザの生体情報を格納するように構成される第１のサーバと接続されてもよい。 The server may be connected to a first server configured to store user biometric information.

第１のサーバは、事業者の店舗情報を格納するように構成することができる。 The first server can be configured to store business store information.

第１のサーバは、ユーザの質問、選択された回答メッセージ、選択された問合せメッセージの少なくとも一つを格納するように構成される第２のサーバと接続することができる。 The first server may be connected to a second server configured to store at least one of a user question, a selected answer message, and a selected query message.

本発明の実施形態の一つは、通信端末から送信される、ユーザの質問を含む音声情報を認識し、認識されたユーザの質問に対応する回答メッセージを複数の回答メッセージから選択し、選択された回答メッセージを、少なくとも音声情報および文字情報のいずれか一つとして通信端末に送信し、回答メッセージが一つに絞り切れない場合、ユーザに質問するための問合せメッセージを複数の回答メッセージから選択し、選択された問合せメッセージを通信端末に送信することを実行させるプログラムである。 One embodiment of the present invention recognizes voice information including a user's question transmitted from a communication terminal, selects an answer message corresponding to the recognized user's question from a plurality of answer messages, and is selected. The response message is transmitted to the communication terminal as at least one of voice information and character information, and when the response message cannot be narrowed down to one, the inquiry message for asking the user a question is selected from a plurality of response messages. , A program for executing transmission of a selected inquiry message to a communication terminal.

上記プログラムは、通信端末が取り込むユーザの生体情報を受信し、生体情報を用いてユーザの認証を行うことを実行させるように構成することができる。 The program can be configured to receive user biometric information captured by the communication terminal and execute user authentication using the biometric information.

上記プログラムは、ユーザの質問、選択された回答メッセージ、選択された問合せメッセージの少なくとも一つを格納することをさらに実行させるように構成することができる。 The program may be further configured to store at least one of a user question, a selected answer message, and a selected inquiry message.

上記プログラムは、格納されたユーザの質問、選択された回答メッセージ、選択された問合せメッセージの少なくとも一つを参照し、通信端末と事業者の電話端末とを接続することをさらに実行させるように構成することができる。 The program is configured to refer to at least one of the stored user question, the selected answer message, and the selected inquiry message, and to further connect the communication terminal and the telephone terminal of the operator. can do.

上記プログラムは、格納されたユーザの質問、選択された回答メッセージ、選択された問合せメッセージの少なくとも一つを参照し、通信端末に事業者の店舗情報を表示することをさらに実行させるように構成することができる。 The program refers to at least one of the stored user question, the selected answer message, and the selected inquiry message, and is configured to further execute displaying the store information of the operator on the communication terminal. be able to.

上記プログラムは、通信端末から質問を文字情報として受信し、回答メッセージを通信端末に文字情報として送信することをさらに実行させるように構成することができる。 The program can be configured to further execute receiving a question as character information from the communication terminal and transmitting an answer message as character information to the communication terminal.

上記プログラムは、音声情報内に含まれる質問が検出できないとき、通信端末から質問を文字情報として受信し、回答メッセージを通信端末に文字情報として送信することをさらに実行させるように構成することができる。 The program can be configured to further execute receiving a question as text information from the communication terminal and transmitting an answer message to the communication terminal as text information when a question included in the voice information cannot be detected. .

上記プログラムは、通信端末から通信端末周囲の音量データを受信し、音量データに応じて通信端末から質問を文字情報として受信することを実行させるように構成することができる。 The program can be configured to receive volume data around the communication terminal from the communication terminal and to receive a question as character information from the communication terminal according to the volume data.

本発明の実施形態の一つは、音声認識サーバ、対話サーバ、音声合成サーバが含まれるシステムに音声情報を送信し、音声情報に含まれるユーザの質問に対する回答メッセージ、質問に対する問合せメッセージの少なくとも一つを受信し、回答メッセージと問合せメッセージを、少なくとも音声情報および文字情報のいずれか一つとして出力することを通信端末に実行させるプログラムである。 One embodiment of the present invention transmits speech information to a system including a speech recognition server, a dialogue server, and a speech synthesis server, and at least one of an answer message for a user question and an inquiry message for a question included in the speech information. Is a program that causes a communication terminal to execute a response message and an inquiry message as at least one of voice information and character information.

上記プログラムは、通信端末に記憶されるように構成することができる。 The program can be configured to be stored in a communication terminal.

上記プログラムは、通信ネットワークから取得されるように構成することができる。 The program can be configured to be acquired from a communication network.

上記プログラムは、仮想オペレータを表示するように構成することができる。また上記プログラムは、前記回答メッセージ、前記問合せメッセージを前記通信端末で再生する際、前記回答メッセージ、前記問合せメッセージを再生しない時と比較し、前記仮想オペレータの一部が大きな動作を行うように構成することができる。 The program can be configured to display a virtual operator. The program is configured such that when the answer message and the inquiry message are reproduced on the communication terminal, a part of the virtual operator performs a large operation compared to when the answer message and the inquiry message are not reproduced. can do.

上記プログラムは、ユーザの生体情報を取得するように構成することができる。 The said program can be comprised so that a user's biometric information may be acquired.

上記プログラムは、通信端末と事業者の電話端末とを接続するように構成することができる。 The program can be configured to connect the communication terminal and the telephone terminal of the business operator.

上記プログラムは、通信端末の位置情報を取得し、通信端末に地図を表示し、事業者の店舗情報を通信端末に表示するように構成することができる。 The said program can be comprised so that the positional information on a communication terminal may be acquired, a map may be displayed on a communication terminal, and a shop's shop information may be displayed on a communication terminal.

上記プログラムは、通信端末の位置から事業者の店舗までの経路を表示するように構成することができる。 The program can be configured to display a route from the position of the communication terminal to the business operator's store.

上記プログラムは、質問を文字情報として送信することをさらに実行させるように構成することができる。 The program can be configured to further execute sending the question as character information.

上記プログラムは、通信端末周囲の音量を測定し、音量のデータを音声認識サーバ、対話サーバ、音声合成サーバのうち少なくとも一つに送信することを実行させるように構成することができる。 The program can be configured to measure the volume around the communication terminal and to transmit the volume data to at least one of a voice recognition server, a dialogue server, and a voice synthesis server.

上記プログラムは、音量に応じて、質問を文字情報として送信することをさらに実行させるように構成することができる。 The program can be configured to further execute transmitting the question as character information in accordance with the volume.

本発明の実施形態の一つは、通信端末にユーザが入力した音声情報を送信し、送信された音声情報を認識し、認識された音声情報に含まれるユーザの質問に対応する回答メッセージを複数の回答メッセージから選択し、選択された回答メッセージを、少なくとも音声情報および文字情報のいずれか一つとして通信端末に送信し、回答メッセージが一つに絞り切れない場合、ユーザに質問するための問合せメッセージを複数の回答メッセージから選択し、選択された問合せメッセージを通信端末に送信する情報提供方法である。 In one embodiment of the present invention, voice information input by a user is transmitted to a communication terminal, the transmitted voice information is recognized, and a plurality of answer messages corresponding to user questions included in the recognized voice information are received. Inquiry to ask the user a question if the selected response message is transmitted to the communication terminal as at least one of voice information and character information and the response message cannot be narrowed down to one. This is an information providing method in which a message is selected from a plurality of reply messages and the selected inquiry message is transmitted to a communication terminal.

上記情報提供方法は、通信端末に仮想オペレータを表示させることをさらに含むことができる。また、前記回答メッセージ、前記問合せメッセージを前記通信端末で再生する際、前記回答メッセージ、前記問合せメッセージを再生しない時と比較し、前記仮想オペレータの一部を大きく動かすことを含むことができる。 The information providing method may further include displaying a virtual operator on the communication terminal. In addition, when the answer message and the inquiry message are reproduced on the communication terminal, a part of the virtual operator may be largely moved as compared with a case where the answer message and the inquiry message are not reproduced.

上記情報提供方法は、通信端末が取り込むユーザの生体情報を送信し、送信された生体情報を用いてユーザの認証を行うことをさらに含むことができる。 The information providing method may further include transmitting user biometric information captured by the communication terminal and authenticating the user using the transmitted biometric information.

上記情報提供方法は、ユーザの質問、選択された回答メッセージ、選択された問合せメッセージの少なくとも一つを格納することをさらに含むことができる。 The information providing method may further include storing at least one of a user question, a selected answer message, and a selected inquiry message.

上記情報提供方法は、格納されたユーザの質問、選択された回答メッセージ、選択された問合せメッセージの少なくとも一つを参照し、通信端末と事業者の電話端末とを接続することをさらに含むことができる。 The information providing method may further include referring to at least one of the stored user question, the selected answer message, and the selected inquiry message, and connecting the communication terminal and the operator's telephone terminal. it can.

上記情報提供方法は、格納されたユーザの質問、選択された回答メッセージ、選択された問合せメッセージの少なくとも一つを参照し、通信端末に事業者の店舗情報を表示することを含むことができる。 The information providing method may include displaying at least one of the stored user question, the selected answer message, and the selected inquiry message, and displaying the store information of the operator on the communication terminal.

上記情報提供方法は、質問を文字情報（テキスト）として通信端末に入力し、質問を送信し、回答メッセージを通信端末に文字情報として送信し、回答メッセージを前記通信端末に表示することを含むことができる。 The information providing method includes inputting a question as character information (text) to a communication terminal, transmitting the question, transmitting an answer message as character information to the communication terminal, and displaying the answer message on the communication terminal. Can do.

上記情報提供方法は、通信端末を用いて周囲の音量を測定することを含むことができる。また音量に応じて、質問を文字情報として通信端末に入力し、質問を送信し、回答メッセージを通信端末に文字情報として送信し、回答メッセージを通信端末に表示することを含むことができる。 The information providing method may include measuring surrounding sound volume using a communication terminal. Also, depending on the volume, it may include inputting the question as character information to the communication terminal, transmitting the question, transmitting the answer message to the communication terminal as character information, and displaying the answer message on the communication terminal.

ユーザと仮想オペレータの対話を行うことで、親密なユーザ―事業者関係を構築することができる。また、ユーザと仮想オペレータの対話を通じて、ユーザが要望する情報を、事業者が提供しえる数多くの情報の中から見つけ出すことが可能となる。すなわち、事業者が提供する多くの情報の中から「選択する」という煩雑な作業をユーザに強制することなく、対話を通じ、必要とする情報へユーザを「案内、誘導する」ことができ、新しいユーザ体験を提供することができる。また、ユーザ対応における人的資源の消耗を回避することで事業者の業務の効率化と生産性の向上を達成し、均一かつ豊富な情報を、事業者やそのオペレータの知識や経験、能力に制限されることなく、ユーザに提供することで、高品質で高付加価値を有するサービスを適時に提供することができる。さらにユーザは、事業者が管理する資源量とその変動を容易に把握することができる。 By having the user and the virtual operator interact, an intimate user-business relationship can be established. In addition, through the interaction between the user and the virtual operator, it is possible to find out information desired by the user from among a lot of information that can be provided by the business operator. In other words, the user can be “guided and guided” to the required information through dialogue without forcing the user to perform the complicated task of “selecting” from the large amount of information provided by the operator. A user experience can be provided. In addition, by avoiding the consumption of human resources in response to users, the business efficiency of the business operator and the improvement of productivity can be achieved, and uniform and abundant information can be used for the knowledge, experience and ability of the business operator and its operators. By providing the user without limitation, a service having high quality and high added value can be provided in a timely manner. Further, the user can easily grasp the amount of resources managed by the business operator and the variation thereof.

本発明の実施形態の情報提供方法に適用される通信端末の構成図。The block diagram of the communication terminal applied to the information provision method of embodiment of this invention. 本発明の実施形態の情報提供方法を説明するフローチャート。The flowchart explaining the information provision method of embodiment of this invention. 本発明の実施形態の情報提供方法において表示される画像の一例。An example of the image displayed in the information provision method of embodiment of this invention. 本発明の実施形態の情報提供方法を説明するフローチャート。The flowchart explaining the information provision method of embodiment of this invention. 本発明の実施形態の情報提供方法を説明するフローチャート。The flowchart explaining the information provision method of embodiment of this invention. 本発明の実施形態の情報提供方法において表示される画像の一例。An example of the image displayed in the information provision method of embodiment of this invention. 本発明の実施形態の情報提供方法において表示される画像の一例。An example of the image displayed in the information provision method of embodiment of this invention. 本発明の実施形態の情報提供方法を説明するフローチャート。The flowchart explaining the information provision method of embodiment of this invention. 本発明の実施形態の情報提供システムの構成例。The structural example of the information provision system of embodiment of this invention. 本発明の実施形態の情報提供方法において表示される画像の一例。An example of the image displayed in the information provision method of embodiment of this invention. 本発明の実施形態の情報提供方法において表示される画像の一例。An example of the image displayed in the information provision method of embodiment of this invention. 本発明の実施形態の情報提供方法において表示される画像の一例。An example of the image displayed in the information provision method of embodiment of this invention. 本発明の実施形態の情報提供方法において表示される画像の一例。An example of the image displayed in the information provision method of embodiment of this invention. 本発明の実施形態の情報提供方法を説明するフローチャート。The flowchart explaining the information provision method of embodiment of this invention. 本発明の実施形態の情報提供方法を説明するフローチャート。The flowchart explaining the information provision method of embodiment of this invention. 本発明の実施形態の情報提供方法を説明するフローチャート。The flowchart explaining the information provision method of embodiment of this invention.

以下、本発明の一実施形態における情報提供方法、ならびにそれを実現するためのプログラムとシステムについて、図面を参照しながら詳細に説明する。以下に示す実施形態は本発明の実施形態の一例であって、本発明はこれらの実施形態に限定されるものではない。 Hereinafter, an information providing method according to an embodiment of the present invention, and a program and a system for realizing the method will be described in detail with reference to the drawings. The following embodiments are examples of the embodiments of the present invention, and the present invention is not limited to these embodiments.

＜第１実施形態＞
本発明の第１実施形態における情報提供方法について、図面を参照しながら説明する。本実施形態では、ユーザが通信端末を用いて通信端末上に表示される、事業者のオペレータを模した画像（本明細書および請求項において、仮想オペレータと記す）との自然な対話を行い、これによって事業者がユーザに適した回答や情報を提供するための情報提供方法、ならびにそれを実行するためのプログラムについて説明する。 <First Embodiment>
An information providing method according to the first embodiment of the present invention will be described with reference to the drawings. In the present embodiment, the user performs a natural dialogue with an image imitating an operator of a business operator (displayed as a virtual operator in the present specification and claims) displayed on the communication terminal using the communication terminal, In this manner, an information providing method for a provider to provide answers and information suitable for the user, and a program for executing the information will be described.

［１．準備］
図１に本実施形態の情報提供方法で使用可能な通信端末１００のシステム図、図２に本実施形態の情報提供方法のフローチャートを示す。最初にユーザは、事業者が提供するプログラムを含むアプリケーションがインストールされた通信端末１００を用意する。アプリケーションは媒体に格納された状態で提供されてもよく、通信ネットワークを介して配布され、ユーザが取得できる状態であってもよい。 [1. Preparation]
FIG. 1 shows a system diagram of a communication terminal 100 that can be used in the information providing method of the present embodiment, and FIG. 2 shows a flowchart of the information providing method of the present embodiment. First, the user prepares the communication terminal 100 in which an application including a program provided by the operator is installed. The application may be provided in a state stored in a medium, or may be distributed via a communication network and acquired by a user.

通信端末１００はユーザが操作可能な端末であり、据え付け型（デスクトップ型）コンピュータ、ノート型コンピュータ、あるいは携帯電話やタブレット、スマートフォンなどの携帯通信端末などが挙げられる。通信端末１００の構成例を図２に示す。図２に示すように、通信端末１００にはＣＰＵ（中央演算ユニット）１０２、メモリ１０４、表示部１０６、入力部１０８、音声入力部１１０、音声出力部１１２、通信部１１４、画像取込装置１１６などが含まれる。 The communication terminal 100 is a terminal that can be operated by the user, and includes a stationary (desktop) computer, a notebook computer, or a portable communication terminal such as a mobile phone, a tablet, or a smartphone. A configuration example of the communication terminal 100 is shown in FIG. As shown in FIG. 2, the communication terminal 100 includes a CPU (Central Processing Unit) 102, a memory 104, a display unit 106, an input unit 108, an audio input unit 110, an audio output unit 112, a communication unit 114, and an image capturing device 116. Etc. are included.

ＣＰＵ１０２は通信端末１００に格納された種々のプログラムを解釈、実行する機能を有する。メモリ１０４はプログラムやデータなどの情報を格納する機能を有し、フラッシュメモリなどの不揮発性メモリ素子を含有する。表示部１０６は液晶表示装置やＥＬ有機装置などを有しており、ユーザのインターフェースとして機能する。入力部１０８は、ユーザが物理的接触を通じて情報を通信端末に入力するインターフェースとしての機能を有し、典型的にはタッチパネルやキーボード、入力ボタンなどが挙げられる。音声入力部１１０は音声を取り込み、入力された音声情報を通信部１１４へ伝達する機能を有する。音声出力部１１２はサーバから送信される音声情報や、電話回線などの通信ネットワークを通じて他の通信端末から送信される音声情報を再生する機能を有する。通信部１１４は通信ネットワークを通してサーバや他の通信端末との間で音声情報や文字情報などを送受信する機能を有する。 The CPU 102 has a function of interpreting and executing various programs stored in the communication terminal 100. The memory 104 has a function of storing information such as programs and data, and includes a nonvolatile memory element such as a flash memory. The display unit 106 includes a liquid crystal display device, an EL organic device, and the like, and functions as a user interface. The input unit 108 has a function as an interface through which a user inputs information to the communication terminal through physical contact, and typically includes a touch panel, a keyboard, an input button, and the like. The voice input unit 110 has a function of capturing voice and transmitting the input voice information to the communication unit 114. The audio output unit 112 has a function of reproducing audio information transmitted from a server and audio information transmitted from another communication terminal through a communication network such as a telephone line. The communication unit 114 has a function of transmitting and receiving voice information, character information, and the like with a server and other communication terminals through a communication network.

ここで本明細書、特許請求の範囲におけるサーバとは、事業者自身が維持、管理して利用するサーバでもよく、事業者とは異なる情報処理サービス提供者などが維持、管理、提供するサーバであってもよい。以下に示すように、本実施形態の情報提供方法を実現するためのシステムの多くは情報処理サービス提供者が管理、提供するサーバを利用することができる。事業者が公開できない情報、例えばユーザの個人情報に係る情報などは、事業者自らが維持、管理するサーバ内に格納して使用することが好ましい。 Here, the server in this specification and claims may be a server that is maintained, managed, and used by the operator itself, or a server that is maintained, managed, and provided by an information processing service provider that is different from the provider. There may be. As shown below, many of the systems for realizing the information providing method of the present embodiment can use a server managed and provided by an information processing service provider. Information that cannot be disclosed by the operator, for example, information related to the user's personal information, is preferably stored and used in a server that is maintained and managed by the operator.

［２．起動］
図２を参照すると、ユーザがアプリケーションを起動すると（ステップ２００）、プログラムは通信端末の表示部１０６に仮想オペレータを表示する（ステップ２０２）。図３は仮想オペレータの一例であり、通信端末１００の表示部１０６に表示した状態を例示したものである。仮想オペレータはユーザ対応を行う従業員（以下、オペレータと記す）などを模したものでよい。また、仮想オペレータが人間の自然な動作（例えば表情の変化、頭や手の動きなど）を行うように設定し、ユーザに親近感を抱かせるようにプログラムを構成してもよい。 [2. Start-up]
Referring to FIG. 2, when the user starts an application (step 200), the program displays a virtual operator on the display unit 106 of the communication terminal (step 202). FIG. 3 is an example of a virtual operator, and illustrates a state displayed on the display unit 106 of the communication terminal 100. The virtual operator may be a model of an employee who handles a user (hereinafter referred to as an operator). Further, the program may be configured so that the virtual operator performs natural human actions (for example, changes in facial expressions, head and hand movements, etc.) and makes the user feel close.

上述したように、図３ではオペレータを模した仮想オペレータを示しているが、仮想オペレータの形態に制約はない。仮想オペレータの性別や容貌、服装、あるいは仮想オペレータの背景などをユーザが任意に選択、設定できるようにプログラムを構成してもよい（ステップ２０４）。これにより、ユーザが単に個人的嗜好に基づいて仮想オペレータを選択できるのみならず、ユーザ専任の仮想オペレータを固定することもできる。このような機能を付与することで、ユーザに所謂「お得意様」意識を抱かせることができ、より親密なユーザ―事業者関係の構築に寄与することができる。 As described above, FIG. 3 shows a virtual operator imitating an operator, but there is no restriction on the form of the virtual operator. The program may be configured so that the user can arbitrarily select and set the gender and appearance of the virtual operator, the clothes, or the background of the virtual operator (step 204). Thus, the user can not only select a virtual operator based on personal preference, but also can fix a virtual operator dedicated to the user. By giving such a function, it is possible to make the user have a so-called “customer” consciousness and contribute to the establishment of a more intimate user-business relationship.

ここでプログラムは、必要に応じてユーザに対する認証プロセスを実行するように構成されていてもよい。例えばユーザの要求や問い合わせ、質問（以下、本明細書、特許請求の範囲において、総じて質問と記す）に係る内容が高いレベルの機密性を要する場合には、認証プロセス（ステップ２０６）を実行することが好ましい。 Here, the program may be configured to execute an authentication process for the user as necessary. For example, if the content of a user request, inquiry, or question (hereinafter referred to as a question in the present specification and claims) is required to have a high level of confidentiality, an authentication process (step 206) is executed. It is preferable.

認証プロセス（ステップ２０６）は、例えば通信端末１００に備えられた画像取込装置１１６を用いてユーザの顔の画像、指紋や静脈のパターンなどの生体情報を取り込み、あらかじめサーバに格納されたユーザの標準データ（標準生体情報）と比較し、その結果に基づいて行うことができる。あるいはユーザにパスワードの入力を要請し、サーバに格納されたユーザのパスワードとの比較を行うことで認証プロセスを行ってもよい。あるいはユーザに音声入力を要請し、入力された音声（声紋）を利用してユーザ認証を行ってもよい。認証プロセスにおいて、通信端末１００がユーザ以外の者によって操作されていると判断された場合、事業者に自動的に通知するようプログラムを構成してもよい。 In the authentication process (step 206), for example, the image capturing device 116 provided in the communication terminal 100 is used to capture biometric information such as an image of a user's face, a fingerprint or a vein pattern, and the user's information stored in the server in advance. Comparison with standard data (standard biological information) can be performed based on the result. Alternatively, the authentication process may be performed by requesting the user to input a password and comparing the password with the user password stored in the server. Alternatively, user authentication may be requested and user authentication may be performed using the input voice (voice print). In the authentication process, if it is determined that the communication terminal 100 is operated by a person other than the user, the program may be configured to automatically notify the operator.

ユーザ認証が正常に終了し、通信端末がユーザ自身によって操作されていると判断された場合には、アプリケーションにおいて事業者が提供するサービス内容をユーザごとに変更してもよい。あるいは、そのユーザに対して通知するべき回答メッセージを音声として出力する、あるいは文字として表示部１０６に表示してもよい（ステップ２０８）。 When it is determined that the user authentication is normally completed and the communication terminal is operated by the user, the service content provided by the operator in the application may be changed for each user. Alternatively, an answer message to be notified to the user may be output as voice or displayed on the display unit 106 as characters (step 208).

［３．基本動作］ [3. basic action]

本実施形態に係る情報提供方法、およびそのプログラムでは、図３に例示されたように、仮想オペレータ以外に、例えばアイコン３００を表示するように構成される。このアイコン３００は音声入力部１１０をアクティブにするアイコンであり、ユーザはこのアイコン３００を操作することで音声入力を開始することができる。また、事業者の店舗情報を表示するモードに移行するためのアイコン３０２を表示するようにプログラムが構成されていてもよい。後述するように、このアイコン３０２を用いることで、例えば事業者の各店舗が提供するサービス内容、店舗の混雑状況や位置、店舗までの道順などを表示するようにプログラムを構成してもよい。 As illustrated in FIG. 3, the information providing method and the program according to the present embodiment are configured to display, for example, an icon 300 in addition to the virtual operator. The icon 300 is an icon that activates the voice input unit 110, and the user can start voice input by operating the icon 300. Moreover, the program may be comprised so that the icon 302 for shifting to the mode which displays a shop information of a provider may be displayed. As will be described later, by using this icon 302, the program may be configured to display, for example, the service contents provided by each store of the business operator, the congestion status and position of the store, the route to the store, and the like.

ユーザはまずアイコン３００を操作して音声入力部１１０をアクティブにする。この後ユーザは、事業者に対する質問を口頭で述べる（ステップ２１０）。プログラムの命令に従い、音声入力部１１０はユーザの質問を音声情報として通信部１１４へ伝達し、通信部１１４は通信ネットワークを経由し、音声認識サーバのインターフェースとなるＡＰＩ（アプリケーションプログラミングインターフェース）へ音声情報を送信する（ステップ２１２）。送信された音声情報は、音声認識サーバにおいて解析され、文字データ（テキストデータ）へ変換される（ステップ２１４）。 First, the user operates the icon 300 to activate the voice input unit 110. Thereafter, the user verbally asks questions about the business (step 210). In accordance with the instructions of the program, the voice input unit 110 transmits the user's question as voice information to the communication unit 114, and the communication unit 114 passes the communication network to the API (application programming interface) serving as an interface of the voice recognition server. Is transmitted (step 212). The transmitted voice information is analyzed in the voice recognition server and converted into character data (text data) (step 214).

質問の入力は音声による入力に限られず、通信端末１００の入力部１０８を用いて文字情報を入力することで行っても構わない。例えば発声が禁じられている場合や、喧噪な環境下でアプリケーションを使用したい場合には文字入力（テキスト入力）が有効である。このような場合、後述するような文字入力、出力のみによる操作モード（ウェブチャットモード）へ移行することも可能である。 The input of the question is not limited to voice input, and may be performed by inputting character information using the input unit 108 of the communication terminal 100. For example, character input (text input) is effective when utterance is prohibited or when an application is to be used in a frivolous environment. In such a case, it is possible to shift to an operation mode (web chat mode) based only on character input and output as will be described later.

テキストデータへ変換された音声情報は対話サーバで解析され、ユーザの質問の意図が理解、把握される（ステップ２１６）。必要に応じて対話サーバは、当該意図が正しく理解されているかを確認するためのメッセージ（確認メッセージ）を、種々のメッセージが蓄積されたサーバあるいはサーバ内のデータベースから選択する（ステップ２１８）。その後、音声合成サーバへ確認メッセージの文字情報が伝達され、音声合成サーバは当該文字情報を音声情報へ変換し、音声合成サーバのＡＰＩを通してユーザの通信端末１００へ音声情報を送信する（ステップ２１８）。その後プログラムの命令に従い、通信端末１００が受信した音声情報を音声出力部１１２が音声として再生する（ステップ２２０）。確認メッセージは文字情報として表示部１０６上に表示してもよく、それと同時に音声として再生してもよい。音声合成サーバによって音声情報が生成されるため、実際の人間が発するような声を実現することができる。 The voice information converted into the text data is analyzed by the dialogue server, and the intention of the user's question is understood and grasped (step 216). If necessary, the dialogue server selects a message (confirmation message) for confirming whether or not the intention is correctly understood from a server in which various messages are stored or a database in the server (step 218). Thereafter, the character information of the confirmation message is transmitted to the voice synthesis server, and the voice synthesis server converts the character information into voice information, and transmits the voice information to the user communication terminal 100 through the API of the voice synthesis server (step 218). . Thereafter, the audio output unit 112 reproduces the audio information received by the communication terminal 100 as audio in accordance with the instructions of the program (step 220). The confirmation message may be displayed on the display unit 106 as character information, and may be reproduced as voice at the same time. Since voice information is generated by the voice synthesis server, it is possible to realize a voice uttered by an actual person.

ユーザは、対話サーバが意図を正しく理解していると認識すれば、アイコン３００に触れて音声入力部１１０をアクティブにし、その旨を音声情報として音声入力部１１０に入力する、あるいは入力部１０８を用いてテキスト入力することができる（ステップ２２２）。一方、対話サーバが意図を正しく理解していないと認識した場合、アイコン３００に触れて音声入力部１１０をアクティブにし、質問を再度音声入力部１１０に入力する、あるいは入力部１０８を操作してテキスト形式で質問を再度入力すればよい（ステップ２２２）。 If the user recognizes that the dialog server correctly understands the intention, the user touches the icon 300 to activate the voice input unit 110 and inputs the voice information to the voice input unit 110 as voice information. Can be used to enter text (step 222). On the other hand, when the dialog server recognizes that the intention is not correctly understood, the voice input unit 110 is activated by touching the icon 300 and the question is input to the voice input unit 110 again, or the input unit 108 is operated to enter the text. The question may be input again in the form (step 222).

ユーザの質問の意図を理解した後、対話サーバは絞り込みエンジン（検索エンジン）などを用いて回答メッセージの絞り込みを行う（ステップ２２４）。そして対話サーバ内のデータベースに蓄積された種々の回答メッセージから適合する回答メッセージを選択する（ステップ２２６）。選択された回答メッセージの文字情報は音声合成サーバで音声情報に変換され、音声合成サーバのＡＰＩを通してユーザの通信端末１００へ送信される（ステップ２２６）。この時、音声情報とともに文字情報として回答メッセージを通信端末１００へ送信してもよい。プログラムの命令に従い、通信端末１００が受信した音声情報を音声出力部１１２が再生する（ステップ２２８）。また同時に、回答メッセージを表示部１０６上にテキストとして表示してもよい。 After understanding the intention of the user's question, the dialogue server narrows down the answer messages using a narrowing engine (search engine) or the like (step 224). Then, a suitable answer message is selected from various answer messages stored in the database in the dialog server (step 226). The character information of the selected answer message is converted into speech information by the speech synthesis server and transmitted to the user communication terminal 100 through the API of the speech synthesis server (step 226). At this time, the reply message may be transmitted to the communication terminal 100 as text information together with the voice information. The voice output unit 112 reproduces the voice information received by the communication terminal 100 in accordance with the instructions of the program (step 228). At the same time, the answer message may be displayed as text on the display unit 106.

データベースには事業者が作成した定型のメッセージを回答メッセージとして蓄積してもよく、対話サーバは蓄積されたメッセージから回答メッセージを選択することができる。ただし本実施形態の情報提供方法はこれに限られず、回答メッセージを選択するだけでなく、人工知能としての機能を対話サーバに搭載し、ユーザとの対話を通じた学習によって新たな回答メッセージを作成し、これを音声合成サーバで音声情報に変換した後に通信端末１００へ送信してもよい。すなわち、ユーザとの対話から新たに結論を獲得する機能、およびユーザとの対話履歴から学習する機能を対話サーバに付与し、これらの機能を利用して回答メッセージを作成するようにしてもよい。 A fixed message created by the operator may be stored as a reply message in the database, and the dialog server can select a reply message from the stored messages. However, the information providing method according to the present embodiment is not limited to this. In addition to selecting an answer message, a function as an artificial intelligence is installed in the dialogue server, and a new answer message is created by learning through dialogue with the user. This may be transmitted to the communication terminal 100 after being converted into voice information by the voice synthesis server. That is, a function for acquiring a new conclusion from a dialog with the user and a function for learning from a dialog history with the user may be given to the dialog server, and an answer message may be created using these functions.

例えば、ユーザと仮想オペレータの対話を通じ、ユーザが満足する最終的な回答メッセージが得られた場合、この時の一連の対話を一つのコンポーネントとして格納する。このコンポーネントに含まれるユーザの音声情報を形態素解析し、最終的な回答メッセージが誘導されるに至ったキーワードを抽出する。同時にキーワードに対して重みづけを行うとともにキーワード同士の関連性（文法的関連性や位置的関連性）をパラメータ化し、これらの情報を最終的に選択された回答メッセージに割り付ける。このようなプロセスを蓄積することにより、それぞれの回答メッセージを最終的に必要とするユーザが与えるであろうキーワードとその関連性を学習する。したがって、新たな対話が開始されたとき、ユーザの音声情報から抽出されるキーワードとその関連性を解析し、その解析結果に基づいて過去に選択された回答メッセージから最も適合するものを選択することができる。その結果、同様の、あるいは類似する質問に対して、常に同じ最適な回答を選択することができる。また、必要に応じて複数の回答メッセージを音声合成サーバで融合して新たなメッセージを生成し、ユーザに提供することも可能となる。 For example, when a final reply message satisfying the user is obtained through the dialogue between the user and the virtual operator, the series of dialogue at this time is stored as one component. The voice information of the user included in this component is morphologically analyzed, and the keyword that led to the final response message is extracted. At the same time, the keywords are weighted and the relevance (grammatical relevance and positional relevance) between the keywords is parameterized, and this information is finally assigned to the selected answer message. By accumulating such processes, the keyword and the relevance that the user who ultimately needs each answer message will give are learned. Therefore, when a new conversation is started, the keyword extracted from the user's voice information and its relation are analyzed, and the most suitable answer message selected in the past based on the analysis result is selected. Can do. As a result, the same optimal answer can always be selected for similar or similar questions. In addition, a plurality of response messages can be merged by the speech synthesis server as needed to generate a new message and provide it to the user.

ユーザは、再生された音声情報あるいは表示された文字情報によって質問に対する満足な回答が得られたと判断した場合にはアプリケーションを終了することができる（ステップ２３０）。一方、ユーザが十分な回答が得られなかったと判断した場合には、再度ユーザは音声入力、あるいはテキスト入力を行って同じ質問、あるいは類似する質問を入力し、回答を求めることができる（ステップ２３２）。この後のプロセスは上述したプロセスと同じである。 If the user determines that a satisfactory answer to the question has been obtained from the reproduced voice information or the displayed character information, the user can end the application (step 230). On the other hand, if the user determines that a sufficient answer has not been obtained, the user can input the same question or a similar question again by voice input or text input to obtain an answer (step 232). ). The subsequent processes are the same as those described above.

ステップ２２４において、ユーザの質問に対する回答メッセージが絞り込めない場合、あるいは数多くの回答メッセージを選択せざるを得ない場合、ユーザに対する質問（問合せメッセージ）を生成あるいは選択して通信端末１００に送信し、ユーザに対してさらなる情報提供を要請してもよい（ステップ２３４）。問合せメッセージは音声として、あるいはテキストとして、あるいはその両方として通信端末１００で再生、表示される。ユーザは問合せメッセージに対して音声、あるいは文字で回答することができる（ステップ２２２）。問合せメッセージの送信とその回答というステップを繰り返すことにより、ユーザと仮想オペレータの対話を通じて、ユーザが要望する回答や情報を事業者が提供しえる数多くの情報の中から見つけ出すことが可能となる。したがって、本発明の本実施形態に係る情報提供方法により、事業者が提供する多くの情報の中から「選択する」という煩雑な作業をユーザに強要することなく、対話を通じ、必要とする回答や情報へユーザをリアルタイムに「案内、誘導する」ことができ、新しいユーザ体験を提供することができる。 In step 224, if the answer message for the user's question cannot be narrowed down or if a large number of answer messages must be selected, a question (inquiry message) for the user is generated or selected and transmitted to the communication terminal 100; The user may be requested to provide further information (step 234). The inquiry message is reproduced and displayed on the communication terminal 100 as voice, text, or both. The user can answer the inquiry message by voice or text (step 222). By repeating the steps of sending the inquiry message and answering it, it becomes possible to find out the answer and information desired by the user from among a lot of information that can be provided by the operator through the dialogue between the user and the virtual operator. Therefore, the information providing method according to the present embodiment of the present invention does not require the user to perform a complicated task of “selecting” from a lot of information provided by the operator, and through the dialogue, It can “guide and guide” users to information in real time and provide a new user experience.

なお図２には示していないが、対話サーバのデータベースにはユーザと雑談を行うためのメッセージが格納されていてもよく、ユーザは仮想オペレータと雑談を行うようにプログラムが構成されていてもよい。これにより、ユーザの緊張感を解きほぐすことができ、より親密なユーザ―事業者関係を構築することができる。さらに、通信ネットワーク上でのリアルタイムコミュニケーション（チャット）が可能なサイトに移行するためのアイコンを表示するようにプログラムが構成されていてもよい。これにより、例えば事業者から提供された有益な情報を第三者と速やかに共有することも可能となり、ユーザに対するサービス向上が図れるのみならず、事業者も効果的な宣伝効果を獲得することができる。 Although not shown in FIG. 2, a message for chatting with the user may be stored in the database of the dialogue server, and the program may be configured so that the user can chat with the virtual operator. . As a result, the tension of the user can be relieved and a more intimate user-business relationship can be established. Furthermore, the program may be configured to display an icon for shifting to a site capable of real-time communication (chat) on the communication network. This makes it possible to quickly share, for example, useful information provided by the operator with a third party, which not only improves the service for the user, but also allows the operator to obtain an effective advertising effect. it can.

また、上述した種々のメッセージを音声出力部１１２で再生する際、仮想オペレータが実際に喋っているように見せるため、仮想オペレータの口や目を動かすようにしてもよい。あるいは手を動かすなど、説明を行うジェスチャーをするようにしてもよい。すなわち、メッセージが再生されていない時と比較し、仮想オペレータの一部がより大きな動作を行うようにプログラムを構成してもよい。 Further, when the various messages described above are reproduced by the voice output unit 112, the virtual operator's mouth and eyes may be moved in order to make it appear that the virtual operator is actually speaking. Or you may make it carry out the gesture which explains, such as moving a hand. That is, the program may be configured such that a part of the virtual operator performs a larger operation compared to when the message is not reproduced.

上述したように本実施形態では、仮想オペレータとユーザとの対話を通し、ユーザは様々な質問を事業者に伝えることができる。このため、使用環境が許容する限り、ユーザはキーボードやタッチパネルを用いる文字入力操作から解放され、より簡便に種々の質問を事業者に伝達することが可能となる。また、音声合成技術を活用して仮想オペレータと対話するため、ユーザは無機質な対話を強制されることなく、自然な対話を行うことができる。一方ユーザは、事業者あるいはそのオペレータとの対話ではなく、あくまで仮想オペレータとの対話によって回答を得るため、ユーザは必要以上に気を遣うことなく様々な質問を投げかけることができ、新しいユーザ体験を享受することができる。 As described above, in the present embodiment, the user can communicate various questions to the operator through the dialogue between the virtual operator and the user. Therefore, as long as the usage environment permits, the user is freed from the character input operation using the keyboard and the touch panel, and can easily transmit various questions to the business operator. In addition, since the user interacts with the virtual operator using the speech synthesis technology, the user can perform a natural conversation without being forced to perform an inorganic conversation. On the other hand, users get answers through dialogues with virtual operators, not dialogues with operators or their operators, so users can ask various questions without worrying more than necessary, creating new user experiences. You can enjoy it.

また上述したステップでは、事業者のオペレータはユーザとの対話に直接関与しない。したがってユーザは、オペレータの知識や経験、能力に左右されることなく、均一かつ豊富な情報をリアルタイムで獲得することができる。また事業者はこの段階では人的資源が不要であるため、ユーザ対応における人的資源の消耗を避けることができ、業務の効率化と生産性の向上を実現することができる。 In the steps described above, the operator of the business operator is not directly involved in the dialog with the user. Therefore, the user can acquire uniform and abundant information in real time without being influenced by the knowledge, experience, and ability of the operator. In addition, since the operator does not need human resources at this stage, it is possible to avoid the consumption of human resources in correspondence with the user, and it is possible to improve the efficiency of business and the improvement of productivity.

［４．通話モードへの移行］
多くのユーザが頻繁に行う質問（ＦＡＱ）に対しては、あらかじめ回答メッセージを周到に準備することができ、高いユーザ満足度を得ることが可能である。しかしながらユーザの質問の意図に対し、対話サーバ内のデータベースに蓄積されたメッセージデータだけでは対応できない場合がある。例えばＦＡＱ以外の質問や、複雑な相談内容、あるいは機密性の高い情報が質問に含まれる場合、上述した方法では対応できない場合がある。そのような場合、電話回線を利用して直接口頭で事業者あるいはそのオペレータと通話するモードに切り替えることができる。 [4. Switch to call mode]
Answers frequently asked by many users (FAQ) can be prepared in advance, and high user satisfaction can be obtained. However, there are cases where the intention of the user's question cannot be dealt with only by the message data stored in the database in the dialogue server. For example, in the case where a question other than FAQ, complicated consultation contents, or highly confidential information is included in the question, the method described above may not be available. In such a case, it is possible to switch to a mode in which a telephone call is used to directly talk to the operator or its operator.

例えば図４に示すように、サーバからの回答メッセージを音声情報として再生した後（ステップ２２４、２２６）、通話モードへ切り替えるためのアイコン（電話アイコン、図３では図示せず）を通信端末１００の表示部１０６に表示するようにプログラムを構成してもよい（ステップ４０４）。プログラムの命令に従い、ユーザが電話アイコンを操作することで、通信端末１００はＡＰＩを介して通信サービス連携サーバに接続される。通信サービス連携サーバは通信ネットワークにユーザからの電話を接続させ、事業者、あるいはそのオペレータとの電話回線による通話を可能にする（ステップ４０６）。 For example, as shown in FIG. 4, after the reply message from the server is reproduced as voice information (steps 224 and 226), an icon (telephone icon, not shown in FIG. 3) for switching to the call mode is displayed on the communication terminal 100. The program may be configured to display on the display unit 106 (step 404). The communication terminal 100 is connected to the communication service cooperation server via the API when the user operates the telephone icon according to the command of the program. The communication service cooperation server connects the telephone from the user to the communication network, and enables a telephone call with the operator or the operator via the telephone line (step 406).

通話モードへ切り替えるための電話アイコンは、アプリケーション起動の段階から表示部１０６に表示するようにしてもよい。しかし事業者の人的資源によっては、ユーザによる通話モードへの移行をある程度制約する必要がある。この場合には、電話アイコンはアプリケーションの起動時には表示せず、対話サーバが回答メッセージを選択、送信し（ステップ２２４）、通信端末１００が少なくとも一度回答を再生、表示（ステップ２２６）した後に電話アイコンを表示するよう、プログラムを構成してもよい。これにより、オペレータの負担が低減され、業務の効率化が図れる。 The telephone icon for switching to the call mode may be displayed on the display unit 106 from the application activation stage. However, depending on the operator's human resources, it is necessary to restrict the user's transition to the call mode to some extent. In this case, the telephone icon is not displayed when the application is started, and the dialog server selects and transmits an answer message (step 224), and the communication terminal 100 reproduces and displays the answer at least once (step 226). The program may be configured to display. As a result, the burden on the operator is reduced, and the efficiency of work can be improved.

あるいは、ユーザが通話モードへの切り替えを自ら行わなくても、対話サーバ内の回答メッセージではユーザの質問に対応できないと対話サーバが判断した後に電話アイコンを表示するよう、プログラムを構成してもよい。この場合、対話サーバはユーザに通話モードに切り替える旨を通知する（ステップ４００）と同時に通信サービス連携サーバにアクセスし、ユーザと事業者、あるいはそのオペレータとの電話回線を接続してもよい（ステップ４０２）。 Alternatively, the program may be configured to display a telephone icon after the dialog server determines that the answer message in the dialog server cannot respond to the user's question even if the user does not switch to the call mode. . In this case, the dialog server notifies the user that the call mode is to be switched (step 400), and at the same time, accesses the communication service cooperation server to connect the telephone line between the user and the operator or the operator (step). 402).

あるいは、ユーザの質問の解析の結果、ユーザと事業者との対話が必須であると対話サーバが判断した後に電話アイコンを表示部１０６に表示し、ユーザに通話モードへの移行を促してもよい。 Alternatively, as a result of analyzing the user's question, a telephone icon may be displayed on the display unit 106 after the dialog server determines that a dialog between the user and the operator is essential, and the user may be prompted to switch to the call mode. .

通話モードへ移行する際、ユーザとサーバ間との対話内容や履歴（ログ）を格納し、その情報を担当するオペレータに表示してもよい（ステップ４０６）。これにより、オペレータはより効率よくユーザの質問に対する回答を検討することができる。また、ユーザもすでに行った会話を再度繰り返す必要性が低くなる。このため、事業者はより高品質で手厚いサービスを迅速にユーザに提供することができる。 When the mode is shifted to the call mode, the contents and history (log) of dialogue between the user and the server may be stored and displayed to the operator in charge (step 406). Thereby, the operator can consider the answer to the user's question more efficiently. In addition, it is less necessary for the user to repeat the conversation that has already been performed. For this reason, the business operator can promptly provide the user with a higher quality and more profound service.

通信サービス連携サーバが通信ネットワークにユーザからの電話を接続させる際、対話内容を解析し（ステップ４０８）、その結果に基づいて最も適切なオペレータを選択し（ステップ４１０）、ユーザと選択されたオペレータと間の電話通信を確立してもよい（ステップ４１２）。これにより、ユーザは質問に対して最も適切なオペレータと通話することが可能となる。 When the communication service cooperation server connects the telephone from the user to the communication network, the contents of the dialogue are analyzed (step 408), the most appropriate operator is selected based on the result (step 410), and the user and the selected operator are selected. A telephone communication may be established between and (step 412). This allows the user to talk to the most appropriate operator for the question.

［５．地図モードへの移行］
上述したように、ユーザの通信端末１００の表示部１０６上に、事業者の店舗にアクセスするためのアイコン３０２を表示するようにプログラムを構成してもよい（図３参照）。ユーザがこのアイコン３０２を操作することにより、地図モードへの移行が可能となる。具体的なフローを図５に示す。なお図５に示す各ステップは一例であり、例えばこれらのステップの順番を適宜入れ替えてもよい。 [5. Transition to map mode]
As described above, the program may be configured to display the icon 302 for accessing the operator's store on the display unit 106 of the user's communication terminal 100 (see FIG. 3). When the user operates this icon 302, it is possible to shift to the map mode. A specific flow is shown in FIG. Each step shown in FIG. 5 is an example, and for example, the order of these steps may be appropriately changed.

ユーザによる操作（ステップ５００）によって地図モードが起動されると（ステップ５０２）、プログラムの命令に従い、通信端末１００は内蔵されたＧＰＳ（全地球測位システム）機能を利用して位置座標を取り込む。あるいは、ユーザ自身が住所や付近のランドマーク情報を、音声情報として音声入力部１１０に入力することで、あるいは文字情報として入力部１０８を用いて入力することで、通信端末１００の位置座標を取り込むようにプログラムが構成されていてもよい。これより、通信端末の位置が特定される（ステップ５０４）。通信端末１００がＧＰＳ機能を搭載していない場合、あるいはＧＰＳ機能が利用できないような環境下では、通信端末１００は付近の通信サービス事業者の基地局と通信を行って通信端末１００が位置するエリアの特定が行われるよう、プログラムが構成されていてもよい。この場合、位置情報特定サーバやＣＴＩ（ＣｏｍｐｕｔｅｒＴｅｌｅｐｈｏｎｙＩｎｔｅｇｒａｔｉｏｎ）サーバなどを利用して通信端末１００の位置の特定が行われる。位置特定サーバやＣＴＩサーバは、情報処理サービス提供者や通信サービス事業者が管理、提供するサーバ内に構築されてもよい。 When the map mode is activated by the user's operation (step 500) (step 502), the communication terminal 100 takes in the position coordinates using a built-in GPS (global positioning system) function according to the instructions of the program. Alternatively, the user himself / herself inputs an address or nearby landmark information into the voice input unit 110 as voice information or by using the input unit 108 as character information, thereby capturing the position coordinates of the communication terminal 100. The program may be configured as described above. Thereby, the position of the communication terminal is specified (step 504). When the communication terminal 100 is not equipped with a GPS function or in an environment where the GPS function cannot be used, the communication terminal 100 communicates with a base station of a nearby communication service provider to locate the communication terminal 100 The program may be configured so that the identification is performed. In this case, the position of the communication terminal 100 is specified using a position information specifying server, a CTI (Computer Telephony Integration) server, or the like. The location specifying server and the CTI server may be constructed in a server managed and provided by an information processing service provider or a communication service provider.

通信端末１００の位置が特定された後、プログラムの命令に従って通信端末１００は、地図情報を提供する事業者などが運営する情報配信サーバにアクセスして地図データを取得し、表示部１０６上に表示する（ステップ５０６）。プログラムはさらに、サーバ内に格納された店舗の位置情報や、店舗が提供するサービス内容を通信端末１００に取得させ（ステップ５０８）、この情報に基づき、表示部１０６上に店舗を表示させる（ステップ５１０）。例えば図６に示すように、通信端末１００付近の地図を表示部１０６上に表示し、かつ店舗を表すアイコン６０２、６０４を表示する。 After the position of the communication terminal 100 is specified, the communication terminal 100 accesses the information distribution server operated by the provider providing the map information and acquires the map data according to the instructions of the program, and displays it on the display unit 106 (Step 506). The program further causes the communication terminal 100 to acquire the location information of the store stored in the server and the service content provided by the store (step 508), and displays the store on the display unit 106 based on this information (step 508). 510). For example, as shown in FIG. 6, a map near the communication terminal 100 is displayed on the display unit 106, and icons 602 and 604 representing stores are displayed.

店舗の位置や、店舗が提供する具体的なサービス内容、機能、営業時間などのステーショナリーな情報だけでなく、混雑状況や資源量（商品の在庫やユーザに割り当て可能な人的資源量など）といった時間とともに逐次変動するパラメータなどを事業者がサーバ内のデータベースに逐次格納し、これらの情報やパラメータを通信端末１００が取得して表示部１０６上に表示するよう、プログラムを構成してもよい（ステップ５０８、５１０）。この場合、店舗のアイコンを操作することで、各店舗に関する上記情報やパラメータを表示するようにしてもよい。これによりユーザは、事業者が管理する資源量とその変動を容易に把握することができる。 Not only stationery, stationery information such as specific service contents, functions, and business hours provided by the store, but also the congestion status and the amount of resources (product inventory, amount of human resources that can be allocated to users, etc.) The program may be configured so that the business operator sequentially stores parameters that change sequentially with time in a database in the server, and the communication terminal 100 acquires and displays these information and parameters on the display unit 106 ( Steps 508, 510). In this case, you may make it display the said information and parameter regarding each store by operating the icon of a store. As a result, the user can easily grasp the amount of resources managed by the business operator and the variation thereof.

さらに本実施形態の情報提供方法では、ユーザの現時点の位置からユーザがアクセスを希望する店舗までの道順を表示してもよい。例えば通信端末１００が、ユーザの位置座標と目的とする店舗の位置情報を情報配信サーバへ送信し、適切な経路の探索を要求し（ステップ５１２）、情報配信サーバ内で探索された経路を受信し、その経路を表示部１０６上に表示する（ステップ５１４）ようにしてもよい。経路の表示は、例えば図７に示したように、ユーザの現在地と目的とする店舗までの経路をアイコンを用いて行うことができる。ここでは複数の円形のアイコンを用いて経路を示している。また、目的地までの距離や所要時間などを表示するようにしてもよい。 Furthermore, in the information providing method of the present embodiment, a route from the current position of the user to the store that the user desires to access may be displayed. For example, the communication terminal 100 transmits the position coordinates of the user and the position information of the target store to the information distribution server, requests a search for an appropriate route (step 512), and receives the route searched in the information distribution server. Then, the route may be displayed on the display unit 106 (step 514). For example, as shown in FIG. 7, the route can be displayed by using icons to indicate the current location of the user and the route to the target store. Here, the route is shown using a plurality of circular icons. In addition, the distance to the destination and the required time may be displayed.

ここではユーザがアイコン３０２を操作することで（ステップ５００）地図モードへ切り替わる例を述べたが、地図モードへの切り替えはユーザと仮想オペレータの対話に基づいて行ってもよい。例えば、ユーザと仮想オペレータ間の対話を解析し、ユーザの質問に対する最終的な回答が店舗への案内を含むと対話サーバが判断した場合（ステップ５１６）、対話サーバはユーザの質問に対応できる適切な店舗を選択し、その店舗への案内を通知するとともに（ステップ５１８）地図モードを起動し（ステップ５０２）、表示部１０６上にユーザの現在地と該当する店舗の位置情報、ならびに店舗への経路などを表示してもよい。 Here, an example in which the user switches to the map mode by operating the icon 302 (step 500) has been described, but the switch to the map mode may be performed based on the dialogue between the user and the virtual operator. For example, if the dialog server analyzes the dialog between the user and the virtual operator and determines that the final answer to the user's question includes a guide to the store (step 516), the dialog server can respond appropriately to the user's question. An appropriate store is selected, a guide to the store is notified (step 518), the map mode is activated (step 502), the current location of the user and the location information of the corresponding store on the display unit 106, and the route to the store Etc. may be displayed.

上述したように本実施形態の情報提供方法では、ユーザと仮想オペレータの対話を通じ、ユーザの求めるサービスに適合した店舗を絞り込むことができ、ユーザに対して質の高いサービスを提供することができる。つまり、ユーザの質問に対して複数の回答を提供してユーザに選択させるのではなく、ユーザと仮想オペレータの対話を通じ、ユーザに適した回答へユーザを導くことができる。 As described above, according to the information providing method of the present embodiment, it is possible to narrow down the stores suitable for the service requested by the user through the dialogue between the user and the virtual operator, and to provide a high-quality service to the user. That is, instead of providing a plurality of answers to the user's question and allowing the user to select, the user can be guided to an answer suitable for the user through the dialogue between the user and the virtual operator.

［６．ウェブチャットモードへの移行］
ユーザの周囲の環境や通信端末の使用状況、あるいはユーザ自身の希望により、音声入力や音声出力を行わず、テキスト入力、出力のみで対話を行い、これを通じてユーザの求める情報を提供するモード（以下、ウェブチャットモードと記す）へ移行できるようにしてもよい。例えばユーザ自身が発声を禁止されている場合、通信端末からの音声発信が禁じられている場合、喧噪な環境に起因してユーザの音声と周囲の雑音が区別できない場合、ユーザが通信端末の音声出力部１１２を他の用途（音楽や映像の視聴など）に用いている場合などには、自動的に、あるいはユーザが自主的にウェブチャットモードへ移行できるようにプログラムを構成してもよい。ウェブチャットモードでは、ユーザが入力した質問、および回答メッセージの両者が時間順に表示部１０６に表示される。この時のフローの一例を図８に示す。 [6. Transition to web chat mode]
A mode that provides dialogue with text input and output without providing voice input or output depending on the user's surrounding environment, communication terminal usage conditions, or the user's own preferences, and providing information requested by the user through this (referred to below) It may be possible to shift to a web chat mode. For example, when the user himself / herself is prohibited from uttering, when voice transmission from the communication terminal is prohibited, when the user's voice and ambient noise cannot be distinguished due to a busy environment, the user can hear the voice of the communication terminal. When the output unit 112 is used for other purposes (such as music or video viewing), the program may be configured so that the user can automatically shift to the web chat mode. In the web chat mode, both the question entered by the user and the answer message are displayed on the display unit 106 in time order. An example of the flow at this time is shown in FIG.

例えば通信端末１００の起動後（ステップ２００）、仮想オペレータが表示された画面（図３参照）上にウェブチャットモードへ移行するためのアイコンを表示し（ステップ７００）、ユーザが任意に操作できるようにプログラムを構成してもよい。このアイコンをユーザが操作することで（ステップ７０２）仮想オペレータが表示されていた画面が切り替わり、対話形式のテキストが表示される画面（ウェブチャット画面）に移行するようにすればよい（ステップ７０４）。この後、ユーザはテキスト入力することで（ステップ７０６）で対話を開始することができる。 For example, after the communication terminal 100 is activated (step 200), an icon for shifting to the web chat mode is displayed on the screen (see FIG. 3) on which the virtual operator is displayed (step 700) so that the user can arbitrarily operate it. A program may be configured. When the user operates this icon (step 702), the screen on which the virtual operator is displayed is switched to a screen (web chat screen) on which interactive text is displayed (step 704). . Thereafter, the user can start the dialogue by inputting text (step 706).

この場合、音声合成サーバから音声情報を送らない、あるいは音声情報を受信した通信端末１００は音声情報を再生せず、表示部１０６に音声情報のテキストのみを表示すればよい。あるいは対話サーバで選択、あるいは生成された回答メッセージを文字情報のまま通信端末１００へ送信してもよい。 In this case, the communication terminal 100 that does not send the voice information from the voice synthesis server or receives the voice information does not reproduce the voice information, and only displays the text of the voice information on the display unit 106. Alternatively, the reply message selected or generated by the dialog server may be transmitted to the communication terminal 100 as the character information.

あるいはアプリケーションの起動後（ステップ２００）、音声出力部１１２の使用状況を確認するようプログラムが命令し(ステップ７１０)、音声出力部１１２が他の用途で使用されていると判断された場合、ウェブチャットモードへの移行の可否を表示部１０６に表示し（ステップ７１２）、ユーザによるモード移行を選択させてもよい。 Alternatively, after the application is started (step 200), the program instructs to check the usage status of the audio output unit 112 (step 710), and if it is determined that the audio output unit 112 is used for other purposes, the web Whether or not to switch to the chat mode is displayed on the display unit 106 (step 712), and the mode shift by the user may be selected.

あるいは何らかの物理量、例えば通信端末１００の周囲の環境パラメータの変化に応じてウェブチャットモードへ切り替えるようにプログラムを構成してもよい。例えば通信端末１００周辺の音量を測定し、その音量データを対話サーバなどに送信し（ステップ７２０）、ある一定レベル以上であればウェブチャットモードへ移行する旨を表示部１０６に表示し（ステップ７２２）、ウェブチャットモードへ自動的に移行する（ステップ７０４）ようにしてもよい。周囲の音量測定は、例えばアプリケーションを起動した後、ユーザが声入力部１１０をアクティブにする前に行うことができる。ウェブチャットモードへ移行する閾値としては、例えば９０デシベル以上、あるいは１１０デシベル以上とすればよい。また、ウェブチャットモードへの移行の判断は、通信端末１００側で行ってもよく、対話サーバ側で行ってもよい。 Alternatively, the program may be configured to switch to the web chat mode according to a change in some physical quantity, for example, an environmental parameter around the communication terminal 100. For example, the sound volume around the communication terminal 100 is measured, and the sound volume data is transmitted to a dialogue server or the like (step 720), and if it is above a certain level, the display unit 106 displays that the mode is shifted to the web chat mode (step 722). ), The mode may be automatically shifted to the web chat mode (step 704). The surrounding sound volume can be measured, for example, after the application is started and before the user activates the voice input unit 110. The threshold value for shifting to the web chat mode may be set to 90 dB or more, or 110 dB or more, for example. Further, the determination of the transition to the web chat mode may be performed on the communication terminal 100 side or the conversation server side.

図示しないが、通信端末１００の周囲の雑音とユーザが入力した音声情報が区別できない場合、ウェブチャットモードへ移行するようにプログラムを構成してもよい。例えば音声認識サーバが音声情報の解析を行う際に音声区間検出を行ってもユーザの音声情報が含まれる区間を検出できない場合、あるいは検出区間が一定の時間に満たない場合、あるいは入力された音声情報から雑音を十分に除去できずユーザの音声（質問）を検出、解析できない場合、ウェブチャットモードへの移行を案内すると同時に、ウェブチャットモードへ移行するためのアイコンを表示部１０６に表示してもよい。あるいは雑音の少ない環境への移動をユーザに要請する回答メッセージを通信端末１００に送信し、併せてアイコンを表示してもよい。 Although not shown, when the noise around the communication terminal 100 and the voice information input by the user cannot be distinguished, the program may be configured to shift to the web chat mode. For example, when the voice recognition server analyzes the voice information and detects the voice section, the section including the user's voice information cannot be detected, or the detected section does not reach a certain time, or the input voice When noise cannot be sufficiently removed from information and a user's voice (question) cannot be detected and analyzed, an icon for shifting to the web chat mode is displayed on the display unit 106 at the same time as guiding the shift to the web chat mode. Also good. Alternatively, an answer message requesting the user to move to an environment with less noise may be transmitted to the communication terminal 100 and an icon may be displayed.

また、対話内容に基づいてウェブチャットモードへ移行するようにプログラムを構成してもよい。たとえばユーザの音声情報に機密性の高い個人情報が含まれていると対話サーバが判断した場合、あるいは対話サーバが選択、生成する回答メッセージ内にそのような個人情報を含ませる必要がある場合、ウェブチャットモードへの移行を案内するようにしてもよい。
［７．他のモードへの移行］
モードの移行は、上述した通話モードや地図モード、ウェブチャットモードへの移行に限られない。例えばユーザと仮想オペレータ間の対話の解析に基づき、あるいはユーザの判断に基づき、事業者が通信ネットワークを経由して提供する種々のサービス（以下、ネットワークサービスと記す）へアクセスするように、プログラムが構成されていてもよい。ユーザ自らの判断でネットワークサービスへのアクセスを許容する場合、例えば図３で示した表示上に適当なアイコンを表示させ、これをユーザが操作することでネットワークサービスへ能動的にアクセスできるようにしてもよい。ネットワークサービスとしては、例えば金融業界の場合、各銀行が提供するインターネットバンキングサービスなどが挙げられる。 Moreover, you may comprise a program so that it may transfer to web chat mode based on the content of interaction. For example, if the conversation server determines that the user's voice information contains sensitive personal information, or if the conversation server needs to include such personal information in an answer message that it selects and generates, The transition to the web chat mode may be guided.
[7. Transition to other modes]
The mode shift is not limited to the above-described call mode, map mode, and web chat mode. For example, based on analysis of interaction between the user and the virtual operator, or based on the user's judgment, the program can access various services (hereinafter referred to as network service) provided by the operator via the communication network. It may be configured. When allowing access to the network service at the user's own discretion, for example, an appropriate icon is displayed on the display shown in FIG. 3, and the user can actively access the network service by operating the icon. Also good. Examples of the network service include an internet banking service provided by each bank in the case of the financial industry.

逆に、対話サーバが選択した最適の回答が、ユーザによるネットワークサービスの利用である場合、対話サーバからの回答メッセージを通信端末１００へ送信するとともにネットワークサービスへリンクするアイコンを表示部１０６上に表示してもよい。 On the contrary, when the optimum answer selected by the dialog server is the use of the network service by the user, an answer message from the dialog server is transmitted to the communication terminal 100 and an icon linked to the network service is displayed on the display unit 106. May be.

上述したように、本実施形態の情報提供方法では、ある条件が満たされた場合、あるいは何らかの物理量、例えば通信端末１００の周囲の騒音レベルや雑音の周波数などが変化した場合、あるいはユーザが希望した場合、様々なモードへ移行することができる。モードの移行はユーザが自主的に行うこともでき、自動的に行うことも可能である。あるいは対話を通じてユーザにモードの意向を促すこともできる。このように、起動時に仮想オペレータが表示される最上位レイヤー（アプリケーションレイヤー）から、種々の下位レイヤーへユーザインタフェースが切り替わり、ユーザの質問に適合する情報や回答メッセージを提供することができる。 As described above, in the information providing method of the present embodiment, when a certain condition is satisfied, or when some physical quantity, for example, a noise level or a noise frequency around the communication terminal 100 changes, or a user desires In this case, it is possible to shift to various modes. The mode transition can be performed voluntarily by the user or automatically. Alternatively, it is possible to prompt the user for the mode through dialogue. As described above, the user interface is switched from the uppermost layer (application layer) where the virtual operator is displayed at the time of activation to various lower layers, and information and answer messages suitable for the user's question can be provided.

＜第２実施形態＞
本実施形態では、第１実施形態で述べた情報提供方法を実現するためのシステム構成に関し、図面を参照して説明する。通信端末１００のシステム構成は第１実施形態で述べたため（図１参照）、ここでは割愛する。 Second Embodiment
In the present embodiment, a system configuration for realizing the information providing method described in the first embodiment will be described with reference to the drawings. Since the system configuration of the communication terminal 100 has been described in the first embodiment (see FIG. 1), it is omitted here.

なお、本実施形態で述べるシステム構成は一つの例であり、本発明は以下に示す実施形態に限定されるものではない。以下で説明する各種サーバの機能や要素は、他のサーバと重複していてもよく、例えばデータベースなどは複数のサーバによって管理、使用されることもある。また、各サーバを制御するユニット（コントローラなど）は、複数のサーバによって共有され、これら複数のサーバを制御するように構成されてもよい。さらに、一つのサーバは別のサーバの構成要素である場合もあり、この場合、上記一つのサーバは別のサーバの一部分と認識することができる。例えばある機能を有するサーバＡがサーバＢの構成要素の一つである場合、サーバＡは、サーバＢを含むシステムにおける一つの機能部と認識、記述することができる。 The system configuration described in the present embodiment is an example, and the present invention is not limited to the embodiment described below. Functions and elements of various servers described below may be duplicated with other servers. For example, a database may be managed and used by a plurality of servers. A unit (such as a controller) that controls each server may be shared by a plurality of servers and configured to control the plurality of servers. Furthermore, one server may be a component of another server. In this case, the one server can be recognized as a part of another server. For example, when the server A having a certain function is one of the components of the server B, the server A can be recognized and described as one functional unit in the system including the server B.

［１．音声認識］
ユーザが入力する音声は通信端末１００の音声入力部１１０で取り込まれ、音声情報として通信部１１４へ伝達される（図１参照）。通信部１１４は通信ネットワークを経由し、図９に示す音声認識サーバ８００のインターフェースとなるＡＰＩ８１０へ音声情報を送信する。送信された音声情報は、音声認識サーバ８００内の音声認識エンジン８２０において解析される。 [1. voice recognition]
The voice input by the user is captured by the voice input unit 110 of the communication terminal 100 and transmitted to the communication unit 114 as voice information (see FIG. 1). The communication unit 114 transmits voice information to the API 810 serving as an interface of the voice recognition server 800 illustrated in FIG. 9 via the communication network. The transmitted voice information is analyzed by the voice recognition engine 820 in the voice recognition server 800.

図９に例示したように、音声認識エンジン８２０は例えば特徴量抽出部８３０、デコーダ８４０、テキスト出力部８５０を有している。特徴量抽出部８３０は送信された音声情報を分析し、特徴量パラメータを抽出する。デコーダ８４０は、言語モデル８４２、音響モデル８４４、語彙辞書８４６などのデータベースを参照しながら特徴量パラメータから最も優れた候補を計算し、認識結果として出力する。出力された認識結果はテキスト出力部８５０によってテキストデータへ変換、出力される。 As illustrated in FIG. 9, the speech recognition engine 820 includes, for example, a feature amount extraction unit 830, a decoder 840, and a text output unit 850. The feature quantity extraction unit 830 analyzes the transmitted voice information and extracts feature quantity parameters. The decoder 840 calculates the best candidate from the feature parameter while referring to databases such as the language model 842, the acoustic model 844, the vocabulary dictionary 846, and outputs the result as a recognition result. The output recognition result is converted into text data by the text output unit 850 and output.

このような音声認識システムを利用することで、ユーザが仮想オペレータに対して口語（話し言葉）で語り掛けた質問がテキストに変換される。 By using such a speech recognition system, a question spoken by the user in spoken language (spoken language) to the virtual operator is converted into text.

［２．音声合成］
ユーザの通信端末１００で再生される音声は、音声合成サーバ９００のインターフェースとなるＡＰＩ９１０から通信ネットワークを経由して通信端末１００に送信される（図１０参照）。音声合成サーバ９００は音声合成エンジン９２０を有しており、図１０に例示したように、音声合成エンジン９２０は例えばテキスト入力部９３０、言語解析部９４０、波形生成部９５０、送信部９６０を有している。 [2. Speech synthesis]
Voice reproduced by the user's communication terminal 100 is transmitted from the API 910 serving as an interface of the voice synthesis server 900 to the communication terminal 100 via the communication network (see FIG. 10). The speech synthesis server 900 includes a speech synthesis engine 920. As illustrated in FIG. 10, the speech synthesis engine 920 includes, for example, a text input unit 930, a language analysis unit 940, a waveform generation unit 950, and a transmission unit 960. ing.

テキスト入力部９３０では、後述する対話サーバで選択されたメッセージの文字情報が入力される。この文字情報は言語解析部９４０において言語解析され、データベースである言語辞書９４２を参照しながらアクセントの位置や読み情報などが解析される。波形生成部９５０では、実際に収録された音声から得られた波形データを用いて作成された音声合成データベース９４４を参照し、音声の波形を生成する。波形生成部９５０で生成された音声データは送信部９６０によって、ＡＩＰ９１０を介して通信端末１００へ送信される。なお、送信部９６０は音声データだけでなく、入力される文字情報を文字（テキスト）として送信することもできる。 In the text input unit 930, character information of a message selected by the dialogue server described later is input. This character information is subjected to language analysis in the language analysis unit 940, and the accent position and reading information are analyzed while referring to the language dictionary 942 which is a database. The waveform generation unit 950 refers to a speech synthesis database 944 created using waveform data obtained from actually recorded speech, and generates speech waveforms. The voice data generated by the waveform generation unit 950 is transmitted by the transmission unit 960 to the communication terminal 100 via the AIP 910. Note that the transmission unit 960 can transmit not only voice data but also input character information as characters (text).

このような音声合成システムを利用することで、仮想オペレータは人が話すのと同様の声でユーザと対話することができ、ユーザに対して親近感を抱かせることで事業者に対する距離感を減少させることができる。 By using such a speech synthesis system, the virtual operator can interact with the user in the same voice as a person speaks, reducing the sense of distance to the operator by making the user feel close Can be made.

［３．対話］
ユーザから送信された音声メッセージが音声認識サーバ８００によって解析されて文字情報に変換された後、対話サーバ１０００によって適切な回答が生成、選択される（図１１参照）。対話サーバ１０００はＡＰＩ１０１０の他、対話エンジン１０２０、事業者管理サーバ１０４０、管理サーバ１０５０、情報データベース１０６０などを有する（図１１参照）。 [3. Dialogue]
After the voice message transmitted from the user is analyzed by the voice recognition server 800 and converted into character information, an appropriate answer is generated and selected by the dialogue server 1000 (see FIG. 11). In addition to the API 1010, the dialogue server 1000 includes a dialogue engine 1020, a provider management server 1040, a management server 1050, an information database 1060, and the like (see FIG. 11).

対話エンジン１０２０は対話コントローラ１０２２、セレクタ１０２４、応答生成部１０２６などを含有することができる。対話コントローラ１０２２は対話エンジン１０２０を制御するものであり、ユーザの意図を把握したり、セレクタ１０２４に適切なメッセージを選択させる機能を有する。セレクタ１０２４は対話コントローラ１０２２が理解したユーザの意図に対して適切な回答メッセージあるいは問合せメッセージ（１０３２、１０３４、１０３６、１０３８）を絞り込み、選択する機能を有する。セレクタ１０２４は絞り込みエンジンとも呼ばれる。応答生成部１０２６は、選択されたメッセージに基づいてユーザに対する回答を作成する機能を有する。応答生成部１０２６は、複数のメッセージを融合して一つのメッセージを作成してもよい。 The dialogue engine 1020 can include a dialogue controller 1022, a selector 1024, a response generation unit 1026, and the like. The dialogue controller 1022 controls the dialogue engine 1020, and has a function of grasping the user's intention and causing the selector 1024 to select an appropriate message. The selector 1024 has a function of narrowing down and selecting an appropriate answer message or inquiry message (1032, 1034, 1036, 1038) for the user's intention understood by the dialog controller 1022. The selector 1024 is also called a refinement engine. The response generation unit 1026 has a function of creating an answer to the user based on the selected message. The response generation unit 1026 may create a single message by fusing a plurality of messages.

情報データベース１０６０は事業者のビジネスに関連する用語や知識などを格納することができ、対話エンジン１０２０によって参照される。事業者管理サーバ１０４０は、例えば店舗の機能や位置情報、提供できるサービスの内容などの情報や、店舗の混雑状況や商品の在庫状況といった時間とともに変動するパラメータを格納できるデータベース１０４２を含むことができる。データベース１０４２に、ユーザの生体情報の標準データなどの個人情報を格納してもよい。応答生成部１０２６は、選択した応答メッセージにデータベース１０４２に格納された個人情報を適宜融合して回答メッセージを生成してもよい。データベース１０４２にユーザ情報を格納する場合、事業者管理サーバは事業者によって管理されることが好ましい。 The information database 1060 can store terms and knowledge related to the business of the operator and is referred to by the dialogue engine 1020. The business management server 1040 can include a database 1042 that can store parameters such as store functions and location information, contents of services that can be provided, and parameters that vary with time, such as store congestion and product inventory. . The database 1042 may store personal information such as standard data of user biometric information. The response generation unit 1026 may generate an answer message by appropriately merging personal information stored in the database 1042 with the selected response message. In the case where user information is stored in the database 1042, the operator management server is preferably managed by the operator.

なお対話サーバは、ユーザと対話サーバ間の対話内容や履歴を格納する通信サービス連携サーバ１１００（後述）と接続されてもよい。あるいは、データベース１０４２内に上述した対話内容や履歴を格納するようシステムを構成してもよい。 The dialogue server may be connected to a communication service cooperation server 1100 (described later) that stores dialogue contents and history between the user and the dialogue server. Alternatively, the system may be configured to store the above-described conversation contents and history in the database 1042.

メッセージ（１０３２、１０３４、１０３６、１０３８）は、ユーザからの様々な質問に対応する回答メッセージあるいは問合せメッセージであり、文字情報として準備、管理することができる。例えばメッセージ１０３２は、ＦＡＱに対応する定型的な回答メッセージとすることができる。メッセージ１０３４は、例えば回答メッセージを絞り込めないとき、ユーザに対して質問するための問合せメッセージとすることができる。メッセージ１０３６は、ユーザと事業者との雑談に対応するためのメッセージとすることができる。メッセージ１０３２、１０３４、１０３６は、事業者管理サーバ１０４０上で編集することができる。メッセージ１０３８は、音声認識サーバ８００や対話サーバ１０００を管理、維持する事業者によって管理される管理サーバ１０５０によって編集することができ、ユーザの事業者の事業とは直接的関連性の低い対話のために用意される。 Messages (1032, 1034, 1036, 1038) are answer messages or inquiry messages corresponding to various questions from the user, and can be prepared and managed as character information. For example, the message 1032 can be a standard reply message corresponding to FAQ. The message 1034 can be, for example, an inquiry message for asking a question when the answer message cannot be narrowed down. The message 1036 can be a message for responding to a chat between the user and the operator. The messages 1032, 1034, and 1036 can be edited on the operator management server 1040. The message 1038 can be edited by the management server 1050 managed by the operator that manages and maintains the voice recognition server 800 and the dialogue server 1000, and is for a dialogue that is not directly related to the business of the user's operator. To be prepared.

問合せメッセージ１０３４を準備することで、多くの回答から選択させるという負担をユーザに課すことなく、ユーザが望む回答へユーザを導くことができる。 By preparing the inquiry message 1034, the user can be guided to an answer desired by the user without imposing a burden on the user to select from many answers.

雑談に対応するためのメッセージ１０３６を事業者が準備、編集することで、事業者の事業内容と関連性の薄い対話であっても、ユーザは種々の会話を仮想オペレータと行うことがでる。これによって事業者とより親近感のあるコミュニケーションを行うことが可能となる。 By preparing and editing a message 1036 for responding to a chat, the user can perform various conversations with the virtual operator even if the conversation is less relevant to the business content of the business. This makes it possible to communicate more closely with the operator.

［４．通話モード］
ユーザと仮想オペレータの間の対話から通話モードに移行する際には、通信端末１００はインターフェースであるＡＰＩ１１１０を介して、通信サービス連携サーバ１１００と接続される（図１２参照）。通信サービス連携サーバ１１００には、通信ステータス管理データベース１１３０、窓口データベース１１４０などが含まれる。 [4. Call mode]
When shifting from the dialogue between the user and the virtual operator to the call mode, the communication terminal 100 is connected to the communication service cooperation server 1100 via the API 1110 which is an interface (see FIG. 12). The communication service cooperation server 1100 includes a communication status management database 1130, a window database 1140, and the like.

通信ステータス管理データベース１１３０にはユーザと仮想オペレータの間の対話履歴や対話内容を格納することができ、この内容は事業者管理サーバ１０４０上で事業者またはそのオペレータが閲覧することができる。窓口データベース１１４０は、オペレータに関する情報、たとえばオペレータの所属する部署や業務範囲、知識や経験などの情報を格納することができる。ユーザと仮想オペレータの間の対話ログや対話内容をコントローラ１１２０が解析し、その結果に基づいてコントローラ１１２０は窓口データベース１１４０を参照し、最適なオペレータを選択することができる。なお、通信ステータス管理データベース１１３０や窓口データベース１１４０を通信サービス連携サーバ１１００内に設置せず、これらのサーバに格納される情報を事業者管理サーバ１０４０で管理してもよい。 The communication status management database 1130 can store a history of dialogue and contents of dialogue between the user and the virtual operator, and this content can be browsed by the operator or the operator on the operator management server 1040. The window database 1140 can store information related to the operator, for example, information such as a department to which the operator belongs, a business scope, knowledge and experience. The controller 1120 analyzes the dialogue log and dialogue contents between the user and the virtual operator, and based on the result, the controller 1120 can refer to the window database 1140 and select the optimum operator. Note that the communication status management database 1130 and the window database 1140 may not be installed in the communication service cooperation server 1100, but the information stored in these servers may be managed by the operator management server 1040.

通話モードへ移行する際には、ＡＰＩ１１１０を介し、電話通信サービスを提供する事業者の通信ネットワーク１１５０へ電話接続の要求を送信し、同時に接続先の電話番号を伝える。これにより、ユーザと事業者あるいはオペレータとの通話が可能となる。 When shifting to the call mode, a request for telephone connection is transmitted to the communication network 1150 of the provider providing the telephone communication service via the API 1110, and at the same time, the telephone number of the connection destination is transmitted. As a result, a call between the user and the operator or the operator becomes possible.

［５．地図モード］
地図モードへ移行する際のシステムは、図１３に示すように、例えば地図情報を提供する事業者などが運営する情報配信サーバ１２００、対話サーバ１０００、事業者管理サーバ１０４０などで構成することができる。 [5. Map mode]
As shown in FIG. 13, the system for shifting to the map mode can be configured by an information distribution server 1200, a dialogue server 1000, an operator management server 1040, etc. operated by an operator providing map information, for example. .

情報配信サーバ１２００には地図情報データベース１２３０が含まれている。ＡＰＩ１２１０を介して通信端末１００は位置情報を情報配信サーバ１２００に送信し、情報配信サーバ１２００から地図情報を受け取ることができる。事業者管理サーバ１０４０は、店舗の機能や位置情報、提供できるサービスの内容などの情報や、混雑状況や商品の在庫状況といった時間とともに変動するパラメータを格納できるデータベース１０４２を含むことができる。対話サーバ１０００はユーザからの質問を解析し、その結果に基づいてデータベース１０４２を参照しつつ、最適な店舗を選択することができる。通信端末１００はＡＰＩ１０１０を介して対話サーバ１０００と接続され、対話サーバ１０００によって選択された店舗とそれに係る情報とパラメータを取得する。プログラムの命令に従い、情報配信サーバ１２００から受け取った地図情報と、選択された店舗の位置情報が表示部１０６上に表示される。 The information distribution server 1200 includes a map information database 1230. The communication terminal 100 can transmit position information to the information distribution server 1200 and receive map information from the information distribution server 1200 via the API 1210. The business management server 1040 can include a database 1042 that can store information such as store functions and location information, the contents of services that can be provided, and parameters that vary with time, such as congestion status and product inventory status. The dialogue server 1000 can analyze a question from the user and select an optimal store while referring to the database 1042 based on the result. The communication terminal 100 is connected to the dialog server 1000 via the API 1010, and acquires the store selected by the dialog server 1000, information related thereto, and parameters. The map information received from the information distribution server 1200 and the location information of the selected store are displayed on the display unit 106 in accordance with the program instructions.

情報配信サーバ１２００は経路探索手段１２２０を有しており、経路探索手段１２２０には地図情報データベース１２３０に加え、例えば経路探索手段１２２０、探索用データベース１２４０、道路交通情報データベース１２５０などが含まれる。経路探索手段１２２０は、地図情報データベース１２３０、探索用データベース１２４０、道路交通情報データベース１２５０に格納されている情報を参照しつつ、ユーザと店舗までの最良の経路を探索、決定する機能を有する。 The information distribution server 1200 includes route search means 1220. The route search means 1220 includes, for example, a route search means 1220, a search database 1240, and a road traffic information database 1250 in addition to the map information database 1230. The route search means 1220 has a function of searching for and determining the best route to the user and the store while referring to information stored in the map information database 1230, the search database 1240, and the road traffic information database 1250.

本実施例では、事業者として銀行を例に挙げ、ユーザが日本円を米国ドルに両替するケースを想定し、図１４を参照しつつ説明を行う。 In the present embodiment, a bank is taken as an example of a business operator, and a case where a user exchanges Japanese yen into US dollars will be described with reference to FIG.

最初にユーザは、通信端末１００にインストールされた第１実施形態で示したプログラムを含むアプリケーションを起動する。その後アイコン３００（図３参照）を操作して音声入力部１１０をアクティブにし、仮想オペレータに対話する要領で質問を口頭で述べる。例えば、「日本円を両替したい」と述べればよい。発声が困難な状況であれば、テキスト入力によって質問を入力してもよい。プログラムの命令に従って通信端末１００の通信部１１４は、ユーザの音声情報を音声認識システムへ送信する（Ｓ１）。音声認識サーバ８００は音声情報を文字情報へ変換し、対話サーバ１０００へ伝達する（Ｓ２）。 First, the user starts an application including the program shown in the first embodiment installed in the communication terminal 100. Thereafter, the voice input unit 110 is activated by operating the icon 300 (see FIG. 3), and the question is verbally described in a manner of interacting with the virtual operator. For example, “I want to exchange Japanese yen”. If it is difficult to speak, a question may be input by text input. The communication unit 114 of the communication terminal 100 transmits the user's voice information to the voice recognition system according to the program command (S1). The voice recognition server 800 converts the voice information into character information and transmits it to the dialogue server 1000 (S2).

対話サーバ１０００は対話エンジン１０２０を用いて質問の意図を理解するが、例示した音声情報には、外貨の種類や両替額といった情報が含まれていないため、事業者はユーザに対して最善の情報を提供することができない。そこで対話サーバはさらなる情報をユーザから得るため、準備された問合せメッセージから最適な問合せメッセージを選択する、あるいは新たに作成し、これを音声合成サーバ９００へ伝達する（Ｓ３）。例えば、「外貨の種類を教えてください」、「両替する日本円の金額を教えてください」などといった問合せメッセージを選択もしくは作成する。 The dialogue server 1000 uses the dialogue engine 1020 to understand the intention of the question. However, since the exemplified voice information does not include information such as the type of foreign currency and the amount of exchange, the business operator provides the best information to the user. Can not provide. In order to obtain further information from the user, the dialogue server selects or newly creates an optimal inquiry message from the prepared inquiry messages, and transmits it to the speech synthesis server 900 (S3). For example, an inquiry message such as “Tell me the type of foreign currency” or “Tell me the amount of Japanese yen to exchange” is selected or created.

音声合成サーバ９００は上記問合せメッセージを言語解析して波形を生成し、音声情報として通信端末１００へ送信する（Ｓ４）。なお、音声情報と同時に問合せメッセージの文字情報を音声合成サーバ９００あるいは対話サーバ１０００から通信端末１００に送信してもよい。通信端末１００は音声出力部１１２を用いて音声情報を再生するとともに、表示部１０６上に問合せメッセージを表示する。 The speech synthesis server 900 performs a linguistic analysis on the inquiry message to generate a waveform, and transmits the waveform as speech information to the communication terminal 100 (S4). Note that the character information of the inquiry message may be transmitted from the speech synthesis server 900 or the dialogue server 1000 to the communication terminal 100 simultaneously with the speech information. The communication terminal 100 reproduces voice information using the voice output unit 112 and displays an inquiry message on the display unit 106.

ユーザは問合せメッセージに対する回答を、Ｓ１のステップと同様に行う。回答としては、例えば「米国ドルに両替したい」、「日本円は１０万円程度」などの回答が考えられる。通信端末１００はこの回答を音声認識サーバ８００に送信し（Ｓ５）、音声認識サーバ８００はユーザの音声を文字情報に変換して対話サーバ１０００に伝達する（Ｓ６）。 The user responds to the inquiry message in the same manner as in step S1. Possible answers include, for example, “I want to exchange for US dollars” and “Japanese yen is about 100,000 yen”. The communication terminal 100 transmits this answer to the voice recognition server 800 (S5), and the voice recognition server 800 converts the user's voice into character information and transmits it to the dialogue server 1000 (S6).

対話サーバ１０００はユーザからの回答を解析する。本実施例では日本円からＵＳドルへの両替であるため、ユーザは店舗へ直接アクセスする必要がある。このため対話サーバ１０００は、ユーザを店舗へ案内することが必須であると判断し、ユーザを店舗へ案内するための回答メッセージを生成、あるいは選択する。回答メッセージの例として、「お近くの当行支店にご案内いたします」のようなメッセージが挙げられる。その後回答メッセージの音声情報を音声合成サーバ９００へ伝達する（Ｓ７）。 The dialogue server 1000 analyzes the answer from the user. In this embodiment, since the exchange is from Japanese yen to US dollars, the user needs to access the store directly. For this reason, the dialogue server 1000 determines that it is essential to guide the user to the store, and generates or selects an answer message for guiding the user to the store. An example of the reply message is a message such as “I will guide you to a nearby branch office”. Thereafter, the voice information of the reply message is transmitted to the voice synthesis server 900 (S7).

この時、対話サーバ１０００は事業者管理サーバ１０４０にアクセスし（Ｓ８）、ユーザの要求を満たす店舗の候補、ならびに店舗の混雑状況や商品の在庫状況などのパラメータを取得する（Ｓ９）。これは、店舗によっては保管する外貨の種類や量が異なり、また、取引状況によっては在庫量が大幅に変動している可能性があるためである。これらの情報に基づいてユーザの要求に対して最適な店舗を決定し、その位置情報を通信端末１００に送信する。同時に、対話サーバ１０００は通信端末１００に対して地図モードへの移行要求を送信する（Ｓ１０）。 At this time, the dialogue server 1000 accesses the operator management server 1040 (S8), and acquires store candidates that satisfy the user's request, and parameters such as store congestion and product inventory (S9). This is because the type and amount of foreign currency to be stored differs depending on the store, and the inventory amount may fluctuate greatly depending on the transaction status. Based on these pieces of information, an optimal store for the user's request is determined, and the position information is transmitted to the communication terminal 100. At the same time, the dialogue server 1000 transmits a request for shifting to the map mode to the communication terminal 100 (S10).

音声合成サーバ９００は回答メッセージから音声情報を生成し、この音声情報と回答メッセージの文字情報を通信端末１００に送信する（Ｓ１１）。 The voice synthesis server 900 generates voice information from the reply message, and transmits the voice information and character information of the reply message to the communication terminal 100 (S11).

通信端末１００は内蔵されたＧＰＳ機能を利用して位置座標を取得し、通信端末１００の位置情報と受信した店舗の位置情報を情報配信サーバ１２００に送信し、付近の地図情報とともに経路探索を要求する（Ｓ１２）。情報配信サーバ１２００は通信端末１００の要求に応え、必要な地図情報と経路の探索結果を通信端末１００に送信する（Ｓ１３）。 The communication terminal 100 acquires position coordinates using the built-in GPS function, transmits the position information of the communication terminal 100 and the received position information of the store to the information distribution server 1200, and requests a route search together with nearby map information. (S12). In response to the request from the communication terminal 100, the information distribution server 1200 transmits necessary map information and a route search result to the communication terminal 100 (S13).

その後通信端末１００は、プログラムの命令に従い、表示部１０６上に付近の地図、目的とする店舗の位置、ならびに店舗までの経路を表示する。 Thereafter, the communication terminal 100 displays a map of the vicinity, the position of the target store, and the route to the store on the display unit 106 according to the instructions of the program.

このように、ユーザの質問に対して対話を通して対応することで、最適な回答をユーザに提供することができる。この時ユーザは、複数の回答から最適な回答を選択するという労力を払う必要がないため、最適な回答に速やかにたどり着くことができる。 Thus, by responding to the user's question through dialogue, an optimal answer can be provided to the user. At this time, the user does not need to pay the effort of selecting the optimum answer from a plurality of answers, and can quickly reach the optimum answer.

また、このような対話によるサービスはユーザが操作する通信端末１００を用いて達成できるため、ユーザは店舗に訪れて質問をする労力を払う必要もなく、また、店舗の営業時間や店舗での待ち時間などを考慮する必要もない。このように、本実施形態の情報提供方法やそれを実行するためのプログラムを用いることによって、ユーザは時間や場所を選ばずに種々の問題を解決することができる。 In addition, since the service based on such an interaction can be achieved by using the communication terminal 100 operated by the user, the user does not have to pay labor to visit the store and ask a question. There is no need to consider time. Thus, by using the information providing method of the present embodiment and the program for executing it, the user can solve various problems regardless of time and place.

またユーザは、事業者やそのオペレータの知識や経験、能力に制約されることなく、均一かつ豊富な情報を獲得することができる。例えば本実施例の事例では、各店舗における米国ドルの現金の在庫状況は刻一刻と変化するため、従来、ユーザと直接対応するオペレータによってはその変動を把握することができず、在庫が不十分な店舗を案内する可能性があった。しかしながら本実施形態の情報提供方法を実施することにより、全てのユーザに対して正確な情報を提供することができる。同時に事業者は、ユーザ対応における人的資源の消耗を回避することで業務の効率化と生産性の向上が達成できる。 In addition, the user can acquire uniform and abundant information without being restricted by the knowledge, experience, and ability of the business operator and its operator. For example, in the case of this example, the US dollar cash inventory status at each store changes from moment to moment, so it has been difficult to grasp the fluctuations by operators who directly correspond to the user, and the inventory is insufficient. There was a possibility to guide a special store. However, by implementing the information providing method of this embodiment, accurate information can be provided to all users. At the same time, business operators can achieve more efficient operations and improved productivity by avoiding the consumption of human resources in response to users.

本実施例では、実施例１と異なるケースを想定して説明を行う。具体的には、ユーザが銀行口座のキャッシュカードを紛失した例を、図１５を参照しつつ説明する。 In the present embodiment, a description will be given assuming a case different from the first embodiment. Specifically, an example in which the user has lost a bank account cash card will be described with reference to FIG.

実施例１と同様に、最初にユーザは、通信端末１００にインストールされた第１実施形態で示したプログラムを含むアプリケーションを起動する。その後アイコン３００を操作して音声入力部１１０をアクティブにし、仮想オペレータに対話する要領で質問を口頭で述べる。例えば、「キャッシュカードをなくしたのだが」と述べることができる。発声が困難な状況であれば、テキスト入力によって質問を入力してもよい。プログラムの命令に従って通信端末１００の通信部１１４は、ユーザの音声情報を音声認識システムへ送信する（図１５、Ｓ１）。音声認識サーバ８００は音声情報を文字情報へ変換し、対話サーバ１０００へ伝達する（Ｓ２）。 Similar to the first embodiment, the user first starts an application including the program shown in the first embodiment installed in the communication terminal 100. Thereafter, the voice input unit 110 is activated by operating the icon 300, and the question is verbally described in a manner of interacting with the virtual operator. For example, you can say “I lost my cash card”. If it is difficult to speak, a question may be input by text input. According to the command of the program, the communication unit 114 of the communication terminal 100 transmits the user's voice information to the voice recognition system (FIG. 15, S1). The voice recognition server 800 converts the voice information into character information and transmits it to the dialogue server 1000 (S2).

対話サーバ１０００は対話エンジン１０２０を用いて質問の意図を理解する。対話サーバ１０００は質問に対する回答メッセージを選択し、その文字情報を音声合成サーバ９００に伝達する（Ｓ３）。音声合成サーバ９００は上記問合せメッセージを言語解析して波形を生成し、音声情報、および文字情報として通信端末１００へ送信する（Ｓ４）。通信端末１００は音声出力部１１２を用いて音声情報を再生するとともに、表示部１０６上に問合せメッセージを表示する。 The dialogue server 1000 uses the dialogue engine 1020 to understand the intention of the question. The dialogue server 1000 selects an answer message for the question, and transmits the character information to the speech synthesis server 900 (S3). The speech synthesis server 900 performs a linguistic analysis on the inquiry message to generate a waveform, and transmits the waveform to the communication terminal 100 as speech information and character information (S4). The communication terminal 100 reproduces voice information using the voice output unit 112 and displays an inquiry message on the display unit 106.

本実施例で想定したケースでは、事業者はユーザに対して速やかに所定の部署、例えば喪失受付センターなどへ通知するよう要求する必要がある。したがって対話サーバは回答メッセージを選択すると同時に、事業者管理サーバ１０４０に対して該当部署の電話番号の送信を要求する（Ｓ５）。事業者管理サーバ１０４０は対話サーバ１０００の要求に従って必要な情報を送信する（Ｓ６）。 In the case assumed in this embodiment, the business operator needs to promptly notify the user to a predetermined department, for example, the loss reception center. Therefore, the dialog server selects the reply message and requests the business management server 1040 to transmit the telephone number of the corresponding department (S5). The business entity management server 1040 transmits necessary information in accordance with a request from the dialogue server 1000 (S6).

本実施例のケースにおける回答メッセージは、例えば「キャッシュカードでのお取引ができないよう設定しますので、喪失受付センターへお繋ぎします」のようなメッセージが挙げられる。これによりユーザは通話モードへ切り替わることを予測することができる。 The reply message in the case of the present embodiment is, for example, a message such as “I will make a transaction with a cash card, so I will connect to the loss reception center”. Thereby, the user can predict switching to the call mode.

対話サーバ１０００は通信端末１００に対して通話モードへの切り替えを要求する（Ｓ７）。同時に対話サーバ１０００は通信サービス連携サーバ１１００に対して回線の接続を要求し、該当部署の電話番号を通知する（Ｓ８）。通信サービス連携サーバ１１００は対話サーバ１０００からの要求に応えてユーザの通信端末１００と所定の部署とを電話回線で接続し（Ｓ９）、ユーザとオペレータとの通話が開始される。 The dialogue server 1000 requests the communication terminal 100 to switch to the call mode (S7). At the same time, the dialog server 1000 requests the communication service cooperation server 1100 to connect the line and notifies the telephone number of the corresponding department (S8). In response to a request from the dialogue server 1000, the communication service cooperation server 1100 connects the user's communication terminal 100 and a predetermined department via a telephone line (S9), and a call between the user and the operator is started.

なお、ここではユーザの意向にかかわらず、通話モードへ切り替わる例を示したが、通話モードへの切り替えの要否をユーザの判断に委ねてもよい。例えば通話モードへの切り替え要求を通信端末１００が対話サーバ１０００から受信した際（Ｓ７）、電話アイコンを表示部１０６上に表示し、電話アイコンの操作によって通話が開始されるようにしてもよい。 In addition, although the example which switches to telephone call mode was shown here irrespective of a user's intention, you may entrust the necessity of the switching to telephone call mode to a user's judgment. For example, when the communication terminal 100 receives a request for switching to the call mode from the dialogue server 1000 (S7), a telephone icon may be displayed on the display unit 106, and the telephone call may be started by operating the telephone icon.

本実施例で想定した質問はＦＡＱの一種であり、例えば事業者が提供するサイト上でも回答を得ることも可能な場合が多い。しかしながら本実施例のケースはユーザにとっては一種の緊急事態であり、一刻でも早い問題解決を要望しているため、通信端末１００から事業者のサイトへアクセスする精神的余裕を持たない場合が多い。 The question assumed in this embodiment is a kind of FAQ, and for example, it is often possible to obtain an answer even on a site provided by a business operator. However, the case of the present embodiment is a kind of emergency situation for the user, and since it is desired to solve the problem as soon as possible, it often does not have a mental margin for accessing the site of the operator from the communication terminal 100.

これに対して本実施形態では、スマートフォンなどの常時携帯可能な通信端末１００にインストールされたアプリケーションを起動し、口頭で質問を述べるだけで自動的に所定の部署への通話を開始することが可能である。したがって本発明の実施形態により、ユーザに対してより簡便、かつ迅速なサービスを提供することができる。 On the other hand, in the present embodiment, it is possible to automatically start a call to a predetermined department simply by starting an application installed in a communication terminal 100 that is always portable, such as a smartphone, and simply speaking a question. It is. Therefore, according to the embodiment of the present invention, it is possible to provide a simpler and quicker service to the user.

上述した実施形態や実施例では、主に金融事業に関連する例を取り上げて説明を行ったが、本発明の実施は金融業界に限られない。例えば食品業界、外食業界、物流業界、建設業界、空港・海運業界など、あらゆる業界、業種において実施、適用が可能である。その一例として本実施例では、ユーザが飲食店を利用する際のケースを想定し、図１６を参照しつつ説明を行う。なおここでの事業者とは、飲食店に関する情報を収集し、それをユーザに提供する個人あるいは団体であり、各飲食店はこの事業者に対して必要な情報を提供する。 In the above-described embodiments and examples, description has been given mainly taking examples related to the financial business, but the implementation of the present invention is not limited to the financial industry. For example, it can be implemented and applied in all industries and industries such as the food industry, restaurant industry, logistics industry, construction industry, airport and shipping industry. As an example, in the present embodiment, a case where a user uses a restaurant is assumed, and description will be made with reference to FIG. Here, the operator is an individual or an organization that collects information about restaurants and provides them to users, and each restaurant provides necessary information to the operators.

実施例１、２と同様にユーザはまず、通信端末１００にインストールされた第１実施形態で示したプログラムを含むアプリケーションを起動する。その後アイコン３００を操作して音声入力部１１０をアクティブにし、仮想オペレータに対話する要領で質問を口頭で述べる。例えば、「お腹すいた」とか、「ご飯が食べたい」と述べることができる。発声が困難な状況であれば、テキスト入力によって質問を入力してもよい。プログラムの命令に従って通信端末１００の通信部１１４は、ユーザの音声情報を音声認識システムへ送信する（Ｓ１）。音声認識サーバ８００は音声情報を文字情報へ変換し、対話サーバ１０００へ伝達する（Ｓ２）。 As in the first and second embodiments, the user first starts an application including the program shown in the first embodiment installed in the communication terminal 100. Thereafter, the voice input unit 110 is activated by operating the icon 300, and the question is verbally described in a manner of interacting with the virtual operator. For example, you can say "I'm hungry" or "I want to eat rice". If it is difficult to speak, a question may be input by text input. The communication unit 114 of the communication terminal 100 transmits the user's voice information to the voice recognition system according to the program command (S1). The voice recognition server 800 converts the voice information into character information and transmits it to the dialogue server 1000 (S2).

対話サーバ１０００は対話エンジン１０２０を用いて質問の意図を理解するが、上記の音声情報には、具体的な情報が全く含まれていない。例えばユーザが何を食べたいのか、どの程度の量を食べたいのか、アルコールを飲みたいのか、人数はどの程度か、喫煙者であるのかどうか、などといった情報は全く含まれていない。したがって事業者はユーザに対して最善の情報を提供することができない。そこで対話サーバはさらなる情報をユーザから得るため、準備された問合せメッセージから最適なメッセージを選択する、あるいは新たに作成し、これを音声合成サーバ９００へ伝達する（Ｓ３）。問合せメッセージとしては、例えば「どのようなものを召し上がりますか？」などといった自由回答形式のメッセージでもよく、あるいは「和食、中華、洋食のうちいずれが良いでしょうか？」といった選択形式のメッセージでもよい。 Although the dialogue server 1000 uses the dialogue engine 1020 to understand the intention of the question, the above voice information does not contain any specific information. For example, it does not contain any information such as what the user wants to eat, how much he wants to eat, whether he wants to drink alcohol, how many people are, whether he is a smoker, etc. Therefore, the business operator cannot provide the best information to the user. In order to obtain further information from the user, the dialogue server selects an optimum message from the prepared inquiry messages or creates a new message and transmits it to the speech synthesis server 900 (S3). The inquiry message may be a free-answer message such as “What do you want to eat?” Or a choice-type message such as “Which is Japanese, Chinese or Western?” .

音声合成サーバ９００は上記問合せメッセージを言語解析して波形を生成し、音声情報として通信端末１００へ送信する（Ｓ４）。実施例１、２と同様に、音声情報と同時に文字情報を音声合成サーバ９００あるいは対話サーバ１０００から通信端末１００に送信してもよい。通信端末１００は音声出力部１１２を用いて音声情報を再生するとともに、表示部１０６上に問合せメッセージを表示する。 The speech synthesis server 900 performs a linguistic analysis on the inquiry message to generate a waveform, and transmits the waveform as speech information to the communication terminal 100 (S4). Similarly to the first and second embodiments, text information may be transmitted from the speech synthesis server 900 or the dialogue server 1000 to the communication terminal 100 simultaneously with the speech information. The communication terminal 100 reproduces voice information using the voice output unit 112 and displays an inquiry message on the display unit 106.

同時に対話サーバ１０００は通信端末１００に対して位置情報の提供を要求する（Ｓ５）。この要求に応えて通信端末１００は、内蔵するＧＰＳ機能を利用して位置座標を獲得し、その情報を対話サーバ１０００に送信する（Ｓ６）。 At the same time, the dialogue server 1000 requests the communication terminal 100 to provide location information (S5). In response to this request, the communication terminal 100 acquires position coordinates using the built-in GPS function, and transmits the information to the dialogue server 1000 (S6).

この間、ユーザは問合せメッセージに対する回答を、Ｓ１のステップと同様に行う。回答としては、例えば「パスタが食べたい」といった具体的な回答でもよく、「温かいものが良い」といった抽象的な回答でもよい。通信端末１００はこの回答を音声認識サーバ８００に送信し（Ｓ７）、音声認識サーバ８００はユーザの音声を文字情報に変換して対話サーバ１０００に伝達する（Ｓ８）。 During this time, the user answers the inquiry message in the same manner as in step S1. The answer may be a concrete answer such as “I want to eat pasta” or an abstract answer such as “Warm food is good”. The communication terminal 100 transmits this answer to the voice recognition server 800 (S7), and the voice recognition server 800 converts the user's voice into character information and transmits it to the dialogue server 1000 (S8).

対話サーバ１０００はユーザからの回答を解析する。ユーザからの回答では最終的な飲食店にまで絞り込むことは困難であると対話サーバ１０００が判断した場合、再度問合せメッセージを選択、作成し、音声合成サーバ９００を介して通信端末１００へ送信する。そしてユーザはそれに対する回答を口頭で述べる。このステップを繰り返すことで、例えば「温かいものが食べたい」、「和食が良い」、「お酒も出してくれるところ」、「人数は３名」、「喫煙できるところがいい」といったように、より具体的な情報をユーザから取得することができる。 The dialogue server 1000 analyzes the answer from the user. When the dialog server 1000 determines that it is difficult to narrow down to the final restaurant based on the answer from the user, it selects and creates an inquiry message again and transmits it to the communication terminal 100 via the speech synthesis server 900. And the user verbally gives an answer to that. By repeating this step, for example, “I want to eat warm food”, “Good Japanese food”, “A place to serve sake”, “3 people”, “A place where you can smoke” Specific information can be acquired from the user.

対話サーバ１０００は事業者管理サーバ１０４０に対し、ユーザの位置座標付近に登録された飲食店の情報を要求する（Ｓ９）。事業者管理サーバ１０４０はこの要求に応え、データベース１０４２から要求された情報を対話サーバ１０００に送信する（Ｓ１０）。 The dialogue server 1000 requests the operator management server 1040 for information on restaurants registered near the position coordinates of the user (S9). In response to this request, the business entity management server 1040 transmits the information requested from the database 1042 to the dialogue server 1000 (S10).

対話サーバ１０００はユーザから得られた情報を解析し、セレクタ１０２４が有する絞り込み機能を用いてデータベース１０４２から取得した情報の中からユーザの希望に最もよく適合する回答を選択する。 The dialogue server 1000 analyzes the information obtained from the user, and selects an answer that best fits the user's wishes from the information obtained from the database 1042 using the narrowing function of the selector 1024.

次に対話サーバ１０００は回答が得られた旨をユーザに伝えるための回答メッセージを選択し、これを音声合成サーバ９００へ伝達する（Ｓ１１）。この時の回答メッセージとしては例えば、「お客様のご希望に最適なお店をご紹介します」というようなメッセージでよい。同時に対話サーバ１０００は情報配信サーバ１２００に対し、選択された回答、すなわち選択された飲食店の位置情報を要求する（Ｓ１２）。情報配信サーバ１２００は対話サーバ１０００からの要求された情報を対話サーバ１０００、あるいは直接通信端末１００へ送信する（Ｓ１３）。 Next, the dialogue server 1000 selects an answer message for informing the user that an answer has been obtained, and transmits this to the speech synthesis server 900 (S11). As an answer message at this time, for example, a message such as “I will introduce the best shop for customer's request” may be used. At the same time, the dialogue server 1000 requests the information distribution server 1200 for the selected answer, that is, the location information of the selected restaurant (S12). The information distribution server 1200 transmits the requested information from the dialogue server 1000 to the dialogue server 1000 or directly to the communication terminal 100 (S13).

音声合成サーバ９００は対話サーバ１０００から送られた回答メッセージを音声情報に変換し、通信端末１００へ送信する（Ｓ１４）。 The voice synthesis server 900 converts the reply message sent from the dialogue server 1000 into voice information and sends it to the communication terminal 100 (S14).

さらに対話サーバ１０００は通信端末１００に対して地図モードへの意向を要求する（Ｓ１５）。これに応えて通信端末１００は情報配信サーバ１２００に対して通信端末１００の位置情報と選択された飲食店の位置情報とともに経路探索を要求する（Ｓ１６）。情報配信サーバ１２００は通信端末１００の要求に応え、必要な地図情報と経路の探索結果を通信端末１００に送信する（Ｓ１７）。 Furthermore, the dialogue server 1000 requests the communication terminal 100 to make an intention for the map mode (S15). In response to this, the communication terminal 100 requests the information distribution server 1200 for a route search together with the position information of the communication terminal 100 and the position information of the selected restaurant (S16). In response to the request from the communication terminal 100, the information distribution server 1200 transmits necessary map information and a route search result to the communication terminal 100 (S17).

その後通信端末１００は、表示部１０６上に付近の地図、目的とする飲食店の位置、ならびに飲食店までの経路を表示する。 Thereafter, the communication terminal 100 displays a map of the vicinity, the position of the target restaurant, and the route to the restaurant on the display unit 106.

このように、ユーザの質問に対して対話を通じて対応することで、最適な回答をユーザに提供することができる。この時ユーザは、複数の回答から最適な回答を選択するという労力を払う必要がないため、最適な回答に速やかにたどり着くことができる。 In this way, by responding to the user's question through dialogue, an optimal answer can be provided to the user. At this time, the user does not need to pay the effort of selecting the optimum answer from a plurality of answers, and can quickly reach the optimum answer.

上述した各実施形態は、相互に矛盾しない限りにおいて、適宜組み合わせて実施することができる。また、各実施形態を基にして、当業者が適宜構成要素やプロセスの追加、削除もしくは設計変更を行ったものも、本発明の要旨を備えている限り、本発明の範囲に含まれる。 The above-described embodiments can be implemented in appropriate combination as long as they do not contradict each other. In addition, components that are appropriately added, deleted, or changed in design by those skilled in the art based on each embodiment are also included in the scope of the present invention as long as they include the gist of the present invention.

また、上述した各実施形態の態様によりもたらされる作用効果とは異なる他の作用効果であっても、本明細書の記載から明らかなもの、又は、当業者において容易に予測し得るものについては、当然に本発明によりもたらされるものと解される。 In addition, even for other operational effects different from the operational effects brought about by the aspects of the above-described embodiments, those that are apparent from the description of the present specification, or that can be easily predicted by those skilled in the art, Of course, it is understood that the present invention provides.

１００：通信端末、１０２：中央演算ユニット、１０４：メモリ、１０６：表示部、１０８：入力部、１１０：音声入力部、１１２：音声出力部、１１４：通信部、１１６：画像取込装置、２００：ステップ、２０２：ステップ、２０４：ステップ、２０６：ステップ、２０８：ステップ、２１０：ステップ、２１２：ステップ、２１４：ステップ、２１６：ステップ、２１８：ステップ、２２０：ステップ、２２２：ステップ、２２４：ステップ、２２６：ステップ、２２８：ステップ、２３０：ステップ、２３２：ステップ、２３４：ステップ、３００：アイコン、３０２：アイコン、４００：ステップ、４０２：ステップ、４０４：ステップ、４０６：ステップ、４０８：ステップ、４１０：ステップ、４１２：ステップ、６０２：アイコン、６０４：アイコン、７００：ステップ、７０２：ステップ、７０４：ステップ、７０６：ステップ、７１０：ステップ、７１２：ステップ、７２０：ステップ、７２２：ステップ、８００：音声認識サーバ、８１０：ＡＰＩ、８２０：音声認識エンジン、８３０：特徴量抽出部、８４０：デコーダ、８４２：言語モデル、８４４：音響モデル、８４６：語彙辞書、８５０：テキスト出力部、９００：音声合成サーバ、９１０：ＡＰＩ、９２０：音声合成エンジン、９３０：テキスト入力部、９４０：言語解析部、９４２：言語辞書、９４４：音声合成データベース、９５０：波形生成部、９６０：送信部、１０００：対話サーバ、１０１０：ＡＰＩ、１０２０：対話エンジン、１０２２：対話コントローラ、１０２４：セレクタ、１０２６：応答生成部、１０３２：メッセージ、１０３４：メッセージ、１０３６：メッセージ、１０３８：メッセージ、１０４０：事業者管理サーバ、１０４２：データベース、１０５０：管理サーバ、１０６０：情報データベース、１１００：通信サービス連携サーバ、１１１０：ＡＰＩ、１１２０：コントローラ、１１３０：通信ステータス管理データベース、１１４０：窓口データベース、１１５０：通信ネットワーク、１２００：情報配信サーバ、１２１０：ＡＰＩ、１２２０：経路探索手段、１２３０：地図情報データベース、１２４０：探索用データベース、１２５０：道路交通情報データベース 100: Communication terminal, 102: Central processing unit, 104: Memory, 106: Display unit, 108: Input unit, 110: Audio input unit, 112: Audio output unit, 114: Communication unit, 116: Image capturing device, 200 : Step, 202: Step, 204: Step, 206: Step, 208: Step, 210: Step, 212: Step, 214: Step, 216: Step, 218: Step, 220: Step, 222: Step, 224: Step 226: Step, 228: Step, 230: Step, 232: Step, 234: Step, 300: Icon, 302: Icon, 400: Step, 402: Step, 404: Step, 406: Step, 408: Step, 410 : Step, 412: Step, 602: A 604: Icon, 700: Step, 702: Step, 704: Step, 706: Step, 710: Step, 712: Step, 720: Step, 722: Step, 800: Voice recognition server, 810: API, 820: Speech recognition engine, 830: feature extraction unit, 840: decoder, 842: language model, 844: acoustic model, 846: vocabulary dictionary, 850: text output unit, 900: speech synthesis server, 910: API, 920: speech synthesis Engine, 930: text input unit, 940: language analysis unit, 942: language dictionary, 944: speech synthesis database, 950: waveform generation unit, 960: transmission unit, 1000: dialog server, 1010: API, 1020: dialog engine, 1022: Dialog controller, 1024: Selector, 1 26: Response generation unit, 1032: Message, 1034: Message, 1036: Message, 1038: Message, 1040: Operator management server, 1042: Database, 1050: Management server, 1060: Information database, 1100: Communication service cooperation server, 1110: API, 1120: Controller, 1130: Communication status management database, 1140: Window database, 1150: Communication network, 1200: Information distribution server, 1210: API, 1220: Route search means, 1230: Map information database, 1240: Search Database 1250: Road traffic information database

Claims

A voice recognition unit for recognizing voice information of a user transmitted from a communication terminal;
A dialog unit for selecting a reply message to the voice information from a database in which a plurality of messages are stored;
A transmission unit that transmits the selected response message to the communication terminal as at least one of voice information and character information;
The information providing system in which the dialog unit selects an inquiry message for asking the user a question from the database when the answer message cannot be narrowed down to one, and transmits the selected inquiry message to the communication terminal.

The information providing system according to claim 1, further comprising the communication terminal.

The communication terminal is configured to display a virtual operator;
2. When the response message and the inquiry message are reproduced on the communication terminal, a part of the virtual operator is configured to perform a large operation compared to when the answer message and the inquiry message are not reproduced. 1. The information providing system according to 1.

A first server storing the standard biometric information of the user;
The communication terminal has an image capturing device that captures the user's biological information,
The information providing system according to claim 1, wherein the user is allowed to input the audio information when the user is approved based on the biometric information and the standard biometric information.

The information providing system according to claim 1, further comprising a second server for a call between the user and a business operator.

The information providing system according to claim 5, wherein the second server is configured to store the voice information recognized by a voice recognition unit, the selected answer message, and the inquiry message.

A third server storing map information;
The first server is configured to store information regarding a plurality of stores of the operator;
The information providing system according to claim 4, wherein the communication terminal is configured to display a position of the communication terminal and information of the plurality of stores on a display unit.

The dialogue unit is configured to select at least one store from the plurality of stores based on the audio information,
The information providing system according to claim 7, wherein the third server is configured to transmit a route to the selected store to the communication terminal.

If the answer message corresponding to the question inputted by voice by the user is narrowed down from a plurality of messages, and the answer message cannot be narrowed down to one, the inquiry message for asking the user a question is selected from the plurality of messages. Server configured to.

The server according to claim 9, wherein the question is transmitted from a voice recognition server that recognizes voice information input by a user.

The server of claim 9, wherein the answer message and the inquiry message are configured to be communicated to a speech synthesis server.

The server according to claim 9, connected to a first server configured to store biometric information of the user.

The server of claim 12, wherein the first server is configured to store business store information.

14. A second server configured to store at least one of the user's question, the selected answer message, and the selected inquiry message is connected to the first server. Server described in.

Recognize voice information including user questions sent from communication terminals,
Selecting a response message corresponding to the question of the recognized user from a plurality of response messages;
The selected response message is transmitted to the communication terminal as at least one of voice information and character information,
If the answer message cannot be narrowed down to one, select an inquiry message for asking a question from the plurality of answer messages,
A program for causing a computer to execute transmission of the selected inquiry message to the communication terminal.

The program according to claim 15, wherein the user's biometric information captured by the communication terminal is received and the user is authenticated using the biometric information.

The program according to claim 15, further comprising storing at least one of the question of the user, the selected answer message, and the selected inquiry message.

Refer to at least one of the stored question of the user, the selected answer message, the selected inquiry message;
The program according to claim 15, further comprising connecting the communication terminal and a telephone terminal of a business operator.

Refer to at least one of the stored question of the user, the selected answer message, the selected inquiry message;
The program according to claim 15, further causing the store information of the business operator to be displayed on the communication terminal.

Receiving the question as text information from the communication terminal;
The program according to claim 15, further causing the reply message to be transmitted as character information to the communication terminal.

When the question included in the voice information cannot be detected, the question is received as character information from the communication terminal,
The program according to claim 15, further causing the reply message to be transmitted as character information to the communication terminal.

Receiving volume data around the communication terminal from the communication terminal;
The program according to claim 21, wherein the program is executed to receive the question as character information from the communication terminal according to the volume data.

Sends speech information to a system that includes a speech recognition server, dialogue server, speech synthesis server,
Receiving at least one of an answer message to the user's question included in the audio information, an inquiry message to the question,
A program for causing a communication terminal to output the answer message and the inquiry message as at least one of voice information and character information.

24. The program according to claim 23, wherein the program is configured to be stored in the communication terminal.

The program of claim 23, wherein the program is configured to be obtained from a communication network.

When the program displays a virtual operator and the answer message and the inquiry message are reproduced on the communication terminal, a part of the virtual operator performs a large operation compared to when the answer message and the inquiry message are not reproduced. 24. The program of claim 23, configured to perform.

24. The program of claim 23, wherein the program is configured to obtain user biometric information.

The program according to claim 23, wherein the program is configured to connect the communication terminal and a telephone terminal of a business operator.

The program acquires location information of the communication terminal,
Displaying a map on the communication terminal;
The program according to claim 23, configured to display store information of a business operator on the communication terminal.

30. The program according to claim 29, wherein the program further displays a route from the position of the communication terminal to the business operator's store.

The program according to claim 23, further causing the question to be transmitted as character information.

24. The program according to claim 23, wherein a volume around the communication terminal is measured, and the volume data is transmitted to at least one of the voice recognition server, the dialogue server, and the voice synthesis server.

The program according to claim 32, further causing the question to be transmitted as character information in accordance with the volume.