JP2003259013A

JP2003259013A - Inquiring method and inquiring system

Info

Publication number: JP2003259013A
Application number: JP2002059932A
Authority: JP
Inventors: Mitsuru Endo; 充遠藤; Makoto Nishizaki; 誠西崎; Natsuki Saito; 夏樹齋藤
Original assignee: Matsushita Electric Industrial Co Ltd
Current assignee: Panasonic Holdings Corp
Priority date: 2002-03-06
Filing date: 2002-03-06
Publication date: 2003-09-12

Abstract

<P>PROBLEM TO BE SOLVED: To solve the problem that a high recognition accuracy of the voice recognition and a low communication cost are required for an inquiring system. <P>SOLUTION: An information terminal makes a question in a voice file to prepare a question mail attached to an electronic mail, and sends the question mail to a mail answer server. The mail answer server prepares and sends an answer mail, utilizing the voice recognition, and the information terminal receives the answer mail from the mail answer server, thus obtaining an inquiring system having a high recognition accuracy of the voice recognition and a low communication cost. <P>COPYRIGHT: (C)2003,JPO

Description

Detailed Description of the Invention

【０００１】[0001]

【発明の属する技術分野】本発明は、音声認識を利用し
た問い合わせ方法および問い合わせシステムに関する。TECHNICAL FIELD The present invention relates to an inquiry method and an inquiry system using voice recognition.

【０００２】[0002]

【従来の技術】従来、問い合わせシステムは特開平05-2
60184号公報に記載されたものが知られている。2. Description of the Related Art Conventionally, an inquiry system is disclosed in Japanese Patent Laid-Open No. 05-2.
The one described in Japanese Patent No. 60184 is known.

【０００３】図６に従来の問い合わせシステムの構成を
示している。図６において、電話機４１は無線を介して
中継機４２に接続され、中継機４２は電話回線を介して
電話応答サーバー４３に接続されている。データベース
４４は直接、電話応答サーバー４３と接続されている。FIG. 6 shows the configuration of a conventional inquiry system. In FIG. 6, a telephone 41 is wirelessly connected to a relay 42, and the relay 42 is connected to a telephone response server 43 via a telephone line. The database 44 is directly connected to the telephone response server 43.

【０００４】電話機４１は、質問音声を送話したり、応
答音声を受話したりするものである。中継機４２は、電
話機４１から電話応答サーバー４３に信号を中継した
り、電話応答サーバー４３から電話機４１に信号を中継
したりするものである。電話応答サーバー４３は、質問
音声の内容に応じてデータベース４４を検索し、応答文
を作成し、応答音声を返すものである。データベース４
４は、個人／団体名と住所と電話番号との対応を記述し
た電話帳情報などのデータを保持している。The telephone 41 is for transmitting a question voice and receiving a response voice. The repeater 42 relays a signal from the telephone 41 to the telephone response server 43 and a signal from the telephone response server 43 to the telephone 41. The telephone response server 43 searches the database 44 according to the content of the question voice, creates a response sentence, and returns the response voice. Database 4
Reference numeral 4 holds data such as telephone directory information in which correspondences between individual / group names, addresses, and telephone numbers are described.

【０００５】図７は電話応答サーバー４３が質問音声を
処理して応答音声を返す際の動作手順を示したフローチ
ャートである。FIG. 7 is a flowchart showing an operation procedure when the telephone response server 43 processes a question voice and returns a response voice.

【０００６】ユーザーが電話応答サーバー４３に電話を
かけると、電話応答サーバー４３は、回線を接続する
（ステップ５１）。次に、ユーザーは質問音声（例えば
「松下太郎さんの電話番号は？」という音声）を発声す
る（ステップ５２）。次に、電話応答サーバー４３は、
質問音声を音声認識処理して音声認識結果を得る（ステ
ップ５３）。認識結果として、例えば、“松下太郎さん
の電話番号は”というテキスト結果が得られる。When the user makes a call to the telephone response server 43, the telephone response server 43 connects the line (step 51). Next, the user utters a question voice (for example, a voice "Taro Matsushita's telephone number?") (Step 52). Next, the telephone response server 43
The question voice is subjected to voice recognition processing to obtain a voice recognition result (step 53). As a recognition result, for example, a text result "Taro Matsushita's telephone number is" is obtained.

【０００７】次に、電話応答サーバー４３は、認識結果
から“松下太郎”が名前であり、“電話番号”の問い合
わせであることを分析する。それから、電話応答サーバ
ー４３は、データベース４４を検索して“松下太郎”の
電話番号を調べる。例えばその結果を１２３４‐５６７
８番とする。そして、応答文“松下太郎さんの電話番号
は、１２３４‐５６７８です。”というテキストを生成
する（ステップ５４）。Next, the telephone response server 43 analyzes from the recognition result that "Taro Matsushita" is the name and the inquiry is "telephone number". Then, the telephone response server 43 searches the database 44 for the telephone number of "Taro Matsushita". For example, the result is 1234-567
Number 8 Then, the response sentence "Taro Matsushita's telephone number is 1234-5678" is generated (step 54).

【０００８】次に、電話応答サーバー４３は、応答文の
テキストを音声合成処理して応答音声を生成し、電話回
線に出力する（ステップ５５）。次に、ユーザーは応答
音声を聞いて電話を切り、電話応答サーバー４３は電話
回線を切断する（ステップ５６）。以上の処理手順によ
り、問い合わせに対する応答処理が完了する。Next, the telephone response server 43 performs voice synthesis processing on the text of the response sentence to generate a response voice and outputs it to the telephone line (step 55). Next, the user hears the response voice and hangs up, and the telephone response server 43 disconnects the telephone line (step 56). With the above processing procedure, the response processing to the inquiry is completed.

【０００９】[0009]

【発明が解決しようとする課題】この問い合わせシステ
ムにおいては、音声認識の認識精度が高いことと通信コ
ストが安いことが要求されている。In this inquiry system, it is required that the recognition accuracy of voice recognition is high and the communication cost is low.

【００１０】本発明は、音声認識の認識精度が高く、通
信コストが安い問い合わせシステムを実現することを目
的とする。It is an object of the present invention to realize an inquiry system with high recognition accuracy of voice recognition and low communication cost.

【００１１】[0011]

【課題を解決するための手段】この課題を解決するため
に本発明は、情報端末にて質問を音声ファイルにして、
電子メールに添付して質問メールとし、質問メールをメ
ール応答サーバーに送信し、メール応答サーバーにて音
声認識を利用して応答メールを作成して送信し、情報端
末にてメール応答サーバーからの応答メールを受信する
もので、電話を介して音声認識を利用する代わりに音声
ファイルと電子メールを介するように構成したものであ
る。In order to solve this problem, the present invention makes a question a voice file in an information terminal,
Attach it to an e-mail as a question e-mail, send the question e-mail to the e-mail response server, use the e-mail response server to create and send a response e-mail using voice recognition, and use the information terminal to respond from the e-mail response server It receives mail and is configured to go through voice files and email instead of using voice recognition over the phone.

【００１２】これにより、音声認識の認識精度が高く、
通信コストが安い問い合わせシステムが得られる。As a result, the recognition accuracy of voice recognition is high,
An inquiry system with low communication costs can be obtained.

【００１３】[0013]

【発明の実施の形態】本発明の請求項１に記載の発明
は、情報端末にて質問を音声ファイルにして、電子メー
ルに添付して質問メールとし、質問メールをメール応答
サーバーに送信し、メール応答サーバーにて音声認識を
利用して応答メールを作成して送信し、情報端末にてメ
ール応答サーバーからの応答メールを受信することを特
徴としたものであり、通信コストがメールの送受信の時
間に対してしかかからないという作用を有する。BEST MODE FOR CARRYING OUT THE INVENTION In the invention described in claim 1 of the present invention, a question is converted into a voice file in an information terminal, attached to an electronic mail to form a question mail, and the question mail is transmitted to a mail response server, The mail response server uses voice recognition to create and send a reply mail, and the information terminal receives the reply mail from the mail reply server. It has the effect that it only takes time.

【００１４】請求項２に記載の発明は、請求項１記載の
問い合わせ方法において、音声ファイルは電話の音声帯
域の制限をかけずに作成することを特徴としたものであ
り、電話の音声帯域の制約を受けないために音声認識の
精度を確保できるという作用を有する。The invention according to claim 2 is characterized in that, in the inquiry method according to claim 1, the voice file is created without limiting the voice band of the telephone. Since there is no restriction, it has the effect of ensuring the accuracy of voice recognition.

【００１５】請求項３に記載の発明は、請求項１または
請求項２記載の問い合わせ方法において、上記応答メー
ルはテキスト形式としたことを特徴ととしたものであ
り、応答文がテキスト形式なので情報を再読しやすく、
かつ通信コストが音声形式よりも安いという作用を有す
る。The invention according to claim 3 is characterized in that, in the inquiry method according to claim 1 or 2, the response mail is in text format, and since the response text is in text format, information Is easy to reread
In addition, the communication cost is lower than that of the voice format.

【００１６】請求項４に記載の発明は、情報端末とメー
ル応答サーバーからなる問い合わせシステムにおいて、
複数の情報端末とメール応答サーバーからなる問い合わ
せシステムにおいて、質問内容を音声ファイルにする音
声ファイル作成部と、前記音声ファイルを電子メールに
添付して質問メールとし、質問メールをメール応答サー
バーに送信するメール送信部と、メール応答サーバーか
らの応答メールを受信する応答メール受信部とを備えた
情報端末と、前記情報端末からの質問メールを受ける質
問メール受信部と、前記受信した音声ファイルからの質
問内容を音声認識によりテキストデータに変換する音声
認識部と、前記テキストデータを解析し、その解析され
た質問内容からデータベースを検索する検索部と、前記
質問内容と検索結果から応答文を作成する応答文作成部
と、前記応答文を音声ファイルに変換する音声合成部
と、前記音声ファイルを応答メールとして添付して送信
する送信部とを備えたメール応答サーバーとを具備する
ことを特徴とする問い合わせシステムとしたものであ
り、通信コストがメールの送受信の時間に対してしかか
からないという作用を有する。The invention according to claim 4 is an inquiry system comprising an information terminal and a mail response server,
In an inquiry system including a plurality of information terminals and a mail response server, a voice file creation unit for converting the content of a question into a voice file, attaching the voice file to an email as a question mail, and transmitting the question mail to a mail response server An information terminal equipped with a mail sending unit and a reply mail receiving unit for receiving a reply mail from a mail reply server, a question mail receiving unit for receiving a question mail from the information terminal, and a question from the received voice file A voice recognition unit that converts the content into text data by voice recognition, a search unit that analyzes the text data and searches a database from the analyzed question content, and a response that creates a response sentence from the question content and the search result. A sentence creation unit, a voice synthesis unit that converts the response sentence into a voice file, and the voice file The inquiry system is characterized by comprising a mail response server having a sending unit for sending and attaching as a reply mail, and the communication cost only affects the time for sending and receiving mail. Have.

【００１７】請求項５に記載の発明は、請求項４記載の
問い合わせシステムにおいて、音声ファイルは電話の音
声帯域の制限をかけずに作成することを特徴ととしたも
のであり、電話の音声帯域の制約を受けないために音声
認識の精度を確保できるという作用を有する。The invention according to claim 5 is the inquiry system according to claim 4, characterized in that the voice file is created without limiting the voice band of the telephone. The effect of ensuring the accuracy of speech recognition can be ensured because it is not restricted by.

【００１８】請求項６に記載の発明は、請求項４または
請求項５記載の問い合わせシステムにおいて、上記応答
メールはテキスト形式としたことを特徴としたものであ
り、応答文がテキスト形式なので情報を再読しやすく、
かつ通信コストが音声形式よりも安いという作用を有す
る。The invention according to claim 6 is characterized in that, in the inquiry system according to claim 4 or claim 5, the response mail is in a text format. Easy to reread,
In addition, the communication cost is lower than that of the voice format.

【００１９】請求項７に記載の発明は、質問内容を音声
ファイルにして、電子メールに添付した質問メールを受
信し、受信した質問メールの音声ファイルから音声認識
を利用してテキスト文に変換し、テキストに変換された
質問内容を分析し、その分析結果から予め応答内容を蓄
積したデータベースを検索し、検索結果と前記分析結果
から応答文を作成し、応答メールとして送信することを
特徴としたものであり、応答文がテキスト形式なので情
報を再読しやすく、かつ通信コストが音声形式よりも安
いという作用を有する。According to the invention of claim 7, the question content is converted into a voice file, the question mail attached to the electronic mail is received, and the voice file of the received question mail is converted into a text sentence by using voice recognition. Characterized by analyzing the question contents converted into text, searching a database in which the answer contents are accumulated in advance from the analysis result, creating a response sentence from the search result and the analysis result, and transmitting it as a response mail. Since the response text is in the text format, it has the effect of making it easier to reread the information and reducing the communication cost compared to the voice format.

【００２０】請求項８に記載の発明は、請求項７記載の
問い合わせに対する応答方法において、応答メールは、
応答文を音声合成により音声ファイルにして添付するこ
とを特徴とするもので、音声信号のみしか取り扱えない
端末でも利用することができるという作用を有する。According to the invention described in claim 8, in the response method to the inquiry according to claim 7, the response mail is
It is characterized in that the response sentence is attached as a voice file by voice synthesis, and it has an effect that it can be used even in a terminal that can handle only voice signals.

【００２１】請求項９に記載の発明は、質問内容を音声
ファイルにして、電子メールに添付した質問メールを受
ける質問メール受信部と、前記受信した音声ファイルか
らの質問内容を音声認識によりテキストデータに変換す
る音声認識部と、前記テキストデータを解析し、その解
析された質問内容からデータベースを検索する検索部
と、前記質問内容と検索結果から応答文を作成する応答
文作成部と、前記応答文を応答メールとして送信する送
信部とを具備したことを特徴とするもので、通信コスト
がメールの送受信の時間に対してしかかからないという
作用を有する。According to a ninth aspect of the present invention, the question content is converted into a voice file, the question mail receiving section receives the question mail attached to the electronic mail, and the question content from the received voice file is converted into text data by voice recognition. A voice recognition unit that converts the text data, a search unit that analyzes the text data and searches the database from the analyzed question contents, a response sentence creation unit that creates a response sentence from the question contents and search results, and the response It is characterized by including a transmitting unit for transmitting a sentence as a response mail, and has an effect that the communication cost only affects the time for transmitting and receiving the mail.

【００２２】請求項１０に記載の発明は、請求項９記載
のメール応答サーバー装置において、送信部は、前記応
答文を音声ファイルに変換する音声合成部を設け、音声
ファイルを添付して送信することを特徴とするもので、
音声信号のみしか取り扱えない端末でも利用することが
できるという作用を有する。According to a tenth aspect of the present invention, in the mail response server device according to the ninth aspect, the transmitting section is provided with a voice synthesizing section for converting the response sentence into a voice file, and the voice file is attached and transmitted. It is characterized by
It has an effect that it can be used even in a terminal that can handle only audio signals.

【００２３】以下、本発明の実施の形態について、図を
用いて説明する。Embodiments of the present invention will be described below with reference to the drawings.

【００２４】（実施の形態１）図１は、本発明の実施の
形態１における問い合わせシステムの構成を示した図で
ある。図１において、情報端末１１は無線を介して中継
機１２に接続され、中継機１２は電話回線を介してプロ
バイダ１３に接続され、プロバイダ１３はインターネッ
ト回線を介してメール応答サーバー１４と接続されてい
る。データベース１５は直接、メール応答サーバー１４
と接続されている。情報端末１１としては、携帯電話、
ＰＤＡ等を利用することができる。(Embodiment 1) FIG. 1 is a diagram showing a configuration of an inquiry system according to Embodiment 1 of the present invention. In FIG. 1, the information terminal 11 is wirelessly connected to a relay device 12, the relay device 12 is connected to a provider 13 via a telephone line, and the provider 13 is connected to a mail response server 14 via an internet line. There is. The database 15 is directly the mail response server 14
Connected with. As the information terminal 11, a mobile phone,
A PDA or the like can be used.

【００２５】中継機１２は、質問メールを情報端末１１
からプロバイダ１３に中継する。プロバイダ１３は、質
問メールを中継機１２からメール応答サーバー１４に転
送する。The relay machine 12 sends the inquiry mail to the information terminal 11
Relay to the provider 13. The provider 13 transfers the inquiry mail from the relay machine 12 to the mail response server 14.

【００２６】次に、情報端末１１の構成図を図２に示
し、以下に説明する。Next, a configuration diagram of the information terminal 11 is shown in FIG. 2 and will be described below.

【００２７】情報端末１１は、発声された質問をマイク
２１から入力し音声入力部２２で音声ファイルに変換す
る。質問メール作成部２３では、変換された音声ファイ
ルを電子メールに添付する。メール送信部２４では、質
問メールとし、送信する。また、応答メール受信部２５
で、質問メールに対する応答メールを受信して、添付さ
れている音声ファイルまたはテキストを出力部２６で再
生または表示する。The information terminal 11 inputs a spoken question from the microphone 21 and converts it into a voice file by the voice input unit 22. The question mail creating unit 23 attaches the converted voice file to the electronic mail. The mail sending unit 24 sends it as a question mail. Also, the response mail receiving unit 25
Then, the response mail to the question mail is received, and the attached voice file or text is reproduced or displayed on the output unit 26.

【００２８】次に、メール応答サーバー１４の構成図を
図３に示し、以下に説明する。Next, a configuration diagram of the mail response server 14 is shown in FIG. 3 and will be described below.

【００２９】情報端末１１からの質問メールをメール受
信部３１で受信し、質問メールに添付された音声ファイ
ルを取り出す。音声認識部３２は、取り出された音声フ
ァイルからの音声信号を音声認識によりテキストデータ
に変換する。分析部３３は、音声認識部３２からのテキ
ストデータから質問メールの内容を分析する。検索部３
４は、質問メールの内容に応じてデータベース１５を検
索し、応答文作成部３５で質問メールの内容と検索結果
を基に応答文を作成する。音声合成部３６は、作成され
た応答文を音声合成により音声ファイルに変換する。送
信部３７は、作成された音声ファイルを添付した応答メ
ールを作成し、プロバイダ１３に送信する。なお、本実
施の形態のデータベース１５は、個人／団体名と住所と
電話番号との対応を記述した電話帳情報などのデータを
保持しているものとする。The inquiry mail from the information terminal 11 is received by the mail reception unit 31, and the voice file attached to the inquiry mail is taken out. The voice recognition unit 32 converts the voice signal from the extracted voice file into text data by voice recognition. The analysis unit 33 analyzes the content of the question mail from the text data from the voice recognition unit 32. Search section 3
Reference numeral 4 searches the database 15 according to the content of the question mail, and the response sentence creating unit 35 creates a response sentence based on the content of the question email and the search result. The voice synthesizing unit 36 converts the created response sentence into a voice file by voice synthesis. The sending unit 37 creates a response mail with the created voice file attached, and sends it to the provider 13. The database 15 of the present embodiment is assumed to hold data such as telephone directory information in which correspondences between individual / group names, addresses, and telephone numbers are described.

【００３０】メール応答サーバー１４から送信された応
答メールは、プロバイダ１３、中継機１２を介して、情
報端末１１にて受信させる。The reply mail sent from the mail reply server 14 is received by the information terminal 11 via the provider 13 and the relay device 12.

【００３１】次に、図４〜図５を参照して問い合わせシ
ステムの動作について、詳細に説明する。Next, the operation of the inquiry system will be described in detail with reference to FIGS.

【００３２】図４は、情報端末１１を用いて質問メール
を作成する際の動作手順を示したフローチャートであ
る。FIG. 4 is a flow chart showing an operation procedure when creating a question mail using the information terminal 11.

【００３３】ユーザーは、情報端末１１の音声入力機能
を用いて、質問文（例えば「松下太郎さんの電話番号は
？」という発声）をマイク２１から入力して、音声入力
部２２で音声ファイルを作成する（ステップ２１）。音
声ファイルのファイル形式としては、ＭＰ３（ＭＰＥＧ
−１ Audio Layer3）、ＷＭＡ（Ｗｉｎｄｏｗｓ（登
録商標）ＭｅｄｉｏＡｕｄｉｏ）あるいはＡＤＰＣ
Ｍ（Adaptive Differencial Pulse Code Modulation）
などがよく知られている規格フォーマットである。The user uses the voice input function of the information terminal 11 to input a question sentence (for example, "utterance of Mr. Taro Matsushita's telephone number") from the microphone 21, and the voice input unit 22 outputs the voice file. Create (step 21). The file format of the audio file is MP3 (MPEG
-1 Audio Layer3), WMA (Windows (registered trademark) Media Audio) or ADPC
M (Adaptive Differential Pulse Code Modulation)
Is a well-known standard format.

【００３４】次に、ユーザーは、質問メール作成部２３
にてメール作成ソフトを起動し、題名と本文に何も書い
ていない電子メールに質問文の音声ファイルを添付し、
宛先にメール応答サーバー１４のメールアドレスを書き
込んだ質問メールを作成する（ステップ２２）。次に、
メール送信部２４で、質問メールを送信する（ステップ
２３）。Next, the user is asked by the question mail creating section 23.
Start the mail creation software at, attach the voice file of the question text to the e-mail with nothing written in the subject and text,
A question mail in which the mail address of the mail response server 14 is written is created (step 22). next,
The mail sending unit 24 sends a question mail (step 23).

【００３５】以上の処理手順により、質問メールが送信
される。次に、メール応答サーバー１４が質問メール
を処理して応答メールを作成する応答メール作成処理の
動作手順の例を示したフローチャートを、図５に示し以
下に説明する。The inquiry mail is transmitted by the above processing procedure. Next, a flow chart showing an example of an operation procedure of a response mail creating process in which the mail response server 14 processes a question mail and creates a response mail is shown in FIG. 5 and will be described below.

【００３６】メール受信部３１で、ユーザーからの質問
メールを受信し（ステップ３１）、質問メールに添付さ
れている音声ファイルを取り出す（ステップ３２）。The mail receiving unit 31 receives the question mail from the user (step 31) and takes out the voice file attached to the question mail (step 32).

【００３７】次に、音声認識部３２は、音声ファイルか
らの音声信号を音声認識処理によりテキストデータを音
声認識結果として得る（ステップ３３）。音声認識処理
は、音声信号を特徴パラメータに変換し、予め用意した
音響モデルと言語モデルと単語の発音辞書とを参照し
て、入力に対して音響的にかつ言語的に尤らしい単語系
列を探索することで実現できる。特徴パラメータはケプ
ストラム係数、音響モデルはＨＭＭ、言語モデルはN-gr
amモデル、探索方法はビームサーチ法などが知られてい
る。音声ファイルのヘッダーから標本化周波数等の情報
を得て、音声認識の制御パラメーターを設定することが
できる。認識結果として、例えば、“松下太郎さんの電
話番号は”というテキストの結果が得られる。Next, the voice recognition unit 32 obtains text data as a voice recognition result by voice recognition processing of the voice signal from the voice file (step 33). The speech recognition process converts a speech signal into a characteristic parameter, searches for a word sequence that is acoustically and linguistically likely to be input with reference to a prepared acoustic model, language model, and pronunciation dictionary of words. It can be realized by doing. Feature parameters are cepstrum coefficients, acoustic model is HMM, language model is N-gr
The am model and the beam search method are known as search methods. Information such as the sampling frequency can be obtained from the header of the voice file to set the control parameters for voice recognition. As a recognition result, for example, a text result "Taro Matsushita's telephone number is" is obtained.

【００３８】次に、分析部３３は、音声認識部３２の認
識結果から“松下太郎”が名前であり、“電話番号”の
問い合わせであることを分析する。分析の方法は、スロ
ットフィリング方式などの公知の方法で実現できる。ス
ロットフィリング方式は、必要なスロット（「名前」や
「問い合わせの対象」等の項目）と、各スロットに当て
はまるキーワードを予め用意しておき、テキストデータ
からキーワードと一致する部分を抽出してスロットに当
てはめる方式であり、これにより、テキストデータの問
い合わせ内容を分析できる。Next, the analysis unit 33 analyzes from the recognition result of the voice recognition unit 32 that "Taro Matsushita" is the name and the inquiry is "telephone number". The analysis method can be realized by a known method such as a slot filling method. The slot filling method prepares necessary slots (items such as "name" and "object of inquiry") and keywords that apply to each slot in advance, and extracts the part that matches the keyword from the text data and stores it in the slot. This is a method to be applied, which enables analysis of inquiry contents of text data.

【００３９】それから、検索部３４は、分析結果を基
に、データベース１５を検索して“松下太郎”の電話番
号を調べる。例えば、その結果を１２３４‐５６７８番
とする。そして、応答文作成部３５は、分析結果及び検
索結果を基に応答文“松下太郎さんの電話番号は、１２
３４‐５６７８です。”というテキストを生成する（ス
テップ３４）。応答文の生成は、予め問い合わせ対象ご
とに応答文パターン（例えば電話番号の問い合わせ用の
応答文パターンは”〔名前〕さんの電話番号は、〔電話
番号〕です”）を用意しておき、〔〕内を具体的な内容
に置きかえることで実現できる。Then, the search unit 34 searches the database 15 for the telephone number of "Taro Matsushita" based on the analysis result. For example, the result is set to 1234-5678. Then, the response sentence creating unit 35 determines that the response sentence “Taro Matsushita's telephone number is 12” based on the analysis result and the search result.
It is 34-5678. The text "" is generated (step 34). The response sentence is generated in advance for each inquiry target (for example, the response sentence pattern for inquiring a telephone number is "[name] 's telephone number is [phone number]. ]] ”) Is prepared and the contents in [] are replaced with concrete contents.

【００４０】次に、音声合成部３６は、応答文のテキス
トを音声合成処理して音声ファイルを作成する（ステッ
プ３５）。音声合成処理は、テキスト文を発音記号の列
に変換し、予め用意しておいた発音記号に対する波形デ
ータを、発音記号の列にしたがって接続することで実現
できる。Next, the voice synthesizing unit 36 performs voice synthesis processing on the text of the response sentence to create a voice file (step 35). The voice synthesis processing can be realized by converting a text sentence into a sequence of phonetic symbols and connecting waveform data for phonetic symbols prepared in advance according to the sequence of phonetic symbols.

【００４１】次に、送信部３７は、題名と本文に何も書
いていない電子メールに応答文の音声ファイルまたは応
答文のテキストを添付し、宛先に質問メールの送信者の
アドレスを書き込んだ応答メールを作成し（ステップ３
６）、応答メールを送信する（ステップ３７）。Next, the transmission unit 37 attaches the voice file of the response sentence or the text of the response sentence to the electronic mail in which the subject and the text are not written, and writes the address of the sender of the question mail in the response. Create an email (step 3
6) A reply mail is transmitted (step 37).

【００４２】以上の処理手順により、応答メールが送信
される。The response mail is transmitted by the above processing procedure.

【００４３】応答メールの受信は、情報端末１１の自動
受信機能によって受信される。応答メール受信部２５
は、質問メールに対する応答メールを受信する。出力部
２６は、添付されている音声ファイルまたはテキストを
スピーカ２８またはディスプレイ２７で再生または表示
する。The reception of the reply mail is received by the automatic reception function of the information terminal 11. Response mail receiver 25
Receives a reply mail to the question mail. The output unit 26 reproduces or displays the attached voice file or text on the speaker 28 or the display 27.

【００４４】あるいは、メールの受信は、適当なタイミ
ングを見計らって、ユーザーが手動で行うことも可能で
ある。ユーザーは、応答メールに添付された音声ファイ
ルを再生することにより、電話番号を知ることができ
る。Alternatively, the user can manually receive the mail at an appropriate timing. The user can know the telephone number by playing the audio file attached to the response mail.

【００４５】以上の手順により、質問メールと応答メー
ルのやりとりにより問い合わせ処理が完了する。By the above procedure, the inquiry process is completed by exchanging the question mail and the reply mail.

【００４６】なお、以上の説明では、質問文も応答文も
音声ファイルを添付する方法で説明したが、質問文は音
声ファイルを添付し、応答文はメール本文にテキスト形
式で送信する場合についても同様に実施可能である。In the above explanation, the method of attaching the voice file to both the question text and the response text has been explained, but the case where the question text is attached to the voice file and the response text is transmitted in the text form in the mail body is also applicable. It can be similarly implemented.

【００４７】また、以上の説明では、メール応答サーバ
ーは、音声認識を行ってから応答文を生成するまで、一
括処理で行う方法で説明したが、音声認識結果をテキス
ト形式でユーザーに一旦返信し、ユーザーが音声認識結
果を確認後再送付し、そのテキスト文の質問文を基にメ
ール応答サーバーが応答文を生成するようにしてもよ
い。Further, in the above description, the mail response server performs batch processing from the voice recognition to the generation of the response sentence, but the voice recognition result is temporarily returned to the user in a text format. The user may confirm the voice recognition result and resend it, and the mail response server may generate the response sentence based on the question sentence of the text sentence.

【００４８】また、以上の説明では、音声ファイルをそ
のまま添付する方法で説明したが、情報端末で音声ファ
イルを圧縮し、メール応答サーバーで音声ファイルを解
凍するようにしてもよい。Also, in the above description, the method of directly attaching the voice file has been described, but the voice file may be compressed by the information terminal and the voice file may be decompressed by the mail response server.

【００４９】また、音声認識結果の信頼度から、認識結
果が信頼できるかどうかの判定を行い、信頼できない場
合には、応答できない旨の通知とともに、発声方法を説
明したガイド文をユーザーに送るようにしてもよい。Further, it is judged from the reliability of the voice recognition result whether or not the recognition result is reliable, and if the recognition result is not reliable, a guide sentence explaining the utterance method should be sent to the user together with a notification that the response cannot be made. You may

【００５０】[0050]

【発明の効果】以上のように本発明によれば、電話を介
して音声認識を利用する代わりに音声ファイルと電子メ
ールを介するように構成したもので、音声認識の認識精
度が高く、通信コストが安い問い合わせシステムが得ら
れるという有利な効果が得られる。As described above, according to the present invention, the voice file and the electronic mail are used instead of the voice recognition via the telephone, the recognition accuracy of the voice recognition is high, and the communication cost is high. The advantageous effect that a cheap inquiry system can be obtained is obtained.

[Brief description of drawings]

【図１】本発明の実施の形態１における問い合わせシス
テムを示す概略構成図FIG. 1 is a schematic configuration diagram showing an inquiry system according to a first embodiment of the present invention.

【図２】本発明の実施の形態１における情報端末の構成
図FIG. 2 is a configuration diagram of an information terminal according to the first embodiment of the present invention.

【図３】本発明の実施の形態１におけるメール応答サー
バーの構成図FIG. 3 is a configuration diagram of a mail response server according to the first embodiment of the present invention.

【図４】質問メールを作成する際の動作手順を示したフ
ローチャートFIG. 4 is a flowchart showing an operation procedure when creating a question mail.

【図５】質問メールを処理して応答メールを作成する際
の動作手順の例を示したフローチャートFIG. 5 is a flowchart showing an example of an operation procedure when processing a question mail and creating a response mail.

【図６】従来の問い合わせシステムを示す概略構成図FIG. 6 is a schematic configuration diagram showing a conventional inquiry system.

【図７】質問音声を処理して応答音声を返す際の動作手
順の例を示したフローチャートFIG. 7 is a flowchart showing an example of an operation procedure when processing a question voice and returning a response voice.

[Explanation of symbols]

１１情報端末１２中継機１３プロバイダ１４メール応答サーバー１５データベース２１マイク２２音声入力部２３質問メール作成部２４メール送信部２５応答メール受信部２６出力部２７ディスプレイ２８スピーカ３１メール受信部３２音声認識部３３分析部３４検索部３５応答文作成部３６音声合成部３７送信部 11 Information terminal 12 repeaters 13 providers 14 Email response server 15 Database 21 microphone 22 Voice input section 23 Question mail composer 24 E-mail sending section 25 Response mail receiver 26 Output section 27 display 28 speakers 31 Email Receiver 32 Speech recognition unit 33 Analysis Department 34 Search Department 35 Response sentence creation section 36 Speech synthesizer 37 Transmitter

───────────────────────────────────────────────────── フロントページの続き (51)Int.Cl.⁷ 識別記号ＦＩテーマコート゛(参考）Ｇ１０Ｌ 15/28 Ｈ０４Ｍ 3/42 ＲＨ０４Ｍ 3/42 Ｚ 11/00 ３０２Ｇ１０Ｌ 3/00 ５５１Ａ 11/00 ３０２５６１Ｈ５７１Ａ (72)発明者齋藤夏樹大阪府門真市大字門真1006番地松下電器産業株式会社内Ｆターム(参考） 5D015 AA05 KK02 LL12 5K015 AA07 GA07 5K024 BB07 GG03 GG05 5K101 KK02 LL01 NN08 NN11 NN21─────────────────────────────────────────────────── ─── Continuation of front page (51) Int.Cl. ⁷ Identification code FI theme code (reference) G10L 15/28 H04M 3/42 R H04M 3/42 Z 11/00 302 G10L 3/00 551A 11/00 302 561H 571A (72) Inventor Natsuki Saito 1006 Kadoma, Kadoma City, Osaka Prefecture Matsushita Electric Industrial Co., Ltd. F term (reference) 5D015 AA05 KK02 LL12 5K015 AA07 GA07 5K024 BB07 GG03 GG05 5K101 KK02 LL01 NN08 NN11 NN11 NN11

Claims

[Claims]

1. The information terminal converts the question into a voice file, attaches it to an e-mail to form a question mail, transmits the question mail to a mail response server, and the mail response server uses voice recognition to reply the response mail. An inquiry method characterized by creating and transmitting and receiving a reply mail from the mail reply server at the information terminal.

2. The inquiry method according to claim 1, wherein the voice file is created without limiting the voice band of the telephone.

3. The inquiry method according to claim 1, wherein the response mail is in a text format.

4. An inquiry system comprising a plurality of information terminals and a mail response server, and a voice file creating section for converting the content of a question into a voice file, and attaching the voice file to an electronic mail to form a question mail, and the question mail is mailed. An information terminal having a mail sending unit for sending to a response server and a reply mail receiving unit for receiving a reply mail from the mail reply server; a question mail receiving unit for receiving a question mail from the information terminal; A voice recognition unit that converts question contents from a voice file into text data by voice recognition, a search unit that analyzes the text data and searches a database from the analyzed question contents, and a response from the question contents and search results A response sentence creating unit for creating a sentence; a voice synthesis unit for converting the response sentence into a voice file; An inquiry system, comprising: a mail response server including a transmission unit that attaches and transmits a voice recording file as a response mail.

5. The inquiry system according to claim 4, wherein the voice file is created without limiting the voice band of the telephone.

6. The inquiry system according to claim 4 or 5, wherein the response mail is in a text format.

7. The question converted into a text by receiving a question mail attached to an e-mail with the question content as a voice file, converting the voice file of the received question mail into a text sentence by using voice recognition, and converting into a text. A method for responding to an inquiry, which comprises analyzing a content, searching a database in which response contents are accumulated in advance from the analysis result, creating a response sentence from the search result and the analysis result, and transmitting it as a response mail.

8. The response mail is characterized in that the response sentence is attached as a voice file by voice synthesis.
How to respond to the inquiries listed.

9. A question mail receiving unit for receiving a question mail attached to an e-mail, the question contents being a voice file, and a voice recognition unit for converting the question contents from the received voice file into text data by voice recognition. , A search unit that analyzes the text data and searches a database from the analyzed question contents, a response sentence creation unit that creates a response sentence from the question contents and search results, and sends the response sentence as a response mail A mail response server device comprising a transmission unit.

10. The mail response server device according to claim 9, wherein the transmission unit is provided with a voice synthesizing unit for converting the response sentence into a voice file, and the voice file is attached and transmitted.