JPS6382155A

JPS6382155A - Telephone set

Info

Publication number: JPS6382155A
Application number: JP61229018A
Authority: JP
Inventors: Yoshimi Betsushiyo; 別所　由実; Takeshi Norimatsu; 武志則松; Hiroyuki Naono; 博之直野
Original assignee: Matsushita Electric Industrial Co Ltd
Current assignee: Panasonic Holdings Corp
Priority date: 1986-09-26
Filing date: 1986-09-26
Publication date: 1988-04-12

Abstract

PURPOSE:To accurately recognize the voice of user by using a voice storage section so as to store a response voice of the user in requesting a voice response to the user by means of a synthesized voice and giving a voice by the user in response to the user and outputting the voice in a voice storage section to a general terminal control section after all the synthesized voice is outputted. CONSTITUTION:In requesting a voice response by the user by means of the synthesized voice and inputting a voice by the user in response to the request, a switch 11 is set to as to input the input voice to the voice storage section 10. Then the voice storage section 10 stores the response voice of the user for a prescribed time and after all the synthesized voice is outputted, the voice in the voice storage section 10 is outputted to the general terminal control section 5. Thus, even when the user applies voice reply during the output of the synthesized voice, the voice of the user is inputted accurately to the voice recognition section 2 to attain the recognition processing.

Description

【発明の詳細な説明】産業上の利用分野本発明は、電話装置に用いられている多機能な音声によ
り制御するための音声認識部、音声合成部を持つ電話装
置に関するものである。DETAILED DESCRIPTION OF THE INVENTION Field of the Invention The present invention relates to a telephone device having a voice recognition section and a voice synthesis section for controlling multifunctional voices used in the telephone device.

従来の技術近年、三者通話や不在光転送など多様な機能を可能とす
る電話装置が市場に投入され注目されている。初期の電
話装置では、多機能を可能にするために、ダイヤル番号
と機能を一対一に対応させておいて、利用者がダイヤル
にて機能番号を入力することで機能を動作させるという
方法をとっていた。しかしながら、機能の多様化が進む
につれて、機能番号を記憶することが困難となり、ダイ
３　ページャル入力も操作が複雑であり扱いにくいという不都合が
生じてきた。そこで、従来の電話装置では、上記問題点
を解決するため、電話装置内に音声認識部と音声合成部
を設け、合成音声にて機能名人力時全利用者に知らせ、
利用者は合成音声に応じて機能名を音声にて入力し、利
用者音声を認識するとこで機能全動作させる方法がとら
れた。以上の方法により、機能番号全記憶する必要もな
く、多機能を容易に扱うことが可能となった。BACKGROUND OF THE INVENTION In recent years, telephone devices capable of various functions such as three-party calling and missed optical transfer have been introduced into the market and are attracting attention. In order to enable multiple functions, early telephone devices had a one-to-one correspondence between dial numbers and functions, and the user operated the functions by inputting the function number by dialing. was. However, as functions have become more diverse, it has become difficult to memorize function numbers, and the operation of the die 3 pager input has become complicated and difficult to handle. Therefore, in order to solve the above problems, conventional telephone devices have a voice recognition section and a voice synthesis section in the telephone device, and notify all users when the function is effective using synthesized voice.
The user inputs the function name aloud according to the synthesized voice, and when the user's voice is recognized, the full function is activated. The above method makes it possible to easily handle multiple functions without having to memorize all function numbers.

以下、図面全参照しながら、上述したような従来の電話
装置について説明を行う。第３図は、従来の電話装置の
ブロック図である。第３図において、１は音声合成部、
２は音声認識部、３は特殊端末制御部、４はシステム制
御部、１２は一般端末制御部、１３にハイブリッドコイ
ル、１４は音声入力端子、８は音声出力端子、１５は番
号入力端子である。以上のように構成された電話装置に
ついて、その動作を説明する。Hereinafter, the conventional telephone device as described above will be explained with reference to all the drawings. FIG. 3 is a block diagram of a conventional telephone device. In FIG. 3, 1 is a speech synthesis section;
2 is a voice recognition unit, 3 is a special terminal control unit, 4 is a system control unit, 12 is a general terminal control unit, 13 is a hybrid coil, 14 is a voice input terminal, 8 is a voice output terminal, and 15 is a number input terminal. . The operation of the telephone device configured as described above will be explained.

利用者が電話装置の各機能を使用する際には、まず利用
者は番号入力端子１５より機能スタート用番号を入力し
、入力された信号は一般端末制御部１２を経てシステム
制御部４に入力される。When a user uses each function of the telephone device, the user first inputs a function start number from the number input terminal 15, and the input signal is input to the system control unit 4 via the general terminal control unit 12. be done.

次にシステム制御部４より特殊端末制御部３を経て音声
合成部１に合成音声を要求し、音声合成部１より利用者
に機能名を要求するための合成音声が出力される。合成
音声は、たとえば「機能名をどうぞ」でもよい。合成音
声は、特殊端末制御部３．システム制御部４．一般端末
制御部１２゜ハイブリッドコイル１３を経て音声出力端
子８より出力される。利用者は合成音声に答えて、必要
とする機能名音声を音声入力端子１４より入力し、機能
名音声は、ハイブリッドコイル１３．一般端末制御部１
２．システム制御部４．特殊端末制御部３を経て音声認
識部２に入力され、認識処理後、認識結果は特殊端末制
御部３に出力される。たとえば、機能名は「不在転送」
であったとする。Next, the system control section 4 requests a synthesized speech from the speech synthesis section 1 via the special terminal control section 3, and the speech synthesis section 1 outputs the synthesized speech for requesting the user for the function name. The synthesized speech may be, for example, "Please give me the function name." The synthesized voice is generated by the special terminal control unit 3. System control unit 4. The general terminal control unit 12 is outputted from the audio output terminal 8 via the hybrid coil 13. In response to the synthesized voice, the user inputs the desired function name voice from the voice input terminal 14, and the function name voice is input to the hybrid coil 13. General terminal control unit 1
2. System control unit 4. It is input to the speech recognition section 2 via the special terminal control section 3, and after recognition processing, the recognition result is output to the special terminal control section 3. For example, the feature name is "Call Forwarding"
Suppose it was.

次に、特殊端末制御部３より音声合成部１に認識結果確
認用合成音声を要求し、音声合成部１より合成音声が出
力される。合成音声は、たとえば「不在転送ですか」で
もよい。合成音声は、機能６　ページ名を要求する上記合成音声と同様に音声出力端子８より
出力される。利用者は合成音声に答えて、認識結果の正
誤判断音声を音声入力端子１４より入力する。正誤判断
音声は、たとえば「はい」または「いいえ」であったと
する。正誤判断音声は、機能名音声と同様に音声認識部
２に入力され、認識処理後、認識結果は特殊端末制御部
３に出力される。Next, the special terminal control section 3 requests the speech synthesis section 1 for synthesized speech for confirming the recognition result, and the speech synthesis section 1 outputs the synthesized speech. The synthesized voice may be, for example, "Is this call forwarded?" The synthesized speech is output from the speech output terminal 8 in the same way as the synthesized speech for requesting the page name in function 6. In response to the synthesized speech, the user inputs speech for determining whether the recognition result is correct or incorrect from the speech input terminal 14. It is assumed that the correct/incorrect judgment voice is, for example, “yes” or “no”. The correct/incorrect judgment voice is input to the voice recognition unit 2 in the same way as the function name voice, and after recognition processing, the recognition result is output to the special terminal control unit 3.

正認識結果の場合には、特殊端末制御部３よりシステム
制御部４に機能を動作するための信号が送られ、誤認識
結果の場合には、特殊端末制御部３よシ音声合成部１に
、再度、機能名要求用合成音声を要求する。In the case of a correct recognition result, a signal for operating a function is sent from the special terminal control unit 3 to the system control unit 4, and in the case of an incorrect recognition result, a signal is sent from the special terminal control unit 3 to the system control unit 1. , requests the synthesized voice for requesting the function name again.

発明が解決しようとする問題点しかしながら、上記のような構成では、一般端末制御部
１２とハイブリッドコイル１３間の音声用回線は２線で
あるので、合成音声と利用者の応答音声を同時に通信す
ることは無理である。そのために、合成音声の出力中に
利用者が音声応答を始めると、音声応答は一般端末制御
部１２に正確６　ヘ一／゛に出力されず、音声認識が不可能になる。上記のような
従来の動作においては、通常は合成音声と音声応答が同
時に入力されないが、上記従来装置全実際に使用してみ
ると、利用者が合成音声が終わるときを待たずに音声応
答を始めてしまうことが頻繁に起こったため、その度に
音声認識が不可能になるという問題点を有していた。Problems to be Solved by the Invention However, in the above configuration, since the voice line between the general terminal control unit 12 and the hybrid coil 13 is two wires, the synthesized voice and the user's response voice are communicated simultaneously. That is impossible. For this reason, if the user starts a voice response while the synthesized voice is being output, the voice response will not be output to the general terminal control unit 12 exactly 6/2, making voice recognition impossible. In the conventional operation described above, synthesized speech and voice response are usually not input at the same time, but when all of the above conventional devices are actually used, it is possible for the user to input voice response without waiting for the synthesized voice to finish. This has caused the problem that voice recognition becomes impossible each time it occurs.

本発明は、上記問題点に鑑み、合成音声により利用者に
音声応答を要求し利用者が要求に応じて音声を入力する
際に、上記音声保存部に利用者の応答音声を記憶し、合
成音声が全て出力された後に、音声保存部内の音声を一
般端子制御部に出力することにより、合成音声の出力中
に利用者が音声応答を行ったとしても、利用者の音声全
正確に音声認識部に入力し認識処理を可能とすることが
できる電話装置を提供するものである。In view of the above-mentioned problems, the present invention stores the user's response voice in the voice storage unit and synthesizes it when the user is requested to give a voice response using synthesized voice and the user inputs voice in response to the request. By outputting the voice in the voice storage unit to the general terminal control unit after all voices are output, even if the user makes a voice response while the synthesized voice is being output, all of the user's voices can be accurately recognized. The present invention provides a telephone device that can perform recognition processing by inputting information into a section.

問題点を解決するための手段この目的を達成するために本発明の電話装置は、利用者
の音声を認識する音声認識部と、利用者に音声入力を要
求したり上記音声認識部での認識結７　ページ果全利用者に知らせるための合成音声を制御する音声合
成部と、上記音声認識部と音声合成部を制御する特殊端
末制御部と、合成音声及び利用者の音声を出力する音声
出力端子と、利用者の音声を入力する音声入力端子と、
番号を入力するための番号入力端子と、利用者の音声全
保存する音声保存部と、上記各端子及び音声保存部を内
蔵している電話端末機を制御する一般端末制御部と、上
記各制御部を制御するシステム制御部とから構成されて
いる。Means for Solving the Problems In order to achieve this object, the telephone device of the present invention includes a voice recognition unit that recognizes the user's voice, and a voice recognition unit that requests voice input from the user and performs recognition by the voice recognition unit. Result 7: a speech synthesis unit that controls synthesized speech to notify all users of the page result; a special terminal control unit that controls the speech recognition unit and speech synthesis unit; and a speech output that outputs the synthesized speech and the user's voice. a terminal, an audio input terminal for inputting the user's voice,
A number input terminal for inputting a number, an audio storage unit that stores all of the user's voice, a general terminal control unit that controls the telephone terminal that incorporates each of the above terminals and the audio storage unit, and each of the above controls. and a system control section that controls the system.

作用この構成によって、合成音声により利用者に音声応答を
要求し利用者が要求に応じて音声を入力する際に、上記
音声保存部に利用者の応答音声を記憶し、合成音声が全
て音声出力端子より出力された後に、上記音声保存部内
の音声を上記−膜端子制御部に出力する。Effect: With this configuration, when a voice response is requested from the user using synthesized voice and the user inputs voice in response to the request, the user's response voice is stored in the voice storage section, and all the synthesized voice is output as voice. After being output from the terminal, the audio in the audio storage section is output to the membrane terminal control section.

実施例以下、本発明の一実施例における電話装置について第１
図を参照しながら説明する。第１図において、１は音声
合成部、２は音声認識部、３は特殊端末制御部、４はシ
ステム制御部、５は一般端末制御部、６はハイブリッド
コイル、７は音声入力端子、８は音声出力端子、９は番
号入力端子、１０は音声保存部、１１はスイッチであり
、第２図に示す従来例と同じものは同一の番号を付与し
ている。Embodiment Hereinafter, the first example of a telephone device according to an embodiment of the present invention will be described.
This will be explained with reference to the figures. In FIG. 1, 1 is a speech synthesis section, 2 is a speech recognition section, 3 is a special terminal control section, 4 is a system control section, 5 is a general terminal control section, 6 is a hybrid coil, 7 is an audio input terminal, and 8 is a An audio output terminal, 9 a number input terminal, 10 an audio storage section, 11 a switch, and the same parts as in the conventional example shown in FIG. 2 are given the same numbers.

以上のように構成された電話装置について、以下その動
作を第２図の音声会話例のフローチャートを用いて説明
する。The operation of the telephone device configured as described above will be explained below using the flowchart of an example of voice conversation shown in FIG.

利用者が電話装置の多機能全使用する際には。When the user uses all the functions of the telephone device.

利用者は番号入力端子９より機能スタート用番号を入力
しく会話例２ｏ）、入力された信号は音声保存部１０．
ハイブリッドコイル６、一般端末制御部６を経てシステ
ム制御部４に入力され、システム制御部４より特殊端末
制御部３を経て音声合成部１に合成音声を要求し、利用
者に機能名を要求するための合成音声が音声合成部１よ
り出力される。合成音声は、たとえば「機能名をどうぞ
」でもよい（会話例２１）。合成音声は、特殊端末９　
ページ制御部３．システム制御部４．一般端末制御部６゜ハイ
ブリッドコイル６を経て音声出力端子８よシ出力される
。また、番号信号を受けた音声保存部１０では、以後の
利用者の入力音声を保存するために、スイッチ１１を音
声保存部１０側（Ａ端子）にセットする。The user inputs the function start number from the number input terminal 9 (conversation example 2o), and the input signal is sent to the voice storage section 10.
The signal is input to the system control unit 4 via the hybrid coil 6 and the general terminal control unit 6, and the system control unit 4 requests the speech synthesis unit 1 via the special terminal control unit 3 for synthesized speech, and requests the function name from the user. The synthesized speech for this purpose is output from the speech synthesis section 1. The synthesized speech may be, for example, "Please give me the function name" (conversation example 21). The synthesized voice is produced by special terminal 9.
Page control section 3. System control unit 4. The signal is output from the audio output terminal 8 via the general terminal control unit 6 and the hybrid coil 6. Further, the voice storage unit 10 that receives the number signal sets the switch 11 to the voice storage unit 10 side (terminal A) in order to store the user's input voice from now on.

次に利用者は、合成音声に答えて、必要とする機能名音
声を音声入力端子７より入力する。たとえば、機能名は
「不在転送」であったとする（会話例２２）。機能名音
声は、ハイブリッドコイル６に出力される前に、−旦、
音声保存部１ｏに出力され、一定時間音声保存部１０に
保存された後、ハイブリッドコイル６、一般端末制御部
５．システム制御部４．特殊端末制御部３を経て、音声
認識部２に入力される。音声保存部１ｏでの保存時間は
、合成音声長より少し短いことが好ましく。Next, the user answers the synthesized voice and inputs the desired function name voice from the voice input terminal 7. For example, assume that the function name is "call forwarding" (conversation example 22). Function name: Before the sound is output to the hybrid coil 6,
After being output to the audio storage unit 1o and stored in the audio storage unit 10 for a certain period of time, the hybrid coil 6, the general terminal control unit 5. System control unit 4. The signal is input to the speech recognition section 2 via the special terminal control section 3. Preferably, the storage time in the audio storage unit 1o is slightly shorter than the synthesized audio length.

たとえば合成音声が１機能名をどうぞ」の場合は、約１
秒間が適当である。認識処理後、認識結果は特殊端末制
御部３に出力され、特殊端末制御部３よシ音声合成部１
に認識結果確認用合成音声を要１　ｏへ一／゛求し、音声合成部１より合成音声が出力される。For example, if the synthesized voice is "1 function name please", approximately 1
Seconds is appropriate. After the recognition process, the recognition result is output to the special terminal control section 3, and the special terminal control section 3 then sends it to the speech synthesis section 1.
Then, the synthesized speech for confirming the recognition result is requested from the speech synthesizer 1, and the synthesized speech is output from the speech synthesis section 1.

合成音声は、たとえば「不在転送ですか」でもよい（会
話例２３）。合成音声は、上記機能名要求用合成音声と
同様に音声出力端子８より出力される。利用者は合成音
声に答えて、認識結果の正誤判断音声を音声入力端子７
より入力する。正誤判断音声は、たとえば「はい」また
け「いいえ」であったとする（会話例２４）。正誤判断
音声は、機能名音声と同様に、一定時間音声保存部１ｏ
に保存された後、音声認識部２に入力され、認識処理後
、認識結果は特殊端末制御部３に出力される。The synthesized voice may be, for example, "Is this a call forwarding?" (conversation example 23). The synthesized speech is output from the speech output terminal 8 in the same way as the synthesized speech for requesting the function name. The user answers the synthesized voice and inputs the voice to judge whether the recognition result is correct or incorrect at the voice input terminal 7.
Enter more information. It is assumed that the correct/incorrect judgment voice is, for example, “yes” and “no” (conversation example 24). The correct/incorrect judgment sound is stored in the sound storage unit 1o for a certain period of time, similar to the function name sound.
After being stored in , it is input to the speech recognition section 2 , and after recognition processing, the recognition result is output to the special terminal control section 3 .

正認識結果の場合には、特殊端末制御部３よりシステム
制御部４に機能を動作するための信号が送られ、機能の
動作が始まる（会話例２６）。誤認識結果の場合には、
特殊端末制御部３より音声合成部１に、再度、機能名を
要求する合成音声を要求する。In the case of a correct recognition result, a signal for operating the function is sent from the special terminal control unit 3 to the system control unit 4, and the operation of the function is started (conversation example 26). In case of incorrect recognition results,
The special terminal control section 3 requests the speech synthesis section 1 again for synthesized speech requesting the function name.

以上の処理後機能動作が終了すると、システム制御部４
より動作終了信号を一般端末制御部５を経て音声保存部
１０に出力する。次に音声保存部１１　ペー。When the above post-processing functional operations are completed, the system control unit 4
Then, an operation end signal is outputted to the audio storage section 10 via the general terminal control section 5. Next, the audio storage section 11 page.

１０で、スイッチ１１をハイブリッドコイル６側（Ｂ端
子側）にセットする。つまりスイッチ１１は、通常の通
話の際には入力音声ヲ７・イプリ・ノドコイル６に入力
するように動作し、各機能を使用する際には音声保存部
１ｏに入力するように動作する。At step 10, the switch 11 is set to the hybrid coil 6 side (B terminal side). In other words, the switch 11 operates to input the input voice to the input voice 7, input voice coil 6 during a normal call, and operates to input the voice to the voice storage section 1o when using each function.

以上のように、本実施例によれば、合成音声により利用
者に音声応答を要求し利用者が要求に応じて音声を入力
する際に、入力音声を音声保存部１ｏに入力できるよう
にスイッチ１１をセットし、音声保存部１０にて利用者
の応答音声を一定時間記憶し、合成音声が全て出力され
た後に、音声保存部１ｏ内の音声を一般端子制御部５に
出力することにより１合成音声の出力中に利用者が音声
応答を行ったとしても、利用者の音声を正確に音声認識
部２に入力し認識処理を可能とすることができる。As described above, according to this embodiment, when a synthesized voice is used to request a voice response from the user and the user inputs voice in response to the request, a switch is provided so that the input voice can be input into the voice storage section 1o. 11, the voice storage unit 10 stores the user's response voice for a certain period of time, and after all the synthesized voices are output, the voice in the voice storage unit 1o is output to the general terminal control unit 5. Even if the user makes a voice response while the synthesized voice is being output, the user's voice can be accurately input to the voice recognition unit 2 and recognition processing can be performed.

なお１本実施例では、音声保存部１０での記憶時間を一
定時間に固定しているが、各合成音声が出力し終わるま
でを記憶時間としてもよい。この際の動作を第２図の音
声会話例のフローチャートを用いて以下に説明する。Note that in this embodiment, the storage time in the audio storage unit 10 is fixed at a certain time, but the storage time may be set as the time until each synthesized voice is outputted. The operation at this time will be explained below using the flowchart of the example voice conversation shown in FIG.

利用者が電話装置の多機能を使用する際には、利用者は
番号入力端子９より機能スタート用番号を入力しく会話
例２０）、入力された信号は音声保存部１０．一般端末
制御部６を経てシステム制御部４に入力され、システム
制御部４より特殊端末制御部３を経て音声合成部１に合
成音声を要求し、音声合成部１より利用者に機能名を要
求するための合成音声が出力される。合成音声は、たと
えば「機能名をどうぞ。」でもよい（会話例２１）。When the user uses the multi-functions of the telephone device, the user should input a function start number from the number input terminal 9 (conversation example 20), and the input signal is stored in the voice storage section 10. It is input to the system control unit 4 via the general terminal control unit 6, the system control unit 4 requests the speech synthesis unit 1 to synthesize speech via the special terminal control unit 3, and the speech synthesis unit 1 requests the user for the function name. A synthesized voice for the purpose is output. The synthesized speech may be, for example, "Please give me the function name." (Example Conversation 21).

合成音声は、特殊端末制御部３．システム制御部４、一
般端末制御部５．ハイブリッドコイル６’ｚ経て音声出
力端子８より出力される。合成音声が全て出力されると
、特殊端末制御部３より合成音声終了信号がシステム制
御部４ｊ一般端末制御部５を経て音声保存部１ｏに入力
される。一方、番号入力端子９より番号信号を受けた音
声保存部１０では、以後の利用者の入力音声を保存する
ために、スィッチ１１全音声保存部１０側（Ａ端子１３
　ページ側）にセットする。The synthesized voice is generated by the special terminal control unit 3. System control section 4, general terminal control section 5. The signal is output from the audio output terminal 8 via the hybrid coil 6'z. When all the synthesized voices have been output, a synthesized voice end signal is input from the special terminal control section 3 to the system control section 4j and the general terminal control section 5 to the voice storage section 1o. On the other hand, in the voice storage section 10 that receives the number signal from the number input terminal 9, in order to save the user's input voice from now on, the switch 11 is switched on the side of the entire voice storage section 10 (A terminal 13
page side).

次に利用者は、合成音声に答えて必要とする機能名音声
を音声入力端子７より入力する。たとえば、機能名に「
不在転送」であったとする（会話例２２）。機能名音声
は、ノＡイブリッドコイル６に出力される前に、−旦音
声保存部１０に出力されるが、この時に、合成音声終了
信号が既に音声保存部１ｏに出力されていたら、機能名
音声を記憶せずに、そのまま一般端末制御部５に機能名
音声を出力する。しかし、終了信号がまだ出力されてい
なかったら、機能名音声は音声保存部１０に一旦保存さ
れ、終了信号が出力されると、機能名音声は、ハイブリ
ッドコイル６、一般端末制御部５、システム制御部４．
特殊端末制御部３を経て、音声認識部２に入力される。Next, the user inputs the desired function name voice from the voice input terminal 7 in response to the synthesized voice. For example, in the feature name "
Suppose that the message was "Non-call transfer" (Conversation example 22). The function name voice is output to the voice storage unit 10 for -1 time before being output to the NoA hybrid coil 6. At this time, if the synthesized voice end signal has already been output to the voice storage unit 1o, the function name The function name voice is directly output to the general terminal control section 5 without storing the voice. However, if the end signal has not been output yet, the function name voice is temporarily stored in the voice storage unit 10, and when the end signal is output, the function name voice is transmitted to the hybrid coil 6, the general terminal control unit 5, the system control Part 4.
The signal is input to the speech recognition section 2 via the special terminal control section 3.

認識処理後、認識結果は特殊端末制御部３に出力され、
特殊端末制御部３より音声合成部１に認識結果確認用合
成音声を要求し、音声合成部１より合成音声が出力され
る。合成音声は、たとえば「不在転送ですか」でもよい
（会話例２３）。合１４　ヘー。After the recognition process, the recognition result is output to the special terminal control unit 3,
The special terminal control section 3 requests the speech synthesis section 1 for synthesized speech for checking the recognition result, and the speech synthesis section 1 outputs the synthesized speech. The synthesized voice may be, for example, "Is this a call forwarding?" (conversation example 23). 14 Heh.

成音声は、上記機能名を要求する合成音声と同様に音声
出力端子８より出力される。利用者は合成音声に答えて
、認識結果の正誤判断音声を音声入力端子７より入力す
る。正誤判断音声は、たとえば「はい」または「いいえ
」であったとする（会話例２４）。正誤判断音声は、機
能名音声と同様の条件で、音声保存部１ｏに保存された
後、音声認識部２に入力され、認識処理後、認識結果は
特殊端末制御部３に出力される。The synthesized speech is output from the speech output terminal 8 in the same way as the synthesized speech requesting the function name. In response to the synthesized speech, the user inputs speech for determining whether the recognition result is correct or incorrect from the speech input terminal 7. It is assumed that the correct/incorrect judgment voice is, for example, “yes” or “no” (conversation example 24). The correct/incorrect judgment voice is stored in the voice storage unit 1o under the same conditions as the function name voice, and then input to the voice recognition unit 2. After recognition processing, the recognition result is output to the special terminal control unit 3.

正認識結果の場合には、特殊端末制御部３よりシステム
制御部４に機能を動作するための信号が送られ機能の動
作が始まる（会話例２５）。誤認識結果の場合には、特
殊端末制御部３より音声合成部１に、再度、機能名を要
求する合成音声を要求するように動作する。In the case of a correct recognition result, a signal for operating the function is sent from the special terminal control section 3 to the system control section 4, and the operation of the function is started (conversation example 25). In the case of an erroneous recognition result, the special terminal control section 3 operates to request the speech synthesis section 1 again for a synthesized speech requesting the function name.

本実施例では、前述した実施例の効果に加えて、応答音
声を保管する際の時間的な無駄を省くことができ、処理
時間を短縮できる。In this embodiment, in addition to the effects of the embodiments described above, it is possible to eliminate waste of time when storing response voices and shorten processing time.

発明の効果本発明は、合成音声により利用者に音声応答を１５　ペ
ーノ要求し利用者が要求に応じて音声を入力する際に、音声
保存部にて利用者の応答音声を記憶し、合成音声が全て
出力された後に、音声保存部内の音声を一般端子制御部
に出力することにより、合成音声の出力中に利用者が音
声応答を行ったとしても。Effects of the Invention The present invention provides that when a synthesized voice is used to request a voice response from a user and the user inputs a voice in response to the request, the voice storage unit stores the user's response voice, and the voice response is stored in the synthesized voice. By outputting the voice in the voice storage unit to the general terminal control unit after all the voices have been output, even if the user makes a voice response while the synthesized voice is being output.

利用者の音声を正確に音声認識部に入力し認識処理を可
能とすることができる優れた電話装置を実現するもので
ある。The objective is to realize an excellent telephone device that can accurately input the user's voice to the voice recognition unit and perform recognition processing.

[Brief explanation of the drawing]

第１図は本発明の一実施例における電話装置のブロック
図、第２図は同音声会話例のフローチャート、第３図は
従来の電話装置のブロック図である。１・・・・・・音声合成部、２・・・・・・音声認識部
、３・・・・・特殊端末制御部、４・・・・・・システ
ム制御部、５・・・・・・一般端末制御部、６・・・・
・・ハイブリッドコイル、７・・・・・・音声入力端子
、８・・・・・・音声出力端子、９・・・・・・番号入
力端子、１ｏ・・・・・・音声保存部、１１・・・・・
・スイッチ。代理人の氏名　弁理士　中　尾　敏　男　ほか１名第１
図７−−−ｔ？入カフ＃Ｉ）８−、　　±ｊ３″ ９−−一番、号△力　・Ｉ覧話請宋碗を第２図匡＝＝＝刀−利庁着り人斧青声ローＳｔＭ８入瞥戸Ｂびｂ＋ｐ第３図８−１す叡４あ叫−−−・Ｉ　入力　・１１５−　番５・・　・７を話鯖渫械FIG. 1 is a block diagram of a telephone device according to an embodiment of the present invention, FIG. 2 is a flowchart of an example of the same voice conversation, and FIG. 3 is a block diagram of a conventional telephone device. 1...Speech synthesis unit, 2...Speech recognition unit, 3...Special terminal control unit, 4...System control unit, 5...・General terminal control unit, 6...
...Hybrid coil, 7...Audio input terminal, 8...Audio output terminal, 9...Number input terminal, 1o...Audio storage section, 11・・・・・・
·switch. Name of agent: Patent attorney Toshio Nakao and 1 other person No. 1
Figure 7---t? Input cuff #I) 8-, ±j3″ 9--Ichiban, No. △ force ・I View story call Song bowl 2nd figure 匡 === Sword - Licho wearer Ax Seisei low StM8 Irybetsu door Bbi b+p Fig. 3 8-1 S 4 A shout --- I Input ・1 15- No. 5... ・ Speak 7 to the mackerel fishing machine

Claims

[Claims]

When controlling the multi-functions used in a telephone device by voice response, there is a voice recognition unit that recognizes the user's voice, and a voice recognition unit that requests voice input from the user and sends the recognition results from the voice recognition unit to the user. a speech synthesis unit that controls synthesized speech to inform the user; a special terminal control unit that controls the speech recognition unit and the speech synthesis unit; an audio output terminal that outputs the synthesized speech and the user's voice; A voice input terminal for inputting a number, a number input terminal for inputting a number, a voice storage section for storing the user's voice, and a two-wire type and a four-wire type for branching the voice output terminal and voice input terminal. a general terminal control unit that controls a telephone terminal incorporating a hybrid coil that performs mutual conversion, the number input terminal, voice input terminal, voice output terminal, voice storage unit, and hybrid coil, and each of the above control units. and a system control unit that controls the user, and when the user requests a voice response using synthesized voice and the user inputs voice in response to the request, stores the user's response voice in the voice storage unit, A telephone device characterized in that after all synthesized speech is output from the speech output terminal, the speech in the speech storage section is output to the general terminal control section.