JP2000267694A

JP2000267694A - Voice recognition device

Info

Publication number: JP2000267694A
Application number: JP11074360A
Authority: JP
Inventors: Hiroyuki Iwa; 博之岩
Original assignee: Kojima Press Industry Co Ltd
Current assignee: Kojima Industries Corp
Priority date: 1999-03-18
Filing date: 1999-03-18
Publication date: 2000-09-29

Abstract

PROBLEM TO BE SOLVED: To provide a voice recognition device having a satisfactory operability in which even the canceling of a voice command can be performed by voice, which can by dispense with the operating of a canceling switch and which can eliminate the canceling key. SOLUTION: This device has a voice command group which has a hierarchical structure and includes a final voice command for commanding final equipment operations to respective hierarchies and an intermediate voice command for an intermediate command in which the selecting of final commands existing in lower hierarchies is needed in order to perform equipment operations. In each hierarchy of the voice command group, a voice command canceling the inputted voice command or a voice command making voice recognition complete is included.

Description

DETAILED DESCRIPTION OF THE INVENTION

【０００１】[0001]

【発明の属する技術分野】本発明は、音声コマンドに基
づいて機器の動作制御を行う音声認識装置に関し、特に
音声コマンドが階層構造を有する音声認識装置に関す
る。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a voice recognition apparatus for controlling the operation of a device based on a voice command, and more particularly to a voice recognition apparatus having a hierarchical structure of voice commands.

【０００２】[0002]

【従来の技術】音声認識装置は、ユーザの音声による指
示入力から音声コマンドを識別し、音声コマンドに基づ
く動作信号を機器に送ることにより、ユーザの音声のみ
で機器を動作制御できるシステムである。2. Description of the Related Art A speech recognition apparatus is a system which can identify a voice command from an instruction input by a user's voice and send an operation signal based on the voice command to the apparatus, thereby controlling the operation of the apparatus only by the user's voice.

【０００３】現在市販されている音声認識装置の一例と
してカーナビゲーションシステムがある。カーナビゲー
ションシステムは、自動車の目的地を設定入力すること
により、ＧＰＳを利用して求められる現在の走行位置か
ら目的地までの道順を専用画面の地図上に表示して道路
案内するシステムである。目的地の設定方法として、当
初用いられていたボタン入力操作やパネルのタッチ操作
の代わりに音声入力を用い、目的地を音声入力すること
により、目的地までの道路案内を自動的に画面上に表示
させることが可能となり、操作性が一段と向上してい
る。[0003] A car navigation system is an example of a speech recognition device currently on the market. The car navigation system is a system for setting and inputting a destination of an automobile, thereby displaying a route from a current traveling position obtained by using the GPS to the destination on a map on a dedicated screen to provide road guidance. As a destination setting method, voice input is used instead of the button input operation or panel touch operation that was originally used, and voice guidance of the destination is used, so that road guidance to the destination is automatically displayed on the screen. Display can be performed, and operability is further improved.

【０００４】一般に、命令を単純化して運転者が機器操
作を容易に理解できるようにするため、及び音声認識装
置による音声コマンドの認識率を向上させる等のため
に、前記音声認識装置で使用される音声コマンドは階層
構造を有する。そのため、最終的な機器動作制御を行わ
せるまでには複数の音声コマンドを順次入力することが
必要である。[0004] Generally, in order to simplify the commands so that the driver can easily understand the operation of the equipment and to improve the recognition rate of the voice command by the voice recognition device, the voice recognition device is used in the voice recognition device. The voice command has a hierarchical structure. For this reason, it is necessary to sequentially input a plurality of voice commands before final device operation control is performed.

【０００５】音声コマンドの階層構造の一例として、自
動車用の空調装置及びオーディオ装置を制御する音声認
識装置に使用するものを図４に示す。自動車の空調制御
としては、送風吹き出し口の切替（ＦＡＣＥ，ＦＯＯ
Ｔ，ＤＥＦ等）や、内外気の切替、温度設定、エアコン
のＯＮ／ＯＦＦ等がある。オーディオ制御としては、コ
ンパクトディスク、テープ、ＡＭ／ＦＭラジオ等の切
替、コンパクトディスク及びテープの再生、ラジオの選
局、音量調節等がある。第１階層には、「空調」と「オ
ーディオ」「フロントデフロスター」「内気」「外気」
という選択コマンドがある。「空調」と「オーディオ」
は最終的な機器への動作指示ではなく、第２階層以降の
分岐した音声コマンドの選択が必要な中間的なコマンド
である。第２階層以降は、分岐した最終的な動作指示コ
マンドと、さらに下層の分岐したコマンドを有する中間
コマンドとが混在する。第１階層には、「フロントデフ
ロスター」「内気」「外気」など、運転者が早急に利用
したい最終的な機器の動作指示コマンドも用意されてい
る。FIG. 4 shows an example of a hierarchical structure of a voice command used for a voice recognition device for controlling an air conditioner and an audio device for a vehicle. As air-conditioning control for automobiles, switching of blow-off outlets (FACE, FOO)
T, DEF, etc.), switching between inside and outside air, temperature setting, ON / OFF of an air conditioner, and the like. Audio control includes switching between compact discs, tapes, AM / FM radios, etc., reproduction of compact discs and tapes, radio selection, volume control, and the like. On the first level, "air conditioning" and "audio""frontdefroster""insideair""outsideair"
There is a selection command called. "Air conditioning" and "Audio"
Is not an operation instruction to the final device, but an intermediate command that requires selection of a branched voice command in the second and subsequent layers. In the second and subsequent hierarchies, a final branched operation instruction command and an intermediate command having a further lower-layer branched command are mixed. On the first level, operation instruction commands for final equipment that the driver wants to use immediately, such as “front defroster”, “inside air”, and “outside air” are also prepared.

【０００６】図５は、自動車の空調及びオーディオに使
用した従来の音声認識装置の概略構成を示すブロック図
である。音声認識装置２００は、音声認識部と音声合成
部１１０及び制御部１２０で構成されている。音声認識
部と音声合成部１１０のうちの音声認識部には、運転者
がコマンドを音声入力するためのマイク１３０が接続さ
れ、音声合成部にはスピーカ１４０が接続されている。
制御部１２０には、音声認識を開始するための認識開始
スイッチ１５０と、音声コマンドによる指令をキャンセ
ルするためのキャンセルスイッチ１６０とが接続されて
いる。また制御部１２０は、空調１７０及びオーディオ
１８０に動作制御信号を出力するように接続されてい
る。FIG. 5 is a block diagram showing a schematic configuration of a conventional voice recognition device used for air conditioning and audio of a car. The speech recognition device 200 includes a speech recognition unit, a speech synthesis unit 110, and a control unit 120. The voice recognition unit of the voice recognition unit and the voice synthesis unit 110 is connected to the microphone 130 for the driver to input a command by voice, and the voice synthesis unit is connected to the speaker 140.
The control unit 120 is connected with a recognition start switch 150 for starting voice recognition and a cancel switch 160 for canceling a command by a voice command. The control unit 120 is connected to the air conditioning 170 and the audio 180 so as to output an operation control signal.

【０００７】認識開始スイッチ１５０を運転手が押す
と、認識開始信号が制御部１２０に送られ、音声認識が
開始される。最初に第１階層の音声コマンドの選択を促
すガイド音声がスピーカ１４０から流される。ガイド音
声は、音声合成部で合成される。運転者がマイク１３０
に向かって音声コマンドを選択して答えると、音声認識
部１１０において制御部１２０に設けられた記憶手段に
記憶、登録されている音声コマンドとの比較がなされ、
入力された音声コマンドが第１階層の音声コマンドのい
ずれとも認識されない場合は、音声コマンド入力の誤り
として、ガイド音声により再度入力が促される。第１階
層の音声コマンドであると認識された場合は、さらに第
２階層の音声コマンドの選択を促すガイド音声が流され
る。最終的な動作指示のコマンドが認識された場合に、
制御部１２０から空調１７０又はオーディオ１８０に動
作制御信号が送られ、運転者の望む動作が実行される。[0007] When the driver presses the recognition start switch 150, a recognition start signal is sent to the control unit 120, and voice recognition is started. First, a guide voice prompting selection of a voice command of the first hierarchy is played from the speaker 140. The guide voice is synthesized by the voice synthesizer. The driver uses the microphone 130
When a voice command is selected and answered, the voice recognition unit 110 compares the voice command with a voice command stored and registered in a storage unit provided in the control unit 120.
If the input voice command is not recognized as any of the first-level voice commands, the input is prompted again by the guide voice as an error in the voice command input. When the voice command is recognized as a first-layer voice command, a guide voice prompting selection of a second-layer voice command is further played. When the final operation instruction command is recognized,
An operation control signal is sent from the control unit 120 to the air conditioner 170 or the audio 180, and an operation desired by the driver is executed.

【０００８】音声コマンド入力の際に、運転者が希望す
る音声コマンドとは別のコマンドであって予め登録され
ている音声コマンドを誤って返答した場合や、希望する
音声コマンドではあるが、音声認識装置が誤って別の音
声コマンドであると認識した場合は、運転者が望まない
機器操作が実行されることになる。そこで、音声コマン
ドをキャンセルするためのキャンセルボタンが設けられ
ており、運転者がボタンを押すことにより、直前の音声
コマンドをキャンセルする。At the time of voice command input, when the driver erroneously answers a voice command which is different from the desired voice command and which is registered in advance, or the voice command is a desired voice command, If the device erroneously recognizes another voice command, a device operation not desired by the driver will be performed. Therefore, a cancel button for canceling the voice command is provided, and when the driver presses the button, the immediately preceding voice command is canceled.

【０００９】以下に、キャンセル操作手順を示す。Hereinafter, a cancel operation procedure will be described.

【００１０】１．運転者：認識開始スイッチを押す。[0010] 1. Driver: Press the recognition start switch.

【００１１】２．装置：認識開始を促すガイドを流
す。「認識処理を開始します。空調・オーディオどちら
を操作しますか？」３．運転者：第１階層の音声コマンドを音声入力する。
「空調」４．装置：第２階層の音声コマンドの選択を促す。
「空調操作を行います。操作項目をお選び下さい。」５．運転者：ここで、本来は、第１階層にある「フロン
トデフ」が希望であったので、第１階層の音声コマンド
をキャンセルするため、キャンセルボタンを押す。2. Apparatus: Plays a guide to start recognition. "Start recognition process. Do you operate air conditioning or audio?" Driver: Voice command of the first layer is input by voice.
"Air conditioning" Apparatus: prompts the user to select the second level voice command.
"Perform the air conditioning operation. Please select the operation item." Driver: Here, since "front differential" on the first level is originally desired, the cancel button is pressed to cancel the voice command on the first level.

【００１２】６．装置：第１階層に戻って改めて第１
階層の音声コマンの選択を促す。「認識処理を開始しま
す。空調・オーディオどちらを操作しますか？」７．運転者：第１階層の音声コマンドを音声入力し直
す。「フロントデフ」８．装置：フロントデフロスター開始の操作ガイドを
流し、操作を終了する。「フロントガラスの曇りを取り
ます。」6. Apparatus: Return to the first level,
Prompt for selection of hierarchical voice command. "Start recognition process. Do you operate air conditioning or audio?" Driver: Voice command of the first layer is input again. "Front differential" Apparatus: Flow the operation guide for starting the front defroster and end the operation. "Remove the fogging of the windshield."

【発明が解決しようとする課題】しかし、キャンセルス
イッチを有する従来の音声認識装置では、音声コマンド
をキャンセルする場合に、その都度キャンセルスイッチ
を押さなければならず、煩わしかった。これでは音声入
力のみで機器操作できるという音声認識装置の利点が半
減されるという問題点を有していた。However, in the conventional voice recognition apparatus having a cancel switch, when canceling a voice command, the user must press the cancel switch each time the command is canceled, which is troublesome. In this case, there is a problem that the advantage of the voice recognition device that the device can be operated only by voice input is halved.

【００１３】そこで、本発明の音声認識装置は、音声コ
マンドのキャンセルも音声で行うことができ、キャンセ
ルスイッチ操作を不要とし、キャンセルスイッチを削除
することができる音声認識装置を提供することを課題と
する。Accordingly, it is an object of the present invention to provide a voice recognition apparatus capable of canceling voice commands by voice, eliminating the need for cancel switch operation, and eliminating the cancel switch. I do.

【００１４】[0014]

【課題を解決するための手段】本発明の前記課題は、音
声コマンドに基づいて機器の動作制御を行うに当たり、
階層構造を有する音声コマンド群であって、各階層に最
終的な機器動作指令のための最終音声コマンドと、機器
動作のためには下層に存在する最終的音声コマンドを選
択することが必要な中間指令のための中間音声コマンド
とを含む音声コマンド群を有し、前記音声コマンド群の
各階層には、入力された音声コマンドを取り消す音声コ
マンド又は音声認識処理を終了させる音声コマンドを含
むことにより、効果的に達成される。SUMMARY OF THE INVENTION The object of the present invention is to control the operation of a device based on a voice command.
A voice command group having a hierarchical structure, in which it is necessary to select a final voice command for a final device operation command in each layer and a final voice command existing in a lower layer for device operation. A voice command group including an intermediate voice command for a command, and each layer of the voice command group includes a voice command for canceling the input voice command or a voice command for ending the voice recognition process, Achieved effectively.

【００１５】また、音声コマンドに基づいて機器の動作
制御を行う音声認識装置であって、階層構造を有する音
声コマンド群であって、各階層に最終的な機器動作指令
を行うための最終音声コマンドと、中間指令のための中
間音声コマンドとを含む音声コマンド群を記憶した記憶
手段と、ユーザの音声入力を前記記憶手段内の音声コマ
ンドと比較して入力された音声コマンドを認識する音声
認識手段と、入力された音声コマンドが中間音声コマン
ドの場合、下層の音声コマンドを選択させるために、音
声合成部で合成された合成音声を用いて音声入力を促す
案内手段と、入力された音声コマンドが最終音声コマン
ドの場合、該当する機器に動作指令信号を送る制御手段
とを有し、前記音声コマンド群の各階層に、入力された
音声コマンドを取り消す音声コマンド又は音声認識処理
を終了させる音声コマンドを含むことにより、より効果
的に達成される。A voice recognition apparatus for controlling the operation of a device based on a voice command is a voice command group having a hierarchical structure, wherein a final voice command for giving a final device operation command to each layer. Storage means for storing a voice command group including an intermediate voice command for an intermediate command, and voice recognition means for recognizing the input voice command by comparing the voice input of the user with the voice command in the storage means When the input voice command is an intermediate voice command, in order to select a lower-layer voice command, guidance means for prompting voice input using a synthesized voice synthesized by the voice synthesis unit, and the input voice command Control means for sending an operation command signal to the corresponding device in the case of the final voice command, and the input voice command is stored in each layer of the voice command group. By including a voice command to terminate a voice command or voice recognition processing erasing are achieved more effectively.

【００１６】[0016]

【発明の実施の形態】実施の形態１．以下、本発明に係
る音声認識装置の実施の形態１を図面に基づいて説明す
る。図１は本発明に係る音声認識装置の実施の形態を示
す全体構成図であり、音声コマンドにより自動車の空調
及びオーディオの動作制御を行うものである。音声認識
装置１００は、音声認識部１０、音声合成部１５及び制
御部２０で構成されている。音声認識部１０には、運転
者が音声コマンドを入力するためのマイク３０が接続さ
れ、音声合成部１５にはスピーカ４０が接続されてい
る。制御部２０には、音声認識を開始するための認識開
始スイッチ５０が接続されている。運転者がマイク３０
に向かって音声コマンドを入力すると、音声認識部１０
では、制御部２０に設けられた記憶手段に記憶、登録さ
れている音声コマンドとの比較がなされ、入力された音
声コマンドが認識され、制御部２０から空調７０又はオ
ーディオ８０に動作制御信号が送られ、運転者の望む動
作が実行される。DESCRIPTION OF THE PREFERRED EMBODIMENTS Embodiment 1 Hereinafter, a first embodiment of a speech recognition device according to the present invention will be described with reference to the drawings. FIG. 1 is an overall configuration diagram showing an embodiment of a voice recognition device according to the present invention, which controls the air conditioning of a car and the operation of audio by voice commands. The speech recognition device 100 includes a speech recognition unit 10, a speech synthesis unit 15, and a control unit 20. The voice recognition unit 10 is connected to a microphone 30 for the driver to input voice commands, and the voice synthesis unit 15 is connected to a speaker 40. A recognition start switch 50 for starting speech recognition is connected to the control unit 20. The driver has the microphone 30
When a voice command is input toward the voice recognition unit 10
Then, the voice command stored in the storage unit provided in the control unit 20 is compared with the registered voice command, the input voice command is recognized, and an operation control signal is transmitted from the control unit 20 to the air conditioner 70 or the audio 80. Then, the operation desired by the driver is performed.

【００１７】本実施の形態においては従来技術と異な
り、音声コマンドによる指令をキャンセルするためのキ
ャンセルスイッチは存在しない。In the present embodiment, unlike the prior art, there is no cancel switch for canceling a command by a voice command.

【００１８】図２は、図１に示す本発明の実施の形態に
おける空調及びオーディオの動作制御を行うための階層
構造を有するコマンド群の一例を示す系統図である。第
１階層には、「空調」「フロントデフ」「フロントデフ
オフ」「オーディオ」等が配置されている。「空調」と
「オーディオ」は、最終的な指示コマンドではなく、第
２階層に選択肢を持つ中間コマンドである。他のコマン
ドは、直接機器の動作を指示する最終的な指示コマンド
である。また、「処理終了」のコマンドも設けられてい
る。「処理終了」のコマンドが選択されると、音声認識
操作は全て終了する。FIG. 2 is a system diagram showing an example of a command group having a hierarchical structure for controlling the operation of air conditioning and audio in the embodiment of the present invention shown in FIG. On the first level, “air conditioning”, “front differential”, “front differential off”, “audio”, and the like are arranged. “Air conditioning” and “audio” are not final instruction commands, but are intermediate commands having options in the second hierarchy. Other commands are final instruction commands that directly instruct the operation of the device. Also, a command of “processing end” is provided. When the command of “processing end” is selected, all the voice recognition operations end.

【００１９】第２階層には、第１階層の中間コマンドで
ある「空調」の選択コマンドである「設定温度」「エア
コン」「リアデフ」等、及び、中間コマンド「オーディ
オ」の選択コマンドである「ＣＤチェンジャー」「テー
プ」「ＦＭ」「ＡＭ」等が配置されている。ここでも
「設定温度」「テープ」「ＦＭ」「ＡＭ」は中間コマン
ドである。第２階層には、それぞれ、「処理終了」「キ
ャンセル」のコマンドを有する。「キャンセル」コマン
ドは、前層のコマンド指令を取り消すコマンドである。
以下第３階層、第４階層とコマンド群が続く。In the second hierarchy, "set temperature", "air conditioner", "rear differential", etc., which are intermediate commands of the first hierarchy, "air conditioning", etc., and a selection command of an intermediate command, "audio", are provided. A CD changer, a tape, an FM, an AM, and the like are arranged. Here, "set temperature", "tape", "FM" and "AM" are intermediate commands. The second layer has commands of “processing end” and “cancel”, respectively. The “cancel” command is a command for canceling the command command of the previous layer.
Hereinafter, a third layer, a fourth layer, and a command group follow.

【００２０】次に本実施の形態におけるコマンド指令の
キャンセル操作手順を示す。Next, a procedure for canceling a command command according to this embodiment will be described.

【００２１】１．運転者：認識開始スイッチを押す。1. Driver: Press the recognition start switch.

【００２２】２．装置：認識開始を促すガイドを流
す。「認識処理を開始します。空調・オーディオどちら
を操作しますか？」３．運転者：第１階層の音声コマンドを音声入力する。
「空調」４．装置：第２階層の音声コマンドの選択を促す。
「空調操作を行います。操作項目をお選び下さい。」５．運転者：ここで、本来は、第１階層にある「フロン
トデフ」が希望であったため、第１階層の音声コマンド
をキャンセルするため、「キャンセル」と発声する。2. Apparatus: Plays a guide to start recognition. "Start recognition process. Do you operate air conditioning or audio?" Driver: Voice command of the first layer is input by voice.
"Air conditioning" Apparatus: prompts the user to select the second level voice command.
"Perform the air conditioning operation. Please select the operation item." Driver: Here, since "front differential" on the first level is originally desired, "cancel" is uttered to cancel the voice command on the first level.

【００２３】６．装置：第１階層に戻って改めて第１
階層の音声コマンドの選択を促す。「認識処理を開始し
ます。空調・オーディオどちらを操作しますか？」７．運転者：第１階層の音声コマンドを音声入力し直
す。「フロントデフ」８．装置：フロントデフロスター開始の操作ガイドを
流し、操作を終了する。「フロントガラスの曇りを取り
ます。」次に、以上の操作手順に伴う一連の音声コマンド操作に
よる本発明の音声認識装置の動作について、図３（Ａ）
（Ｂ）に示す動作フロー図に基づき説明する。6. Apparatus: Return to the first level,
Prompt for selection of hierarchical voice command. "Start recognition process. Do you operate air conditioning or audio?" Driver: Voice command of the first layer is input again. "Front differential" Apparatus: Flow the operation guide for starting the front defroster and end the operation. "Remove the fogging of the windshield." Next, the operation of the voice recognition device of the present invention by a series of voice command operations following the above operation procedure is shown in FIG.
The operation will be described with reference to the operation flowchart shown in FIG.

【００２４】運転者が認識開始スイッチ５０を押圧する
と、システムが動作を開始し、ステップＳ１０で変数Ｎ
に数値１を代入し、次いでステップＳ１１でＮ層のガイ
ドを流す。現在はＮ＝１なので１層のガイド、すなわ
ち、「認識処理を開始します。空調・オーディオどちら
を操作しますか？」というガイドを流す。このガイドに
従い、ステップＳ１２で運転者が「空調」と第１階層の
音声コマンドを音声入力すると、ステップＳ１３で「空
調」がＮ層のコマンドであるか否かを判断し、Ｎ層のコ
マンドでない場合はステップＳ１１に戻る。Ｎ層のコマ
ンドである場合はステップＳ１４に進み、音声入力が終
了コマンドであるか否かを判断し、終了コマンドの場合
は終了する。また、終了コマンドでない場合は、ステッ
プＳ１５に進み、同じくキャンセルコマンドであるか否
かを、また、ステップＳ１６で中間コマンドであるか否
かを判断し、いずれも否の場合はステップＳ１７でＮ層
のコマンドを実行する。例えば、Ｎ＝１の場合、第１階
層の「空調」コマンドを実行する。When the driver presses the recognition start switch 50, the system starts operating, and the variable N is set at step S10.
, And then guide the N-layer guide in step S11. Since N = 1 at the moment, a one-layer guide, that is, "Start recognition processing. Which air conditioning / audio should be operated?" In accordance with this guide, when the driver voice-inputs "air conditioning" and the first hierarchical voice command in step S12, it is determined in step S13 whether "air conditioning" is an N-layer command. In this case, the process returns to step S11. If the command is an N-layer command, the process proceeds to step S14, where it is determined whether or not the voice input is an end command. If the command is not the end command, the process proceeds to step S15, and it is determined whether the command is a cancel command. Similarly, whether the command is an intermediate command is determined in step S16. Execute the command of For example, when N = 1, the “air conditioning” command of the first level is executed.

【００２５】ステップＳ１６で音声入力が中間コマンド
である場合はステップＳ１８に進み、変数Ｎを＋１した
後ステップＳ１１に戻る。また、ステップＳ１５で音声
入力がキャンセルコマンドであった場合はステップＳ１
９に進んで変数Ｎを−１した後ステップＳ１１に戻る。
このようにして、第２階層、第３階層の音声コマンドを
順次選択して実行する。途中でキャンセルコマンドの音
声入力があった場合は第１階層に戻り、Ｓ１１からのス
テップを再度実行する。If it is determined in step S16 that the voice input is an intermediate command, the flow advances to step S18, where the variable N is incremented by 1, and the flow returns to step S11. If the voice input is a cancel command in step S15, step S1
The process proceeds to step 9, where the variable N is decremented by one, and then returns to step S11.
In this way, the voice commands of the second and third layers are sequentially selected and executed. If there is a speech input of the cancel command in the middle, the process returns to the first level and the steps from S11 are executed again.

【００２６】ステップＳ１７でＮ層のコマンドを実行
後、ステップＳ２０で所定時間が経過したか否かを、ま
た、ステップＳ２１でキャンセル音声の入力があったか
否かを判定し、ステップＳ２０でＹＥＳの場合とステッ
プＳ２１でＮＯの場合は一連の動作を終了し、それ以外
の場合はステップＳ２２に進み、Ｎ層のコマンドの実行
を中止し、ステップＳ１１に戻ってＮ層のガイドを行
う。After executing the command of the N-th layer in step S17, it is determined in step S20 whether or not a predetermined time has elapsed, and in step S21, whether or not a cancel voice has been input. If YES in step S20, If NO in step S21, the series of operations is terminated. Otherwise, the process proceeds to step S22, where the execution of the command of the N layer is stopped, and the process returns to step S11 to guide the N layer.

【００２７】このように、本実施の形態の構成によれ
ば、キャンセルスイッチを押す操作が不要となり、キャ
ンセルスイッチを省略することができるため、操作性が
向上し、かつ、部品点数を削減できる。また、各階層に
「処理終了」コマンドが設けられているため、音声認識
操作のどの段階であっても音声認識処理を終了させるこ
とができるため、処理操作を取り消すために手を移動さ
せる煩わしさをなくすことができる。As described above, according to the configuration of the present embodiment, the operation of pressing the cancel switch becomes unnecessary, and the cancel switch can be omitted, so that the operability is improved and the number of parts can be reduced. In addition, since the “processing end” command is provided at each level, the voice recognition processing can be ended at any stage of the voice recognition operation, so that it is troublesome to move the hand to cancel the processing operation. Can be eliminated.

【００２８】なお、キャンセルのための音声コマンドの
言葉は、他の音声コマンドの言葉と紛らわしくない長め
の言葉が望ましい。その方が誤認識を生じにくくするこ
とができるためである。例えば、「処理終了」よりも
「音声認識処理終了」、「キャンセル」よりも「階層ア
ップ」の方が望ましい。The words of the voice command for canceling are preferably longer words which are not confused with the words of other voice commands. This is because erroneous recognition is less likely to occur. For example, it is preferable that “speech recognition process is completed” and “hierarchy up” is more than “cancel” than “process end”.

【００２９】実施の形態２．なお、音声コマンド群の各
階層に、入力された音声コマンドを取り消す音声コマン
ド又は音声認識処理を終了させる音声コマンドを含むと
共に、入力された音声コマンドを取り消すキャンセルス
イッチを併設し、音声コマンドとキャンセルスイッチの
いずれでもキャンセル操作ができるようにしてもよい。Embodiment 2 Each layer of the voice command group includes a voice command for canceling the input voice command or a voice command for ending the voice recognition process, and a cancel switch for canceling the input voice command is provided. The cancel operation may be performed in any of the above.

【００３０】実施の形態３．また、本発明の構成は、音
声認識開始を認識開始スイッチでなく音声入力で行う形
態の装置に適用することもできる。その場合、完全にハ
ンズフリーで空調又はオーディオ操作をすることが可能
となる。また、誤って「認識開始」を指令してしまった
場合においても、キャンセルスイッチを慌てて探す必要
がなく、直ちに音声でキャンセルできるので、特に有効
である。Embodiment 3 Further, the configuration of the present invention can be applied to an apparatus in which voice recognition is started by voice input instead of the recognition start switch. In that case, it becomes possible to perform air conditioning or audio operation completely hands-free. Further, even when a command to start "recognition" is given by mistake, the present invention is particularly effective because it is not necessary to search for the cancel switch in a hurry, and it is possible to immediately cancel by voice.

【００３１】実施の形態４．なお、本発明の構成は、自
動車の空調、オーディオ装置のみならず、カーナビゲー
ションシステム、テレビ、ハンズフリー機能付き電話等
にも同様の構成で応用することができる。Embodiment 4 FIG. The configuration of the present invention can be applied to a car navigation system, a television, a telephone with a hands-free function, and the like, in addition to an air conditioner and an audio device of a car.

【００３２】[0032]

【発明の効果】本発明の音声認識装置は、階層構造を有
する音声コマンド群の各階層ごとに、前階層の音声コマ
ンドをキャンセルするキャンセル用コマンド又は音声認
識を終了する音声認識終了用コマンドを有し、音声コマ
ンドのキャンセルを音声で行うことにより、キャンセル
スイッチを排除してキャンセルスイッチによる音声コマ
ンドの取り消し操作を不要にし、音声認識処理における
操作性を著しく向上することができる。The speech recognition apparatus according to the present invention has a command for canceling the speech command of the previous layer or a command for terminating speech recognition for terminating speech recognition for each layer of the speech command group having a hierarchical structure. However, by canceling the voice command by voice, the cancel switch is eliminated, and the operation of canceling the voice command by the cancel switch becomes unnecessary, and the operability in the voice recognition processing can be significantly improved.

[Brief description of the drawings]

【図１】この発明に係る音声認識装置及びその周辺装
置の実施の形態１の全体構成を示すブロック図である。FIG. 1 is a block diagram illustrating an overall configuration of a voice recognition device and peripheral devices according to a first embodiment of the present invention;

【図２】図１に示す本発明の実施の形態における空調
及びオーディオの動作制御を行うための階層構造を持つ
コマンド群の一例を示す系統図である。FIG. 2 is a system diagram showing an example of a command group having a hierarchical structure for performing air conditioning and audio operation control according to the embodiment of the present invention shown in FIG. 1;

【図３】同じく本発明の音声認識装置の動作フローを
示す図である。FIG. 3 is a diagram showing an operation flow of the speech recognition device of the present invention.

【図４】従来の音声認識装置における空調及びオーデ
ィオの動作制御を行うためのコマンド群の一例を示す系
統図である。FIG. 4 is a system diagram showing an example of a command group for performing air conditioning and audio operation control in a conventional voice recognition device.

【図５】従来の音声認識装置及びその周辺装置の全体
構成を示すブロック図である。FIG. 5 is a block diagram showing the overall configuration of a conventional voice recognition device and its peripheral devices.

[Explanation of symbols]

１０音声認識部、１５音声合成部、２０制御部、
３０マイク、４０スピーカ、５０認識開始スイッ
チ、７０空調、８０オーディオ。10 voice recognition unit, 15 voice synthesis unit, 20 control unit,
30 microphones, 40 speakers, 50 recognition start switch, 70 air conditioning, 80 audios.

Claims

[Claims]

1. A voice recognition apparatus for controlling operation of a device based on a voice command, comprising: a voice command group having a hierarchical structure, wherein a final voice command for a final device operation command is provided in each layer; And a voice command group including an intermediate voice command for an intermediate command necessary to select a final voice command existing in the lower layer for operation. A voice command for canceling the voice command or a voice command for terminating the voice recognition process.

2. A voice recognition apparatus for controlling operation of a device based on a voice command, comprising: a voice command group having a hierarchical structure, wherein a final voice command for a final device operation command is provided in each layer; Storage means for storing a voice command group including an intermediate voice command for an intermediate command required to select a voice command for a final device operation command existing in a lower layer for operation; Voice recognition means for recognizing the input voice command by comparing the voice input with the voice command in the storage means; and, when the input voice command is an intermediate voice command, a voice for selecting a lower-layer voice command. Guidance means for prompting voice input using the synthesized voice synthesized by the synthesizer, and when the input voice command is the final voice command, the corresponding device operates And a control means for sending a decree signals, each layer of the voice commands, the voice recognition device characterized by comprising a voice command to terminate a voice command or speech recognition process canceling a voice command input.