JP2001042891A

JP2001042891A - Speech recognition apparatus, speech recognition mounting device, speech recognition mounting system, speech recognition method, and memory medium

Info

Publication number: JP2001042891A
Application number: JP11212451A
Authority: JP
Inventors: Yasushi Sofugawa; 靖曽布川
Original assignee: Suzuki Motor Corp
Current assignee: Suzuki Motor Corp
Priority date: 1999-07-27
Filing date: 1999-07-27
Publication date: 2001-02-16

Abstract

PROBLEM TO BE SOLVED: To improve the operability, to reduce the cost over the entire part of a speed recognition apparatus and system and to enhance the performance of speech recognition function by constituting the apparatus and system so that the speech recognition can be started by speech input. SOLUTION: A center control unit 100 is capable of starting the speech recognition by inputting the speech of a previously recorded keyword from a microphone 150. If a user desires to use a word 'command recognition start' as the start keyword at this time, the user inputs the two words, 'keyword registration' and 'command recognition start' from the microphone 150. The 'command recognition start' is then registered as the start keyword. If the user desires to activate CD reproduction in a CD unit 110a by the speech recognition function thereafter, the user inputs the 'command recognition start' from the microphone 150 and in succession, the user inputs the operation keyword previously registered for the start of the CD reproduction action from the microphone. The reproduction action is then started.

Description

DETAILED DESCRIPTION OF THE INVENTION

【０００１】[0001]

【発明の属する技術分野】本発明は、音声認識機能を有
する装置やシステムに適用される音声認識装置、音声認
識搭載装置、音声認識搭載システム、音声認識方法、及
び記憶媒体に関するものである。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a voice recognition device, a voice recognition mounting device, a voice recognition mounting system, a voice recognition method, and a storage medium applied to a device or system having a voice recognition function.

【０００２】[0002]

【従来の技術】近年では、音声認識機能が搭載されたカ
ーオーディオ装置、カーナビゲーションシステム、携帯
電話等の装置やシステムが多く使用されてきている。こ
の音声認識機能とは、使用者が操作スイッチやボタンを
操作する代わりに、音声（上記操作に対して予め設定し
ておいた任意の言葉、以下、「操作キーワード」と言
う）を入力することで、所望する動作（以下、「音声認
識機能による動作」と言う）を実行させることができる
機能である。2. Description of the Related Art In recent years, devices and systems such as a car audio device, a car navigation system, and a mobile phone having a voice recognition function have been widely used. This voice recognition function means that a user inputs a voice (an arbitrary word set in advance for the above operation, hereinafter referred to as an “operation keyword”) instead of operating an operation switch or button. This is a function that can execute a desired operation (hereinafter, referred to as “operation by voice recognition function”).

【０００３】例えば、図６示すような、上記音声認識機
能が搭載されたカーオーディオ装置４００がある。この
カーオーディオ装置４００では、次のようにして、音声
認識機能による動作が実行される。For example, as shown in FIG. 6, there is a car audio device 400 equipped with the voice recognition function. In the car audio device 400, the operation by the voice recognition function is executed as follows.

【０００４】先ず、使用者は、ＣＤ（Compact Disk）ユ
ニットやＭＤ（Magnetic Disk ）ユニット等を含むユニ
ット部４３０での所望する動作（ここでは、「ＣＤ再生
開始動作」とする）を音声認識機能によって行わせる直
前に、音声認識開始スイッチ４８０をＯＮに操作する。
これにより、コントローラ４７０は、今から使用者から
の操作キーワードの入力があることを認識する。First, a user performs a desired operation (here, referred to as “CD reproduction start operation”) in a unit section 430 including a CD (Compact Disk) unit, an MD (Magnetic Disk) unit, and the like by a voice recognition function. Immediately before the operation is performed, the voice recognition start switch 480 is turned on.
Thereby, the controller 470 recognizes that an operation keyword has been input from the user.

【０００５】そこで、使用者は、「ＣＤ再生開始動作」
に対して予め設定しておいた操作キーワード（”ＣＤス
タート”等）をマイク４５０から入力する。この入力さ
れた操作キーワードは、コントローラ４７０によって認
識され、それに対応する制御コマンドに変換されて、ド
ライバ４４０を介してユニット部４３０のＣＤユニット
１１０ａに供給されると共に、モニタ４１０やスピーカ
４６０に供給される。これにより、ＣＤユニット１１０
ａでのＣＤ再生開始動作が実行され、例えば、ＣＤ再生
による音楽がスピーカ４６０から出力されたり、その音
楽の題名がモニタ４１０に表示されたりする。[0005] Therefore, the user has to perform a "CD reproduction start operation".
, An operation keyword (“CD start” or the like) set in advance is input from the microphone 450. The input operation keyword is recognized by the controller 470, converted into a corresponding control command, supplied to the CD unit 110a of the unit section 430 via the driver 440, and supplied to the monitor 410 and the speaker 460. You. Thereby, the CD unit 110
The CD playback start operation at “a” is executed. For example, music by CD playback is output from the speaker 460 or the title of the music is displayed on the monitor 410.

【０００６】したがって、使用者は、上述のような音声
認識機能により、車を運転をしながらキー操作部４２０
のボタンやスイッチを操作することなく、ユニット部４
３０の各種ユニットでの所望する動作を実行させること
ができる。Accordingly, the user operates the key operating section 420 while driving the car by the above-described voice recognition function.
Unit unit 4 without operating any buttons or switches
A desired operation can be executed in each of the 30 units.

【０００７】[0007]

【発明が解決しようとする課題】ところで、上述したよ
うに従来の音声認識機能では、音声認識開始スイッチ４
８０をＯＮに操作することで、今から操作キーワードを
入力することを当該音声認識機能に対して認識させるよ
うに構成されている。すなわち、音声認識機能による動
作指示のための操作キーワードの入力の開始（以下、
「音声認識の開始」と言う）を、音声認識開始スイッチ
４８０で行なうように構成されている。これは、常に操
作キーワードの入力が可能な状態（音声認識開始スイッ
チ４８０がＯＮ状態）にしておくと、ある動作を実行さ
せるための操作キーワード以外の、単なる同乗者との会
話の音声や、ラジオから出力される音声、或いはノイズ
等までもが、操作キーワードの認識の対象となってしま
い、この結果、使用者が意図していない時に突然誤動作
する可能性があるためである。As described above, in the conventional voice recognition function, the voice recognition start switch 4 is used.
By turning on 80, the voice recognition function recognizes that an operation keyword is to be input. That is, the start of input of an operation keyword for an operation instruction by the voice recognition function (hereinafter, referred to as an operation keyword).
"Start of speech recognition") is performed by a speech recognition start switch 480. This is because if the operation keyword can be always input (the voice recognition start switch 480 is in the ON state), the voice of the conversation with the fellow passenger other than the operation keyword for executing a certain operation, or the radio This is because even voices or noises output from the terminal are subject to recognition of the operation keyword, and as a result, there is a possibility of malfunctioning suddenly when the user does not intend.

【０００８】したがって、従来の音声認識機能では、音
声認識開始スイッチ４８０を設け、音声認識開始スイッ
チ４８０がＯＮに操作されてから入力された音声のみ
を、操作キーワードの認識の対象として受け付けるよう
にしている。Therefore, in the conventional voice recognition function, the voice recognition start switch 480 is provided, and only the voice input after the voice recognition start switch 480 is turned on is received as the target of the operation keyword recognition. I have.

【０００９】しかしながら、音声認識開始スイッチ４８
０をＯＮに操作しないかぎり、入力音声が操作キーワー
ドの認識の対象として受け付けられないということは、
使用者は、ある所望する動作を音声認識機能によって実
行させたいときにはその都度、音声認識開始スイッチ４
８０を操作しなければならないということになる。これ
は、非常に面倒な操作であり、操作性向上のための音声
認識機能としての効果が薄れてしまうことになる。However, the voice recognition start switch 48
Unless 0 is turned on, the fact that the input voice is not accepted as a recognition target of the operation keyword means that
Whenever the user wants to execute a desired operation by the voice recognition function, the voice recognition start switch 4
You have to operate 80. This is a very troublesome operation, and the effect as the voice recognition function for improving the operability is weakened.

【００１０】また、上述した理由により音声認識開始ス
イッチ４８０を取り付ける必要性があることから、その
ための取り付け費用がかかり、また、取り付ける場所を
確保する必要があると共に、当該取り付け場所としては
使用者が扱いやすい場所であることが好ましい。これ
は、装置或いはシステム全体のコストアップの問題につ
ながると共に、例えば、車載装置であれば内装のデザイ
ン設計上に制限が生じるという問題につながる。[0010] Further, since it is necessary to mount the voice recognition start switch 480 for the above-mentioned reason, it is necessary to mount the voice recognition start switch 480, and it is necessary to secure a mounting place. It is preferable that the place is easy to handle. This leads to a problem of an increase in the cost of the apparatus or the entire system, and also a problem that, for example, in the case of an in-vehicle apparatus, there is a restriction on the design of the interior.

【００１１】そこで、本発明は、上記の欠点を除去する
ために成されたもので、音声認識の開始をも音声入力で
行えるように構成することで、操作性向上を図ることが
でき、さらには装置やシステム全体のコストダウン、及
び音声認識機能の高性能化をも図ることができる、音声
認識装置、音声認識搭載装置、音声認識搭載システム、
音声認識方法、及び記憶媒体を提供することを目的とす
る。Therefore, the present invention has been made to eliminate the above-mentioned drawbacks, and the operability can be improved by configuring so that voice recognition can be started by voice input. Can reduce the cost of the device and the entire system, and can also improve the performance of the voice recognition function, voice recognition device, voice recognition device, voice recognition system,
An object of the present invention is to provide a voice recognition method and a storage medium.

【００１２】[0012]

【課題を解決するための手段】斯かる目的下において、
第１の発明は、任意の機能に対して実行指示するための
操作キーワードの音声入力を認識する操作認識手段と、
上記操作認識手段での上記操作キーワードの認識動作開
始を指示するための開始キーワードの音声入力を認識す
る開始認識手段とを備え、上記操作認識手段は、上記開
始認識手段の認識結果に基づいて、上記操作キーワード
の認識動作を開始することを特徴とする。For such a purpose,
A first invention is an operation recognizing means for recognizing a voice input of an operation keyword for instructing execution of an arbitrary function,
Start recognition means for recognizing a voice input of a start keyword for instructing a start of a recognition operation of the operation keyword in the operation recognition means, wherein the operation recognition means is based on a recognition result of the start recognition means, The operation of recognizing the operation keyword is started.

【００１３】第２の発明は、上記第１の発明において、
上記操作認識手段は、上記開始認識手段により上記開始
キーワードの音声入力が認識されてから所定時間の間、
上記操作キーワードの認識動作を実行することを特徴と
する。According to a second aspect of the present invention, in the first aspect,
The operation recognition means, for a predetermined time after the start recognition means recognizes the voice input of the start keyword,
The operation of recognizing the operation keyword is performed.

【００１４】第３の発明は、上記第１の発明において、
上記開始認識手段により上記開始キーワードの音声入力
が認識されてから所定期間の間、又は上記開始認識手段
により上記開始キーワードの音声入力が認識されてから
上記操作認識手段により上記操作キーワードの音声入力
が認識されるまでの間、音の出力を禁止する制御手段を
備えたことを特徴とする。According to a third aspect, in the first aspect,
The voice input of the operation keyword is performed by the operation recognizing unit for a predetermined period of time after the voice input of the start keyword is recognized by the start recognition unit, or after the voice input of the start keyword is recognized by the start recognition unit. Until the recognition, a control means for prohibiting the output of the sound is provided.

【００１５】第４の発明は、上記第１の発明において、
上記開始キーワードの登録を指示するための登録キーワ
ードの音声入力を認識する登録認識手段と、上記登録認
識手段により上記登録キーワードの音声入力が認識され
た後の入力音声を上記開始キーワードとして登録する登
録手段とを備え、上記開始認識手段は、上記登録手段に
より登録された開始キーワードの音声入力を認識するこ
とを特徴とする。[0015] In a fourth aspect based on the first aspect,
Registration recognizing means for recognizing a voice input of the registered keyword for instructing the registration of the start keyword, and registration for registering the input voice after the voice input of the registered keyword is recognized by the registration recognizing means as the start keyword Means, wherein the start recognition means recognizes a voice input of the start keyword registered by the registration means.

【００１６】第５の発明は、上記第４の発明において、
上記登録手段は、上記登録認識手段により上記登録キー
ワードの音声入力が認識されてから所定時間の間の入力
音声を上記開始キーワードとして登録することを特徴と
する。According to a fifth aspect, in the fourth aspect,
The registration means registers an input voice for a predetermined time from the recognition of the voice input of the registered keyword by the registration recognition means as the start keyword.

【００１７】第６の発明は、上記第４の発明において、
上記登録認識手段により上記登録キーワードの音声入力
が認識されてから所定時間の間、又は上記登録認識手段
により上記登録キーワードの音声入力が認識されてから
上記登録手段により上記開始キーワードの登録が終了す
るまでの間、音の出力を禁止する制御手段を備えたこと
を特徴とする。According to a sixth aspect based on the fourth aspect,
The registration of the start keyword is completed by the registration unit for a predetermined time period after the voice input of the registered keyword is recognized by the registration recognition unit, or after the voice input of the registered keyword is recognized by the registration recognition unit. In the meantime, there is provided a control means for prohibiting sound output.

【００１８】第７の発明は、上記第１の発明において、
上記開始キーワードの登録を指示するための操作手段
と、上記操作手段により上記開始キーワードの登録が指
示された後の入力音声を上記開始キーワードとして登録
する登録手段とを備え、上記開始認識手段は、上記登録
手段により登録された開始キーワードの音声入力を認識
することを特徴とする。According to a seventh aspect, in the first aspect,
Operating means for instructing the registration of the start keyword, and registering means for registering, as the start keyword, the input voice after the registration of the start keyword is instructed by the operating means, wherein the start recognition means comprises: It is characterized in that the voice input of the start keyword registered by the registration means is recognized.

【００１９】第８の発明は、上記第７の発明において、
上記登録手段は、上記操作手段により上記開始キーワー
ドの登録が指示されてから所定時間の間の入力音声を上
記開始キーワードとして登録することを特徴とする。According to an eighth aspect, in the seventh aspect,
The registration means registers an input voice for a predetermined time after the instruction of the start keyword is instructed by the operation means as the start keyword.

【００２０】第９の発明は、上記第７の発明において、
上記操作手段により上記開始キーワードの登録が指示さ
れてから所定時間の間、又は上記操作手段により上記開
始キーワードの登録が指示されてから上記登録手段によ
り上記開始キーワードの登録が終了するまでの間、音の
出力を禁止する制御手段を備えることを特徴とする。According to a ninth aspect, in the seventh aspect,
During a predetermined time after the start keyword is instructed by the operation means, or until the start keyword registration is completed by the registration means after the start keyword is instructed by the operation means, It is characterized by comprising control means for prohibiting output of sound.

【００２１】第１０の発明は、複数の機能を有し、当該
複数の機能のうちの任意の機能を操作キーワードの音声
入力によって実行させることが可能な音声認識搭載装置
であって、請求項１〜９の何れかに記載の音声認識装置
を有することを特徴とする。According to a tenth aspect, there is provided a voice recognition mounting apparatus having a plurality of functions, wherein any of the plurality of functions can be executed by voice input of an operation keyword. A speech recognition device according to any one of claims 1 to 9.

【００２２】第１１の発明は、複数の機器が通信可能に
接続されてなる音声認識搭載システムであって、上記複
数の機器のうち少なくとも１つの機器は、請求項１〜９
の何れかに記載の音声認識装置を有し、当該音声認識装
置によって他の機器の動作制御を行なうことを特徴とす
る。An eleventh invention is a voice recognition system comprising a plurality of devices communicably connected to each other, wherein at least one of the plurality of devices is one of the first to ninth embodiments.
Wherein the voice recognition device controls the operation of another device.

【００２３】第１２の発明は、入力された操作キーワー
ドの音声を認識し、当該操作キーワードに基づいて対応
する機能を実行させるための音声認識方法であって、上
記操作キーワードの音声認識の開始を、開始キーワード
の音声入力を待って行なうことを特徴とする。A twelfth invention is a speech recognition method for recognizing a speech of an input operation keyword and executing a corresponding function based on the operation keyword, wherein the speech recognition of the operation keyword is started. , After the voice input of the start keyword is performed.

【００２４】第１３の発明は、請求項１〜９の何れかに
記載の音声認識装置の機能をコンピュータに実施させる
ためのプログラムを記憶したコンピュータが読み取り可
能な記憶媒体であることを特徴とする。According to a thirteenth aspect, the present invention is a computer-readable storage medium storing a program for causing a computer to execute the functions of the voice recognition device according to any one of the first to ninth aspects. .

【００２５】[0025]

【発明の実施の形態】以下、本発明の実施の形態につい
て図面を用いて説明する。Embodiments of the present invention will be described below with reference to the drawings.

【００２６】本発明は、例えば、図１に示すような、自
動車の運転席に設けられるセンターコントロールユニッ
ト１００に適用される。The present invention is applied to, for example, a center control unit 100 provided in a driver's seat of an automobile as shown in FIG.

【００２７】センターコントロールユニット１００は、
音声認識機能を有し、図１及び図２に示すように、音声
認識機能による動作指示の対象となる各種ユニットが設
けられたユニット部１１０と、メインの電源スイッチ等
が設けられたキー操作部１２０と、ユニット部１１０の
状態等を表示するためのモニタ１３０とを備えている。
また、センターコントロールユニット１００には、後述
するマイク１５０やスピーカ１６０が内蔵されている。The center control unit 100
As shown in FIGS. 1 and 2, a unit section 110 having a voice recognition function and provided with various units to be operated by the voice recognition function, and a key operation section provided with a main power switch and the like 120 and a monitor 130 for displaying the state of the unit section 110 and the like.
The center control unit 100 includes a microphone 150 and a speaker 160 described later.

【００２８】ユニット部１１０は、ＣＤユニット１１０
ａやＭＤユニットの他、図示していないラジオやカーナ
ビゲーションユニット等も含んでおり、これらのユニッ
トは、キー操作部１２０の操作により動作させることも
可能であり、マイク１５０から使用者が所望する動作に
対応する操作キーワードを入力することによっても動作
させることが可能なようになされている。The unit section 110 includes a CD unit 110
a and an MD unit, as well as a radio and a car navigation unit (not shown). These units can be operated by operating the key operation unit 120. The operation can also be performed by inputting an operation keyword corresponding to the operation.

【００２９】キー操作部１２０には、メインの電源スイ
ッチの他、音声認識機能による動作指示以外の動作のた
めのキーや、ユニット部１１０に対する各種動作指示の
ためのキー、後述するキーワード登録キー１２０ａ等の
各種キーが設けられている。The key operation unit 120 includes a main power switch, a key for operations other than an operation instruction by a voice recognition function, a key for various operation instructions to the unit unit 110, and a keyword registration key 120a to be described later. Etc. are provided.

【００３０】モニタ１３０は、ユニット部１１０の各種
ユニットの現在状態や、ユニット１１０部の動作に伴っ
た各種情報（ＣＤユニット１１０ａでのＣＤ再生動作に
よる再生トラック情報や、カーナビゲーションユニット
による地図表示等）等を表示する。The monitor 130 displays the current state of various units of the unit 110, various information associated with the operation of the unit 110 (reproduced track information by a CD reproducing operation of the CD unit 110a, map display by a car navigation unit, etc.). ) Etc. are displayed.

【００３１】マイク１５０は、ユニット部１１０を音声
認識機能によって動作させるための操作キーワード等の
音声を入力するためのものであり、スピーカ１６０は、
マイク１５０から入力された音声のリピート出力や、ユ
ニット部１１０の動作に伴った出力（ＣＤユニット１１
０ａでのＣＤ再生動作による音楽の出力や、ラジオの出
力等）等を行なうためのものである。The microphone 150 is for inputting a voice such as an operation keyword for operating the unit unit 110 by the voice recognition function.
Repeat output of the audio input from the microphone 150 and output accompanying the operation of the unit unit 110 (CD unit 11
0a, music output, radio output, etc.).

【００３２】ここで、従来装置では音声認識の開始（ユ
ニット部１１０を音声認識機能によって動作させるため
の操作キーワードの入力の開始）の指示を音声認識開始
スイッチで行なっていたのに対して、本実施の形態にお
けるセンターコントロールユニット１００では、上記音
声認識の開始を、予め登録したキーワード（以下、「開
始キーワード」と言う）の音声をマイク１５０から入力
することで行えるようになされている。Here, in the conventional apparatus, the start of voice recognition (start of input of an operation keyword for operating the unit 110 by the voice recognition function) is instructed by the voice recognition start switch. In the center control unit 100 according to the embodiment, the start of the voice recognition can be performed by inputting a voice of a keyword registered in advance (hereinafter, referred to as a “start keyword”) from the microphone 150.

【００３３】例えば、センターコントロールユニット１
００では、開始キーワードを登録するために用いるキー
ワード（以下、「登録キーワード」と言う）がデフォル
トとして予め用意されている。ここでは、例えば、登録
キーワードを”キーワードを登録”といった言葉として
いる。したがって、使用者が”コマンド認識開始”とい
う言葉を開始キーワードとして使用したい場合には、”
キーワードを登録”及び”コマンド認識開始”の２つの
言葉をマイク１５０から入力すれば、”コマンド認識開
始”が開始キーワードとして登録される。その後、使用
者がＣＤユニット１１０ａでのＣＤ再生を音声認識機能
によって動作させたい場合には、使用者は、先ず、開始
キーワードである”コマンド認識開始”をマイク１５０
から入力し、その入力に続いて、ＣＤ再生動作開始に対
して予め登録しておいた操作キーワード（”ＣＤスター
ト”等）をマイク１５０から入力すれば、ＣＤユニット
１１０ａでのＣＤ再生動作が開始されることになる。For example, the center control unit 1
In 00, a keyword used for registering a start keyword (hereinafter referred to as “registered keyword”) is prepared in advance as a default. Here, for example, the registered keyword is a word such as “register a keyword”. Therefore, if the user wants to use the word “command recognition start” as the start keyword,
When two words, "register keyword" and "command recognition start", are input from the microphone 150, "command recognition start" is registered as a start keyword, and then the user performs voice recognition for CD playback in the CD unit 110a. When the user wants to operate by the function, the user first inputs the start keyword “command recognition start” to the microphone 150.
Then, if an operation keyword (“CD start” or the like) registered in advance for the start of the CD reproduction operation is input from the microphone 150 after the input, the CD reproduction operation in the CD unit 110a starts. Will be done.

【００３４】そこで、上述のようなセンターコントロー
ルユニット１００での動作を実現するために、センター
コントロールユニット１００の内部には、図２に示すよ
うな構成を有するコントローラ１７０が設けられてい
る。Therefore, in order to realize the operation of the center control unit 100 as described above, a controller 170 having a configuration as shown in FIG. 2 is provided inside the center control unit 100.

【００３５】コントローラ１７０は、例えば、ＣＰＵ、
ＲＯＭ、及びＲＡＭを含むマイクロコンピュータシステ
ムから構成され、ＣＰＵがＲＯＭに予め記憶された所定
の処理プログラムを実行することで、上記図２の音声認
識処理部１７１、音声合成処理部１７２、音声出力処理
部１７３、動作制御処理部１７４、及びキー入力処理部
１７５を含む構成を実現している。The controller 170 includes, for example, a CPU,
The microcomputer is constituted by a microcomputer system including a ROM and a RAM. The CPU executes a predetermined processing program stored in the ROM in advance, so that the voice recognition processing unit 171, the voice synthesis processing unit 172, the voice output processing A configuration including a unit 173, an operation control processing unit 174, and a key input processing unit 175 is realized.

【００３６】音声認識処理部１７１は、開始キーワード
の登録を指示するための登録キーワードを認識する登録
キーワード認識部１７１ａと、音声認識の開始を指示す
るための開始キーワードを認識する開始キーワード認識
部１７１ｂと、ユニット部１１０の各種ユニットに対す
る動作指示を表す操作キーワードを認識する操作キーワ
ード認識部１７１ｃとを含んでおり、これらの各認識部
１７１ａ〜１７１ｃによって、マイク１５０から入力さ
れた音声が登録キーワード、開始キーワード、及び操作
キーワードの何れのキーワードであるかを認識する。こ
の認識の方法としては、ここでは次のような方法を一例
として採用している。The speech recognition processing section 171 has a registered keyword recognizing section 171a for recognizing a registered keyword for instructing registration of a start keyword, and a start keyword recognizing section 171b for recognizing a start keyword for instructing start of speech recognition. And an operation keyword recognizing unit 171c for recognizing an operation keyword indicating an operation instruction for various units of the unit unit 110. The voices input from the microphone 150 are registered by the recognizing units 171a to 171c. It recognizes which keyword is the start keyword or the operation keyword. As a method of this recognition, the following method is adopted as an example here.

【００３７】例えば、音声認識処理部１７１は、マイク
１５０から入力された音声を一旦テキスト形式のデータ
の変換してメモリ１７６に記憶する。すなわち、音声認
識処理部１７１は、マイク１５０からの入力音声をその
まま音声データとして記憶するのではなく、１つの言葉
（単語）としてのテキストデータに変換してからメモリ
１７６に記憶する。For example, the voice recognition processing section 171 temporarily converts voice input from the microphone 150 into text format data and stores the data in the memory 176. That is, the voice recognition processing unit 171 does not store the input voice from the microphone 150 as voice data as it is, but converts it into text data as one word and stores it in the memory 176.

【００３８】このとき、メモリ１７６には、予め、登録
キーワードのテキストデータ（以下、「登録キーワード
データ」と言う）、及び各種の操作キーワードのテキス
トデータ（以下、「操作キーワードデータ」と言う）が
設定されている。At this time, the text data of the registered keyword (hereinafter, referred to as “registered keyword data”) and the text data of various operation keywords (hereinafter, referred to as “operation keyword data”) are stored in the memory 176 in advance. Is set.

【００３９】ここでの登録キーワードデータは、”キー
ワードを登録”という音声に対応したテキストデータと
している。また、操作キーワードデータは、例えば、ユ
ニット部１１０のＣＤユニット１１０ａに対する、ＣＤ
再生動作開始を示す”ＣＤスタート”、ＣＤ再生動作終
了を示す”ＣＤストップ”、ＣＤの入れ替え動作を示
す”ＣＤチェンジ”等といった各種動作を示す音声に対
応したテキストデータとしている。The registered keyword data here is text data corresponding to the voice of "register a keyword". The operation keyword data is, for example, a CD corresponding to the CD unit 110a of the unit unit 110.
This is text data corresponding to voices indicating various operations such as "CD start" indicating the start of the reproducing operation, "CD stop" indicating the end of the CD reproducing operation, and "CD change" indicating the switching operation of the CD.

【００４０】尚、ここでは、登録キーワードについて
は、センターコントロールユニット１００自体に対して
出荷時等に予めメモリ１７６に設定されているものとし
ている。一方の操作キーワードについては、センターコ
ントロールユニット１００自体に対して出荷時等に予め
メモリ１７６に設定されているものとしてもよいし、使
用者が実際に使用する前に予め所定の操作を行なうこと
で、任意のキーワード（使用者が使いやすいキーワード
等）を各種動作に対応させて設定できるようにしてもよ
い。また、メモリ１７６に現在設定されている操作キー
ワードのデータを、使用者が任意のキーワードに変更で
きるようにしてもよい。Here, it is assumed that the registered keywords are set in the memory 176 in advance to the center control unit 100 at the time of shipment. One of the operation keywords may be set in the memory 176 in advance at the time of shipment or the like to the center control unit 100 itself, or by performing a predetermined operation in advance before the user actually uses the operation keyword. Alternatively, an arbitrary keyword (such as a keyword that is easy for the user to use) may be set in accordance with various operations. The user may be able to change the data of the operation keyword currently set in the memory 176 to an arbitrary keyword.

【００４１】したがって、音声認識処理部１７１は、メ
モリ１７６に一旦記憶した入力音声のテキストデータ
と、メモリ１７６に予め設定されている各種キーワード
データとを比較することで、上記入力音声が登録キーワ
ードであるか操作キーワードであるかを認識する。Therefore, the speech recognition processing unit 171 compares the text data of the input speech once stored in the memory 176 with various keyword data preset in the memory 176, so that the input speech is a registered keyword. Recognize whether it is an operation keyword.

【００４２】また、音声認識処理部１７１は、上述の登
録キーワードや操作キーワードと同様にして、後述する
処理によって使用者から入力された開始キーワードをテ
キストデータに変換してメモリ１７６に設定（登録）す
るようにもなされているため、その後にマイク１５０か
ら入力された音声については、メモリ１７６に設定され
た開始キーワードデータとのマッチングによって、当該
音声が開始キーワードであるかの認識も行なわれること
になる。The speech recognition processing unit 171 converts the start keyword input by the user into text data by the processing described later and sets it in the memory 176 in the same manner as the above-described registered keyword and operation keyword. Since the voice input from the microphone 150 after that is matched with the start keyword data set in the memory 176, it is also recognized that the voice is the start keyword. Become.

【００４３】音声合成処理部１７２は、音声認識処理部
１７１での認識対象となるキーワード、すなわちマイク
１５０から入力された音声を音声出力処理部１７３を介
してマイク１６０から出力（リピート出力）するための
処理を実行する。The speech synthesis processing unit 172 outputs a keyword to be recognized by the speech recognition processing unit 171, that is, a speech input from the microphone 150 from the microphone 160 via the speech output processing unit 173 (repeat output). Execute the processing of

【００４４】例えば、マイク１５０から”ＣＤスター
ト”という音声が入力された場合、この音声は、音声認
識処理部１７１により、メモリ１７６にテキストデータ
として記憶され、操作キーワードとして認識される。音
声合成処理部１７２は、”ＣＤスタート”という音声を
入力した使用者に対して、当該音声が音声認識処理部１
７１の認識対象となり、操作キーワードとして受け付け
られたことを知らせるために、メモリ１７６に記憶され
た当該音声のデータを、スピーカ１６０からの出力デー
タとして音声出力処理部１７３に与える。音声出力処理
部１７３は、音声合成処理１７２からの出力データを音
声としてスピーカ１６０から出力する。したがって、こ
の場合には、マイク１６０から”ＣＤスタート”という
音声が入力され音声認識処理部１７１で操作キーワード
として受け付けられると、スピーカ１６０から”ＣＤス
タート”という音声がリピート出力されることになる。For example, when a voice "CD start" is input from the microphone 150, the voice is stored as text data in the memory 176 by the voice recognition processing unit 171, and is recognized as an operation keyword. The voice synthesizing unit 172 provides the user with the input of the voice “CD start” the voice to the voice recognition processing unit 1.
The voice data stored in the memory 176 is given to the voice output processing unit 173 as output data from the speaker 160 in order to notify that the voice has been recognized as an operation keyword and has been received as an operation keyword. The audio output processing unit 173 outputs the output data from the audio synthesis processing 172 as audio from the speaker 160. Therefore, in this case, when the voice “CD start” is input from the microphone 160 and is accepted as an operation keyword by the voice recognition processing unit 171, the voice “CD start” is repeatedly output from the speaker 160.

【００４５】また、音声合成処理部１７２は、上述のリ
ピート出力の他、開始キーワードの登録の際の処理手順
を使用者に対して促すためのメッセージや、開始キーワ
ードの登録が終了した際にその旨を使用者に知らせるた
めのメッセージ等を、音声出力処理部１７３を介してス
ピーカ１６０から出力するための処理を実行する。In addition to the above-described repeat output, the speech synthesis processing unit 172 provides a message for prompting the user to perform a processing procedure at the time of registration of the start keyword, and a message when the registration of the start keyword is completed. A process for outputting a message or the like for notifying the user to the effect from the speaker 160 via the audio output processing unit 173 is executed.

【００４６】例えば、マイク１６０から”キーワードを
登録”という音声が入力された場合、この音声は、音声
認識処理部１７１により登録キーワードとして認識され
る。音声合成処理部１７２は、”キーワードを登録”と
いう音声を入力した使用者に対して、次に使用者が行な
うべき処理を知らせるために、”開始キーワードを登録
する準備ができました。開始キーワードを言って下さ
い。”といったメッセージデータを生成し、これをスピ
ーカ１６０からの出力データとして音声出力処理部１７
３に与える。音声出力処理部１７３は、音声合成処理部
１７２からの出力データを音声としてスピーカ１６０か
ら出力する。したがって、この場合には、マイク１６０
から”キーワードを登録”という音声が入力され音声認
識処理部１７１で登録キーワードとして受け付けられる
と、スピーカ１６０から”開始キーワードを登録する準
備ができました。開始キーワードを言って下さい。”と
いう音声が出力されることになる。For example, when a voice "register a keyword" is input from the microphone 160, this voice is recognized by the voice recognition processing unit 171 as a registered keyword. The voice synthesis processing unit 172 is ready to register the “start keyword” in order to notify the user who has input the voice “register keyword” of the process to be performed next by the user. Is generated, and the message data is output as the output data from the speaker 160.
Give to 3. The audio output processing unit 173 outputs the output data from the audio synthesis processing unit 172 as audio from the speaker 160. Therefore, in this case, the microphone 160
When the voice of “register keyword” is input and received as a registered keyword in the voice recognition processing unit 171, the voice “Ready to register the starting keyword. Please say the starting keyword.” Is output from the speaker 160. Will be output.

【００４７】音声出力処理部１７３は、上述したような
音声合成処理部１７２から与えられたデータに対応する
音をスピーカ１６０から出力する他、後述する動作制御
処理部１７４から与えられたデータに対応する音もスピ
ーカ１６０から出力する。The voice output processing unit 173 outputs a sound corresponding to the data provided from the voice synthesis processing unit 172 as described above from the speaker 160, and outputs a sound corresponding to the data provided from the operation control processing unit 174 described later. The sound to be played is also output from the speaker 160.

【００４８】キー入力処理部１７５は、キー操作部１２
０の操作状態を検出して、その検出結果を動作制御処理
部１７４へ与える。The key input processing unit 175 includes the key operation unit 12
The operation state of “0” is detected, and the detection result is provided to the operation control processing unit 174.

【００４９】動作制御処理部１７４は、センターコント
ロールユニット１００全体の動作制御を司るものであ
り、特に、音声認識処理部１７１での入力音声の認識結
果や、キー入力処理部１７５でのキー操作部１２０の操
作状態検出結果に基づいて、センターコントロールユニ
ット１００全体の動作を制御する。The operation control processing section 174 controls the operation of the center control unit 100 as a whole. In particular, the result of recognition of the input voice by the voice recognition processing section 171 and the key operation section by the key input processing section 175 The operation of the entire center control unit 100 is controlled based on the operation state detection result of 120.

【００５０】例えば、マイク１５０から”ＣＤスター
ト”という音声が入力された場合、この音声は、音声認
識処理部１７１により操作キーワードとして認識され
る。動作制御処理部１７４は、音声認識処理部１７１の
認識結果により、”ＣＤスタート”という操作キーワー
ドがマイク１５０から入力されたことを把握すると、当
該操作キーワードに対応した制御コマンド（ＣＤユニッ
ト１１０ａでのＣＤ再生動作開始を指示する制御コマン
ド）を生成し、これをドライバ１４０に与える。ドライ
バ１４０は、動作制御処理部１７４からの制御コマンド
に従って、ＣＤユニット１１０ａを駆動する。これによ
り、ＣＤユニット１１０ａでは、ＣＤ再生動作が開始す
る。このとき、動作制御処理部１７４は、必要に応じ
て、ＣＤユニット１１０ａで再生されているトラック情
報をモニタ１３０で表示するための動作制御等も行な
う。For example, when a voice "CD start" is input from the microphone 150, this voice is recognized by the voice recognition processing unit 171 as an operation keyword. When the operation control processing unit 174 recognizes that the operation keyword “CD start” has been input from the microphone 150 based on the recognition result of the speech recognition processing unit 171, the operation control processing unit 174 controls the control command (the CD unit 110a) corresponding to the operation keyword. A control command for instructing the start of the CD reproduction operation is generated, and the generated control command is given to the driver 140. The driver 140 drives the CD unit 110a according to a control command from the operation control processing unit 174. As a result, the CD playback operation is started in the CD unit 110a. At this time, the operation control processing unit 174 also performs operation control for displaying the track information reproduced by the CD unit 110a on the monitor 130, if necessary.

【００５１】また、キー操作部１２０にてＣＤ再生動作
開始を指示するためのキーが操作された場合、この操作
状態は、キー入力処理部１７５により検出される。動作
制御処理部１７４は、キー入力処理部１７５の検出結果
に対応した制御コマンド（ＣＤユニット１１０ａでのＣ
Ｄ再生動作開始を指示する制御コマンド）を生成し、こ
れをドライバ１４０に与える。ドライバ１４０は、動作
制御処理部１７４からの制御コマンドに従って、ＣＤユ
ニット１１０ａを駆動する。これにより、ＣＤユニット
１１０ａでは、ＣＤ再生動作が開始する。このときも、
動作制御処理部１７４は、必要に応じて、ＣＤユニット
１１０ａで再生されているトラック情報をモニタ１３０
で表示するための動作制御等も行なう。When a key for instructing the start of the CD reproduction operation is operated by the key operation section 120, this operation state is detected by the key input processing section 175. The operation control processing unit 174 receives a control command (C in the CD unit 110a) corresponding to the detection result of the key input processing unit 175.
A control command for instructing the start of the D reproduction operation is generated, and the generated control command is given to the driver 140. The driver 140 drives the CD unit 110a according to a control command from the operation control processing unit 174. As a result, the CD playback operation is started in the CD unit 110a. Again,
The operation control processing unit 174 monitors the track information reproduced by the CD unit 110a as necessary.
Also, operation control for displaying the information is performed.

【００５２】上述のようなコントローラ１７０を有する
センターコントロールユニット１００では、音声認識を
開始するための開始キーワードを、登録キーワードを音
声入力することによって登録することも可能であり、キ
ー操作部１２０に設けられたキーワード登録キー１２０
ａの操作によっても登録することが可能となっている。
以下、これらの２パターンの開始キーワードの登録処
理、及びその登録後の操作キーワードの入力によるユニ
ット部１１０に対する動作指示処理について、図３及び
図４に示すフローチャートを用いて説明する。In the center control unit 100 having the controller 170 as described above, a start keyword for starting speech recognition can be registered by inputting a registered keyword by voice. Keyword registration key 120
It is also possible to register by the operation of a.
Hereinafter, the registration processing of these two patterns of start keywords and the operation instruction processing for the unit unit 110 by inputting the operation keywords after the registration will be described with reference to the flowcharts shown in FIGS.

【００５３】（１）開始キーワードを登録キーワードの
音声入力により登録する場合（図３参照）(1) When the start keyword is registered by voice input of the registered keyword (see FIG. 3)

【００５４】ステップＳ２０１：先ず、センターコント
ロールユニット１００本体の電源がＯＮされると、マイ
ク１５０からの音声入力待ち状態となる。この状態にお
いて、音声認識処理部１７１は、マイク１５０から音声
が入力されたか否かを判別する。この判別の結果、マイ
ク１５０から音声が入力された場合には次のステップＳ
２０１からの処理に進み、そうでない場合にはそのまま
音声入力待ち状態となる。尚、センターコントロールユ
ニット１００本体の電源ＯＮ状態とは、ユニット部１１
０の各種ユニットが動作可能な状態であり、車載ユニッ
トであればＡＣＣのＯＮ状態を示す。Step S201: First, when the power of the main body of the center control unit 100 is turned on, the apparatus enters a state of waiting for a voice input from the microphone 150. In this state, the voice recognition processing unit 171 determines whether or not voice has been input from the microphone 150. If the result of this determination is that speech has been input from the microphone 150, the next step S
The process proceeds to step 201, and if not, the process directly waits for a voice input. The power ON state of the main body of the center control unit 100 means that the unit 11
0 indicates that the various units are operable. If the unit is a vehicle-mounted unit, the ACC indicates an ON state.

【００５５】ステップＳ２０２：マイク１５０から音声
が入力されると、音声認識処理部１７１は、登録キーワ
ード認識部１７１ａによって、その入力音声のテキスト
データと、メモリ１７６に予め設定されている登録キー
ワードとを比較することで、当該入力音声が登録キーワ
ード（”キーワードを登録”等）であるか否かを判別す
る。この判別の結果、入力音声が登録キーワードである
場合には次のステップＳ２０３からの処理に進み、そう
でない場合には後述するステップＳ２０８からの処理に
進む。Step S202: When a voice is input from the microphone 150, the voice recognition processing unit 171 uses the registered keyword recognition unit 171a to convert the text data of the input voice and the registered keyword preset in the memory 176. By performing the comparison, it is determined whether or not the input voice is a registered keyword (eg, “register a keyword”). If the result of this determination is that the input speech is a registered keyword, the process proceeds to the next step S203; otherwise, the process proceeds to a step S208 described later.

【００５６】ステップＳ２０３：ステップＳ２０２の判
別の結果、入力音声が登録キーワードである場合、すな
わち登録キーワードの入力が認識された場合、音声認識
処理部１７１は、その旨を動作制御処理部１７４へ通知
する。これを受けた動作制御処理部１７４は、ユニット
部１１０のＣＤユニット１１０ａやＭＤユニット等が動
作している場合、その動作しているユニットに対してＭ
ＵＴＥをかける。これにより、使用者の声以外のものか
らの音（ＣＤ、ＭＤの再生により出力されている音、ラ
ジオの音等）の反応を防ぐことができる。また、動作制
御処理部１７４は、登録キーワードが入力されてから所
定時間（例えば、５秒）内に入力された開始キーワード
を有効とするためのタイマ（図示せず）をセットする。
尚、このとき、動作制御処理部１７４により、使用者に
対して、当該登録キーワードを受け付け開始キーワード
を登録できる状態となったことを知らせるようにしても
よい。具体的には例えば、動作制御処理部１７４が、音
声出力処理部１７３により、アラーム音をスピーカ１６
０から発生させたり、”開始キーワードを登録する準備
ができました。開始キーワードを言って下さい。”とい
ったメッセージ音をスピーカ１６０から発生させたりす
る。この場合、開始キーワードの入力を有効とする所定
時間については、上記のアラーム音やメッセージの出力
時間を考慮した時間を設定するようにする。Step S203: If the result of the determination in step S202 is that the input voice is a registered keyword, that is, if the input of the registered keyword has been recognized, the voice recognition processing unit 171 notifies the operation control processing unit 174 of this fact. I do. When the operation control processing unit 174 receives this, when the CD unit 110a, the MD unit, or the like of the unit unit 110 is operating, the operation control processing unit 174 sends the M to the operating unit.
Apply UTE. As a result, it is possible to prevent a reaction from a sound other than the user's voice (a sound output from reproduction of a CD or MD, a sound of a radio, or the like). Further, the operation control processing unit 174 sets a timer (not shown) for validating the start keyword input within a predetermined time (for example, 5 seconds) after the input of the registered keyword.
At this time, the operation control processing unit 174 may notify the user that the registered keyword has been accepted and the start keyword can be registered. Specifically, for example, the operation control processing unit 174 causes the sound output processing unit 173 to output an alarm sound to the speaker 16.
0 or a message sound such as "Ready to register start keyword. Please say start keyword." In this case, as the predetermined time for which the input of the start keyword is valid, a time is set in consideration of the output time of the alarm sound and the message.

【００５７】ステップＳ２０４：ステップＳ２０３によ
りユニット部１１０がＭＵＴＥ状態となると、マイク１
５０からの音声入力待ち状態となる。このとき、動作制
御処理部１７４は、例えば、音声入力待ち状態であるこ
とを示すメッセージをモニタ１３０へ表示させる。そし
て、音声認識処理部１７１は、マイク１５０から音声が
入力されたか否かを判別する。この判別の結果、マイク
１５０から音声が入力された場合にはステップＳ２０６
の処理に進み、そうでない場合にはステップＳ２０５の
処理に進む。Step S204: When the unit 110 enters the MUTE state in step S203, the microphone 1
It is in a state of waiting for a voice input from 50. At this time, the operation control processing unit 174 causes the monitor 130 to display, for example, a message indicating that it is in a voice input waiting state. Then, the voice recognition processing unit 171 determines whether or not voice has been input from the microphone 150. If the result of this determination is that speech has been input from the microphone 150, step S206
Otherwise, the process proceeds to step S205.

【００５８】ステップＳ２０５：ステップＳ２０４の判
別の結果、マイク１５０から音声が入力されない場合、
動作制御処理部１７４は、ステップＳ２０３にて設定し
たタイマにより、登録キーワードが入力されユニット部
１１０に対してＭＵＴＥをかけてから一定時間経過した
か否かを判別する。この判別の結果、一定時間を経過し
ていない場合には再びステップＳ２０４へと戻って音声
入力待ち状態となり、一定時間を経過した場合にはその
まま後述するステップＳ２０７へと進む。Step S205: If no voice is input from the microphone 150 as a result of the determination in step S204,
The operation control processing unit 174 determines whether or not a predetermined time has elapsed since the registered keyword was input and MUTE was applied to the unit unit 110 using the timer set in step S203. As a result of this determination, if the predetermined time has not elapsed, the flow returns to step S204 again to wait for voice input, and if the predetermined time has elapsed, the flow proceeds directly to step S207 described later.

【００５９】ステップＳ２０６：ステップＳ２０４の判
別の結果、マイク１５０から音声が入力された場合、音
声認識処理部１７１は、開始キーワード認識部１７１ｂ
により、当該音声を開始キーワードデータとしてメモリ
１７６に記憶（登録）する。そして、音声合成処理部１
７２は、音声出力処理部１７３により、音声認識処理部
１７１によって登録された開始キーワードをスピーカ１
６０から出力する。したがって、マイク１５０から入力
された、使用者が開始キーワードとして使用したい言葉
（”コマンド認識開始”等）の音声が、スピーカ１６０
からリピート出力される。これにより使用者は、自分が
入力した言葉が開始キーワードとして受け付けられたこ
とを把握することができる。Step S206: If the result of determination in step S204 is that speech has been input from the microphone 150, the speech recognition processing section 171 starts the start keyword recognition section 171b.
Thus, the voice is stored (registered) in the memory 176 as start keyword data. Then, the speech synthesis processing unit 1
72, the voice output processing unit 173 transmits the start keyword registered by the voice recognition processing unit 171 to the speaker 1;
Output from 60. Therefore, the voice of the word (“start command recognition” or the like) that the user wants to use as the start keyword, input from the microphone 150, is output from the speaker 160.
Is output repeatedly. This allows the user to know that the word entered by the user has been accepted as the start keyword.

【００６０】ステップＳ２０７：ステップＳ２０６によ
り、開始キーワードの登録が終了すると、動作制御処理
部１７４は、ステップＳ２０３にてユニット部１１０へ
かけたＭＵＴＥを解除する。そして、動作制御処理部１
７４は、音声出力処理部１７３により、”開始キーワー
ドを登録しました”といったメッセージをスピ−カ１６
０から出力させる。また、ステップＳ２０５のタイムア
ウトの判別の結果、登録キーワードが入力されユニット
部１１０に対してＭＵＴＥをかけてから一定時間内に音
声の入力がなかった場合、動作制御処理部１７４は、ス
テップＳ２０３にてユニット部１１０へかけたＭＵＴＷ
を解除し、音声出力処理部１７３により、例えば、”開
始キーワードの受付を終了しました”といったメッセー
ジや、”ばいばい”といったメッセージをスピ−カ１６
０から出力させる。その後、ステップＳ２０１へと戻
り、再び音声入力待ち状態となる。Step S207: When the registration of the start keyword is completed in step S206, the operation control processing section 174 releases the MUTE applied to the unit section 110 in step S203. Then, the operation control processing unit 1
Numeral 74 denotes a message such as "registered start keyword" by the voice output processing unit 173.
Output from 0. In addition, as a result of the determination of the timeout in step S205, when the registered keyword is input and no voice is input within a predetermined time after the MUTE is applied to the unit unit 110, the operation control processing unit 174 determines in step S203 MUTW applied to the unit 110
Is canceled, and a message such as "acceptance of the start keyword has been completed" or a message such as "bye bye" is output by the voice output processing unit 173.
Output from 0. After that, the process returns to step S201, and again enters the voice input waiting state.

【００６１】ステップＳ２０８：一方、ステップＳ２０
２の判別の結果、入力音声が登録キーワードでない場
合、音声認識処理部１７１は、開始キーワード認識部１
７１ｂによって、その入力音声のテキストデータと、上
述したステップＳ２０６にてメモリ１７６に登録された
開始キーワードとを比較することで、当該入力音声が開
始キーワード（”コマンド認識開始”等）であるか否か
を判別する。この判別の結果、入力音声が開始キーワー
ドである場合には次のステップＳ２０９からの処理に進
み、そうでない場合にはステップＳ２０１へと戻って再
び音声入力待ち状態となる。Step S208: On the other hand, step S20
If the result of determination in step 2 is that the input voice is not a registered keyword, the voice recognition processing unit
71b, by comparing the text data of the input voice with the start keyword registered in the memory 176 in the above-described step S206, whether or not the input voice is a start keyword (“command recognition start” or the like) is determined. Is determined. As a result of this determination, if the input voice is the start keyword, the process proceeds to the next step S209, and if not, the process returns to step S201 to again wait for voice input.

【００６２】ステップＳ２０９：ステップＳ２０８の判
別の結果、入力音声が開始キーワードである場合、すな
わち開始キーワードの入力が認識された場合、音声認識
処理部１７１は、その旨を動作制御処理部１７４へ通知
する。これを受けた動作制御処理部１７４は、上述した
ステップＳ２０３と同様に、ユニット部１１０のＣＤユ
ニット１１０ａやＭＤユニット等が動作している場合、
その動作しているユニットに対してＭＵＴＥをかける。
また、動作制御処理部１７４は、開始キーワードが入力
されてから所定時間（例えば、５秒）内に入力された操
作キーワードを有効とするためのタイマ（図示せず）を
セットする。尚、このとき、動作制御処理部１７４が、
使用者に対して、当該開始キーワードを受け付け操作キ
ーワードを入力できる状態となったことを知らせるよう
にしてもよい。具体的には例えば、動作制御処理部１７
４が、音声出力処理部１７３により、”操作キーワード
を言って下さい”或いは”何にしますか？”といったメ
ッセージや、本ユニットが呼ばれたものとしてのその返
事のように”はい、何ですか？”といったメッセージを
スピーカ１６０から発生させたりする。この場合、操作
キーワードの入力を有効とする所定時間については、上
記のメッセージの出力時間を考慮した時間を設定するよ
うにする。Step S209: If the result of the determination in step S208 is that the input speech is the start keyword, that is, if the input of the start keyword has been recognized, the speech recognition processing unit 171 notifies the operation control processing unit 174 of that fact. I do. The operation control processing unit 174 that has received this, when the CD unit 110a, the MD unit, or the like of the unit unit 110 is operating, as in step S203 described above,
Apply MUTE to the operating unit.
Further, the operation control processing unit 174 sets a timer (not shown) for validating the operation keyword input within a predetermined time (for example, 5 seconds) after the input of the start keyword. At this time, the operation control processing unit 174
The user may be notified that the start keyword is accepted and the operation keyword can be input. Specifically, for example, the operation control processing unit 17
4. The voice output processing unit 173 displays a message such as "Please say the operation keyword" or "What do you want to do?" Or a reply as if this unit was called "Yes, what is it?" Or a message such as "?" In this case, as the predetermined time for which the input of the operation keyword is valid, a time is set in consideration of the output time of the message.

【００６３】ステップＳ２１０：ステップＳ２０９によ
りユニット部１１０がＭＵＴＥ状態となると、マイク１
５０からの操作キーワードの入力待ち状態となる。この
とき、動作制御処理部１７４は、例えば、操作キーワー
ド入力待ち状態であることを示すメッセージをモニタ１
３０へ表示させる。そして、音声認識処理部１７１は、
マイク１５０から操作キーワードの音声が入力されたか
否かを判別する。すなわち、音声認識処理部１７１は、
操作キーワード認識部１７１ｃにより、マイク１５０か
ら入力された音声のテキストデータと、メモリ１７６に
予め設定されている各種操作キーワードとを比較するこ
とで、当該入力音声が操作キーワードであるか否かを判
別する。この判別の結果、操作キーワードが入力された
場合にはステップＳ２１２の処理に進み、そうでない場
合にはステップＳ２１１の処理に進む。Step S210: When the unit section 110 enters the MUTE state in step S209, the microphone 1
It is in a state of waiting for the input of the operation keyword from 50. At this time, for example, the operation control processing unit 174 monitors the monitor 1 for a message indicating that the operation keyword input is waiting.
30 is displayed. Then, the voice recognition processing unit 171
It is determined whether or not the voice of the operation keyword has been input from microphone 150. That is, the voice recognition processing unit 171
The operation keyword recognition unit 171c compares the text data of the voice input from the microphone 150 with various operation keywords preset in the memory 176 to determine whether the input voice is the operation keyword. I do. If the result of this determination is that an operation keyword has been input, the flow proceeds to the processing in step S212; otherwise, the flow proceeds to the processing in step S211.

【００６４】ステップＳ２１１：ステップＳ２１０の判
別の結果、マイク１５０から操作キーワードが入力され
ていない場合、動作制御処理部１７４は、ステップＳ２
０９にて設定したタイマにより、開始キーワードが入力
されユニット部１１０に対してＭＵＴＥをかけてから一
定時間経過したか否かを判別する。この判別の結果、一
定時間を経過していない場合には再びステップＳ２１０
へと戻って操作キーワードの入力待ち状態となり、一定
時間を経過した場合にはそのまま後述するステップＳ２
１３へと進む。Step S211: If the result of determination in step S210 is that no operation keyword has been input from the microphone 150, the operation control processing unit 174 proceeds to step S2.
The timer set in step 09 determines whether a predetermined time has elapsed since the start keyword was input and MUTE was applied to the unit 110. If the result of this determination is that the fixed time has not elapsed, step S210 is performed again.
Then, the process returns to step S2 to wait for an input of an operation keyword.
Proceed to 13.

【００６５】ステップＳ２１２：ステップＳ２１０の判
別の結果、マイク１５０から操作キーワードが入力され
た場合、音声認識処理部１７１は、当該操作キーワード
を示すデータを音声合成処理部１７２及び動作制御処理
部１７４へ供給する。これを受けた音声合成処理部１７
２は、音声出力処理部１７３により、上記操作キーワー
ドをスピーカ１６０から出力させる。したがって、マイ
ク１５０から入力された操作キーワードの音声が、スピ
ーカ１６０からリピート出力される。これにより使用者
は、自分が入力した操作キーワードが受け付けられたこ
とを把握することができる。また、動作制御処理部１７
４は、上記操作キーワードに対応する制御コマンドをド
ライバ１４０へ供給する。例えば、上記操作キーワード
が、ＣＤユニット１１０ａでのＣＤ再生動作の開始を示
す”ＣＤスタート”であった場合、動作制御処理部１７
４は、ＣＤ再生動作開始を示す制御コマンドを生成し、
これをドライバ１４０へ供給する。ドライバ１４０は、
動作制御処理部１７４からの制御コマンドにより、ＣＤ
ユニット１１０ａでのＣＤ再生動作を開始させる。さら
に、動作制御処理部１７４は、上記制御コマンドに基づ
く動作によって発生する情報（ＣＤの再生音やＣＤの再
生トラックの情報等）を、スピーカ１６０やモニタ１３
０から出力するための動作制御も行う。Step S212: If the result of determination in step S210 is that an operation keyword has been input from the microphone 150, the speech recognition processing unit 171 sends data indicating the operation keyword to the speech synthesis processing unit 172 and the operation control processing unit 174. Supply. Speech synthesis processing unit 17 receiving this
2 causes the audio output processing unit 173 to output the operation keyword from the speaker 160. Therefore, the voice of the operation keyword input from microphone 150 is repeatedly output from speaker 160. This allows the user to know that the operation keyword input by the user has been accepted. The operation control processing unit 17
4 supplies a control command corresponding to the operation keyword to the driver 140. For example, when the operation keyword is “CD start” indicating the start of the CD reproduction operation in the CD unit 110a, the operation control processing unit 17
4 generates a control command indicating the start of CD playback operation,
This is supplied to the driver 140. The driver 140
According to a control command from the operation control processing unit 174, the CD
The unit 110a starts the CD playback operation. Further, the operation control processing unit 174 transmits information (such as CD reproduction sound and CD reproduction track information) generated by the operation based on the control command to the speaker 160 and the monitor 13.
Operation control for outputting from 0 is also performed.

【００６６】ステップＳ２１３：ステップＳ２１２によ
り、操作キーワードに対応する処理（コマンド処理）が
終了すると、動作制御処理部１７４は、ステップＳ２０
９にてユニット部１１０へかけたＭＵＴＥを解除する。
また、ステップＳ２１１のタイムアウトの判別の結果、
開始キーワードが入力されユニット部１１０に対してＭ
ＵＴＥをかけてから一定時間内に操作キーワードの入力
がなかった場合、動作制御処理部１７４は、ステップＳ
２０９にてユニット部１１０へかけたＭＵＴＥを解除
し、音声出力処理部１７３により、例えば、”操作キー
ワードの受付を終了しました”といったメッセージ
や、”ばいばい”といったメッセージをスピ−カ１６０
から出力させる。その後、ステップＳ２０１へと戻り、
再び音声入力待ち状態となる。Step S213: When the processing (command processing) corresponding to the operation keyword is completed in step S212, the operation control processing unit 174 proceeds to step S20.
At 9, the MUTE applied to the unit 110 is released.
Also, as a result of the determination of the timeout in step S211,
The start keyword is input, and M
If the operation keyword has not been input within a certain period of time since the user entered UTE, the operation control processing unit 174 proceeds to step S
At step 209, the MUTE applied to the unit 110 is released, and the voice output processing unit 173 outputs a message such as "the operation keyword has been accepted" or a message "bye".
Output from Then, returning to step S201,
The voice input wait state is set again.

【００６７】（２）開始キーワードをキー操作により登
録する場合（図４参照）(2) When starting keyword is registered by key operation (see FIG. 4)

【００６８】ステップＳ３０１：先ず、センターコント
ロールユニット１００本体の電源がＯＮされると、マイ
ク１５０からの音声入力待ち状態となる。この状態にお
いて、音声認識処理部１７１は、マイク１５０から音声
が入力されたか否かを判別する。この判別の結果、マイ
ク１５０から音声が入力された場合には後述するステッ
プＳ３０１からの処理に進み、そうでない場合には次の
ステップＳ３０８からの処理に進む。尚、センターコン
トロールユニット１００本体の電源ＯＮ状態とは、ユニ
ット部１１０の各種ユニットが動作可能な状態であり、
車載ユニットであればＡＣＣのＯＮ状態を示す。Step S301: First, when the power of the main body of the center control unit 100 is turned on, the apparatus enters a state of waiting for a voice input from the microphone 150. In this state, the voice recognition processing unit 171 determines whether or not voice has been input from the microphone 150. As a result of the determination, if a voice is input from the microphone 150, the process proceeds to step S301, which will be described later; otherwise, the process proceeds to the next step S308. The power ON state of the center control unit 100 is a state in which various units of the unit section 110 can operate.
In the case of an in-vehicle unit, it indicates the ON state of ACC.

【００６９】ステップＳ３０８：ステップＳ３０１の判
別の結果、マイク１５０からの音声入力が無い場合、動
作制御処理部１７４は、キー入力処理部１７５からのキ
ー操作部１２０の操作状態検出結果により、キー操作部
１２０のキーワード登録キー１２０ａが操作されたか否
かを判別する。この判別の結果、キーワード登録キー１
２０ａが操作された場合には次のステップＳ３０９から
の処理に進み、そうでない場合にはステップＳ３０１へ
戻って再び音声入力待ち状態となる。Step S308: As a result of the determination in step S301, when there is no voice input from the microphone 150, the operation control processing unit 174 determines the key operation based on the operation state detection result of the key operation unit 120 from the key input processing unit 175. It is determined whether or not the keyword registration key 120a of the unit 120 has been operated. As a result of this determination, the keyword registration key 1
If the button 20a has been operated, the process proceeds from the next step S309, and if not, the process returns to step S301 to wait for a voice input again.

【００７０】ステップＳ３０９：ステップＳ３０８の判
別の結果、キー操作部１２０のキーワード登録キー１２
０ａが操作された場合、すなわち開始キーワードを登録
することを指示された場合、動作制御処理部１７４は、
ユニット部１１０のＣＤユニット１１０ａやＭＤユニッ
ト等が動作している場合にはその動作しているユニット
に対してＭＵＴＥをかける。これにより、使用者の声以
外のものからの音（ＣＤの再生により出力されている
音、ラジオの音等）への反応を防ぐことができる。ま
た、動作制御処理部１７４は、キー操作部１２０のキー
ワード登録キー１２０ａが操作されてから所定時間（例
えば、５秒）内に入力された開始キーワードを有効とす
るためのタイマ（図示せず）をセットする。尚、このと
き、動作制御処理部１７４が、使用者に対して、当該キ
ーワード登録キー１２０ａ１２０ａによる指示を受け付
け開始キーワードを登録できる状態となったことを知ら
せるようにしてもよい。具体的には例えば、動作制御処
理部１７４が、音声出力処理部１７３により、アラーム
音をスピーカ１６０から発生させたり、”開始キーワー
ドを登録する準備ができました。開始キーワードを言っ
て下さい。”といったメッセージをスピーカ１６０から
発生させたりする。この場合、開始キーワードの入力を
有効とする所定時間については、上記のアラーム音やメ
ッセージの出力時間を考慮した時間を設定するようにす
る。Step S309: As a result of the determination in step S308, the keyword registration key 12 of the key operation unit 120
When 0a is operated, that is, when an instruction to register a start keyword is issued, the operation control processing unit 174
When the CD unit 110a, the MD unit or the like of the unit unit 110 is operating, MUTE is applied to the operating unit. As a result, it is possible to prevent a reaction to a sound (a sound output by reproducing a CD, a radio sound, or the like) from a sound other than the user's voice. The operation control processing unit 174 includes a timer (not shown) for validating a start keyword input within a predetermined time (for example, 5 seconds) after the keyword registration key 120a of the key operation unit 120 is operated. Is set. At this time, the operation control processing unit 174 may notify the user that the instruction by the keyword registration keys 120a and 120a has been accepted and the start keyword can be registered. Specifically, for example, the operation control processing unit 174 causes the audio output processing unit 173 to generate an alarm sound from the speaker 160, or "Ready to register a start keyword. Please say the start keyword." Is generated from the speaker 160. In this case, as the predetermined time for which the input of the start keyword is valid, a time is set in consideration of the output time of the alarm sound and the message.

【００７１】ステップＳ３１０：ステップＳ３０９によ
りユニット部１１０がＭＵＴＥ状態となると、マイク１
５０からの音声入力待ち状態となる。このとき、動作制
御処理部１７４は、例えば、音声入力待ち状態であるこ
とを示すメッセージをモニタ１３０へ表示させる。そし
て、音声認識処理部１７１は、マイク１５０から音声が
入力されたか否かを判別する。この判別の結果、マイク
１５０から音声が入力された場合にはステップＳ３１２
の処理に進み、そうでない場合にはステップＳ３１１の
処理に進む。Step S310: When the unit section 110 enters the MUTE state in step S309, the microphone 1
It is in a state of waiting for a voice input from 50. At this time, the operation control processing unit 174 causes the monitor 130 to display, for example, a message indicating that it is in a voice input waiting state. Then, the voice recognition processing unit 171 determines whether or not voice has been input from the microphone 150. If the result of this determination is that speech has been input from the microphone 150, step S312
Otherwise, the process proceeds to step S311.

【００７２】ステップＳ３１１：ステップＳ３１０の判
別の結果、マイク１５０から音声が入力されない場合、
動作制御処理部１７４は、ステップＳ３０９にて設定し
たタイマにより、キーワード登録キー１２０ａが操作さ
れユニット部１１０に対してＭＵＴＥをかけてから一定
時間経過したか否かを判別する。この判別の結果、一定
時間を経過していない場合には再びステップＳ３１０へ
と戻って音声入力待ち状態となり、一定時間を経過した
場合にはそのまま後述するステップＳ３１３へと進む。Step S311: If no voice is input from the microphone 150 as a result of the determination in step S310,
The operation control processing unit 174 determines whether or not a predetermined time has elapsed since the keyword registration key 120a was operated and MUTE was applied to the unit unit 110 by using the timer set in step S309. If the result of this determination is that the predetermined time has not elapsed, the flow returns to step S310 to wait for voice input, and if the predetermined time has elapsed, the flow proceeds directly to step S313 described below.

【００７３】ステップＳ３１２：ステップＳ３１０の判
別の結果、マイク１５０から音声が入力された場合、音
声認識処理部１７１は、開始キーワード認識部１７１ｂ
により、当該音声を開始キーワードデータとしてメモリ
１７６に記憶（登録）する。そして、音声合成処理部１
７２は、音声出力処理部１７３により、音声認識処理部
１７１によって登録された開始キーワードをスピーカ１
６０から出力する。したがって、マイク１５０から入力
された、使用者が開始キーワードとして使用したい言葉
（”コマンド認識開始”等）の音声が、スピーカ１６０
からリピート出力される。これにより使用者は、自分が
入力した言葉が開始キーワードとして受け付けられたこ
とを把握することができる。Step S312: As a result of the determination in step S310, when a voice is input from the microphone 150, the voice recognition processing section 171 starts the start keyword recognition section 171b.
Thus, the voice is stored (registered) in the memory 176 as start keyword data. Then, the speech synthesis processing unit 1
72, the voice output processing unit 173 transmits the start keyword registered by the voice recognition processing unit 171 to the speaker 1;
Output from 60. Therefore, the voice of the word (“start command recognition” or the like) that the user wants to use as the start keyword, input from the microphone 150, is output from the speaker 160.
Is output repeatedly. This allows the user to know that the word entered by the user has been accepted as the start keyword.

【００７４】ステップＳ３１３：ステップＳ３１２によ
り、開始キーワードの登録が終了すると、動作制御処理
部１７４は、ステップＳ３０９にてユニット部１１０へ
かけたＭＵＴＥを解除する。そして、動作制御処理部１
７４は、音声出力処理部１７３により、”開始キーワー
ドを登録しました”といったメッセージをスピ−カ１６
０から出力させる。また、ステップＳ３１１のタイムア
ウトの判別の結果、キーワード登録キー１２０ａが操作
されユニット部１１０に対してＭＵＴＥをかけてから一
定時間内に音声の入力がなかった場合、動作制御処理部
１７４は、ステップＳ３０９にてユニット部１１０へか
けたＭＵＴＥを解除し、音声出力処理部１７３により、
例えば、”開始キーワードの受付を終了しました”とい
ったメッセージや、”ばいばい”といったメッセージを
スピ−カ１６０から出力させる。その後、ステップＳ３
０１へと戻り、再び音声入力待ち状態となる。Step S313: When the registration of the start keyword is completed in step S312, the operation control processing section 174 releases the MUTE applied to the unit section 110 in step S309. Then, the operation control processing unit 1
Numeral 74 denotes a message such as "registered start keyword" by the voice output processing unit 173.
Output from 0. Also, as a result of the determination of the timeout in step S311, if the keyword registration key 120a is operated and MUTE is not applied to the unit unit 110, and there is no voice input within a predetermined time, the operation control processing unit 174 proceeds to step S309. The MUTE applied to the unit unit 110 is released at, and the audio output processing unit 173 outputs
For example, a message such as "the reception of the start keyword has been completed" or a message such as "bye" is output from the speaker 160. Then, step S3
The process returns to 01, and again enters the voice input waiting state.

【００７５】ステップＳ３０２：ステップＳ３０１の判
別の結果、マイク１５０から音声の入力があった場合、
音声認識処理部１７１は、開始キーワード認識部１７１
ｂによって、その入力音声のテキストデータと、上述し
たステップＳ２０６にてメモリ１７６に登録された開始
キーワードとを比較することで、当該入力音声が開始キ
ーワード（”コマンド認識開始”等）であるか否かを判
別する。この判別の結果、入力音声が開始キーワードで
ある場合には次のステップＳ３０３からの処理に進む。
一方、入力音声が開始キーワードでない場合、すなわち
音声が入力されがその音声が開始キーワードでない場
合、図５に示すように、入力音声が登録キーワードであ
るか否かをチェックするために、上記図３に示したよう
なステップＳ２０２からの処理を実行する。その後、上
記図４のステップＳ３０１へと戻って再び音声入力待ち
状態となる。これにより、開始キーワードの登録を、登
録キーワードの音声入力によっても、後述するステップ
Ｓ３０９からの処理により、キーワード登録キー１２０
ａの操作によっても行なうことができる。Step S302: If the result of determination in step S301 is that a voice has been input from the microphone 150,
The speech recognition processing unit 171 includes a start keyword recognition unit 171.
b, by comparing the text data of the input voice with the start keyword registered in the memory 176 in step S206 described above, whether or not the input voice is the start keyword (“command recognition start” or the like) Is determined. If the result of this determination is that the input speech is the start keyword, the process proceeds to the next step S303.
On the other hand, if the input voice is not the start keyword, that is, if the voice is input but the voice is not the start keyword, as shown in FIG. 5, in order to check whether the input voice is a registered keyword, as shown in FIG. The processing from step S202 shown in FIG. After that, the process returns to step S301 in FIG. Thus, the registration of the start keyword can be performed by inputting the registered keyword by voice or by performing the processing from step S309 described later.
It can also be performed by the operation of a.

【００７６】ステップＳ３０３：ステップＳ３０２の判
別の結果、入力音声が開始キーワードである場合、すな
わち開始キーワードの入力が認識された場合、音声認識
処理部１７１は、その旨を動作制御処理部１７４へ通知
する。これを受けた動作制御処理部１７４は、上述した
ステップＳ３０９と同様に、ユニット部１１０のＣＤユ
ニット１１０ａやＭＤユニット等が動作している場合に
はその動作しているユニットに対してＭＵＴＥをかけ
る。また、動作制御処理部１７４は、開始キーワードが
入力されてから所定時間（例えば、５秒）内に入力され
た操作キーワードを有効とするためのタイマ（図示せ
ず）をセットする。尚、このとき、上述したステップＳ
２０９と同様に、動作制御処理部１７４が、使用者に対
して、当該開始キーワードを受け付け操作キーワードを
入力できる状態となったことを知らせるようにしてもよ
い。Step S303: If the input speech is the start keyword as a result of the determination in step S302, that is, if the input of the start keyword is recognized, the speech recognition processing unit 171 notifies the operation control processing unit 174 of the fact. I do. When the operation control processing unit 174 receives this, similarly to step S309 described above, when the CD unit 110a, the MD unit, or the like of the unit unit 110 is operating, the operation control processing unit 174 applies MUTE to the operating unit. . Further, the operation control processing unit 174 sets a timer (not shown) for validating the operation keyword input within a predetermined time (for example, 5 seconds) after the input of the start keyword. At this time, the above-described step S
Similarly to 209, the operation control processing unit 174 may notify the user that the start keyword is accepted and the operation keyword can be input.

【００７７】ステップＳ３０４：ステップＳ３０３によ
りユニット部１１０がＭＵＴＥ状態となると、マイク１
５０からの操作キーワードの入力待ち状態となる。この
とき、動作制御処理部１７４は、例えば、操作キーワー
ド入力待ち状態であることを示すメッセージをモニタ１
３０へ表示させる。そして、音声認識処理部１７１は、
マイク１５０から操作キーワードの音声が入力されたか
否かを判別する。すなわち、音声認識処理部１７１は、
操作キーワード認識部１７１ｃにより、マイク１５０か
ら入力された音声のテキストデータと、メモリ１７６に
予め設定されている各種操作キーワードとを比較するこ
とで、当該入力音声が操作キーワードであるか否かを判
別する。この判別の結果、操作キーワードが入力された
場合にはステップＳ３０６の処理に進み、そうでない場
合にはステップＳ３０５の処理に進む。Step S304: When the unit section 110 enters the MUTE state in step S303, the microphone 1
It is in a state of waiting for the input of the operation keyword from 50. At this time, for example, the operation control processing unit 174 monitors the monitor 1 for a message indicating that the operation keyword input is waiting.
30 is displayed. Then, the voice recognition processing unit 171
It is determined whether or not the voice of the operation keyword has been input from microphone 150. That is, the voice recognition processing unit 171
The operation keyword recognition unit 171c compares the text data of the voice input from the microphone 150 with various operation keywords preset in the memory 176 to determine whether the input voice is the operation keyword. I do. If the result of this determination is that an operation keyword has been input, the flow proceeds to the processing in step S306; otherwise, the flow proceeds to the processing in step S305.

【００７８】ステップＳ３０５：ステップＳ３０４の判
別の結果、マイク１５０から操作キーワードが入力され
ない場合、動作制御処理部１７４は、ステップＳ３０３
にて設定したタイマにより、開始キーワードが入力され
ユニット部１１０に対してＭＵＴＥをかけてから一定時
間経過したか否かを判別する。この判別の結果、一定時
間を経過していない場合には再びステップＳ３０４へと
戻って操作キーワードの入力待ち状態となり、一定時間
を経過した場合にはそのまま後述するステップＳ３０７
へと進む。Step S305: If the operation keyword is not input from the microphone 150 as a result of the determination in step S304, the operation control processing unit 174 proceeds to step S303.
Then, it is determined whether or not a predetermined time has elapsed since the start keyword was input and MUTE was applied to unit unit 110 by the timer set in. If the result of this determination is that the predetermined time has not elapsed, the flow returns to step S304 again to wait for input of an operation keyword, and if the predetermined time has elapsed, step S307 described later is used as it is.
Proceed to.

【００７９】ステップＳ３０６：ステップＳ３０４の判
別の結果、マイク１５０から操作キーワードが入力され
た場合、音声認識処理部１７１は、当該操作キーワード
を示すデータを音声合成処理部１７２及び動作制御処理
部１７４へ供給する。これを受けた音声合成処理部１７
２は、音声出力処理部１７３により、上記操作キーワー
ドをスピーカ１６０から出力させる。したがって、マイ
ク１５０から入力された操作キーワードの音声が、スピ
ーカ１６０からリピート出力される。これにより使用者
は、自分が入力した操作キーワードが受け付けられたこ
とを把握することができる。また、動作制御処理部１７
４は、上記操作キーワードに対応する制御コマンドをド
ライバ１４０へ供給する。例えば、上記操作キーワード
が、ＣＤユニット１１０ａでのＣＤ再生動作の開始を示
す”ＣＤスタート”であった場合、動作制御処理部１７
４は、ＣＤ再生動作開始を示す制御コマンドを生成し、
これをドライバ１４０へ供給する。ドライバ１４０は、
動作制御処理部１７４からの制御コマンドにより、ＣＤ
ユニット１１０ａでのＣＤ再生動作を開始させる。さら
に、動作制御処理部１７４は、上記制御コマンドに基づ
く動作によって発生する情報（ＣＤの再生音やＣＤの再
生トラックの情報等）を、スピーカ１６０やモニタ１３
０から出力するための動作制御も行う。Step S306: As a result of the determination in step S304, when an operation keyword is input from the microphone 150, the voice recognition processing unit 171 sends data indicating the operation keyword to the voice synthesis processing unit 172 and the operation control processing unit 174. Supply. Speech synthesis processing unit 17 receiving this
2 causes the audio output processing unit 173 to output the operation keyword from the speaker 160. Therefore, the voice of the operation keyword input from microphone 150 is repeatedly output from speaker 160. This allows the user to know that the operation keyword input by the user has been accepted. The operation control processing unit 17
4 supplies a control command corresponding to the operation keyword to the driver 140. For example, when the operation keyword is “CD start” indicating the start of the CD reproduction operation in the CD unit 110a, the operation control processing unit 17
4 generates a control command indicating the start of CD playback operation,
This is supplied to the driver 140. The driver 140
According to a control command from the operation control processing unit 174, the CD
The unit 110a starts the CD playback operation. Further, the operation control processing unit 174 transmits information (such as CD reproduction sound and CD reproduction track information) generated by the operation based on the control command to the speaker 160 and the monitor 13.
Operation control for outputting from 0 is also performed.

【００８０】ステップＳ３０７：ステップＳ３０６によ
り、操作キーワードに対応する処理（コマンド処理）が
終了すると、動作制御処理部１７４は、ステップＳ３０
３にてユニット部１１０へかけたＭＵＴＥを解除する。
また、ステップＳ３０５のタイムアウトの判別の結果、
開始キーワードが入力されユニット部１１０に対してＭ
ＵＴＥをかけてから一定時間内に操作キーワードの入力
がなかった場合、動作制御処理部１７４は、ステップＳ
３０３にてユニット部１１０へかけたＭＵＴＥを解除
し、音声出力処理部１７３により、例えば、”操作キー
ワードの受付を終了しました”といったメッセージ
や、”ばいばい”といったメッセージをスピ−カ１６０
から出力させる。その後、ステップＳ３０１へと戻り、
再び音声入力待ち状態となる。Step S307: When the processing (command processing) corresponding to the operation keyword ends in step S306, the operation control processing unit 174 proceeds to step S30.
In step 3, the MUTE applied to the unit 110 is released.
Also, as a result of the determination of the timeout in step S305,
The start keyword is input, and M
If the operation keyword has not been input within a certain period of time since the user entered UTE, the operation control processing unit 174 proceeds to step S
At 303, the MUTE applied to the unit unit 110 is released, and the voice output processing unit 173 outputs, for example, a message such as "the reception of the operation keyword has been completed" or a message such as "bye".
Output from After that, returning to step S301,
The voice input wait state is set again.

【００８１】上述のように本実施の形態では、音声認識
を開始するための開始キーワードを、登録キーワードを
用いた音声入力、或いはキーワード登録キー１２０ａの
操作に基づいて予め登録しておき、その開始キーワード
がマイク１５０から入力されることで、音声認識を開始
するように構成した。As described above, in the present embodiment, a start keyword for starting speech recognition is registered in advance based on a voice input using a registered keyword or an operation of the keyword registration key 120a. The speech recognition is started when a keyword is input from the microphone 150.

【００８２】これにより、従来装置で設けられていた音
声認識の開始のための音声認識開始スイッチ（上記図６
参照）を省くことができるため、装置或いはシステム全
体のコストダウンを図ることができる。また、上記音声
認識開始スイッチを設ける場所を確保する必要がないた
め、車の内装のデザインに制限が生じることはない。さ
らに、使用者にとっても、音声認識の開始の度に音声認
識開始スイッチをＯＮするといった非常に煩わしい操作
を行なう必要がなくなるため、操作性を大幅に向上させ
ることができる。As a result, a speech recognition start switch for starting speech recognition provided in the conventional apparatus (see FIG. 6)
) Can be omitted, so that the cost of the apparatus or the entire system can be reduced. Further, since it is not necessary to secure a place where the voice recognition start switch is provided, there is no restriction on the design of the interior of the car. Further, the user does not need to perform a very troublesome operation such as turning on a voice recognition start switch every time voice recognition is started, so that operability can be greatly improved.

【００８３】また、開始キーワードが音声入力されてか
ら、一定時間内の操作キーワードの音声入力を受け付け
るようにしたので、常に操作キーワードの入力が可能な
状態にしておくと、当該操作キーワード以外の音が認識
されてしまい、使用者が意図していない時に突然誤動作
する可能性がある、ということを確実に防ぐことができ
る。さらに、開始キーワードが入力されてからＭＵＴＥ
をかけるようにしたので、余分なＭＵＴＥがかかること
なく、その後の操作キーワードの認識を正確に行なうこ
とができる。Further, since the input of the operation keyword within a certain period of time is accepted after the start keyword is input by voice, if the operation keyword can always be input, the sound other than the operation keyword can be input. Can be surely prevented, and there is a possibility that a malfunction may occur suddenly when the user does not intend. Furthermore, after the start keyword is input, MUTE
, The subsequent operation keywords can be accurately recognized without extra MUTE.

【００８４】また、通常では、登録キーワードと開始キ
ーワードの２つのキーワードのみしか受け付けない、す
なわち入力された音声が登録キーワードであるか、開始
キーワードであるかの、２つのキーワードのみの認識を
行なうようにしているため、キーワードの誤認識を防ぐ
ことができる。Normally, only two keywords, a registered keyword and a start keyword, are accepted, that is, only two keywords, that is, whether the input voice is a registered keyword or a start keyword, are recognized. , It is possible to prevent erroneous recognition of keywords.

【００８５】また、開始キーワードとして、使用者が利
用しやすく覚えやすい言葉を任意に登録することができ
るため、使用者は、自分が登録した開始キーワードを忘
れることなく正確に使用することができる。さらに、開
始キーワードとして、普段使用しない言葉を登録すれ
ば、同乗者の会話等の中の言葉が開始キーワードとして
認識されにくいため、キーワードの誤認識を確実に防ぐ
ことができる。さらにまた、開始キーワードとして、使
用者の好きな言葉を登録すれば、その言葉の音声入力
（呼びかけ）により反応するため、本装置に愛着がわ
き、装置の魅力を高めることができる。Also, since words that are easy for the user to use and easy to remember can be arbitrarily registered as start keywords, the user can use the registered start keywords accurately without forgetting them. Furthermore, if words that are not usually used are registered as start keywords, words in conversations of passengers and the like are unlikely to be recognized as start keywords, so that erroneous recognition of keywords can be reliably prevented. Furthermore, if a user's favorite word is registered as a start keyword, the user responds by voice input (calling) of the word, so that the user can be more attached to the present apparatus and the attractiveness of the apparatus can be enhanced.

【００８６】また、開始キーワードの登録を、登録キー
ワードの音声入力と、キー操作部１２０のキーワード登
録キー１２０ａの操作との何れの方法でも行なえるよう
にしたので、使用者は、そのときの状況に応じて、利用
しやすい方法で、開始キーワードの登録を行なうことが
できる。例えば、開始キーワードの登録は頻繁に行なう
作業ではないので、キー操作部１２０のキーワード登録
キー１２０ａの操作により行なうようにしてもよい。Further, since the start keyword can be registered by either the voice input of the registered keyword or the operation of the keyword registration key 120a of the key operation unit 120, the user can check the situation at that time. , The start keyword can be registered in an easy-to-use method. For example, since the registration of the start keyword is not frequently performed, it may be performed by operating the keyword registration key 120a of the key operation unit 120.

【００８７】尚、上述した本実施の形態において、次の
ような構成を採用するようにしてもよい。In the above-described embodiment, the following configuration may be adopted.

【００８８】（１）ユニット部１１０としては、ＣＤユ
ニット１１０ａやＭＤユニットに限られることはなく、
例えば、ＣＤユニット１１０ａ及びＭＤユニットに加え
て、ラジオ、ＴＶ、チューナ、カーナビゲーションシス
テム、エアコン等のユニットを含ませるようにしてもよ
いし、或いは、これらのユニットの一部を含むようにし
てもよい。(1) The unit section 110 is not limited to the CD unit 110a or the MD unit.
For example, in addition to the CD unit 110a and the MD unit, a unit such as a radio, a TV, a tuner, a car navigation system, and an air conditioner may be included, or a part of these units may be included.

【００８９】（２）上記図２に示したようなコントロー
ラ１７０の機能を、ＣＤユニット１１０ａやカーナビゲ
ーションシステム等の各ユニット自体に持たせるように
してもよい。また、上記図２に示したようなコントロー
ラ１７０は、上記図１に示したような車載装置に限ら
ず、例えば、携帯電話等の電話機や、オーディオ装置、
その他の音声認識機能を有する装置やシステムに適用可
能である。(2) Each unit such as the CD unit 110a and the car navigation system may have the function of the controller 170 as shown in FIG. 2 described above. The controller 170 as shown in FIG. 2 is not limited to the in-vehicle device as shown in FIG. 1, but may be, for example, a telephone such as a mobile phone, an audio device,
The present invention is applicable to other devices and systems having a voice recognition function.

【００９０】（３）入力音声の認識の際、当該入力音声
をテキストデータに変換してメモリ１７６に記憶するよ
うにしたが、例えば、当該入力音声をそのままサンプリ
ングしてメモリ１７６に記憶するようにしてもよい。(3) When recognizing an input voice, the input voice is converted into text data and stored in the memory 176. For example, the input voice is sampled as it is and stored in the memory 176. You may.

【００９１】（４）開始キーワードをキー操作部１２０
のキーワード登録キー１２０ａの操作によって登録する
場合、キーワード登録キー１２０ａが押下され、開始キ
ーワードとしての音声が入力された後、再度キーワード
登録キー１２０ａが押下されたときに、当該入力音声を
開始キーワードとして登録するようにしてもよい。(4) The start keyword is input to the key operation unit 120
When the keyword is registered by operating the keyword registration key 120a, when the keyword registration key 120a is pressed and the voice as the start keyword is input, and when the keyword registration key 120a is pressed again, the input voice is used as the start keyword. You may make it register.

【００９２】（５）開始キーワードの登録を音声入力で
行ない、ユニット部１１０に対する全ての動作指示をも
音声認識機能によって行なうように構成した場合、キー
操作部１２０は必ずしも設ける必要はない。(5) When the start keyword is registered by voice input and all operation instructions to the unit section 110 are also performed by the voice recognition function, the key operation section 120 is not necessarily provided.

【００９３】（６）音声入力待ち状態や、ユニット部１
１０の動作状態等の情報を使用者に示すために、モニタ
１３０での表示を行なうようにしたが、これは必ずしも
必要ではなく、モニタ１３０を設けない構成とするよう
にしてもよい。また、モニタ１３０に各種情報を表示す
る代わりに、当該情報をスピーカ１６０から音声として
出力するようにしてもよい。(6) Waiting for voice input, unit unit 1
The display on the monitor 130 is performed in order to show the information such as the operation state of the device 10 to the user. However, this is not always necessary, and the monitor 130 may not be provided. Further, instead of displaying various kinds of information on the monitor 130, the information may be output from the speaker 160 as audio.

【００９４】（７）本発明の目的は、上述した実施の形
態の各機能を実現するソフトウェアのプログラムコード
を記憶した記憶媒体を、システム或いは装置に供給し、
そのシステム或いは装置のコンピュータ（又はＣＰＵや
ＭＰＵ）が記憶媒体に格納されたプログラムコードを読
みだして実行することによっても、達成されることは言
うまでもない。この場合、記憶媒体から読み出されたプ
ログラムコード自体が本実施の形態の機能を実現するこ
ととなり、そのプログラムコードを記憶した記憶媒体は
本発明を構成することとなる。プログラムコードを供給
するための記憶媒体としては、ＲＯＭ、フロッピーディ
スク、ハードディスク、光ディスク、光磁気ディスク、
ＣＤ−ＲＯＭ、ＣＤ−Ｒ、磁気テープ、不揮発性のメモ
リカード等を用いることができる。また、コンピュータ
が読みだしたプログラムコードを実行することにより、
本実施の形態の機能が実現されるだけでなく、そのプロ
グラムコードの指示に基づき、コンピュータ上で稼動し
ているＯＳ等が実際の処理の一部又は全部を行い、その
処理によって本実施の形態の機能が実現される場合も含
まれることは言うまでもない。さらに、記憶媒体から読
み出されたプログラムコードが、コンピュータに挿入さ
れた拡張機能ボードやコンピュータに接続された機能拡
張ユニットに備わるメモリに書き込まれた後、そのプロ
グラムコードの指示に基づき、その機能拡張ボードや機
能拡張ユニットに備わるＣＰＵなどが実際の処理の一部
又は全部を行い、その処理によって本実施の形態の機能
が実現される場合も含まれることは言うまでもない。(7) An object of the present invention is to provide a system or an apparatus with a storage medium storing a program code of software for realizing each function of the above-described embodiment,
It is needless to say that the present invention is also achieved when a computer (or CPU or MPU) of the system or apparatus reads out and executes a program code stored in a storage medium. In this case, the program code itself read from the storage medium implements the functions of the present embodiment, and the storage medium storing the program code constitutes the present invention. As storage media for supplying the program code, ROM, floppy disk, hard disk, optical disk, magneto-optical disk,
A CD-ROM, CD-R, magnetic tape, nonvolatile memory card, or the like can be used. Also, by executing the program code read by the computer,
Not only the functions of the present embodiment are realized, but also an OS or the like running on a computer performs a part or all of the actual processing based on the instructions of the program code. It is needless to say that the case where the function is realized is also included. Further, after the program code read from the storage medium is written to a memory provided in an extension function board inserted into the computer or a function extension unit connected to the computer, the function extension is performed based on the instruction of the program code. It goes without saying that a CPU or the like provided in the board or the function expansion unit performs part or all of the actual processing, and the processing realizes the functions of the present embodiment.

【００９５】[0095]

【発明の効果】以上説明したように本発明では、任意の
機能を動作させるための操作キーワードの音声認識の開
始を、開始キーワードの音声入力によって行なえるよう
に構成したので、従来からの音声認識の開始のためのス
イッチ等を設ける必要はない。これにより、装置或いは
システム全体のコストダウンを図ることができる。As described above, according to the present invention, speech recognition of an operation keyword for operating an arbitrary function can be started by speech input of a start keyword. It is not necessary to provide a switch or the like for starting the operation. Thereby, the cost of the apparatus or the entire system can be reduced.

【００９６】また、使用者は、音声認識により上記任意
の機能を動作させたい度に、その都度上記スイッチをＯ
Ｎする等といった非常に煩わしい操作を行なう必要がな
くなるため、操作性を大幅に向上させることができる。Each time the user wants to operate the above-mentioned arbitrary function by voice recognition, the user turns on the above switch every time.
Since it is not necessary to perform a very troublesome operation such as N, the operability can be greatly improved.

【００９７】また、登録キーワードと開始キーワードの
２つのキーワードのみの音声入力の認識を行ない、開始
キーワードである場合に、次の操作キーワードの音声入
力の認識を行なうことができるように構成されているた
め、キーワードの誤認識を防ぐことができる。Further, the speech input of only the two keywords, the registered keyword and the start keyword, is recognized, and if the keyword is the start keyword, the speech input of the next operation keyword can be recognized. Therefore, erroneous recognition of the keyword can be prevented.

【００９８】また、開始キーワードを登録するように構
成した場合には、開始キーワードとして、使用者が利用
しやすく覚えやすい言葉等を任意に登録することができ
るため、使用者は、自分が登録した開始キーワードを忘
れることなく正確に使用することができる。さらに、開
始キーワードとして、普段使用しない言葉を登録すれ
ば、会話中の言葉等が開始キーワードとして認識されに
くいため、キーワードの誤認識を確実に防ぐことができ
る。さらにまた、開始キーワードとして、使用者の好き
な言葉を登録すれば、その言葉の音声入力（呼びかけ）
により反応するため、本発明を適用した装置やシステム
に愛着がわき、当該装置やシステムの魅力を高めること
ができる。When the start keyword is configured to be registered, words that are easy for the user to use and easy to remember can be arbitrarily registered as the start keyword. It can be used accurately without forgetting the starting keyword. Furthermore, if words that are not usually used are registered as start keywords, words in conversation are difficult to be recognized as start keywords, so that erroneous recognition of keywords can be reliably prevented. Furthermore, if a user's favorite word is registered as a starting keyword, voice input of that word (call)
, The attachment to the device or system to which the present invention is applied is enhanced, and the attraction of the device or system can be enhanced.

【００９９】また、開始キーワードの登録を、登録キー
ワードの音声入力と、キー操作との何れの方法でも行な
えるように構成した場合には、使用者は、そのときの状
況に応じて、利用しやすい方法で、開始キーワードの登
録を行なうことができる。If the start keyword is registered so that it can be registered by either the voice input of the registered keyword or the key operation, the user can use the key word in accordance with the situation at that time. The start keyword can be registered in an easy way.

【０１００】また、開始キーワードが音声入力されてか
ら、一定時間内の操作キーワードの音声入力を受け付け
るように構成した場合には、確実に操作キーワードの音
声入力を認識することができる。これにより、当該認識
の誤りによって誤動作することを確実に防ぐくことがで
き、高性能な音声認識を提供することができる。[0100] Further, when the voice input of the operation keyword within a certain period of time is accepted after the voice input of the start keyword, the voice input of the operation keyword can be surely recognized. This can reliably prevent malfunction due to the recognition error, and provide high-performance speech recognition.

【０１０１】また、開始キーワードの音声入力が認識さ
れてから操作キーワードの音声入力がされる間に、音楽
やラジオ等の音の出力の禁止をするように構成した場合
には、さらに確実に操作キーワードの音声入力を認識す
ることができる。If the system is configured to prohibit the output of sound such as music or radio while the voice input of the operation keyword is performed after the voice input of the start keyword is recognized, the operation can be more reliably performed. The voice input of the keyword can be recognized.

[Brief description of the drawings]

【図１】本発明を適用した、自動車の運転席に設けられ
るセンターコントロールユニットの構成を説明するため
の図である。FIG. 1 is a diagram for explaining a configuration of a center control unit provided in a driver seat of an automobile to which the present invention is applied.

【図２】上記センターコントロールユニットの内部構成
を示すブロック図である。FIG. 2 is a block diagram showing an internal configuration of the center control unit.

【図３】上記センターコントロールユニットにおいて、
音声認識を開始することを示す開始キーワードを、登録
キーワードの音声入力により登録する場合の上記センタ
ーコントロールユニットの動作を説明するためのフロー
チャートである。FIG. 3 In the center control unit,
It is a flowchart for demonstrating the operation | movement of the said center control unit at the time of registering the start keyword which shows starting speech recognition by the voice input of a registered keyword.

【図４】上記開始キーワードをキーワード登録キーの操
作により登録する場合の上記センターコントロールユニ
ットの動作を説明するためのフローチャートである。FIG. 4 is a flowchart illustrating an operation of the center control unit when the start keyword is registered by operating a keyword registration key.

【図５】上記開始キーワードをキーワード登録キーの操
作により登録する場合において、入力音声が開始キーワ
ードでない場合の処理を説明するためのフローチャート
である。FIG. 5 is a flowchart illustrating a process when the input voice is not a start keyword when the start keyword is registered by operating a keyword registration key.

【図６】従来の音声認識機能の構成を示すブロック図で
ある。FIG. 6 is a block diagram showing a configuration of a conventional voice recognition function.

[Explanation of symbols]

１００センターコントロールユニット１１０ユニット部１２０キー操作部１３０モニタ１４０ドライバ１５０マイク１６０スピーカ１７０コントローラ１７１音声認識処理部１７１ａ登録キーワード認識部１７１ｂ開始キーワード認識部１７１ｃ操作キーワード認識部１７２音声合成処理部１７３音声出力処理部１７４動作制御処理部１７５キー入力処理部１７６メモリ 100 center control unit 110 unit section 120 key operation section 130 monitor 140 driver 150 microphone 160 speaker 170 controller 171 voice recognition processing section 171a registered keyword recognition section 171b start keyword recognition section 171c operation keyword recognition section 172 voice synthesis processing section 173 voice output processing Unit 174 operation control processing unit 175 key input processing unit 176 memory

Claims

[Claims]

An operation recognizing means for recognizing a voice input of an operation keyword for instructing execution of an arbitrary function, and a start keyword for instructing a start of a recognition operation of the operation keyword by the operation recognizing means A speech recognition device, comprising: start recognition means for recognizing a voice input, wherein the operation recognition means starts a recognition operation of the operation keyword based on a recognition result of the start recognition means.

2. The operation recognition unit according to claim 1, wherein the operation recognition unit executes the operation keyword recognition operation for a predetermined time after the start recognition unit recognizes the voice input of the start keyword. Voice recognition device.

3. The operation by the operation recognizing means for a predetermined period of time after the speech input of the start keyword is recognized by the start recognizing means, or after the speech input of the start keyword is recognized by the start recognizing means. 2. The speech recognition apparatus according to claim 1, further comprising control means for inhibiting output of a sound until the speech input of the keyword is recognized.

4. A registration recognizing means for recognizing a voice input of a registered keyword for instructing registration of the start keyword, and starting the input voice after the voice recognition of the registered keyword is recognized by the registration recognizing means. 2. A speech recognition apparatus according to claim 1, further comprising registration means for registering as a keyword, wherein said start recognition means recognizes a speech input of the start keyword registered by said registration means.

5. The registration unit according to claim 4, wherein the registration unit registers an input voice for a predetermined time after the registration recognition unit recognizes a voice input of the registration keyword as the start keyword. Voice recognition device.

6. A method according to claim 1, wherein said registration recognition means recognizes the voice input of said registered keyword for a predetermined period of time, or after said registration recognition means recognizes the voice input of said registered keyword, said registration means recognizes said start keyword. 5. The speech recognition apparatus according to claim 4, further comprising control means for prohibiting output of a sound until the registration of the speech is completed.

7. An operating unit for instructing the registration of the start keyword, and a registering unit for registering an input voice after the instruction of the registration of the start keyword by the operating unit as the start keyword, 2. The speech recognition apparatus according to claim 1, wherein the start recognition unit recognizes a speech input of the start keyword registered by the registration unit.

8. A speech recognition apparatus according to claim 7, wherein said registering means registers an input voice for a predetermined time from when the registration of said start keyword is instructed by said operation means as said start keyword. apparatus.

9. The registration of the start keyword is completed by the registration means for a predetermined time after the registration of the start keyword is instructed by the operation means, or after the registration of the start keyword is instructed by the operation means. 8. The speech recognition apparatus according to claim 7, further comprising control means for prohibiting output of a sound until the sound is output.

10. A voice recognition-equipped device which has a plurality of functions and is capable of executing any of the plurality of functions by voice input of an operation keyword. A voice recognition mounting device comprising the voice recognition device according to any one of claims 1 to 3.

11. A speech recognition mounting system in which a plurality of devices are communicably connected, wherein at least one of the plurality of devices is the speech recognition device according to any one of claims 1 to 9. Wherein the voice recognition device controls the operation of another device.

12. A voice recognition method for recognizing a voice of an input operation keyword and executing a corresponding function based on the operation keyword, wherein the start of the voice recognition of the operation keyword is determined based on the start keyword. A speech recognition method characterized by performing the process after waiting for a speech input.

13. A computer-readable storage medium storing a program for causing a computer to execute the functions of the speech recognition device according to claim 1.