JP2975633B2

JP2975633B2 - Voice recognition method

Info

Publication number: JP2975633B2
Application number: JP2082535A
Authority: JP
Inventors: 正一亀井
Original assignee: Sanyo Denki Co Ltd
Current assignee: Sanyo Denki Co Ltd
Priority date: 1990-03-29
Filing date: 1990-03-29
Publication date: 1999-11-10
Anticipated expiration: 2014-11-10
Also published as: JPH03280100A

Description

【発明の詳細な説明】（イ）産業上の利用分野本発明は話者が発声した音声を機械的に認識するシス
テムに於ける音声認識方法に関する。The present invention relates to a speech recognition method in a system for mechanically recognizing speech uttered by a speaker.

（ロ）従来の技術話者が発声した音声を機械的に認識するシステムとし
ては、従来から種々の音声認識装置が開発されており、
近年では、例えば、音声によってオートダイヤルできる
電話システムに採用されるに至っている（特開昭62−81
152号）。(B) Conventional technology As a system for mechanically recognizing a voice uttered by a speaker, various voice recognition devices have been conventionally developed.
In recent years, for example, it has been adopted for telephone systems that can automatically dial by voice (Japanese Patent Laid-Open No. 62-81).
No. 152).

しかしながら、音声認識手法の研究開発が進み誤認識
が如何に低減されたとは言え、人間が音声を発声する以
上不確定な要素があり、さらに周囲雑音の影響を回避で
きないために、やはり誤認識を完全には解消できないの
が現状である。However, although research and development of speech recognition techniques have progressed and false recognition has been reduced, there are uncertain factors beyond human utterance and the effect of ambient noise cannot be avoided. At present, it cannot be completely eliminated.

従って、上述の如き、オートダイヤル電話システムに
於ては、第１ステップとして発呼者相手先名が音声認識
してこの認識結果を表示あるいは合成音声で報知し、報
知された認識結果が正しいときに始めて、この認識結果
の相手先名にダイヤルするように、第２ステップで“ダ
イヤル”などの実行指令語音声を音声認識させ、一方、
報知された認識結果が誤りである時には、第２ステップ
で“キャンセル”などの非実行指令語を音声認識させる
方式（２段階認識処理方式）が採用されている。Therefore, as described above, in the auto-dial telephone system, as a first step, the name of the caller is recognized by voice, and this recognition result is displayed or reported by synthetic voice, and when the reported recognition result is correct. First, in the second step, execution command words such as "dial" are subjected to voice recognition so as to dial the destination name of the recognition result.
When the notified recognition result is an error, a method (two-stage recognition processing method) of performing voice recognition of a non-execution command word such as "cancel" in the second step is adopted.

ところが、このような従来の２段階認識処理方式で
は、相手先名の誤認識がある度に、使用者、即ち話者
は、非実行指令語の入力、及び相手先名の再入力を繰り
返さなければならないので、使用者への負担が大きくな
る欠点があった。However, in such a conventional two-stage recognition processing method, the user, that is, the speaker, has to repeat the input of the non-execution command word and the re-input of the destination name every time the destination name is erroneously recognized. Therefore, there is a disadvantage that the burden on the user is increased.

（ハ）発明が解決しようとする課題本発明は、上述の欠点を解消するべくなされたもので
あって、誤認識があり得ることを前提とし、使用者の負
担の軽減を図った音声認識方法を提供するものである。(C) Problems to be Solved by the Invention The present invention has been made in order to solve the above-mentioned drawbacks, and is based on the premise that erroneous recognition may occur, and reduces the user's burden. Is provided.

（ニ）課題を解決するための手段本発明の音声認識方法は、以下の処理行程を備えるも
のである。(D) Means for Solving the Problems The voice recognition method of the present invention includes the following processing steps.

話者が発声した音声を認識して複数の認識候補を得る
第１の音声認識処理行程、該第１の音声認識処理で得られる複数の認識候補を話
者に順次報知出力する認識候補報知処理行程、該認識候補報知処理で特定の認識候補が報知された時
に該候補が上記第１の音声認識処理での話者の話者に該
当する事を指示するめに話者が発声する指令語音声を認
識する第２の音声認識処理行程、上記認識候補報知処理で全ての候補の報知が終了した
時点までに、第２の音声認識処理による指示語音声の認
識がなされなかった場合に、上記認識候補報知処理を再
度実行するかどうかを話者に尋ねるための案内報知を行
う案内報知処理行程、該案内報知処理での案内報知に対応して話者が発声する
上記第２の音声認識処理での指令語音声と同一の指令語
音声を認識する第３の音声認識処理行程、該第３の音声認識処理で指令語音声を認識可能であっ
た場合に、上記認識候補報知処理を再度実行する認識候
補再報知処理行程。A first voice recognition processing step of recognizing a voice uttered by a speaker to obtain a plurality of recognition candidates; a recognition candidate notification process of sequentially outputting a plurality of recognition candidates obtained by the first voice recognition processing to a speaker A command voice spoken by the speaker to indicate that the candidate corresponds to the speaker of the speaker in the first speech recognition process when a specific recognition candidate is notified in the recognition candidate notification process. A second speech recognition processing step of recognizing the target speech, if the instruction speech has not been recognized by the second speech recognition processing by the time when the notification of all candidates has been completed in the recognition candidate notification processing, A guidance notification processing step of performing guidance notification for asking the speaker whether to execute the candidate notification processing again; in the second voice recognition processing in which the speaker utters in response to the guidance notification in the guidance notification processing; Command word voice that is the same as A third speech recognition processing step of recognizing a recognition candidate re-notification processing step of re-executing the recognition candidate notification processing when the command word voice can be recognized in the third speech recognition processing;

（ホ）作用本発明の音声認識方法は、複数の認識候補が順次報知
出力されるので、第１候補の認識結果が誤りであって
も、次々に報知される候補の中から所望の認識結果を見
出すことが可能になり、この候補に対して、“OK"など
の指令語音声を音声入力し、この指令語音声が認識され
れば、該候補を正しい認識結果として出力することがで
きる。しかも、もしこの指令語音声が認識不能であった
場合、あるいは話者が指令語音声の入力ができなかった
場合に、これら認識候補の再報知を尋ねる案内報知に対
応して、話者が上記指令音声と同一の指令音声を認識さ
せるための音声入力を行うことにより、この時上記指令
語音声が正しく認識できれば、指令語音声自体の認識可
能性の確認が行えると同時に、これらの認識候補の再報
知を実行させることができる。(E) Function In the voice recognition method of the present invention, since a plurality of recognition candidates are sequentially notified and output, even if the recognition result of the first candidate is erroneous, a desired recognition result is selected from among the candidates notified one after another. Then, a command word voice such as “OK” is input to the candidate by voice, and if the command word voice is recognized, the candidate can be output as a correct recognition result. Moreover, if the command word voice cannot be recognized, or if the speaker cannot input the command word voice, the speaker responds to the guidance notification asking for re-notification of these recognition candidates. By performing voice input for recognizing the same command voice as the command voice, if the command word voice can be correctly recognized at this time, the recognizability of the command word voice itself can be confirmed, and at the same time, these recognition candidates are recognized. Re-notification can be performed.

（ヘ）実施例以下、図面を参照して、この発明を自動車に搭載され
たワイヤレスオートダイヤル電話に適用した場合の実施
例について説明する。(F) Embodiment An embodiment in which the present invention is applied to a wireless auto-dial telephone mounted on an automobile will be described below with reference to the drawings.

第１図および第２図は、ワイヤレスオートダイヤル電
話機の電気的構成を示し、第１図は新機、第２図は子機
を示している。1 and 2 show the electrical configuration of a wireless auto-dial telephone, where FIG. 1 shows a new unit and FIG. 2 shows a slave unit.

第２図の子機は、音声入力するためのマイクロホン
１、マイクロホン１から入力された音声信号を増幅する
増幅部２、増幅された音声信号によって搬送波を変調す
るための変調回路よび変調回路から出力される被変調波
を増幅するた増幅回路を含む送信部３、アンテナ結合器
４、送受信用アンテナ５、受信された被変調波を変調す
る増幅回路および増幅された被変調波を復調する復調回
路を含む受信部６、受信部６によって再生された音声信
号を増幅する増幅部７、増幅された音声信号を音声とし
て出力するスピーカ８、テンキー、各種機能キーを有す
るキー入力部９、ならびに送信部３、受信部６およびキ
ー入力部９を制御する制御部10を備えている。2 includes a microphone 1 for inputting voice, an amplifier 2 for amplifying a voice signal input from the microphone 1, a modulation circuit for modulating a carrier with the amplified voice signal, and an output from a modulation circuit. A transmitting unit 3 including an amplifier circuit for amplifying a modulated wave to be received, an antenna coupler 4, a transmitting / receiving antenna 5, an amplifier circuit for modulating a received modulated wave, and a demodulation circuit for demodulating the amplified modulated wave. , An amplifying unit 7 for amplifying the audio signal reproduced by the receiving unit 6, a speaker 8 for outputting the amplified audio signal as audio, a numeric keypad, a key input unit 9 having various function keys, and a transmitting unit 3, a control unit 10 for controlling the receiving unit 6 and the key input unit 9 is provided.

第１図の親機は、回線ループの状態を検出するループ
検出部27、オートダイヤルを音声入力によって行うため
の音声認識部25、音声認識部25によって認識された音声
に対応する電話番号を回線に供給するとともに送受信信
号を回線に供給する回線制御部２、送受信用アンテナ2
1、アンテナ結合器22、受信された被変調波を増幅する
増幅回路および増幅された被変調波を復調する復調回路
を含む受信部23、受信部23によって再生された音声信号
を回線制御部26および音声認識部25のうちのいずれに供
給するかを切替えるための切替スイッチ24、テンキー、
各種機能キーを有するキー入力部34、各種合成音声を形
成するための音声合成部28、音声合成部28の出力信号ま
たは回線制御部26からの受話信号によって搬送波を変調
するための変調回路および変調回路から出力される被変
調波を増幅する増幅回路を含む送信部29ならびにこれら
の機器を制御する制御部30を備えている。The master unit shown in FIG. 1 includes a loop detecting unit 27 for detecting a line loop state, a voice recognizing unit 25 for performing auto dialing by voice input, and a telephone number corresponding to the voice recognized by the voice recognizing unit 25. Line control unit 2 that supplies transmission / reception signals to the line while supplying transmission / reception signals to the line.
1, an antenna coupler 22, a receiving unit 23 including an amplification circuit for amplifying a received modulated wave and a demodulation circuit for demodulating the amplified modulated wave, a line control unit 26 for transmitting an audio signal reproduced by the receiving unit 23. And a changeover switch 24 for switching which of the voice recognition units 25 to supply, a numeric keypad,
A key input unit 34 having various function keys, a voice synthesis unit 28 for forming various synthesized voices, a modulation circuit and a modulation circuit for modulating a carrier by an output signal of the voice synthesis unit 28 or a reception signal from the line control unit 26 A transmission unit 29 including an amplifier circuit for amplifying a modulated wave output from the circuit and a control unit 30 for controlling these devices are provided.

音声認識部25は、入力された音声信号を周波数分析す
る分析部41、分析部41の分析結果に基づいて音声スペク
トルの時系列群、すなわち音声パターンを作成する音声
パターン作成部42、パターン作成部42によって作成され
た音声パターンを記憶するための入力音声パターンメモ
リ45、複数の発呼相手先名の標準音声パターンを記憶す
る第１基準パターンメモリ43、複数の指令語の標準音声
パターンを記憶する第２標準パターンメモリ44および音
声パターン作成部42から出力される音声パターンと標準
パターンとを比較し、両パターンの類似度を識別する識
別部46を備えている。The voice recognition unit 25 includes an analysis unit 41 that performs frequency analysis of an input voice signal, a time series group of a voice spectrum based on an analysis result of the analysis unit 41, that is, a voice pattern generation unit 42 that generates a voice pattern, and a pattern generation unit. An input voice pattern memory 45 for storing the voice pattern created by 42, a first reference pattern memory 43 for storing standard voice patterns of a plurality of call destination names, and a standard voice pattern for a plurality of command words. An identification unit 46 is provided for comparing the audio pattern output from the second standard pattern memory 44 and the audio pattern creation unit 42 with the standard pattern and identifying the similarity between the two patterns.

制御部30は、プログラムおよび必要なデータを記憶す
るための主メモリ31、発呼相手名の音声認識に応じて、
類似度の高いものから順に発呼相手名を記憶する認識結
果メモリ32ならびに時間計測のためのカウンタ33を備え
ている。The control unit 30 has a main memory 31 for storing programs and necessary data,
A recognition result memory 32 for storing caller names in descending order of similarity and a counter 33 for time measurement are provided.

また、制御部30には、この電話が搭載されている自動
車のハンドルの回転角度を検出するハンドルの回転角検
出器35の検出信号、ブレーキの作動状態を検出するブレ
ーキ作動検出器36の検出信号および車速を検出する車速
検出器37の検出信号が入力される。Further, the control unit 30 includes a detection signal of a steering wheel rotation angle detector 35 for detecting a rotation angle of a steering wheel of an automobile on which the telephone is mounted, and a detection signal of a brake operation detector 36 for detecting an operation state of a brake. And a detection signal of a vehicle speed detector 37 for detecting the vehicle speed is input.

このワイヤレスオートダイヤル電話を用いてオートダ
イヤルを音声によって行うために、あらかじめ、指令語
および発呼相手先名の音声記録が行われる。この指令語
としては、電話をオフフックさせるための指令語“オフ
フック”や確認用指令語“OK"などがある。In order to perform auto-dial by voice using this wireless auto-dial telephone, voice recording of the command word and the name of the called party is performed in advance. Examples of the command word include a command word “off hook” for making the telephone go off-hook and a command word “OK” for confirmation.

指令語および発呼相手先名の音声登録は、次のように
して行われる。まず、子機のキー入力部９に設けられた
登録キーを押す。すると、子機の制御部10から、登録動
作開始信号が出力される。そして、送信部３において、
この登録動作開始信号によって搬送波が変調され、被変
調波が結合器４およびアンテナ５を介して送信される。
この被変調波は、親機のアンテン21によって受信され、
結合器22を介して受信部23に送られる。そして、受信部
23において、受信された被変調信号が復調され、増幅さ
れる。受信部23で再生された登録動作開始信号は制御部
30に送られ、これにより、登録動作状態となる。The voice registration of the command word and the call destination name is performed as follows. First, a registration key provided on the key input unit 9 of the slave unit is pressed. Then, the control unit 10 of the slave unit outputs a registration operation start signal. Then, in the transmission unit 3,
The carrier is modulated by the registration operation start signal, and the modulated wave is transmitted via the coupler 4 and the antenna 5.
This modulated wave is received by the parent device Anten 21,
The signal is sent to the receiving unit 23 via the coupler 22. And the receiving unit
At 23, the received modulated signal is demodulated and amplified. The registration operation start signal reproduced by the receiving unit 23 is transmitted to the control unit.
30 to enter a registration operation state.

子機のキー入力部９の登録キーを押さずに、親機のキ
ー入力部34の登録キーを押してもよい。この場合には、
キー入力部34の出力が親機の制御部30に入力することに
より、登録動作状態となる。Instead of pressing the registration key of the key input unit 9 of the child device, the registration key of the key input unit 34 of the parent device may be pressed. In this case,
When the output of the key input unit 34 is input to the control unit 30 of the master unit, a registration operation state is set.

次に、指令語および発呼相手先を子機のマイクロホン
１によって入力する。マイクロホン１から出力される音
声信号は、増幅部２で増幅されたのち、送信部３に送ら
れる。そして、送信部３で、搬送波が音声信号によって
変調され、被変調波が増幅される。送信部３から出力さ
れる被変調波は結合器４およびアンテナ５を介して送信
される。Next, the command word and the call destination are input by the microphone 1 of the slave unit. The audio signal output from the microphone 1 is sent to the transmission unit 3 after being amplified by the amplification unit 2. Then, in the transmission unit 3, the carrier is modulated by the audio signal, and the modulated wave is amplified. The modulated wave output from the transmitting unit 3 is transmitted via the coupler 4 and the antenna 5.

この被変調波は、親機のアンテナ21によって受信さ
れ、結合器22を介して受信部23に送られる。そして、受
信部23において、受信された被変調波信号が復調され、
増幅される。受信部23で再生された音声信号は切替スイ
ッチ24を介して音声認識部25に送られる。This modulated wave is received by the antenna 21 of the master unit, and sent to the receiving unit 23 via the coupler 22. Then, in the receiving unit 23, the received modulated wave signal is demodulated,
Amplified. The audio signal reproduced by the receiving unit 23 is sent to the audio recognizing unit 25 via the changeover switch 24.

音声認識部25においては、まず、音声分析部41によっ
て、入力した音声信号が周波数分析され、パターン作成
部42によって、この分析結果に基づいて音声パターンが
作成される。この音声パターンは、入力音声が発呼相手
先名の場合は第１標準パターンメモリに、入力音声が指
令語の場合は第２標準パターンメモリに記憶される。In the voice recognition unit 25, first, the input voice signal is subjected to frequency analysis by the voice analysis unit 41, and a voice pattern is generated by the pattern generation unit 42 based on the analysis result. This voice pattern is stored in the first standard pattern memory when the input voice is the call destination name, and is stored in the second standard pattern memory when the input voice is the command word.

このような動作により、すべての発呼相手先名の音声
パターンおよびすべての指令語の音声パターンが第１ま
たは第２標準パターンメモリに記憶される。By such an operation, the voice patterns of all call destination names and the voice patterns of all command words are stored in the first or second standard pattern memory.

発呼相手先名の音声登録時には、各発呼相手先名前の
音声入力に続いて発呼相手先の電話番号が親機または子
機のキー入力部９、34のテンキーによって入力され、音
声登録された各発呼相手先ごとに対応する電話番号が主
メモリ31に記憶される。At the time of voice registration of the call destination name, following the voice input of each call destination name, the telephone number of the call destination is input using the numeric keypad of the key input sections 9 and 34 of the master unit or the slave unit, and the voice registration is performed. The telephone numbers corresponding to the respective called parties are stored in the main memory 31.

第３図（Ｉ）〜（IV）は、親機の制御部30による発呼
処理の手順を示している。FIGS. 3 (I) to (IV) show a procedure of a calling process by the control unit 30 of the master unit.

まず、使用者（話者）は、子機の電源を入れた後、オ
フフック指令語“オフフック”をマイクロホン１によっ
て音声入力する。マイクロホン１にこのオフフック指令
語が入力されると（ステップS1）、マイクロホン１から
の音声信号は増幅され、増幅された音声信号によって搬
送波が変調された後、被変調波が子機のアンテナ５から
送信される。この被変調波は、親機のアンテナ21によっ
て受信され、復調される。再生された音声信号は切替ス
イッチを介して音声認識部25に送られて音声認識処理が
行われる（ステップS2）。First, after turning on the power of the slave unit, the user (speaker) inputs the off-hook command word “off-hook” by using the microphone 1. When the off-hook command word is input to the microphone 1 (step S1), the audio signal from the microphone 1 is amplified, the carrier wave is modulated by the amplified audio signal, and the modulated wave is transmitted from the antenna 5 of the slave unit. Sent. This modulated wave is received and demodulated by the antenna 21 of the master unit. The reproduced voice signal is sent to the voice recognition unit 25 via the changeover switch, and the voice recognition processing is performed (step S2).

音声認識部25においては、まず、音声分析部41によっ
て、入力した音声信号が周波数分析される。次に、パタ
ーン作成部42によって、この分析結果に基づいて音声パ
ターンが作成され、作成された音声パターンが入力音声
パタンメモリ45に記憶される。そして、識別部46におい
て、作成された音声パターンと第２標準パターンメモリ
44に記憶されているすべての指令語標準パターンとが比
較され、その類似度が識別される。In the voice recognition unit 25, first, the voice analysis unit 41 performs frequency analysis on the input voice signal. Next, a voice pattern is generated by the pattern generation unit 42 based on the analysis result, and the generated voice pattern is stored in the input voice pattern memory 45. Then, in the identification unit 46, the created voice pattern and the second standard pattern memory are stored.
All command word standard patterns stored in 44 are compared, and their similarity is identified.

識別部46において入力音声が“オフフック”と認識さ
れると（ステップS3）、回線制御部26によって直流ルー
プが形成され、オフフック状態となる（ステップS4）。
この後、使用者が電話操作を行なうことに関し、運転情
況が安全な状態にあるか否かを調べるための安全確認処
理が行われる（ステップS5）。この処理の詳細について
は後述する。When the input voice is recognized as "off-hook" by the identification unit 46 (step S3), a DC loop is formed by the line control unit 26, and an off-hook state is set (step S4).
Thereafter, a safety confirmation process is performed to check whether the driving situation is in a safe state with respect to the user performing the telephone operation (step S5). Details of this processing will be described later.

安全確認処理において、安全であることが確認される
と、音声合成部28から、使用者に発呼相手先の名前の入
力を促すための案内音声、例えば“相手先名を入力して
下さい”を表す合成音声信号が出力される。この信号
は、送信部29に送られ、搬送波がこの信号によって変調
される。被変調波は、増幅された後、アンテナ21を介し
て送信される。In the safety confirmation process, if it is confirmed that the security is secured, the voice synthesizer 28 prompts the user to input the name of the called party, for example, "Please enter the called party name." Is output. This signal is sent to the transmission unit 29, and the carrier is modulated by this signal. The modulated wave is transmitted via the antenna 21 after being amplified.

この被変調波は子機のアンテナ５によって受信され、
復調される。再生された音声合成信号は、増幅された
後、スピーカ８に送られ、スピーカ８から“相手先名を
入力して下さい”という音声が出力される（ステップS
6）。This modulated wave is received by the antenna 5 of the slave unit,
Demodulated. The reproduced speech synthesis signal is sent to the speaker 8 after being amplified, and the speaker 8 outputs a voice saying "Please input the name of the other party" (step S).
6).

この後、使用者が発呼相手先名、例えば“サンヨウ”
をマイクロホン１から音声入力すると（ステップS7）、
この音声信号は、無線通信によって親機に受信され、音
声認識部25によって、第１標準パターンメモリ43内のす
べての相手先標準パターンとの類似度が算出される。そ
して、類似度の高いものから順に認識結果メモリ32に、
発呼相手先名が記憶される（ステップS8）。この後、候
補順位ｎが１だけ更新される（ステップS9）。尚、候補
順位ｎは、初期設定において、ｎ＝０に設定されている
ので、１回目は、ｎ＝１となる。Thereafter, the user enters the name of the called party, for example, "Sanyou"
Is input from the microphone 1 (step S7).
The voice signal is received by the base unit by wireless communication, and the voice recognition unit 25 calculates the similarity to all the partner standard patterns in the first standard pattern memory 43. Then, in the recognition result memory 32 in order from the one having the highest similarity,
The call destination name is stored (step S8). Thereafter, the candidate rank n is updated by 1 (step S9). Note that the candidate rank n is set to n = 0 in the initial setting, so that n = 1 at the first time.

この後、安全確認処理が行われる（ステップS10）。
安全確認処理において、安全であることが確認される
と、確認結果メモリ32に記憶されている発呼相手先のう
ち、第ｎ候補のものが読み出される。１回目は、第１候
補、すなわち、類似度の最も高い発呼相手先名に対応す
る合成音声信号が音声合成部28から出力される。この出
力に基づいて、子機のスピーカ８から、第ｎ候補の発呼
相手先名が出力される（ステップS11）。Thereafter, a safety confirmation process is performed (step S10).
In the safety confirmation process, when it is confirmed that the call is secure, the n-th candidate among the call destinations stored in the confirmation result memory 32 is read. At the first time, the speech synthesis unit 28 outputs the first candidate, that is, the synthesized speech signal corresponding to the call destination name having the highest similarity. Based on this output, the n-th candidate call destination name is output from the speaker 8 of the slave unit (step S11).

また、所定時間を計測するための計測動作が開始され
る（ステップS12）。この所定時間は、発呼相手先名の
音声出力に対して、使用者が確認指令語“OK"を入力で
きる適当な時間（例えば３秒）に設定されている。使用
者は、音声出力された発呼相手先名が、使用者が音声入
力した発呼相手先名である場合にのみ、確認指令語“O
K"を子機のマイクロホン１から音声入力する。Further, a measurement operation for measuring a predetermined time is started (step S12). This predetermined time is set to an appropriate time (for example, 3 seconds) during which the user can input the confirmation command word “OK” for the voice output of the call destination name. The user can confirm the command word “O” only when the name of the called party whose voice is output is the name of the calling party that is spoken by the user.
"K" is input from the microphone 1 of the slave unit.

上記所定時間内に、子機のマイクロホン１から使用者
が“OK"を音声入力すると（ステップS13）、無線通信に
より、その音声信号が親機に受信され、音声認識部25で
入力音声パターンが第２標準パターンメモリ44内のすべ
ての指令語標準パターンと比較される（ステップS1
5）。この音声認識の結果、入力音声が“OK"であると認
識されると（ステップS16）、上記ステップS11で音声出
力された発呼相手先名に対応する電話番号が主メモリ31
から読み出され、回線制御部26を介して回線に送られる
（ステップS17）。この結果、発呼相手先に電話がかけ
られる。When the user voice-inputs “OK” from the microphone 1 of the child device within the above-mentioned predetermined time (step S13), the voice signal is received by the parent device by wireless communication, and the voice recognition unit 25 changes the input voice pattern. It is compared with all the command word standard patterns in the second standard pattern memory 44 (step S1).
Five). As a result of the voice recognition, when the input voice is recognized as "OK" (step S16), the telephone number corresponding to the name of the call destination output in step S11 is stored in the main memory 31.
And sent to the line via the line controller 26 (step S17). As a result, a call is made to the called party.

そして、一定時間内にループ検出器27によって、発呼
相手先がオフフックしたことが検出されると（ステップ
S18）、子機に入力される音声信号が回線制御部26に供
給されるように、切替スイッチ24の切り替えが行われる
（ステップS19）。これにより、発呼相手先との通話が
可能となる。通話が終了すると（ステップS20）、認識
結果メモリ31の内容がクリアされるとともに候補順位ｎ
がリセット（ｎ＝０）され（ステップS21）この処理は
終了する。When the loop detector 27 detects that the called party has gone off-hook within a certain time (step
S18), the changeover switch 24 is switched so that the audio signal input to the slave unit is supplied to the line control unit 26 (step S19). As a result, a call can be made with the call destination. When the call ends (step S20), the contents of the recognition result memory 31 are cleared and the candidate rank n
Is reset (n = 0) (step S21), and this process ends.

上記ステップS21で計時動作が開始されてから、所定
時間内に、音声入力がなかった場合（ステップS14）お
よび音声に入力があってもその入力音声が“OK"と認識
されなかった場合（ステップS16）には、上記ステップS
11で音声出力した発呼相手先名が最終候補（ｎ＝Ｎ）か
否かが調べられる（ステップS22）。If there is no voice input within a predetermined time after the start of the timing operation in step S21 (step S14), and if the input voice is not recognized as "OK" even if there is voice input (step S14). S16) includes the above step S
It is checked whether or not the call destination name output as a voice in step 11 is the final candidate (n = N) (step S22).

そして、最終候補でなければ（ｎ＜Ｎ）、ステップS9
に戻って、候補順位ｎが１だけ更新される。この後、安
全確認処理が行われ（ステップS23）、安全が確認され
ると、認識結果メモリ32から第ｎ候補すなわち、次候補
の発呼相手先名が読み出される。この発呼相手先に対応
する合成音声信号が音声合成部28から出力され、これに
基づいて子機のスピーカ８からこの合成音声が出力され
る（ステップS11）。また、計時動作が開始される（ス
テップS12）。If it is not the final candidate (n <N), step S9
And the candidate rank n is updated by one. Thereafter, a safety confirmation process is performed (step S23). When the safety is confirmed, the n-th candidate, that is, the name of the call destination of the next candidate is read from the recognition result memory 32. The synthesized voice signal corresponding to the call destination is output from the voice synthesis unit 28, and based on this, the synthesized voice is output from the speaker 8 of the slave unit (step S11). Further, a timing operation is started (step S12).

そして、所定時間の間、確認指令語“OK"の音声入力
待ちとなる（ステップS13およびS14）。この音声入力待
状態において、音声入力があり、その音声認識の結果、
入力音声が確認指令語“OK"であると認識されると（ス
テップS13、S15およびS16）、オートダイヤルおよび通
話のための処理（ステップS17〜S21）が行われる。Then, for a predetermined period of time, the input of the confirmation command word “OK” is waited for (steps S13 and S14). In the voice input waiting state, there is a voice input, and as a result of the voice recognition,
When the input voice is recognized as the confirmation command word "OK" (steps S13, S15 and S16), processing for auto dialing and talking (steps S17 to S21) is performed.

確認指令語“OK"の音声入力待状態で、音声入力がな
かった場合（ステップS14）、および音声入力があって
もそれが入力音声が確認指令語“OK"であると認識され
なかった場合（ステップS16）は、今回音声出力された
相手先名が最終候補か否かが調べられる（ステップS2
2）。そして、今回音声出力された相手先名が最終候補
でなければ（ｎ＜Ｎ）、ステップS9に戻る。When there is no voice input in the voice input waiting state of the confirmation command word "OK" (step S14), and when there is a voice input, the input voice is not recognized as the confirmation command word "OK" In (Step S16), it is checked whether or not the destination name that has been voice-output this time is the final candidate (Step S2).
2). Then, if the destination name that has been voice-output this time is not the final candidate (n <N), the process returns to step S9.

今回音声出力された相手先名が最終候補であれば（ｎ
＝Ｎ）、安全確認処理が行われた後（ステップS23）、
再度発呼相手先名を候補順に出力するか否かを使用者に
尋ねるための案内音声、例えば“再度相手先名を音声出
力しますか”が音声出力される（ステップS24）。ま
た、所定時間の計時が開始される（ステプS25）。この
所定時間は、上記案内音声出力に対して、使用者が上記
ステップS12、S13での確認指令語と同一の確認指令語、
この場合“OK"を入力できる適当な時間に設定されてい
る。使用者は、発呼相手先名の再度出力を望む場合にの
み、確認指令語“OK"を子機のマイクロホン１から音声
入力する。この所定時間内に、子機のマイクロホン１か
ら使用者が“OK"を音声入力すると（ステップS26）、無
線通信により、その音声信号が親機に受信され、音声認
識部25で、第２標準パターンメモリ44内のすべての指令
語標準パターンと比較される（ステップS28）。If the destination name output this time is the final candidate (n
= N), after the safety confirmation processing is performed (step S23),
A guidance voice for asking the user whether or not to output the destination name again in the candidate order, for example, "Do you want to output the destination name again?" Is output as voice (step S24). In addition, clocking of a predetermined time is started (Step S25). The predetermined time is the same as the confirmation command in the steps S12 and S13, with respect to the guidance voice output,
In this case, the time is set to an appropriate time for inputting “OK”. The user inputs the confirmation command word "OK" by voice from the microphone 1 of the slave unit only when the user desires to output the call destination name again. When the user voice-inputs "OK" from the microphone 1 of the child device within this predetermined time (step S26), the voice signal is received by the parent device by wireless communication, and the second standard It is compared with all the command word standard patterns in the pattern memory 44 (step S28).

音声認識部25でのステップS28における音声認識の結
果、入力音声が“OK"であると認識されると（ステップS
29）、候補順位ｎがリセットされる（ステップS30）。
そして、上記ステップS9に戻って、次候補の発呼相手先
名の出力、確認指令語音声入力待ち、確認指令語音声入
力があった場合の電話番号の出力などの処理が再び行わ
れる。As a result of the voice recognition in step S28 in the voice recognition unit 25, if the input voice is recognized as “OK” (step S28).
29), the candidate rank n is reset (step S30).
Then, returning to step S9, processing such as outputting the name of the call destination of the next candidate, waiting for the confirmation command word voice input, and outputting the telephone number when the confirmation command word voice is input is performed again.

上記ステップS25で計時動作が開始されてから、所定
時間内に、音声入力がなかった場合および音声入力があ
ってもその入力音声が“OK"と認識されなかった場合に
は（ステップS27でYESまたはステップS29でNO）、認識
結果メモリ32の内容がクリアされるとともに候補順位ｎ
がリセットされ（ステップS31）、この処理は終了す
る。If there is no voice input within a predetermined time after the start of the timing operation in step S25, and if the input voice is not recognized as "OK" even if there is a voice input (YES in step S27) Or NO in step S29), the contents of the recognition result memory 32 are cleared and the candidate rank n
Is reset (step S31), and this process ends.

以上の実施例のステップS24〜S30での処理が本発明が
特徴とするところであり、具体的には、“再度相手先名
を音声出力しますか”なる音声案内に対して、発呼相手
先名の音声出力に対しての確認指令語と同一の“OK"を
入力して、これを認識処理させる点にある。The processing of steps S24 to S30 of the above embodiment is a feature of the present invention. Specifically, in response to voice guidance "Do you want to output the destination name again?" The point is that the same "OK" as the confirmation command word for the voice output of the name is input, and this is recognized.

即ち、１巡目の複数の認識候補である発呼相手先名の
音声出力中に、所望の発呼相手先名に対して“OK"を発
声してもこれが周囲雑音で認識されないこともあるの
で、“再度相手先名を音声出力しますか”に対しての
“OK"が認識できたなら、次の２巡目の複数の発呼相手
先名の音声出力中ではも“OK"が認識される可能性が高
いことを示していると見做すことができるのである。こ
のような指令語“OK"の認識処理は、この語自体の認識
可能性のテストと再度の発呼相手先名の音声出力の指令
とを兼ねたものとなっている。In other words, even when "OK" is uttered for a desired call destination name during voice output of the call destination names, which are a plurality of recognition candidates in the first round, this may not be recognized due to ambient noise. Therefore, if "OK" for "Do you want to output the destination name again?" Is recognized, "OK" will be displayed even during the voice output of a plurality of destination names in the second round. It can be regarded as indicating that recognition is likely. Such recognition processing of the command word “OK” serves both as a test of the recognizability of the word itself and a command to output the voice of the call destination again.

第４図は、安全確認処理の詳細を示している。この処
理においては、まず、ハンドル回転角検出器35によって
検出された検出回転角が、あらかじめ定められた基準角
度より小さいか否かが判別される（ステップS41）。FIG. 4 shows details of the safety confirmation process. In this process, first, it is determined whether or not the detected rotation angle detected by the steering wheel rotation angle detector 35 is smaller than a predetermined reference angle (step S41).

検出回転角が基準角度より小さければ、次に、ブレー
キ作動検出器36の出力に基づいて、ブレーキがオフとな
っているか否かが判別される（ステップS42）。If the detected rotation angle is smaller than the reference angle, it is next determined whether or not the brake is off based on the output of the brake operation detector 36 (step S42).

ブレーキがオフとなっていれば、次に速度検出器37に
よって検出された検出速度があらかじめ定められた基準
速度より小さいか否かが判別される（ステップS43）。
そして、検出速度が基準速度より小さければ、この処理
は終了し、この処理に続く音声出力のためのステップS
6、S11またはS24（第３図参照）に移る。If the brake is off, it is determined whether or not the speed detected by the speed detector 37 is lower than a predetermined reference speed (step S43).
If the detected speed is lower than the reference speed, the process ends, and a step S for audio output following the process is completed.
6, the process proceeds to S11 or S24 (see FIG. 3).

上記ステップS41において検出回転角が基準角度より
大きい場合、上記ステップS42においてブレーキがオン
となっている場合、上記ステップS43において検出速度
が基準速度より大きい場合には、ステップS41に戻る。If the detected rotation angle is larger than the reference angle in step S41, if the brake is on in step S42, or if the detected speed is larger than the reference speed in step S43, the process returns to step S41.

したがって、検出回転角が基準角度より小さくかつブ
レーキがオフとなっており、しかも検出速度が基準速度
より小さい状態になるまで、安全確認処理に続くステッ
プS6、S11またはS24（第３図参照）での音声出力が禁止
される。Therefore, until the detected rotation angle is smaller than the reference angle and the brake is off, and the detected speed is smaller than the reference speed, in steps S6, S11 or S24 (see FIG. 3) following the safety confirmation process. Voice output is prohibited.

上記実施例のステップ２においては、入力音声パター
ンと第２標準パターンメモリ44に記憶されているすべて
の指令語標準パターンとが比較されているが、入力音声
パターンを第２標準パターンメモリ44に記憶されている
オフフック指令語の標準パターンのみと比較するように
してもよい。In step 2 of the above embodiment, the input voice pattern is compared with all the command word standard patterns stored in the second standard pattern memory 44, but the input voice pattern is stored in the second standard pattern memory 44. It may be compared with only the standard pattern of the off-hook command word.

また、相手先標準音声パターンと指令語標準音声パタ
ーンとは、別個のメモリ43、44に記憶されているが、一
つの標準パターンメモリに記憶するようにしてもよい。Further, the destination standard voice pattern and the command word standard voice pattern are stored in separate memories 43 and 44, but may be stored in one standard pattern memory.

また、上記ステップ８においては、第１標準パターン
メモリ43内のすべての相手先標準パターンとの類似度が
算出されているが、第１標準パターンメモリ43内の相手
先標準パターンのうち、あらかじめ指定された所定範囲
内でのすべての相手先標準パターンとの類似度を算出す
るようにしてもよい。In step 8, the similarities with all the standard patterns in the first standard pattern memory 43 are calculated. The similarity with all the partner standard patterns within the predetermined range may be calculated.

たとえば、相手先を所定のグループに分け、第１標準
パターンメモリ43にグループごとに、相手先標準パター
ンを記憶させ、電話操作を開始する前にグループ名指定
データをキー入力、音声入力等によって入力するように
する。そして、ステップ８においては、第１標準パター
ンメモリ43に記憶されている相手先標準音声パターンの
うち、指定されたグループの範囲内で、すべての相手先
標準パターンとの類似度が算出されるようにする。For example, the destinations are divided into predetermined groups, the standard patterns of the destinations are stored in the first standard pattern memory 43 for each group, and the group name designation data is input by key input, voice input, etc. before starting the telephone operation. To do it. Then, in step 8, the similarity with all the standard patterns of the destination is calculated within the range of the specified group among the standard voice patterns of the destination stored in the first standard pattern memory 43. To

さらに、利用者が複数ある場合には、第１標準パター
ンメモリ43に利用者ごとに、相手先標準パターンを記憶
させ、電話操作を開始する前に利用者名指定データをキ
ー入力、音声入力等によって入力するようにする。そし
て、ステップ８においては、第１標準パターンメモリ43
に記憶されている相手先標準音声パターンのうち、指定
された利用者についての相手先標準パターンのすべての
類似度が算出されるようにする。Further, when there are a plurality of users, the standard pattern is stored in the first standard pattern memory 43 for each user. To enter. Then, in step 8, the first standard pattern memory 43
Of the destination standard voice pattern stored in the destination standard pattern, all similarities of the destination standard pattern for the specified user are calculated.

さらに、利用者が複数ある場合には、第１標準パター
ンメモリ43に利用者および所定のグループごとに相手先
標準パターンを記憶させ、電話操作を開始する前に利用
者名指定データおよびグループ名指定データをキー入
力、音声入力等によって入力するようにする。そして、
ステップ８においては、第１標準パターンメモリ43に記
憶されている相手先標準音声パターンのうち、指定され
た利用者および指定されたグループについての相手先標
準パターンのすべてとの類似度が算出されるようにす
る。Further, when there are a plurality of users, the standard pattern is stored in the first standard pattern memory 43 for each user and each predetermined group, and the user name specification data and the group name specification are set before starting the telephone operation. Data is input by key input, voice input, or the like. And
In step 8, among the standard voice patterns of the destination stored in the first standard pattern memory 43, the similarity to all of the standard patterns of the specified user and the specified group is calculated. To do.

上記の実施例では、第１候補から最終候補までの発呼
相手先名が順次音声出力され、それに対して指令語の音
声入力がない場合または指令語の音声入力があっても、
“OK"と認識されない場合に、再度第１候補から発呼相
手先名の音声を出力するか否かを使用者に尋ねている
（ステップS22〜S24参照）。しかしながら、使用者の意
思に関係なく、第１候補から最終候補までの発呼相手先
名出力の繰返回数ｍが所定回数Ｍとなる迄、発呼相手先
名の出力を繰返すようにしてもよい。In the above embodiment, the names of the call destinations from the first candidate to the final candidate are sequentially output as voice, and in the case where there is no voice input of the command word or there is a voice input of the command word,
If it is not recognized as "OK", it asks the user again whether to output the voice of the call destination name from the first candidate (see steps S22 to S24). However, regardless of the user's intention, the output of the call destination name may be repeated until the number of repetitions m of the output of the call destination name from the first candidate to the final candidate reaches the predetermined number M. Good.

この場合の処理手順が第３図に破線で示されている。
即ち、ステップS22において、今回音声出力した発呼相
手先名が最終候補（ｎ＝Nm）である判別されると、繰返
回数ｍが１だけ更新される（ステップS51）。繰返回数
ｍは、初期設定において０に設定されている。The processing procedure in this case is shown by a broken line in FIG.
That is, if it is determined in step S22 that the call destination name that has been sounded this time is the final candidate (n = Nm), the number of repetitions m is updated by 1 (step S51). The number of repetitions m is set to 0 in the initial setting.

そして、繰返回数ｍがあらかじめ定められた所定回数
Ｍより大きいか否かが判別される（ステップS52）。繰
返回数ｍが所定回数Ｍより小さければ、候補順位ｎがリ
セットされた後（ステップS53）、ステップS9に戻る。
したがって、第１候補から順次発呼相手先名が出力さ
れ、それに対して確認指令語“OK"の入力があれば、対
応する電話番号が出力される。Then, it is determined whether or not the number of repetitions m is greater than a predetermined number M (step S52). If the number of repetitions m is smaller than the predetermined number M, the candidate rank n is reset (step S53), and the process returns to step S9.
Therefore, the call destination name is sequentially output from the first candidate, and if the confirmation command word "OK" is input thereto, the corresponding telephone number is output.

そして、第１候補から最終候補までの発呼相手先名が
出力されたが、これらに対し、確認指令語“OK"の入力
がない場合および音声名入力があっても“OK"と確認さ
れなかった場合には、第３図（IV）に示す如く、再び、
繰返回数ｍが１だけ更新され（ステップS51）、繰返回
数ｍが所定回数Ｍより大きいか否かが判別される（ステ
ップS52）。そして、繰返回数ｍが所定回数Ｍより大き
い場合には、認識結果メモリ32の内容がクリアされると
ともに候補順位ｎおよび繰返回数ｍがリセットされ（ス
テップS54）、この処理は終了する。Then, the names of the call destinations from the first candidate to the final candidate are output. However, when there is no input of the confirmation command word “OK” and when there is a voice name input, “OK” is confirmed. If not, as shown in FIG. 3 (IV),
The number of repetitions m is updated by 1 (step S51), and it is determined whether the number of repetitions m is greater than a predetermined number M (step S52). If the number of repetitions m is larger than the predetermined number M, the contents of the recognition result memory 32 are cleared, the candidate rank n and the number of repetitions m are reset (step S54), and this process ends.

上記２つの実施例では、オフフックおよび確認を行う
ために、指令語“オフフック”および“OK"を音声入力
しているが、他の指令語を用いてもよい。また、オフフ
ックおよび確認を行うために、親機又は子機のキー入力
部34または９に、各指令のための機能キーを設け、キー
入力によりオフフックおよび確認を行うようにしてもよ
い。In the above two embodiments, the command words “off-hook” and “OK” are input by voice to perform off-hook and confirmation, but other command words may be used. In order to perform off-hook and confirmation, a function key for each command may be provided in the key input unit 34 or 9 of the master unit or the slave unit, and off-hook and confirmation may be performed by key input.

第５図はこの発明のさらに他の実施例における制御部
30の発呼処理手順の一部を示しており、第３図（II）に
変わるものである。従って、同図以外の処理手順は第３
図（Ｉ）（III）（IV）に準じている。FIG. 5 shows a control unit according to still another embodiment of the present invention.
It shows a part of the outgoing call processing procedure of No. 30 and is changed to FIG. 3 (II). Therefore, the processing procedure other than that shown in FIG.
It conforms to the figures (I), (III) and (IV).

第５図において、第３図（II）と同じステップには、
同じステップ番号を付して、その説明を省略する。この
実施例による処理と第３図の処理を比較するとこの実施
例ではステップS61およびS62が追加されている点のみが
異なっている。In FIG. 5, the same steps as in FIG. 3 (II) include:
The same step numbers are given and the description thereof is omitted. Comparing the processing of this embodiment with the processing of FIG. 3, the only difference is that steps S61 and S62 are added in this embodiment.

この実施例においては、音声出力された発呼相手先名
に対して確認指令語“OK"が音声入力されなければ、第
３図の場合同様に、次候補が出力される。In this embodiment, if the confirmation command word "OK" is not voice-inputted to the calling party name whose voice is output, the next candidate is output as in the case of FIG.

しかしながら、誤った発呼相手先名の音声入力がなさ
れた後一定時間内に、再度希望する発呼相手先名を入力
すれば、その音声入力に基づいて、発呼相手先名の音声
確認が再度自動的に行われる（ステップS61およびS62参
照）。However, if the desired caller name is entered again within a certain period of time after the incorrect caller name is input, the voice of the caller name is confirmed based on the voice input. It is automatically performed again (see steps S61 and S62).

すなわち、ステップ11で、たとえば第１候補の発呼相
手先名が音声出力されたが、使用者が希望するものと異
なる場合において、使用者が再度発呼相手先名を音声入
力すると、その入力音声の音声パターンが入力音声パタ
ーンメモリ45に記憶される。そして、入力音声パターン
と、第２標準パターンメモリ44内のすべての指令語標準
パターンとが比較され、、その類似度が調べられる（ス
テップS15）。That is, in step 11, for example, the name of the call destination of the first candidate is output as voice, but if the name is different from the one desired by the user, when the user again inputs the name of the call destination by voice, the input is performed. The voice pattern of the voice is stored in the input voice pattern memory 45. Then, the input voice pattern is compared with all the command word standard patterns in the second standard pattern memory 44, and their similarities are checked (step S15).

次に、各指令語標準パターンと入力音声パターンの類
似度のうち、最も高い類似度αがあらかじめ定められた
相手先・指令語判別用基準値αｏと比較される（ステッ
プS61）。発呼相手先名と指令語との間には、音声的に
距離があるので、上記のように発呼相手先名が音声入力
された場合には、入力音声パターンと指令語標準パター
ンとの類似度αが基準値αｏより低くなる。Next, among the similarities between the respective command word standard patterns and the input voice pattern, the highest similarity α is compared with a predetermined destination / command word discrimination reference value αo (step S61). Since there is a voice distance between the call destination name and the command word, when the call destination name is input as described above, the input voice pattern and the command word standard pattern are compared. The similarity α becomes lower than the reference value αo.

したがって、ステップS61からステップS62に進み、認
識結果メモリ32の内容がクリアされるとともに候補順位
ｎがリセットされる。そして、ステップS8に戻り、上記
ステップS15で入力音声パターンメモリ45に記憶された
発呼相手先名の音声パターンと、第１標準パターンメモ
リ43内のすべての相手先標準パターンが比較され、類似
度の高いものから順に発呼相手先名が認識結果メモリ32
に記憶される。そして、ステップS9以降の処理が行われ
る。Therefore, the process proceeds from step S61 to step S62, where the contents of the recognition result memory 32 are cleared and the candidate rank n is reset. Then, returning to step S8, the voice pattern of the call destination stored in the input voice pattern memory 45 in step S15 is compared with all the standard patterns in the first standard pattern memory 43, and the similarity is determined. The names of the call destinations are stored in the recognition result memory 32 in descending order of
Is stored. Then, the processes in and after step S9 are performed.

ステップS11で、音声出力された発呼相手先名が使用
者が希望するものである場合に於て、使用者が確認指令
語“OK"を入力した場合には、指令語標準パターンとの
類似度αが基準値αｏより高くなるので、ステップS61
からステップS16に進む。そして、ステップS16におい
て、入力音声が“OK"と認識され、今回音声出力された
発呼相手先名に対応する電話番号が出力される（ステッ
プS17）。In step S11, when the call destination name output by voice is the one desired by the user, and the user inputs the confirmation command word "OK", the similarity with the command word standard pattern is obtained. Since the degree α becomes higher than the reference value αo, step S61
To step S16. Then, in step S16, the input voice is recognized as "OK", and the telephone number corresponding to the name of the called party whose voice has been output this time is output (step S17).

第６図（Ｉ）（II）は、この発明のさらに他の実施例
のおける制御部30の発呼処理手順の一部を示しており、
それぞれ第３図（Ｉ）（II）に変わるものである。従っ
て、同図以外の処理手順は第３図（III）（IV）に準じ
ている。FIGS. 6 (I) and (II) show a part of a call processing procedure of the control unit 30 in still another embodiment of the present invention.
These are changed to FIGS. 3 (I) and 3 (II), respectively. Therefore, processing procedures other than those shown in FIG. 3 are based on FIGS. 3 (III) and (IV).

第６図において、第３図と同じステップには、同じス
テップ番号を付してその説明を省略する。In FIG. 6, the same steps as those in FIG. 3 are denoted by the same step numbers, and description thereof will be omitted.

この実施例では、第３図の場合と異なり、発呼相手先
名の音声認識が行われると、第１候補の発呼相手先名の
みが出力される。このため、第３図のステップS9におけ
る候補順位ｎの更新処理およびステップS22以降におけ
る相手先名繰返出力のための処理は実行されない。そし
て、この実施例では、ステップS71〜S76が追加されてい
る。In this embodiment, unlike the case of FIG. 3, when voice recognition of the call destination name is performed, only the first candidate call destination name is output. Therefore, the process of updating the candidate rank n in step S9 in FIG. 3 and the process of repeatedly outputting the destination name from step S22 are not executed. In this embodiment, steps S71 to S76 are added.

オフフック状態において、ステップS6にて、“相手先
名を入力して下さい”の音声案内が行われると、所定時
間の計時動作が開始される（ステップS71）。そして、
この所定時間内に音声入力がない場合には（ステップS7
2）、オフフックが解除される（ステップS73）。そし
て、ステップ１に戻り、オフフック指令語の音声入力待
状態となる。In the off-hook state, when voice guidance of "Please input the name of the other party" is given in step S6, a time counting operation for a predetermined time is started (step S71). And
If there is no voice input within this predetermined time (step S7
2), off-hook is released (step S73). Then, the process returns to step 1 to be in a state of waiting for the voice input of the off-hook command word.

上記時間内に、音声入力がある場合には（ステップS
7）、入力音声パターンと第１標準パターンメモリ43内
のすべての相手先標準パターンと比較される（ステップ
S8）。If there is a voice input within the above time (step S
7) The input voice pattern is compared with all the partner standard patterns in the first standard pattern memory 43 (step
S8).

次に、各相手先標準パターンと入力音声パターンとの
類似度のうち、最も高い類似度βが、あらかじめ定めら
れた雑音判別用基準値βｏと比較される（ステップS7
4）。Next, the highest similarity β among the similarities between each of the other party standard patterns and the input voice pattern is compared with a predetermined noise discrimination reference value βo (step S7).
Four).

類似度βが基準値βｏより小さいときには、音声入力
が雑音によるものであると判断され、オフフックが解除
された後（ステップS73）、ステップS1に戻り、オフッ
ツク指令語の音声入力待状態となる。When the similarity β is smaller than the reference value βo, it is determined that the voice input is due to noise, and after the off-hook is released (step S73), the process returns to step S1 to be in a state of waiting for the voice input of the off-tsk command word.

上記ステップS74において、類似度βが基準値βｏよ
り大きい時、安全確認処理の後（ステップS10）、類似
度の最も高い発呼相手先名（第１候補）が音声出力され
る（ステップS11）。また、所定時間の計時動作が開始
される（ステップS12）。In step S74, when the similarity β is larger than the reference value βo, after the security confirmation process (step S10), the name of the call destination having the highest similarity (first candidate) is output as voice (step S11). . In addition, a timing operation for a predetermined time is started (step S12).

そして、この所定時間内に音声入力がない場合には
（ステップS14）、オフフックが解除された後（ステッ
プS75）、ステップS1に戻り、オフフック指令語の音声
入力待状態となる。If there is no voice input within the predetermined time (step S14), after off-hook is released (step S75), the process returns to step S1 to be in a state of waiting for voice input of an off-hook command word.

上記所定時間内に音声入力があった場合には（ステッ
プS13）入力音声パターンと第２標準パターンメモリ44
内のすべての指令語標準パターンとが比較される（ステ
ップS15）。If there is a voice input within the predetermined time (step S13), the input voice pattern and the second standard pattern memory 44
Are compared with all the command word standard patterns (step S15).

次に、各指令語パターンと入力音声パターンとの類似
度のうち、最も高い類似度βが、あらかじめ定められた
雑音判別用基準値βｏが比較される（ステップS76）。Next, the highest similarity β among the similarities between each command word pattern and the input voice pattern is compared with a predetermined noise discrimination reference value βo (step S76).

類似度βが基準値βｏより小さいときには、音声入力
が雑音によるものであると判断され、オフフックが解除
された後（ステップS75）、ステップS1に戻り、オフフ
ック指令語の音声入力待状態となる。When the similarity β is smaller than the reference value βo, it is determined that the voice input is caused by noise, and after the off-hook is released (step S75), the process returns to step S1 to wait for the voice input of the off-hook command word.

上記ステップS76において、類似度βが基準値βｏよ
り大きいときには、入力音声が“OK"であるか否かが識
別される（ステップS16）。入力音声が“ON"であると認
識されない時、オフフックが解除された後（ステップS7
5）、ステップS1に戻り、オフフック指令語の音声入力
待状態となる。If the similarity β is larger than the reference value βo in the above step S76, it is determined whether or not the input voice is “OK” (step S16). When the input voice is not recognized as “ON”, after the off-hook is released (step S7
5) Returning to step S1, the apparatus enters a state of waiting for a voice input of an off-hook command word.

上記ステップS75において、入力音声が“OK"であると
認識されたときには、今回音声出力された発呼相手先名
に対応する電話番号が出力される（ステップS17）。If it is determined in step S75 that the input voice is “OK”, a telephone number corresponding to the name of the called party whose voice has been output this time is output (step S17).

第７図（Ｉ）（II）はこの発明のさらに他の実施例に
おける制御部30の発呼処理手順の一部を示しており、そ
れぞれ第３図（Ｉ）（II）に変わるものである。従っ
て、同図以外の処理手順は第３図（III）（IV）に準じ
ている。FIGS. 7 (I) and (II) show a part of a call processing procedure of the control unit 30 according to still another embodiment of the present invention, which are replaced with FIGS. 3 (I) and (II), respectively. . Therefore, processing procedures other than those shown in FIG. 3 are based on FIGS. 3 (III) and (IV).

第７図において、第３図と同じステップには、同じス
テップ番号を付してその説明を省略する。In FIG. 7, the same steps as those in FIG. 3 are denoted by the same step numbers, and description thereof will be omitted.

第３図の処理では、発呼相手先名が音声入力の認識結
果に基づいて第１候補から順に音声出力されているが、
この実施例では発呼相手先名は音声入力されず、あらか
じめ定められた順番で発呼相手先名が音声出力される。
このため、第３図のステップS7における相手先名入力待
ちおよびステップS8における音声認識の処理は行われな
い。In the processing of FIG. 3, the call destination name is output as voice from the first candidate in order based on the recognition result of the voice input.
In this embodiment, the call destination name is not input by voice, but the call destination name is output by voice in a predetermined order.
Therefore, the process of waiting for the input of the destination name in step S7 of FIG. 3 and the speech recognition processing in step S8 are not performed.

また、第３図のステップS9に対応するステップS9aに
おいては、音声認識結果に基づく候補順位ではなく、あ
らかじめ定められた順番ｎが更新される。したがって、
第３図のステップS11に対応するステップS11aにおいて
は、あらかじめ定められた順番におけるｎ番目の発呼相
手先名が音声出力される。その他の点は、第３図と同じ
なので、詳細な説明を省略する。In step S9a corresponding to step S9 in FIG. 3, the predetermined order n is updated instead of the candidate order based on the speech recognition result. Therefore,
In step S11a corresponding to step S11 in FIG. 3, the n-th calling party name in a predetermined order is output as voice. Other points are the same as those in FIG. 3, and thus detailed description is omitted.

この実施例においても、オフフックおよび確認を行う
ために、指令語“オフフック”および“OK"を音声入力
しているが、他の指令語を用いてもよい。また、オフフ
ックおよび確認を行うために、親機または子機のキー入
力部34または９に各指令のための機能キーを儲け、キー
入力によって、オフフックおよび確認を行うようにして
もよい。In this embodiment as well, the command words "off-hook" and "OK" are input by voice for off-hook and confirmation, but other command words may be used. Further, in order to perform off-hook and check, a function key for each command may be provided in the key input unit 34 or 9 of the master unit or the slave unit, and off-hook and check may be performed by key input.

上述したすべての実施例において、通常の電話機のよ
うに、音声入力装置（電話機またはマイクロホン）を定
常位置からはずすと自動的にオフフックになるようにし
てもよいし、マイクロホンまたは受話機を定常位置に置
いたままキー入力または音声入力によりオフフックさせ
るようにしてもよい。In all of the above-described embodiments, the off-hook may be automatically set when the voice input device (telephone or microphone) is removed from the normal position, as in a normal telephone, or the microphone or the receiver may be set to the normal position. It may be made to go off-hook by key input or voice input while leaving it.

また、上記すべての実施例は、この発明をワイヤレス
オートダイヤル電話に通用したものであるが、ワイヤレ
スでないオートダイヤル電話にもこの発明を適用するこ
とができるのはいうまでもない。Also, in all the above embodiments, the present invention is applied to a wireless auto-dial telephone, but it is needless to say that the present invention can be applied to a non-wireless auto-dial telephone.

さらに、上の説明においては、音声合成部28で生成さ
れる合成音声によって、“相手先名を入力して下さい”
の初期案内、複数の認識候補である発呼相手先名、“再
度相手先名を音声出力しますか”なる案内音声を報知出
力する構成について示したが、表示器を用いてこれらを
表示報知する構成にすることもできる。Further, in the above description, “Please input the destination name” is based on the synthesized voice generated by the voice synthesis unit 28.
Of the initial guidance, the names of the call destinations which are a plurality of recognition candidates, and the guidance voice "Would you like to output the name of the destination again?", Are displayed and displayed on the display unit. Alternatively, the configuration may be as follows.

（ト）発明の効果本発明の音声認識方法によれば、以上の説明から明ら
かな如く、１巡目の複数の認識候補の報知に、所望の認
識候補に対して“OK"などの指令語を発声してもこれが
周囲雑音で認識されないこともあるので、認識候補の再
報知を尋ねる案内音声に対しての上記指令語とおなじ
“OK"などの指令語が認識できたなら、次の２巡目の複
数の認識候補の報知中ではもこの指令語が認識される可
能性が高いことを示していると見做すことができる。従
って、本発明の音声認識方法を採用した音声認識システ
ムは、指令語の認識処理が、語自体の認識可能性のテス
トと再度の認識候補報知の指令とを兼ねたものとなって
いるので、話者の操作上の負担を大幅に軽減できる。(G) Effects of the Invention According to the speech recognition method of the present invention, as is apparent from the above description, a command word such as "OK" for a desired recognition candidate is used for reporting a plurality of recognition candidates in the first round. May not be recognized due to the ambient noise even if the utterance is pronounced. If a command word such as “OK”, which is the same as the above-mentioned command word for the guidance voice asking for re-notification of the recognition candidate, can be recognized, the following 2 Even during notification of a plurality of recognition candidates at the round, it can be considered that the command word has a high possibility of being recognized. Therefore, in the speech recognition system that employs the speech recognition method of the present invention, the recognition process of the command word combines the test of the recognizability of the word itself and the command of the notification of the recognition candidate again, The operational burden on the speaker can be greatly reduced.

[Brief description of the drawings]

第１図および第２図は本発明の音声認識方法をワイヤレ
スオートダイヤル電話機の親機及びその子機の電気的構
成を示すブロック図、第３図乃至第７図は処理フロー図
である。１……マイクロホン、３……送信部、４……結合器、５
……アンテナ、21……アンテナ、22……結合器、23……
受信部、25……音声認識部、26……回線制御部、27……
ループ検出部、31……認識結果メモリ、33……カウン
タ、46……識別部。1 and 2 are block diagrams showing the electrical configuration of a master unit of a wireless auto-dial telephone and its slave unit using the voice recognition method of the present invention, and FIGS. 3 to 7 are processing flowcharts. 1 ... microphone, 3 ... transmitter, 4 ... combiner, 5
…… Antenna, 21 …… Antenna, 22 …… Coupler, 23 ……
Receiving part, 25 ... Voice recognition part, 26 ... Line control part, 27 ...
Loop detection unit 31, 31 recognition result memory, 33 counter, 46 identification unit.

フロントページの続き (56)参考文献特開昭62−81152（ＪＰ，Ａ) 特開昭64−81998（ＪＰ，Ａ) 特開昭63−118198（ＪＰ，Ａ) 特開昭61−73998（ＪＰ，Ａ) 特開昭60−241128（ＪＰ，Ａ) 特開平１−166100（ＪＰ，Ａ) 特開平１−154100（ＪＰ，Ａ) 特許2646080（ＪＰ，Ｂ２) 特公平６−8999（ＪＰ，Ｂ２) 特公平８−33741（ＪＰ，Ｂ２) 特公平１−43960（ＪＰ，Ｂ２) 特公平６−34188（ＪＰ，Ｂ２) 国際公開89／4035（ＷＯ，Ａ１) (58)調査した分野(Int.Cl.⁶，ＤＢ名) G10L 3/00 561 Continuation of the front page (56) References JP-A-62-81152 (JP, A) JP-A-64-81998 (JP, A) JP-A-63-118198 (JP, A) JP-A-61-73998 (JP) JP-A-60-241128 (JP, A) JP-A-1-166100 (JP, A) JP-A-1-154100 (JP, A) Patent 2646080 (JP, B2) JP-B-6-8999 (JP, A) JP, B2) JP 8-33741 (JP, B2) JP 1-443960 (JP, B2) JP 6-34188 (JP, B2) WO 89/4035 (WO, A1) (58) Surveyed field (Int. Cl. ⁶ , DB name) G10L 3/00 561

Claims

(57) [Claims]

1. A first speech recognition processing step of recognizing a speech uttered by a speaker to obtain a plurality of recognition candidates, and sequentially outputting a plurality of recognition candidates obtained by the first speech recognition processing to the speaker. A recognition candidate notification processing step, wherein when a specific recognition candidate is notified in the recognition candidate processing, the speaker utters to indicate that the candidate corresponds to the speaker's utterance in the first speech recognition processing. A second voice recognition processing step of recognizing the command word voice to be performed, when the command word voice is not recognized by the second voice recognition processing by the time when notification of all candidates is completed in the recognition candidate notification processing A guidance notification processing step of performing guidance notification for asking the speaker whether or not to execute the recognition result candidate notification processing again; and the second step in which the speaker utters in response to the guidance notification in the guidance notification processing. Same as command word voice in voice recognition processing of A third speech recognition processing step of recognizing the command word voice, and a re-recognition candidate notification for executing the recognition candidate notification processing again when the command word voice can be recognized in the third voice recognition processing. A speech recognition method including a processing step.