JP3018759B2

JP3018759B2 - Specific speaker type speech recognition device

Info

Publication number: JP3018759B2
Application number: JP19604492A
Authority: JP
Inventors: 英人袋井
Original assignee: NEC Corp
Current assignee: NEC Corp
Priority date: 1992-06-30
Filing date: 1992-06-30
Publication date: 2000-03-13
Anticipated expiration: 2015-03-13
Also published as: JPH0619493A

Abstract

PURPOSE:To register speech data, required for specified speaker system speech recognition, only by a speech by utilizing the unspecified speaker system speech recognition. CONSTITUTION:A file 6 for unspecified speakers contains speech data B1-Bm by the unspecified speaker system as to predetermined speeches corresponding to necessary indications to be given to the device side from the outside when speech data A1-An are registered in a file 5 for unspecified speakers. When a user inputs a predetermined speech to a microphone 7, an unspecified speaker system speech recognition part 3 recognizes speech data appearing at the output of an A/D converter 9 on the basis of the speech data B1-Bm registered in the unspecified speaker file 6 and a control part 1 controls the registration of the speech data A1-An in the unspecified speaker file 5 according to the recognition result. Further, an answer from the device side is vocalized through a speech synthesis part 4 and a speaker 10.

Description

DETAILED DESCRIPTION OF THE INVENTION

【０００１】[0001]

【産業上の利用分野】本発明は特定話者方式音声認識装
置に関し、特に特定話者の音声データを登録する操作を
不特定話者方式による音声認識にて音声で行う特定話者
方式音声認識装置に関する。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a specific speaker type speech recognition apparatus, and more particularly to a specific speaker type speech recognition in which an operation for registering speech data of a specific speaker is performed by voice recognition using an unspecified speaker type. Related to the device.

【０００２】[0002]

【従来の技術】音声認識装置は、予め音声をデータ化し
たファイルを持ち、外部より入力されてくる音声が、そ
のファイルの中のデータと一致するか否かを判別するも
のであり、話し手を特定した特定話者方式と話し手を特
定しない不特定話者方式との２つの方式がある。2. Description of the Related Art A voice recognition apparatus has a file in which voice is converted into data in advance, and determines whether or not voice input from the outside matches data in the file. There are two methods, a specified specific speaker method and an unspecified speaker method in which the speaker is not specified.

【０００３】不特定話者方式は、音声データを広く一般
の使用者の音声の中から登録し、音声認識を行わせるも
のである。この方式では、予め装置にファイル化されて
いる言葉であれば、誰の音声に対しても応答可能である
ことが特徴となっている。In the unspecified speaker system, voice data is registered from a wide range of general users' voices, and voice recognition is performed. This system is characterized in that it can respond to anyone's voice as long as the words are filed in the device in advance.

【０００４】これに対し、特定話者方式は、その音声の
データファイルを特定の使用者が事前に自分の音声で装
置に登録し、音声認識を行わせるものである。この方式
では、使用者が登録する言葉を自由に選択できるのが特
徴である。On the other hand, in the specific speaker system, a specific user registers a data file of the voice in the apparatus in advance with his / her own voice, and performs voice recognition. This system is characterized in that the user can freely select words to be registered.

【０００５】従って、相手先の名前等のキーワードを発
声すれば予めそのキーワードに対応して登録しておいた
ダイヤル番号に自動的に発呼する音声ダイヤル機能付き
自動車電話等の如く、登録音声を任意なものとすること
が必要な装置では、特定話者方式が一般に採用されてい
る。Therefore, when a keyword such as the name of the other party is uttered, the registered voice is automatically transmitted to a dial number registered in advance corresponding to the keyword, such as a car telephone with a voice dial function. In a device that needs to be optional, a specific speaker system is generally adopted.

【０００６】[0006]

【発明が解決しようとする課題】このように特定話者方
式は、不特定話者方式にない優れた利点を有している
が、前述したように使用者が事前に自分の音声をデータ
として登録しなければならない。そして、従来は、この
音声データを登録するのに、音声認識装置にキーボード
等の操作部が必要であり、使用者は自ら複雑なキー入力
操作を行う必要があった。As described above, the specific speaker system has an excellent advantage that is not provided by the unspecified speaker system. You must register. Conventionally, in order to register the voice data, an operation unit such as a keyboard is required in the voice recognition device, and the user has to perform a complicated key input operation by himself.

【０００７】このため複雑なキー入力操作を行うことに
よる操作性の悪さ、及び操作部のコスト，スペースが必
要になるという問題点があった。For this reason, there have been problems that the operability is poor due to the complicated key input operation, and that the cost and space of the operation unit are required.

【０００８】本発明は、この様な点に鑑みてなされたも
のであり、特定話者方式の音声認識に必要な音声データ
の登録に不特定話者方式の音声認識を利用することによ
り、特定話者音声データの登録操作を複雑なキー入力操
作を行わずに音声で行えるようにした特定話者方式音声
認識装置を提供することを目的とする。SUMMARY OF THE INVENTION The present invention has been made in view of the above points, and uses an unspecified speaker type speech recognition for registration of speech data necessary for a specific speaker type speech recognition. An object of the present invention is to provide a specific-speaker-type speech recognition device that can register speaker voice data by voice without performing complicated key input operations.

【０００９】[0009]

【課題を解決するための手段】本発明は上記の目的を達
成するために、音声ダイヤル機能付き無線電話における
特定話者方式音声認識装置において、ダイヤル番号と特
定話者音声データとの組を複数格納し得る特定話者用フ
ァイルと、マイクロフォンを通じて外部より入力された
音声の認識を前記特定話者用ファイルに登録された特定
話者音声データに基づいて行い、認識した特定話者音声
に対応するダイヤル番号を出力する特定話者方式音声認
識部と、前記特定話者音声データを前記特定話者用ファ
イルへ登録する際に外部より装置側に対し与える必要の
ある指示毎に、予め定められた音声についての不特定話
者方式による不特定話者音声データを記憶する不特定話
者用ファイルと、前記マイクロフォンを通じて外部より
入力された音声の認識を前記不特定話者用ファイルに登
録された不特定話者音声データに基づいて行い、認識し
た不特定話者音声に対応する指示を出力する不特定話者
方式音声認識部と、前記マイクロフォンを通じて外部よ
り音声が入力されたとき、前記特定話者方式音声認識部
からダイヤル番号が出力された場合はそのダイヤル番号
を用いて発呼を行い、前記不特定話者方式音声認識部か
ら指示が出力された場合は音声による暗証コードの入力
とその照合処理を行って照合がとれた場合に限り、その
指示に従って特定話者方式による特定話者音声データの
前記特定話者用ファイルへの登録を制御する制御部とを
備えている。SUMMARY OF THE INVENTION In order to achieve the above object, the present invention provides a specific speaker type speech recognition apparatus for a radio telephone with a voice dial function, wherein a plurality of pairs of a dial number and specific speaker voice data are provided. Recognition of the specific speaker file that can be stored and the voice input from the outside through the microphone are performed based on the specific speaker voice data registered in the specific speaker file and correspond to the recognized specific speaker voice. A specific speaker type speech recognition unit that outputs a dial number, and a predetermined instruction for each instruction that needs to be given to the device side from the outside when registering the specific speaker voice data in the specific speaker file. An unspecified speaker file storing unspecified speaker voice data according to the unspecified speaker method for the voice, and a file of an externally input voice through the microphone. An unspecified speaker type speech recognition unit that performs recognition based on unspecified speaker voice data registered in the unspecified speaker file, and outputs an instruction corresponding to the recognized unspecified speaker voice; and the microphone When a voice is input from the outside through the specific speaker system voice recognition unit and a dial number is output, a call is made using the dial number, and an instruction is issued from the unspecified speaker system voice recognition unit. If it is output , input the password by voice
And a control unit for controlling registration of specific speaker voice data in the specific speaker file in accordance with the instruction according to the instruction only when the collation processing is performed and the collation is performed .

【００１０】また、音声合成部と該音声合成部で合成さ
れた音声を出力するスピーカとを備え、前記制御部は、
特定話者音声データの前記特定話者用ファイルへの登録
制御に関する装置側からの応答を前記音声合成部および
前記スピーカを通じて音声にて行うようにしている。[0010] The apparatus further includes a voice synthesizing unit and a speaker for outputting a voice synthesized by the voice synthesizing unit.
A response from the device regarding registration control of the specific speaker voice data to the specific speaker file is made by voice through the voice synthesis unit and the speaker.

【００１１】[0011]

【作用】本発明の特定話者方式音声認識装置において
は、不特定話者用ファイルが、特定話者音声データを特
定話者用ファイルへ登録する際に外部より装置側に対し
与える必要のある指示に対応する予め定められた音声に
ついての不特定話者方式による不特定話者音声データを
保持しており、利用者が予め定められた音声をマイクロ
フォンに入力すると、不特定話者方式音声認識部が、マ
イクロフォンを通じて入力したその音声を不特定話者用
ファイルに登録された不特定話者音声データに基づいて
認識し、制御部が、この認識結果に基づいて特定話者方
式による特定話者音声データの特定話者用ファイルへの
登録を制御する。また、装置側からの応答を音声合成部
およびスピーカを通じて音声にて行う。In the specific speaker system speech recognition apparatus of the present invention, the unspecified speaker file needs to be provided from the outside to the apparatus when registering the specific speaker voice data in the specific speaker file. It holds unspecified speaker voice data in the unspecified speaker system for the predetermined voice corresponding to the instruction, and when the user inputs the predetermined voice to the microphone, the unspecified speaker system voice recognition is performed. The unit recognizes the voice input through the microphone based on the unspecified speaker voice data registered in the unspecified speaker file, and the control unit controls the specific speaker based on the specific speaker method based on the recognition result. Controls registration of voice data to a specific speaker file. In addition, a response from the device is made by voice through a voice synthesis unit and a speaker.

【００１２】[0012]

【実施例】次に、本発明の実施例について図面を参照し
て詳細に説明する。Next, embodiments of the present invention will be described in detail with reference to the drawings.

【００１３】図１は本発明の一実施例の構成図であり、
音声ダイヤル機能付き自動車電話に本発明を適用したも
のである。FIG. 1 is a block diagram of an embodiment of the present invention.
The present invention is applied to a car telephone with a voice dial function.

【００１４】前述したように、音声ダイヤル機能付き自
動車電話は、相手先の名前等のキーワードを発声すれば
予めそのキーワードに対応して登録されたダイヤル番号
に自動的に発呼するものであり、登録する相手先の名前
等のキーワードやダイヤル番号は利用者毎に相違するの
で、特定話者方式が適している。As described above, a car telephone with a voice dialing function automatically calls a dial number registered in advance in response to a keyword such as the name of the other party when the keyword is spoken. Keywords such as the name of the destination to be registered and dial numbers differ for each user, so the specific speaker system is suitable.

【００１５】この実施例の装置は、装置全体の主たる制
御を行う制御部１と、利用者の音声を入力するマイクロ
フォン７と、その出力を増幅するアンプ８と、その出力
をディジタル化するＡ／Ｄ変換器９と、ダイヤル番号Ｅ
１〜Ｅｎおよびそれらに対応する音声データＡ１〜Ａｎ
を記憶する特定話者用ファイル５と、Ａ／Ｄ変換器９の
出力に現れる音声データに対する音声認識を特定話者用
ファイル５に記憶された音声データＡ１〜Ａｎに基づい
て行い、一致した即ち認識した音声データＡｉに対応す
るダイヤル番号Ａｉを制御部１に出力する特定話者方式
音声認識部２と、指示・登録ｅ１〜ｅｍおよびそれらに
対応する音声データＢ１〜Ｂｍを記憶する不特定話者用
ファイル６と、Ａ／Ｄ変換器９の出力に現れる音声デー
タに対する音声認識を不特定話者用ファイル６に記憶さ
れた音声データＢ１〜Ｂｍに基づいて行い、一致した音
声データＢｊに対応する指示・登録ｅｊを制御部１に出
力する不特定話者方式音声認識部３と、制御部１から指
示された音声を合成する音声合成部４と、その出力をア
ナログ信号に変換するＤ／Ａ変換器１２と、その出力を
増幅するアンプ１１と、その出力を音声に変換するスピ
ーカ１０と、アンテナ１４を通じて無線により図示しな
い基地局と信号の授受を行う送受信機１３とを備えてい
る。The apparatus of this embodiment has a control unit 1 for performing main control of the entire apparatus, a microphone 7 for inputting a user's voice, an amplifier 8 for amplifying the output, and an A / A for digitizing the output. D converter 9 and dial number E
1 to En and their corresponding audio data A1 to An
Is recognized based on the voice data A1 to An stored in the specific speaker file 5 and the voice data appearing in the output of the A / D converter 9 is stored in the file 5 for the specific speaker. A specific speaker type voice recognition unit 2 that outputs a dial number Ai corresponding to the recognized voice data Ai to the control unit 1, and an unspecified voice that stores instructions / registrations e1 to em and voice data B1 to Bm corresponding thereto. The speech recognition for the speaker file 6 and the speech data appearing at the output of the A / D converter 9 is performed based on the speech data B1 to Bm stored in the unspecified speaker file 6 and corresponds to the matched speech data Bj. Speaker / speech recognition unit 3 that outputs an instruction / registration ej to the control unit 1, a voice synthesis unit 4 that synthesizes the voice specified by the control unit 1, and converts the output to an analog signal A D / A converter 12, an amplifier 11 for amplifying the output, a speaker 10 for converting the output to a voice, and a transceiver 13 for transmitting and receiving a signal to and from a base station (not shown) wirelessly via an antenna 14. ing.

【００１６】図２〜図４は制御部１の処理の一例を示す
フローチャートであり、以下各図を参照して本実施例の
動作を説明する。FIGS. 2 to 4 are flowcharts showing an example of the processing of the control unit 1. The operation of this embodiment will be described below with reference to the drawings.

【００１７】なお、不特定話者用ファイル６の音声デー
タＢ１〜Ｂｍには、例えば「登録開始」という音声，
「登録終了」という音声，数字の「０」から「９」まで
の音声に対応する音声データが記憶されており、それら
に対応する指示・登録ｅ１〜ｅｍの内容は、登録要求の
指示，登録終了の指示，数字の０から９までの数値とす
る。The voice data B1 to Bm of the unspecified speaker file 6 include, for example, a voice of "registration start",
The voice data corresponding to the voice of “registration end” and voices of numbers “0” to “9” are stored, and the contents of the instructions / registrations e1 to em corresponding to the voices are the instruction and registration of the registration request. End instruction, number from 0 to 9

【００１８】図１に示す装置の電源が投入されると、制
御部１は、図２に示す処理を開始し、マイクロフォン７
から音声が入力されたか否かをＡ／Ｄ変換器９の出力に
よって監視する（Ｓ１）。そして、何等かの音声が入力
されると、不特定話者方式音声認識部３から登録要求が
出力されたか否か（Ｓ２）、特定話者方式音声認識部２
からダイヤル番号が出力されてダイヤル要求されたか否
かを判定し（Ｓ３）、何れでもない場合にはノイズ或い
は誤入力として処理Ｓ１に戻り、不特定話者方式音声認
識部３から登録要求が出力されていれば処理Ｓ５へ移行
し、特定話者方式音声認識部２からダイヤル番号が出力
されていれば処理Ｓ４へ進む。When the power supply of the apparatus shown in FIG. 1 is turned on, the control unit 1 starts the processing shown in FIG.
It is monitored by the output of the A / D converter 9 whether or not a voice has been input from (S1). Then, when any voice is input, whether or not a registration request is output from the unspecified speaker system speech recognition unit 3 (S2), the specific speaker system speech recognition unit 2
It is determined whether or not a dial number has been output and a dial request has been made (S3). If not, the process returns to step S1 as noise or erroneous input, and a registration request is output from the unspecified speaker system voice recognition unit 3. If the dial number has been output from the specific speaker system voice recognition unit 2, the process proceeds to step S5.

【００１９】今、利用者がキーワードの音声データおよ
びそれに対応するダイヤル番号を不特定話者用ファイル
５に登録するために、「登録開始」なる音声を発したと
すると、それが不特定話者方式音声認識部３にて認識さ
れて登録要求が制御部１に出力されるため、制御部１は
処理Ｓ２から処理Ｓ５以降の登録処理へ進むことにな
る。Now, if the user utters a "registration start" voice in order to register the voice data of the keyword and the corresponding dial number in the file 5 for an unspecified speaker, it is assumed that the voice is "unregistered speaker". Since the registration request is recognized and output to the control unit 1 by the system voice recognition unit 3, the control unit 1 proceeds from the processing S2 to the registration processing of the processing S5 and thereafter.

【００２０】なお、第三者による音声データの不正登録
を防止するために、処理Ｓ２と処理Ｓ５との間に、音声
による暗証コードの入力とその照合処理とを追加し、照
合がとれた場合に限り処理Ｓ５以降の登録処理へ進むよ
うにしても良い。In order to prevent unauthorized registration of voice data by a third party, an input of a personal identification code by voice and a verification process thereof are added between the processing S2 and the processing S5. The process may proceed to the registration process after the process S5.

【００２１】制御部１は処理Ｓ５において、例えば「登
録番地を指定して下さい」といった登録番地指定促進メ
ッセージ音声を音声合成部４で合成させ、スピーカ１０
から発生させる。そして、音声入力を待つ（Ｓ６）。In step S5, the control unit 1 synthesizes a registered address designation prompt message such as "Please specify a registered address" in the voice synthesizing unit 4, and the speaker 10
Generate from. Then, it waits for a voice input (S6).

【００２２】上記のメッセージ音声に応答して、利用者
が例えば「１」と発声すると、それが不特定話者方式音
声認識部３にて認識され、制御部１へ数値１が出力され
る。制御部１はこの数値「１」を登録番地の指定値とし
て認識し（Ｓ７でＹＥＳ）、内部変数ｉにこの数値
「１」を設定し（Ｓ９）、処理Ｓ１０へ進む。When the user utters, for example, "1" in response to the above-mentioned message voice, it is recognized by the unspecified speaker system voice recognition unit 3 and a numerical value 1 is output to the control unit 1. The control unit 1 recognizes this numerical value “1” as the designated value of the registration address (YES in S7), sets this numerical value “1” in the internal variable i (S9), and proceeds to processing S10.

【００２３】制御部１は処理Ｓ１０において、例えば
「ダイヤル番号を入力して下さい」といったダイヤル番
号入力促進メッセージ音声を音声合成部４で合成させて
スピーカ１０から発生させ、音声入力を待つ（Ｓ１
１）。In step S10, the control unit 1 synthesizes a voice of a dial number input prompt message such as "Please input a dial number" in the voice synthesizing unit 4, generates it from the speaker 10, and waits for voice input (S1).
1).

【００２４】次に利用者がダイヤル番号として例えば
「０４５９３９２３５４」を発声すると、その各々の数
値が不特定話者方式音声認識部３にて認識され、それら
の数値が制御部１へ出力されるので、制御部１はダイヤ
ル番号として必要な形式（例えば桁数等）が整っていれ
ばこれらの数値をダイヤル番号として認識し（Ｓ１２で
ＹＥＳ）、このダイヤル番号を内部変数ｉが示す登録番
地のダイヤル番号Ｅｉとして特定話者用ファイル５に格
納する（Ｓ１３）。ダイヤル番号としての形式を有して
いないデータの場合は、ノイズ／誤入力として処理し
（Ｓ１２でＮＯ）、処理Ｓ１１に戻って音声入力を再度
受け付ける。Next, when the user utters, for example, “0459392354” as a dial number, the respective numerical values are recognized by the unspecified speaker system voice recognition unit 3, and these numerical values are output to the control unit 1. If the format (for example, the number of digits) necessary for the dial number is in order, the control unit 1 recognizes these numerical values as the dial number (YES in S12), and dials the dial number at the registered address indicated by the internal variable i. The number Ei is stored in the specific speaker file 5 (S13). If the data does not have a format as a dial number, it is processed as noise / erroneous input (NO in S12), and the process returns to step S11 to accept a voice input again.

【００２５】なお、認識したダイヤル番号を音声合成部
４で音声に変換し、それをスピーカ１０から発生させ、
利用者に確認させる処理を付加するようにしても良い。The recognized dial number is converted into a voice by the voice synthesizing unit 4 and generated from the speaker 10.
A process for allowing the user to confirm may be added.

【００２６】次に制御部１は、例えば「キーワードを入
力して下さい」といったキーワード入力促進メッセージ
音声を音声合成部４で合成してスピーカ１０から発生さ
せ（Ｓ１４）、音声入力を待つ（Ｓ１５）。Next, the control unit 1 synthesizes the voice of a keyword input prompting message such as "Please input a keyword" by the voice synthesizing unit 4 and generates it from the speaker 10 (S14), and waits for voice input (S15). .

【００２７】このメッセージ音声に応答して利用者が例
えば「ふくろい」と発生すると、制御部１はＡ／Ｄ変換
器９の出力に現れる信号を処理して得た音声データを内
部変数ｉが示す登録番地の音声データＡｉとして特定話
者用ファイル５に格納する（Ｓ１６）。そして、処理Ｓ
５に戻る。When the user generates, for example, “Fukuro” in response to the message voice, the control unit 1 processes the signal appearing at the output of the A / D converter 9 and stores the voice data obtained by processing the internal variable i as an internal variable i. It is stored in the specific speaker file 5 as the voice data Ai of the registered address shown (S16). And processing S
Return to 5.

【００２８】以上の一連の動作で、特定話者用ファイル
５の登録番地「１」に、ダイヤル番号「０４５９３９２
３５４」およびキーボード「ふくろい」という音声にか
かる音声データが登録されたことになる。In the above series of operations, the dial number “059392” is added to the registered address “1” of the specific speaker file 5.
354 "and the keyboard" Fukuroi "are registered.

【００２９】利用者は以上のような操作を繰り返すこと
により、特定話者用ファイル５に必要なだけのダイヤル
番号とキーワードに対応する音声データとを登録する。
そして、最後に「登録終了」と発生すると、それが不特
定話者方式音声認識部３で認識されて制御部１に登録終
了の指示が出力され、制御部１はこれを処理Ｓ８で認識
し、処理Ｓ１に戻る。なお、処理Ｓ６において音声入力
が検出されたが、登録番地の指定や登録終了の指定でな
い場合は、処理Ｓ６に戻って音声入力を再度受け付け
る。The user repeats the above operation to register the necessary dial numbers and the voice data corresponding to the keywords in the specific speaker file 5.
Then, when "registration end" occurs at last, it is recognized by the unspecified speaker system voice recognition unit 3 and an instruction of registration end is output to the control unit 1, and the control unit 1 recognizes this in the processing S8. The process returns to step S1. It should be noted that if a voice input is detected in the process S6, but the registration address is not specified or the registration end is not specified, the process returns to the process S6 to receive the voice input again.

【００３０】さて、特定話者用ファイル５に必要なデー
タが登録された後、利用者が自ら登録したキーワードを
発声すると、それが特定話者方式音声認識部２で認識さ
れ、制御部１へ認識したキーワードに対応するダイヤル
番号が出力される。After the necessary data is registered in the specific speaker file 5, when the user utters the registered keyword, the specific speaker system voice recognition unit 2 recognizes the keyword and sends the keyword to the control unit 1. The dial number corresponding to the recognized keyword is output.

【００３１】制御部１はこれを処理Ｓ３で認識し、送受
信機１３およびアンテナ１４を通じて図示しない基地局
に対し、入力されたダイヤル番号を用いた発呼を行う
（Ｓ４）。The controller 1 recognizes this in step S3, and makes a call to the base station (not shown) through the transceiver 13 and the antenna 14 using the input dial number (S4).

【００３２】以上のように、本実施例では、不特定話者
方式の音声認識機能を使用して、ダイヤル番号およびそ
れに対応する音声データの登録を、音声だけで行えるよ
うにしている。自動車電話の場合、運転中にキーボード
等を操作して音声データ等を登録することは安全面から
著しく困難であったが、本実施例の音声ダイヤル機能付
き自動車電話によれば、運転中であっても登録が可能と
なるのでその実用的価値は非常に大きいと言える。As described above, in the present embodiment, the dial number and the corresponding voice data can be registered only by voice using the voice recognition function of the unspecified speaker system. In the case of a car phone, it is extremely difficult to register voice data and the like by operating a keyboard or the like during driving, but according to the car phone with a voice dial function of the present embodiment, it is difficult to register while driving. However, since registration is possible, its practical value is very large.

【００３３】[0033]

【発明の効果】以上説明したように本発明の特定話者方
式音声認識装置は、特定話者方式の音声認識に必要な音
声データの登録操作を音声にて行うことができるので、
複雑なキー操作が不要となり、操作性が向上する。As described above, the specific speaker system speech recognition apparatus of the present invention can perform the registration operation of the speech data required for the specific speaker system speech recognition by voice.
No complicated key operation is required, and operability is improved.

【００３４】また、キーボード等の操作部を省略あるい
は簡略化することができ、更に音声入力用のマイクロフ
ォンは特定話者方式の音声認識用として既に存在するも
のを利用するので、装置の小型化，低価格化が達成でき
る。Further, an operation unit such as a keyboard can be omitted or simplified, and a microphone for voice input that already exists for voice recognition of a specific speaker system is used. Low price can be achieved.

【００３５】更に、装置側からの応答を音声で行う構成
では、応答を出力する表示部等が不要となり、且つ、登
録も対話形式で行うことができる。Further, in the configuration in which the response from the device side is made by voice, a display unit for outputting the response becomes unnecessary, and the registration can be performed in an interactive manner.

[Brief description of the drawings]

【図１】本発明の一実施例の構成図である。FIG. 1 is a configuration diagram of an embodiment of the present invention.

【図２】制御部の処理例の一部を示すフローチャートで
ある。FIG. 2 is a flowchart illustrating a part of a processing example of a control unit;

【図３】制御部の処理例の他の部分を示すフローチャー
トである。FIG. 3 is a flowchart illustrating another part of the processing example of the control unit.

【図４】制御部の処理例の残りの部分を示すフローチャ
ートである。FIG. 4 is a flowchart illustrating a remaining part of the processing example of the control unit.

[Explanation of symbols]

１…制御部２…特定話者方式音声認識部３…不特定話者方式音声認識部４…音声合成部５…特定話者用ファイル６…不特定話者用ファイル７…マイクロフォン８…アンプ９…Ａ／Ｄ変換器１０…スピーカ１１…アンプ１２…Ｄ／Ａ変換器１３…送受信機１４…アンテナ DESCRIPTION OF SYMBOLS 1 ... Control part 2 ... Specific speaker system speech recognition part 3 ... Unspecified speaker system speech recognition part 4 ... Speech synthesis part 5 ... File for specific speaker 6 ... File for unspecified speaker 7 ... Microphone 8 ... Amplifier 9 ... A / D converter 10 ... Speaker 11 ... Amplifier 12 ... D / A converter 13 ... Transceiver 14 ... Antenna

フロントページの続き (51)Int.Cl.⁷ 識別記号ＦＩＨ０４Ｑ 7/38 Ｇ１０Ｌ 3/00 ５５１ＡＨ０４Ｂ 7/26 １０９Ｑ (56)参考文献特開平４−238398（ＪＰ，Ａ) 特開平２−312426（ＪＰ，Ａ) 特開平１−97044（ＪＰ，Ａ) 特開昭63−8798（ＪＰ，Ａ) 特開昭63−32596（ＪＰ，Ａ) 特開昭59−177599（ＪＰ，Ａ) 特開平３−157696（ＪＰ，Ａ) 実開昭59−61658（ＪＰ，Ｕ) 特公平３−76759（ＪＰ，Ｂ２) 特公平７−40704（ＪＰ，Ｂ２) (58)調査した分野(Int.Cl.⁷，ＤＢ名) G10L 15/00 - 15/28 H04M 1/27 H04Q 7/38 Continuation of the front page (51) Int.Cl. ⁷ identification symbol FI H04Q 7/38 G10L 3/00 551A H04B 7/26 109Q (56) References JP-A-4-238398 (JP, A) JP-A-2- 312426 (JP, A) JP-A-1-97044 (JP, A) JP-A-63-8798 (JP, A) JP-A-63-32596 (JP, A) JP-A-59-177599 (JP, A) JP-A-3-157696 (JP, A) JP-A-59-61658 (JP, U) JP-B 3-76759 (JP, B2) JP-B 7-40704 (JP, B2) (58) (Int.Cl. ⁷ , DB name) G10L 15/00-15/28 H04M 1/27 H04Q 7/38

Claims

(57) [Claims]

1. A specific speaker type speech recognition device for a radio telephone with a voice dial function, comprising: a specific speaker file capable of storing a plurality of sets of dial numbers and specific speaker voice data; A specific speaker type voice recognition unit that performs recognition of the recognized voice based on the specific speaker voice data registered in the specific speaker file and outputs a dial number corresponding to the recognized specific speaker voice; When registering the speaker voice data in the specific speaker file, for each instruction that needs to be given from the outside to the device side, unspecified speaker voice data based on a predetermined speaker method for a predetermined voice is An unspecified speaker file to be stored, and an unspecified speaker registered in the unspecified speaker file for recognizing speech input from outside through the microphone An unspecified speaker type speech recognition unit that performs based on speaker audio data and outputs an instruction corresponding to the recognized unspecified speaker voice; and when a voice is input from outside through the microphone, the specific speaker type speech If the dial number is outputted from the recognition unit performs a call using the dialed number, wherein when the instruction from the unspecified speaker system voice recognition unit is output to the input of the PIN code by voice the verification process
And a controller for controlling the registration of the specific speaker voice data in the specific speaker file in accordance with the instruction only when the verification is performed. Method speech recognition device.

2. A device comprising: a voice synthesizer; and a speaker for outputting a voice synthesized by the voice synthesizer, wherein the controller is configured to control registration of specific speaker voice data in the specific speaker file. 2. The specific speaker type speech recognition device according to claim 1, wherein a response from the speaker is made by speech through the speech synthesis unit and the speaker.