JPS63183498A

JPS63183498A - Registration type voice input/output device

Info

Publication number: JPS63183498A
Application number: JP62016973A
Authority: JP
Inventors: 北野　正明; 正宏浜田; 博之直野
Original assignee: Matsushita Electric Industrial Co Ltd
Current assignee: Panasonic Holdings Corp
Priority date: 1987-01-27
Filing date: 1987-01-27
Publication date: 1988-07-28

Abstract

(57)【要約】本公報は電子出願前の出願データであるた
め要約のデータは記録されません。(57) [Summary] This bulletin contains application data before electronic filing, so abstract data is not recorded.

Description

【発明の詳細な説明】産業上の利用分野本発明は、各種機器への命令を音声によって行なうため
に用いられる登録式音声入出力装置に関するものである
。DETAILED DESCRIPTION OF THE INVENTION Field of the Invention The present invention relates to a registered voice input/output device used for issuing commands to various devices by voice.

２へ従来の技術近年、音声認識、音声合成等の音声情報処理、およびＬ
ＳＩの技術の発達に伴い、音声認識装置。2. Conventional technology In recent years, voice information processing such as voice recognition and voice synthesis, and L
With the development of SI technology, voice recognition equipment.

音声合成装置は産業機器、民生機器等に利用され始め、
音声認識装置と音声合成装置とを組み合わせた登録式音
声入出力装置も実用化され始めた。Speech synthesis equipment began to be used in industrial equipment, consumer equipment, etc.
Registered voice input/output devices that combine a voice recognition device and a voice synthesis device have also begun to be put into practical use.

以下図面を参照しながら、従来の登録式音声入出力装置
の一例について説明する。An example of a conventional registered voice input/output device will be described below with reference to the drawings.

第２図は従来の登録式音声入出力装置のブロック図を示
すものである。FIG. 2 shows a block diagram of a conventional registered voice input/output device.

第２図において、１は音声の空気振動を電気信号に変換
するマイ□り、２はマイク１により入力された音声の電
気信号（以下音声信号）の音声区間を検出する音声区間
検出手段、３は音声区間検出手段２で検出された音声区
間の音声信号を音声の特徴パラメータのひとつであるＬ
ＰＧケダストラム係数に変換するＬＰＧケプストラム分
析手段、Ｓ、は認識−登録切り換えスイッチであり、Ｌ
ＰＧケプストラム分析手段３で変換され１ＬＰｃケプス
トラム係数を、認識時には後述するパターンマ３へ７ノチング手段４へ、登録時には後述する標準パターン格
納ＲＡＭ５へそれぞれ送るための切シ換えスイッチであ
る。５は標準パターン格納ＲＡＭであり、登録時にマイ
ク１よシ入力された音声信号ｆ、Ｉ　Ｌ　Ｐ　Ｃケブヌ
トラム係数に変換して蓄えておく。In FIG. 2, 1 is a microphone that converts air vibrations of sound into electrical signals, 2 is a sound section detection means that detects the sound section of the electrical signal of sound (hereinafter referred to as "sound signal") inputted by microphone 1, and 3 is the voice signal of the voice section detected by the voice section detection means 2, which is one of the voice characteristic parameters.
LPG cepstrum analysis means for converting into PG cepstrum coefficients, S is a recognition-registration switch, L
This is a changeover switch for sending the 1LPc cepstrum coefficients converted by the PG cepstrum analysis means 3 to the pattern mark 3 (described later) during recognition, to the notching means 4 (described later), and to the standard pattern storage RAM 5 (described later) during registration. Reference numeral 5 denotes a standard pattern storage RAM, in which the audio signal f input through the microphone 1 at the time of registration is converted into an ILPC Keb Nutram coefficient and stored therein.

４はパターンマツチング手段であう、認識時にマイク１
より人力された音声信号のＬＰＧケプヌトラム係数と標
準パターン格納ＲＡＭ５に格納されている標準パターン
のＬＰＧケプストラム係数との類似度を求め、最も類似
したものを認識結果とする。以上マイク１．音声区間検
出手段２　、　ＬＰＧケプストラム分析手段３．パター
ンマツチング手段４．標準パターン格納ＲＡＭ５　、認
識−登録切り換えスイッチＳ１によって登録式音声認識
装置は構成される。4 is a pattern matching means, microphone 1 is used during recognition.
The degree of similarity between the LPG cepnutrum coefficients of the human-generated audio signal and the LPG cepnutrum coefficients of the standard pattern stored in the standard pattern storage RAM 5 is determined, and the most similar one is taken as the recognition result. Above is Mike 1. Voice section detection means 2, LPG cepstrum analysis means 3. Pattern matching means 4. The registration type speech recognition device is constituted by the standard pattern storage RAM 5 and the recognition/registration changeover switch S1.

８は合成パターン格納ＲＯＭであり、登録式音声入出力
装置作成時に情報の出力や登録式音声認識装置の取シ扱
い全案内する音声合成を発声させるための人ＤＰＣＭ合
成パターンを格納するメモリであり、７はＡＤＰＣＭ合
成手段であり、合成パターン格納ＲＯＭ５に格納されて
いる合成パターンを音声の電気信号へ再生する。６はス
ピーカであシ、ＡＤＰＣＭ合成手段で再生された音声の
電気信号を音声の空気振動へと変換する。以上合成パタ
ーン格納ＲＯＭ８　、ＡＤＰＣＭ合成手段合成手段−カ
６によって音声合成装置は構成される。Reference numeral 8 is a synthesis pattern storage ROM, which is a memory that stores human DPCM synthesis patterns for outputting information when creating a registered voice input/output device and for generating voice synthesis that provides complete guidance on handling the registered voice recognition device. , 7 is an ADPCM synthesis means, which reproduces the synthesis pattern stored in the synthesis pattern storage ROM 5 into an audio electrical signal. A speaker 6 converts the electric signal of the sound reproduced by the ADPCM synthesis means into air vibration of the sound. The speech synthesis apparatus is constituted by the synthesis pattern storage ROM 8 and the ADPCM synthesis means synthesis means 6.

１１は被制御機器であり、登録式音声認識装置のパター
ンマツチング手段４の認識結果により制御される。なお
、被制御機器１１は音声合成装置のＡＤＰＣＭ合成手段
へ合成パターン格納ＲＯＭ５中のどの合成パターンを再
生するか命令を出す。Reference numeral 11 denotes a controlled device, which is controlled based on the recognition result of the pattern matching means 4 of the registered voice recognition device. Note that the controlled device 11 issues a command to the ADPCM synthesis means of the speech synthesizer as to which synthesis pattern in the synthesis pattern storage ROM 5 is to be reproduced.

以上のように構成された従来の登録式音声入出力装置に
ついて、その動作を説明する。The operation of the conventional registered voice input/output device configured as described above will be explained.

登録時には、被制御機器１１は、ＡＤＰＣＭ全ＰＣＭ７
に合成パターン格納ＲＯＭａ中より登録用音声を発声さ
せる旨（例えば、「イチと発声して下さい」）の合成パ
ターンを読み再生する命令を出すＯＡＤＰＣＭ合成手段
７で再生された音声信号は、スピーカ６によシ空気振動
に変換され使用者の耳に達する。使用者はこの合成音声
に対し５へ／で登録用音声を発声する。この音声はマイク１により電
気信号に変換され、音声区間検出手段２によって音声区
間が検出され、ＬＰＣケプストラム分析手段３によって
ＬＰＣケブヌトラム係数に変換さする。認識−登録切り
換えスイッチＳ、は、登録時は、標準パターン格納ＲＡ
Ｍ５に前記ＬＰＧケプヌトラム係数を送る方につながっ
ており、前記ＬＰＧケプストラム係数は、標準パターン
として標準パターン格納ＲＡＭ６に蓄えられる。以上の
登録時の動作は、被制御機器１１を登録式音声認識装置
が制御するのに満足な命令数だけ行なわれる０次に認識時には、被制御機器１１はＡＤＰＣＭ全ＰＣＭ
７に合成パターン格納ＲＯＭ５中より認識用音声全発声
させる旨（例えば、「命令して下さい」）の合成パター
ンを読み再生する命令を出す。人ＤＰＣＭ合成手段７で
再生された音声信号は、スピーカ６によシ空気振動に変
換され使用者の耳に達する。使用者はこの合成音声に対
して音声を発声する。この音声はマイク１により電気信
６へ号に変換され、音声区間検出手段２によって音声区間が
検出され、ＬＰＣケプストラム分析手段３によってＬＰ
Ｇケプヌトラム係数に変換される。At the time of registration, the controlled device 11 is ADPCM all PCM 7
The audio signal reproduced by the OADPCM synthesis means 7 issues a command to read and reproduce the synthesis pattern to cause the registration voice to be uttered from the synthesis pattern storage ROMA (for example, "Please say it") to the speaker 6. The vibrations are converted into air vibrations and reach the user's ears. The user utters a registration voice in response to this synthesized voice at 5/. This voice is converted into an electrical signal by a microphone 1, a voice interval is detected by a voice interval detection means 2, and the voice interval is converted into an LPC cepstrum coefficient by an LPC cepstrum analysis means 3. The recognition-registration changeover switch S is for standard pattern storage RA during registration.
It is connected to send the LPG cepnutrum coefficients to M5, and the LPG cepnutrum coefficients are stored in the standard pattern storage RAM 6 as a standard pattern. The above operation at the time of registration is performed by the number of instructions sufficient for the registered voice recognition device to control the controlled device 11. Next, at the time of recognition, the controlled device 11 is
At step 7, a command is issued to read and reproduce a composite pattern for uttering all recognition voices (for example, "Please give me a command") from the composite pattern storage ROM 5. The audio signal reproduced by the human DPCM synthesis means 7 is converted into air vibrations by the speaker 6 and reaches the user's ears. The user utters a voice in response to this synthesized voice. This voice is converted into an electric signal 6 by the microphone 1, the voice zone is detected by the voice zone detecting means 2, and the voice zone is detected by the LPC cepstrum analysis means 3.
It is converted into a G kepnutrum coefficient.

認識−登録切９換えスイッチＳ、は、認識時には、パタ
ーンマツチング手段４に前記ＬＰＧケプストラム係数を
送る方につながっており、パターンマツチング手段４で
は、登録時に標準パターン格納ＲＡＭ５に蓄えられた標
準パターンのＬＰＣケプヌトラム係数と入力されて来た
音声のＬＰＧケプヌトラム係数との類似度を求め、最も
類似したものを認識結果とし、この認識結果が正しいか
どうか使用者に発声させる旨（例えば、「××ですね？
」）の合成パターンを合成パターン格納ＲＯＭ８中より
ＡＤＰＣＭ全ＰＣＭ７が読み込み再生する。以下、前記
の認識時と同様である。使用者は、確認あるいは訂正を
意味する語（例えば、「）・イ」。The recognition/registration changeover switch S is connected to send the LPG cepstrum coefficients to the pattern matching means 4 during recognition, and the pattern matching means 4 selects the standard stored in the standard pattern storage RAM 5 during registration. The degree of similarity between the LPC kepnutrum coefficient of the pattern and the LPG kepnutrum coefficient of the input voice is determined, the most similar one is taken as the recognition result, and the user is asked to confirm whether the recognition result is correct (for example, by ×Isn't it?
'') is read by all ADPCM PCMs 7 from the composite pattern storage ROM 8 and reproduced. The following is the same as the above recognition. The user uses words that mean confirmation or correction (for example, ")・i".

「イイエ」）を発声する。この認識結果が確認を意味す
るものであれば、先の認識結果により被制御機器１１を
制御するし、訂正であれば、再度被制御機器１１を制御
する命令を使用者に発声させ７へ−７る０発明が解決しようとする問題点しかしながら、上記のような構成では、認識結果が正し
いかどうか使用者に発声させる旨の合成パターンは、あ
らかじめ合成パターン格納ＲＯＭに格納されている合成
パターンに限られ、使用者が登録式音声入出力装置に使
用者独自の音声命令で認識動作を行なわせる場合、この
音声命令が意味する合成パターンを用いねばならず、確
認が面倒な場合もあるという問題点を有していた。Say "Yes"). If this recognition result means confirmation, the controlled device 11 is controlled based on the previous recognition result, and if it is a correction, the user is made to utter a command to control the controlled device 11 again and go to step 7. 7 RU0 Problems to be Solved by the Invention However, in the above configuration, the synthetic pattern that prompts the user to say whether the recognition result is correct is a synthetic pattern that is stored in advance in the synthetic pattern storage ROM. When a user instructs a registered voice input/output device to perform a recognition operation based on the user's own voice commands, the user must use the synthetic pattern meant by the voice commands, which may be troublesome to confirm. It had a point.

本発明は上記問題点に鑑み、使用者が登録式音声入出力
装置に使用者独自の音声命令で認識動作を行なわせる場
合でも、認識結果を使用者独自の音声命令を音声合成装
置で再生することのできる登録式音声入出力装置を提供
するものである。In view of the above-mentioned problems, the present invention allows a voice synthesizer to reproduce the recognition result as the user's own voice command even when the user causes the registered voice input/output device to perform a recognition operation using the user's own voice command. The present invention provides a registered voice input/output device that can be used as a registered audio input/output device.

問題点を解決するための手段上記目的を達成するために本発明の登録式音声入出力装
置は、登録式音声認識装置と、音声登録時に認識のため
の登録と同時に使用話者の音声を音声合成に有効な特徴
パラメータに変換して記憶しておき、使用時にこれを音
声に再生する音声合成装置とを備えたことを特徴とする
。Means for Solving the Problems In order to achieve the above object, the registration type voice input/output device of the present invention includes a registration type voice recognition device and a registering type voice input/output device that records the voice of the speaker used at the same time as registering for recognition at the time of voice registration. The present invention is characterized by comprising a speech synthesis device which converts and stores feature parameters effective for synthesis and reproduces them into speech when used.

作用本発明は上記した構成によって、まず登録時に使用話者
の音声を音声認識に有効な特徴パラメータに変換して登
録しておくと同時に、この音声を音声合成に有効な特徴
パラメータに変換して記憶しておく。そして認識時に入
力音声と前記の登録されているそれぞれの音声認識に有
効な特徴パラメータの類似度を求め最も類似したものを
認識結果とする。この際、この認識結果に対応する音声
合成に有効な特徴パラメータを音声合成装置により再生
することにより使用者の独自の音声命令もアンサーバッ
クすることができる。According to the above-described configuration, the present invention first converts the voice of the speaker in use into feature parameters effective for speech recognition and registers the same at the time of registration, and at the same time converts this speech into feature parameters effective for speech synthesis. Remember it. Then, during recognition, the degree of similarity between the input voice and each of the registered feature parameters effective for voice recognition is determined, and the most similar one is taken as the recognition result. At this time, the user's unique voice command can also be answered by reproducing feature parameters effective for voice synthesis corresponding to the recognition result using the voice synthesizer.

実施例以下本発明の一実施例の登録式音声入出力装置について
、図面を参照しながら説明する。Embodiment Hereinafter, a registered voice input/output device according to an embodiment of the present invention will be described with reference to the drawings.

第１図は本発明の登録式音声入出力装置の一実施例のブ
ロック図を示すものである。FIG. 1 shows a block diagram of an embodiment of the registered voice input/output device of the present invention.

第１図において、１はマイク、２は音声区間検９　ベー
ン山手段、３はＬＰＧケプストラム分析手段、４はパター
ンマツチング手段、５は標準パターン格納ＲＡＭ、Ｓ、
は認識−登録切り換えスイツチであり、以上は従来例と
同様であり、以上によって登録式音声認識装置は構成さ
れる。In FIG. 1, 1 is a microphone, 2 is a voice section detector 9 vane mountain means, 3 is an LPG cepstrum analysis means, 4 is a pattern matching means, 5 is a standard pattern storage RAM, S,
is a recognition-registration changeover switch, and the above is the same as the conventional example, and the registration type speech recognition apparatus is configured by the above.

８は合成パターン格納ＲＯＭ、７はＡＤＰＣＭ合成手段
、６はスピーカであり、以上は従来例と同様である。９
はＡＤＰＣＭ分析手段であり、音声登録時に音声区間検
出手段２で検出された音声区間の音声信号を音声合成に
有効である人ＤＰＣＭを用いて特徴パラメータに変換す
る。１ｏは人ＤＰＣＭ分析手段で変換された特徴パラメ
ータを蓄えておく合成パターン格納ＲＡＭであり、以上
によって音声合成装置は構成されている。Reference numeral 8 is a composite pattern storage ROM, 7 is an ADPCM composition means, and 6 is a speaker, which is the same as the conventional example. 9
is an ADPCM analysis means, which converts the speech signal of the speech section detected by the speech section detection means 2 at the time of speech registration into feature parameters using human DPCM which is effective for speech synthesis. Reference numeral 1o denotes a synthesis pattern storage RAM for storing feature parameters converted by the human DPCM analysis means, and the speech synthesis apparatus is configured as described above.

１１は被制御機器であり、従来例と同様であるが、合成
パターン格納ＲＯＭ５の他に合成パターン格納ＲＡＭ１
０の中の合成パターンの再生命令も行なう。Reference numeral 11 designates a controlled device, which is similar to the conventional example, but includes a composite pattern storage RAM 1 in addition to the composite pattern storage ROM 5.
It also issues a command to reproduce the composite pattern in 0.

以上のように構成された本実施例の登録式音声入出力装
置について、その動作を説明する。The operation of the registration type audio input/output device of this embodiment configured as described above will be explained.

１０　へ−７登録時には、被制御機器１１は、ＡＤＰＣＭ合成手段７
に合成パターン格納ＲＯＭ５中より登録用音声を発声さ
せる旨（例えば、「イチと発声して下さい」）の合成パ
ターンを読み再生する命令を出す。ＡＤＰＣＭ合成手段
７で再生された音声信号は、スピーカ６により合成音声
に変換され使用者の耳に達する。使用者はこの合成音声
に対して登録用音声を発声する。この音声はマイク１に
より電気信号に変換され、音声区間検出手段２によって
音声区間が検出され、ＬＰＧケプストラム分析手段３に
よってＬＰＧケプヌトラム係数に変換さｎる。認識−登
録切り換えスイッチＳ１は、登録時は、標準パターン格
納ＲＡＭ５に前記ＬＰＧケプストラム係数を送る方につ
ながっており、前記ＬＰＣケプヌトラム係数は、標準パ
ターンとして標準パターン格納ＲＡＭ５に蓄えられる。10 To-7 At the time of registration, the controlled device 11 uses the ADPCM synthesis means 7
Then, a command is issued to read and reproduce a synthetic pattern indicating that a registration voice is to be uttered (for example, "Please say 1") from the synthetic pattern storage ROM 5. The audio signal reproduced by the ADPCM synthesizing means 7 is converted into synthesized audio by the speaker 6 and reaches the user's ears. The user utters a registration voice in response to this synthesized voice. This voice is converted into an electrical signal by a microphone 1, a voice interval is detected by a voice interval detection means 2, and is converted into an LPG cepnutrum coefficient by an LPG cepstrum analysis means 3. The recognition/registration changeover switch S1 is connected to send the LPG cepnutrum coefficients to the standard pattern storage RAM 5 during registration, and the LPC cepnutrum coefficients are stored in the standard pattern storage RAM 5 as a standard pattern.

一方、音声区間検出手段２によって検出された音声区間
は、ＡＤＰＣＭ分析手段９によって特徴パラメータに変
換され、合成パターン格納ＲＡＭ１０に蓄えられる。以
上の登録時の動作は、被制御機器１１１１　ヘー。On the other hand, the voice section detected by the voice section detection means 2 is converted into characteristic parameters by the ADPCM analysis means 9 and stored in the synthetic pattern storage RAM 10. The above operation at the time of registration is performed by the controlled device 1111.

を登録式音声認識装置が制御するのに満足な命令数だけ
行なわれる。A sufficient number of instructions are executed for the registered speech recognition device to control.

次に認識時には、被制御機器１１はＡＤＰＣＭ合成手段
７に合成パターン格納ＲＯＭＢ中より認識用音声を発声
させる旨（例えば、「命令して下さい」）の合成パター
ンを読み再生する命令を出す。ＡＤＰＣＭ合成手段７で
再生された音声信号は、スピーカ６により合成音声に変
換され使用者の耳に達する。使用者はこの合成音声に対
して音声全発声する。この音声はマイク１により電気信
号に変換され、音声区間検出手段２によって音声区間力
検出され、ＬＰＧケプストラム分析手段３によってＬＰ
Ｇケプヌトラム係数に変換される。Next, at the time of recognition, the controlled device 11 issues a command to the ADPCM synthesizing means 7 to read and reproduce a synthetic pattern for uttering a recognition voice (for example, "Please give me a command") from the synthetic pattern storage ROMB. The audio signal reproduced by the ADPCM synthesizing means 7 is converted into synthesized audio by the speaker 6 and reaches the user's ears. The user utters the entire voice in response to this synthesized voice. This voice is converted into an electrical signal by the microphone 1, the voice interval detection means 2 detects the voice interval force, and the LPG cepstrum analysis means 3 converts the voice into an electric signal.
It is converted into a G kepnutrum coefficient.

認識−登録切り換えスイッチＳ、は、認識時にはパター
ンマツチング手段４に前記ＬＰＧケプストラム係数を送
る方につながっており、パターンマツチング手段４では
、登録時に標準パターン格納ＲＡＭ５に蓄えられた標準
パターンのＬＰＧケプストラム係数と入力されて来た音
声のＬＰＧケプヌトラム係数との類似度を求め、最も類
似したものを認識結果とし、この認識結果が正しいかど
うか使用者に発声させる旨（例えば、「××ですね？」
）の合成パターンを合成パターン格納ＲＯＭ８、および
合成パターン格納ＲＡＭ１０中よりＡＤＰＣＭ合成手段
７が読み込み再生する。以下、前記の認識時と同様であ
る。使用者は、確認あるいは訂正を意味する語（例えば
、「ハイ」、「イイエ」）全発声する。この認識結果が
確認を意味するものであれば、先の認識結果により被制
御機器１１’（ｉ−制御するし、訂正であれば、再度被
制御機器１１を制御する命令を使用者に発声させる。The recognition/registration changeover switch S is connected to send the LPG cepstrum coefficient to the pattern matching means 4 during recognition, and the pattern matching means 4 selects the LPG cepstrum coefficient of the standard pattern stored in the standard pattern storage RAM 5 during registration. The degree of similarity between the cepstrum coefficient and the LPG cepnutrum coefficient of the input voice is calculated, the most similar one is taken as the recognition result, and the user is asked to confirm whether the recognition result is correct (for example, "It's XX"). ?”
) is read from the composite pattern storage ROM 8 and the composite pattern storage RAM 10 by the ADPCM composition means 7 and reproduced. The following is the same as the above recognition. The user utters all words meaning confirmation or correction (eg, "hi", "yes"). If this recognition result means confirmation, the controlled device 11' (i-control is performed based on the previous recognition result, and if it is a correction, the user is made to utter a command to control the controlled device 11 again. .

以上のように本実施例によれば、登録時に音声認識に用
いる標準パターンの特徴パラメータを標準パターン格納
ＲＡＭ５へ蓄えるのと同時に音声合成に用いる合成パタ
ーンを合成パターン格納ＲＡＭ１０に蓄えるので、使用
話者独自の音声命令をアンサーバックすることができる
。さらに音声認識の特徴パラメータとしてＬＰＧケブヌ
トラム係数を用いているので、品質の高い音声認識を行
なうことができる。また、音声合成の特徴パラ１３　へ
−７メータとしてＡＤＰＣＭｉ用いているので、品質の良い
合成音を再生することができる。As described above, according to this embodiment, the characteristic parameters of the standard pattern used for speech recognition are stored in the standard pattern storage RAM 5 at the time of registration, and at the same time, the synthesis patterns used for speech synthesis are stored in the synthesis pattern storage RAM 10. You can answer back your own voice commands. Furthermore, since the LPG Kebnutram coefficient is used as a feature parameter for speech recognition, high quality speech recognition can be performed. Furthermore, since ADPCMi is used as the speech synthesis characteristic parameter, high quality synthesized speech can be reproduced.

発明の効果本発明は、登録時に使用話者の音声を音声認識に有効な
特徴パラメータに変換して登録しておくのと同時に、こ
の音声を音声合成に有効な特徴パラメータに変換して記
憶しておく。そして認識時に入力音声と登録音声の類似
度を求め最も類似したものを認識結果とする。この際、
認識結果に対応する音声合成に有効な特徴パラメータを
音声合成装置により再生する。したがって使用者独自の
音声命令も合成音によりアンサ−バンクすることができ
、さらに音声認識の特徴パラメータには、音声認識に有
効な特徴パラメータを用い、音声合成の特徴パラメータ
には、音声合成に有効な特徴パラメータを用いているの
でそれぞれ品質の高い音声認識、音声合成を行なうこと
ができる等、数数の優れた効果を得ることのできる登録
式音声入出力装置全実現するものである。Effects of the Invention The present invention converts the voice of the speaker used during registration into feature parameters effective for speech recognition and registers the same, and at the same time converts this speech into feature parameters effective for speech synthesis and stores the same. I'll keep it. Then, during recognition, the degree of similarity between the input voice and the registered voice is determined, and the most similar one is taken as the recognition result. On this occasion,
Feature parameters effective for speech synthesis corresponding to the recognition results are reproduced by the speech synthesis device. Therefore, the user's own voice commands can also be answered using synthesized voices.Furthermore, the characteristic parameters for voice recognition are those that are effective for voice recognition, and the characteristic parameters for voice synthesis are those that are effective for voice synthesis. The present invention is intended to realize a registered voice input/output device that can achieve a number of excellent effects, such as being able to perform high quality voice recognition and voice synthesis because of the use of characteristic parameters.

[Brief explanation of the drawing]

１４　・＼　２第１図は本発明の一実施例における登録式音声入出力装
置のブロック図、第２図は従来例の登録式音声入出力装
置のブロック図である。１・・・・・・マイク、２・・・・・・音声区間検出手
段、３・・・・・・ＬＰＧケプストラム分析手段、４・
・・・・・パターンマツチング手段、５・・・・・・標
準パターン格納ＲＡＭ。Ｓｌ・・・・・・認識−登録切り換えスイッチ、６・・
・・・・スピーカ、７・・・・・・ＡＤＰＣＭ合成手段
、８・・・・・・合成パターン格納ＲＯＭ、９・・・・
・・ＡＤＰＣＭ分析手段、１０・・・・・・合成パター
ン格納ＲＡＭ、１１・・・・・・被制御機器。代理人の氏名　弁理士　中　尾　敏　男　ほか１名第１
図５イーゴｙ塩−＝ｖ〜多１１月すＪφ−之又Ａソラー特
開口ＵＧ３−１８３４９８（５）第２図14.\2 FIG. 1 is a block diagram of a registered voice input/output device according to an embodiment of the present invention, and FIG. 2 is a block diagram of a conventional registered voice input/output device. 1...Microphone, 2...Voice section detection means, 3...LPG cepstrum analysis means, 4.
...Pattern matching means, 5...Standard pattern storage RAM. Sl...Recognition-registration switch, 6...
... Speaker, 7 ... ADPCM synthesis means, 8 ... Synthesis pattern storage ROM, 9 ...
. . . ADPCM analysis means, 10 . . . Composite pattern storage RAM, 11 . . . Controlled equipment. Name of agent: Patent attorney Toshio Nakao and 1 other person No. 1
Fig. 5 Igo y salt-=v ~ multi-November Jφ-nomata A solar special opening UG3-183498 (5) Fig. 2

Claims

[Claims]

A registration method that converts the user's voice into feature parameters effective for voice recognition and registers it in advance, then calculates the degree of similarity between the input voice and the registered voice at the time of use and outputs the most similar one as the recognition result. A voice recognition device, and a voice that converts and stores the voice of the user using the speaker into characteristic parameters effective for voice synthesis almost simultaneously with the registration, and outputs information and guides the handling of the registered voice input/output device. What is claimed is: 1. A registered voice input/output device comprising: a voice synthesizer which converts and stores characteristic parameters effective for voice synthesis into characteristic parameters, and reproduces the characteristic parameters into voice when used.