JPH02193196A

JPH02193196A - Voice recognizing device

Info

Publication number: JPH02193196A
Application number: JP1013426A
Authority: JP
Inventors: Shoichi Kamei; 亀井　正一; Masayuki Iida; 正幸飯田; Hiroki Onishi; 宏樹大西; Shinichi Tsurufuji; 鶴藤　真一; Kazuyoshi Okura; 計美大倉
Original assignee: Sanyo Electric Co Ltd
Current assignee: Sanyo Electric Co Ltd
Priority date: 1989-01-23
Filing date: 1989-01-23
Publication date: 1990-07-30
Anticipated expiration: 2013-04-28
Also published as: JP2744039B2

Abstract

PURPOSE:To contrive the reduction of troublesomeness at the time of registration by executing together a voice pattern input of a recognition word and a key input of recognition result display use data at the time of registration. CONSTITUTION:A voice inputted from a microphone 11 is amplified by an amplifier 12, sent to a recognition use voice analyzing part 13 and a synthesis use voice analyzing part 14, and analyzed to a recognition use parameter and a synthesis use parameter, respectively. Subsequently, the recognition use parameter is stored in a standard voice pattern memory 15, the synthesis use parameter is stored in a synthesis use pattern memory 17, and until the registration of all recognition words is ended, this processing is repeated. Subsequently, a synthetic pattern is sent to a voice synthesizing part 20 in order from the head of the synthesis use pattern memory 17, and the recognition word is synthesized and outputted. A user confirms this composite tone, and thereafter, inputs recognition result display use data from a key input part 18 and confirms it in a display part 19. In such a way, a voice input and a key input can be executed together, and troublesomeness at the time of registration is reduced.

Description

【発明の詳細な説明】（イ）　産業上の利用分野本発明は操作性の優れた登録手段を備えた特定話者用の
音声認識装置に関するものである。DETAILED DESCRIPTION OF THE INVENTION (a) Field of Industrial Application The present invention relates to a speech recognition device for a specific speaker, which is equipped with a registration means with excellent operability.

ｃ口）　従来の技術使用者が予め認識語を登録して使う特定話者音声認識装
置においては、認識結果として認識語に対応した文字や
数字などを使用者が予め入力しておく必要があるが、入
力音声パタンと認識結果表示用データは一組の対応づけ
られたデータであるため、−語毎に、認識語の音声登録
に続けて表示用データをキー人力するという方法が取ら
れていた。しかし、この方法では音声の発声とキー人力
を交互に行なわねばならず、操作が煩わしくなり使用者
の負担増の原因になっていた。(c) In conventional speaker-specific speech recognition devices in which the user registers recognition words in advance, it is necessary for the user to input letters, numbers, etc. that correspond to the recognition words as recognition results in advance. However, since the input speech pattern and the data for displaying recognition results are a set of data that are associated with each other, a method is used in which the data for display is entered manually after registering the speech of the recognition word for each word. Ta. However, with this method, it is necessary to alternately utter the voice and press the keys manually, which makes the operation cumbersome and increases the burden on the user.

（ハ）　発明が解決しようとする課題認識語の音声パタンの登録だけを先にまとめて行い、そ
の後で認識結果表示用データをまとめてキー人力すると
いう方法にすれば、音声入力とキー人力を分けて行なう
ことができ、操作はわかりやすくなるが、語数が増える
と何番目にどの語を発声したかを覚えておくのは困難で
あり、この点が問題である。(C) The problem to be solved by the invention If only the voice patterns of the recognition words are registered at once, and then the data for displaying the recognition results is input manually, voice input and keyboard input can be reduced. This can be done separately, making the operation easier to understand, but as the number of words increases, it becomes difficult to remember which word was uttered in which order, which is a problem.

（ニ）　課題を解決するための手段本発明による音声認識装置では、認識語の登録時に、＃
、’ｌ　品用の音声パタンとして分析して格納すると同
時に、合成用の音声パタンとして分析したデータを格納
する手段と、該合成用音声パタンを合成する手段と、音
声合成後に入力された認識結果の表示用データを認識用
音声パタンに対応づけた位置に格納する手段を設け、認
識語の登録をまとめて行なった後で、該合成用音声パタ
ンを最初から順番に音声合成することにより、その認識
語に対応する認識結果の表示用データを入力することが
可能となる。(d) Means for Solving the Problems In the speech recognition device according to the present invention, when registering a recognition word, #
,'l A means for analyzing and storing the data as a speech pattern for a product and at the same time storing the analyzed data as a speech pattern for synthesis, a means for synthesizing the speech pattern for synthesis, and a recognition result input after speech synthesis. By providing a means for storing the display data in a position corresponding to the speech pattern for recognition, registering the recognition words all at once, and then synthesizing the speech patterns for synthesis in order from the beginning. It becomes possible to input display data of recognition results corresponding to recognition words.

（ホ）　作用本発明によれば、音声登録の際に何番目に何という認識
語を登録したかを覚えておく必要がなく、音声の入力と
認識結果の表示用データのキ入力を、それぞれ、まとめ
て行なうことが可能であるので、登録時の煩わしさが軽
減される。(e) Effects According to the present invention, there is no need to remember which recognition word was registered in which position when registering a voice, and the input of voice and the key input of data for displaying recognition results can be performed separately. , can be done all at once, which reduces the hassle of registration.

（へ）　実施例第１図に本発明の音声認識装置の一実施例を示す。(f) Examples FIG. 1 shows an embodiment of the speech recognition device of the present invention.

同図によって、音声登録時に認識語の音声登録と認識結
果表示用データのキー人力をそれぞれ、まとめて行なう
場合の処理の流れを以下に示す。Referring to the same figure, the flow of processing will be shown below when voice registration of recognized words and key manual input of recognition result display data are performed simultaneously at the time of voice registration.

マイクロホン１１より入力された音声は、増幅器１２で
振幅が飽和しない程度に増幅され、認識用の音声分析部
１３と合成用の音声分析部１４に送られ、それぞれ、認
識用パラメータと合成用パラメータに分析される。そし
て、認識用パラメタは標準音声パタンメモリ１５に格納
され、合成用パラメータは合成用パタンメモリ１７に格
納される。全ての認識語の登録が終了するまでこの処理
が繰り返される。The voice input from the microphone 11 is amplified by the amplifier 12 to an extent that the amplitude is not saturated, and is sent to the voice analysis section 13 for recognition and the voice analysis section 14 for synthesis, where it is converted into recognition parameters and synthesis parameters, respectively. be analyzed. The recognition parameters are stored in the standard speech pattern memory 15, and the synthesis parameters are stored in the synthesis pattern memory 17. This process is repeated until registration of all recognition words is completed.

次に、合成用パタンメモリ１７の先頭から順番に合成パ
タンか音声合成部２０に送られて、認識語が合成出力さ
れる。使用者は、この合成音を確認した後で、認識結果
表示用データをキー人力部１８から入力し表示部１９に
おいて確認する。Next, the synthesis patterns are sequentially sent to the speech synthesis section 20 from the beginning of the synthesis pattern memory 17, and the recognized words are synthesized and output. After confirming this synthesized voice, the user inputs recognition result display data from the key input section 18 and confirms it on the display section 19.

以上の如くして、全ての認識語に対する認識結果表示用
データの入力が終了するまでこの処理が繰り返される。As described above, this process is repeated until the input of recognition result display data for all recognized words is completed.

尚、認識時には、マイクロホン１１より入力された音声
は、増幅器１２で振幅が飽和しない程度に増幅され、認
識用の音声分析部１３で分析されて、入力音声パタンか
作成される。該入力音声パタンと標準音声パタンメモリ
１５内の標準音声パタンとで、マツチング部１６におい
てパタンマツチングを行い、最も距離の小さい標準音声
パタンを算出し、認識語を決定する。そして、表示部１
９において認識語に対する認識結果表示用データが表示
される。At the time of recognition, the voice input from the microphone 11 is amplified by the amplifier 12 to such an extent that the amplitude is not saturated, and analyzed by the voice analysis section 13 for recognition to create an input voice pattern. A matching section 16 performs pattern matching between the input speech pattern and the standard speech pattern in the standard speech pattern memory 15, calculates the standard speech pattern with the smallest distance, and determines a recognized word. Then, display section 1
At 9, recognition result display data for the recognized word is displayed.

（ト）　　発明の効果以上に説明した如く、本発明によれば、登録時の認識語
の音声パタン入力と認識結果表示用データのキー人力を
、それぞれ、まとめて行なうことができるので登録時の
煩わしさが軽減される。(G) Effects of the Invention As explained above, according to the present invention, inputting the speech pattern of the recognized word at the time of registration and key human input of the recognition result display data can be performed at the same time. The hassle is reduced.

また、合成音を確認することにより人力音声が正しく登
録されていることが確認できるため、登録音声のアップ
デートが省け、使用者の負担が軽減できる。Furthermore, since it is possible to confirm that the human voice has been correctly registered by checking the synthesized voice, updating the registered voice can be omitted and the burden on the user can be reduced.

[Brief explanation of the drawing]

第１図は本発明による音声認識装置の一実施例を示す構
成図である。１１・・・マイクロフォン、１６・・・パタンマツチン
グ部、１２・・・増幅器、１７・・・合成用音声パタン
メモリ、１３・・・認識用音声分析部、１８・・・表示
用データ入力部、１４・・・合成用音声分析部、１９・
・・表示部、１５・・・標準音声パタンメモリ、２０・
・・音声合成部。FIG. 1 is a block diagram showing an embodiment of a speech recognition device according to the present invention. 11...Microphone, 16...Pattern matching section, 12...Amplifier, 17...Speech pattern memory for synthesis, 13...Speech analysis section for recognition, 18...Data input section for display , 14...Speech analysis unit for synthesis, 19.
...Display section, 15...Standard audio pattern memory, 20.
...Speech synthesis section.

Claims

[Claims]

(1) In a specific speaker speech recognition device equipped with a speech input means and a speech analysis means, a memory for storing speech patterns for recognition at the time of speech registration, and speech data for synthesizing input speech corresponding to each speech pattern are stored. a storage means comprising a memory for storing and a memory for storing display data of the recognition result;
Means for analyzing input speech into speech patterns for recognition at the time of speech registration and simultaneously analyzing and storing speech data for synthesis, means for speech-synthesizing the stored speech data for synthesis, and displaying input recognition results after speech synthesis. A method is provided to store the data for recognition in a location that corresponds to the voice pattern for recognition, and the input voice is recorded when registering recognition words, and when all recognition words have been registered, they are played back in order from the beginning. A speech recognition device characterized by inputting data for displaying recognition results.