JPH02193196A - Voice recognizing device - Google Patents
Voice recognizing deviceInfo
- Publication number
- JPH02193196A JPH02193196A JP1013426A JP1342689A JPH02193196A JP H02193196 A JPH02193196 A JP H02193196A JP 1013426 A JP1013426 A JP 1013426A JP 1342689 A JP1342689 A JP 1342689A JP H02193196 A JPH02193196 A JP H02193196A
- Authority
- JP
- Japan
- Prior art keywords
- recognition
- speech
- voice
- input
- synthesis
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
- 230000015572 biosynthetic process Effects 0.000 claims abstract description 22
- 238000003786 synthesis reaction Methods 0.000 claims abstract description 22
- 230000002194 synthesizing effect Effects 0.000 claims abstract description 4
- 238000000034 method Methods 0.000 claims description 5
- 239000002131 composite material Substances 0.000 abstract 1
- 230000000694 effects Effects 0.000 description 2
- 229920006395 saturated elastomer Polymers 0.000 description 2
- 238000010586 diagram Methods 0.000 description 1
Abstract
Description
【発明の詳細な説明】
(イ) 産業上の利用分野
本発明は操作性の優れた登録手段を備えた特定話者用の
音声認識装置に関するものである。DETAILED DESCRIPTION OF THE INVENTION (a) Field of Industrial Application The present invention relates to a speech recognition device for a specific speaker, which is equipped with a registration means with excellent operability.
c口) 従来の技術
使用者が予め認識語を登録して使う特定話者音声認識装
置においては、認識結果として認識語に対応した文字や
数字などを使用者が予め入力しておく必要があるが、入
力音声パタンと認識結果表示用データは一組の対応づけ
られたデータであるため、−語毎に、認識語の音声登録
に続けて表示用データをキー人力するという方法が取ら
れていた。しかし、この方法では音声の発声とキー人力
を交互に行なわねばならず、操作が煩わしくなり使用者
の負担増の原因になっていた。(c) In conventional speaker-specific speech recognition devices in which the user registers recognition words in advance, it is necessary for the user to input letters, numbers, etc. that correspond to the recognition words as recognition results in advance. However, since the input speech pattern and the data for displaying recognition results are a set of data that are associated with each other, a method is used in which the data for display is entered manually after registering the speech of the recognition word for each word. Ta. However, with this method, it is necessary to alternately utter the voice and press the keys manually, which makes the operation cumbersome and increases the burden on the user.
(ハ) 発明が解決しようとする課題
認識語の音声パタンの登録だけを先にまとめて行い、そ
の後で認識結果表示用データをまとめてキー人力すると
いう方法にすれば、音声入力とキー人力を分けて行なう
ことができ、操作はわかりやすくなるが、語数が増える
と何番目にどの語を発声したかを覚えておくのは困難で
あり、この点が問題である。(C) The problem to be solved by the invention If only the voice patterns of the recognition words are registered at once, and then the data for displaying the recognition results is input manually, voice input and keyboard input can be reduced. This can be done separately, making the operation easier to understand, but as the number of words increases, it becomes difficult to remember which word was uttered in which order, which is a problem.
(ニ) 課題を解決するための手段
本発明による音声認識装置では、認識語の登録時に、#
、’l 品用の音声パタンとして分析して格納すると同
時に、合成用の音声パタンとして分析したデータを格納
する手段と、該合成用音声パタンを合成する手段と、音
声合成後に入力された認識結果の表示用データを認識用
音声パタンに対応づけた位置に格納する手段を設け、認
識語の登録をまとめて行なった後で、該合成用音声パタ
ンを最初から順番に音声合成することにより、その認識
語に対応する認識結果の表示用データを入力することが
可能となる。(d) Means for Solving the Problems In the speech recognition device according to the present invention, when registering a recognition word, #
,'l A means for analyzing and storing the data as a speech pattern for a product and at the same time storing the analyzed data as a speech pattern for synthesis, a means for synthesizing the speech pattern for synthesis, and a recognition result input after speech synthesis. By providing a means for storing the display data in a position corresponding to the speech pattern for recognition, registering the recognition words all at once, and then synthesizing the speech patterns for synthesis in order from the beginning. It becomes possible to input display data of recognition results corresponding to recognition words.
(ホ) 作用
本発明によれば、音声登録の際に何番目に何という認識
語を登録したかを覚えておく必要がなく、音声の入力と
認識結果の表示用データのキ入力を、それぞれ、まとめ
て行なうことが可能であるので、登録時の煩わしさが軽
減される。(e) Effects According to the present invention, there is no need to remember which recognition word was registered in which position when registering a voice, and the input of voice and the key input of data for displaying recognition results can be performed separately. , can be done all at once, which reduces the hassle of registration.
(へ) 実施例 第1図に本発明の音声認識装置の一実施例を示す。(f) Examples FIG. 1 shows an embodiment of the speech recognition device of the present invention.
同図によって、音声登録時に認識語の音声登録と認識結
果表示用データのキー人力をそれぞれ、まとめて行なう
場合の処理の流れを以下に示す。Referring to the same figure, the flow of processing will be shown below when voice registration of recognized words and key manual input of recognition result display data are performed simultaneously at the time of voice registration.
マイクロホン11より入力された音声は、増幅器12で
振幅が飽和しない程度に増幅され、認識用の音声分析部
13と合成用の音声分析部14に送られ、それぞれ、認
識用パラメータと合成用パラメータに分析される。そし
て、認識用パラメタは標準音声パタンメモリ15に格納
され、合成用パラメータは合成用パタンメモリ17に格
納される。全ての認識語の登録が終了するまでこの処理
が繰り返される。The voice input from the microphone 11 is amplified by the amplifier 12 to an extent that the amplitude is not saturated, and is sent to the voice analysis section 13 for recognition and the voice analysis section 14 for synthesis, where it is converted into recognition parameters and synthesis parameters, respectively. be analyzed. The recognition parameters are stored in the standard speech pattern memory 15, and the synthesis parameters are stored in the synthesis pattern memory 17. This process is repeated until registration of all recognition words is completed.
次に、合成用パタンメモリ17の先頭から順番に合成パ
タンか音声合成部20に送られて、認識語が合成出力さ
れる。使用者は、この合成音を確認した後で、認識結果
表示用データをキー人力部18から入力し表示部19に
おいて確認する。Next, the synthesis patterns are sequentially sent to the speech synthesis section 20 from the beginning of the synthesis pattern memory 17, and the recognized words are synthesized and output. After confirming this synthesized voice, the user inputs recognition result display data from the key input section 18 and confirms it on the display section 19.
以上の如くして、全ての認識語に対する認識結果表示用
データの入力が終了するまでこの処理が繰り返される。As described above, this process is repeated until the input of recognition result display data for all recognized words is completed.
尚、認識時には、マイクロホン11より入力された音声
は、増幅器12で振幅が飽和しない程度に増幅され、認
識用の音声分析部13で分析されて、入力音声パタンか
作成される。該入力音声パタンと標準音声パタンメモリ
15内の標準音声パタンとで、マツチング部16におい
てパタンマツチングを行い、最も距離の小さい標準音声
パタンを算出し、認識語を決定する。そして、表示部1
9において認識語に対する認識結果表示用データが表示
される。At the time of recognition, the voice input from the microphone 11 is amplified by the amplifier 12 to such an extent that the amplitude is not saturated, and analyzed by the voice analysis section 13 for recognition to create an input voice pattern. A matching section 16 performs pattern matching between the input speech pattern and the standard speech pattern in the standard speech pattern memory 15, calculates the standard speech pattern with the smallest distance, and determines a recognized word. Then, display section 1
At 9, recognition result display data for the recognized word is displayed.
(ト) 発明の効果
以上に説明した如く、本発明によれば、登録時の認識語
の音声パタン入力と認識結果表示用データのキー人力を
、それぞれ、まとめて行なうことができるので登録時の
煩わしさが軽減される。(G) Effects of the Invention As explained above, according to the present invention, inputting the speech pattern of the recognized word at the time of registration and key human input of the recognition result display data can be performed at the same time. The hassle is reduced.
また、合成音を確認することにより人力音声が正しく登
録されていることが確認できるため、登録音声のアップ
デートが省け、使用者の負担が軽減できる。Furthermore, since it is possible to confirm that the human voice has been correctly registered by checking the synthesized voice, updating the registered voice can be omitted and the burden on the user can be reduced.
第1図は本発明による音声認識装置の一実施例を示す構
成図である。
11・・・マイクロフォン、16・・・パタンマツチン
グ部、12・・・増幅器、17・・・合成用音声パタン
メモリ、13・・・認識用音声分析部、18・・・表示
用データ入力部、14・・・合成用音声分析部、19・
・・表示部、15・・・標準音声パタンメモリ、20・
・・音声合成部。FIG. 1 is a block diagram showing an embodiment of a speech recognition device according to the present invention. 11...Microphone, 16...Pattern matching section, 12...Amplifier, 17...Speech pattern memory for synthesis, 13...Speech analysis section for recognition, 18...Data input section for display , 14...Speech analysis unit for synthesis, 19.
...Display section, 15...Standard audio pattern memory, 20.
...Speech synthesis section.
Claims (1)
声認識装置において、音声登録時の認識用音声パタンを
格納するメモリ、個々の該音声パタンに対応する入力音
声の合成用音声データを格納するメモリ、及び認識結果
の表示用データを格納するメモリからなる記憶手段と、
音声登録時に入力音声を認識用音声パタンに分析すると
同時に合成用音声データに分析して格納する手段と、格
納した合成用音声データを音声合成する手段と、音声合
成後に入力された認識結果の表示用データを認識用音声
パタンに対応づけた位置に格納する手段を設け、 認識語の登録時に入力音声を録音しておき、全認識語の
登録が終了した時点で、先頭から順番に再生しながら認
識結果の表示用データを入力することを特徴とした音声
認識装置。(1) In a specific speaker speech recognition device equipped with a speech input means and a speech analysis means, a memory for storing speech patterns for recognition at the time of speech registration, and speech data for synthesizing input speech corresponding to each speech pattern are stored. a storage means comprising a memory for storing and a memory for storing display data of the recognition result;
Means for analyzing input speech into speech patterns for recognition at the time of speech registration and simultaneously analyzing and storing speech data for synthesis, means for speech-synthesizing the stored speech data for synthesis, and displaying input recognition results after speech synthesis. A method is provided to store the data for recognition in a location that corresponds to the voice pattern for recognition, and the input voice is recorded when registering recognition words, and when all recognition words have been registered, they are played back in order from the beginning. A speech recognition device characterized by inputting data for displaying recognition results.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
JP1013426A JP2744039B2 (en) | 1989-01-23 | 1989-01-23 | Voice recognition device |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
JP1013426A JP2744039B2 (en) | 1989-01-23 | 1989-01-23 | Voice recognition device |
Publications (2)
Publication Number | Publication Date |
---|---|
JPH02193196A true JPH02193196A (en) | 1990-07-30 |
JP2744039B2 JP2744039B2 (en) | 1998-04-28 |
Family
ID=11832812
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
JP1013426A Expired - Fee Related JP2744039B2 (en) | 1989-01-23 | 1989-01-23 | Voice recognition device |
Country Status (1)
Country | Link |
---|---|
JP (1) | JP2744039B2 (en) |
-
1989
- 1989-01-23 JP JP1013426A patent/JP2744039B2/en not_active Expired - Fee Related
Also Published As
Publication number | Publication date |
---|---|
JP2744039B2 (en) | 1998-04-28 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
JP3968133B2 (en) | Speech recognition dialogue processing method and speech recognition dialogue apparatus | |
JP2008309856A (en) | Speech recognition device and conference system | |
JP2000105596A5 (en) | ||
JPH04204700A (en) | Speech recognition device | |
JPH10105191A (en) | Speech recognition device and microphone frequency characteristic converting method | |
JPH02193196A (en) | Voice recognizing device | |
US20020049597A1 (en) | Audio recognition method and device for sequence of numbers | |
JP3139679B2 (en) | Voice input device and voice input method | |
JP3437492B2 (en) | Voice recognition method and apparatus | |
JPH09218696A (en) | Speech recognition device | |
JPH01285998A (en) | Speech recognizing device | |
JPH05341705A (en) | Conversation training device | |
JPH0844388A (en) | Word voice recognizing device for specified speaker | |
JPH0619493A (en) | Specified speaker system speech recognizing device | |
JPH01159700A (en) | Phoneme parameter producing apparatus | |
JP3752738B2 (en) | Voice recognition device | |
JPH02129686A (en) | Conversation aid apparatus | |
JPH0556519B2 (en) | ||
JPH103295A (en) | Voice recognition device | |
JP2005148764A (en) | Method and device for speech recognition interaction | |
JPH04181998A (en) | Device and method for speech recognition | |
JPS59176791A (en) | Voice registration system | |
JPS6370296A (en) | Word registration | |
JPS6287993A (en) | Voice recognition equipment | |
JPH045697A (en) | Word accent registering method |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
LAPS | Cancellation because of no payment of annual fees |