JPH02193196A - Voice recognizing device - Google Patents

Voice recognizing device

Info

Publication number
JPH02193196A
JPH02193196A JP1013426A JP1342689A JPH02193196A JP H02193196 A JPH02193196 A JP H02193196A JP 1013426 A JP1013426 A JP 1013426A JP 1342689 A JP1342689 A JP 1342689A JP H02193196 A JPH02193196 A JP H02193196A
Authority
JP
Japan
Prior art keywords
recognition
speech
voice
input
synthesis
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
JP1013426A
Other languages
Japanese (ja)
Other versions
JP2744039B2 (en
Inventor
Shoichi Kamei
亀井 正一
Masayuki Iida
正幸 飯田
Hiroki Onishi
宏樹 大西
Shinichi Tsurufuji
鶴藤 真一
Kazuyoshi Okura
計美 大倉
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Sanyo Electric Co Ltd
Original Assignee
Sanyo Electric Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Sanyo Electric Co Ltd filed Critical Sanyo Electric Co Ltd
Priority to JP1013426A priority Critical patent/JP2744039B2/en
Publication of JPH02193196A publication Critical patent/JPH02193196A/en
Application granted granted Critical
Publication of JP2744039B2 publication Critical patent/JP2744039B2/en
Anticipated expiration legal-status Critical
Expired - Fee Related legal-status Critical Current

Links

Abstract

PURPOSE:To contrive the reduction of troublesomeness at the time of registration by executing together a voice pattern input of a recognition word and a key input of recognition result display use data at the time of registration. CONSTITUTION:A voice inputted from a microphone 11 is amplified by an amplifier 12, sent to a recognition use voice analyzing part 13 and a synthesis use voice analyzing part 14, and analyzed to a recognition use parameter and a synthesis use parameter, respectively. Subsequently, the recognition use parameter is stored in a standard voice pattern memory 15, the synthesis use parameter is stored in a synthesis use pattern memory 17, and until the registration of all recognition words is ended, this processing is repeated. Subsequently, a synthetic pattern is sent to a voice synthesizing part 20 in order from the head of the synthesis use pattern memory 17, and the recognition word is synthesized and outputted. A user confirms this composite tone, and thereafter, inputs recognition result display use data from a key input part 18 and confirms it in a display part 19. In such a way, a voice input and a key input can be executed together, and troublesomeness at the time of registration is reduced.

Description

【発明の詳細な説明】 (イ) 産業上の利用分野 本発明は操作性の優れた登録手段を備えた特定話者用の
音声認識装置に関するものである。
DETAILED DESCRIPTION OF THE INVENTION (a) Field of Industrial Application The present invention relates to a speech recognition device for a specific speaker, which is equipped with a registration means with excellent operability.

c口) 従来の技術 使用者が予め認識語を登録して使う特定話者音声認識装
置においては、認識結果として認識語に対応した文字や
数字などを使用者が予め入力しておく必要があるが、入
力音声パタンと認識結果表示用データは一組の対応づけ
られたデータであるため、−語毎に、認識語の音声登録
に続けて表示用データをキー人力するという方法が取ら
れていた。しかし、この方法では音声の発声とキー人力
を交互に行なわねばならず、操作が煩わしくなり使用者
の負担増の原因になっていた。
(c) In conventional speaker-specific speech recognition devices in which the user registers recognition words in advance, it is necessary for the user to input letters, numbers, etc. that correspond to the recognition words as recognition results in advance. However, since the input speech pattern and the data for displaying recognition results are a set of data that are associated with each other, a method is used in which the data for display is entered manually after registering the speech of the recognition word for each word. Ta. However, with this method, it is necessary to alternately utter the voice and press the keys manually, which makes the operation cumbersome and increases the burden on the user.

(ハ) 発明が解決しようとする課題 認識語の音声パタンの登録だけを先にまとめて行い、そ
の後で認識結果表示用データをまとめてキー人力すると
いう方法にすれば、音声入力とキー人力を分けて行なう
ことができ、操作はわかりやすくなるが、語数が増える
と何番目にどの語を発声したかを覚えておくのは困難で
あり、この点が問題である。
(C) The problem to be solved by the invention If only the voice patterns of the recognition words are registered at once, and then the data for displaying the recognition results is input manually, voice input and keyboard input can be reduced. This can be done separately, making the operation easier to understand, but as the number of words increases, it becomes difficult to remember which word was uttered in which order, which is a problem.

(ニ) 課題を解決するための手段 本発明による音声認識装置では、認識語の登録時に、#
、’l 品用の音声パタンとして分析して格納すると同
時に、合成用の音声パタンとして分析したデータを格納
する手段と、該合成用音声パタンを合成する手段と、音
声合成後に入力された認識結果の表示用データを認識用
音声パタンに対応づけた位置に格納する手段を設け、認
識語の登録をまとめて行なった後で、該合成用音声パタ
ンを最初から順番に音声合成することにより、その認識
語に対応する認識結果の表示用データを入力することが
可能となる。
(d) Means for Solving the Problems In the speech recognition device according to the present invention, when registering a recognition word, #
,'l A means for analyzing and storing the data as a speech pattern for a product and at the same time storing the analyzed data as a speech pattern for synthesis, a means for synthesizing the speech pattern for synthesis, and a recognition result input after speech synthesis. By providing a means for storing the display data in a position corresponding to the speech pattern for recognition, registering the recognition words all at once, and then synthesizing the speech patterns for synthesis in order from the beginning. It becomes possible to input display data of recognition results corresponding to recognition words.

(ホ) 作用 本発明によれば、音声登録の際に何番目に何という認識
語を登録したかを覚えておく必要がなく、音声の入力と
認識結果の表示用データのキ入力を、それぞれ、まとめ
て行なうことが可能であるので、登録時の煩わしさが軽
減される。
(e) Effects According to the present invention, there is no need to remember which recognition word was registered in which position when registering a voice, and the input of voice and the key input of data for displaying recognition results can be performed separately. , can be done all at once, which reduces the hassle of registration.

(へ) 実施例 第1図に本発明の音声認識装置の一実施例を示す。(f) Examples FIG. 1 shows an embodiment of the speech recognition device of the present invention.

同図によって、音声登録時に認識語の音声登録と認識結
果表示用データのキー人力をそれぞれ、まとめて行なう
場合の処理の流れを以下に示す。
Referring to the same figure, the flow of processing will be shown below when voice registration of recognized words and key manual input of recognition result display data are performed simultaneously at the time of voice registration.

マイクロホン11より入力された音声は、増幅器12で
振幅が飽和しない程度に増幅され、認識用の音声分析部
13と合成用の音声分析部14に送られ、それぞれ、認
識用パラメータと合成用パラメータに分析される。そし
て、認識用パラメタは標準音声パタンメモリ15に格納
され、合成用パラメータは合成用パタンメモリ17に格
納される。全ての認識語の登録が終了するまでこの処理
が繰り返される。
The voice input from the microphone 11 is amplified by the amplifier 12 to an extent that the amplitude is not saturated, and is sent to the voice analysis section 13 for recognition and the voice analysis section 14 for synthesis, where it is converted into recognition parameters and synthesis parameters, respectively. be analyzed. The recognition parameters are stored in the standard speech pattern memory 15, and the synthesis parameters are stored in the synthesis pattern memory 17. This process is repeated until registration of all recognition words is completed.

次に、合成用パタンメモリ17の先頭から順番に合成パ
タンか音声合成部20に送られて、認識語が合成出力さ
れる。使用者は、この合成音を確認した後で、認識結果
表示用データをキー人力部18から入力し表示部19に
おいて確認する。
Next, the synthesis patterns are sequentially sent to the speech synthesis section 20 from the beginning of the synthesis pattern memory 17, and the recognized words are synthesized and output. After confirming this synthesized voice, the user inputs recognition result display data from the key input section 18 and confirms it on the display section 19.

以上の如くして、全ての認識語に対する認識結果表示用
データの入力が終了するまでこの処理が繰り返される。
As described above, this process is repeated until the input of recognition result display data for all recognized words is completed.

尚、認識時には、マイクロホン11より入力された音声
は、増幅器12で振幅が飽和しない程度に増幅され、認
識用の音声分析部13で分析されて、入力音声パタンか
作成される。該入力音声パタンと標準音声パタンメモリ
15内の標準音声パタンとで、マツチング部16におい
てパタンマツチングを行い、最も距離の小さい標準音声
パタンを算出し、認識語を決定する。そして、表示部1
9において認識語に対する認識結果表示用データが表示
される。
At the time of recognition, the voice input from the microphone 11 is amplified by the amplifier 12 to such an extent that the amplitude is not saturated, and analyzed by the voice analysis section 13 for recognition to create an input voice pattern. A matching section 16 performs pattern matching between the input speech pattern and the standard speech pattern in the standard speech pattern memory 15, calculates the standard speech pattern with the smallest distance, and determines a recognized word. Then, display section 1
At 9, recognition result display data for the recognized word is displayed.

(ト)  発明の効果 以上に説明した如く、本発明によれば、登録時の認識語
の音声パタン入力と認識結果表示用データのキー人力を
、それぞれ、まとめて行なうことができるので登録時の
煩わしさが軽減される。
(G) Effects of the Invention As explained above, according to the present invention, inputting the speech pattern of the recognized word at the time of registration and key human input of the recognition result display data can be performed at the same time. The hassle is reduced.

また、合成音を確認することにより人力音声が正しく登
録されていることが確認できるため、登録音声のアップ
デートが省け、使用者の負担が軽減できる。
Furthermore, since it is possible to confirm that the human voice has been correctly registered by checking the synthesized voice, updating the registered voice can be omitted and the burden on the user can be reduced.

【図面の簡単な説明】[Brief explanation of the drawing]

第1図は本発明による音声認識装置の一実施例を示す構
成図である。 11・・・マイクロフォン、16・・・パタンマツチン
グ部、12・・・増幅器、17・・・合成用音声パタン
メモリ、13・・・認識用音声分析部、18・・・表示
用データ入力部、14・・・合成用音声分析部、19・
・・表示部、15・・・標準音声パタンメモリ、20・
・・音声合成部。
FIG. 1 is a block diagram showing an embodiment of a speech recognition device according to the present invention. 11...Microphone, 16...Pattern matching section, 12...Amplifier, 17...Speech pattern memory for synthesis, 13...Speech analysis section for recognition, 18...Data input section for display , 14...Speech analysis unit for synthesis, 19.
...Display section, 15...Standard audio pattern memory, 20.
...Speech synthesis section.

Claims (1)

【特許請求の範囲】[Claims] (1)音声入力手段と音声分析手段を備えた特定話者音
声認識装置において、音声登録時の認識用音声パタンを
格納するメモリ、個々の該音声パタンに対応する入力音
声の合成用音声データを格納するメモリ、及び認識結果
の表示用データを格納するメモリからなる記憶手段と、
音声登録時に入力音声を認識用音声パタンに分析すると
同時に合成用音声データに分析して格納する手段と、格
納した合成用音声データを音声合成する手段と、音声合
成後に入力された認識結果の表示用データを認識用音声
パタンに対応づけた位置に格納する手段を設け、 認識語の登録時に入力音声を録音しておき、全認識語の
登録が終了した時点で、先頭から順番に再生しながら認
識結果の表示用データを入力することを特徴とした音声
認識装置。
(1) In a specific speaker speech recognition device equipped with a speech input means and a speech analysis means, a memory for storing speech patterns for recognition at the time of speech registration, and speech data for synthesizing input speech corresponding to each speech pattern are stored. a storage means comprising a memory for storing and a memory for storing display data of the recognition result;
Means for analyzing input speech into speech patterns for recognition at the time of speech registration and simultaneously analyzing and storing speech data for synthesis, means for speech-synthesizing the stored speech data for synthesis, and displaying input recognition results after speech synthesis. A method is provided to store the data for recognition in a location that corresponds to the voice pattern for recognition, and the input voice is recorded when registering recognition words, and when all recognition words have been registered, they are played back in order from the beginning. A speech recognition device characterized by inputting data for displaying recognition results.
JP1013426A 1989-01-23 1989-01-23 Voice recognition device Expired - Fee Related JP2744039B2 (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
JP1013426A JP2744039B2 (en) 1989-01-23 1989-01-23 Voice recognition device

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
JP1013426A JP2744039B2 (en) 1989-01-23 1989-01-23 Voice recognition device

Publications (2)

Publication Number Publication Date
JPH02193196A true JPH02193196A (en) 1990-07-30
JP2744039B2 JP2744039B2 (en) 1998-04-28

Family

ID=11832812

Family Applications (1)

Application Number Title Priority Date Filing Date
JP1013426A Expired - Fee Related JP2744039B2 (en) 1989-01-23 1989-01-23 Voice recognition device

Country Status (1)

Country Link
JP (1) JP2744039B2 (en)

Also Published As

Publication number Publication date
JP2744039B2 (en) 1998-04-28

Similar Documents

Publication Publication Date Title
JP3968133B2 (en) Speech recognition dialogue processing method and speech recognition dialogue apparatus
JP2008309856A (en) Speech recognition device and conference system
JP2000105596A5 (en)
JPH04204700A (en) Speech recognition device
JPH10105191A (en) Speech recognition device and microphone frequency characteristic converting method
JPH02193196A (en) Voice recognizing device
US20020049597A1 (en) Audio recognition method and device for sequence of numbers
JP3139679B2 (en) Voice input device and voice input method
JP3437492B2 (en) Voice recognition method and apparatus
JPH09218696A (en) Speech recognition device
JPH01285998A (en) Speech recognizing device
JPH05341705A (en) Conversation training device
JPH0844388A (en) Word voice recognizing device for specified speaker
JPH0619493A (en) Specified speaker system speech recognizing device
JPH01159700A (en) Phoneme parameter producing apparatus
JP3752738B2 (en) Voice recognition device
JPH02129686A (en) Conversation aid apparatus
JPH0556519B2 (en)
JPH103295A (en) Voice recognition device
JP2005148764A (en) Method and device for speech recognition interaction
JPH04181998A (en) Device and method for speech recognition
JPS59176791A (en) Voice registration system
JPS6370296A (en) Word registration
JPS6287993A (en) Voice recognition equipment
JPH045697A (en) Word accent registering method

Legal Events

Date Code Title Description
LAPS Cancellation because of no payment of annual fees