JP2003099091A

JP2003099091A - Speech recognition device and speech recognition method

Info

Publication number: JP2003099091A
Application number: JP2001289263A
Authority: JP
Inventors: Takashi Tomoe; 孝友枝
Original assignee: NEC Corp
Current assignee: NEC Corp
Priority date: 2001-09-21
Filing date: 2001-09-21
Publication date: 2003-04-04
Anticipated expiration: 2021-09-21
Also published as: JP4797307B2

Abstract

PROBLEM TO BE SOLVED: To provide a speech recognition device in which user's reuttering is not required, work load of the user is made light and quick and precise correcting and supplementing functions for recognition results are provided. SOLUTION: A search processing section 3 conducts a recognition process for voice recorded in a voice storage section 1 using grammar information and a recognition dictionary section 2. A search control section 4 transmits recognition result candidate word strings obtained by the recognition process of the section 3 to a recognition result candidate display section 5 and makes a request to the section 3 so that the section 3 conducts voice recognition employing only a candidate word string, that starts with a correct character string notified by a correct character storage section 8, as a recognition object word string and voice recorded in a voice storage section. A correct character input section 7 inputs the characters of the correct word string one character at a time. The section 8 records the correct character string inputted from the section 7 when the correct word string is not included in the recognition result candidate word string displayed in the section 5 and notifies the above to the section 4.

Description

Detailed Description of the Invention

【０００１】[0001]

【発明の属する技術分野】本発明は、音声認識装置及び
音声認識方法に関するものであり、特に誤認識結果の修
正を迅速、かつ高精度に行うことが出きる音声認識装置
及び音声認識方法に関する。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a voice recognition device and a voice recognition method, and more particularly to a voice recognition device and a voice recognition method capable of correcting an erroneous recognition result quickly and highly accurately.

【０００２】[0002]

【従来の技術】従来の音声認識装置における誤認識単語
の修正方法の例が、「自分の声でパソコンが動く（村上
弘子著１９９９年８月１０日株式会社ＮＥＣクリ
エイティブ発行）」の２２〜２５ページに記載されてい
る。この従来技術では、誤認識した単語の修正として
「認識結果候補を複数列挙し、いずれかをユーザが選択
する」、「再発声及び再認識処理を行うことで誤認識結
果を正解単語と置換する」、「誤認識された箇所の正解
結果をキーボードで入力することにより修正する」とい
う方法が用いられている。2. Description of the Related Art An example of a method for correcting an erroneously recognized word in a conventional voice recognition device is "Personal computer moves with one's own voice (by Hiroko Murakami, published on August 10, 1999 by NEC Creative Co., Ltd.), 22-25. It is listed on the page. In this conventional technique, as a correction of a word that has been erroneously recognized, "a plurality of recognition result candidates are listed and a user selects one of them" and "A rerecognition process and a re-recognition process are performed to replace the erroneous recognition result with the correct word. "," Correct by inputting the correct answer result of the erroneously recognized part with the keyboard ".

【０００３】一方、文字入力装置や文字認識装置におい
ては、入力したい文字列の全てをユーザが打ち込まなく
とも、文字列入力装置側が文字列の補完を行う「補完機
能」が発明されており、その従来例が「http://www.hll
a.is.tsukuba.ac.jp/~yas/ipe/nitiniti2-enshu-1996/1
996-10-21/unix-completion.html」に記載されている。
これは、ユーザが入力した文字列に対応する正解候補文
字列だけを表示、選択できるものであり、文字列入力途
中で候補が少数又は一意に決定された際に、文字列を全
て入力しなくても正解候補を選択できるという機能であ
る。On the other hand, in the character input device and the character recognition device, a "complementary function" has been invented in which the character string input device side complements the character string even if the user does not type in all the desired character string. The conventional example is "http: //www.hll
a.is.tsukuba.ac.jp/~yas/ipe/nitiniti2-enshu-1996/1
996-10-21 / unix-completion.html ".
This is to display and select only the correct answer candidate character string corresponding to the character string input by the user, and when the candidates are few or uniquely decided during the character string input, the entire character string is not input. However, it is a function that can select the correct answer candidate.

【０００４】[0004]

【発明が解決しようとする課題】上述した従来の音声認
識装置は、認識結果候補に正解がない場合、「再発声を
行う」、「正解をキーボードで入力した後、漢字変換を
行う」などのユーザへの負担が大きかった。また、従来
の文字入力装置等における補完機能では、正解文字列を
全て打ち込まなくてはよいものの、候補が多く存在する
場合、例えば音声認識のように類似単語が数百以上存在
する場合、依然として多くの文字列を入力する必要があ
った。The above-mentioned conventional speech recognition apparatus, when there is no correct answer in the recognition result candidates, "repeat voice", "input the correct answer on the keyboard, and then perform Kanji conversion", etc. The burden on the user was heavy. Further, in the complementary function in the conventional character input device, etc., although it is not necessary to enter all correct answer character strings, if there are many candidates, for example, if there are hundreds or more similar words such as voice recognition, many still remain. I had to enter the string.

【０００５】本発明の目的は、上記の従来技術の問題点
を回避しつつ、ユーザの入力した正解文字列を利用し、
バックアップ辞書も含めた認識辞書を用いて、ユーザの
発声した誤認識された音声データに対して再認識処理を
行うことにより、ユーザの再発声を必要としない、ユー
ザへの負担の軽く、迅速かつ高精度な認識結果の修正機
能及び補完機能を備えた音声認識装置を提供することに
ある。An object of the present invention is to use the correct character string input by the user while avoiding the above-mentioned problems of the prior art,
The recognition dictionary including the backup dictionary is used to re-recognize the erroneously recognized voice data uttered by the user, thereby eliminating the need for the user to re-voice. An object of the present invention is to provide a voice recognition device having a highly accurate recognition result correction function and a complementary function.

【０００６】[0006]

【課題を解決するための手段】本願の第１の発明は、音
声認識装置において、入力された音声を記録する音声記
憶部と、認識単語辞書情報を持つ認識辞書部と、予め用
意された文法情報及び前記認識辞書部を用いて前記音声
記憶部に記録されている音声に対し認識処理を行うサー
チ処理部と、前記サーチ処理部の認識処理による認識結
果候補単語列を認識結果候補表示部に渡すとともに、正
解文字記憶部から正解文字列を通知された場合通知され
た前記正解文字列で始まる候補単語列のみを認識対象単
語列として前記音声記憶部に記録されている前記音声を
用いて音声認識を行う再認識処理を前記サーチ処理部に
要求するサーチ制御部と、前記認識結果候補単語列を表
示する認識結果候補表示部と、正解単語列の文字を一文
字ずつ入力する正解文字入力部と、前記認識結果候補表
示部に表示されている前記認識結果候補単語列の中に正
解単語列が含まれていない場合に前記正解文字入力部か
ら入力された前記正解文字列を記録し前記サーチ制御部
に通知する前記正解文字記憶部とを含んで構成されるこ
とを特徴とする。A first invention of the present application is, in a voice recognition device, a voice storage section for recording an input voice, a recognition dictionary section having recognition word dictionary information, and a prepared grammar. A search processing unit that performs recognition processing on the voice recorded in the voice storage unit using the information and the recognition dictionary unit, and a recognition result candidate word string by the recognition processing of the search processing unit on the recognition result candidate display unit. When the correct answer character string is notified from the correct answer character storage unit, only the candidate word string starting with the notified correct answer character string is recognized as the recognition target word string by using the voice recorded in the voice storage unit. A search control unit that requests re-recognition processing for recognition to the search processing unit, a recognition result candidate display unit that displays the recognition result candidate word string, and a character that inputs characters of the correct word string one by one. Record the correct answer character string input from the correct answer character input section when a correct answer word string is not included in the recognition result candidate word string displayed in the character input section and the recognition result candidate display section. The correct answer character storage unit for notifying the search control unit is also included.

【０００７】本願の第２の発明は、第１の発明の前記サ
ーチ処理部は、前記再認識処理時には前記認識辞書部よ
りも多くの単語を有するバックアップ辞書を利用して前
記音声認識を行うことを特徴とする。According to a second invention of the present application, the search processing section of the first invention performs the voice recognition by using a backup dictionary having more words than the recognition dictionary section during the re-recognition processing. Is characterized by.

【０００８】本願の第３の発明は、第１の発明の前記正
解文字入力部における前記文字入力終了の通知を受けて
未知語単語登録要求を出力する前記サーチ制御部と、前
記未知語単語登録要求を受けて入力された未知語単語に
より前記正解文字記憶部に記録されている前記文字列の
前記認識辞書部への単語登録を行う未知語単語登録部を
含んで構成されることを特徴とする。A third invention of the present application is the search control section which outputs an unknown word word registration request upon receipt of the character input end notification in the correct character input section of the first invention, and the unknown word word registration. An unknown word word registration unit for registering a word of the character string recorded in the correct character storage unit in the recognition dictionary unit by an unknown word word input in response to a request. To do.

【０００９】本願の第４の発明は、音声認識方法におい
て、入力された音声を記録する音声記憶部と認識単語辞
書情報を持つ認識辞書部とを予め備え、予め用意された
文法情報及び前記認識辞書部を用いて前記音声記憶部に
記録されている音声に対し認識処理を行い、前記認識処
理による認識結果である認識結果候補単語列を予め備え
た認識結果候補表示部に表示し、前記認識結果候補表示
部に表示されている前記認識結果候補単語列の中に正解
単語列が含まれていない場合に予め用意された正解文字
入力部から前記正解単語列の文字が一文字ずつ入力され
ると、入力された前記正解文字列を予め用意された正解
文字記憶部に記録し、記録された前記正解文字記憶部に
おける前記正解文字列で始まる候補単語列のみを認識対
象単語列として前記音声記憶部に記録されている前記音
声を用いて音声認識を行う再認識処理を行うことを特徴
とする。In a fourth aspect of the present invention, in the voice recognition method, a voice storage section for recording an input voice and a recognition dictionary section having recognition word dictionary information are provided in advance, and the prepared grammar information and the recognition are provided. A recognition process is performed on the voice recorded in the voice storage unit by using the dictionary unit, and a recognition result candidate word string that is a recognition result by the recognition process is displayed on a recognition result candidate display unit that is provided in advance. When the characters of the correct answer word string are input character by character from the correct answer character input section prepared in advance when the correct answer word string is not included in the recognition result candidate word strings displayed in the result candidate display section. , The input correct answer character string is recorded in a prepared correct answer character storage unit, and only candidate word strings starting with the correct answer character string in the recorded correct answer character storage unit are used as recognition target word strings. And performing re-recognition processing for speech recognition using the voice recorded in the voice storage unit.

【００１０】本願の第５の発明は、第４の発明の前記再
認識処理時には前記認識辞書部よりも多くの単語を有す
るバックアップ辞書を利用して前記音声認識を行うこと
を特徴とする。A fifth aspect of the present invention is characterized in that during the re-recognition process according to the fourth aspect, the voice recognition is performed using a backup dictionary having more words than the recognition dictionary section.

【００１１】本願の第６の発明は、第４の発明の前記正
解文字入力部における前記文字入力終了の通知を受けて
未知語単語登録要求を出力し、前記未知語単語登録要求
に応じて入力された未知語単語を前記正解文字記憶部に
記録されている前記文字列について前記認識辞書部への
単語登録を行うことを特徴とする。According to a sixth aspect of the present invention, an unknown word word registration request is output in response to the character input end notification in the correct character input section of the fourth aspect, and the unknown word word registration request is input. The registered unknown word is registered in the recognition dictionary unit for the character string recorded in the correct character storage unit.

【００１２】[0012]

【発明の実施の形態】次に、本発明の実施の形態につい
て図面を参照して詳細に説明する。BEST MODE FOR CARRYING OUT THE INVENTION Next, embodiments of the present invention will be described in detail with reference to the drawings.

【００１３】図１は、本発明の一実施の形態を示す音声
認識装置のブロック図である。FIG. 1 is a block diagram of a voice recognition apparatus showing an embodiment of the present invention.

【００１４】図１を参照すると、本発明の音声認識装置
は、音声記憶部１と、認識辞書部２と、サーチ処理部３
と、サーチ制御部４と、認識結果候補表示部５と、候補
選択入力部６と、正解文字入力部７と、正解文字記憶部
８と、未知語単語登録部９と、バックアップ辞書部１０
とから構成される。Referring to FIG. 1, the voice recognition device of the present invention includes a voice storage unit 1, a recognition dictionary unit 2, and a search processing unit 3.
, Search control unit 4, recognition result candidate display unit 5, candidate selection input unit 6, correct answer character input unit 7, correct answer character storage unit 8, unknown word word registration unit 9, and backup dictionary unit 10.
Composed of and.

【００１５】音声記憶部１は入力された音声を記録す
る。The voice storage unit 1 records the input voice.

【００１６】認識辞書部２は認識単語辞書情報を持つ。The recognition dictionary unit 2 has recognition word dictionary information.

【００１７】サーチ処理部３は自然言語文法、統計言語
モデル又はネットワーク文法などの文法情報及び認識辞
書部２を用いて音声記憶部１に記録されている音声に対
し認識処理を行う。The search processing unit 3 uses the grammatical information such as natural language grammar, statistical language model or network grammar, and the recognition dictionary unit 2 to perform recognition processing on the voice recorded in the voice storage unit 1.

【００１８】サーチ制御部４はサーチ処理部３に入力さ
れた音声の認識処理を行うよう要求し、認識結果候補単
語列を認識結果候補表示部５に渡すとともに、正解文字
記憶部８から正解文字列を通知された場合、通知された
正解文字列で始まる候補単語列のみを認識結果候補表示
部に表示し、候補単語列を全て認識結果候補表示部に表
示しても、まだ認識結果候補表示部に空きがある場合
や、候補単語列の個数が一定数以下になってしまった場
合、正解文字記憶部８から通知された正解文字列で始ま
る単語列のみを認識対象単語列とし、認識辞書としてバ
ックアップ辞書部１０に記録されている認識辞書も利用
しながら音声記憶部１に記録されている音声に対して再
度音声認識を行うようサーチ処理部３に指示し、すべて
の正解文字列の入力完了の通知を受けてユーザに未知語
単語登録の指示を出力する。The search control unit 4 requests the search processing unit 3 to perform recognition processing of the input voice, passes the recognition result candidate word string to the recognition result candidate display unit 5, and at the same time stores the correct character from the correct character storage unit 8. When a column is notified, only the candidate word string starting with the notified correct character string is displayed in the recognition result candidate display part, and even if all candidate word strings are displayed in the recognition result candidate display part, the recognition result candidate display is still displayed. When there is a space in the copy or when the number of candidate word strings is less than a certain number, only the word string starting with the correct answer character string notified from the correct answer character storage unit 8 is set as the recognition target word string, and the recognition dictionary. As an instruction, the search processing unit 3 is instructed to perform voice recognition again on the voice recorded in the voice storage unit 1 while using the recognition dictionary recorded in the backup dictionary unit 10, and all correct character strings are input. And outputs an instruction of the unknown word word registered in the user receives a notification of completion.

【００１９】認識結果候補表示部５は認識結果の候補単
語列をユーザに通知する。The recognition result candidate display section 5 notifies the user of the candidate word string of the recognition result.

【００２０】候補選択入力部６は認識結果候補表示部５
に表示されている候補単語列の中に正解がある場合、ユ
ーザから通知された正解の候補単語を確定する。The candidate selection input unit 6 is a recognition result candidate display unit 5
If there is a correct answer in the candidate word string displayed in, the correct candidate word notified by the user is determined.

【００２１】正解文字入力部７は認識結果候補表示部５
に正解単語列が含まれていない場合、ユーザが正解単語
列の文字を一文字ずつ入力するためのものである。The correct character input unit 7 is a recognition result candidate display unit 5
When the correct answer word string is not included in, the user inputs characters of the correct answer word string one by one.

【００２２】正解文字記憶部８は正解文字入力部７から
入力された正解文字列を追加記録し、今までに入力され
た正解文字列をサーチ制御部４に通知する。The correct character storage unit 8 additionally records the correct character string input from the correct character input unit 7, and notifies the search control unit 4 of the correct character strings input so far.

【００２３】未知語単語登録部９はユーザによる未知語
単語登録要求を受けて正解文字記憶部８に記録されてい
る文字列の単語登録を行う。The unknown word registration unit 9 receives a word registration request from the user and performs word registration of the character string recorded in the correct character storage unit 8.

【００２４】バックアップ辞書部１０は認識辞書部２に
おける認識単語辞書情報だけでなく、固有名詞や専門用
語など、より多くの認識単語辞書情報を有する。The backup dictionary unit 10 has not only the recognized word dictionary information in the recognition dictionary unit 2 but also more recognized word dictionary information such as proper nouns and technical terms.

【００２５】次に、図２を参照しながら本発明の音声認
識装置の動作ついて説明する。（ステップ１）：入力された音声を音声記憶部１に記録
する。（ステップ２）：サーチ制御部４は、サーチ処理部３に
入力された音声の認識処理を行うよう要求する。サーチ
処理部３は、自然言語文法、統計言語モデル又はネット
ワーク文法などの文法情報及び認識単語辞書情報を持つ
認識辞書部２を用いて、音声記憶部１に記録されている
音声に対し、認識処理を行う。（ステップ３）：サーチ処理部３は認識処理が終了する
と、認識結果の候補単語列をサーチ制御部４に渡す。サ
ーチ制御部４は認識結果候補単語列を認識結果候補表示
部５に渡し、認識結果の候補単語列をユーザに通知す
る。（ステップ４）：認識結果候補表示部５に表示されてい
る候補単語列の中に正解がある場合ステップ５に行き、
認識結果候補表示部５に表示されている候補単語列の中
に正解がない場合ステップ７に行く。（ステップ５）、（ステップ６）：ユーザは候補選択入
力部６に第何位候補が正解であるかを通知し、単語を確
定することができる。Next, the operation of the speech recognition apparatus of the present invention will be described with reference to FIG. (Step 1): The input voice is recorded in the voice storage unit 1. (Step 2): The search control unit 4 requests the search processing unit 3 to perform recognition processing of the input voice. The search processing unit 3 uses the recognition dictionary unit 2 having grammatical information such as natural language grammar, statistical language model or network grammar, and recognized word dictionary information to recognize the voice recorded in the voice storage unit 1. I do. (Step 3): When the recognition processing is completed, the search processing unit 3 passes the candidate word string of the recognition result to the search control unit 4. The search control unit 4 passes the recognition result candidate word string to the recognition result candidate display unit 5, and notifies the user of the recognition result candidate word string. (Step 4): If there is a correct answer in the candidate word string displayed in the recognition result candidate display unit 5, go to Step 5 and
If there is no correct answer in the candidate word string displayed on the recognition result candidate display section 5, the process goes to step 7. (Step 5), (Step 6): The user can notify the candidate selection / input unit 6 of the number of the correct answer and determine the word.

【００２６】また、第一候補が正解の場合には、そのま
ま次の発声を行うことにより、候補選択入力部に何も入
力しなくても、サーチ制御部４は第一位候補が正解であ
ると判断し、次の音声の処理を行うことができる。上記
の２つのどちらかに該当する場合、音声記憶部１に記録
されている音声の認識処理は終了となる。（ステップ７）：ユーザが正解文字入力部に全ての文字
列を入力したのでなければステップ８に行き、ユーザが
正解文字入力部に全ての文字列を入力したことを通知す
ると、ステップ１３以降を実行する。（ステップ８）：ユーザは正解文字入力部７に正解単語
列の文字入力を一文字ずつ行う。When the first candidate is the correct answer, the next utterance is performed as it is, so that the first candidate is the correct answer in the search control unit 4 without inputting anything in the candidate selection input unit. Then, the next voice processing can be performed. If either of the above two cases applies, the recognition processing of the voice recorded in the voice storage unit 1 ends. (Step 7): If the user has not entered all the character strings in the correct answer character input section, go to Step 8 and notify that the user has entered all the character strings in the correct answer character input section. Run. (Step 8): The user inputs the correct word string into the correct character input unit 7 character by character.

【００２７】但し、認識結果候補表示部５に途中まで正
解文字列の含まれている単語列候補がある場合、ユーザ
はどこまでが正解であるかを指定することにより、先頭
から途中までの複数文字の入力を一気に行うこともでき
る。（ステップ９）：正解文字入力部７は正解文字記憶部８
に、正解文字列を追加記録する。正解文字記憶部８は、
今までに入力された正解文字列をサーチ制御部４に通知
する。（ステップ１０）：サーチ制御部４は通知された正解文
字列で始まる候補単語列のみを認識結果候補表示部５に
表示する。（ステップ１１）：候補単語列を全て認識結果候補表示
部５に表示しても、まだ認識結果候補表示部に空きがあ
る場合や、候補単語列の個数が一定数以下になってしま
った場合などには、ステップ１２を実行する。そうでな
い場合、ステップ８に戻る。（ステップ１２）：サーチ制御部４は、サーチ処理部３
に対し正解文字記憶部から通知された正解文字列で始ま
る単語列のみを認識対象単語列とし、音声記憶部１に記
録されている音声に対して再度音声認識を行うよう指示
する。この際、認識辞書としてバックアップ辞書部１０
に記録されている認識辞書も利用することにより、最初
の認識処理時よりも多くの語彙に対して認識処理を行
う。このため、ユーザがバックアップ辞書１０に登録さ
れている単語を発声していた場合には、高精度かつ迅速
な候補修正が可能となる。再認識処理による新しい認識
結果の候補単語列を求め、ステップ３に戻る。（ステップ１３）〜（ステップ１５）：サーチ制御部４
は、ユーザが入力した文字列に該当する単語列を認識結
果として得ることができなかったのは、ユーザが認識辞
書に登録されていない未知語を入力したためと判断し、
ユーザに単語登録を行うよう通知し、未知語単語登録部
９により正解文字記憶部８に記録されている文字列の未
知語単語登録を行う。However, when there is a word string candidate in which the correct answer character string is included in the recognition result candidate display portion 5, the user designates up to what is the correct answer, so that the plural characters from the beginning to the middle You can also input all at once. (Step 9): Correct answer character input unit 7 is correct answer character storage unit 8
In addition, the correct answer character string is additionally recorded. The correct character storage unit 8 is
The search control unit 4 is notified of the correct answer character string input so far. (Step 10): The search control unit 4 displays only the candidate word string starting with the notified correct character string on the recognition result candidate display unit 5. (Step 11): When all the candidate word strings are displayed on the recognition result candidate display unit 5, but there is still space in the recognition result candidate display unit, or when the number of candidate word strings is below a certain number. For example, step 12 is executed. If not, return to step 8. (Step 12): The search control unit 4 makes the search processing unit 3
On the other hand, only the word string starting with the correct character string notified from the correct character storage unit is set as the recognition target word string, and the voice recorded in the voice storage unit 1 is instructed to perform the voice recognition again. At this time, the backup dictionary unit 10 is used as a recognition dictionary.
By using the recognition dictionary recorded in, the recognition processing is performed for more vocabulary than in the first recognition processing. Therefore, when the user utters a word registered in the backup dictionary 10, it is possible to correct the candidate with high accuracy and speed. A candidate word string of a new recognition result is obtained by the re-recognition process, and the process returns to step 3. (Step 13) to (Step 15): Search control unit 4
Judges that the word string corresponding to the character string input by the user cannot be obtained as the recognition result because the user has input an unknown word that is not registered in the recognition dictionary,
The user is notified to perform word registration, and the unknown word registration unit 9 registers the unknown word of the character string recorded in the correct character storage unit 8.

【００２８】[0028]

【発明の効果】以上説明したように、本発明は、ユーザ
の入力した正解文字列を利用し、バックアップ辞書も含
めた認識辞書を用いて、ユーザの発声した誤認識された
音声データに対して再認識処理を行うことにより、ユー
ザの再発声を必要としない、ユーザへの負担の軽く、迅
速かつ高精度な認識結果の修正機能及び補完機能を備え
た音声認識装置を提供することが出来る効果がある。As described above, according to the present invention, the correct character string input by the user is used, and the recognition dictionary including the backup dictionary is used for the erroneously recognized voice data uttered by the user. By performing the re-recognition process, it is possible to provide a voice recognition device that does not require a re-voice of the user, has a light burden on the user, and has a quick and highly accurate recognition result correction function and a complementary function. There is.

[Brief description of drawings]

【図１】本発明の一実施の形態を示す音声認識装置のブ
ロック図である。FIG. 1 is a block diagram of a voice recognition device showing an embodiment of the present invention.

【図２】図１に示す本発明の音声認識装置の動作フロー
図である。FIG. 2 is an operation flow diagram of the voice recognition device of the present invention shown in FIG.

[Explanation of symbols]

１音声記憶部２認識辞書部３サーチ処理部４サーチ制御部５認識結果候補表示部６候補選択入力部７正解文字入力部８正解文字記憶部９未知語単語登録部１０バックアップ辞書部 1 Voice memory 2 Recognition dictionary section 3 Search processing unit 4 Search control section 5 Recognition result candidate display 6 Candidate selection input section 7 Correct answer input section 8 Correct answer character storage 9 Unknown word registration section 10 Backup dictionary section

Claims

[Claims]

1. In a voice recognition device, a voice storage unit for recording an input voice, a recognition dictionary unit having recognition word dictionary information, and a voice storage unit using previously prepared grammatical information and the recognition dictionary unit. A search processing unit that performs recognition processing on the voice recorded in the unit, and a recognition result candidate word string by the recognition processing of the search processing unit is passed to the recognition result candidate display unit, and the correct character string is stored from the correct character storage unit. When notified, the search processing unit is subjected to re-recognition processing for performing voice recognition using only the notified candidate word string starting with the correct character string as the recognition target word string as the recognition target word string using the voice recorded in the voice storage unit. A requesting search control unit, a recognition result candidate display unit for displaying the recognition result candidate word string, a correct character input unit for inputting characters of the correct answer word string one by one, and the recognition result candidate display If the correct answer word string is not included in the recognition result candidate word strings displayed in the section, the correct answer character string input from the correct answer character input unit is recorded, and the correct answer is notified to the search control unit. A voice recognition device comprising a character storage unit.

2. The voice recognition according to claim 1, wherein the search processing unit performs the voice recognition using a backup dictionary having more words than the recognition dictionary unit during the re-recognition process. apparatus.

3. The search control unit which outputs an unknown word word registration request upon receipt of the character input end notification in the correct character input unit, and an unknown word word input in response to the unknown word word registration request. The voice recognition device according to claim 1, further comprising an unknown word registration unit that registers a word of the character string recorded in the correct character storage unit into the recognition dictionary unit.

4. A voice recognition method, comprising a voice storage unit for recording an input voice and a recognition dictionary unit having recognition word dictionary information in advance, and using the prepared grammatical information and the recognition dictionary unit, A recognition process is performed on the voice recorded in the voice storage unit, a recognition result candidate word string that is a recognition result of the recognition process is displayed on a recognition result candidate display unit that is provided in advance, and is displayed on the recognition result candidate display unit. When the correct answer word string is not included in the recognition result candidate word strings that have been input and the characters of the correct answer word string are input one by one from the prepared correct answer character input unit, the input correct answer is input. A character string is recorded in a correct character storage unit prepared in advance, and only candidate word strings starting with the correct character string in the recorded correct character character storage unit are recorded in the voice memory unit as recognition target word strings. A voice recognition method characterized by performing a re-recognition process for performing voice recognition using the voice that has been recorded.

5. The voice recognition method according to claim 4, wherein during the re-recognition process, the voice recognition is performed by using a backup dictionary having more words than the recognition dictionary unit.

6. The unknown character word registration request is output in response to the notification of the character input completion in the correct character input unit, and the unknown word word input in response to the unknown word registration request is stored in the correct character storage unit. 5. The voice recognition method according to claim 4, wherein the word is registered in the recognition dictionary unit for the character string recorded in.