JP2003099091A - Speech recognition device and speech recognition method - Google Patents

Speech recognition device and speech recognition method

Info

Publication number
JP2003099091A
JP2003099091A JP2001289263A JP2001289263A JP2003099091A JP 2003099091 A JP2003099091 A JP 2003099091A JP 2001289263 A JP2001289263 A JP 2001289263A JP 2001289263 A JP2001289263 A JP 2001289263A JP 2003099091 A JP2003099091 A JP 2003099091A
Authority
JP
Japan
Prior art keywords
recognition
voice
unit
word
character
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
JP2001289263A
Other languages
Japanese (ja)
Other versions
JP4797307B2 (en
Inventor
Takashi Tomoe
孝 友枝
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
NEC Corp
Original Assignee
NEC Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by NEC Corp filed Critical NEC Corp
Priority to JP2001289263A priority Critical patent/JP4797307B2/en
Publication of JP2003099091A publication Critical patent/JP2003099091A/en
Application granted granted Critical
Publication of JP4797307B2 publication Critical patent/JP4797307B2/en
Anticipated expiration legal-status Critical
Expired - Fee Related legal-status Critical Current

Links

Abstract

PROBLEM TO BE SOLVED: To provide a speech recognition device in which user's reuttering is not required, work load of the user is made light and quick and precise correcting and supplementing functions for recognition results are provided. SOLUTION: A search processing section 3 conducts a recognition process for voice recorded in a voice storage section 1 using grammar information and a recognition dictionary section 2. A search control section 4 transmits recognition result candidate word strings obtained by the recognition process of the section 3 to a recognition result candidate display section 5 and makes a request to the section 3 so that the section 3 conducts voice recognition employing only a candidate word string, that starts with a correct character string notified by a correct character storage section 8, as a recognition object word string and voice recorded in a voice storage section. A correct character input section 7 inputs the characters of the correct word string one character at a time. The section 8 records the correct character string inputted from the section 7 when the correct word string is not included in the recognition result candidate word string displayed in the section 5 and notifies the above to the section 4.

Description

【発明の詳細な説明】Detailed Description of the Invention

【0001】[0001]

【発明の属する技術分野】本発明は、音声認識装置及び
音声認識方法に関するものであり、特に誤認識結果の修
正を迅速、かつ高精度に行うことが出きる音声認識装置
及び音声認識方法に関する。
BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a voice recognition device and a voice recognition method, and more particularly to a voice recognition device and a voice recognition method capable of correcting an erroneous recognition result quickly and highly accurately.

【0002】[0002]

【従来の技術】従来の音声認識装置における誤認識単語
の修正方法の例が、「自分の声でパソコンが動く(村上
弘子著 1999年8月10日 株式会社NECクリ
エイティブ発行)」の22〜25ページに記載されてい
る。この従来技術では、誤認識した単語の修正として
「認識結果候補を複数列挙し、いずれかをユーザが選択
する」、「再発声及び再認識処理を行うことで誤認識結
果を正解単語と置換する」、「誤認識された箇所の正解
結果をキーボードで入力することにより修正する」とい
う方法が用いられている。
2. Description of the Related Art An example of a method for correcting an erroneously recognized word in a conventional voice recognition device is "Personal computer moves with one's own voice (by Hiroko Murakami, published on August 10, 1999 by NEC Creative Co., Ltd.), 22-25. It is listed on the page. In this conventional technique, as a correction of a word that has been erroneously recognized, "a plurality of recognition result candidates are listed and a user selects one of them" and "A rerecognition process and a re-recognition process are performed to replace the erroneous recognition result with the correct word. "," Correct by inputting the correct answer result of the erroneously recognized part with the keyboard ".

【0003】一方、文字入力装置や文字認識装置におい
ては、入力したい文字列の全てをユーザが打ち込まなく
とも、文字列入力装置側が文字列の補完を行う「補完機
能」が発明されており、その従来例が「http://www.hll
a.is.tsukuba.ac.jp/~yas/ipe/nitiniti2-enshu-1996/1
996-10-21/unix-completion.html」に記載されている。
これは、ユーザが入力した文字列に対応する正解候補文
字列だけを表示、選択できるものであり、文字列入力途
中で候補が少数又は一意に決定された際に、文字列を全
て入力しなくても正解候補を選択できるという機能であ
る。
On the other hand, in the character input device and the character recognition device, a "complementary function" has been invented in which the character string input device side complements the character string even if the user does not type in all the desired character string. The conventional example is "http: //www.hll
a.is.tsukuba.ac.jp/~yas/ipe/nitiniti2-enshu-1996/1
996-10-21 / unix-completion.html ".
This is to display and select only the correct answer candidate character string corresponding to the character string input by the user, and when the candidates are few or uniquely decided during the character string input, the entire character string is not input. However, it is a function that can select the correct answer candidate.

【0004】[0004]

【発明が解決しようとする課題】上述した従来の音声認
識装置は、認識結果候補に正解がない場合、「再発声を
行う」、「正解をキーボードで入力した後、漢字変換を
行う」などのユーザへの負担が大きかった。また、従来
の文字入力装置等における補完機能では、正解文字列を
全て打ち込まなくてはよいものの、候補が多く存在する
場合、例えば音声認識のように類似単語が数百以上存在
する場合、依然として多くの文字列を入力する必要があ
った。
The above-mentioned conventional speech recognition apparatus, when there is no correct answer in the recognition result candidates, "repeat voice", "input the correct answer on the keyboard, and then perform Kanji conversion", etc. The burden on the user was heavy. Further, in the complementary function in the conventional character input device, etc., although it is not necessary to enter all correct answer character strings, if there are many candidates, for example, if there are hundreds or more similar words such as voice recognition, many still remain. I had to enter the string.

【0005】本発明の目的は、上記の従来技術の問題点
を回避しつつ、ユーザの入力した正解文字列を利用し、
バックアップ辞書も含めた認識辞書を用いて、ユーザの
発声した誤認識された音声データに対して再認識処理を
行うことにより、ユーザの再発声を必要としない、ユー
ザへの負担の軽く、迅速かつ高精度な認識結果の修正機
能及び補完機能を備えた音声認識装置を提供することに
ある。
An object of the present invention is to use the correct character string input by the user while avoiding the above-mentioned problems of the prior art,
The recognition dictionary including the backup dictionary is used to re-recognize the erroneously recognized voice data uttered by the user, thereby eliminating the need for the user to re-voice. An object of the present invention is to provide a voice recognition device having a highly accurate recognition result correction function and a complementary function.

【0006】[0006]

【課題を解決するための手段】本願の第1の発明は、音
声認識装置において、入力された音声を記録する音声記
憶部と、認識単語辞書情報を持つ認識辞書部と、予め用
意された文法情報及び前記認識辞書部を用いて前記音声
記憶部に記録されている音声に対し認識処理を行うサー
チ処理部と、前記サーチ処理部の認識処理による認識結
果候補単語列を認識結果候補表示部に渡すとともに、正
解文字記憶部から正解文字列を通知された場合通知され
た前記正解文字列で始まる候補単語列のみを認識対象単
語列として前記音声記憶部に記録されている前記音声を
用いて音声認識を行う再認識処理を前記サーチ処理部に
要求するサーチ制御部と、前記認識結果候補単語列を表
示する認識結果候補表示部と、正解単語列の文字を一文
字ずつ入力する正解文字入力部と、前記認識結果候補表
示部に表示されている前記認識結果候補単語列の中に正
解単語列が含まれていない場合に前記正解文字入力部か
ら入力された前記正解文字列を記録し前記サーチ制御部
に通知する前記正解文字記憶部とを含んで構成されるこ
とを特徴とする。
A first invention of the present application is, in a voice recognition device, a voice storage section for recording an input voice, a recognition dictionary section having recognition word dictionary information, and a prepared grammar. A search processing unit that performs recognition processing on the voice recorded in the voice storage unit using the information and the recognition dictionary unit, and a recognition result candidate word string by the recognition processing of the search processing unit on the recognition result candidate display unit. When the correct answer character string is notified from the correct answer character storage unit, only the candidate word string starting with the notified correct answer character string is recognized as the recognition target word string by using the voice recorded in the voice storage unit. A search control unit that requests re-recognition processing for recognition to the search processing unit, a recognition result candidate display unit that displays the recognition result candidate word string, and a character that inputs characters of the correct word string one by one. Record the correct answer character string input from the correct answer character input section when a correct answer word string is not included in the recognition result candidate word string displayed in the character input section and the recognition result candidate display section. The correct answer character storage unit for notifying the search control unit is also included.

【0007】本願の第2の発明は、第1の発明の前記サ
ーチ処理部は、前記再認識処理時には前記認識辞書部よ
りも多くの単語を有するバックアップ辞書を利用して前
記音声認識を行うことを特徴とする。
According to a second invention of the present application, the search processing section of the first invention performs the voice recognition by using a backup dictionary having more words than the recognition dictionary section during the re-recognition processing. Is characterized by.

【0008】本願の第3の発明は、第1の発明の前記正
解文字入力部における前記文字入力終了の通知を受けて
未知語単語登録要求を出力する前記サーチ制御部と、前
記未知語単語登録要求を受けて入力された未知語単語に
より前記正解文字記憶部に記録されている前記文字列の
前記認識辞書部への単語登録を行う未知語単語登録部を
含んで構成されることを特徴とする。
A third invention of the present application is the search control section which outputs an unknown word word registration request upon receipt of the character input end notification in the correct character input section of the first invention, and the unknown word word registration. An unknown word word registration unit for registering a word of the character string recorded in the correct character storage unit in the recognition dictionary unit by an unknown word word input in response to a request. To do.

【0009】本願の第4の発明は、音声認識方法におい
て、入力された音声を記録する音声記憶部と認識単語辞
書情報を持つ認識辞書部とを予め備え、予め用意された
文法情報及び前記認識辞書部を用いて前記音声記憶部に
記録されている音声に対し認識処理を行い、前記認識処
理による認識結果である認識結果候補単語列を予め備え
た認識結果候補表示部に表示し、前記認識結果候補表示
部に表示されている前記認識結果候補単語列の中に正解
単語列が含まれていない場合に予め用意された正解文字
入力部から前記正解単語列の文字が一文字ずつ入力され
ると、入力された前記正解文字列を予め用意された正解
文字記憶部に記録し、記録された前記正解文字記憶部に
おける前記正解文字列で始まる候補単語列のみを認識対
象単語列として前記音声記憶部に記録されている前記音
声を用いて音声認識を行う再認識処理を行うことを特徴
とする。
In a fourth aspect of the present invention, in the voice recognition method, a voice storage section for recording an input voice and a recognition dictionary section having recognition word dictionary information are provided in advance, and the prepared grammar information and the recognition are provided. A recognition process is performed on the voice recorded in the voice storage unit by using the dictionary unit, and a recognition result candidate word string that is a recognition result by the recognition process is displayed on a recognition result candidate display unit that is provided in advance. When the characters of the correct answer word string are input character by character from the correct answer character input section prepared in advance when the correct answer word string is not included in the recognition result candidate word strings displayed in the result candidate display section. , The input correct answer character string is recorded in a prepared correct answer character storage unit, and only candidate word strings starting with the correct answer character string in the recorded correct answer character storage unit are used as recognition target word strings. And performing re-recognition processing for speech recognition using the voice recorded in the voice storage unit.

【0010】本願の第5の発明は、第4の発明の前記再
認識処理時には前記認識辞書部よりも多くの単語を有す
るバックアップ辞書を利用して前記音声認識を行うこと
を特徴とする。
A fifth aspect of the present invention is characterized in that during the re-recognition process according to the fourth aspect, the voice recognition is performed using a backup dictionary having more words than the recognition dictionary section.

【0011】本願の第6の発明は、第4の発明の前記正
解文字入力部における前記文字入力終了の通知を受けて
未知語単語登録要求を出力し、前記未知語単語登録要求
に応じて入力された未知語単語を前記正解文字記憶部に
記録されている前記文字列について前記認識辞書部への
単語登録を行うことを特徴とする。
According to a sixth aspect of the present invention, an unknown word word registration request is output in response to the character input end notification in the correct character input section of the fourth aspect, and the unknown word word registration request is input. The registered unknown word is registered in the recognition dictionary unit for the character string recorded in the correct character storage unit.

【0012】[0012]

【発明の実施の形態】次に、本発明の実施の形態につい
て図面を参照して詳細に説明する。
BEST MODE FOR CARRYING OUT THE INVENTION Next, embodiments of the present invention will be described in detail with reference to the drawings.

【0013】図1は、本発明の一実施の形態を示す音声
認識装置のブロック図である。
FIG. 1 is a block diagram of a voice recognition apparatus showing an embodiment of the present invention.

【0014】図1を参照すると、本発明の音声認識装置
は、音声記憶部1と、認識辞書部2と、サーチ処理部3
と、サーチ制御部4と、認識結果候補表示部5と、候補
選択入力部6と、正解文字入力部7と、正解文字記憶部
8と、未知語単語登録部9と、バックアップ辞書部10
とから構成される。
Referring to FIG. 1, the voice recognition device of the present invention includes a voice storage unit 1, a recognition dictionary unit 2, and a search processing unit 3.
, Search control unit 4, recognition result candidate display unit 5, candidate selection input unit 6, correct answer character input unit 7, correct answer character storage unit 8, unknown word word registration unit 9, and backup dictionary unit 10.
Composed of and.

【0015】音声記憶部1は入力された音声を記録す
る。
The voice storage unit 1 records the input voice.

【0016】認識辞書部2は認識単語辞書情報を持つ。The recognition dictionary unit 2 has recognition word dictionary information.

【0017】サーチ処理部3は自然言語文法、統計言語
モデル又はネットワーク文法などの文法情報及び認識辞
書部2を用いて音声記憶部1に記録されている音声に対
し認識処理を行う。
The search processing unit 3 uses the grammatical information such as natural language grammar, statistical language model or network grammar, and the recognition dictionary unit 2 to perform recognition processing on the voice recorded in the voice storage unit 1.

【0018】サーチ制御部4はサーチ処理部3に入力さ
れた音声の認識処理を行うよう要求し、認識結果候補単
語列を認識結果候補表示部5に渡すとともに、正解文字
記憶部8から正解文字列を通知された場合、通知された
正解文字列で始まる候補単語列のみを認識結果候補表示
部に表示し、候補単語列を全て認識結果候補表示部に表
示しても、まだ認識結果候補表示部に空きがある場合
や、候補単語列の個数が一定数以下になってしまった場
合、正解文字記憶部8から通知された正解文字列で始ま
る単語列のみを認識対象単語列とし、認識辞書としてバ
ックアップ辞書部10に記録されている認識辞書も利用
しながら音声記憶部1に記録されている音声に対して再
度音声認識を行うようサーチ処理部3に指示し、すべて
の正解文字列の入力完了の通知を受けてユーザに未知語
単語登録の指示を出力する。
The search control unit 4 requests the search processing unit 3 to perform recognition processing of the input voice, passes the recognition result candidate word string to the recognition result candidate display unit 5, and at the same time stores the correct character from the correct character storage unit 8. When a column is notified, only the candidate word string starting with the notified correct character string is displayed in the recognition result candidate display part, and even if all candidate word strings are displayed in the recognition result candidate display part, the recognition result candidate display is still displayed. When there is a space in the copy or when the number of candidate word strings is less than a certain number, only the word string starting with the correct answer character string notified from the correct answer character storage unit 8 is set as the recognition target word string, and the recognition dictionary. As an instruction, the search processing unit 3 is instructed to perform voice recognition again on the voice recorded in the voice storage unit 1 while using the recognition dictionary recorded in the backup dictionary unit 10, and all correct character strings are input. And outputs an instruction of the unknown word word registered in the user receives a notification of completion.

【0019】認識結果候補表示部5は認識結果の候補単
語列をユーザに通知する。
The recognition result candidate display section 5 notifies the user of the candidate word string of the recognition result.

【0020】候補選択入力部6は認識結果候補表示部5
に表示されている候補単語列の中に正解がある場合、ユ
ーザから通知された正解の候補単語を確定する。
The candidate selection input unit 6 is a recognition result candidate display unit 5
If there is a correct answer in the candidate word string displayed in, the correct candidate word notified by the user is determined.

【0021】正解文字入力部7は認識結果候補表示部5
に正解単語列が含まれていない場合、ユーザが正解単語
列の文字を一文字ずつ入力するためのものである。
The correct character input unit 7 is a recognition result candidate display unit 5
When the correct answer word string is not included in, the user inputs characters of the correct answer word string one by one.

【0022】正解文字記憶部8は正解文字入力部7から
入力された正解文字列を追加記録し、今までに入力され
た正解文字列をサーチ制御部4に通知する。
The correct character storage unit 8 additionally records the correct character string input from the correct character input unit 7, and notifies the search control unit 4 of the correct character strings input so far.

【0023】未知語単語登録部9はユーザによる未知語
単語登録要求を受けて正解文字記憶部8に記録されてい
る文字列の単語登録を行う。
The unknown word registration unit 9 receives a word registration request from the user and performs word registration of the character string recorded in the correct character storage unit 8.

【0024】バックアップ辞書部10は認識辞書部2に
おける認識単語辞書情報だけでなく、固有名詞や専門用
語など、より多くの認識単語辞書情報を有する。
The backup dictionary unit 10 has not only the recognized word dictionary information in the recognition dictionary unit 2 but also more recognized word dictionary information such as proper nouns and technical terms.

【0025】次に、図2を参照しながら本発明の音声認
識装置の動作ついて説明する。 (ステップ1):入力された音声を音声記憶部1に記録
する。 (ステップ2):サーチ制御部4は、サーチ処理部3に
入力された音声の認識処理を行うよう要求する。サーチ
処理部3は、自然言語文法、統計言語モデル又はネット
ワーク文法などの文法情報及び認識単語辞書情報を持つ
認識辞書部2を用いて、音声記憶部1に記録されている
音声に対し、認識処理を行う。 (ステップ3):サーチ処理部3は認識処理が終了する
と、認識結果の候補単語列をサーチ制御部4に渡す。サ
ーチ制御部4は認識結果候補単語列を認識結果候補表示
部5に渡し、認識結果の候補単語列をユーザに通知す
る。 (ステップ4):認識結果候補表示部5に表示されてい
る候補単語列の中に正解がある場合ステップ5に行き、
認識結果候補表示部5に表示されている候補単語列の中
に正解がない場合ステップ7に行く。 (ステップ5)、(ステップ6):ユーザは候補選択入
力部6に第何位候補が正解であるかを通知し、単語を確
定することができる。
Next, the operation of the speech recognition apparatus of the present invention will be described with reference to FIG. (Step 1): The input voice is recorded in the voice storage unit 1. (Step 2): The search control unit 4 requests the search processing unit 3 to perform recognition processing of the input voice. The search processing unit 3 uses the recognition dictionary unit 2 having grammatical information such as natural language grammar, statistical language model or network grammar, and recognized word dictionary information to recognize the voice recorded in the voice storage unit 1. I do. (Step 3): When the recognition processing is completed, the search processing unit 3 passes the candidate word string of the recognition result to the search control unit 4. The search control unit 4 passes the recognition result candidate word string to the recognition result candidate display unit 5, and notifies the user of the recognition result candidate word string. (Step 4): If there is a correct answer in the candidate word string displayed in the recognition result candidate display unit 5, go to Step 5 and
If there is no correct answer in the candidate word string displayed on the recognition result candidate display section 5, the process goes to step 7. (Step 5), (Step 6): The user can notify the candidate selection / input unit 6 of the number of the correct answer and determine the word.

【0026】また、第一候補が正解の場合には、そのま
ま次の発声を行うことにより、候補選択入力部に何も入
力しなくても、サーチ制御部4は第一位候補が正解であ
ると判断し、次の音声の処理を行うことができる。上記
の2つのどちらかに該当する場合、音声記憶部1に記録
されている音声の認識処理は終了となる。 (ステップ7):ユーザが正解文字入力部に全ての文字
列を入力したのでなければステップ8に行き、ユーザが
正解文字入力部に全ての文字列を入力したことを通知す
ると、ステップ13以降を実行する。 (ステップ8):ユーザは正解文字入力部7に正解単語
列の文字入力を一文字ずつ行う。
When the first candidate is the correct answer, the next utterance is performed as it is, so that the first candidate is the correct answer in the search control unit 4 without inputting anything in the candidate selection input unit. Then, the next voice processing can be performed. If either of the above two cases applies, the recognition processing of the voice recorded in the voice storage unit 1 ends. (Step 7): If the user has not entered all the character strings in the correct answer character input section, go to Step 8 and notify that the user has entered all the character strings in the correct answer character input section. Run. (Step 8): The user inputs the correct word string into the correct character input unit 7 character by character.

【0027】但し、認識結果候補表示部5に途中まで正
解文字列の含まれている単語列候補がある場合、ユーザ
はどこまでが正解であるかを指定することにより、先頭
から途中までの複数文字の入力を一気に行うこともでき
る。 (ステップ9):正解文字入力部7は正解文字記憶部8
に、正解文字列を追加記録する。正解文字記憶部8は、
今までに入力された正解文字列をサーチ制御部4に通知
する。 (ステップ10):サーチ制御部4は通知された正解文
字列で始まる候補単語列のみを認識結果候補表示部5に
表示する。 (ステップ11):候補単語列を全て認識結果候補表示
部5に表示しても、まだ認識結果候補表示部に空きがあ
る場合や、候補単語列の個数が一定数以下になってしま
った場合などには、ステップ12を実行する。そうでな
い場合、ステップ8に戻る。 (ステップ12):サーチ制御部4は、サーチ処理部3
に対し正解文字記憶部から通知された正解文字列で始ま
る単語列のみを認識対象単語列とし、音声記憶部1に記
録されている音声に対して再度音声認識を行うよう指示
する。この際、認識辞書としてバックアップ辞書部10
に記録されている認識辞書も利用することにより、最初
の認識処理時よりも多くの語彙に対して認識処理を行
う。このため、ユーザがバックアップ辞書10に登録さ
れている単語を発声していた場合には、高精度かつ迅速
な候補修正が可能となる。再認識処理による新しい認識
結果の候補単語列を求め、ステップ3に戻る。 (ステップ13)〜(ステップ15):サーチ制御部4
は、ユーザが入力した文字列に該当する単語列を認識結
果として得ることができなかったのは、ユーザが認識辞
書に登録されていない未知語を入力したためと判断し、
ユーザに単語登録を行うよう通知し、未知語単語登録部
9により正解文字記憶部8に記録されている文字列の未
知語単語登録を行う。
However, when there is a word string candidate in which the correct answer character string is included in the recognition result candidate display portion 5, the user designates up to what is the correct answer, so that the plural characters from the beginning to the middle You can also input all at once. (Step 9): Correct answer character input unit 7 is correct answer character storage unit 8
In addition, the correct answer character string is additionally recorded. The correct character storage unit 8 is
The search control unit 4 is notified of the correct answer character string input so far. (Step 10): The search control unit 4 displays only the candidate word string starting with the notified correct character string on the recognition result candidate display unit 5. (Step 11): When all the candidate word strings are displayed on the recognition result candidate display unit 5, but there is still space in the recognition result candidate display unit, or when the number of candidate word strings is below a certain number. For example, step 12 is executed. If not, return to step 8. (Step 12): The search control unit 4 makes the search processing unit 3
On the other hand, only the word string starting with the correct character string notified from the correct character storage unit is set as the recognition target word string, and the voice recorded in the voice storage unit 1 is instructed to perform the voice recognition again. At this time, the backup dictionary unit 10 is used as a recognition dictionary.
By using the recognition dictionary recorded in, the recognition processing is performed for more vocabulary than in the first recognition processing. Therefore, when the user utters a word registered in the backup dictionary 10, it is possible to correct the candidate with high accuracy and speed. A candidate word string of a new recognition result is obtained by the re-recognition process, and the process returns to step 3. (Step 13) to (Step 15): Search control unit 4
Judges that the word string corresponding to the character string input by the user cannot be obtained as the recognition result because the user has input an unknown word that is not registered in the recognition dictionary,
The user is notified to perform word registration, and the unknown word registration unit 9 registers the unknown word of the character string recorded in the correct character storage unit 8.

【0028】[0028]

【発明の効果】以上説明したように、本発明は、ユーザ
の入力した正解文字列を利用し、バックアップ辞書も含
めた認識辞書を用いて、ユーザの発声した誤認識された
音声データに対して再認識処理を行うことにより、ユー
ザの再発声を必要としない、ユーザへの負担の軽く、迅
速かつ高精度な認識結果の修正機能及び補完機能を備え
た音声認識装置を提供することが出来る効果がある。
As described above, according to the present invention, the correct character string input by the user is used, and the recognition dictionary including the backup dictionary is used for the erroneously recognized voice data uttered by the user. By performing the re-recognition process, it is possible to provide a voice recognition device that does not require a re-voice of the user, has a light burden on the user, and has a quick and highly accurate recognition result correction function and a complementary function. There is.

【図面の簡単な説明】[Brief description of drawings]

【図1】本発明の一実施の形態を示す音声認識装置のブ
ロック図である。
FIG. 1 is a block diagram of a voice recognition device showing an embodiment of the present invention.

【図2】図1に示す本発明の音声認識装置の動作フロー
図である。
FIG. 2 is an operation flow diagram of the voice recognition device of the present invention shown in FIG.

【符号の説明】[Explanation of symbols]

1 音声記憶部 2 認識辞書部 3 サーチ処理部 4 サーチ制御部 5 認識結果候補表示部 6 候補選択入力部 7 正解文字入力部 8 正解文字記憶部 9 未知語単語登録部 10 バックアップ辞書部 1 Voice memory 2 Recognition dictionary section 3 Search processing unit 4 Search control section 5 Recognition result candidate display 6 Candidate selection input section 7 Correct answer input section 8 Correct answer character storage 9 Unknown word registration section 10 Backup dictionary section

Claims (6)

【特許請求の範囲】[Claims] 【請求項1】 音声認識装置において、入力された音声
を記録する音声記憶部と、認識単語辞書情報を持つ認識
辞書部と、予め用意された文法情報及び前記認識辞書部
を用いて前記音声記憶部に記録されている音声に対し認
識処理を行うサーチ処理部と、前記サーチ処理部の認識
処理による認識結果候補単語列を認識結果候補表示部に
渡すとともに、正解文字記憶部から正解文字列を通知さ
れた場合通知された前記正解文字列で始まる候補単語列
のみを認識対象単語列として前記音声記憶部に記録され
ている前記音声を用いて音声認識を行う再認識処理を前
記サーチ処理部に要求するサーチ制御部と、前記認識結
果候補単語列を表示する認識結果候補表示部と、正解単
語列の文字を一文字ずつ入力する正解文字入力部と、前
記認識結果候補表示部に表示されている前記認識結果候
補単語列の中に正解単語列が含まれていない場合に前記
正解文字入力部から入力された前記正解文字列を記録し
前記サーチ制御部に通知する前記正解文字記憶部とを含
んで構成されることを特徴とする音声認識装置。
1. In a voice recognition device, a voice storage unit for recording an input voice, a recognition dictionary unit having recognition word dictionary information, and a voice storage unit using previously prepared grammatical information and the recognition dictionary unit. A search processing unit that performs recognition processing on the voice recorded in the unit, and a recognition result candidate word string by the recognition processing of the search processing unit is passed to the recognition result candidate display unit, and the correct character string is stored from the correct character storage unit. When notified, the search processing unit is subjected to re-recognition processing for performing voice recognition using only the notified candidate word string starting with the correct character string as the recognition target word string as the recognition target word string using the voice recorded in the voice storage unit. A requesting search control unit, a recognition result candidate display unit for displaying the recognition result candidate word string, a correct character input unit for inputting characters of the correct answer word string one by one, and the recognition result candidate display If the correct answer word string is not included in the recognition result candidate word strings displayed in the section, the correct answer character string input from the correct answer character input unit is recorded, and the correct answer is notified to the search control unit. A voice recognition device comprising a character storage unit.
【請求項2】 前記サーチ処理部は、前記再認識処理時
には前記認識辞書部よりも多くの単語を有するバックア
ップ辞書を利用して前記音声認識を行うことを特徴とす
る請求項1記載の音声認識装置。
2. The voice recognition according to claim 1, wherein the search processing unit performs the voice recognition using a backup dictionary having more words than the recognition dictionary unit during the re-recognition process. apparatus.
【請求項3】 前記正解文字入力部における前記文字入
力終了の通知を受けて未知語単語登録要求を出力する前
記サーチ制御部と、前記未知語単語登録要求を受けて入
力された未知語単語により前記正解文字記憶部に記録さ
れている前記文字列の前記認識辞書部への単語登録を行
う未知語単語登録部を含んで構成されることを特徴とす
る請求項1記載の音声認識装置。
3. The search control unit which outputs an unknown word word registration request upon receipt of the character input end notification in the correct character input unit, and an unknown word word input in response to the unknown word word registration request. The voice recognition device according to claim 1, further comprising an unknown word registration unit that registers a word of the character string recorded in the correct character storage unit into the recognition dictionary unit.
【請求項4】 音声認識方法において、入力された音声
を記録する音声記憶部と認識単語辞書情報を持つ認識辞
書部とを予め備え、予め用意された文法情報及び前記認
識辞書部を用いて前記音声記憶部に記録されている音声
に対し認識処理を行い、前記認識処理による認識結果で
ある認識結果候補単語列を予め備えた認識結果候補表示
部に表示し、前記認識結果候補表示部に表示されている
前記認識結果候補単語列の中に正解単語列が含まれてい
ない場合に予め用意された正解文字入力部から前記正解
単語列の文字が一文字ずつ入力されると、入力された前
記正解文字列を予め用意された正解文字記憶部に記録
し、記録された前記正解文字記憶部における前記正解文
字列で始まる候補単語列のみを認識対象単語列として前
記音声記憶部に記録されている前記音声を用いて音声認
識を行う再認識処理を行うことを特徴とする音声認識方
法。
4. A voice recognition method, comprising a voice storage unit for recording an input voice and a recognition dictionary unit having recognition word dictionary information in advance, and using the prepared grammatical information and the recognition dictionary unit, A recognition process is performed on the voice recorded in the voice storage unit, a recognition result candidate word string that is a recognition result of the recognition process is displayed on a recognition result candidate display unit that is provided in advance, and is displayed on the recognition result candidate display unit. When the correct answer word string is not included in the recognition result candidate word strings that have been input and the characters of the correct answer word string are input one by one from the prepared correct answer character input unit, the input correct answer is input. A character string is recorded in a correct character storage unit prepared in advance, and only candidate word strings starting with the correct character string in the recorded correct character character storage unit are recorded in the voice memory unit as recognition target word strings. A voice recognition method characterized by performing a re-recognition process for performing voice recognition using the voice that has been recorded.
【請求項5】 前記再認識処理時には前記認識辞書部よ
りも多くの単語を有するバックアップ辞書を利用して前
記音声認識を行うことを特徴とする請求項4記載の音声
認識方法。
5. The voice recognition method according to claim 4, wherein during the re-recognition process, the voice recognition is performed by using a backup dictionary having more words than the recognition dictionary unit.
【請求項6】 前記正解文字入力部における前記文字入
力終了の通知を受けて未知語単語登録要求を出力し、前
記未知語単語登録要求に応じて入力された未知語単語を
前記正解文字記憶部に記録されている前記文字列につい
て前記認識辞書部への単語登録を行うことを特徴とする
請求項4記載の音声認識方法。
6. The unknown character word registration request is output in response to the notification of the character input completion in the correct character input unit, and the unknown word word input in response to the unknown word registration request is stored in the correct character storage unit. 5. The voice recognition method according to claim 4, wherein the word is registered in the recognition dictionary unit for the character string recorded in.
JP2001289263A 2001-09-21 2001-09-21 Speech recognition apparatus and speech recognition method Expired - Fee Related JP4797307B2 (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
JP2001289263A JP4797307B2 (en) 2001-09-21 2001-09-21 Speech recognition apparatus and speech recognition method

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
JP2001289263A JP4797307B2 (en) 2001-09-21 2001-09-21 Speech recognition apparatus and speech recognition method

Publications (2)

Publication Number Publication Date
JP2003099091A true JP2003099091A (en) 2003-04-04
JP4797307B2 JP4797307B2 (en) 2011-10-19

Family

ID=19111780

Family Applications (1)

Application Number Title Priority Date Filing Date
JP2001289263A Expired - Fee Related JP4797307B2 (en) 2001-09-21 2001-09-21 Speech recognition apparatus and speech recognition method

Country Status (1)

Country Link
JP (1) JP4797307B2 (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2010055044A (en) * 2008-04-22 2010-03-11 Ntt Docomo Inc Device, method and system for correcting voice recognition result

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPS61239378A (en) * 1985-04-16 1986-10-24 Toshiba Corp Discrimination processor
JPH02163874A (en) * 1988-12-16 1990-06-25 Nippon Telegr & Teleph Corp <Ntt> Word dictionary production system
JP2000056795A (en) * 1998-08-03 2000-02-25 Fuji Xerox Co Ltd Speech recognition device

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPS61239378A (en) * 1985-04-16 1986-10-24 Toshiba Corp Discrimination processor
JPH02163874A (en) * 1988-12-16 1990-06-25 Nippon Telegr & Teleph Corp <Ntt> Word dictionary production system
JP2000056795A (en) * 1998-08-03 2000-02-25 Fuji Xerox Co Ltd Speech recognition device

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2010055044A (en) * 2008-04-22 2010-03-11 Ntt Docomo Inc Device, method and system for correcting voice recognition result
JP4709887B2 (en) * 2008-04-22 2011-06-29 株式会社エヌ・ティ・ティ・ドコモ Speech recognition result correction apparatus, speech recognition result correction method, and speech recognition result correction system
TWI427620B (en) * 2008-04-22 2014-02-21 Ntt Docomo Inc A speech recognition result correction device and a speech recognition result correction method, and a speech recognition result correction system

Also Published As

Publication number Publication date
JP4797307B2 (en) 2011-10-19

Similar Documents

Publication Publication Date Title
TWI437449B (en) Multi-mode input method and input method editor system
TWI443551B (en) Method and system for an input method editor and computer program product
JP7200405B2 (en) Context Bias for Speech Recognition
JPH03224055A (en) Method and device for input of translation text
JP2002116796A (en) Voice processor and method for voice processing and storage medium
JP2007041319A (en) Speech recognition device and speech recognition method
JP3476007B2 (en) Recognition word registration method, speech recognition method, speech recognition device, storage medium storing software product for registration of recognition word, storage medium storing software product for speech recognition
JP4738847B2 (en) Data retrieval apparatus and method
US7533014B2 (en) Method and system for concurrent use of two or more closely coupled communication recognition modalities
JP4797307B2 (en) Speech recognition apparatus and speech recognition method
JP3762300B2 (en) Text input processing apparatus and method, and program
JP2002207728A (en) Phonogram generator, and recording medium recorded with program for realizing the same
Dudy et al. A multi-context character prediction model for a brain-computer interface
JPH05119793A (en) Method and device for speech recognition
JPH1063651A (en) Chinese language input device
JPH04232997A (en) System for displaying result of recognition in speech recognition device
CN118072761A (en) Large model pronunciation deviation detection and pronunciation action image feedback method and device
JP2020030379A (en) Recognition result correction device, recognition result correction method, and program
JPH03201058A (en) Japanese sentence processor
JP2001175645A (en) Device, method, and recording medium for chinese input converting process
JP2000163427A (en) Processor and method for information processing and provision medium
JPH09106400A (en) Word processor
JP2000010588A (en) Method and device for recognizing voice
JPH01114973A (en) Device for supporting document formation/calibration
JPH1049523A (en) Device and method for preparing document

Legal Events

Date Code Title Description
RD01 Notification of change of attorney

Free format text: JAPANESE INTERMEDIATE CODE: A7421

Effective date: 20050317

RD01 Notification of change of attorney

Free format text: JAPANESE INTERMEDIATE CODE: A7421

Effective date: 20070118

RD01 Notification of change of attorney

Free format text: JAPANESE INTERMEDIATE CODE: A7421

Effective date: 20080612

A621 Written request for application examination

Free format text: JAPANESE INTERMEDIATE CODE: A621

Effective date: 20080812

RD01 Notification of change of attorney

Free format text: JAPANESE INTERMEDIATE CODE: A7421

Effective date: 20090512

A977 Report on retrieval

Free format text: JAPANESE INTERMEDIATE CODE: A971007

Effective date: 20100903

A131 Notification of reasons for refusal

Free format text: JAPANESE INTERMEDIATE CODE: A131

Effective date: 20100928

TRDD Decision of grant or rejection written
A01 Written decision to grant a patent or to grant a registration (utility model)

Free format text: JAPANESE INTERMEDIATE CODE: A01

Effective date: 20110705

RD01 Notification of change of attorney

Free format text: JAPANESE INTERMEDIATE CODE: A7421

Effective date: 20110705

A01 Written decision to grant a patent or to grant a registration (utility model)

Free format text: JAPANESE INTERMEDIATE CODE: A01

A61 First payment of annual fees (during grant procedure)

Free format text: JAPANESE INTERMEDIATE CODE: A61

Effective date: 20110718

FPAY Renewal fee payment (event date is renewal date of database)

Free format text: PAYMENT UNTIL: 20140812

Year of fee payment: 3

R150 Certificate of patent or registration of utility model

Free format text: JAPANESE INTERMEDIATE CODE: R150

LAPS Cancellation because of no payment of annual fees