JP2015215390A

JP2015215390A - Speech recognition dictionary update device, speech recognition dictionary update method, and program

Info

Publication number: JP2015215390A
Application number: JP2014096577A
Authority: JP
Inventors: 太一浅見; Taichi Asami; 祥子山畠; Shoko Yamahata; 浩和政瀧; Hirokazu Masataki
Original assignee: Nippon Telegraph and Telephone Corp
Current assignee: Nippon Telegraph and Telephone Corp
Priority date: 2014-05-08
Filing date: 2014-05-08
Publication date: 2015-12-03
Anticipated expiration: 2034-05-08
Also published as: JP5921601B2

Abstract

PROBLEM TO BE SOLVED: To provide a speech recognition dictionary update device which can suppress selection of unnecessary words by an operator.SOLUTION: A speech recognition dictionary update device comprises: a recognition dictionary storage unit which stores a recognition dictionary; an unregistered word management unit which uses related documents as an input to extract and store an unregistered word appeared in the related documents by a trigger of the word addition instruction of the operator, and not registered in the recognition dictionary; a registration management unit which selects the unregistered words by prescribed criteria to provide the operator with the selected and unregistered words, outputs the unregistered words that are instructed to exclude from registration by the operator as excluded words, and updates an extended recognition dictionary to which the unregistered words other than the excluded words are added as a new recognition dictionary; an excluded word storage unit which stores the excluded words; and an unregistered word correction unit which removes words that matches with the excluded words from among the words newly stored in the unregistered word management unit by a trigger of the word addition instruction by the operator.

Description

本発明は、音声認識に用いられる音声認識辞書に単語を追加して更新する音声認識辞書更新装置、音声認識辞書更新方法、プログラムに関する。 The present invention relates to a speech recognition dictionary updating apparatus, a speech recognition dictionary updating method, and a program for updating a speech recognition dictionary used for speech recognition by adding words.

音声認識システムに用いる認識辞書に追加で登録すべき単語を自動的に選出し、選出された単語を認識辞書に追加する音声認識辞書更新装置が特許文献１に開示されている。この音声認識辞書更新装置は、音声認識システム運用者（以下、単に運用者ともいう）が入力した関連文書から認識辞書に登録されていない未登録単語を抽出し、音声認識システム運用中に蓄積された音声と認識結果を用いて未登録単語それぞれのタスク関連度と平均認識信頼度を算出し、タスク関連度と平均認識信頼度から未登録単語それぞれの登録優先度を算出し、登録優先度の高い未登録単語を追加すべき単語として選出する。 Japanese Patent Application Laid-Open No. 2004-151867 discloses a speech recognition dictionary updating apparatus that automatically selects additional words to be registered in a recognition dictionary used in the speech recognition system and adds the selected words to the recognition dictionary. This speech recognition dictionary update device extracts unregistered words that are not registered in the recognition dictionary from related documents input by a speech recognition system operator (hereinafter also referred to simply as an operator) and accumulates them during operation of the speech recognition system. The task relevance and average recognition reliability of each unregistered word are calculated using the voice and the recognition result, and the registration priority of each unregistered word is calculated from the task relevance and average recognition reliability. Select a high unregistered word to add.

以下図１、図２を参照して、特許文献１の音声認識辞書更新装置について説明する。図１は、特許文献１の音声認識辞書更新装置９の構成を示すブロック図である。図２は、特許文献１の音声認識辞書更新装置９の動作を示すフローチャートである。 Hereinafter, the speech recognition dictionary updating apparatus of Patent Document 1 will be described with reference to FIGS. 1 and 2. FIG. 1 is a block diagram showing the configuration of the speech recognition dictionary updating device 9 of Patent Document 1. FIG. 2 is a flowchart showing the operation of the speech recognition dictionary update device 9 of Patent Document 1.

図１に示すように、特許文献１の音声認識辞書更新装置９は、関連文書記憶部７１０と、未登録単語抽出部１１０と、未登録単語記憶部８１０と、未登録単語特徴量抽出部２１０と、入力音声記憶部７２０と、音声認識部１２０と、認識結果記憶部８２０と、認識結果特徴量抽出部２２０と、タスク関連度算出部３１０と、認識辞書記憶部７３０と、暫定認識辞書登録部１３０と、暫定認識辞書記憶部８３０と、登録優先度算出部３２０と、暫定音声認識部１４０と、暫定認識結果記憶部８４０と、認識信頼度算出部２３０と、認識辞書登録部３３０と、追加登録単語確認除外部０３０と、認識辞書更新部０４０と、拡張認識辞書記憶部９００を含む構成である。 As shown in FIG. 1, the speech recognition dictionary update device 9 of Patent Document 1 includes a related document storage unit 710, an unregistered word extraction unit 110, an unregistered word storage unit 810, and an unregistered word feature amount extraction unit 210. An input voice storage unit 720, a voice recognition unit 120, a recognition result storage unit 820, a recognition result feature quantity extraction unit 220, a task relevance calculation unit 310, a recognition dictionary storage unit 730, and a provisional recognition dictionary registration Unit 130, provisional recognition dictionary storage unit 830, registration priority calculation unit 320, provisional speech recognition unit 140, provisional recognition result storage unit 840, recognition reliability calculation unit 230, recognition dictionary registration unit 330, The configuration includes an additional registered word check exclusion unit 030, a recognition dictionary update unit 040, and an extended recognition dictionary storage unit 900.

音声認識システム利用者（以下、単に利用者ともいう）は音声認識辞書更新装置９に音声を入力する。入力された音声（以下、入力音声という）は入力音声記憶部７２０に記憶されると同時に、認識辞書記憶部７３０に記憶されている認識辞書を用いて音声認識部１２０で音声認識され、認識結果が認識結果記憶部８２０に記憶される。 A voice recognition system user (hereinafter also simply referred to as a user) inputs voice to the voice recognition dictionary update device 9. The input voice (hereinafter referred to as input voice) is stored in the input voice storage unit 720, and at the same time, the voice recognition unit 120 recognizes the voice using the recognition dictionary stored in the recognition dictionary storage unit 730, and the recognition result. Is stored in the recognition result storage unit 820.

音声認識システム運用者は音声認識辞書更新装置９に関連文書を入力する。入力された関連文書は関連文書記憶部７１０に記憶される。 The voice recognition system operator inputs a related document to the voice recognition dictionary update device 9. The input related document is stored in the related document storage unit 710.

音声認識システム運用者の単語追加指示を契機として、図２に示す手順で以下のように単語追加処理と認識辞書の更新処理が行われる。 In response to a word addition instruction from the voice recognition system operator, a word addition process and a recognition dictionary update process are performed as follows in the procedure shown in FIG.

（ア）未登録単語抽出部１１０が、関連文書記憶部７１０に記憶された関連文書と、認識辞書記憶部７３０に記憶された認識辞書から、関連文書に出現しているが認識辞書に登録されていない未登録単語を抽出し、未登録単語記憶部８１０に記憶させる（Ｓ１１０）。 (A) The unregistered word extraction unit 110 appears in the related document from the related document stored in the related document storage unit 710 and the recognition dictionary stored in the recognition dictionary storage unit 730, but is registered in the recognition dictionary. Unregistered unregistered words are extracted and stored in the unregistered word storage unit 810 (S110).

（イ）未登録単語特徴量抽出部２１０が、未登録単語それぞれについて、未登録単語が含まれる関連文書中の文とその前後ｎ個の文（共起窓）の集合に含まれる単語である複数の単語と当該未登録単語との間の共起頻度をベクトル値とする共起頻度ベクトルを生成する（Ｓ２１０）。 (A) The unregistered word feature extraction unit 210 is a word included in a set of a sentence in a related document including an unregistered word and n sentences (co-occurrence windows) before and after the unregistered word. A co-occurrence frequency vector having a co-occurrence frequency between a plurality of words and the unregistered word as a vector value is generated (S210).

（ウ）認識結果特徴量抽出部２２０が、認識結果記憶部８２０に記憶された認識結果から、認識結果をｍ文ごとに分割した発話窓の集合に含まれる単語の生起頻度をベクトル値とする単語頻度ベクトルを生成する（Ｓ２２０）。 (C) From the recognition result stored in the recognition result storage unit 820, the recognition result feature amount extraction unit 220 sets the occurrence frequency of words included in the set of utterance windows obtained by dividing the recognition result for every m sentences as a vector value. A word frequency vector is generated (S220).

（エ）タスク関連度算出部３１０が、未登録単語それぞれの共起頻度ベクトルと単語頻度ベクトルから、未登録単語それぞれのタスク関連度を算出する。タスク関連度は共起頻度ベクトルと単語頻度ベクトルのコサイン距離から算出され、システム利用者の入力音声の内容と未登録単語との間の意味的な関連の高さを表す（Ｓ３１０）。 (D) The task relevance calculation unit 310 calculates the task relevance of each unregistered word from the co-occurrence frequency vector and word frequency vector of each unregistered word. The task relevance level is calculated from the cosine distance between the co-occurrence frequency vector and the word frequency vector, and represents the height of the semantic relationship between the contents of the input voice of the system user and the unregistered word (S310).

（オ）暫定認識辞書登録部１３０が、認識辞書記憶部７３０に記憶された認識辞書に、未登録単語記憶部８１０に記憶された未登録単語を全て追加した暫定認識辞書を生成し、暫定認識辞書記憶部８３０に記憶させる（Ｓ１３０）。 (E) The provisional recognition dictionary registration unit 130 generates a provisional recognition dictionary in which all the unregistered words stored in the unregistered word storage unit 810 are added to the recognition dictionary stored in the recognition dictionary storage unit 730, and provisional recognition is performed. The data is stored in the dictionary storage unit 830 (S130).

（カ）暫定音声認識部１４０が、暫定認識辞書を用いて、入力音声記憶部７２０に記憶された音声を音声認識した暫定認識結果を生成し、暫定認識結果記憶部８４０に記憶させる（Ｓ１４０）。 (F) The provisional speech recognition unit 140 uses the provisional recognition dictionary to generate a provisional recognition result obtained by recognizing the speech stored in the input speech storage unit 720, and stores the provisional recognition result in the provisional recognition result storage unit 840 (S140). .

（キ）認識信頼度算出部２３０が、暫定認識結果を用いて、未登録単語それぞれの認識結果出力の正解らしさを表す指標である認識信頼度と、当該未登録単語の出現回数をもとに、未登録単語それぞれの平均認識信頼度を算出する（Ｓ２３０）。 (G) The recognition reliability calculation unit 230 uses the provisional recognition result based on the recognition reliability that is an index representing the correctness of the recognition result output of each unregistered word and the number of appearances of the unregistered word. Then, the average recognition reliability of each unregistered word is calculated (S230).

（ク）登録優先度算出部３２０が、未登録単語それぞれのタスク関連度と平均認識信頼度を用いて、未登録単語それぞれの登録優先度を算出する。登録優先度は、例えばタスク関連度と平均認識信頼度の重み付き和として算出される（Ｓ３２０）。 (H) The registration priority calculation unit 320 calculates the registration priority of each unregistered word using the task relevance and the average recognition reliability of each unregistered word. The registration priority is calculated, for example, as a weighted sum of task relevance and average recognition reliability (S320).

（ケ）認識辞書登録部３３０が、登録優先度が事前に設定された閾値θ以上となる未登録単語を全て選出し（Ｓ３３０１）、選出された未登録単語を追加登録単語確認除外部０３０に出力する。閾値θは大きくすれば登録すべき未登録単語が選出されなくなるリスクが増加し、小さくすれば不要な未登録単語が選出されてしまうリスクが増加する。θとして通常は0.1程度の値が利用される。追加登録単語確認除外部０３０は入力された未登録単語の一覧を（例えばディスプレイに表示する等の方法で）音声認識システム運用者に提示し、音声認識システム運用者は登録すべきでないと判断した未登録単語をインタフェース（例えばチェックボックス等）を用いて指定する。追加登録単語確認除外部０３０は指定されなかった未登録単語を認識辞書登録部３３０に出力する（Ｓ０３０）。認識辞書登録部３３０は追加登録単語確認除外部０３０から入力された指定されなかった未登録単語を、認識辞書記憶部７３０に記憶された認識辞書に追加した拡張認識辞書を生成し、拡張認識辞書記憶部９００に記憶させる（Ｓ３３０２）。以上のステップＳ３３０１、Ｓ０３０、Ｓ３３０２からなる手順全体をステップＳ３３０と呼ぶ。 (K) The recognition dictionary registration unit 330 selects all unregistered words whose registration priority is equal to or higher than a preset threshold value θ (S3301), and selects the selected unregistered words to the additional registered word check exclusion unit 030. Output. Increasing the threshold θ increases the risk that unregistered words to be registered will not be selected, while decreasing the threshold θ increases the risk that unnecessary unregistered words will be selected. A value of about 0.1 is usually used as θ. The additional registered word confirmation exclusion unit 030 presents the list of input unregistered words to the voice recognition system operator (for example, by displaying it on a display) and determines that the voice recognition system operator should not register. An unregistered word is designated using an interface (for example, a check box). The additional registered word confirmation exclusion unit 030 outputs the unregistered unspecified word to the recognition dictionary registration unit 330 (S030). The recognition dictionary registration unit 330 generates an extended recognition dictionary in which the unregistered unregistered words input from the additional registered word check exclusion unit 030 are added to the recognition dictionary stored in the recognition dictionary storage unit 730, and the extended recognition dictionary The data is stored in the storage unit 900 (S3302). The entire procedure including the above steps S3301, S030, and S3302 is referred to as step S330.

（コ）認識辞書更新部０４０が、認識辞書記憶部７３０の内容を拡張認識辞書の内容に更新する（Ｓ０４０）。 (E) The recognition dictionary update unit 040 updates the contents of the recognition dictionary storage unit 730 with the contents of the extended recognition dictionary (S040).

以上の手順により認識辞書が更新され、それ以降に音声認識システム利用者が入力した音声は更新後の認識辞書を用いて音声認識部１２０で音声認識されることになる。更新後の認識辞書には、タスク関連度が高く、かつ認識信頼度が高い（＝実際に入力音声において発声されている可能性が高い）単語が新たに登録されているため、更新前よりも高精度に音声認識を行うことができる。 The recognition dictionary is updated by the above procedure, and the speech input by the speech recognition system user thereafter is recognized by the speech recognition unit 120 using the updated recognition dictionary. In the updated recognition dictionary, words with high task relevance and high recognition reliability (= highly likely to be uttered in the input speech) are newly registered. Speech recognition can be performed with high accuracy.

上記の手順（ケ）／（Ｓ３３０）において音声認識システム運用者による未登録単語の確認／除外を行うことにより、明らかに発声されないであろう不要な単語が誤って選出された場合に、当該不要な単語が認識辞書に追加され、認識結果に誤認識として出現してしまう危険性を低減することができる。 In the above procedure (K) / (S330), when an unregistered word is confirmed / excluded by the voice recognition system operator, an unnecessary word that will not be clearly spoken is selected by mistake. The risk that a simple word is added to the recognition dictionary and appears as a recognition error in the recognition result can be reduced.

なお、本発明では、以下の特許文献１の数式を利用する。
式（１）「手順（イ）共起頻度ベクトルの生成」
式（２）「手順（ウ）単語頻度ベクトルの生成」
式（３）「手順（キ）平均認識信頼度の算出」
式（４）「手順（キ）平均認識信頼度の算出に用いる単語ごとの認識信頼度の算出」
式（５）「手順（ク）登録優先度の算出」 In the present invention, the following mathematical formula of Patent Document 1 is used.
Formula (1) “Procedure (b) Generation of co-occurrence frequency vector”
Formula (2) “Procedure (C) Generation of Word Frequency Vector”
Formula (3) "Procedure (ki) Calculation of average recognition reliability"
Formula (4) “Procedure (g) Calculation of recognition reliability for each word used for calculation of average recognition reliability”
Formula (5) "Procedure (ku) Registration priority calculation"

特開２０１３−１７１２２２号公報JP 2013-171222 A

特許文献１のような従来技術を用いた音声認識システムを長期間に渡って運用する場合、音声認識システム運用者が単語追加指示を行うのは１度だけではない。システム運用中に新たな関連文書を入手できた場合や、入力される音声中で発声される単語が時間の経過とともに変化してきた場合には、音声認識システム運用者は、そのつど単語追加指示を行って認識辞書を更新し、音声認識精度を維持することになる。 When a speech recognition system using the conventional technology such as Patent Document 1 is operated for a long period of time, the speech recognition system operator does not give a word addition instruction only once. When new related documents are available during system operation, or when the words uttered in the input speech change over time, the speech recognition system operator must give a word addition instruction each time. The recognition dictionary is updated to maintain the speech recognition accuracy.

従来技術において音声認識システム運用者が２回目以降に単語追加指示を行った際、１回目に従来技術の手順（ケ）／（Ｓ３３０）において除外した不要な単語が再度追加すべき未登録単語として選出されてしまう場合が多い。単語追加処理では１回目とは別の関連文書と入力音声が使われるものの、未登録単語の選出アルゴリズムは１回目と変わらないためである。そのため、音声認識システム運用者は単語追加指示を行うたびに、毎回毎回同じ不要な単語を除外する作業が発生し、システム運用作業の効率が低下する。２回目に新たに選出されてしまった不要な単語も存在するため、３回目、４回目と単語追加実行を繰り返すたびに不要な単語の除外作業の量は増加する傾向があり、さらなる作業効率の低下を招く。この作業で除外しそこねた不要な単語は拡張認識辞書に登録されてその後も残り続けるため、不要な単語の除外作業には慎重さが求められる。このように音声認識システム運用者は、回を負うごとに増加する不要な単語の除外作業を慎重に行わなければならないため、システム運用コストが増大してしまうという課題があった。 In the prior art, when the voice recognition system operator issues a word addition instruction for the second and subsequent times, unnecessary words excluded in the first procedure (K) / (S330) are unregistered words to be added again. In many cases, it will be elected. This is because the word addition process uses a related document and input speech different from the first time, but the algorithm for selecting unregistered words is the same as the first time. For this reason, every time a voice recognition system operator gives an instruction to add a word, an operation to exclude the same unnecessary word occurs every time, and the efficiency of the system operation operation decreases. Since there are unnecessary words that have been newly selected for the second time, the amount of unnecessary word exclusion tends to increase each time the third and fourth times and the word addition execution is repeated. Incurs a decline. Unnecessary words that are left out in this work are registered in the extended recognition dictionary and continue to remain after that, so care must be taken when removing unnecessary words. As described above, the voice recognition system operator has to carefully exclude unnecessary words that are increased every time he / she takes time, and there is a problem that the system operation cost increases.

そこで本発明は、運用者の不要な単語の選出作業を抑制することができる音声認識辞書更新装置を提供することを目的とする。 Therefore, an object of the present invention is to provide a speech recognition dictionary updating apparatus that can suppress an operator's unnecessary word selection work.

本発明の音声認識辞書更新装置は、認識辞書記憶部と、未登録単語管理部と、登録管理部と、除外単語記憶部と、未登録単語補正部を含む。 The speech recognition dictionary update device of the present invention includes a recognition dictionary storage unit, an unregistered word management unit, a registration management unit, an excluded word storage unit, and an unregistered word correction unit.

認識辞書記憶部は、認識辞書を記憶する。未登録単語管理部は、関連文書を入力とし、運用者の単語追加指示を契機として、関連文書中に出現し、認識辞書に登録されていない単語である未登録単語を抽出して記憶する。登録管理部は、未登録単語を所定の基準で選出して、選出された未登録単語を運用者に提示し、選出された未登録単語のうち除外する旨の運用者指示があった未登録単語を除外単語として出力し、除外単語以外の未登録単語を認識辞書に追加した拡張認識辞書を新たな認識辞書として認識辞書記憶部に記憶された認識辞書を更新する。除外単語記憶部は、除外単語を記憶する。未登録単語補正部は、運用者の単語追加指示を契機として、未登録単語管理部に新たに記憶された未登録単語のうち、除外単語に一致する単語を削除する。 The recognition dictionary storage unit stores a recognition dictionary. The unregistered word management unit extracts and stores an unregistered word that is a word that appears in the related document and is not registered in the recognition dictionary with an input of the related document and an operator's word addition instruction. The registration management unit selects unregistered words based on a predetermined criterion, presents the selected unregistered words to the operator, and has received an operator instruction to exclude the selected unregistered words. The recognition dictionary stored in the recognition dictionary storage unit is updated with the extended recognition dictionary in which the unregistered word other than the excluded word is added to the recognition dictionary as a new recognition dictionary. The excluded word storage unit stores excluded words. An unregistered word correction | amendment part deletes the word which corresponds to an exclusion word among unregistered words newly memorize | stored in the unregistered word management part by an operator's word addition instruction | indication.

本発明の音声認識辞書更新装置によれば、運用者の不要な単語の選出作業を抑制することができる。 According to the speech recognition dictionary updating apparatus of the present invention, it is possible to suppress an operator's unnecessary word selection work.

特許文献１の音声認識辞書更新装置の構成を示すブロック図。The block diagram which shows the structure of the speech recognition dictionary update apparatus of patent document 1. FIG. 特許文献１の音声認識辞書更新装置の動作を示すフローチャート。10 is a flowchart showing the operation of the speech recognition dictionary update device of Patent Document 1. 実施例１の音声認識辞書更新装置の構成を示すブロック図。1 is a block diagram illustrating a configuration of a speech recognition dictionary update device according to Embodiment 1. FIG. 実施例１の音声認識辞書更新装置の動作を示すフローチャート。5 is a flowchart showing the operation of the speech recognition dictionary updating apparatus according to the first embodiment. 実施例２の音声認識辞書更新装置の構成を示すブロック図。The block diagram which shows the structure of the speech recognition dictionary update apparatus of Example 2. FIG. 実施例２の音声認識辞書更新装置の動作を示すフローチャート。9 is a flowchart showing the operation of the speech recognition dictionary updating apparatus according to the second embodiment. 実施例３の音声認識辞書更新装置の構成を示すブロック図。FIG. 9 is a block diagram illustrating a configuration of a speech recognition dictionary updating apparatus according to a third embodiment. 実施例３の音声認識辞書更新装置の動作を示すフローチャート。10 is a flowchart showing the operation of the speech recognition dictionary updating apparatus according to the third embodiment. 実施例４の音声認識辞書更新装置の構成を示すブロック図。FIG. 9 is a block diagram illustrating a configuration of a speech recognition dictionary updating apparatus according to a fourth embodiment. 実施例４の音声認識辞書更新装置の動作を示すフローチャート。10 is a flowchart showing the operation of the speech recognition dictionary updating apparatus according to the fourth embodiment. 実施例１の変形例の音声認識辞書更新装置の構成を示すブロック図。The block diagram which shows the structure of the speech recognition dictionary update apparatus of the modification of Example 1. FIG. 実施例１の変形例の音声認識辞書更新装置の動作を示すフローチャート。7 is a flowchart showing the operation of the speech recognition dictionary updating apparatus according to a modification of the first embodiment. 実施例２の変形例の音声認識辞書更新装置の構成を示すブロック図。FIG. 9 is a block diagram illustrating a configuration of a speech recognition dictionary update device according to a modification of the second embodiment. 実施例２の変形例の音声認識辞書更新装置の動作を示すフローチャート。10 is a flowchart showing the operation of the speech recognition dictionary updating apparatus according to a modification of the second embodiment. 実施例３の変形例の音声認識辞書更新装置の構成を示すブロック図。FIG. 10 is a block diagram illustrating a configuration of a speech recognition dictionary update device according to a modification of the third embodiment. 実施例３の変形例の音声認識辞書更新装置の動作を示すフローチャート。10 is a flowchart showing the operation of the speech recognition dictionary updating apparatus according to a modification of the third embodiment. 実施例４の変形例の音声認識辞書更新装置の構成を示すブロック図。FIG. 10 is a block diagram illustrating a configuration of a speech recognition dictionary update device according to a modification of the fourth embodiment. 実施例４の変形例の音声認識辞書更新装置の動作を示すフローチャート。10 is a flowchart showing the operation of the speech recognition dictionary updating apparatus according to a modification of the fourth embodiment.

以下、本発明の実施の形態について、詳細に説明する。なお、同じ機能を有する構成部には同じ番号を付し、重複説明を省略する。 Hereinafter, embodiments of the present invention will be described in detail. In addition, the same number is attached | subjected to the structure part which has the same function, and duplication description is abbreviate | omitted.

以下、図３、図４を参照して、本発明の実施例１の音声認識辞書更新装置１について説明する。図３は、本実施例の音声認識辞書更新装置１の構成を示すブロック図である。図４は、本実施例の音声認識辞書更新装置１の動作を示すフローチャートである。 Hereinafter, the speech recognition dictionary updating apparatus 1 according to the first embodiment of the present invention will be described with reference to FIGS. FIG. 3 is a block diagram illustrating the configuration of the speech recognition dictionary updating apparatus 1 according to the present embodiment. FIG. 4 is a flowchart showing the operation of the speech recognition dictionary updating apparatus 1 according to the present embodiment.

図３に示す通り、特許文献１の音声認識辞書更新装置９（図１）との違いは、特許文献１の音声認識辞書更新装置９における追加登録単語確認除外部０３０が追加登録単語確認除外保存部０５０に置き換えられている点、特許文献１の音声認識辞書更新装置９に含まれていない除外単語記憶部０６０と未登録単語補正部０７０が追加されている点である。 As shown in FIG. 3, the difference from the speech recognition dictionary update device 9 (FIG. 1) of Patent Literature 1 is that the additional registration word confirmation exclusion unit 030 in the speech recognition dictionary update device 9 of Patent Literature 1 stores additional registration word confirmation exclusion. This is a point that is replaced by the part 050, and an excluded word storage part 060 and an unregistered word correction part 070 that are not included in the speech recognition dictionary update device 9 of Patent Document 1 are added.

以下、新規な構成である追加登録単語確認除外保存部０５０と未登録単語補正部０７０の動作を図４を参照して説明する。なお図３に示された構成のうち、図１と同じ参照番号が付与されている構成の動作は前述と同じであるから適宜説明を略す。 Hereinafter, the operations of the additional registered word confirmation exclusion storing unit 050 and the unregistered word correcting unit 070 which are new configurations will be described with reference to FIG. In addition, since the operation | movement of the structure to which the same reference number as FIG. 1 is provided among the structures shown in FIG. 3 is the same as the above-mentioned, description is abbreviate | omitted suitably.

＜追加登録単語確認除外保存部０５０＞
入力：認識辞書登録部３３０で選出された未登録単語
出力：音声認識システム運用者に指定されなかった未登録単語（認識辞書登録部３３０へ）、音声認識システム運用者に指定された未登録単語（除外単語記憶部０６０へ）
処理：追加登録単語確認除外保存部０５０は、認識辞書登録部３３０が未登録単語を選出した（Ｓ３３０１）後に、以下の処理を行う。
１）追加登録単語確認除外保存部０５０は、入力された未登録単語の一覧を（例えばディスプレイに表示する等の方法で）音声認識システム運用者に提示し、音声認識システム運用者は登録すべきでないと判断した未登録単語をインタフェース（例えばチェックボックス等）を用いて指定する。追加登録単語確認除外部０５０は指定されなかった未登録単語を認識辞書登録部３３０に出力する（Ｓ０３０）。
２）追加登録単語確認除外保存部０５０は、さらに、音声認識システム運用者に指定された未登録単語（除外単語）を除外単語記憶部０６０に記憶させる（Ｓ０５０）。 <Additional registered word confirmation exclusion storage unit 050>
Input: Unregistered word selected by recognition dictionary registration unit 330 Output: Unregistered word not specified by speech recognition system operator (to recognition dictionary registration unit 330), Unregistered word specified by speech recognition system operator (To excluded word storage unit 060)
Processing: The additional registered word confirmation exclusion saving unit 050 performs the following processing after the recognition dictionary registration unit 330 selects an unregistered word (S3301).
1) The additional registered word confirmation exclusion storage unit 050 presents the list of input unregistered words to the voice recognition system operator (for example, by displaying it on a display), and the voice recognition system operator should register An unregistered word determined not to be specified is specified using an interface (for example, a check box). The additional registered word confirmation excluding unit 050 outputs the unregistered unspecified word to the recognition dictionary registering unit 330 (S030).
2) The additional registered word confirmation exclusion storing unit 050 further stores the unregistered word (excluded word) designated by the voice recognition system operator in the excluded word storage unit 060 (S050).

＜未登録単語補正部０７０＞
入力：未登録単語記憶部８１０に記憶されている未登録単語、除外単語記憶部０６０に記憶されている除外単語
出力：除外単語を取り除いた未登録単語（未登録単語記憶部８１０へ）
処理：未登録単語補正部０７０は、未登録単語抽出部１１０が抽出した未登録単語を未登録単語記憶部８１０に記憶させた（Ｓ１１０）後に以下の処理（Ｓ０７０）を行う。
１）未登録単語補正部０７０は、未登録単語記憶部８１０に記憶されている未登録単語と、除外単語記憶部０６０に記憶されている除外単語を取得する。
２）未登録単語補正部０７０は、未登録単語のうち、除外単語に一致するものを取り除く。
３）未登録単語補正部０７０は、未登録単語記憶部８１０の内容を、取り除かれずに残った未登録単語に書き換える。
※１回目の実行時には除外単語記憶部０６０に除外単語が１つも記憶されていないため、上記の手順２）で取り除かれる単語はなく、従来技術と同じ出力となる。 <Unregistered word correction unit 070>
Input: Unregistered word stored in unregistered word storage unit 810, Excluded word stored in excluded word storage unit 060 Output: Unregistered word after removal of excluded word (to unregistered word storage unit 810)
Process: The unregistered word correction unit 070 stores the unregistered word extracted by the unregistered word extraction unit 110 in the unregistered word storage unit 810 (S110), and then performs the following process (S070).
1) The unregistered word correction unit 070 acquires an unregistered word stored in the unregistered word storage unit 810 and an excluded word stored in the excluded word storage unit 060.
2) The unregistered word correction unit 070 removes unregistered words that match the excluded words.
3) The unregistered word correction unit 070 rewrites the contents of the unregistered word storage unit 810 with the unregistered words remaining without being removed.
* Since no excluded word is stored in the excluded word storage unit 060 at the time of the first execution, there is no word removed in the above procedure 2), and the output is the same as that of the prior art.

以上の処理によって音声認識システム運用者が除外した単語を記憶し、未登録単語の抽出処理の直後に除外単語を取り除くことにより、２回目以降の単語追加実行時に、
＊過去に除外した不要な未登録単語が再度選出されなくなる
＊共起頻度ベクトル生成処理（Ｓ２１０）、タスク関連度算出処理（Ｓ３１０）、暫定認識結果生成処理（Ｓ１４０）、登録優先度算出処理（Ｓ３２０）の処理時間が短縮される
＊Ｓ２３０で算出される平均認識信頼度の正確性が向上し、結果として拡張認識辞書の質が向上し、音声認識精度の改善効果が大きくなる
という効果が得られる。 By storing the words excluded by the voice recognition system operator through the above process, and removing the excluded words immediately after the unregistered word extraction process,
* Unnecessary unregistered words excluded in the past will not be selected again * Co-occurrence frequency vector generation processing (S210), task relevance calculation processing (S310), provisional recognition result generation processing (S140), registration priority calculation processing ( The processing time of S320) is shortened. * Accuracy of the average recognition reliability calculated in S230 is improved. As a result, the quality of the extended recognition dictionary is improved, and the effect of improving the speech recognition accuracy is obtained. It is done.

以下、図５、図６を参照して、本発明の実施例２の音声認識辞書更新装置２について説明する。図５は、本実施例の音声認識辞書更新装置２の構成を示すブロック図である。図６は、本実施例の音声認識辞書更新装置２の動作を示すフローチャートである。 Hereinafter, the speech recognition dictionary updating apparatus 2 according to the second embodiment of the present invention will be described with reference to FIGS. 5 and 6. FIG. 5 is a block diagram illustrating the configuration of the speech recognition dictionary updating apparatus 2 according to the present embodiment. FIG. 6 is a flowchart showing the operation of the speech recognition dictionary updating apparatus 2 of the present embodiment.

図５に示す通り、実施例１の音声認識辞書更新装置１との違いは、実施例１における未登録単語補正部０７０が、本実施例において不要単語推定除去部０８０に置き換えられている点のみである。実施例１の未登録単語補正部０７０では過去に音声認識システム運用者によって除外された除外単語と一致する未登録単語のみを取り除いていたが、それに加えて実施例２の不要単語推定除外部０８０では除外単語から推定した（必ずしも除外単語と一致するとは限らない）不要単語も取り除く。実施例２では実施例１よりも多くの不要単語が取り除かれるため、音声認識システム運用者の不要単語除外作業量の削減効果、単語追加の処理時間削減効果、平均認識信頼度の正確性向上による音声認識精度の改善効果がより大きなものとなる。 As shown in FIG. 5, the only difference from the speech recognition dictionary updating apparatus 1 of the first embodiment is that the unregistered word correcting unit 070 in the first embodiment is replaced with an unnecessary word estimation removing unit 080 in the present embodiment. It is. The unregistered word correction unit 070 of the first embodiment only removes unregistered words that match the excluded words that were previously excluded by the speech recognition system operator. In addition, the unnecessary word estimation exclusion unit 080 of the second embodiment is used. Then, unnecessary words estimated from the excluded words (which do not necessarily match the excluded words) are also removed. In the second embodiment, more unnecessary words are removed than in the first embodiment. Therefore, it is possible to reduce the amount of unnecessary words excluded by the voice recognition system operator, reduce the processing time for adding words, and improve the accuracy of the average recognition reliability. The effect of improving the speech recognition accuracy is greater.

以下、新規な構成である不要単語推定除去部０８０の動作を図６を参照して説明する。なお図５に示された構成のうち、図３や図１と同じ参照番号が付与されている構成の動作は前述と同じであるから適宜説明を略す。 Hereinafter, the operation of the unnecessary word estimation removing unit 080 having a novel configuration will be described with reference to FIG. Note that, in the configuration shown in FIG. 5, the operation of the configuration given the same reference numerals as those in FIG. 3 and FIG.

＜不要単語推定除去部０８０＞
入力：未登録単語記憶部８１０に記憶されている未登録単語、除外単語記憶部０６０に記憶されている除外単語
出力：除外単語を取り除いた未登録単語（未登録単語記憶部８１０へ）
処理：不要単語推定除去部０８０は、未登録単語抽出部１１０が抽出した未登録単語を、未登録単語記憶部８１０に記憶させた（Ｓ１１０）後に以下の処理（Ｓ０８０）を行う。
１）不要単語推定除去部０８０は、未登録単語記憶部８１０に記憶されている未登録単語と、除外単語記憶部０６０に記憶されている除外単語を取得する。
２）不要単語推定除去部０８０は、以下のルールで推定除外単語を求める。
(ア)不要単語推定除去部０８０は、未登録単語のうち、除外単語と一致するものは推定除外単語とする。
(イ)不要単語推定除去部０８０は、除外単語のうち記号（ひらがな・カタカナ・漢字・アルファベット以外の文字）のみで構成されている記号除外単語を集め、記号除外単語中に現れる全ての記号の集合（除外記号集合）を生成する。未登録単語のうち、除外記号集合に含まれる記号のみで構成されているものは推定除外単語とする。
３）不要単語推定除去部０８０は、未登録単語のうち、推定除外単語に一致するものを取り除く。
４）不要単語推定除去部０８０は、未登録単語記憶部８１０の内容を、取り除かれずに残った未登録単語に書き換える。
※実施例１と同様に、１回目の実行時には除外単語記憶部０６０に除外単語が１つも記憶されていないため、上記の手順２）で推定除外単語は一つも得られず、従来技術と同じ出力となる。 <Unnecessary word estimation removal unit 080>
Input: Unregistered word stored in unregistered word storage unit 810, Excluded word stored in excluded word storage unit 060 Output: Unregistered word after removal of excluded word (to unregistered word storage unit 810)
Process: The unnecessary word estimation removal unit 080 stores the unregistered word extracted by the unregistered word extraction unit 110 in the unregistered word storage unit 810 (S110), and then performs the following process (S080).
1) The unnecessary word estimation removal unit 080 acquires an unregistered word stored in the unregistered word storage unit 810 and an excluded word stored in the excluded word storage unit 060.
2) The unnecessary word estimation removing unit 080 obtains an estimated excluded word according to the following rule.
(A) Unnecessary word estimation removing unit 080 determines an unregistered word that matches an excluded word as an estimated excluded word.
(B) The unnecessary word estimation removing unit 080 collects symbol excluded words composed only of symbols (characters other than hiragana, katakana, kanji, and alphabets) among the excluded words, and calculates all the symbols that appear in the symbol excluded words. A set (exclusion symbol set) is generated. Among the unregistered words, words that are composed only of symbols included in the excluded symbol set are assumed to be excluded words.
3) The unnecessary word estimation removing unit 080 removes unregistered words that match the estimated excluded words.
4) The unnecessary word estimation removal unit 080 rewrites the contents of the unregistered word storage unit 810 with the unregistered words remaining without being removed.
* Similar to Example 1, since no excluded word is stored in the excluded word storage unit 060 at the first execution, no estimated excluded word is obtained in the above step 2), which is the same as the prior art. Output.

本技術分野では、関連文書としてＷｅｂから収集した文書などが入力されるため、関連文書には思わぬ記号が含まれている場合が多い。 In this technical field, since a document collected from the Web is input as a related document, the related document often includes an unexpected symbol.

記号には、音声認識システムの利用用途次第で重要な記号と不要な記号が存在する。例えば音楽に関する内容の音声を認識するシステムでは「＃」（シャープ）や「♭」（フラット）は重要な記号だが、音楽以外の内容ではこれらの記号は不要な場合が多い。完全に自動で重要な記号と不要な記号を区別することは難しいため、実施例２では利用用途を熟知した音声認識システム運用者が除外した単語を活用することで不要な記号を特定している。例えば上記手順２）のルール（イ）の第１文で不要な記号の特定を行っている。 There are important symbols and unnecessary symbols depending on the application of the speech recognition system. For example, “#” (sharp) and “♭” (flat) are important symbols in a system that recognizes audio related to music, but these symbols are often unnecessary for content other than music. Since it is difficult to distinguish important symbols from unnecessary symbols completely automatically, in Example 2, unnecessary symbols are identified by using words excluded by a voice recognition system operator who is familiar with the usage. . For example, unnecessary symbols are specified in the first sentence of the rule (b) in the procedure 2).

また、ある記号が不要でも、その記号が特殊な単語の一部として現れているときには重要となる場合もある（例えば認識辞書に「＃」は不要だが、プログラミング言語の「Ｃ＃」は登録したいと音声認識システム運用者が考える場合）。上記手順２）のルール（イ）の第２文では、不要な記号のみから構成される単語を除去している。これによって、「＃」を音声認識システム運用者が除外したとき、次回以降の単語追加実行時に「＃＃」は除外記号集合に含まれる記号のみで構成されているため自動的に除去されるが「Ｃ＃」は除外記号集合に含まれない文字（アルファベットのＣ）を単語の記述に含むため、除去されずに未登録単語の選出処理に渡されることになり、重要な単語が誤って除去される事態を防いでいる。 Even if a symbol is unnecessary, it may be important when the symbol appears as part of a special word (for example, “#” is not necessary in the recognition dictionary, but “C #” in the programming language is desired to be registered. Voice recognition system operator). In the second sentence of the rule (b) in the procedure 2), a word composed only of unnecessary symbols is removed. As a result, when the operator of the speech recognition system excludes “#”, “##” is automatically removed because it is composed only of symbols included in the excluded symbol set when the next word addition is executed. Since “C #” includes a character (alphabet C) that is not included in the exclusion symbol set in the word description, it is passed to the selection process of unregistered words without being removed, and important words are erroneously removed. The situation is prevented.

手順２）のルール（イ）の追加により、不要単語推定除去部０８０では実施例１の未登録単語補正部０７０よりも多くの単語が取り除かれることになり、
＊過去に除外した不要な未登録単語が再度選出されなくなるのに加えて、不要な記号のみからなる不要な未登録単語も選出されなくなるので、実施例１より大きく作業効率が改善する
＊共起頻度ベクトル生成処理（Ｓ２１０）、タスク関連度算出処理（Ｓ３１０）、暫定認識結果生成処理（Ｓ１４０）、登録優先度算出処理（Ｓ３２０）の処理時間が実施例１より大きく短縮される
＊Ｓ２３０で算出される平均認識信頼度の正確性が実施例１より大きく向上し、結果として音声認識精度の改善効果がより大きくなる
という効果を得られる。 By adding the rule (b) in step 2), the unnecessary word estimation removal unit 080 removes more words than the unregistered word correction unit 070 of the first embodiment.
* Unnecessary unregistered words excluded in the past will not be selected again, and unnecessary unregistered words consisting only of unnecessary symbols will not be selected, so work efficiency will be greatly improved compared to Example 1. * Co-occurrence The processing time of the frequency vector generation process (S210), the task relevance calculation process (S310), the provisional recognition result generation process (S140), and the registration priority calculation process (S320) is greatly shortened compared to the first embodiment * calculated in S230 The accuracy of the average recognition reliability is greatly improved as compared with the first embodiment, and as a result, the effect of improving the speech recognition accuracy can be obtained.

以下、図７、図８を参照して、本発明の実施例３の音声認識辞書更新装置３について説明する。図７は、本実施例の音声認識辞書更新装置３の構成を示すブロック図である。図８は、本実施例の音声認識辞書更新装置３の動作を示すフローチャートである。 Hereinafter, the speech recognition dictionary updating apparatus 3 according to the third embodiment of the present invention will be described with reference to FIGS. FIG. 7 is a block diagram illustrating the configuration of the speech recognition dictionary updating device 3 according to the present embodiment. FIG. 8 is a flowchart showing the operation of the speech recognition dictionary updating apparatus 3 of the present embodiment.

図７に示す通り、実施例１または２の音声認識辞書更新装置１、２との違いは、実施例１および２における追加登録単語確認除外保存部０５０が登録優先度保存部０９０に置き換えられている点、実施例１および２の音声認識辞書更新装置１、２に含まれない除外登録優先度記憶部０９１と登録優先度閾値算出部０９２が追加されている点である。なお、本実施例では、実施例２の構成をベースに上述の変更（０５０を０９０に置換、０９１、０９２を追加する変更）を加えているが、これに限定されず、実施例１の構成に、同様の変更（０５０を０９０に置換、０９１、０９２を追加する変更）を加えた構成としてもよい。 As shown in FIG. 7, the difference from the speech recognition dictionary update apparatuses 1 and 2 of the first or second embodiment is that the additional registered word confirmation exclusion storing unit 050 in the first and second embodiments is replaced with a registration priority storing unit 090. In other words, an exclusion registration priority storage unit 091 and a registration priority threshold value calculation unit 092 which are not included in the speech recognition dictionary updating apparatuses 1 and 2 of the first and second embodiments are added. In the present embodiment, the above-described changes (changes in which 050 is replaced with 090 and 091 and 092 are added) are added based on the configuration of the second embodiment, but the present invention is not limited to this, and the configuration of the first embodiment Further, a configuration in which similar changes (replace 050 with 090 and changes to add 091 and 092) may be made.

認識辞書登録部３３０は、事前に設定された閾値以上の登録優先度を持つ未登録単語を選出する。本実施例では、音声認識システム運用者が除外した未登録単語それぞれの登録優先度（除外登録優先度）を保存し、次回以降の単語追加実行時には除外登録優先度から算出した閾値を認識辞書登録部３３０に設定することにより、不要な未登録単語の選出を実施例１または２よりも大きく抑制する。 The recognition dictionary registration unit 330 selects unregistered words having a registration priority equal to or higher than a preset threshold value. In this embodiment, the registration priority (exclusion registration priority) of each unregistered word excluded by the voice recognition system operator is saved, and the threshold value calculated from the exclusion registration priority is registered in the recognition dictionary when the next word addition is executed. By setting in the unit 330, selection of unnecessary unregistered words is suppressed more than in the first or second embodiment.

以下、新規な構成である登録優先度保存部０９０と登録優先度閾値算出部０９２の動作を図８を参照して説明する。なお図７に示された構成のうち、図５、図３、図１と同じ参照番号が付与されている構成の動作は前述と同じであるから適宜説明を略す。 The operations of the registration priority storage unit 090 and the registration priority threshold calculation unit 092, which are new configurations, will be described below with reference to FIG. In the configuration shown in FIG. 7, the operation of the configuration given the same reference numerals as those in FIGS. 5, 3, and 1 is the same as described above, and therefore the description thereof is omitted as appropriate.

＜登録優先度保存部０９０＞
入力：認識辞書登録部３３０で選出された未登録単語
出力：音声認識システム運用者に指定されなかった未登録単語（認識辞書登録部３３０へ）、音声認識システム運用者に指定された未登録単語（除外単語記憶部０６０へ）、音声認識システム運用者に指定された未登録単語の登録優先度（除外登録優先度記憶部０９１へ）
処理：登録優先度保存部０９０は、認識辞書登録部３３０が未登録単語を選出した（Ｓ３３０１）後に、以下の処理を行う。
１）登録優先度保存部０９０は、入力された未登録単語の一覧を（例えばディスプレイに表示する等の方法で）音声認識システム運用者に提示し、音声認識システム運用者は登録すべきでないと判断した未登録単語をインタフェース（例えばチェックボックス等）を用いて指定する。登録優先度保存部０９０は、指定されなかった未登録単語を認識辞書登録部３３０に出力する（従来技術のＳ０３０と同じ処理）。
２）登録優先度保存部０９０は、さらに、音声認識システム運用者に指定された未登録単語（除外単語）を除外単語記憶部０６０に記憶させるとともに、除外単語の登録優先度を除外登録優先度記憶部０９１に記憶させる（Ｓ０９０）。このとき、過去の単語追加実行時に記憶されていた除外登録優先度は消去してから、今回の除外登録優先度を記憶させる。除外登録優先度は複数個（音声認識システム運用者によって除外された未登録単語の個数と同じ数）存在し、N_allはその個数を指す。 <Registration priority storage unit 090>
Input: Unregistered word selected by recognition dictionary registration unit 330 Output: Unregistered word not specified by speech recognition system operator (to recognition dictionary registration unit 330), Unregistered word specified by speech recognition system operator (To exclusion word storage unit 060), registration priority of unregistered words designated by the voice recognition system operator (to exclusion registration priority storage unit 091)
Processing: The registration priority storage unit 090 performs the following processing after the recognition dictionary registration unit 330 selects an unregistered word (S3301).
1) The registration priority storage unit 090 presents a list of input unregistered words to the voice recognition system operator (for example, by displaying it on a display), and the voice recognition system operator should not register. The determined unregistered word is designated using an interface (for example, a check box). The registration priority storage unit 090 outputs an unregistered unspecified word to the recognition dictionary registration unit 330 (the same processing as S030 of the conventional technology).
2) The registration priority storage unit 090 further stores the unregistered word (excluded word) designated by the voice recognition system operator in the excluded word storage unit 060 and sets the registration priority of the excluded word as the excluded registration priority. The data is stored in the storage unit 091 (S090). At this time, the exclusion registration priority stored at the time of past word addition is deleted, and then the current exclusion registration priority is stored. There are a plurality of exclusion registration priorities (the same number as the number of unregistered words excluded by the voice recognition system operator), and N_all indicates the number.

＜登録優先度閾値算出部０９２＞
入力：除外登録優先度記憶部０９１に記憶されている除外登録優先度
出力：登録優先度閾値（認識辞書登録部３３０へ）
処理：登録優先度閾値算出部０９２は、登録優先度算出部３２０が未登録単語それぞれの登録優先度を算出した（Ｓ３２０）後に、以下の処理を行う。
１）登録優先度閾値算出部０９２は、認識辞書登録部３３０に設定されている登録優先度閾値θを取得する。
２）登録優先度閾値算出部０９２は、除外登録優先度記憶部０９１に記憶されている除外登録優先度を全て取得する。
３）登録優先度閾値算出部０９２は、除外登録優先度の個数（N_all）が所定の値Ｎよりも小さい場合、手順１で取得した登録優先度閾値θをそのまま認識辞書登録部３３０に設定して処理を終了する。Ｎは登録優先度閾値の変更判断をするために必要な除外登録優先度の個数を表し、５０程度の値が用いられる。
４）登録優先度閾値算出部０９２は、除外登録優先度の個数がＮ以上となっている場合、登録優先度閾値θに、登録優先度閾値最大上昇幅Δθを加えたθ＋Δθを計算し、除外登録優先度のうち値がθ＋Δθ以下となっている低登録優先度の個数（N_small）をカウントする。Δθを大きくすれば登録優先度閾値の上昇量が大きくなって未登録単語が選出されにくくなり、小さくすれば登録優先度の閾値の上昇量が小さくなって不要単語の選出を抑制しにくくなる。Δθには通常０．０５〜０．１程度の値が用いられる。
５）登録優先度閾値算出部０９２は、除外登録優先度の総数（N_all）のうち、低登録優先度の個数（N_small）の割合（N_small / N_all）が所定の値Ｒ以上となっている場合、低登録優先度の中での最大の登録優先度を、登録優先度閾値θの更新値として認識辞書登録部３３０に設定する。N_small / N_allがＲ未満となっていた場合は、手順１で取得した登録優先度閾値θをそのまま認識辞書登録部３３０に設定する。Ｒは、除外登録優先度が閾値付近にどの程度密集している場合に閾値上昇の判断をするかを表すパラメータであり、通常は０．８程度の値が用いられる。
※１回目の実行時には除外登録優先度記憶部０９１に除外登録優先度が１つも記憶されていないため、上記の手順３でN_allがＮ以上とならず、登録優先度閾値の変更は行われない。 <Registration priority threshold calculation unit 092>
Input: Exclusion registration priority stored in exclusion registration priority storage unit 091 Output: Registration priority threshold (to recognition dictionary registration unit 330)
Process: The registration priority threshold calculation unit 092 performs the following process after the registration priority calculation unit 320 calculates the registration priority of each unregistered word (S320).
1) The registration priority threshold value calculation unit 092 acquires the registration priority threshold value θ set in the recognition dictionary registration unit 330.
2) The registration priority threshold value calculation unit 092 acquires all the exclusion registration priorities stored in the exclusion registration priority storage unit 091.
3) When the number of excluded registration priorities (N_all) is smaller than a predetermined value N, the registration priority threshold value calculation unit 092 sets the registration priority threshold value θ acquired in step 1 in the recognition dictionary registration unit 330 as it is. To finish the process. N represents the number of exclusion registration priorities necessary for determining the registration priority threshold change, and a value of about 50 is used.
4) The registration priority threshold calculation unit 092 calculates θ + Δθ by adding the registration priority threshold maximum increase width Δθ to the registration priority threshold θ when the number of exclusion registration priorities is N or more, and excludes it. The number (N_small) of low registration priorities whose values are equal to or less than θ + Δθ among the registration priorities is counted. Increasing Δθ increases the amount of increase in the registration priority threshold and makes it difficult to select unregistered words, while decreasing it decreases the amount of increase in the registration priority threshold and makes it difficult to suppress selection of unnecessary words. A value of about 0.05 to 0.1 is usually used for Δθ.
5) The registration priority threshold calculation unit 092 determines that the ratio (N_small / N_all) of the number of low registration priorities (N_small) out of the total number of excluded registration priorities (N_all) is equal to or greater than a predetermined value R The maximum registration priority among the low registration priorities is set in the recognition dictionary registration unit 330 as the update value of the registration priority threshold value θ. If N_small / N_all is less than R, the registration priority threshold value θ acquired in step 1 is set in the recognition dictionary registration unit 330 as it is. R is a parameter that indicates how dense the exclusion registration priority is in the vicinity of the threshold value, and the threshold value increase is determined. Usually, a value of about 0.8 is used.
* At the first execution, since no exclusion registration priority is stored in the exclusion registration priority storage unit 091, N_all does not exceed N in the above procedure 3, and the registration priority threshold is not changed. .

認識辞書登録部３３０は登録優先度が閾値θ以上となっている未登録単語を選出する。このとき、登録優先度が閾値θを少しだけ上回る未登録単語は、選出された未登録単語の中では低めの登録優先度を持ち、不要単語である可能性が比較的高い。 The recognition dictionary registration unit 330 selects unregistered words whose registration priority is equal to or higher than the threshold θ. At this time, an unregistered word whose registration priority slightly exceeds the threshold value θ has a lower registration priority among the selected unregistered words, and has a relatively high possibility of being an unnecessary word.

音声認識システム運用者が不要単語を除外した時、閾値θを少しだけ上回る未登録単語が数多く除外されていた場合、閾値を上昇させるべきだと考えられる。実施例３はこの考えに基づき、音声認識システム運用者が除外した不要単語の登録優先度（除外登録優先度）を活用して、次回以降の単語追加実行時に不要単語が選出されにくくなるように閾値θを上昇させるかどうかを判断する。 When the speech recognition system operator excludes unnecessary words, if many unregistered words slightly exceeding the threshold θ are excluded, it is considered that the threshold should be increased. Example 3 is based on this idea, and uses the registration priority (exclusion registration priority) of unnecessary words excluded by the voice recognition system operator so that unnecessary words are not easily selected at the next and subsequent word addition executions. It is determined whether or not the threshold value θ is increased.

上記の手順３〜５により、以下のように登録優先度閾値が設定される。
＊音声認識システム運用者が一定数Ｎ以上の未登録単語を除外したとき、
＊登録優先度閾値を少しだけ（Δθだけ）上回る未登録単語が数多く（割合Ｒ以上）除外されていた場合、
＊次回の単語追加実行時に、それら登録優先度閾値θを少しだけ上回る未登録単語を全て除外するように（低登録優先度の最大値に）登録優先度閾値を設定し直す。 By the above steps 3 to 5, the registration priority threshold is set as follows.
* When the voice recognition system operator excludes a certain number of unregistered words,
* If many unregistered words (ratio R or more) that slightly exceed the registration priority threshold (by Δθ) are excluded,
* When the next word addition is executed, the registration priority threshold is reset so that all unregistered words that slightly exceed the registration priority threshold θ are excluded (to the maximum value of the low registration priority).

実施例３では、以上のように、前回実行時に音声認識システム運用者が閾値付近の未登録単語を数多く除外した場合、それらが再度選出されないように閾値を変更する。これにより、実施例１や２の効果に加えて不要単語の選出を抑制し、音声認識システム運用者の不要単語除外作業をさらに削減することができる。 In the third embodiment, as described above, when the speech recognition system operator excludes many unregistered words near the threshold at the previous execution, the threshold is changed so that they are not selected again. Thereby, in addition to the effect of Example 1 and 2, selection of an unnecessary word can be suppressed and the unnecessary word exclusion operation | work of a speech recognition system operator can further be reduced.

以下、図９、図１０を参照して、本発明の実施例４の音声認識辞書更新装置４について説明する。図９は、本実施例の音声認識辞書更新装置４の構成を示すブロック図である。図１０は、本実施例の音声認識辞書更新装置４の動作を示すフローチャートである。 Hereinafter, the speech recognition dictionary updating apparatus 4 according to the fourth embodiment of the present invention will be described with reference to FIGS. FIG. 9 is a block diagram showing the configuration of the speech recognition dictionary update device 4 of this embodiment. FIG. 10 is a flowchart showing the operation of the speech recognition dictionary update device 4 of this embodiment.

図９に示す通り、実施例１の音声認識辞書更新装置１との違いは、実施例１における未登録単語補正部０７０が類似未登録単語除外部０９５に置き換えられている点である。 As shown in FIG. 9, the difference from the speech recognition dictionary updating apparatus 1 of the first embodiment is that the unregistered word correction unit 070 in the first embodiment is replaced with a similar unregistered word exclusion unit 095.

音声認識システム運用者が過去に除外した不要な単語と文字列として類似した単語は不要である可能性が高い（例えば「ＰＣ−９８０１」が不要な場合は、「ＰＣ−８８０１」や「ＰＣ−９８０１Ｆ」も不要と考えられる）。実施例４の音声認識辞書更新装置４は、この考えに基づき、不要な単語の選出を実施例１の音声認識辞書更新装置１よりも大きく抑制する。 There is a high possibility that a word similar to an unnecessary word excluded by the voice recognition system operator in the past is unnecessary (for example, when “PC-9801” is unnecessary, “PC-8801” or “PC-”). 9801F "is also considered unnecessary). Based on this idea, the speech recognition dictionary update device 4 according to the fourth embodiment suppresses selection of unnecessary words more than the speech recognition dictionary update device 1 according to the first embodiment.

実施例１の未登録単語補正部０７０は、過去に音声認識システム運用者に除外された単語（除外単語）と完全に一致する単語のみを未登録単語から取り除いていたが、実施例４の類似未登録単語除外部０９５はそれに加えて、除外単語と類似した単語（除外単語と編集距離の近い単語）も未登録単語から取り除く。これにより、不要な未登録単語の選出を実施例１よりも大きく抑制する。 The unregistered word correction unit 070 of the first embodiment removes only words that completely match words (excluded words) that were previously excluded by the speech recognition system operator from the unregistered words. In addition to that, the unregistered word exclusion unit 095 also removes words similar to the excluded words (words having an editing distance close to the excluded words) from the unregistered words. Thereby, selection of an unnecessary unregistered word is suppressed more than in the first embodiment.

以下、類似未登録単語除外部０９５の動作を図１０を参照して説明する。なお図９に示された構成のうち、図３と同じ参照番号が付与されている構成の動作は前述した実施例１と同じであるから適宜説明を略す。 Hereinafter, the operation of the similar unregistered word exclusion unit 095 will be described with reference to FIG. Note that, in the configuration shown in FIG. 9, the operation of the configuration given the same reference numerals as in FIG. 3 is the same as that in the first embodiment described above, and therefore the description thereof is omitted as appropriate.

＜類似未登録単語除外部０９５＞
入力：未登録単語記憶部８１０に記憶されている未登録単語、除外単語記憶部０６０に記憶されている除外単語
出力：除外単語と同一の単語、および類似する単語を取り除いた未登録単語（未登録単語記憶部８１０へ）
処理：類似未登録単語除外部０９５は、未登録単語抽出部１１０が抽出した未登録単語を未登録単語記憶部８１０に記憶させた（Ｓ１１０）後に以下の処理（Ｓ０９５）を行う。
１）類似未登録単語除外部０９５は、未登録単語記憶部８１０に記憶されている未登録単語と、除外単語記憶部０６０に記憶されている除外単語を取得する。
２）類似未登録単語除外部０９５は、未登録単語それぞれについて、以下の処理を行う。
２−１）類似未登録単語除外部０９５は、当該未登録単語と除外単語それぞれとの間の編集距離を算出し、それぞれの編集距離を当該未登録単語の文字数で除した正規化編集距離を算出する。編集距離は例えば参考非特許文献１などに開示されている既存の方法で算出する。正規化編集距離は、当該未登録単語を構成する文字列のうち除外単語と異なっている部分がどの程度あるかを表す実数値であり、当該未登録単語が除外単語と完全に一致する場合は０となり、異なる部分が増加するにしたがって値は大きくなる（例えば、４文字の未登録単語と４文字の除外単語があり、一致する文字が１文字もない場合、編集距離＝４、正規化編集距離＝４÷４＝１．０、となる）。
２−２）類似未登録単語除外部０９５は、正規化編集距離のうち一つでも所定の正規化編集距離閾値τ以下となる場合、当該未登録単語を取り除く。τ＝０と設定した場合は除外単語と完全一致する未登録単語のみが取り除かれるため、実施例１と同じ出力となる。通常はτ＝０．２程度の値に設定する。τ＝０．２と設定すると、例えば５文字の未登録単語のうち４文字以上が除外単語と一致する場合に取り除かれることになる。
３）類似未登録単語除外部０９５は、未登録単語記憶部８１０の内容を、取り除かれずに残った未登録単語に書き換える。
※１回目の実行時には除外単語記憶部０６０に除外単語が１つも記憶されていないため、上記の手順３で取り除かれる単語はなく、従来技術と同じ出力となる。 <Similar Unregistered Word Exclusion Unit 095>
Input: Unregistered word stored in the unregistered word storage unit 810, Excluded word stored in the excluded word storage unit 060: Unregistered word (unregistered word removed from the same word as the excluded word and similar words) Registered word storage unit 810)
Process: The similar unregistered word exclusion unit 095 stores the unregistered word extracted by the unregistered word extraction unit 110 in the unregistered word storage unit 810 (S110), and then performs the following process (S095).
1) The similar unregistered word exclusion unit 095 acquires an unregistered word stored in the unregistered word storage unit 810 and an excluded word stored in the excluded word storage unit 060.
2) The similar unregistered word exclusion unit 095 performs the following process for each unregistered word.
2-1) The similar unregistered word exclusion unit 095 calculates an edit distance between the unregistered word and each excluded word, and calculates a normalized edit distance obtained by dividing each edit distance by the number of characters of the unregistered word. calculate. The edit distance is calculated by an existing method disclosed in Reference Non-Patent Document 1, for example. The normalized edit distance is a real value indicating how much of the character string constituting the unregistered word is different from the excluded word, and when the unregistered word completely matches the excluded word 0, and the value increases as the number of different parts increases (for example, if there are 4 unregistered words and 4 excluded words and there is no matching character, edit distance = 4, normalized edit Distance = 4 ÷ 4 = 1.0).
2-2) The similar unregistered word exclusion unit 095 removes the unregistered word when at least one of the normalized edit distances is equal to or less than a predetermined normalized edit distance threshold τ. When τ = 0 is set, only unregistered words that completely match the excluded words are removed, so the output is the same as in the first embodiment. Usually, a value of about τ = 0.2 is set. When τ = 0.2 is set, for example, when 4 or more characters out of 5 unregistered words match the excluded word, they are removed.
3) The similar unregistered word exclusion unit 095 rewrites the contents of the unregistered word storage unit 810 with the remaining unregistered words that have not been removed.
* Since no excluded word is stored in the excluded word storage unit 060 at the time of the first execution, there is no word removed in the above procedure 3, and the output is the same as that of the conventional technique.

以上の処理によって音声認識システム運用者が除外した単語（除外単語）を記憶し、未登録単語の抽出処理の直後に除外単語に類似した単語を取り除くことにより、２回目以降の単語追加実行時に、
＊過去に除外した不要な未登録単語が再度選出されなくなるのに加えて、過去に除外した未登録単語に類似した不要な未登録単語も選出されなくなるので、実施例１より大きく作業効率が改善する
＊共起頻度ベクトル生成処理（Ｓ２１０）、タスク関連度算出処理（Ｓ３１０）、暫定認識結果生成処理（Ｓ１４０）、登録優先度算出処理（Ｓ３２０）の処理時間が実施例１より大きく短縮される
＊Ｓ２３０で算出される平均認識信頼度の正確性が実施例１より大きく向上し、結果として音声認識精度の改善効果がより大きくなる
という効果を得られる。
（参考非特許文献１：Daniel Jurafsky and James H. Martin: “Speech and Language Processing: An introduction to natural language processing, computational linguistics, and speech recognition”, Prentice Hall, 2009） The words (excluded words) excluded by the voice recognition system operator by the above process are stored, and words similar to the excluded words are removed immediately after the unregistered word extraction process.
* Unnecessary unregistered words excluded in the past are not selected again, and unnecessary unregistered words similar to unregistered words excluded in the past are also not selected, so work efficiency is greatly improved compared to the first embodiment. * Co-occurrence frequency vector generation processing (S210), task relevance calculation processing (S310), provisional recognition result generation processing (S140), registration priority calculation processing (S320) processing time is greatly shortened compared to the first embodiment. * The accuracy of the average recognition reliability calculated in S230 is greatly improved as compared with the first embodiment, and as a result, the effect of improving the speech recognition accuracy can be obtained.
(Reference Non-Patent Document 1: Daniel Jurafsky and James H. Martin: “Speech and Language Processing: An introduction to natural language processing, computational linguistics, and speech recognition”, Prentice Hall, 2009)

＜発明の効果＞
本発明によって２回目以降の単語追加実行時に不要な単語の選出が抑制されるため、音声認識システム運用者が除外すべき単語が減少し、運用作業の効率が向上する。結果として、音声認識システム運用中の認識辞書メンテナンスコストが低減される。 <Effect of the invention>
According to the present invention, selection of unnecessary words at the second and subsequent word addition executions is suppressed, so that the number of words that should be excluded by the voice recognition system operator is reduced and the efficiency of operation work is improved. As a result, the recognition dictionary maintenance cost during operation of the speech recognition system is reduced.

さらに本発明では、未登録単語の選出処理の前（従来技術の手順（ア）の直後、手順（イ）の直前のタイミング）に不要な単語の除外を行うため、従来技術での未登録単語それぞれに対して行う処理（手順（イ）の共起頻度ベクトル生成処理、手順（エ）のタスク関連度算出処理、手順（ク）の登録優先度算出処理）にかかる時間が短縮される。加えて、暫定認識辞書からも不要な単語が除外されるため暫定認識辞書のサイズが小さくなり、手順（カ）の暫定認識結果生成処理（暫定認識辞書による音声認識処理）にかかる時間も短縮される。これらの処理時間削減効果により、音声認識システム運用者が単語追加実行指示を行ってから単語追加処理が完了するまで待つ時間が短縮され、２回目以降の単語追加時のさらなる作業効率向上につながる。 Furthermore, in the present invention, unnecessary words are excluded before the selection process of unregistered words (immediately after the prior art procedure (A) and immediately before the procedure (A)). The time required for each of the processes (procedure (b) co-occurrence frequency vector generation process, procedure (d) task relevance calculation process, and procedure (c) registration priority calculation process) is reduced. In addition, since unnecessary words are also excluded from the temporary recognition dictionary, the size of the temporary recognition dictionary is reduced, and the time taken for the temporary recognition result generation processing (voice recognition processing by the temporary recognition dictionary) of the procedure (f) is shortened. The Due to these processing time reduction effects, the time for the voice recognition system operator to wait until the word addition processing is completed after issuing the word addition execution instruction is reduced, and the work efficiency is further improved at the second and subsequent word addition.

さらに、暫定認識辞書から不要な単語が削減されると、手順（キ）で算出される平均認識信頼度の正確性が向上する。特許文献１の段落［００３７］に記載されている通り、認識信頼度は、算出される。以下に、特許文献１の段落［００３７］を引用する。 Furthermore, when unnecessary words are reduced from the provisional recognition dictionary, the accuracy of the average recognition reliability calculated in the procedure (g) is improved. As described in paragraph [0037] of Patent Document 1, the recognition reliability is calculated. The paragraph [0037] of Patent Document 1 is cited below.

＜特許文献１の段落［００３７］＞
認識信頼度の算出方法は既知のいかなる方法も用いることができるが、例えば以下のように算出することができる（より詳しくは、「李, 河原, 鹿野, “2パス探索アルゴリズムにおける高速な単語事後確率に基づく信頼度算出法”, 情報処理学会研究報告, Vol.2003, No.124, pp.281-286」参照）。音声認識システムは入力音声に対する音響モデルおよび言語モデルの尤度が一番高い単語列を認識結果として出力する。しかし、もし入力音声の発音が曖昧であったり、単語列の繋がりが不自然でない単語が複数あったりしたときには、尤度が同程度の対立候補が多数現れ、音声認識システムが最適な認識結果の識別に困難をきたすことがある。音声認識システムが出力を決定するときに有力な対立候補が多数存在すれば認識信頼度は低くなり、逆に対立候補がほとんど存在しなければ認識信頼度は高くなる。 <Patent Document 1 Paragraph [0037]>
Any known method can be used to calculate the recognition reliability. For example, it can be calculated as follows (for more details, see “Li, Kawahara, Shikano,“ Fast word posterior in the two-pass search algorithm. Probability calculation method based on probability ", Information Processing Society of Japan, Vol.2003, No.124, pp.281-286)). The speech recognition system outputs a word string having the highest likelihood of the acoustic model and the language model for the input speech as a recognition result. However, if the pronunciation of the input speech is ambiguous, or if there are multiple words that are not unnaturally connected, a large number of conflicting candidates with the same likelihood appear, and the speech recognition system It can be difficult to identify. If there are many influential conflict candidates when the speech recognition system determines the output, the recognition reliability is low. Conversely, if there are few conflict candidates, the recognition reliability is high.

＜特許文献１の段落［００３７］終わり＞
暫定認識辞書に不要な単語が多いと、それらの単語が暫定音声認識処理中に上記「有力な対立候補」となってしまう可能性が高くなり、本来高い平均認識信頼度が付与されるべき単語の平均認識信頼度が低くなってしまうケースが増加する。暫定認識辞書から不要な単語を事前に削減しておくことで認識信頼度の正確性が向上するため、平均認識信頼度を用いて算出される登録優先度の正確性も向上し、その結果、追加すべき未登録単語がより正確に選出されるようになる。本来追加すべき単語が選出されないケースや追加すべきでない単語が選出されてしまうケースが減少するため、最終的に生成される拡張認識辞書の質が向上し、認識辞書更新後により大きな音声認識精度の向上効果が得られる。 <End of paragraph [0037] of Patent Document 1>
If there are many unnecessary words in the provisional recognition dictionary, there is a high possibility that these words will become the above-mentioned “potential conflict candidates” during the provisional speech recognition processing, and words that should originally be given a high average recognition reliability. The number of cases where the average recognition reliability of the is reduced. Since the accuracy of recognition reliability is improved by reducing unnecessary words from the provisional recognition dictionary in advance, the accuracy of registration priority calculated using the average recognition reliability is also improved. Unregistered words to be added are selected more accurately. The number of cases where words that should originally be added are not selected and cases where words that should not be added are reduced is reduced, so the quality of the extended recognition dictionary that is finally generated is improved and the speech recognition accuracy is increased after the recognition dictionary is updated. The improvement effect is obtained.

＜本発明のポイント＞
本発明は、「音声認識システム運用者によって除外された単語は確実に不要単語である」との考えに基づき、２回目以降の単語追加実行時に不要単語が選出されてしまうことを防ぐ。具体的には、従来技術の手順（ケ）において音声認識システム運用者によって除外された不要単語を記憶しておき、２回目以降の単語追加実行時に、従来技術の未登録単語の抽出処理（手順（ア）、Ｓ１１０）の直後のタイミングで不要単語の除外を行う。 <Points of the present invention>
The present invention prevents an unnecessary word from being selected during the second and subsequent word addition execution based on the idea that “the word excluded by the voice recognition system operator is definitely an unnecessary word”. Specifically, unnecessary words excluded by the speech recognition system operator in the prior art procedure (K) are stored, and prior art unregistered word extraction processing (procedure) (A) Unnecessary words are excluded at the timing immediately after S110).

このタイミングで不要単語の除外処理を行うことにより、それ以降の様々な処理にかかる時間の削減効果（＜発明の効果＞の第２段落）および、追加すべき未登録単語の選出の正確性を向上させる効果（＜発明の効果＞の第３段落）を得る。 By performing unnecessary word exclusion processing at this timing, it is possible to reduce the time required for various subsequent processing (the second paragraph of <Effects of the invention>) and the accuracy of selecting unregistered words to be added. The effect to improve (the third paragraph of <Effect of invention>) is obtained.

［変形例１］
以下、図１１、図１２を参照して、本発明の要部に注目するために従来技術の構成要素の一部を集約して実現した、実施例１の変形例（変形例１）である音声認識辞書更新装置５について説明する。図１１は、本変形例の音声認識辞書更新装置５の構成を示すブロック図である。図１２は、本変形例の音声認識辞書更新装置５の動作を示すフローチャートである。 [Modification 1]
Hereinafter, with reference to FIG. 11 and FIG. 12, a modification of the first embodiment (modification 1) realized by consolidating some of the constituent elements of the prior art to focus on the main part of the present invention. The speech recognition dictionary update device 5 will be described. FIG. 11 is a block diagram showing the configuration of the speech recognition dictionary update device 5 of this modification. FIG. 12 is a flowchart showing the operation of the speech recognition dictionary update device 5 of this modification.

図１１に示すように本変形例の音声認識辞書更新装置５は、実施例１に含まれる構成要件を全て過不足なく含んでいる。ただし、関連文書記憶部７１０と、未登録単語抽出部１１０と、未登録単語記憶部８１０は、未登録単語管理部５１００に含まれる。また、未登録単語特徴量抽出部２１０と、入力音声記憶部７２０と、音声認識部１２０と、認識結果記憶部８２０と、認識結果特徴量抽出部２２０と、タスク関連度算出部３１０は、タスク関連度管理部５２１０に含まれる構成である。また、暫定認識辞書登録部１３０と、暫定認識辞書記憶部８３０と、暫定音声認識部１４０と、暫定認識結果記憶部８４０と、認識信頼度算出部２３０は、平均認識信頼度管理部５２２０に含まれる構成である。また、音声認識辞書更新装置５は、タスク関連度管理部５２１０、平均認識信頼度管理部５２２０、登録優先度算出部３２０の三つの構成要件をさらに集約した登録優先度管理部５２００を含む。また、認識辞書登録部３３０と、追加登録単語確認除外保存部０５０と、認識辞書更新部０４０と、拡張認識辞書記憶部９００は、登録管理部５３００に含まれる構成である。本変形例の音声認識辞書更新装置５は、上述の新たな構成要件に加え、認識辞書記憶部７３０と、除外単語記憶部０６０と、未登録単語補正部０７０を含む。本変形例の音声認識辞書更新装置５は実施例１と同じ動作を実行するが、実施例１に存在する各構成要件が、これらを含む新たな構成要件内に集約されているため、各構成要件を集約してなる新たな構成要件の動作について、異なる表現でその動作例を開示するものとする。 As shown in FIG. 11, the speech recognition dictionary update device 5 according to the present modification includes all the constituent elements included in the first embodiment without excess or deficiency. However, the related document storage unit 710, the unregistered word extraction unit 110, and the unregistered word storage unit 810 are included in the unregistered word management unit 5100. The unregistered word feature quantity extraction unit 210, the input voice storage unit 720, the voice recognition unit 120, the recognition result storage unit 820, the recognition result feature quantity extraction unit 220, and the task relevance calculation unit 310 This is a configuration included in the association degree management unit 5210. In addition, the provisional recognition dictionary registration unit 130, the provisional recognition dictionary storage unit 830, the provisional speech recognition unit 140, the provisional recognition result storage unit 840, and the recognition reliability calculation unit 230 are included in the average recognition reliability management unit 5220. It is the composition which is. The speech recognition dictionary updating apparatus 5 includes a registration priority management unit 5200 that further aggregates three component requirements of a task relevance management unit 5210, an average recognition reliability management unit 5220, and a registration priority calculation unit 320. Further, the recognition dictionary registration unit 330, the additional registered word confirmation exclusion storage unit 050, the recognition dictionary update unit 040, and the extended recognition dictionary storage unit 900 are included in the registration management unit 5300. The speech recognition dictionary update device 5 of the present modification includes a recognition dictionary storage unit 730, an excluded word storage unit 060, and an unregistered word correction unit 070 in addition to the above-described new configuration requirements. The speech recognition dictionary update device 5 of the present modification performs the same operation as that of the first embodiment, but the respective component requirements existing in the first embodiment are aggregated in new component requirements including these components. An example of the operation of a new configuration requirement obtained by integrating the requirements is disclosed in different expressions.

前述同様、認識辞書記憶部７３０は、認識辞書を記憶している。未登録単語管理部５１００は、関連文書を入力とし、運用者の単語追加指示を契機として、関連文書中に出現し、認識辞書に登録されていない単語である未登録単語を抽出して記憶する（Ｓ５１００）。 As described above, the recognition dictionary storage unit 730 stores a recognition dictionary. The unregistered word management unit 5100 extracts and stores an unregistered word that is a word that appears in the related document and is not registered in the recognition dictionary when the related document is input and the operator adds a word. (S5100).

登録優先度管理部５２００は、各未登録単語の登録優先度を算出する（Ｓ５２００）。より詳細には、タスク関連度管理部５２１０は、利用者の入力音声を入力とし、利用者の入力音声の内容と未登録単語それぞれとの間の意味的な関連の高さを表すタスク関連度を、未登録単語ごとに算出する（Ｓ５２１０）。平均認識信頼度管理部５２２０は、認識辞書に未登録単語を追加した暫定認識辞書を用いて、利用者の入力音声を音声認識した結果である暫定認識結果を生成し、暫定認識結果に基づいて各未登録単語の認識結果出力の正解らしさを表す指標の出現回数平均である平均認識信頼度を算出する（Ｓ５２２０）。登録優先度算出部３２０は、実施例１と同様に、各未登録単語のタスク関連度と平均認識信頼度に基づいて、各未登録単語の登録優先度を算出する。 The registration priority management unit 5200 calculates the registration priority of each unregistered word (S5200). More specifically, the task relevance management unit 5210 takes the input voice of the user as an input, and indicates the degree of semantic relevance between the contents of the user input voice and each unregistered word. Is calculated for each unregistered word (S5210). The average recognition reliability management unit 5220 generates a temporary recognition result that is a result of voice recognition of the user's input voice using a temporary recognition dictionary in which an unregistered word is added to the recognition dictionary, and based on the temporary recognition result An average recognition reliability that is an average of the number of appearances of an index representing the correctness of the recognition result output of each unregistered word is calculated (S5220). Similar to the first embodiment, the registration priority calculation unit 320 calculates the registration priority of each unregistered word based on the task association degree and the average recognition reliability of each unregistered word.

登録管理部５３００は、未登録単語を所定の基準（例えば前述の登録優先度とその閾値）で選出して、選出された未登録単語を運用者に提示し、選出された未登録単語のうち除外する旨の運用者指示があった未登録単語を除外単語として出力し、除外単語以外の未登録単語を認識辞書に追加した拡張認識辞書を新たな認識辞書として認識辞書記憶部７３０に記憶された認識辞書を更新する（Ｓ５３００）。除外単語記憶部０６０は、除外単語を記憶する（Ｓ０６０）。未登録単語補正部０７０は、運用者の単語追加指示を契機として、未登録単語管理部５１００に新たに記憶された未登録単語のうち、除外単語に一致する単語を削除する（Ｓ０７０）。 The registration management unit 5300 selects an unregistered word based on a predetermined criterion (for example, the above-described registration priority and its threshold value), presents the selected unregistered word to the operator, and among the selected unregistered words An unregistered word for which an operator instruction to exclude is output as an excluded word, and an extended recognition dictionary in which unregistered words other than the excluded word are added to the recognition dictionary is stored in the recognition dictionary storage unit 730 as a new recognition dictionary. The recognition dictionary is updated (S5300). The excluded word storage unit 060 stores excluded words (S060). The unregistered word correction unit 070 deletes a word that matches the excluded word from unregistered words newly stored in the unregistered word management unit 5100, triggered by the operator's word addition instruction (S070).

このように、本変形例の音声認識辞書更新装置５によれば、実施例１の音声認識辞書更新装置１と同様に、運用者の不要な単語の選出作業を抑制することができる。 As described above, according to the speech recognition dictionary update device 5 of the present modification, it is possible to suppress unnecessary word selection work by the operator, as with the speech recognition dictionary update device 1 of the first embodiment.

［変形例２］
以下、図１３、図１４を参照して、本発明の要部に注目するために従来技術の構成要素の一部を集約して実現した、実施例２の変形例（変形例２）である音声認識辞書更新装置６について説明する。図１３は、本変形例の音声認識辞書更新装置６の構成を示すブロック図である。図１４は、本変形例の音声認識辞書更新装置６の動作を示すフローチャートである。 [Modification 2]
Hereinafter, with reference to FIG. 13 and FIG. 14, a modification of the second embodiment (modification 2) realized by consolidating some of the constituent elements of the prior art in order to focus on the main part of the present invention. The speech recognition dictionary update device 6 will be described. FIG. 13 is a block diagram showing the configuration of the speech recognition dictionary update device 6 of this modification. FIG. 14 is a flowchart showing the operation of the speech recognition dictionary update device 6 of this modification.

図１３に示すように本変形例の音声認識辞書更新装置６は、実施例２に含まれる構成要件を全て過不足なく含んでいる。ただし、本変形例の音声認識辞書更新装置６は、変形例１と同様に、実施例２に存在する構成要件を集約した新たな構成要件である、未登録単語管理部５１００、登録優先度管理部５２００、登録管理部５３００を含む。これらの構成は、変形例１と同様であるから説明を略す。本変形例の音声認識辞書更新装置６は、上述の新たな構成要件に加え、認識辞書記憶部７３０と、除外単語記憶部０６０と、不要単語推定除去部０８０を含む。 As shown in FIG. 13, the speech recognition dictionary updating apparatus 6 of this modification includes all the constituent elements included in the second embodiment without excess or deficiency. However, the speech recognition dictionary update device 6 according to the present modification, like the first modification, is a new configuration requirement in which the configuration requirements existing in the second embodiment are aggregated, the unregistered word management unit 5100, the registration priority management. Part 5200 and registration management part 5300. Since these configurations are the same as those of the first modification, description thereof is omitted. The speech recognition dictionary updating apparatus 6 according to the present modification includes a recognition dictionary storage unit 730, an excluded word storage unit 060, and an unnecessary word estimation removal unit 080 in addition to the above-described new components.

ステップＳ５１００、Ｓ５２００、Ｓ５３００、Ｓ０６０は変形例１と同様に実行される。不要単語推定除去部０８０は、運用者の単語追加指示を契機として、未登録単語管理部に新たに記憶された未登録単語のうち、除外単語に一致する単語、および除外単語に類似する単語を削除する（Ｓ０８０）。前述したように、例えば不要単語推定除去部０８０は、除外単語のうち記号のみで構成されている単語である記号除外単語の集合を除外記号集合として生成し、未登録単語管理部５１００に新たに記憶された未登録単語のうち、除外記号集合に含まれる記号のみで構成される未登録単語を削除する（Ｓ０８０）。 Steps S5100, S5200, S5300, and S060 are executed in the same manner as in the first modification. Unnecessary word estimation removal unit 080 takes a word that matches an excluded word and a word similar to the excluded word among unregistered words newly stored in the unregistered word management unit, triggered by the operator's word addition instruction. Delete (S080). As described above, for example, the unnecessary word estimation removal unit 080 generates a set of symbol excluded words, which are words composed only of symbols among the excluded words, as an excluded symbol set and newly adds them to the unregistered word management unit 5100. Of the stored unregistered words, unregistered words composed only of symbols included in the excluded symbol set are deleted (S080).

このように、本変形例の音声認識辞書更新装置６によれば、実施例２の音声認識辞書更新装置２と同様の効果を得ることが出来る。 Thus, according to the speech recognition dictionary update device 6 of this modification, the same effect as the speech recognition dictionary update device 2 of Example 2 can be obtained.

［変形例３］
以下、図１５、図１６を参照して、本発明の要部に注目するために従来技術の構成要素の一部を集約して実現した、実施例３の変形例（変形例３）である音声認識辞書更新装置７について説明する。図１５は、本変形例の音声認識辞書更新装置７の構成を示すブロック図である。図１６は、本変形例の音声認識辞書更新装置７の動作を示すフローチャートである。 [Modification 3]
Hereinafter, with reference to FIG. 15 and FIG. 16, it is the modification (modification 3) of Example 3 implement | achieved by integrating some components of a prior art in order to pay attention to the principal part of this invention. The speech recognition dictionary update device 7 will be described. FIG. 15 is a block diagram showing the configuration of the speech recognition dictionary update device 7 of this modification. FIG. 16 is a flowchart showing the operation of the speech recognition dictionary update device 7 of this modification.

図１５に示すように本変形例の音声認識辞書更新装置７は、実施例３に含まれる構成要件を全て過不足なく含んでいる。ただし、本変形例の音声認識辞書更新装置７は、変形例１、２と同様に、実施例３に存在する構成要件を集約した新たな構成要件である、未登録単語管理部５１００、登録優先度管理部５２００を含む。これらの構成は、変形例１、２と同様であるから説明を略す。また、本変形例の音声認識辞書更新装置７は、上述の構成要件に加え、認識辞書登録部３３０、認識辞書更新部０４０、拡張認識辞書記憶部９００、登録優先度保存部０９０、除外登録優先度記憶部０９１、登録優先度閾値算出部０９２を集約した登録管理部７３００を含む。さらに、本変形例の音声認識辞書更新装置５は、認識辞書記憶部７３０と、除外単語記憶部０６０と、不要単語推定除去部０８０を含む。 As shown in FIG. 15, the speech recognition dictionary updating apparatus 7 of this modification includes all the constituent elements included in the third embodiment without excess or deficiency. However, the speech recognition dictionary update device 7 of the present modified example is a new constituent requirement in which constituent features existing in the third embodiment are aggregated, as in the first and second modified examples, the unregistered word management unit 5100, and the registration priority. A degree management unit 5200 is included. Since these configurations are the same as those of the first and second modifications, description thereof is omitted. In addition to the above-described configuration requirements, the speech recognition dictionary update device 7 of the present modification includes a recognition dictionary registration unit 330, a recognition dictionary update unit 040, an extended recognition dictionary storage unit 900, a registration priority storage unit 090, and exclusion registration priority. A registration management unit 7300 that includes a degree storage unit 091 and a registration priority threshold value calculation unit 092. Furthermore, the speech recognition dictionary update device 5 of the present modification includes a recognition dictionary storage unit 730, an excluded word storage unit 060, and an unnecessary word estimation removal unit 080.

ステップＳ５１００、Ｓ５２００、Ｓ０６０、Ｓ０８０は変形例２と同様に実行される。登録管理部７３００は、出力した除外単語の登録優先度に基づいて登録優先度閾値を設定および更新し、登録優先度閾値を前記所定の基準として前記未登録単語を選出する（Ｓ７３００）。 Steps S5100, S5200, S060, and S080 are executed in the same manner as in the second modification. The registration management unit 7300 sets and updates a registration priority threshold based on the registration priority of the output excluded word, and selects the unregistered word using the registration priority threshold as the predetermined reference (S7300).

このように、本変形例の音声認識辞書更新装置７によれば、実施例３の音声認識辞書更新装置３と同様の効果を得ることが出来る。 Thus, according to the speech recognition dictionary update device 7 of this modification, the same effect as the speech recognition dictionary update device 3 of Example 3 can be obtained.

［変形例４］
以下、図１７、図１８を参照して、本発明の要部に注目するために従来技術の構成要素の一部を集約して実現した、実施例４の変形例（変形例４）である音声認識辞書更新装置８について説明する。図１７は、本変形例の音声認識辞書更新装置８の構成を示すブロック図である。図１８は、本変形例の音声認識辞書更新装置８の動作を示すフローチャートである。 [Modification 4]
Hereinafter, with reference to FIG. 17 and FIG. 18, a modified example (modified example 4) of the fourth embodiment realized by consolidating some of the constituent elements of the prior art in order to pay attention to the main part of the present invention. The speech recognition dictionary update device 8 will be described. FIG. 17 is a block diagram showing a configuration of the speech recognition dictionary update device 8 of the present modification. FIG. 18 is a flowchart showing the operation of the speech recognition dictionary update device 8 of the present modification.

図１７に示すように本変形例の音声認識辞書更新装置８は、実施例４に含まれる構成要件を全て過不足なく含んでいる。本変形例の音声認識辞書更新装置８は、変形例１の音声認識辞書更新装置５の構成とほとんど同じであるが、変形例１における未登録単語補正部０７０が、本変形例において類似未登録単語除外部０９５に変更されている点のみ異なる。 As shown in FIG. 17, the speech recognition dictionary update device 8 of the present modification includes all the constituent requirements included in the fourth embodiment without excess or deficiency. The voice recognition dictionary update device 8 of the present modification is almost the same as the configuration of the voice recognition dictionary update device 5 of the first modification, but the unregistered word correction unit 070 in the first modification is similar unregistered in this modification. The only difference is that the word exclusion unit 095 is changed.

ステップＳ５１００、Ｓ５２００、Ｓ５３００、Ｓ０６０は変形例１と同様に実行される。次に、類似未登録単語除外部０９５は、運用者の単語追加指示を契機として、未登録単語管理部に新たに記憶された未登録単語を構成する文字列のうち除外単語と異なっている部分の程度を表す値である正規化編集距離を算出し、正規化編集距離に基づいて、新たに記憶された未登録単語を削除する（Ｓ０９５）。 Steps S5100, S5200, S5300, and S060 are executed in the same manner as in the first modification. Next, the similar unregistered word exclusion unit 095 is a part that is different from the excluded word in the character string that forms the unregistered word newly stored in the unregistered word management unit, triggered by the operator's word addition instruction. The normalized edit distance, which is a value representing the degree of, is calculated, and the newly stored unregistered word is deleted based on the normalized edit distance (S095).

このように、本変形例の音声認識辞書更新装置８によれば、実施例４の音声認識辞書更新装置４と同様の効果を得ることが出来る。 Thus, according to the speech recognition dictionary update device 8 of this modification, the same effect as the speech recognition dictionary update device 4 of Example 4 can be obtained.

上述の各種の処理は、記載に従って時系列に実行されるのみならず、処理を実行する装置の処理能力あるいは必要に応じて並列的にあるいは個別に実行されてもよい。その他、本発明の趣旨を逸脱しない範囲で適宜変更が可能であることはいうまでもない。 The various processes described above are not only executed in time series according to the description, but may also be executed in parallel or individually as required by the processing capability of the apparatus that executes the processes. Needless to say, other modifications are possible without departing from the spirit of the present invention.

また、上述の構成をコンピュータによって実現する場合、各装置が有すべき機能の処理内容はプログラムによって記述される。そして、このプログラムをコンピュータで実行することにより、上記処理機能がコンピュータ上で実現される。 Further, when the above-described configuration is realized by a computer, processing contents of functions that each device should have are described by a program. The processing functions are realized on the computer by executing the program on the computer.

この処理内容を記述したプログラムは、コンピュータで読み取り可能な記録媒体に記録しておくことができる。コンピュータで読み取り可能な記録媒体としては、例えば、磁気記録装置、光ディスク、光磁気記録媒体、半導体メモリ等どのようなものでもよい。 The program describing the processing contents can be recorded on a computer-readable recording medium. As the computer-readable recording medium, for example, any recording medium such as a magnetic recording device, an optical disk, a magneto-optical recording medium, and a semiconductor memory may be used.

また、このプログラムの流通は、例えば、そのプログラムを記録したＤＶＤ、ＣＤ−ＲＯＭ等の可搬型記録媒体を販売、譲渡、貸与等することによって行う。さらに、このプログラムをサーバコンピュータの記憶装置に格納しておき、ネットワークを介して、サーバコンピュータから他のコンピュータにそのプログラムを転送することにより、このプログラムを流通させる構成としてもよい。 The program is distributed by selling, transferring, or lending a portable recording medium such as a DVD or CD-ROM in which the program is recorded. Furthermore, the program may be distributed by storing the program in a storage device of the server computer and transferring the program from the server computer to another computer via a network.

このようなプログラムを実行するコンピュータは、例えば、まず、可搬型記録媒体に記録されたプログラムもしくはサーバコンピュータから転送されたプログラムを、一旦、自己の記憶装置に格納する。そして、処理の実行時、このコンピュータは、自己の記録媒体に格納されたプログラムを読み取り、読み取ったプログラムに従った処理を実行する。また、このプログラムの別の実行形態として、コンピュータが可搬型記録媒体から直接プログラムを読み取り、そのプログラムに従った処理を実行することとしてもよく、さらに、このコンピュータにサーバコンピュータからプログラムが転送されるたびに、逐次、受け取ったプログラムに従った処理を実行することとしてもよい。また、サーバコンピュータから、このコンピュータへのプログラムの転送は行わず、その実行指示と結果取得のみによって処理機能を実現する、いわゆるＡＳＰ（Application Service Provider）型のサービスによって、上述の処理を実行する構成としてもよい。なお、本形態におけるプログラムには、電子計算機による処理の用に供する情報であってプログラムに準ずるもの（コンピュータに対する直接の指令ではないがコンピュータの処理を規定する性質を有するデータ等）を含むものとする。 A computer that executes such a program first stores, for example, a program recorded on a portable recording medium or a program transferred from a server computer in its own storage device. When executing the process, the computer reads a program stored in its own recording medium and executes a process according to the read program. As another execution form of the program, the computer may directly read the program from a portable recording medium and execute processing according to the program, and the program is transferred from the server computer to the computer. Each time, the processing according to the received program may be executed sequentially. Also, the program is not transferred from the server computer to the computer, and the above-described processing is executed by a so-called ASP (Application Service Provider) type service that realizes the processing function only by the execution instruction and result acquisition. It is good. Note that the program in this embodiment includes information that is used for processing by an electronic computer and that conforms to the program (data that is not a direct command to the computer but has a property that defines the processing of the computer).

また、この形態では、コンピュータ上で所定のプログラムを実行させることにより、本装置を構成することとしたが、これらの処理内容の少なくとも一部をハードウェア的に実現することとしてもよい。 In this embodiment, the present apparatus is configured by executing a predetermined program on a computer. However, at least a part of these processing contents may be realized by hardware.

Claims

A recognition dictionary storage unit for storing the recognition dictionary;
An unregistered word management unit that extracts and stores an unregistered word that is a word that appears in the related document and is not registered in the recognition dictionary, with an input of the related document and triggered by an operator's word addition instruction; ,
The unregistered word is selected based on a predetermined criterion, the selected unregistered word is presented to the operator, and an operator instruction is given to exclude the selected unregistered word. A registration management unit that updates the recognition dictionary stored in the recognition dictionary storage unit as a new recognition dictionary with an extended recognition dictionary added to the recognition dictionary as an unregistered word other than the excluded word,
An excluded word storage unit for storing the excluded word;
Triggered by the operator's word addition instruction, among unregistered words newly stored in the unregistered word management unit, an unregistered word correction unit that deletes a word that matches the excluded word;
A speech recognition dictionary update device including:

A recognition dictionary storage unit for storing the recognition dictionary;
An unregistered word management unit that extracts and stores an unregistered word that is a word that appears in the related document and is not registered in the recognition dictionary, with an input of the related document and triggered by an operator's word addition instruction; ,
The unregistered word is selected based on a predetermined criterion, the selected unregistered word is presented to the operator, and an operator instruction is given to exclude the selected unregistered word. A registration management unit that updates the recognition dictionary stored in the recognition dictionary storage unit as a new recognition dictionary with an extended recognition dictionary added to the recognition dictionary as an unregistered word other than the excluded word,
An excluded word storage unit for storing the excluded word;
An unnecessary word that deletes a word that matches the excluded word and a word that is similar to the excluded word among unregistered words newly stored in the unregistered word management unit, triggered by the operator's word addition instruction An estimation removal unit;
A speech recognition dictionary update device including:

The speech recognition dictionary update device according to claim 2,
The unnecessary word estimation removing unit
A set of symbol excluded words that are words composed only of symbols among the excluded words is generated as an excluded symbol set, and the excluded symbol set among the unregistered words newly stored in the unregistered word management unit The speech recognition dictionary update apparatus which deletes the unregistered word comprised only by the symbol contained in.

A recognition dictionary storage unit for storing the recognition dictionary;
An unregistered word management unit that extracts and stores an unregistered word that is a word that appears in the related document and is not registered in the recognition dictionary, with an input of the related document and triggered by an operator's word addition instruction; ,
The unregistered word is selected based on a predetermined criterion, the selected unregistered word is presented to the operator, and an operator instruction is given to exclude the selected unregistered word. A registration management unit that updates the recognition dictionary stored in the recognition dictionary storage unit as a new recognition dictionary with an extended recognition dictionary added to the recognition dictionary as an unregistered word other than the excluded word,
An excluded word storage unit for storing the excluded word;
Normalization that is a value representing the degree of a portion that is different from the excluded word in the character string constituting the unregistered word newly stored in the unregistered word management unit, triggered by the operator's word addition instruction A similar unregistered word excluding unit that calculates an edit distance and deletes the newly stored unregistered word based on the normalized edit distance;
A speech recognition dictionary update device including:

The speech recognition dictionary update device according to any one of claims 1 to 4,
The average recognition reliability is assumed to be the average number of appearances of an index representing the correctness of the recognition result output of each unregistered word,
The task relevance level represents the height of the semantic relevance between the content of the user's input speech and each of the unregistered words,
The registration priority is an index based on the task relevance and the average recognition reliability,
The registration management unit
A speech recognition dictionary updating apparatus that sets and updates a registration priority threshold based on the registration priority of the output excluded word, and selects the unregistered word using the registration priority threshold as the predetermined reference.

A speech recognition dictionary update method executed by a speech recognition dictionary update device including a recognition dictionary storage unit that stores a recognition dictionary and an excluded word storage unit that stores an excluded word,
An unregistered word management step for extracting and storing an unregistered word that appears in the related document and is not registered in the recognition dictionary, with an input of the related document and triggered by an operator's word addition instruction; ,
The unregistered word is selected based on a predetermined criterion, the selected unregistered word is presented to the operator, and an operator instruction is given to exclude the selected unregistered word. A registration management step of updating the recognition dictionary stored in the recognition dictionary storage unit as a new recognition dictionary using an extended recognition dictionary in which unregistered words other than the exclusion word are added to the recognition dictionary as a new recognition dictionary; ,
Triggered by the operator's word addition instruction, an unregistered word correction step of deleting a word that matches the excluded word among the unregistered words newly stored in the unregistered word management step;
A speech recognition dictionary update method including:

A speech recognition dictionary update method executed by a speech recognition dictionary update device including a recognition dictionary storage unit that stores a recognition dictionary and an excluded word storage unit that stores an excluded word,
An unregistered word management step for extracting and storing an unregistered word that appears in the related document and is not registered in the recognition dictionary, with an input of the related document and triggered by an operator's word addition instruction; ,
The unregistered word is selected based on a predetermined criterion, the selected unregistered word is presented to the operator, and an operator instruction is given to exclude the selected unregistered word. A registration management step of updating the recognition dictionary stored in the recognition dictionary storage unit as a new recognition dictionary using an extended recognition dictionary in which unregistered words other than the exclusion word are added to the recognition dictionary as a new recognition dictionary; ,
An unnecessary word that deletes a word that matches the excluded word and a word that is similar to the excluded word among unregistered words newly stored in the unregistered word management step, triggered by the operator's word addition instruction An estimated removal step;
A speech recognition dictionary update method including:

A program for causing a computer to function as the speech recognition dictionary updating apparatus according to any one of claims 1 to 5.