JP2016102866A

JP2016102866A - False recognition correction device and program

Info

Publication number: JP2016102866A
Application number: JP2014240350A
Authority: JP
Inventors: 富久白石; Tomihisa Shiraishi; 敦洋岩崎; Atsuhiro Iwasaki
Original assignee: I Business Center Corp
Current assignee: I Business Center Corp
Priority date: 2014-11-27
Filing date: 2014-11-27
Publication date: 2016-06-02

Abstract

PROBLEM TO BE SOLVED: To provide a false recognition correction device and program which automatically correct false recognition even when the false recognition has occurred to a result of voice recognition processing.SOLUTION: With a syllable character string obtained by voice recognition as a processing object, among a plurality of key words registered with a data pool, a key word having a syllable character string matching a syllable character string part included in a voice recognition syllable character string or a syllable character string similar to the same is searched for, and by a found key word, a corresponding syllable character string part included in the voice recognition syllable character string is replaced. Key word search is performed by creating a search pattern including a regular expression with respect to each of key words registered with the data pool and searching the voice recognition syllable character strings for a syllable character string part matching the search pattern including the regular expression.SELECTED DRAWING: Figure 13

Description

この発明は誤認識修正装置およびプログラムに関する。 The present invention relates to an erroneous recognition correction apparatus and program.

音声データに含まれる言葉を音声認識し，それを文字列に変換する処理が知られている。音声データは辞書データにあらかじめ記憶されている単語に置き換えられることで文字列に変換される。変換後の誤りを少なくする（認識精度を高くする）ために，発音（音データ）とその発音に対応する文字列の組合せを記憶させておくことなどが行われている（たとえば，特許文献１）。 Processing for recognizing words contained in speech data and converting them into character strings is known. The voice data is converted into a character string by being replaced with words stored in the dictionary data in advance. In order to reduce errors after conversion (increase recognition accuracy), a combination of pronunciation (sound data) and a character string corresponding to the pronunciation is stored (for example, Patent Document 1). ).

特許第５２０７６４２号公報Japanese Patent No. 5207642

しかしながら，音声データに雑音が混じると，それだけで変換後の文字列には誤りが生じやすくなる。発音が悪い場合も誤認識は生じやすい。音声データを文字列に変換する音声認識処理では誤認識が全く生じないことはなく，誤認識の発生は避けることができない。 However, if noise is mixed in voice data, an error is likely to occur in the converted character string. Misrecognition is likely to occur even when pronunciation is bad. In the speech recognition processing for converting speech data into a character string, no erroneous recognition occurs at all, and the occurrence of erroneous recognition cannot be avoided.

この発明は，音声認識処理した結果に誤認識が発生していても，それを自動的に修正できるようにすることを目的とする。 An object of the present invention is to make it possible to automatically correct a misrecognition occurring as a result of a speech recognition process.

この発明による誤認識修正装置は，与えられる音声を音節文字列に変換する音声認識処理手段，複数の単語およびその音節文字列を対応づけて記憶した単語プール，上記単語プールに記憶されている複数の単語のうち，上記音声認識音節文字列に含まれる音節文字列部分に合致する音節文字列またはこれに類似する音節文字列を有する単語を検索する単語検索手段，ならびに単語検索手段によって見つかった単語によって，上記音声認識音節文字列に含まれる，対応する音節文字列部分を置換する単語置換手段を備え，上記単語検索手段が，上記単語プールに記憶されている単語のそれぞれについて，正規表現を含む検索パターンを作成する第１の検索パターン作成手段，および上記第１の検索パターン作成手段によって作成された検索パターンに合致する音節文字列部分を，上記音声認識音節文字列の中から検索する手段を含むことを特徴とするものである。「音節」とは日本語の場合はかな文字の一文字に相当し，「音節文字列」はかな文字の組合せ（羅列）に相当する。音節文字列は「読み」と表現することもできる。 The erroneous recognition correcting apparatus according to the present invention includes a speech recognition processing means for converting a given speech into a syllable character string, a plurality of words and a word pool storing the syllable character strings in association with each other, and a plurality of words stored in the word pool. Word search means for searching for a word having a syllable character string matching or similar to the syllable character string part included in the speech recognition syllable character string, and a word found by the word search means The word search means includes a regular expression for each of the words stored in the word pool, with word replacement means for replacing the corresponding syllable character string portion included in the speech recognition syllable character string. First search pattern creation means for creating a search pattern, and a search pattern created by the first search pattern creation means Syllable string portion that matches the emission, is characterized in that it includes means for retrieving from the above speech recognition syllable strings. “Syllable” corresponds to a single kana character in Japanese, and “syllable character string” corresponds to a combination of kana characters. The syllable string can also be expressed as “reading”.

この発明はコンピュータを誤認識修正装置として機能させるためのプログラムも提供する。このプログラムは，与えられる音声を認識して音節文字列に変換し，あらかじめ用意される，複数の単語およびその音節文字列を対応づけて記憶した単語プールに記憶されている単語のうち，上記音声認識音節文字列に含まれる音節文字列部分に合致する音節文字列またはこれに類似する音節文字列を有する単語を検索し，見つかった単語によって，上記音声認識音節文字列に含まれる，対応する音節文字列部分を置換し，ここで上記単語の検索が，上記単語プールに記憶されている単語のそれぞれについて，正規表現を含む検索パターンを作成し，作成した検索パターンに合致する音節文字列部分を，上記音声認識音節文字列の中から検索することを特徴とする。 The present invention also provides a program for causing a computer to function as a misrecognition correcting device. This program recognizes the given speech, converts it into a syllable string, and prepares the above-mentioned speech from among a plurality of words stored in a word pool in which a plurality of words and their syllable strings are stored in association with each other. A word having a syllable character string matching the syllable character string part included in the recognized syllable character string or a syllable character string similar thereto is searched, and the corresponding syllable included in the speech recognition syllable character string by the found word. Replace the character string part, where the search for the word creates a search pattern including a regular expression for each word stored in the word pool, and the syllable character string part that matches the created search pattern. , And searching from the speech recognition syllable character string.

与えられる音声が音声認識処理手段によって音節文字列に変換される。変換された音節文字列（音声認識音節文字列）を対象にして，誤認識修正装置による修正処理は行われる。 The given voice is converted into a syllable character string by the voice recognition processing means. Correction processing by the erroneous recognition correction device is performed on the converted syllable character string (speech recognition syllable character string).

あらかじめ用意される単語プールに記憶される単語は，漢字表記，漢字と仮名の両方を含む表記，仮名表記等によって記憶される。単語プールには単語の音節文字列（読み）も併せて記憶されており，いわば辞書データである。 Words stored in a word pool prepared in advance are stored in kanji notation, notation including both kanji and kana, kana notation, and the like. The word pool also stores a syllable character string (reading) of the word, which is so-called dictionary data.

この発明によると，単語プールに記憶されている単語のうち，音声認識音節文字列に含まれる音節文字列部分に合致する音節文字列またはこれに類似する音節文字列を有する単語が検索され，検索によって見つかった単語によって，対応する音節文字列部分が置換される。ここで単語検索処理では，単語プールに記憶されている複数の単語を検索対象とするのではなく，逆に上記音声認識音節文字列が検索対象とされる。すなわち，上記単語プールに記憶されている単語のそれぞれについて正規表現を含む検索パターンを作成し，作成された検索パターンに合致する音節文字列部分が上記音声認識音節文字列に含まれているかどうかを判断することで，単語検索は行われる。検索パターンが正規表現を含むので，音声認識音節文字列中に日本語として正しくない音節文字列部分（たとえば辞書に掲載されていない音節文字列部分）が含まれているとしても，その音節文字列部分を，日本語として正しい単語（辞書に掲載されている単語）（単語プールに記憶されている単語）に置換することができる。もちろん，日本語として正しい音節文字列部分は単語プールに記憶されている可能性が高いから正しい単語に置換される。このように，音声認識処理手段によって誤認識が発生し，音声認識音節文字列が日本語として意味をなさない文字列部分を含むとしても，それを日本語として意味をなす文字列部分に修正することができる。 According to the present invention, among words stored in the word pool, a word having a syllable character string that matches a syllable character string part included in the speech recognition syllable character string or a syllable character string similar thereto is searched and searched. The corresponding syllable string part is replaced by the word found by. Here, in the word search process, instead of searching for a plurality of words stored in the word pool, the speech recognition syllable character string is set as a search target. That is, a search pattern including a regular expression is created for each of the words stored in the word pool, and whether or not a syllable character string portion that matches the created search pattern is included in the speech recognition syllable character string. By determining, word search is performed. Since the search pattern includes a regular expression, even if the speech recognition syllable string contains a syllable string part that is not correct as Japanese (for example, a syllable string part not listed in the dictionary), the syllable string The portion can be replaced with a correct Japanese word (words listed in the dictionary) (words stored in the word pool). Of course, a syllable character string portion that is correct as Japanese is likely to be stored in the word pool, and is thus replaced with a correct word. In this way, even if a recognition error occurs by the speech recognition processing means and the speech recognition syllable character string includes a character string portion that does not make sense as Japanese, it is corrected to a character string portion that makes sense as Japanese. be able to.

好ましい実施態様では，誤認識修正装置は，上記音声認識音節文字列をかな漢字変換するかな漢字変換手段，およびかな漢字変換された文字列の中に上記単語プールに記憶されている単語が含まれるかどうかを判断する手段を備え，かな漢字変換された文字列の中に上記単語プールに記憶されている単語が含まれていることが判断された場合に，その単語を除いた音声認識音節文字列部分が上記単語検索手段および上記単語置換手段によって処理される。日本語として正しい文字列が生成されやすくなる。 In a preferred embodiment, the misrecognition correcting apparatus performs kana-kanji conversion means for converting the speech recognition syllable character string into kana-kanji and whether or not the word stored in the word pool is included in the kana-kanji converted character string. A means for determining, and when it is determined that a word stored in the word pool is included in the character string converted to Kana-Kanji, the speech recognition syllable character string portion excluding the word is Processed by the word search means and the word replacement means. It becomes easy to generate a correct string for Japanese.

一実施態様では，上記検索パターンは，上記単語の音節文字列のうちのいずれか１文字が欠けた音節文字列を検索可能な検索パターンである。音声認識処理においてある単語の正しい音節文字列（読み）のうちの一音節が欠けた状態で音声認識されたとしても，それを元の正しい単語に修正する（置換する）ことができる。 In one embodiment, the search pattern is a search pattern capable of searching for a syllable character string lacking any one of the syllable character strings of the word. Even if speech recognition is performed in a state where one syllable of a correct syllable character string (reading) of a word is missing in the speech recognition processing, it can be corrected (replaced) to the original correct word.

他の実施態様では，上記検索パターンは，上記単語の音節文字列のうちのいずれか１文字が他の文字で置き換えられた文字列を検索可能な検索パターンである。音声認識処理においてある単語の正しい音節文字列（読み）のうちの一音節が誤った音節によって認識されたとしても，それを元の単語に修正する（置換する）ことができる。 In another embodiment, the search pattern is a search pattern that can search for a character string in which any one of the syllable character strings of the word is replaced with another character. Even if one syllable of a correct syllable character string (reading) of a word is recognized by an incorrect syllable in the speech recognition process, it can be corrected (replaced) to the original word.

さらに他の実施態様では，上記検索パターンは，上記単語の音節文字列に含まれる文字の間に別の文字が挿入された文字列を検索可能な検索パターンである。音声認識処理においてある単語の正しい音節文字列の並びの中に誤った音節が加わって認識されたとしても，それを元の単語に修正する（置換する）ことができる。 In still another embodiment, the search pattern is a search pattern capable of searching for a character string in which another character is inserted between characters included in the syllable character string of the word. Even if a wrong syllable is added to the correct syllable string sequence of a word and recognized in the speech recognition process, it can be corrected (replaced) to the original word.

好ましくは，誤認識自動修正装置は，上記単語プールに記憶されている複数の単語のそれぞれについての共起単語およびその音節文字列を対応づけて記憶した共起単語プール，上記単語検索手段によって見つかった単語をターゲットキーワードとして選択するキーワード選択手段，上記共起単語プールに記憶されている共起単語の中から，上記ターゲットキーワードについての共起単語を特定する共起単語特定手段，共起単語特定手段によって特定された共起単語の中から，上記音声認識音節文字列から上記ターゲットキーワードを除いた音声認識音節文字列部分に類似する音節文字列を有する共起単語を検索する共起単語検索手段，ならびに上記共起単語検索手段によって見つかった共起単語によって，上記音声認識音節文字列に含まれる，対応する音節文字列部分を置換する共起単語置換手段を備えている。単語検索手段によって見つけることができなかった誤認識された文字列部分が音声認識音節文字列に未だ存在する場合に，その文字列部分を共起単語によって置き換えることができる。共起単語は，上記単語検索手段によって見つかった単語（ターゲットキーワード）と一緒に用いられることが多い単語である。単語検索手段によって見つけることができなかった誤認識された文字列部分を，日本語として意味をなす文字列に修正する（置換する）ことができる。 Preferably, the automatic misrecognition correcting device finds the co-occurrence word pool for each of a plurality of words stored in the word pool and the co-occurrence word pool stored in association with the syllable character string, and the word search means. Keyword selection means for selecting a target word as a target keyword, co-occurrence word specifying means for specifying a co-occurrence word for the target keyword from co-occurrence words stored in the co-occurrence word pool, co-occurrence word specification Co-occurrence word search means for searching for a co-occurrence word having a syllable character string similar to the speech recognition syllable character string portion obtained by removing the target keyword from the voice recognition syllable character string from the co-occurrence words specified by the means , And the co-occurrence words found by the co-occurrence word search means, are included in the speech recognition syllable string. And a co-occurrence word replacing means for replacing the corresponding syllable string portion. If a misrecognized character string portion that could not be found by the word search means still exists in the speech recognition syllable character string, the character string portion can be replaced with a co-occurrence word. A co-occurrence word is a word that is often used together with a word (target keyword) found by the word search means. The misrecognized character string portion that could not be found by the word search means can be corrected (replaced) with a character string that makes sense as Japanese.

好ましくは，上記共起単語検索手段は，上述した単語検索手段と同様に，上記共起単語プールに記憶されている共起単語のそれぞれについて，正規表現を含む検索パターンを作成する第２の検索パターン作成手段，および上記第２の検索パターン作成手段によって作成された検索パターンに合致する音節文字列部分を，上記ターゲットキーワードを除いた音声認識音節文字列部分の中から検索する手段を含む。 Preferably, the co-occurrence word search unit generates a search pattern including a regular expression for each of the co-occurrence words stored in the co-occurrence word pool, similar to the word search unit described above. Pattern generation means, and means for searching for a syllable character string portion matching the search pattern generated by the second search pattern generation means from the speech recognition syllable character string portion excluding the target keyword.

一実施態様では，誤認識修正装置は，特定分野において頻繁に用いられる単語のみをそれぞれ記憶した複数の分野別単語プールを備え，上記単語検索手段による単語検索処理および上記単語置換手段による音節文字列部分の置換処理が，複数の分野別単語プールのそれぞれを用いて実行される。日本語として正しい処理結果が生成されやすくなる。好ましくは，上記共起単語プールについても，上記複数の分野別単語プールのそれぞれに対応する複数の分野別共起単語プールが用意され，上記共起単語特定手段による共起単語特定処理，上記共起単語検索手段による共起単語検索処理および上記共起単語置換手段により音節文字列部分の置換処理が，複数の分野別共起単語プールのそれぞれを用いて実行される。複数の分野別単語プールおよび複数の分野別共起単語プールのそれぞれを用いることで，分野ごとの処理結果（誤認識修正結果）が生成されることになる。生成される複数の処理結果のすべてをユーザに提示するようにしてもよいし，複数の処理結果のうち，より確からしい処理結果を所定のルールにしたがって自動的に選択してそれをユーザに提示してもよい。 In one embodiment, the misrecognition correcting apparatus includes a plurality of field-specific word pools each storing only words frequently used in a specific field, and includes word search processing by the word search means and syllable character strings by the word replacement means. The partial replacement process is executed using each of the plurality of field-specific word pools. It is easy to generate correct processing results for Japanese. Preferably, also for the co-occurrence word pool, a plurality of field co-occurrence word pools corresponding to each of the plurality of field word pools are prepared, and the co-occurrence word specifying process by the co-occurrence word specifying means, The co-occurrence word search process by the word occurrence search means and the syllable character string portion replacement process by the co-occurrence word replacement means are executed using each of the plurality of field-specific co-occurrence word pools. By using each of a plurality of field-specific word pools and a plurality of field-specific co-occurrence word pools, processing results (misrecognition correction results) for each field are generated. All of the generated multiple processing results may be presented to the user, or a more probable processing result among the multiple processing results is automatically selected according to a predetermined rule and presented to the user. May be.

誤認識自動修正システムを構成するスマートフォンの外観を示す。The external appearance of the smart phone which comprises a misrecognition automatic correction system is shown. 誤認識自動修正システムを構成するサーバ・コンピュータの電気的構成を示すブロック図である。It is a block diagram which shows the electric constitution of the server computer which comprises a misrecognition automatic correction system. 記憶装置に記憶されるデータプールを示す。3 shows a data pool stored in a storage device. ターゲットキーワード・データプールの一例を示す。An example of a target keyword data pool is shown. 共起単語データプールの一例を示す。An example of a co-occurrence word data pool is shown. 誤認識修正処理を示すフローチャートである。It is a flowchart which shows a misrecognition correction process. 誤認識修正処理の一部（かな漢字データ利用処理）を詳細に示すフローチャートである。It is a flowchart which shows a part of misrecognition correction process (kana-kanji data utilization process) in detail. 誤認識修正処理の一部（音節文字列利用処理）を詳細に示すフローチャートである。It is a flowchart which shows a part of misrecognition correction process (syllabic character string utilization process) in detail. 誤認識修正処理の一部（音節文字列利用処理のうちの検索パターン作成処理）をさらに詳細に示すフローチャートである。It is a flowchart which shows a part in detail (the search pattern creation process of the syllable character string utilization processes) of the misrecognition correction process. 誤認識修正処理の一部（音節文字列利用処理のうちの検索パターン作成処理）をさらに詳細に示すフローチャートである。It is a flowchart which shows a part in detail (the search pattern creation process of the syllable character string utilization processes) of the misrecognition correction process. 誤認識修正処理の一部（音節文字列利用処理のうちの検索パターンを用いた文字列検索処理）をさらに詳細に示すフローチャートである。It is a flowchart which shows a part in detail (character string search process using the search pattern of syllable character string utilization processes) of a misrecognition correction process in more detail. （Ａ）は誤認識音節文字列をかな漢字変換した結果を，（Ｂ）はかな漢字変換された文字列を形態素解析した結果を，（Ｃ）はかな漢字変換文字列の一部の文字列がマーキングされている様子を，それぞれ示す。(A) shows the result of Kana-Kanji conversion of a misrecognized syllable character string, (B) shows the result of morphological analysis of the Kana-Kanji converted character string, and (C) shows a part of the Kana-Kanji conversion character string. Each is shown. 検索パターンの作成の様子を示す。This shows how the search pattern is created. 作成された検索パターンを示す。Indicates the created search pattern. ターゲットキーワード・データプールに登録されたターゲットキーワードのそれぞれについて，検索パターンが作成されている様子を示す。A search pattern is created for each target keyword registered in the target keyword / data pool. （Ａ）〜（Ｅ）は検索パターンを用いた文字列検索処理の様子を示す。(A)-(E) show the state of the character string search process using a search pattern. （Ａ）〜（Ｄ）は検索パターンを用いた文字列検索処理の様子を示す。(A)-(D) show the state of the character string search process using a search pattern. 誤認識修正処理の一部（共起単語利用処理）を詳細に示すフローチャートである。It is a flowchart which shows a part of misrecognition correction process (co-occurrence word utilization process) in detail. 共起単語のそれぞれについて，検索パターンが作成されている様子を示す。A search pattern is created for each co-occurrence word. カテゴリ決定処理の流れを示す。The flow of a category determination process is shown. 変形例の誤認識修正処理を示すフローチャートである。It is a flowchart which shows the misrecognition correction process of a modification.

１．誤認識自動修正システムのハードウエア構成
図１は誤認識自動修正システムにおいてクライアントとして用いられるスマートフォンの外観を示している。図２は誤認識自動修正システムにおいて上記クライアントと通信するサーバ・コンピュータ（以下，サーバという）の電気的構成を示すブロック図である。 1. Hardware Configuration of Misrecognition Automatic Correction System FIG. 1 shows an external appearance of a smartphone used as a client in a misrecognition automatic correction system. FIG. 2 is a block diagram showing an electrical configuration of a server computer (hereinafter referred to as a server) that communicates with the client in the automatic recognition correction system.

誤認識自動修正システムはクライアント−サーバ・モデルによって構成される。図１を参照して，クライアント（スマートフォン）１は，入力装置および表示装置として機能するタッチパネル２，音声を電気信号に変換するマイクロフォン３，サーバ等との間でデータを送受信する通信装置（図示略），クライアント１の動作を統括的に制御する処理装置（図示略），プログラムおよびデータを記憶する記憶装置（図示略）等を備える。タッチパネル２に表示されるマイクロフォンの形をしたアイコン４ａがクリックされると，クライアント１において音声入力を受付けるためのプログラムが起動し，音声入力ウインドウ４がタッチパネル２上に表示される。マイクロフォン３によって受信された音声データがクライアント１に入力される。 The erroneous recognition automatic correction system is constituted by a client-server model. Referring to FIG. 1, a client (smart phone) 1 includes a touch panel 2 that functions as an input device and a display device 2, a microphone 3 that converts voice into an electrical signal, a communication device (not shown) that transmits and receives data to and from a server, and the like. ), A processing device (not shown) for comprehensively controlling the operation of the client 1, a storage device (not shown) for storing programs and data, and the like. When the microphone-shaped icon 4 a displayed on the touch panel 2 is clicked, a program for accepting voice input is started in the client 1 and the voice input window 4 is displayed on the touch panel 2. Audio data received by the microphone 3 is input to the client 1.

入力された音声データは，クライアント１からネットワーク（インターネットなど）を通じてサーバ10に送信され，サーバ10の通信装置９によって受信される。図２を参照して，サーバ10は，サーバ10の動作を統括的に制御する中央演算処理装置（ＣＰＵ）５，入力装置および表示装置として機能するタッチパネル６，データを一時的に記憶するメモリ７，後述する誤認識自動修正処理を実行するためのプログラム，後述するデータプール等を記憶する記憶装置８，およびクライアント１との間でデータ通信を行うための通信装置９を備えている。後述するように，サーバ10によって誤認識自動修正処理が行われる。クライアント１から送信された音声データに基づいて認識される音節文字列（以下，読みデータという）が文法的または言語的に誤りを含むものであったとしてもそれが修正され（正され），修正された文字列データがクライアント１に送信される。 The input voice data is transmitted from the client 1 to the server 10 through a network (such as the Internet) and received by the communication device 9 of the server 10. Referring to FIG. 2, a server 10 includes a central processing unit (CPU) 5 that controls the operation of the server 10 in an integrated manner, a touch panel that functions as an input device and a display device 6, and a memory 7 that temporarily stores data. , A program for executing an erroneous recognition automatic correction process to be described later, a storage device 8 for storing a data pool and the like to be described later, and a communication device 9 for performing data communication with the client 1. As will be described later, the server 10 performs automatic recognition error correction processing. Even if the syllable character string (hereinafter referred to as reading data) recognized based on the voice data transmitted from the client 1 includes a grammatical or linguistic error, it is corrected (corrected) and corrected. The character string data thus transmitted is transmitted to the client 1.

図３から図５は，サーバ10において実行される誤認識自動修正処理に用いられるデータ（データプール）を示している。図３から図５に示すデータは，サーバ10が備える記憶装置８にあらかじめ記憶される。 3 to 5 show data (data pool) used for the erroneous recognition automatic correction process executed in the server 10. The data shown in FIGS. 3 to 5 are stored in advance in the storage device 8 provided in the server 10.

図３を参照して，記憶装置８には，カテゴリごとに，ターゲットキーワード・データプール11，13，15と，共起単語データプール12，14，16の組が記憶されている。一例を説明する。図４はグルメに関するカテゴリ１のターゲットキーワード・データプール11のデータ内容を，図５はグルメに関するカテゴリ１の共起単語データプール12のデータ内容を，それぞれ示している。 Referring to FIG. 3, the storage device 8 stores a set of target keyword data pools 11, 13, 15 and co-occurrence word data pools 12, 14, 16 for each category. An example will be described. 4 shows the data contents of the target keyword / data pool 11 of category 1 related to gourmet, and FIG. 5 shows the data contents of the co-occurrence word data pool 12 of category 1 related to gourmet, respectively.

図４を参照して，ターゲットキーワード・データプール11にはグルメに関連するターゲットキーワード（単語）が多数記憶（登録）されている。ターゲットキーワードのそれぞれは，識別符号（番号），原型，読みデータ（音節文字列）および品詞のデータを含む。「原型」は，漢字によってまたは漢字を含んで表記されることが一般的であれば漢字を使用して，カタカナによってまたはカタカナを含んで表記されることが一般的であればカタカナを使用して，グルメに関する単語を表す。「読みデータ」は音節文字列を表すデータであり，この明細書では読みデータ（音節文字列）をカタカナ表記する。「品詞」には，原型が表す単語に応じて，名詞，形容詞，形容動詞，動詞または副詞のいずれかが記憶される。 Referring to FIG. 4, a large number of target keywords (words) related to gourmet are stored (registered) in the target keyword data pool 11. Each of the target keywords includes identification code (number), prototype, reading data (syllable character string), and part of speech data. “Prototype” uses kanji if it is generally expressed by or including kanji, and katakana if it is generally expressed by or including katakana. , Represents a word about gourmet. “Reading data” is data representing a syllable character string. In this specification, the reading data (syllable character string) is expressed in katakana. “Part of speech” stores either a noun, an adjective, an adjective verb, a verb, or an adverb according to the word represented by the prototype.

図５を参照して，グルメに関するターゲットキーワード・データプール11と組である共起単語データプール12には，グルメに関するターゲットキーワード・データプール11に記憶されている多数のターゲットキーワードのそれぞれについて，同じドキュメント（ウェブページ）のなかで頻繁に一緒に使われる単語（共起単語）が多数記憶（登録）されている。識別符号（番号），ターゲットキーワード（原型），ターゲットキーワードに対する共起単語および共起単語の読みデータが一組のデータとして記憶される。一般には一つのターゲットキーワード（原型）について複数の共起単語が記憶される。 Referring to FIG. 5, the co-occurrence word data pool 12 paired with the gourmet target keyword data pool 11 is the same for each of the many target keywords stored in the gourmet target keyword data pool 11. Many words (co-occurrence words) that are frequently used together in a document (web page) are stored (registered). The identification code (number), the target keyword (prototype), the co-occurrence word for the target keyword and the co-occurrence word reading data are stored as a set of data. In general, a plurality of co-occurrence words are stored for one target keyword (prototype).

上述したグルメに関するターゲットキーワード・データプール11および共起単語データプール12の組を用いることで，以下に詳述するように，一つの処理結果（誤認識修正処理後の文字列）が生成されて出力される。同様にして，観光（カテゴリ２）に関するターゲットキーワード・データプール13および共起単語データプール14の組（図３参照），宿泊施設（カテゴリＮ）に関するターゲットキーワード・データプール15および共起単語データプール16（図３参照）を用いることでも，それぞれ処理結果が生成されて出力される。すなわち，誤認識自動修正システムではカテゴリごとに処理結果が生成される。カテゴリごとの複数の処理結果はいずれも同じアルゴリズムによって生成されるので，以下の実施例では，グルメに関するターゲットキーワード・データプール11（図４）および共起単語データプール12（図５）の組を用いたときのサーバ10の処理（誤認識自動修正処理）を説明する。 By using the target keyword / data pool 11 and the co-occurrence word data pool 12 related to gourmet, one processing result (character string after erroneous recognition correction processing) is generated as described in detail below. Is output. Similarly, a set of target keyword data pool 13 and co-occurrence word data pool 14 relating to tourism (category 2) (see FIG. 3), target keyword data pool 15 relating to accommodation facility (category N) and co-occurrence word data pool By using 16 (see FIG. 3), the processing results are also generated and output. That is, the erroneous recognition automatic correction system generates a processing result for each category. Since a plurality of processing results for each category are all generated by the same algorithm, in the following embodiment, a set of a target keyword data pool 11 (FIG. 4) and a co-occurrence word data pool 12 (FIG. 5) relating to gourmet is created. Processing of the server 10 when used (error recognition automatic correction processing) will be described.

２．誤認識自動修正処理の全体の流れ
図６は，サーバ10において実行される誤認識自動修正処理の全体的な流れを示すフローチャートである。はじめに図６を参照してサーバ10において実行される誤認識自動修正処理を概略的に説明する。 2. Overall Flow of False Recognition Automatic Correction Processing FIG. 6 is a flowchart showing the overall flow of false recognition automatic correction processing executed in the server 10. First, the erroneous recognition automatic correction processing executed in the server 10 will be schematically described with reference to FIG.

上述したように，クライアント１から入力された音声データは，ネットワークを介してサーバ10に向けて送信され，サーバ10に入力（受信）される（ステップ21）。 As described above, the audio data input from the client 1 is transmitted to the server 10 via the network and input (received) to the server 10 (step 21).

音声認識処理，すなわち音声データを音節文字列データ（読みデータ）に変換する処理が実行される（ステップ22）。音声認識処理にはたとえば市販されている音声認識ソフトウエアを利用することができる。 Speech recognition processing, that is, processing for converting speech data into syllable character string data (reading data) is executed (step 22). For example, commercially available voice recognition software can be used for the voice recognition processing.

音声認識処理によって音データが変換されて生成される音節文字列データは，日本語として文法的または言語的に誤りがあることがある。この実施例では，「これからうなぎをたべたい」と発話したものの，音声認識処理の結果である音節文字列が「コカラウネギヲタベタイ」であったこと，すなわち誤認識されたことを前提とする。上述した誤認識は，発話をした者の発音が悪いことが原因であることもあるし，音声データに多くの雑音が含まれていることが原因であることもある。いずれにしても，日本語として正しくない（意味の通じない）音節文字列が音声認識処理によって生成された状態を例にして，サーバ10における誤認識修正処理を説明する。 The syllable character string data generated by converting the sound data by the speech recognition processing may have a grammatical or linguistic error as Japanese. In this embodiment, it is assumed that the syllable character string that is the result of the speech recognition processing is “cocaraune wotabaitai”, that is, that it has been misrecognized, although “I want to eat eel from now on” is spoken. . The above-described misrecognition may be caused by poor pronunciation of a person who speaks, or may be caused by a lot of noise included in voice data. In any case, the misrecognition correction process in the server 10 will be described by taking as an example a state in which a syllable character string that is not correct in Japanese (not meaningful) is generated by the speech recognition process.

音声認識処理の結果（「コカラウネギヲタベタイ」の音節文字列）（以下，「誤認識音節文字列」という）が誤認識修正処理の対象のデータである。誤認識音節文字列に対して，次の２つの処理が行われる。 The result of the speech recognition process (the syllable character string of “Coca Raunegi Otabaitai”) (hereinafter referred to as “misrecognized syllable character string”) is the target data for the erroneous recognition correction process. The following two processes are performed on the misrecognized syllable character string.

その一つは，誤認識音節文字列をかな漢字変換処理し，かな漢字変換処理によって生成された漢字交じりの文字列（「個から畝議を食べたい」）の中に上述したターゲットキーワード・データプール11に登録されているターゲットキーワードが含まれているかどうかを検索し，見つかった文字列部分をマーキングする処理である（ステップ23〜25）。以下，かな漢字変換データを利用するステップ23〜25の処理を「かな漢字変換データ利用処理」という。たとえば「コカラウネギヲタベタイ」の音節文字列から「個から畝議を食べたい」の漢字交じりの文字列が生成され，このうち「食べ」（食べる）がターゲットキーワード・データプール11に登録されている場合，「食べ」の文字列部分に，かな漢字変換データ利用処理によって見つかったターゲットキーワードであることがマーキングされる。この処理は最終的に決定される処理結果の精度を高めるために行われる（詳細は後述する）。 One of them is a kana-kanji conversion process on a misrecognized syllable string, and the target keyword data pool 11 described above is included in the kanji-mixed string generated by the kana-kanji conversion process. Is a process of searching whether or not the target keyword registered in is included and marking the found character string portion (steps 23 to 25). Hereinafter, the processing of steps 23 to 25 using the kana-kanji conversion data is referred to as “kana-kanji conversion data use processing”. For example, a character string of “I want to eat a request from an individual” is generated from a syllable character string of “Coca Raunegiota Tabaitai”, and “Eat” (eat) is registered in the target keyword / datapool 11 If it is, the character string portion of “eat” is marked as a target keyword found by the kana-kanji conversion data use processing. This process is performed in order to improve the accuracy of the process result finally determined (details will be described later).

他の一つは，ターゲットキーワード・データプール11に登録されているターゲットキーワードのうち，誤認識音節文字列（「コカラウネギヲタベタイ」）に含まれる一部の音節文字列部分に合致する，またはこれに類似する音節文字列部分（読み）を有するターゲットキーワードを検索し，見つかったターゲットキーワード（原型）によって，誤認識音節文字列中の対応する音節文字列部分を置換する処理である（ステップ26，27）。以下，音節文字列を利用するステップ26，27の処理を「音節文字列利用処理」と呼ぶ。音節文字列利用処理によって，誤認識音節文字列「コカラウネギヲタベタイ」から「コカラ鰻ヲ食べタイ」の文字列が生成される。また，音節文字列利用処理によって置換された文字列部分，ここでは「鰻」および「食べ」には，音節文字列利用処理によって置換された文字列部分（ターゲットキーワード）であることがマーキングされる。音節文字列利用処理の詳細も後述する。 The other one matches the partial syllable character string part included in the misrecognized syllable character string ("Coca Raunegiwo Tabaitai") among the target keywords registered in the target keyword data pool 11. Alternatively, a target keyword having a syllable character string portion (reading) similar to this is searched, and the corresponding syllable character string portion in the misrecognized syllable character string is replaced with the found target keyword (prototype) (step) 26, 27). Hereinafter, the processing of steps 26 and 27 using the syllable character string is referred to as “syllabic character string use processing”. Through the syllable character string use processing, a character string of “Kokara-wo-eaten-tai” is generated from the misrecognized syllable character string “Koca Raunegi wota-tai”. In addition, the character string portion replaced by the syllable character string use processing, here “鰻” and “eat” is marked as the character string portion (target keyword) replaced by the syllable character string use processing. . Details of the syllable character string use processing will also be described later.

音節文字列利用処理によって生成された文字列（コカラ鰻ヲ食べタイ）に含まれる文字列部分に，音節文字列利用処理で置換された文字列部分であって，かつかな漢字変換データ利用処理においてもマーキングされた文字列部分であるものが存在するかどうかが判断され，そのような文字列部分については，かな漢字変換データ利用処理で見つかったものであることがラベリングされる（ステップ28）。たとえば音節文字列利用処理によって生成された文字列「コカラ鰻ヲ食べタイ」のうち，「食べ」の文字列部分が音節文字列利用処理で置換された文字列部分であり，かつかな漢字変換データ利用処理においてもマーキングされた文字列部分であれば，「食べ」の文字列部分はかな漢字変換データ利用処理で見つかったものであることがラベリングされる。この処理も，最終的に決定される処理結果の精度を高めるために行われる（詳細は後述する）。 The character string part included in the character string generated by the syllabic character string use processing (Kokora 鰻 eat tie) is replaced by the syllable character string use processing, and in the kana-kanji conversion data use processing It is determined whether or not there is a marked character string part, and it is labeled that such a character string part is found in the kana-kanji conversion data utilization processing (step 28). For example, in the character string “Kokhara-wo-eaten-tai” generated by syllable character string use processing, the character string portion of “eat” is replaced by syllable character string use processing, and kana-kanji conversion data is used. If the character string portion is marked in the process, the character string portion of “eat” is labeled as found in the kana-kanji conversion data use processing. This process is also performed in order to improve the accuracy of the finally determined process result (details will be described later).

ここまでの処理で生成された文字列のうち，ターゲットキーワード以外の音節文字列部分について，共起単語データプール12に登録されている共起単語の中から，その音節文字列部分に合致する，またはこれに類似する音節文字列（読み）を有する共起単語を検索し，見つかった共起単語（原型）によって，対応する音節文字列部分を置換する処理が行われる（ステップ29，30）。データ検索に用いられるデータプールは異なるが，アルゴリズムは上述した音節文字列利用処理（ステップ26，27）と同じである。この処理によって，「コカラ」の音節文字列部分が共起単語「これから」によって置換され，「これから鰻ヲ食べタイ」の文字列が生成される。 Of the character strings generated so far, the syllable character string part other than the target keyword matches the syllable character string part from the co-occurrence words registered in the co-occurrence word data pool 12. Alternatively, a co-occurrence word having a syllable character string (reading) similar to this is searched, and the corresponding syllable character string part is replaced with the found co-occurrence word (prototype) (steps 29 and 30). The data pool used for data retrieval is different, but the algorithm is the same as the above-described syllable character string use processing (steps 26 and 27). By this processing, the syllable character string portion of “Kokhara” is replaced with the co-occurrence word “from now on”, and a character string “from now on” is generated.

最後に残存する音節文字列部分をそのまま確定する（そのままの表現を用いる）ことで誤認識自動修正処理は終了する（ステップ31）。「これから鰻を食べたい」の文字列が生成される。 When the last remaining syllable character string portion is confirmed as it is (using the expression as it is), the erroneous recognition automatic correction processing is ended (step 31). The character string “I want to eat rice cake from now on” is generated.

以上の処理を経ることで，誤認識音節文字列「コカラウネギヲタベタイ」から「これから鰻を食べたい」の処理結果（誤認識修正文字列）が生成される。生成される誤認識修正文は，誤認識音節文字列を単にかな漢字変換することでは得ることができない，日本語として正しい文章となる。生成された誤認識修正文を表す文字列データがサーバ10からクライント１に送信される。クライアント１では，サーバ10から送信された文字列が用いられて，その後の処理（たとえば，検索エンジンを用いたウェブ・ページの検索処理など）に進む。 Through the above processing, the processing result (misrecognition correction character string) of “I want to eat potatoes from now” is generated from the misrecognized syllable character string “Coca Raunegiwo Tabaitai”. The generated misrecognition correction sentence is a sentence correct in Japanese that cannot be obtained by simply kana-kanji conversion of the misrecognized syllable character string. Character string data representing the generated erroneous recognition correction sentence is transmitted from the server 10 to the client 1. In the client 1, the character string transmitted from the server 10 is used, and the process proceeds to subsequent processing (for example, web page search processing using a search engine).

以下，図７から図１１および図１８のフローチャートおよび図１２から図１７および図１９に示す具体例を参照して，上記処理を詳細に説明する。 Hereinafter, the above processing will be described in detail with reference to the flowcharts of FIGS. 7 to 11 and 18 and the specific examples shown in FIGS. 12 to 17 and FIG.

３．かな漢字変換データ利用処理
図７は，図６のステップ24の処理を詳細に示すフローチャートである。ここでは「コカラウネギヲタベタイ」の誤認識音節文字列に対してかな漢字変換処理が行われ，「個から畝議を食べたい」の漢字交じりの文字列が生成されているものとする。図１２（Ａ）〜（Ｃ）を参照して図７に示す処理を説明する。 3. Kana-Kanji Conversion Data Utilization Processing FIG. 7 is a flowchart showing in detail the processing of step 24 in FIG. Here, it is assumed that a kana-kanji conversion process is performed on the misrecognized syllable character string of “Kokaraunegiwo Tabaitai”, and a character string mixed with kanji “I want to eat debate from an individual” is generated. The process shown in FIG. 7 will be described with reference to FIGS.

はじめに，かな漢字変換処理後の漢字交じりの文字列に対して形態素解析処理が行われる（ステップ41）。「個から畝議を食べたい」の漢字交じりの文字列（図１２（Ａ））の場合，「個」，「から」，「畝」，「議」，「を」，「食べ」および「たい」の形態素（最小単位）に分割され，それぞれの読み，原型および品詞が判別される（図１２（Ｂ））。形態素解析処理には形態素解析システム「Chasen」などを利用することができる。 First, a morphological analysis process is performed on a character string mixed with kanji after the kana-kanji conversion process (step 41). In the case of a character string mixed with kanji “I want to eat an applause from an individual” (FIG. 12A), “Individual”, “From”, “畝”, “Meeting”, “O”, “Eat” and “ Is divided into morphemes (minimum units), and each reading, prototype, and part of speech are determined (FIG. 12B). For the morphological analysis processing, a morphological analysis system “Chasen” or the like can be used.

分割された形態素のうち，名詞，形容詞，形容動詞，動詞および副詞の品詞を持つ形態素が選択される（ステップ42）。上述の例では，「個」，「畝」，「議」および「食べ」の４つの形態素が選択される。選択された形態素のそれぞれについて，その形態素（単語）が，グルメに関するターゲットキーワード・データプール11（図４参照）にターゲットキーワードとして登録されているかどうかが判断される（ステップ43）。 Of the divided morphemes, morphemes having nouns, adjectives, adjective verbs, verbs and adverbs are selected (step 42). In the above example, four morphemes of “individual”, “mochi”, “discussion” and “eat” are selected. For each selected morpheme, it is determined whether the morpheme (word) is registered as a target keyword in the gourmet target keyword data pool 11 (see FIG. 4) (step 43).

選択された形態素の中にターゲットキーワードが含まれている場合，その形態素（ターゲットキーワード）に対して，上述したように，その形態素（ターゲットキーワード）がかな漢字変換データ利用処理を用いて見つかったターゲットキーワードであることを示すマーキングが行われる（図６，ステップ25）。たとえば，「個」，「畝」，「議」および「食べ」の４つの形態素のうち，「食べ」（原型「食べる」）がターゲットキーワード・データプール11にターゲットキーワードとして登録されている場合，文字列部分「食べ」について，それがかな漢字変換データ利用処理を用いて見つかったターゲットキーワードであることを示すマーキングが行われる。この様子を図１２（Ｃ）に模式的に示す。 If the target keyword is included in the selected morpheme, the target keyword for which the morpheme (target keyword) was found using the Kana-Kanji conversion data utilization process as described above Marking is performed (FIG. 6, step 25). For example, among four morphemes of “individual”, “mochi”, “discussion”, and “eat”, “eat” (prototype “eat”) is registered in the target keyword / data pool 11 as a target keyword. The character string portion “eat” is marked to indicate that it is a target keyword found using the kana-kanji conversion data utilization processing. This is schematically shown in FIG.

選択された形態素のすべてがターゲットキーワード・データプール11に登録されていない場合には，特段の処理は行われない。 If not all of the selected morphemes are registered in the target keyword / data pool 11, no special processing is performed.

４．音節文字列利用処理（ステップ26，27）
図８は，図６のステップ26の処理を詳細に示すフローチャートである。図９から図１１は図８に示す処理をさらに詳細に示すフローチャートである。図１３から図１７に示す具体例を参照して，図８に示す処理を説明する。 4). Syllable string usage processing (steps 26 and 27)
FIG. 8 is a flowchart showing in detail the processing of step 26 in FIG. 9 to 11 are flowcharts showing the process shown in FIG. 8 in more detail. The process shown in FIG. 8 will be described with reference to specific examples shown in FIGS.

はじめに，グルメに関するターゲットキーワード・データプール11（図４）に記憶されている複数のターゲットキーワードのそれぞれについて，検索パターン（検索式）を作成する処理が行われる（ステップ51）。この検索パターン作成処理は，誤認識音節文字列（コカラウネギヲタベタイ）の中に，ターゲットキーワードに完全に合致する文字列部分が含まれているかを判断するのみならず，これに類似する文字列部分が含まれているかを判断するために行われる処理である。この処理を経ることで，図１５に示すように，ターゲットキーワード・データプール11に登録されているターゲットキーワードのそれぞれについて，一または複数の検索パターンが作成される。 First, processing for creating a search pattern (search formula) is performed for each of a plurality of target keywords stored in the target keyword / data pool 11 related to gourmet (FIG. 4) (step 51). This search pattern creation process not only determines whether the character string part that completely matches the target keyword is included in the misrecognized syllable character string (cocaraunegiotatabai), but also similar characters. This is a process performed to determine whether or not a column part is included. Through this process, as shown in FIG. 15, one or a plurality of search patterns are created for each target keyword registered in the target keyword / data pool 11.

図９および図１０を参照して，図８のステップ51の検索パターンの作成処理を詳細に説明する。 With reference to FIGS. 9 and 10, the search pattern creation processing in step 51 of FIG. 8 will be described in detail.

はじめに，ターゲットキーワードのそれぞれを特定するためのカウンタｉがインクリメントされ（ステップ60），ターゲットキーワード・データプール11に記憶されている複数のターゲットキーワードのうち，一のターゲットキーワードがメモリ７に読み出される（ステップ61）。たとえば，図４に示すターゲットキーワード・データプール11に記憶されている複数のターゲットキーワードの中の先頭（No.１）のターゲットキーワード（原型「鰻」）が，ターゲットキーワード・データプール11からメモリ７に読み出される。 First, a counter i for specifying each target keyword is incremented (step 60), and one target keyword is read out to the memory 7 among a plurality of target keywords stored in the target keyword / data pool 11 ( Step 61). For example, the first (No. 1) target keyword (prototype “鰻”) among a plurality of target keywords stored in the target keyword / data pool 11 shown in FIG. Is read out.

読み出されたターゲットキーワードの読みデータの文字数Ｔの半分の数が算出され，その数が有効ビット数Ｂとされる（ステップ62）。有効ビット数Ｂについての詳細は後述する。ターゲットキーワードの読みデータの文字数Ｔが奇数であれば，四捨五入した整数が有効ビット数Ｂとされる。たとえば，読み出されたターゲットキーワードの読みデータが「ウナギ」である場合には，文字数が「３」であるから，有効ビット数Ｂとして「２」が算出される。なお，読み出されたターゲットキーワードの読みデータの文字数Ｔが２文字である場合には，例外的に有効ビット数Ｂは２文字とされる。また読み出されたターゲットキーワードの読みデータの文字数Ｔが１文字であれば，そのターゲットキーワードは無視する（そのターゲットキーワードを読み出さない）ようにしてもよい。 Half the number of characters T of the read data of the read target keyword is calculated, and the number is set as the effective bit number B (step 62). Details of the effective bit number B will be described later. If the number of characters T in the reading data of the target keyword is an odd number, the rounded integer is set as the effective bit number B. For example, when the read data of the read target keyword is “eel”, since the number of characters is “3”, “2” is calculated as the effective bit number B. When the number of characters T of the read data of the read target keyword is 2 characters, the effective bit number B is exceptionally 2 characters. If the number T of characters read from the read target keyword is 1, the target keyword may be ignored (the target keyword is not read).

最大処理回数Ｐが算出される（ステップ63）。後述するように，検索パターンの作成を試みる処理は，読みデータの文字数Ｔに依存する数だけ繰り返し行われる。最大処理回数Ｐは２^Ｔ−１によって算出される。たとえば読出されたターゲットキーワードの読みデータが「ウナギ」であれば，読みデータの文字数Ｔ＝３であるから，最大処理回数Ｐとして「７」（＝２^３−１）が算出される。 The maximum processing number P is calculated (step 63). As will be described later, the process of trying to create a search pattern is repeatedly performed by the number depending on the number of characters T of the read data. The maximum number of processing times P is calculated by 2 ^T −1. For example, if the read data of the read target keyword is “eel”, since the number of characters T of the read data is 3, “7” (= 2 ³ −1) is calculated as the maximum processing count P.

メモリ７中に，上述した最大処理回数分の検索パターンを記憶する領域が確保され，初期化処理される（ステップ64）。 An area for storing the search patterns for the maximum number of processing times described above is secured in the memory 7 and is initialized (step 64).

処理カウンタＣがインクリメントされ（ステップ65），カウンタＣが最大処理回数Ｐよりも小さいかどうかが判断される（ステップ66）。 The processing counter C is incremented (step 65), and it is determined whether the counter C is smaller than the maximum processing count P (step 66).

カウンタＣが最大処理回数Ｐよりも小さい場合（ステップ66でYES ），カウンタＣの数値が２進数化される（図10，ステップ70）。たとえば最初の処理であればカウンタＣ＝１であり，この２進数は「００１」である（図１３参照）。なお，２進数化カウンタＣのビット数は，処理対象のターゲットキーワードの読みデータの文字数Ｔに合わせられる。 If the counter C is smaller than the maximum processing count P (YES in step 66), the value of the counter C is converted to a binary number (FIG. 10, step 70). For example, in the case of the first processing, the counter C = 1, and this binary number is “001” (see FIG. 13). The number of bits of the binary digitizing counter C is matched with the number of characters T of the read data of the target keyword to be processed.

２進数化カウンタＣにおけるビット「１」の数Ｎが取得され（ステップ71），ビット「１」の数Ｎが上述した有効ビット数Ｂ以上であるかが判断される（ステップ72）。ビット「１」の数Ｎが有効ビット数Ｂ以上である場合，検索パターンの作成処理に進む（ステップ72でYES ）。ビット「１」の数Ｎが有効ビット数Ｂ未満のときには検索パターンの作成処理に進まず，カウンタＣがインクリメントされる（ステップ72でNO，ステップ76）。これは，後述するように作成される検索パターン（図１３）において，たとえば１文字だけ合致すればヒットされることになる検索パターンの作成を避けるための処理である。たとえば，２進数化カウンタＣが「００１」であれば，これに含まれるビット「１」の数Ｎは「１」である。読みデータの文字数Ｔを用いて算出される有効ビット数Ｂ＝２とすると，ビット「１」の数Ｎ（＝１）＜有効ビット数Ｂ（＝２）であるから，検索パターンの作成処理に進まずにカウンタＣがインクリメントされることになる。 The number N of bits “1” in the binary digitizing counter C is acquired (step 71), and it is determined whether the number N of bits “1” is equal to or greater than the effective bit number B described above (step 72). If the number N of bits “1” is equal to or greater than the number of valid bits B, the process proceeds to search pattern creation processing (YES in step 72). When the number N of bits “1” is less than the effective bit number B, the process does not proceed to the search pattern creation process, and the counter C is incremented (NO in step 72, step 76). This is a process for avoiding creation of a search pattern that will be hit if only one character matches in a search pattern (FIG. 13) created as described later. For example, if the binary digitizing counter C is “001”, the number N of bits “1” included therein is “1”. If the number of effective bits B = 2 calculated using the number of characters T of the read data is N (= 1) <the number of effective bits B (= 2), the number of bits “1” is smaller than the number of effective bits B (= 2). The counter C is incremented without progressing.

カウンタＣがインクリメントされた後，ステップ66からの処理が繰り返される。カウンタＣ＝２の場合，その２進数化カウンタＣは「０１０」である。この場合もビット「１」の数Ｎ（＝１）＜有効ビット数Ｂ（＝２）であるから，検索パターンの作成処理に進まずにカウンタＣがさらにインクリメントされる（ステップ72でNO，ステップ76，ステップ66，図１３）。 After the counter C is incremented, the processing from step 66 is repeated. When the counter C = 2, the binarization counter C is “010”. Also in this case, since the number N (= 1) of bits “1” <the number of effective bits B (= 2), the counter C is further incremented without proceeding to the search pattern creation process (NO in step 72, step 76, step 66, FIG. 13).

カウンタＣ＝３の場合，その２進数カウンタＣは「０１１」である。ビット「１」の数Ｎ＝２であり，ビット「１」の数Ｎ（＝２）≧有効ビット数Ｂ（＝２）となるので，検索パターンの作成処理に進む（ステップ72でYES）。 When the counter C = 3, the binary counter C is “011”. Since the number of bits “1” N = 2 and the number of bits “1” N (= 2) ≧ the number of effective bits B (= 2), the process proceeds to search pattern creation processing (YES in step 72).

２進数化カウンタＣを用いて文字パターンが作成される（ステップ73）。図１３を参照して具体的に説明する。処理対象のターゲットキーワードの読みデータが「ウナギ」であり，２進数化カウンタＣが「０１１」であるとする。この場合には「○ナギ」が文字パターンとして作成される。すなわち，２進数化カウンタＣ「０１１」に含まれるビット０について，その位置に「○」を対応づけ，ビット１についてその位置の読みデータ（音節）をそのまま対応づけたものが，作成される文字パターンである。上述したように，２進数化カウンタＣはターゲットキーワードの読みデータの文字数Ｔと同じビット数を持つので，２進数化カウンタＣの値（フラグのオン／オフと言ってもよい）に応じて，文字パターンが作成されることになる。 A character pattern is created using the binary digitizing counter C (step 73). This will be specifically described with reference to FIG. It is assumed that the reading data of the target keyword to be processed is “eel” and the binary counter C is “011”. In this case, “○ Nagi” is created as a character pattern. That is, for the bit 0 included in the binary digitizing counter C “011”, “0” is associated with the position, and the read data (syllable) at that position is directly associated with the bit 1 as the character to be created. It is a pattern. As described above, the binary digitizing counter C has the same number of bits as the number of characters T of the read data of the target keyword. Therefore, depending on the value of the binary digitizing counter C (which may be referred to as flag ON / OFF), A character pattern will be created.

作成された文字パターンを用いて次に検索パターンが作成される（ステップ74）。検索パターンは次のようにして行われる。 Next, a search pattern is created using the created character pattern (step 74). The search pattern is performed as follows.

（１）文字パターンに含まれる○を無視する。
（２）○以外の文字が連続している場合，文字と文字との間に正規表現（.*?）を加える。
（３）（２）の処理で正規表現(.*?)が連続する場合には１つの正規表現(.*?)とする。 (1) Ignore ○ included in the character pattern.
(2) If characters other than ○ are consecutive, add a regular expression (. *?) Between the characters.
(3) If regular expressions (. *?) Continue in the process of (2), one regular expression (. *?) Is assumed.

正規表現(.*?)の意味は後述する。 The meaning of regular expressions (. *?) Will be described later.

図１３を参照して，たとえば文字パターンが「○ナギ」であれば，「ナ(.*?)ギ」が作成される検索パターンである。作成された検索パターンはメモリ７に記憶され（ステップ75），その後カウンタＣがインクリメントされて（ステップ76），ステップ66からの処理が繰り返される。 Referring to FIG. 13, for example, if the character pattern is “○ Nagi”, “Na (. *?) G” is created. The created search pattern is stored in the memory 7 (step 75), then the counter C is incremented (step 76), and the processing from step 66 is repeated.

図１３を参照して，読みデータが「ウナギ」であるターゲットキーワードを処理対象にして上述の処理を行うと，上述したように，カウンタＣ＝１およびカウンタＣ＝２のときの処理では検索パターンは作成されず，カウンタＣ＝３のときの処理で検索パターン「ナ(.*?)ギ」が作成される。カウンタＣ＝４のときの処理では検索パターンは作成されず，カウンタＣ＝５，６および７のときの処理では，検索パターン「ウ(.*?)ギ」，「ウ(.*?)ナ」および「ウ(.*?)ナ(.*?)ギ」がそれぞれ作成される。 Referring to FIG. 13, when the above processing is performed with the target keyword whose reading data is “eel” as the processing target, as described above, in the processing when counter C = 1 and counter C = 2, the search pattern Is not created, and the search pattern “NA (. *?) GI” is created in the process when the counter C = 3. In the processing when the counter C = 4, the search pattern is not created. In the processing when the counter C = 5, 6 and 7, the search patterns “U (. *?)”, “U (. *?) "And" U (. *?) Na (. *?) Gi "are created respectively.

カウンタＣがさらにインクリメントされてＣ＝８になると，カウンタＣが読みデータの文字数Ｔに応じて算出された最大処理回数Ｐ（＝７）を超える（ステップ66でNO）。ここまでの処理で作成された検索パターン（図１３）が，正規表現(.*?) 以外の文字数の多い順番に並び替えられ（ステップ67），これによりターゲットキーワード「鰻」についての処理が終了する。並び替えを終えた検索パターンは，ターゲットキーワードの原型，読みデータ，読みデータの文字数，検索パターンに含まれる文字数とともに，メモリ７に記憶される（図１４）。 When the counter C is further incremented to C = 8, the counter C exceeds the maximum number of processing times P (= 7) calculated according to the number of characters T of the read data (NO in step 66). The search pattern created in the process so far (Fig. 13) is rearranged in the descending order of the number of characters other than the regular expression (. *?) (Step 67), thereby completing the process for the target keyword "鰻". To do. The search pattern after the rearrangement is stored in the memory 7 together with the target keyword prototype, reading data, the number of characters of the reading data, and the number of characters included in the search pattern (FIG. 14).

次にカウンタｉがインクリメントされて，ターゲットキーワード・データプール11に登録されている次のターゲットキーワードに対する処理に進む（ステップ68，ステップ69でNO）。ターゲットキーワード・データプール11に登録されているすべてのターゲットキーワードについて上述の処理を終えると，検索パターンの作成処理は終了する（ステップ69でYES）。 Next, the counter i is incremented, and the process proceeds to the process for the next target keyword registered in the target keyword / data pool 11 (NO in step 68 and step 69). When the above process is completed for all target keywords registered in the target keyword / data pool 11, the search pattern creation process ends (YES in step 69).

図１５は，図４に示すターゲットキーワード・データプール11に登録されているターゲットキーワードについて作成される検索パターンを示している。 FIG. 15 shows search patterns created for the target keywords registered in the target keyword / data pool 11 shown in FIG.

図８に戻って，次に，作成した検索パターンが用いられて，これに合致する文字列を誤認識音節文字列の中から検索する処理が行われる（ステップ52）。 Returning to FIG. 8, the created search pattern is used, and a process of searching for a character string matching the search pattern from the erroneously recognized syllable character strings is performed (step 52).

図１１は検索パターンを用いた文字列検索処理の詳細を示すフローチャートである。 FIG. 11 is a flowchart showing details of character string search processing using a search pattern.

はじめに，上述した正規表現(.*?)を含む検索パターンについて説明する。 First, a search pattern including the regular expression (. *?) Described above will be described.

正規表現(.*?)はパターンマッチ文字列を表すもので，(.*?)の部分に文字は無くてもよいこと，および(.*?) の部分は何文字でもよい（ただし文字長が最も短い部分が選ばれる）ことを表すものである。 A regular expression (. *?) Represents a pattern match character string. There can be no characters in the (. *?) Part, and the (. *?) Part can be any number of characters (however, the character length Represents the shortest part).

たとえば，検索パターン「ウ(.*?)ナ(.*?)ギ」は，検索対象の音節文字列に「ウナギ」が含まれていればその部分がヒットする。また，検索対象の文字列が「アソノウマノナキゴエニギョーテンスル」（阿蘇の馬の鳴き声に仰天する）であれば，「ウマノナキゴエニギ」の部分がヒットする。同様にして，検索パターン「ナ(.*?)ギ」は，検索対象の文字列に「ナギ」，「ナイスギフト」（ヒット部分は「ナイスギ」），「ナミノリギャル」（ヒット部分は「ナミノリギ」）などが含まれているとヒットする。 For example, if the search pattern “u (. *?) Na (. *?) Gi” includes “eel” in the search target syllable character string, that portion is hit. In addition, if a character string that is the target of a search is "a Sono Umano Naki Goe two-Gyo Ten Sul" (astounded to cries of Aso of the horse), part of the "c Mano Na Kigoeni formate" is a hit. Similarly, the search pattern “NA (. *?) GI” is “Nagi”, “Nice gift” (hit part is “Nice cedar”), “Naminorigal” (hit part is “Namino rigi”). )) Etc., it hits.

図１１を参照して，検索パターンを用いた文字列検索処理は，ターゲットキーワード・データプール11に登録されている複数のターゲットキーワードのそれぞれについて作成された検索パターンを用いて並列的に行われる。 Referring to FIG. 11, the character string search process using the search pattern is performed in parallel using the search pattern created for each of the plurality of target keywords registered in the target keyword / data pool 11.

図１１のフローチャートの一番左の処理を参照して，この処理では，ターゲットキーワード・データプール11（図４）に最初に登録されているターゲットキーワード「鰻」から作成された検索パターンを用いて検索処理が行われ，次にターゲットキーワード・データプール11において２番目に登録されているターゲットキーワード「鯛」から作成された検索パターンを用いて検索処理が行われ，最終的に最後（Ｍ番目）に登録されているターゲットキーワード（図４に示すターゲットキーワード・データプール11の場合，１２番目の「減る」）から作成された検索パターンを用いて検索処理が行われる。図１１の左から２番目の処理では，はじめにターゲットキーワード・データプール11において２番目に登録されているターゲットキーワード「鯛」から作成された検索パターンを用いて検索処理が行われ，次に３番目に登録されているターゲットキーワード「ネギ焼き」から作成された検索パターンを用いて検索処理が行われ，最終的に１番目に登録されているターゲットキーワード「鰻」から作成された検索パターンを用いて検索処理が行われる。 Referring to the leftmost process in the flowchart of FIG. 11, this process uses a search pattern created from the target keyword “鰻” that is first registered in the target keyword data pool 11 (FIG. 4). The search process is performed, and then the search process is performed using the search pattern created from the second target keyword “鯛” registered in the target keyword / data pool 11, and finally the last (Mth) The search processing is performed using the search pattern created from the target keywords registered in (in the case of the target keyword / data pool 11 shown in FIG. 4, the 12th “decrease”). In the second process from the left in FIG. 11, first, a search process is performed using a search pattern created from the target keyword “鯛” registered second in the target keyword / data pool 11, and then the third process. The search process is performed using the search pattern created from the target keyword “Negiyaki” registered in, and finally the search pattern created from the target keyword “キーワード” registered first is used. Search processing is performed.

並列処理の一つ（図１１の一番左の処理）を説明する。カウンタｊがインクリメントされ（ステップ70），ターゲットキーワード・データプール11の１番目に登録されているターゲットキーワード「鰻」から作成された検索パターンのそれぞれ（図１４，図１５参照）が用いられて，誤認識音節文字列「コカラウネギヲタベタイ」を処理対象とする文字列検索が行われる。ターゲットキーワード「鰻」からは，「ウ(.*?)ナ(.*?)ギ」，「ナ(.*?)ギ」，「ウ(.*?)ギ」および「ウ(.*?)ナ」の正規表現を用いた４つの検索パターンが作成されている。４つの検索パターンのすべてについて，検索パターンに合致する文字列部分が誤認識音節文字列に含まれるかどうかが判断される（ステップ71ａ，72ａ）。 One of the parallel processes (the leftmost process in FIG. 11) will be described. The counter j is incremented (step 70), and each of the search patterns (see FIGS. 14 and 15) created from the target keyword “鰻” registered first in the target keyword / data pool 11 is used. A character string search is performed on the erroneously recognized syllable character string “Coca Raunegi wotabetai”. From the target keyword “鰻”, “U (. *?) Na (. *?) Gi”, “Na (. *?) Gi”, “U (. *?) Gi” and “U (. *? Four search patterns using the regular expression “)” have been created. For all four search patterns, it is determined whether or not the character string portion matching the search pattern is included in the misrecognized syllable character string (steps 71a and 72a).

図１６（Ａ）〜（Ｅ）を参照して具体的に説明する。誤認識音節文字列「コカラウネギヲタベタイ」については，「ウ(.*?)ナ(.*?)ギ」，「ナ(.*?)ギ」および「ウ(.*?)ナ」の３つの検索パターンではヒットせず，検索パターン「ウ(.*?)ギ」によって「ウネギ」の部分がヒットする（図１６（Ａ））。 A specific description will be given with reference to FIGS. For the misrecognized syllable character string "Kokalauunegiwotabaitai", "U (. *?) Na (. *?) Gi", "Na (. *?) Gi" and "U (. *?) Na" The search pattern “U (. *?) GI” hits the “UNIGI” part (FIG. 16A).

ヒットした文字列部分が動詞の後ろに続くかどうかが判断される（ステップ73ａ）。ヒットした文字列部分の前方の文字列部分が動詞であるかどうかは，たとえば辞書データを用いて判断される。ヒットした文字列部分が動詞の後ろに続く文字列部分でない場合には，ヒットした文字列部分に対応するターゲットキーワード（上述の場合「鰻」）がメモリ７に保存され，かつヒットした文字列部分が誤認識音節文字列から除去される（ステップ74ａ）。カウンタｊがインクリメントされて，次のターゲットキーワード「鯛」から作成された検索パターンを用いた処理に進む（ステップ75ａ，76ａでNO）。検索パターンに合致する文字列部分が見つからなかった場合にも，カウンタｊがインクリメントされて次のターゲットキーワードから作成された検索パターンを用いた処理に進む（ステップ72ａでNO，ステップ75ａ，ステップ76ａでNO，ステップ71ａ）。 It is determined whether or not the hit character string portion follows the verb (step 73a). Whether the character string portion in front of the hit character string portion is a verb is determined using, for example, dictionary data. If the hit character string portion is not the character string portion following the verb, the target keyword (“鰻” in the above case) corresponding to the hit character string portion is stored in the memory 7 and the hit character string portion Is removed from the misrecognized syllable character string (step 74a). The counter j is incremented, and the process proceeds to the process using the search pattern created from the next target keyword “鯛” (NO in steps 75a and 76a). Even when the character string portion matching the search pattern is not found, the counter j is incremented and the process proceeds to the process using the search pattern created from the next target keyword (NO in step 72a, and in steps 75a and 76a). NO, step 71a).

誤認識音節文字列「コカラウネギヲタベタイ」からヒットした文字列部分「ウネギ」を除去した文字列部分「コカラ」および「ヲタベタイ」のそれぞれが処理される（図１６（Ｂ））。「ヲタベタイ」に含まれる「タイ」の部分が，ターゲットキーワード「鯛」から作成される検索パターン「タ(.*?)イ」によってヒットする。しかしながら，ヒットした文字列部分「タイ」は動詞「タベ」に続く文字列部分であるので，ターゲットキーワード「鯛」はメモリ７に保存されずに，除去もされないことになる（ステップ73ａでYES ）。同様の処理が，次のターゲットキーワード「ネギ焼き」，さらに次のターゲットキーワード「ラーメン」などから作成された検索パターンを用いて行われる（ステップ75ａ，ステップ76ａでNO，ステップ71ａ）。 Each of the character string portions “kokara” and “wotabeitai” obtained by removing the hit character string portion “unigi” from the misrecognized syllable character string “cocaraunegiwotabaitai” is processed (FIG. 16B). The part of “Thailand” included in “Otabetai” is hit by the search pattern “Ta (. *?) I” created from the target keyword “鯛”. However, since the hit character string portion “Thai” is a character string portion following the verb “Tab”, the target keyword “鯛” is not saved in the memory 7 and is not removed (YES in Step 73a). . A similar process is performed using a search pattern created from the next target keyword “green onion” and the next target keyword “ramen” (step 75a, NO at step 76a, step 71a).

「ヲタベタイ」に含まれる「タベ」の文字列部分は，ターゲットキーワード「食べる」から作成される検索パターン「タ(.*?)ベ」によってヒットする（図１６（Ｃ））。ヒットした文字列部分に対応するターゲットキーワード（上述の場合「食べ」）がメモリ７に保存される。また，ヒットした文字列部分が誤認識音節文字列から除去される（ステップ74ａ）。 The character string portion of “Tabe” included in “Otabetai” is hit by the search pattern “Tabe (. *?) Be” created from the target keyword “Eat” (FIG. 16C). The target keyword (“eat” in the above case) corresponding to the hit character string portion is stored in the memory 7. In addition, the hit character string portion is removed from the misrecognized syllable character string (step 74a).

「タベ」の文字列部分がさらに除去されることで，残りは「コカラ」，「ヲ」および「タイ」となる（図１６（Ｄ））。これらの文字列部分のそれぞれについて，残りのターゲットキーワードから作成された検索パターンを用いた検索処理が行われる（ステップ75ａ，76ａでNO，71ａ）。全く文字列がヒットしなかった場合，そこで検索パターンを用いた検索処理は終了する（ステップ76ａでYES ）。結局，「鰻」と「食べる」の２つのターゲットキーワードがメモリ７に記憶される（図１６（Ｅ））。 The character string portion of “tab” is further removed, and the rest becomes “kokara”, “wo”, and “tie” (FIG. 16D). A search process using a search pattern created from the remaining target keywords is performed for each of these character string portions (NO in steps 75a and 76a, 71a). If no character string is hit, the search process using the search pattern ends there (YES in step 76a). Eventually, the two target keywords “鰻” and “eat” are stored in the memory 7 (FIG. 16E).

並列処理のそれぞれは同じ処理結果とならないことがある。図１７（Ａ）〜（Ｄ）は，ターゲットキーワード「鰻」よりも先にターゲットキーワード「ネギ焼き」から作成された検索パターンを用いた文字列検索が行われたときの処理の流れを示している。ターゲットキーワード「ネギ焼き」から作成された検索パターンを用いた検索処理が行われると，誤認識音節文字列「コカラウネギヲタベタイ」における「ネギ」の文字列部分がヒットする。これに続く処理を示す図１７（Ｂ），（Ｃ）は，上述した図１６（Ｃ），（Ｂ）とそれぞれ同じになる。結局，「ネギ焼き」と「食べる」の２つのターゲットキーワードがメモリ７に記憶される（図１７（Ｄ））。 Each parallel processing may not produce the same processing result. FIGS. 17A to 17D show the flow of processing when a character string search is performed using a search pattern created from the target keyword “green onion” prior to the target keyword “鰻”. Yes. When a search process using a search pattern created from the target keyword “green onion” is performed, the character string portion of “green onion” in the misrecognized syllable character string “Kokalau Negiwo Tabai” is hit. 17B and 17C showing the subsequent processing are the same as FIGS. 16C and 16B described above. Eventually, two target keywords of “green onion” and “eat” are stored in the memory 7 (FIG. 17D).

並列処理によって上述のように２つ以上の検索結果が生成された場合には，次のルールにしたがって，誤認識音節文字列中の文字列を置換するために用いるべきターゲットキーワードが選ばれる（図８，ステップ53）。 When two or more search results are generated by the parallel processing as described above, the target keyword to be used for replacing the character string in the misrecognized syllable character string is selected according to the following rule (see FIG. 8, Step 53).

（１）ターゲットキーワードの数が多い検索結果を採用する。
２つ以上の検索結果におけるターゲットキーワード数が同数である場合，
（２）同じターゲットキーワードが含まれていれば，そのターゲットキーワードを，置換に用いるターゲットキーワードに決定する。図１６（Ｅ）に示す検索結果と図１７（Ｄ）に示す検索結果には，同じターゲットキーワード「食べる」が含まれている。ターゲットキーワード「食べる」は置換すべきターゲットキーワードに決定される。
（３）異なるターゲットキーワードが含まれている場合には，ヒットした検索パターンに含まれる文字数が多いターゲットキーワードを，置換すべきターゲットキーワードに決定する。
（４）ヒットした検索パターンに含まれる文字数が同じであれば，ターゲットキーワードの読みの数が少ない方を，置換すべきターゲットキーワードに決定する。図１６（Ｅ）に示す検索結果と図１７（Ｄ）に示す検索結果であれば，ターゲットキーワード「ネギ焼き」ではなく，ターゲットキーワード「鰻」が，置換すべきターゲットキーワードに決定される。 (1) Use search results with a large number of target keywords.
If the number of target keywords in two or more search results is the same,
(2) If the same target keyword is included, the target keyword is determined as a target keyword used for replacement. The search result shown in FIG. 16 (E) and the search result shown in FIG. 17 (D) include the same target keyword “eat”. The target keyword “eat” is determined as the target keyword to be replaced.
(3) When different target keywords are included, a target keyword having a large number of characters included in the hit search pattern is determined as a target keyword to be replaced.
(4) If the number of characters included in the searched search pattern is the same, the target keyword to be replaced is determined to have the smaller number of target keyword readings. In the search result shown in FIG. 16E and the search result shown in FIG. 17D, the target keyword “ターゲット” is determined as the target keyword to be replaced instead of the target keyword “green onion”.

図６に戻って，誤認識音節文字列「コカラウネギヲタベタイ」のうち，決定されたターゲットキーワードによって対応する文字列部分が置換される（ステップ27）。上述の例では，「コカラ鰻ヲ食べタイ」の文字列が作成されることになる。上述したように，置換されたターゲットキーワードには，音節文字列利用処理によって置換された文字列部分であることがマーキングされる。 Returning to FIG. 6, the character string portion corresponding to the determined target keyword in the erroneously recognized syllable character string “Coca Raunegiwo Tabaitai” is replaced (step 27). In the above example, a character string of “Kokhara 鰻食べ eat Thai” is created. As described above, the replaced target keyword is marked as a character string portion replaced by the syllable character string use processing.

上述したラベリング処理（図６のステップ28）を経て，次に，ここまでの処理で生成された文字列のうちの，ターゲットキーワード以外の音節文字列部分に対する処理に進む。 After the above-described labeling process (step 28 in FIG. 6), the process proceeds to the process for the syllable character string part other than the target keyword in the character string generated by the process so far.

５．共起単語利用処理（ステップ29，30）
図１８は，図６のステップ29の処理を詳細に示すフローチャートである。 5. Co-occurrence word processing (steps 29 and 30)
FIG. 18 is a flowchart showing in detail the processing of step 29 in FIG.

はじめに共起単語データプール12（図３，図５）に登録されている複数の共起単語のうち，上述ようにして決定されたターゲットキーワード「鰻」および「食べる」についての共起単語が読み出される（ステップ81）。なお，決定されたターゲットキーワードと同一の共起単語および重複した共起単語は読み出されずに削除される（ステップ82）。 First, among the plurality of co-occurrence words registered in the co-occurrence word data pool 12 (FIGS. 3 and 5), the co-occurrence words for the target keywords “鰻” and “eat” determined as described above are read out. (Step 81). Note that the same co-occurrence words and duplicate co-occurrence words with the determined target keyword are deleted without being read (step 82).

読み出された共起単語のそれぞれについて検索パターンが作成される（ステップ83）。検索パターンの作成処理のアルゴリズムは，上述したターゲットキーワードのそれぞれについての検索パターンの作成処理のアルゴリズムと同じである（図９，図１０参照）。図１９に示すように，共起単語のそれぞれについて，一または複数の検索パターンが生成される。 A search pattern is created for each of the read co-occurrence words (step 83). The search pattern creation processing algorithm is the same as the search pattern creation processing algorithm for each of the target keywords described above (see FIGS. 9 and 10). As shown in FIG. 19, one or a plurality of search patterns are generated for each co-occurrence word.

次に，誤認識音節文字列からターゲットキーワードを除いた残りの文字列部分に対して，共起単語から作成された検索パターンを用いて文字列検索処理が行われる（ステップ84）。この処理も，上述したターゲットキーワードから作成される検索パターンを用いた文字列検索処理のアルゴリズムと同じアルゴリズムで行われる（図１１参照）。読み出された共起単語のそれぞれから作成された検索パターンを用いた検索処理は，並列的に行われる。 Next, a character string search process is performed on the remaining character string portion obtained by removing the target keyword from the misrecognized syllable character string using a search pattern created from the co-occurrence words (step 84). This process is also performed by the same algorithm as that of the character string search process using the search pattern created from the target keyword described above (see FIG. 11). Search processing using a search pattern created from each of the read co-occurrence words is performed in parallel.

「コカラ鰻ヲ食べタイ」のうち，ターゲットキーワード「鰻」および「食べ」を除いた文字列部分は，「コカラ」，「ヲ」および「タイ」である。これらの文字列部分に合致する，またはこれらの文字列部分に類似する共起単語が検索されることになる。 The character string portion of the “Kokhara 鰻 wo eat Thai” excluding the target keywords “鰻” and “eat” is “Kokhara”, “wo” and “Thai”. Co-occurrence words that match or are similar to these character string portions are searched.

たとえば，共起単語「これから」から作成される複数の検索パターンのうち，「コ(.*?)カ(.*?)ラ」の検索パターンによって「コカラ」の文字列部分がヒットする。ヒットした文字列部分「コカラ」が，共起単語「これから」によって置換されることになる（図６，ステップ30）。 For example, among the plurality of search patterns created from the co-occurrence word “from now on”, the character string portion of “Kokhara” is hit by the search pattern of “ko (. *?) Ka (. *?) La”. The hit character string portion “Kokhara” is replaced by the co-occurrence word “from now on” (FIG. 6, step 30).

ここまでの処理を経て置換されていない文字列部分はそのままとされる（図６，ステップ31）。すなわち，置換されていない文字列部分「ヲ」および「タイ」はそのまま用いられる。これにより「これから鰻を食べたい」の文字列が完成する。 The character string portion that has not been replaced through the processing so far is left as it is (FIG. 6, step 31). That is, the non-replaced character string parts “wo” and “tie” are used as they are. As a result, the character string “I want to eat rice cake” is completed.

カテゴリ「グルメ」に関するターゲットキーワードが格納されたターゲットキーワード・データプール11および共起単語データプール12を用いた誤認識修正処理に加えて，上述したように，カテゴリ「観光」に関するターゲットキーワードが格納されたターゲットキーワード・データプール13および共起単語データプール14を用いた誤認識修正処理，カテゴリ「宿泊施設」に関するターゲットキーワードが格納されたターゲットキーワード・データプール15および共起単語データプール16を用いた誤認識修正処理も，平行して行われる（図３参照）。 In addition to the misrecognition correction process using the target keyword data pool 11 and the co-occurrence word data pool 12 in which the target keyword related to the category “gourmet” is stored, the target keyword related to the category “tourism” is stored as described above. Misrecognition correction processing using the target keyword / data pool 13 and the co-occurrence word data pool 14, and the target keyword / data pool 15 and the co-occurrence word data pool 16 storing the target keywords related to the category “accommodation facility” were used. The erroneous recognition correction processing is also performed in parallel (see FIG. 3).

図２０は，カテゴリごとに誤認識修正処理が行われることで作成される，複数の処理後文字列（解析結果，誤認識修正結果）のうち，最終結果として選択すべき文字列（カテゴリ）を決定する処理を示している。 FIG. 20 shows a character string (category) to be selected as a final result among a plurality of post-processing character strings (analysis result, erroneous recognition correction result) created by performing the erroneous recognition correction process for each category. The process to determine is shown.

カテゴリ決定処理（ステップ91）では，以下のアルゴリズムによって最終結果として選択すべき文字列（カテゴリ）が決定される。 In the category determination process (step 91), the character string (category) to be selected as the final result is determined by the following algorithm.

（１）漢字変換データ利用処理でマーキングされた文字列部分の数が多いものを，最終結果として選択する。
（２）漢字変換データ利用処理でマーキングされた文字列部分の数が同数であれば，音節文字列利用処理において置換されたターゲットキーワードの数が多いものを，最終結果として選択する。
（３）音節文字列利用処理において置換されたターゲットキーワードの数も同数であれば，共起単語利用処理において置換された共起単語の数が多いものを，最終結果として選択する。 (1) A character string portion marked in the kanji conversion data utilization process is selected as the final result.
(2) If the number of character string parts marked in the Kanji conversion data use processing is the same, the one with the large number of target keywords replaced in the syllable character string use processing is selected as the final result.
(3) If the number of target keywords replaced in the syllable character string use processing is also the same, the one having a large number of co-occurrence words replaced in the co-occurrence word use processing is selected as the final result.

一般にグルメに関する言葉を含む音声データであれば，グルメに関するデータプール11，12を用いて得られた文字列が最終結果として選択されやすい。もっとも，カテゴリごとの処理後文字列（誤認識修正結果）のすべてをユーザに提示するようにしてもよい。 In general, in the case of audio data including words relating to gourmet, the character strings obtained using the data pools 11 and 12 relating to gourmet are easily selected as the final result. However, all of the processed character strings (misrecognition correction results) for each category may be presented to the user.

上述した実施例では，誤認識修正処理の中で検索パターン（ターゲットキーワードについての検索パターンおよび共起単語についての検索パターン）を作成しているが，あらかじめ検索パターンを作成しておき，データプール11，12等に記憶させてもよい。 In the embodiment described above, a search pattern (search pattern for target keywords and search pattern for co-occurrence words) is created in the erroneous recognition correction process. , 12 etc. may be stored.

図２１は，サーバ10において実行される誤認識自動修正処理の変形例を示すフローチャートである。図６に示すフローチャートと同一の処理には同一の符号を付し，重複した説明を避ける。 FIG. 21 is a flowchart showing a modified example of the erroneous recognition automatic correction process executed in the server 10. The same processes as those in the flowchart shown in FIG. 6 are denoted by the same reference numerals to avoid redundant description.

上述した実施例の誤認識自動修正処理では，かな漢字変換利用処理はマーキング（図６のステップ25）およびカテゴリの決定（図２０のステップ91）を行うために用いられ，音節文字列利用処理（ステップ26，27）は音声認識処理（ステップ22）によって得られた誤認識音節文字列全体を処理対象にしている。図２１に示す処理では，かな漢字変換利用処理でターゲットキーワードが見つかった場合，見つかったターゲットキーワードを最終的な処理結果に常に含ませ，残りの音節文字列部分を音節文字列利用処理（ステップ26，27）とすることが行われる。 In the erroneous recognition automatic correction process of the above-described embodiment, the kana-kanji conversion use process is used for marking (step 25 in FIG. 6) and determining a category (step 91 in FIG. 20). 26 and 27) target the entire erroneously recognized syllable character string obtained by the speech recognition process (step 22). In the process shown in FIG. 21, when a target keyword is found in the Kana-Kanji conversion use process, the found target keyword is always included in the final process result, and the remaining syllable character string portion is processed as a syllable character string use process (step 26, 27) is performed.

誤認識音節文字列「コカラウネギヲタベタイ」をかな漢字変換することで「個から畝議を食べたい」が生成され，このかな漢字変換データのうち，ターゲットキーワード・データプール11（図４参照）に「食べ」が記憶されている場合，文字列部分「食べ」は，その時点で最終的な処理結果に用いることが確定される。文字列部分「食べ」以外の部分が読みデータに戻される（ステップ95）。文字列部分「食べ」を除いて，「コカラウネギヲ」の読みデータと「タイ」の読みデータとが，音節文字列利用処理（ステップ26，27）の処理対象とされる。音節文字列利用処理の負担を軽くすることができる。 The kana-kanji conversion of the misrecognized syllable character string “Koca Raunegiwo Tabaitai” generates “I want to eat a debate from the individual”. When “eat” is stored, the character string portion “eat” is determined to be used as the final processing result at that time. The portion other than the character string portion “eat” is returned to the reading data (step 95). Except for the character string portion “eat”, the reading data of “Coca Raunegiwo” and the reading data of “Thai” are the processing target of the syllable character string use processing (steps 26 and 27). The burden of syllable character string use processing can be reduced.

１クライアント（スマートフォン）
３マイクロフォン
５ＣＰＵ（音声認識処理手段，単語検索手段，単語置換手段，第１および第２の検索パターン作成手段，キーワード選択手段，共起単語検索手段，共起単語置換手段）
７メモリ
８記憶装置
10 サーバ・コンピュータ
11，13，15 ターゲットキーワード・データプール（単語プール）
12，14，16 共起単語データプール（共起単語プール） 1 Client (smartphone)
3 Microphone 5 CPU (voice recognition processing means, word search means, word replacement means, first and second search pattern creation means, keyword selection means, co-occurrence word search means, co-occurrence word replacement means)
7 Memory 8 Storage device
10 Server computer
11, 13, 15 Target keyword data pool (word pool)
12, 14, 16 Co-occurrence word data pool (co-occurrence word pool)

Claims

Speech recognition processing means for converting a given speech into a syllable string;
A word pool in which multiple words and their syllable strings are stored in association with each other,
A word search means for searching for a word having a syllable character string that matches a syllable character string portion included in the speech recognition syllable character string or a syllable character string similar to the syllable character string part of the plurality of words stored in the word pool , And word replacement means for replacing the corresponding syllable character string part included in the speech recognition syllable character string by the word found by the word search means,
The word search means
First search pattern creation means for creating a search pattern including a regular expression for each word stored in the word pool, and syllable characters that match the search pattern created by the first search pattern creation means Including means for searching for the sequence part from the speech recognition syllable character string.
False recognition correction device.

Kana-kanji conversion means for converting the speech recognition syllable character string into kana-kanji characters, and means for determining whether or not the words stored in the word pool are included in the kana-kanji converted character string,
When it is determined that the word stored in the word pool is included in the character string converted into Kana-Kanji, the speech recognition syllable character string portion excluding the word is the word search means and the word Processed by replacement means,
The erroneous recognition correction apparatus according to claim 1.

The above search pattern is
A search pattern that can search for a syllable character string that lacks any one of the syllable character strings of a word.
The erroneous recognition correction apparatus according to claim 1 or 2.

The above search pattern is
A search pattern that can search for a syllable string in which any one of the syllable strings of a word is replaced with another character.
The misrecognition correction apparatus according to any one of claims 1 to 3.

The above search pattern is
A search pattern that can search for a syllable string in which another character is inserted between characters included in the syllable string of a word.
The misrecognition correction apparatus according to any one of claims 1 to 4.

A co-occurrence word pool in which a co-occurrence word and a syllable character string of each of a plurality of words stored in the word pool are stored in association with each other;
Keyword selection means for selecting a word found by the word search means as a target keyword;
A co-occurrence word specifying means for specifying a co-occurrence word for the target keyword from co-occurrence words stored in the co-occurrence word pool;
Co-occurrence searching for a co-occurrence word having a syllable character string similar to the speech recognition syllable character string part obtained by removing the target keyword from the speech recognition syllable character string from the co-occurrence words specified by the co-occurrence word specifying means Word search means, and co-occurrence word replacement means for replacing the corresponding syllable character string portion included in the speech recognition syllable character string by the co-occurrence word found by the co-occurrence word search means,
The erroneous recognition automatic correction device according to any one of claims 1 to 5.

The co-occurrence word search means is
For each of the co-occurrence words stored in the co-occurrence word pool, a second search pattern creation means for creating a search pattern including a regular expression, and a search pattern created by the second search pattern creation means Including means for searching for a matching syllable string part from the speech recognition syllable string part excluding the target keyword,
The misrecognition correcting apparatus according to claim 6.

A plurality of field word pools each storing only words frequently used in a specific field,
The word search processing by the word search means and the replacement processing of the syllable character string portion by the word replacement means are executed using each of a plurality of field-specific word pools.
The misrecognition correction apparatus according to any one of claims 1 to 7.

A plurality of field co-occurrence word pools corresponding to each of the plurality of field word pools are provided.
The co-occurrence word specifying process by the co-occurrence word specifying means, the co-occurrence word searching process by the co-occurrence word searching means, and the replacement process of the syllable character string part by the co-occurrence word replacing means are a plurality of field-specific co-occurrence word pool Are executed using each of the
The misrecognition correcting device according to claim 8.

A program for causing a computer to function as a misrecognition correcting device,
Recognizes the given speech and converts it to a syllable string,
A syllable character string that matches a syllable character string part included in the speech recognition syllable character string among words stored in a word pool in which a plurality of words and their syllable character strings are stored in association with each other, or Search for words with similar syllable strings,
The found word replaces the corresponding syllable string part included in the speech recognition syllable string,
Here, for the word search, for each word stored in the word pool, a search pattern including a regular expression is created, and the syllable character string portion matching the created search pattern is defined as the speech recognition syllable character string. Searching from within,
program.