JPH09237322A

JPH09237322A - Candidate correcting system for recognizing device

Info

Publication number: JPH09237322A
Application number: JP8344865A
Authority: JP
Inventors: Masahiko Hamanaka; 雅彦濱中
Original assignee: NEC Corp
Current assignee: NEC Corp
Priority date: 1995-12-28
Filing date: 1996-12-25
Publication date: 1997-09-09
Anticipated expiration: 2016-12-25
Also published as: JP2894305B2

Abstract

PROBLEM TO BE SOLVED: To improve a recognition rate by utilizing operator's recognizing result correcting information for later recognizing processing. SOLUTION: A recognizing means 2 compares an inputted character image with the standard feature of each category of a recognizing dictionary 3 to obtain plural high-ordered recognizing candidate categories of which distance value being a difference between both of them is small. When a pair including an erroneous reading category coinciding with a first recognizing candidate category is stored in a correcting table 5, a candidate correcting means 4 subtracts a value corresponding to information for correction in the pair from the distance value of the recognizing candidate category coinciding with a right answer category in the pair and after then, sort it by the distance value to change a candidate order. A result correcting means 6 outputs a first-order candidate after this change to a display means 6 as a recognizing result. When the recognizing result is wrong and the operator inputs the right answer category, a table updating means 9 additionally registers the right answer category and the pair of the distance difference of them in the correcting table 5.

Description

Detailed Description of the Invention

【０００１】[0001]

【発明の属する技術分野】本発明は帳票上に書かれた文
字を認識する文字認識装置等のデータ認識装置に関し、
より詳細には、認識結果に誤りがあった為にオペレータ
が正しい認識結果に訂正した場合に、その訂正情報を以
降の認識処理に活用して認識精度を高めるようにした認
識装置の候補修正方式に関する。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a data recognition device such as a character recognition device for recognizing characters written on a form,
More specifically, when an operator corrects a correct recognition result due to an error in the recognition result, the correction information is utilized in the subsequent recognition processing to improve the recognition accuracy, so that the recognition accuracy is improved. Regarding

【０００２】[0002]

【従来の技術】文字認識装置、特に手書文字認識装置に
おいては認識率を１００％にすることが極めて困難であ
る。このため、認識結果を表示し、オペレータが目視で
チェックして、誤読している文字は正しい文字に置き換
えるような半自動式が一般的に採用されている。そし
て、このようなオペレータによる認識結果訂正作業を支
援するために従来より種々の技術が提案されている。2. Description of the Related Art It is extremely difficult to achieve a recognition rate of 100% in a character recognition device, especially a handwritten character recognition device. For this reason, a semi-automatic system is generally adopted in which the recognition result is displayed, the operator visually checks it, and replaces the erroneously read character with a correct character. Various techniques have been conventionally proposed to support the recognition result correction work by such an operator.

【０００３】図１４は従来の文字認識装置の構成例を示
すブロック図であり、特開昭６１−７４０８１号公報
（以下、文献１と称す）に記載された文字認識装置の構
成を機能ブロック図化したものである。FIG. 14 is a block diagram showing an example of the configuration of a conventional character recognition device, and is a functional block diagram of the configuration of the character recognition device described in Japanese Patent Laid-Open No. 61-74081 (hereinafter referred to as Document 1). It has been transformed.

【０００４】図１４において、帳票上などに書かれた文
字はスキャナ等のデータ入力手段１０によって文字イメ
ージ画像として入力され、認識手段２２に与えられる。
認識辞書２１には文字カテゴリ毎の文字パターンの標準
特徴が記憶されており、認識手段２２は、文字イメージ
画像から認識用特徴を抽出し、これと認識辞書２１中の
各文字カテゴリ毎の標準特徴とを比較して例えば第５位
候補までの認識候補カテゴリを求め、候補追加手段５０
に出力する。In FIG. 14, a character written on a form or the like is input as a character image image by the data input means 10 such as a scanner and given to the recognition means 22.
The recognition dictionary 21 stores standard features of character patterns for each character category, and the recognition unit 22 extracts recognition features from a character image image, and the standard features for each character category in the recognition dictionary 21. And the recognition candidate categories up to the fifth candidate are calculated, and the candidate adding means 50 is added.
Output to

【０００５】候補追加手段５０は、必要に応じて後述す
る候補追加処理を施したのち、全ての認識候補カテゴリ
を結果訂正手段４０に出力し、結果訂正手段４０は、そ
の第１候補の認識候補カテゴリを認識結果として表示手
段４１に表示する。The candidate adding means 50 performs a candidate adding process, which will be described later, if necessary, and then outputs all recognition candidate categories to the result correcting means 40, and the result correcting means 40 recognizes the first candidate recognition candidate. The category is displayed on the display unit 41 as a recognition result.

【０００６】オペレータは、表示手段４１に表示された
認識結果を見て、帳票上等の文字が正しく認識されたか
否かをチェックする。そして、誤読している場合、キー
ボード等の訂正情報入力手段４２から誤読している旨を
入力する。結果訂正手段４０は、これに応じて候補追加
手段５０から出力されていた第２候補以降の認識候補カ
テゴリを表示手段４１に表示する。The operator looks at the recognition result displayed on the display means 41 and checks whether or not the characters on the form are correctly recognized. Then, in the case of misreading, the fact that the misreading is performed is input from the correction information input means 42 such as a keyboard. In response to this, the result correcting means 40 displays on the display means 41 the recognition candidate categories after the second candidate outputted from the candidate adding means 50.

【０００７】従って、例えば、帳票上に書かれた文字が
「み」であったのにかかわらず、認識手段２２で決定さ
れた認識候補が上位より順に「巾」，「け」，「サ」，
「Ｈ」，「や」であり、候補追加手段５０で新たな候補
の追加がなかった場合、表示手段４１には認識結果とし
て第１候補の「巾」が表示され、オペレータが誤読であ
る旨を指示すると、残りの候補「け」，「サ」，
「Ｈ」，「や」が表示手段４１に表示される。Therefore, for example, although the character written on the form is "mi", the recognition candidates determined by the recognition means 22 are "width", "ke", and "sa" in order from the top. ，
If the candidate addition means 50 does not add a new candidate, the width of the first candidate is displayed on the display means 41 as a recognition result, and the operator misreads. , The remaining candidates “ke”, “sa”,
“H” and “ya” are displayed on the display means 41.

【０００８】オペレータは、この表示された候補中に正
解が含まれていれば、それを選択することにより結果訂
正手段４０に認識結果をその選択した候補に変更する処
理を行わせるわけであるが、前述のように正解が候補中
にない場合には、例えばかな漢字変換等により訂正情報
入力手段４２から正解の「み」を入力して、訂正を指示
する。結果訂正手段４０は、正解が入力されると、認識
結果をその正解に変更する処理を行うと共に、誤読カテ
ゴリ（第１候補であった「巾」のカテゴリ）と正解カテ
ゴリ（オペレータにより入力された「み」のカテゴリ）
とをテーブル更新手段５２に送る。テーブル更新手段５
２は、この誤読カテゴリと正解カテゴリとの組を、それ
が未登録ならカテゴリテーブル５１に追加登録する。If the displayed candidate includes a correct answer, the operator selects the correct answer to cause the result correcting means 40 to change the recognition result to the selected candidate. As described above, when the correct answer is not in the candidates, the correct answer "mi" is input from the correction information input means 42 by, for example, kana-kanji conversion, and the correction is instructed. When the correct answer is input, the result correcting means 40 performs a process of changing the recognition result to the correct answer, and at the same time, the misreading category (the category of “width” which was the first candidate) and the correct answer category (input by the operator). "Mi" category)
And are sent to the table updating means 52. Table updating means 5
2 additionally registers the set of the misread category and the correct answer category in the category table 51 if it is not registered.

【０００９】さて、このようなカテゴリテーブル５１へ
の登録が行われると、その後に再び同様な書体の「み」
の認識が試みられ、例えば先と同様に認識手段２２が上
位より順に「巾」，「け」，「サ」，「Ｈ」，「や」の
認識候補を出力して候補追加手段５０に出力すると、候
補追加手段５０による候補追加処理で「み」が新たな候
補に追加される。即ち、候補追加手段５０は、認識手段
２２から送られてきた第１候補のカテゴリ「巾」と同一
カテゴリである誤読カテゴリ「巾」をカテゴリテーブル
５１から検索し、見つけるとその誤読カテゴリ「巾」と
対を成す正解カテゴリ「み」を第２候補として候補に追
加する。従って、上述の場合、新たな認識候補は、
「巾」，「み」，「け」，…，となり、最初に表示手段
４１に表示される認識結果は「巾」で前回と変わらない
が、オペレータが誤読を指示したときには、前回表示さ
れなかった「み」が認識候補として表示手段４１に表示
されることになり、正解への訂正は表示された「み」を
選択するだけで済む。[0009] By the way, when such a category table 51 is registered, after that, a similar typeface "mi" is again displayed.
The recognition means 22 outputs the recognition candidates of “width”, “ke”, “sa”, “H”, and “ya” in order from the higher order and outputs them to the candidate addition means 50 as in the above. Then, "mi" is added to the new candidate by the candidate addition processing by the candidate addition means 50. That is, the candidate adding means 50 searches the category table 51 for a misreading category “width” that is the same category as the category “width” of the first candidate sent from the recognizing means 22, and if found, the misreading category “width”. The correct answer category "mi" paired with is added to the candidates as the second candidate. Therefore, in the above case, the new recognition candidate is
“Width”, “mi”, “ke”, ..., The recognition result first displayed on the display means 41 is “width”, which is the same as the previous time, but is not displayed last time when the operator gives an error reading. The "mi" will be displayed on the display means 41 as a recognition candidate, and the correct answer can be corrected simply by selecting the displayed "mi".

【００１０】なお、図１４に示した従来技術と同様の技
術は特開平１−１００６８６号公報（以下、文献２と称
す）にも記載されている。A technique similar to the conventional technique shown in FIG. 14 is also described in Japanese Patent Application Laid-Open No. 1-100686 (hereinafter referred to as Document 2).

【００１１】他方、特開昭５９−２７３７５号公報（以
下、文献３と称す），特開昭６３−２０８１８０号公報
（以下、文献４と称す）および特開平５−７３７０９号
公報（以下、文献５と称す）には、認識結果として表示
された一連の文字列において或る１つの文字の認識結果
の訂正を行った場合、それより以降の認識結果中に同様
の訂正を行うべき箇所が存在するか否かを自動的に検出
し、オペレータによる一度の訂正操作で同種の複数の誤
読部分の訂正を一括して行えるようにした技術が記載さ
れている。On the other hand, JP-A-59-27375 (hereinafter referred to as Document 3), JP-A-63-208180 (hereinafter referred to as Document 4) and JP-A-5-73709 (hereinafter referred to as Document 3). 5)), when the recognition result of a certain character is corrected in the series of character strings displayed as the recognition result, there is a place where the same correction should be performed in the subsequent recognition results. There is described a technology that automatically detects whether or not to do so and can collectively correct a plurality of misread portions of the same type by a single correction operation by an operator.

【００１２】[0012]

【発明が解決しようとする課題】このように従来の文字
認識装置においては、オペレータによる認識結果訂正作
業を支援するために種々の技術が提案されているが、以
下のような問題点があった。As described above, in the conventional character recognition device, various techniques have been proposed to assist the operator in correcting the recognition result, but there are the following problems. .

【００１３】文献１および文献２に示される技術では、
カテゴリテーブル５１に登録された正解カテゴリを固定
の順位（前述の例では第２位）の認識候補として追加す
るだけであるため、最初に表示される認識結果は常に認
識手段が第１位候補としたものに限られる。つまり、オ
ペレータによる認識結果訂正情報は第１位候補に影響を
全く与えない。従って、候補中に正解が含まれる正解含
有率は改善されても、第１位候補が正解となる認識率は
向上させることができない。このため、オペレータによ
る訂正回数自体を削減することは難しい。According to the techniques disclosed in Document 1 and Document 2,
Since the correct answer category registered in the category table 51 is only added as a recognition candidate having a fixed rank (second rank in the above example), the recognition result displayed first is always the recognition means being the first rank candidate. Limited to what you have done. That is, the recognition result correction information by the operator does not affect the first-ranked candidate at all. Therefore, even if the correct answer content rate in which the correct answer is included in the candidates is improved, the recognition rate in which the first-ranked candidate is the correct answer cannot be improved. Therefore, it is difficult to reduce the number of corrections performed by the operator.

【００１４】また文献３乃至文献５に示される技術は、
一連の認識結果に対する訂正作業の効率化は実現される
が、その訂正作業においてオペレータが入力した認識結
果訂正情報は今回の認識結果の訂正だけに利用されるも
のに過ぎないため、以降の新たな認識対象文字列に対す
る認識では、再び同じ結果をもたらす。即ち、オペレー
タの認識結果訂正情報を活用して認識率を向上させるこ
とができていない。Further, the techniques disclosed in Documents 3 to 5 are
Although the efficiency of the correction work for a series of recognition results is realized, the recognition result correction information input by the operator in the correction work is used only for the correction of the recognition result this time. The recognition of the recognition target character string produces the same result again. That is, the recognition rate cannot be improved by utilizing the recognition result correction information of the operator.

【００１５】本発明はこのような従来の問題点を解決し
たもので、その目的は、オペレータの認識結果訂正情報
を以降の認識処理に活用し、候補中に正解が含まれる正
解含有率だけでなく、第１位候補が正解となる認識率を
も向上させ得るようにすることにある。The present invention has solved such a conventional problem, and its purpose is to utilize the recognition result correction information of the operator for the subsequent recognition processing, and to determine the correct answer content rate in which the correct answer is included in the candidates. In other words, it is to improve the recognition rate in which the first candidate is the correct answer.

【００１６】[0016]

【課題を解決するための手段】本発明は、文字イメージ
画像などの認識対象データから認識用特徴を抽出し、認
識辞書に予め登録されている各カテゴリ毎の標準特徴と
比較して、認識用特徴と標準特徴との差である距離値が
小さい上位複数個の認識候補カテゴリ及びそれらの前記
距離値を求める認識手段と、誤読カテゴリと正解カテゴ
リと修正用情報との組を格納する修正用テーブルと、前
記認識手段で求められた認識候補カテゴリ群における第
１位の認識候補カテゴリと一致する誤読カテゴリを含む
組が前記修正用テーブルに格納されている場合に、その
組中の正解カテゴリと一致する認識候補カテゴリの距離
値をその組中の修正用情報に応じた値だけ減じる候補修
正手段と、該候補修正手段による処理後の認識候補カテ
ゴリ群における第１位の認識候補カテゴリを認識結果と
して出力すると共に、訂正情報入力手段から入力される
訂正情報に従って前記認識結果を訂正する結果訂正手段
と、該結果訂正手段による訂正時に、誤読カテゴリと正
解カテゴリと修正用情報との組を前記修正用テーブルに
追加するテーブル更新手段とを備えている。According to the present invention, a recognition feature is extracted from recognition target data such as a character image image and compared with a standard feature for each category registered in a recognition dictionary in advance to recognize the recognition feature. A plurality of recognition candidate categories having a small distance value which is a difference between the feature and the standard feature, a recognition means for obtaining the distance values thereof, and a correction table for storing a set of misreading category, correct category and correction information. And a set including a misreading category that matches the first-ranked recognition candidate category in the recognition candidate category group obtained by the recognition means is stored in the correction table, it matches the correct category in the set. Candidate correction means for reducing the distance value of the recognition candidate category by a value according to the correction information in the set, and the first candidate in the recognition candidate category group after processing by the candidate correction means. Position recognition candidate category is output as a recognition result, and the result correction means corrects the recognition result in accordance with the correction information inputted from the correction information input means, and a misread category, a correct answer category and a correction category are corrected at the time of correction by the result correction means. Table updating means for adding a combination with the usage information to the correction table.

【００１７】また、前記候補修正手段は、前記正解カテ
ゴリと一致する認識候補カテゴリが前記認識手段で求め
られた認識候補カテゴリ群に存在しないときは、前記正
解カテゴリと一致する、所定の距離値を付与した認識候
補カテゴリを前記認識候補カテゴリ群に追加した上で前
記距離値の修正を行う構成を有している。Further, when the recognition candidate category matching the correct answer category does not exist in the recognition candidate category group obtained by the recognizing means, the candidate correcting means sets a predetermined distance value that matches the correct answer category. The distance value is corrected after adding the given recognition candidate category to the recognition candidate category group.

【００１８】前記テーブル更新手段が前記修正用テーブ
ルに追加する組中の修正用情報としては、例えば、（１）誤読カテゴリの距離値と正解カテゴリの距離値と
の差である距離差。（２）誤読カテゴリの距離値と正解カテゴリの距離値と
の差である距離差、および認識対象データから抽出され
た認識用特徴。が用いられる。The correction information in the set added to the correction table by the table updating means is, for example, (1) a distance difference which is a difference between the distance value of the misread category and the distance value of the correct category. (2) A distance difference, which is the difference between the distance value of the misread category and the distance value of the correct category, and the recognition feature extracted from the recognition target data. Is used.

【００１９】前記（１）のような修正用情報を用いる場
合、前記候補修正手段は、例えば次式によって距離値
を修正する。When the correction information as in the above (1) is used, the candidate correction means corrects the distance value by the following equation, for example.

【００２０】Ｄ’＝Ｄ−ｗ・ｓ … ここで、Ｄ’は修正後の距離値，Ｄは修正前の距離値，
ｗは重み係数，ｓは修正情報である距離差である。D ′ = D−w · s where D ′ is the corrected distance value, D is the uncorrected distance value,
w is a weighting coefficient, and s is a distance difference which is correction information.

【００２１】また前記（２）のような修正用情報を用い
る場合、前記候補修正手段は、例えば次式によって距
離値を修正する。Further, when the correction information as in the above (2) is used, the candidate correction means corrects the distance value by the following equation, for example.

【００２２】Ｄ’＝Ｄ−ｗ・ｓ・ａ／ｄ … ここで、Ｄ’は修正後の距離値，Ｄは修正前の距離値，
ｗは重み係数，ｓは修正情報中の距離差，ｄは修正用情
報中の認識用特徴と今回の認識対象データの認識用特徴
との距離値，ａは距離値ｄを正規化する定数である基準
距離値である。また、前記候補修正手段は、前記認識手
段で求められた認識候補カテゴリ群における第１位の認
識候補カテゴリが前記修正用テーブルを用いた候補修正
により変更された場合に、新しい第１位候補カテゴリを
用いて手再度候補修正を行う構成を有している。D ′ = D−w · s · a / d where D ′ is the corrected distance value, D is the uncorrected distance value,
w is a weighting coefficient, s is a distance difference in the correction information, d is a distance value between the recognition feature in the correction information and the recognition feature of the current recognition target data, and a is a constant for normalizing the distance value d. It is a certain reference distance value. Further, the candidate correction means is a new first-ranked candidate category when the first-ranked recognition candidate category in the recognition candidate category group obtained by the recognition means is changed by the candidate correction using the correction table. Has a configuration in which the candidate is corrected again by using.

【００２３】上述のように構成された本発明にあって
は、誤読時にオペレータが正しいカテゴリに訂正する
と、その正解カテゴリと誤読カテゴリ（装置が第１位候
補として出力したカテゴリ）と修正用情報との組が修正
用テーブルに登録される。そして、以降、その誤読カテ
ゴリと同じカテゴリが第１位の認識候補カテゴリとして
求められた場合、その誤読カテゴリに対応する正解カテ
ゴリと同じカテゴリの認識候補カテゴリの距離値が減じ
られる。従って、減じられる値の大きさによっては、第
１位候補に躍り出る場合がある。これが前記文献１及び
文献２に記載された技術と大きく相違するところであ
り、最初に表示される認識結果は常に認識手段が第１位
候補としたものに限られず、以前に正解カテゴリとして
入力したカテゴリが認識結果として出力され得る場合が
ある。このような作用により、第１位候補が正解となる
認識率を向上させることができる。In the present invention configured as described above, when the operator corrects the correct category at the time of misreading, the correct category, the misreading category (the category output by the device as the first candidate) and the correction information are provided. Is registered in the correction table. After that, when the same category as the misreading category is obtained as the first recognition candidate category, the distance value of the recognition candidate category of the same category as the correct answer category corresponding to the misreading category is reduced. Therefore, depending on the size of the value to be reduced, it may jump to the first place candidate. This is a big difference from the techniques described in Documents 1 and 2, and the recognition result displayed first is not limited to the one in which the recognition means is always the first candidate, and is the category previously input as the correct category. May be output as a recognition result. With such an action, it is possible to improve the recognition rate in which the first candidate is the correct answer.

【００２４】[0024]

【発明の実施の形態】次に本発明の実施の形態の例につ
いて図面を参照して詳細に説明する。BEST MODE FOR CARRYING OUT THE INVENTION Next, an example of an embodiment of the present invention will be described in detail with reference to the drawings.

【００２５】図１は本発明を適用した文字認識装置の一
実施例のブロック図である。同図に示すように本実施例
の文字認識装置は、データ入力手段１と、認識手段２
と、認識辞書３と、候補修正手段４と、修正用テーブル
５と、結果訂正手段６と、表示手段７と、訂正情報入力
手段８と、テーブル更新手段９とを備えている。FIG. 1 is a block diagram of an embodiment of a character recognition device to which the present invention is applied. As shown in the figure, the character recognition device of this embodiment includes a data input means 1 and a recognition means 2.
The recognition dictionary 3, the candidate correction means 4, the correction table 5, the result correction means 6, the display means 7, the correction information input means 8 and the table update means 9 are provided.

【００２６】また、図２は図１の実施例の動作フローチ
ャートである。以下、図１および図２を参照して本実施
例の各部の機能ならびに動作を説明する。FIG. 2 is an operation flowchart of the embodiment shown in FIG. Hereinafter, the function and operation of each unit of this embodiment will be described with reference to FIGS. 1 and 2.

【００２７】データ入力手段１は、例えばスキャナ等で
構成され、例えば図示しない帳票上に書かれた手書き文
字を走査して各文字毎の文字イメージ画像を生成し、認
識手段２に出力する。The data input means 1 is composed of, for example, a scanner, and scans, for example, handwritten characters written on a form (not shown) to generate a character image image for each character and outputs it to the recognition means 2.

【００２８】認識辞書３には、文字カテゴリ毎の標準文
字パターンの認識用特徴（標準特徴）が予め記憶されて
いる。認識手段２は、データ入力手段１から入力された
文字イメージ画像から認識用特徴を抽出し、認識辞書３
に予め記憶されている文字カテゴリ毎の標準特徴と比較
して、距離値が小さい上位複数個（例えば第５位までの
５個）の認識候補カテゴリと、それら各々の距離値とを
求める（Ｓ１）。この求められた認識候補カテゴリ群と
各距離値は候補修正手段４に出力される。ここで、距離
値とは認識用特徴と標準特徴との差であり、類似度が高
いほど値が小さくなる。認識対象文字の認識用特徴と認
識辞書中の標準特徴との距離値を求めて文字認識を行う
手法としては従来より各種のものが提案されており、本
発明はその任意の手法を用いることができる。例えば、
このような手法を記載した文献として、『方向パタンマ
ッチング法の改良と手書き漢字認識への応用』（１９９
０年６月，電子情報通信学会研究会技報，ＰＲＵ９０−
２０）がある。The recognition dictionary 3 stores in advance the recognition characteristics (standard characteristics) of the standard character pattern for each character category. The recognition unit 2 extracts the recognition feature from the character image image input from the data input unit 1, and the recognition dictionary 3
In comparison with the standard features stored in advance in each character category, a plurality of recognition candidate categories with a small distance value (for example, five up to the fifth place) and their respective distance values are obtained (S1). ). The obtained recognition candidate category group and each distance value are output to the candidate correction means 4. Here, the distance value is the difference between the recognition feature and the standard feature, and the smaller the similarity, the smaller the value. Various methods have been conventionally proposed as a method for performing character recognition by obtaining the distance value between the recognition feature of the recognition target character and the standard feature in the recognition dictionary, and the present invention may use any of the techniques. it can. For example,
As a document describing such a method, "Improvement of Directional Pattern Matching Method and Application to Handwritten Kanji Recognition" (199)
Jun. 0, IEICE Technical Report, PRU90-
There is 20).

【００２９】修正用テーブル５は、誤読カテゴリ５１と
正解カテゴリ５２と距離差５３との組５０を格納するた
めのテーブルであり、例えば２００個の組を格納し得る
容量を有している。ここで、本実施例では、修正用情報
として誤読カテゴリ５１にかかる距離値と正解カテゴリ
５２にかかる距離値との差である距離差５３を用いてい
る。この修正用テーブル５の初期状態は空状態、つまり
１つの組も格納されていない状態である。また、この修
正用テーブル５は利用者毎あるいは認識対象の種類（帳
票の種類等）毎に交換可能である。The correction table 5 is a table for storing the set 50 of the misread category 51, the correct answer category 52, and the distance difference 53, and has a capacity capable of storing, for example, 200 sets. In this embodiment, the distance difference 53, which is the difference between the distance value of the misread category 51 and the distance value of the correct category 52, is used as the correction information. The initial state of the correction table 5 is an empty state, that is, a state in which no set is stored. The correction table 5 can be exchanged for each user or for each type of recognition target (type of form, etc.).

【００３０】候補修正手段４は、修正用テーブル５を参
照して、認識手段２から入力された認識候補カテゴリ群
（第１位か第５位までの５個の認識候補カテゴリ）に新
たな認識候補カテゴリを追加したり、所定の認識候補カ
テゴリの距離値を減じた後に距離値でソートして、新た
な認識候補カテゴリ群を生成する手段であり、図２のス
テップＳ２〜Ｓ１０の処理を実行する。The candidate correction means 4 refers to the correction table 5 and newly recognizes the recognition candidate category group (five recognition candidate categories from the first rank to the fifth rank) input from the recognition means 2. It is a means for adding a candidate category or reducing the distance value of a predetermined recognition candidate category and then sorting by the distance value to generate a new recognition candidate category group, and executes the processing of steps S2 to S10 of FIG. To do.

【００３１】先ず、候補修正手段４は、修正用テーブル
５中の組５０における誤読カテゴリ５１を例えばテーブ
ルの先頭に近い組から順に１つずつ検索して、その誤読
カテゴリが認識手段２から入力された認識候補カテゴリ
群における第１位候補のカテゴリと同じか否かを調べて
いく（Ｓ２，Ｓ３，Ｓ４）。そして、第１位候補と同じ
カテゴリの誤読カテゴリを見つける毎に（Ｓ４でＹＥ
Ｓ）、下記のような処理を実行する。First, the candidate correction means 4 searches for the misreading categories 51 in the set 50 in the correction table 5 one by one, for example, in order from the set closest to the beginning of the table, and the misreading categories are input from the recognition means 2. It is checked whether the category is the same as the category of the first candidate in the recognition candidate category group (S2, S3, S4). Then, every time a misreading category of the same category as the first candidate is found (YE in S4
S), the following processing is executed.

【００３２】まず、その見つけた誤読カテゴリを含む組
５０中の正解カテゴリ５２と同じカテゴリの認識候補カ
テゴリが、認識手段２から入力された認識候補カテゴリ
群に含まれるか否かを、認識候補カテゴリ群から認識候
補カテゴリを１つずつ順に検索して調べる（Ｓ５，Ｓ
６，Ｓ７）。First, it is determined whether or not the recognition candidate category of the same category as the correct category 52 in the set 50 including the found misreading category is included in the recognition candidate category group input from the recognition means 2. One by one searching the recognition candidate categories from the group (S5, S
6, S7).

【００３３】そして、正解カテゴリ５２と同じカテゴリ
の認識候補カテゴリが存在していた場合（Ｓ７でＹＥ
Ｓ）、その認識候補カテゴリの距離値を次式により修正
する（Ｓ９）。Then, when there is a recognition candidate category of the same category as the correct answer category 52 (YE in S7).
S), the distance value of the recognition candidate category is corrected by the following equation (S9).

【００３４】Ｄ’＝Ｄ−ｗ・ｓ … ここで、Ｄ’は修正後の距離値，Ｄは修正前の距離値，
ｗは重み係数，ｓは正解カテゴリ５２を含む組５０中の
修正用情報である距離差５３である。ここで、重み係数
ｗは例えば１に固定化しても良く、また、当該組５０の
修正用テーブル５への記憶が古いものに対しては１に比
べて小さくなるような値にしてもよい。D ′ = D−w · s Here, D ′ is the corrected distance value, D is the uncorrected distance value,
w is a weighting coefficient, and s is a distance difference 53 which is correction information in the set 50 including the correct answer category 52. Here, the weighting coefficient w may be fixed to, for example, 1 or may be set to a value smaller than 1 for an old storage of the set 50 in the correction table 5.

【００３５】他方、正解カテゴリ５２と同じカテゴリの
認識候補カテゴリが存在しなかった場合（Ｓ６でＹＥ
Ｓ）、正解カテゴリ５２と同じカテゴリの認識候補カテ
ゴリを認識候補カテゴリ群に追加する（Ｓ８）。このと
き、追加した認識候補カテゴリの距離値としては、例え
ば、認識手段２から入力された認識候補カテゴリ群にお
ける最下位の認識候補カテゴリの距離値に予め定められ
た定数α（例えば２００）を足した値とする。そして、
ステップＳ９に進み、この追加した認識候補カテゴリの
距離値を前記の式により修正する。On the other hand, when there is no recognition candidate category of the same category as the correct answer category 52 (YE in S6)
S), a recognition candidate category of the same category as the correct answer category 52 is added to the recognition candidate category group (S8). At this time, as the distance value of the added recognition candidate category, for example, a predetermined constant α (for example, 200) is added to the distance value of the lowest recognition candidate category in the recognition candidate category group input from the recognition unit 2. Value. And
Proceeding to step S9, the distance value of the added recognition candidate category is corrected by the above equation.

【００３６】候補修正手段４は、以上のような処理後、
認識候補カテゴリ群中の認識候補カテゴリを、それらの
距離値で昇順にソートする（Ｓ１０）。このとき、距離
値が修正されていると、候補順位が認識手段２から出力
された時点と相違する場合がある。このように新たに順
序付けた認識候補カテゴリ群のうち上位複数個（例えば
第５位までの５個）の認識候補カテゴリとそれらの距離
値は結果訂正手段６に出力される。After the above processing, the candidate correction means 4
The recognition candidate categories in the recognition candidate category group are sorted in ascending order by their distance values (S10). At this time, if the distance value is corrected, the candidate rank may be different from the time when the candidate rank is output. Among the newly ordered recognition candidate category groups, a plurality of high-ranking recognition candidate categories (for example, 5 ranks up to the fifth rank) and their distance values are output to the result correcting means 6.

【００３７】結果訂正手段６は、候補修正手段４から入
力された認識候補カテゴリ群における第１位の認識候補
カテゴリを認識結果として、例えばＣＲＴ等で構成され
る表示手段７に表示する（Ｓ１１）。また、誤読があっ
た場合の訂正にそなえて第２位以下の認識候補カテゴリ
を表示手段７の別の表示領域に表示する。なお、この第
２位以下の認識候補カテゴリの表示はオペレータから誤
読している旨の指示があった時点で表示するようにして
も良い。The result correcting means 6 displays the first recognition candidate category in the recognition candidate category group input from the candidate correcting means 4 as the recognition result on the display means 7 composed of, for example, a CRT (S11). . In addition, the second and subsequent recognition candidate categories are displayed in another display area of the display means 7 for correction in the case of misreading. The display of the second and lower recognition candidate categories may be performed at the time when the operator gives an instruction to misread.

【００３８】オペレータは、表示手段７に表示された認
識結果をチェックし、誤読している場合には、キーボー
ド等の如き訂正情報入力手段８から訂正情報を入力して
訂正を行わせる。このとき、表示手段７に表示された第
２位以下の認識候補カテゴリ中に正解カテゴリが存在す
る場合には、その正解カテゴリを選択する情報を訂正情
報入力手段８から与えると（Ｓ１２でＹＥＳ）、結果訂
正手段６は、認識結果を上記選択された正解カテゴリに
訂正する（Ｓ１３）。また、第２位以下の認識候補カテ
ゴリ中に正解カテゴリが存在しない場合、オペレータは
訂正情報入力手段８を通じて例えばかな漢字変換などに
よって正解カテゴリを入力して訂正を行わせる。このと
き結果訂正手段６は、入力された訂正カテゴリで認識結
果を訂正する（Ｓ１３）。The operator checks the recognition result displayed on the display means 7, and if it is misread, inputs the correction information from the correction information input means 8 such as a keyboard to make a correction. At this time, when the correct answer category is present in the second or lower recognition candidate categories displayed on the display means 7, the information for selecting the correct answer category is given from the correction information input means 8 (YES in S12). The result correction means 6 corrects the recognition result into the selected correct answer category (S13). When the correct answer category does not exist in the second or lower recognition candidate categories, the operator inputs the correct answer category by the kana-kanji conversion or the like through the correction information input means 8 and corrects it. At this time, the result correction means 6 corrects the recognition result with the input correction category (S13).

【００３９】そして、結果訂正手段６は、認識結果を訂
正した場合、訂正前のカテゴリ（誤読カテゴリ）及びそ
の距離値と、訂正後のカテゴリ（正解カテゴリ）及びそ
の距離値とをテーブル更新手段９に通知し、修正用テー
ブルの更新を行わせる（Ｓ１４）。なお、正解カテゴリ
が認識候補カテゴリ群に存在しないときは、例えば認識
候補カテゴリ群における最下位の認識候補カテゴリの距
離値に予め定められた定数α（例えば２００）を加算し
た値を、正解カテゴリの距離値とする。Then, when the recognition result is corrected, the result correcting means 6 updates the table (correct reading category) and its distance value before correction, and the corrected category (correct category) and its distance value to the table updating means 9 To update the correction table (S14). When the correct answer category does not exist in the recognition candidate category group, for example, a value obtained by adding a predetermined constant α (for example, 200) to the distance value of the lowest recognition candidate category in the recognition candidate category group is set to the correct answer category. The distance value.

【００４０】テーブル更新手段９は、先ず、結果訂正手
段６から通知された誤読カテゴリの距離値から、正解カ
テゴリの距離値を減算して、距離差を求める。そして、
この距離差と前記正解カテゴリと前記誤読カテゴリとを
含む組５０を、修正用テーブル５に追加登録する。The table updating means 9 first subtracts the distance value of the correct answer category from the distance value of the misread category notified from the result correcting means 6 to obtain the distance difference. And
A set 50 including the distance difference, the correct answer category, and the misread category is additionally registered in the correction table 5.

【００４１】このとき、テーブル更新手段９は、例えば
修正用テーブル５の先頭から順に新たな組５０を登録す
るようにし、テーブルの最後尾まで組５０が登録されて
満杯になっていた場合（つまり、先の例では２００個の
組が登録されていた場合）には、再びテーブルの先頭に
戻り最も古く登録された組５０を上書きするかたちで追
加登録する。このように登録する組を例えば２００個程
度に限定し、２００個登録した後はテーブル５の領域を
循環的に使用することで、修正用テーブル５に必要なメ
モリの増大を防ぐことができる。At this time, the table updating means 9 registers a new set 50 in order, for example, from the beginning of the correction table 5, and when the set 50 is registered up to the end of the table and is full (that is, (In the previous example, when 200 sets have been registered), the registration is returned to the top of the table again and the oldest registered set 50 is overwritten for additional registration. By limiting the number of groups to be registered in this way to, for example, about 200 and cyclically using the area of the table 5 after registering 200, it is possible to prevent an increase in the memory required for the correction table 5.

【００４２】またテーブル更新手段９は、本実施例にお
いては、今回登録する組の誤読カテゴリおよび正解カテ
ゴリと同じ誤読カテゴリおよび正解カテゴリを含む組が
修正用テーブル５に既に登録されている場合には、その
既登録の距離差に今回の距離差を加算するようにしてい
る。Further, in the present embodiment, the table updating means 9 determines that a set including the same misreading category and correct answer category as that of the set registered this time is already registered in the correction table 5. , The current distance difference is added to the registered distance difference.

【００４３】次に、具体例を挙げて本実施例を更に詳細
に説明する。Next, this embodiment will be described in more detail with reference to specific examples.

【００４４】修正用テーブル５の内容が図３（ａ）に示
すように空である初期の状態において、帳票等に手書き
された或る文字の認識が行われ、認識手段２から図３
（ｂ）に示すような認識候補カテゴリ群および距離値が
出力されたとする。即ち、第１位候補が「巾」，第２位
候補が「ゆ」，第３位候補が「け」，第４位候補が
「や」，第５位候補が「サ」であり、それぞれの距離値
が１３００，１５００，１６００，１７００，１８００
であったとする。このとき、候補修正手段４では、修正
用テーブル５に１つも組が登録されていないので、図２
のステップＳ３で直ちに検索終了と判定されるために距
離値の修正は行われない。従って、ステップＳ１０にお
いて距離値でソートした新たな認識候補カテゴリ群およ
び距離値は図３（ｃ）に示すように図３（ｂ）と同じに
なる。このため、表示手段７には第１位候補の「巾」が
認識結果として表示され、また第２位以下の候補が表示
される（Ｓ１１）。ここで本来の入力文字が「ゆ」であ
った為にオペレータが誤読と判断し、表示手段７に表示
された第２位候補の「ゆ」を選択すると、結果訂正手段
６により認識結果が「巾」から「ゆ」に訂正される（Ｓ
１３）。また、誤読カテゴリ「巾」及びその距離値１３
００と、正解カテゴリ「ゆ」及びその距離値１５００と
がテーブル更新手段９に通知される。テーブル更新手段
９では、正解カテゴリ「ゆ」の距離値１５００から誤読
カテゴリ「巾」の距離値１３００を引いた２００を距離
差５３とし、それに誤読カテゴリ「巾」，正解カテゴリ
「ゆ」を加えた組５０を修正用テーブル５に登録する
（Ｓ１４）。この結果、修正用テーブル５の内容は図３
（ｄ）のようになる。In the initial state in which the contents of the correction table 5 are empty as shown in FIG. 3A, a certain character handwritten on a form or the like is recognized, and the recognition means 2 detects the character.
It is assumed that the recognition candidate category group and the distance value as shown in (b) are output. That is, the first candidate is “width”, the second candidate is “yu”, the third candidate is “ke”, the fourth candidate is “ya”, and the fifth candidate is “sa”. Distance values of 1300, 1500, 1600, 1700, 1800
Assume that At this time, in the candidate correction means 4, no set is registered in the correction table 5, so that FIG.
The distance value is not corrected because the search is immediately determined to be completed in step S3. Therefore, the new recognition candidate category group sorted by the distance value in step S10 and the distance value are the same as those in FIG. 3B as shown in FIG. Therefore, the "width" of the first rank candidate is displayed on the display means 7 as the recognition result, and the candidates of the second rank and below are displayed (S11). When the operator judges that the original input character is "Yu" because it is an erroneous reading and selects the second-ranked candidate "Yu" displayed on the display unit 7, the recognition result is " Corrected from "width" to "yu" (S
13). Also, the misread category "width" and its distance value 13
00, the correct category “Yu” and the distance value 1500 thereof are notified to the table updating means 9. In the table updating means 9, 200, which is obtained by subtracting the distance value 1300 of the misreading category “width” from the distance value 1500 of the correct answer category “yu”, is set as the distance difference 53, and the misreading category “width” and the correct answer category “yu” are added. The set 50 is registered in the correction table 5 (S14). As a result, the contents of the correction table 5 are shown in FIG.
(D).

【００４５】次に、修正用テーブル５の内容が図４
（ａ）に示す状態（図３（ｄ）と同じ）であるときに、
別の或る文字の認識が行われ、認識手段２から図４
（ｂ）に示すような認識候補カテゴリ群および距離値が
出力されたとする。即ち、第１位候補が「巾」，第２位
候補が「け」，第３位候補が「サ」，第４位候補が
「Ｈ」，第５位候補が「ゆ」であり、それぞれの距離値
が１１００，１１５０，１２００，１４００，１４００
であったとする。このとき、候補修正手段４では、第１
位候補「巾」と同じカテゴリの誤読カテゴリ「巾」を含
む組が修正用テーブル５に１つ存在するので、その組の
正解カテゴリ「ゆ」と距離差２００とに従って、認識候
補「ゆ」の距離値を１４００から１２００に変更する。
従って、ステップＳ１０で距離値でソートした新たな認
識候補カテゴリ群および距離値は図４（ｃ）に示すよう
になる。次に、表示手段７は第１位候補の「巾」を認識
結果として表示し、また第２位以下の候補も表示する
（Ｓ１１）。ここで本来の入力文字が「み」であった為
にオペレータが誤読と判断し、訂正情報入力手段８から
「み」を入力すると、結果訂正手段６は認識結果を
「巾」から「み」に訂正する（Ｓ１３）。また、誤読カ
テゴリ「巾」及びその距離値１１００と、正解カテゴリ
「み」及びその距離値とをテーブル更新手段９に通知す
る。ここで、正解カテゴリ「み」は認識候補カテゴリ群
に存在しないため、最下位候補「ゆ」の距離値１４００
に２００（＝α）を足した１６００が正解カテゴリ
「み」の距離値とされる。テーブル更新手段９は、正解
カテゴリ「み」の距離値１６００から誤読カテゴリ
「巾」の距離値１１００を引いた５００を距離差５３と
し、それに誤読カテゴリ「巾」，正解カテゴリ「み」を
加えた組５０を修正用テーブル５に登録する（Ｓ１
４）。この結果、修正用テーブル５の内容は図４（ｄ）
のようになる。Next, the contents of the correction table 5 are shown in FIG.
When in the state shown in (a) (same as in FIG. 3D),
Recognition of another certain character is performed, and the recognition means 2 to FIG.
It is assumed that the recognition candidate category group and the distance value as shown in (b) are output. That is, the first candidate is “width”, the second candidate is “ke”, the third candidate is “sa”, the fourth candidate is “H”, and the fifth candidate is “yu”. Distance values of 1100, 1150, 1200, 1400, 1400
Assume that At this time, the candidate correction means 4 uses the first
There is one set in the correction table 5 that includes the misreading category “width” of the same category as the rank candidate “width”. Therefore, according to the correct answer category “yu” and the distance difference 200 of that set, the recognition candidate “yu” is selected. Change the distance value from 1400 to 1200.
Therefore, the new recognition candidate category group and distance values sorted by distance value in step S10 are as shown in FIG. 4 (c). Next, the display means 7 displays the "width" of the first rank candidate as a recognition result, and also displays the second rank and lower rank candidates (S11). Here, since the original input character is "mi", the operator judges that it is a misreading, and inputs "mi" from the correction information input means 8, the result correcting means 6 recognizes the recognition result from "width" to "mi". (S13). Further, the table updating means 9 is notified of the misreading category “width” and its distance value 1100, and the correct answer category “mi” and its distance value. Here, since the correct answer category “mi” does not exist in the recognition candidate category group, the distance value 1400 of the lowest candidate “yu” is calculated.
Is added to 200 (= α) to obtain 1600 as the distance value of the correct answer category “mi”. The table updating means 9 subtracts the distance value 1100 of the misreading category “width” from the distance value 1600 of the correct answer category “Mi” to obtain a distance difference 53 of 500, and adds the misreading category “width” and the correct answer category “Mi”. The set 50 is registered in the correction table 5 (S1
4). As a result, the contents of the correction table 5 are shown in FIG.
become that way.

【００４６】次に、修正用テーブル５の内容が図５
（ａ）に示す状態（図４（ｄ）と同じ）であるときに、
別の或る文字（ここでは、「ゆ」，それも前回の「ゆ」
に比べてより「ゆ」らしく手書きされたものを想定して
いる）の認識が行われ、認識手段２から図５（ｂ）に示
すような認識候補カテゴリ群および距離値が出力された
とする。即ち、第１位候補が「巾」，第２位候補が
「ゆ」，第３位候補が「け」，第４位候補が「サ」，第
５位候補が「や」であり、それぞれの距離値が１２０
０，１３５０，１５００，１８００，１９００であった
とする。このとき、候補修正手段４では、第１位候補
「巾」と同じカテゴリの誤読カテゴリ「巾」を含む組が
修正用テーブル５に２つ存在するので、距離値の修正を
行う。先ず、１つの組の正解カテゴリ「ゆ」と距離差２
００とに従って、認識候補「ゆ」の距離値を１３５０か
ら１１５０に変更する。次に、残りの１つの組の正解カ
テゴリ「み」と距離差５００とに従って、認識候補
「み」を追加し、その距離値を１６００（＝１９００＋
２００−５００）とする。従って、ステップＳ１０で距
離値でソートした新たな認識候補カテゴリ群および距離
値は図５（ｃ）に示すようになる。次に、表示手段７は
第１位候補の「ゆ」を認識結果として表示する。Next, the contents of the correction table 5 are shown in FIG.
When in the state shown in (a) (the same as in FIG. 4D),
Another certain character (here, "Yu", which is also the previous "Yu")
It is assumed that the handwriting is more likely to be “smooth” as compared with the above), and the recognition unit 2 outputs the recognition candidate category group and the distance value as shown in FIG. 5B. That is, the first candidate is “width”, the second candidate is “yu”, the third candidate is “ke”, the fourth candidate is “sa”, and the fifth candidate is “ya”. Distance value of 120
It is assumed that they are 0, 1350, 1500, 1800, and 1900. At this time, the candidate correction means 4 corrects the distance value because there are two sets in the correction table 5 including the misreading category “width” of the same category as the first-ranked candidate “width”. First, one set of correct category "Yu" and distance difference 2
00, the distance value of the recognition candidate “Yu” is changed from 1350 to 1150. Next, the recognition candidate “mi” is added according to the remaining one set of correct category “mi” and the distance difference 500, and the distance value thereof is set to 1600 (= 1900 +).
200-500). Therefore, the new recognition candidate category group and distance values sorted by distance value in step S10 are as shown in FIG. 5 (c). Next, the display means 7 displays the first candidate "Yu" as the recognition result.

【００４７】このように、距離差が２００のように小さ
い条件で誤読し、同じカテゴリの組「巾，ゆ」が同様に
距離差が１５０のように小さい条件で再度生じた場合
は、以前に正解として訂正されたカテゴリ「ゆ」が上位
にくる。換言すれば、組の登録時に認識対象とした文字
に比べ、よりその文字に似ている文字であれば第１位候
補となり得ることを示している。As described above, when the misreading is performed under the condition that the distance difference is as small as 200, and the pair "width, yu" of the same category occurs again under the condition that the distance difference is as small as 150, it is possible to use The category "Yu", which was corrected as the correct answer, is ranked higher. In other words, it indicates that a character that is more similar to a character that is a recognition target at the time of registering a set can be a first-ranked candidate.

【００４８】次に、修正用テーブル５の内容が図６
（ａ）に示す状態（図４（ｄ），図５（ａ）と同じ）で
あるときに、別の或る文字（ここでは、「巾」を想定し
ている）の認識が行われ、認識手段２から図６（ｂ）に
示すような認識候補カテゴリ群および距離値が出力され
たとする。即ち、第１位候補が「巾」，第２位候補が
「け」，第３位候補が「サ」，第４位候補が「ゆ」，第
５位候補が「や」であり、それぞれの距離値が１２０
０，１３５０，１４００，１６５０，１８００であった
とする。このとき、候補修正手段４では、第１位候補
「巾」と同じカテゴリの誤読カテゴリ「巾」を含む組が
修正用テーブル５に２つ存在するので、距離値の修正を
行う。先ず、１つの組の正解カテゴリ「ゆ」と距離差２
００とに従って、認識候補「ゆ」の距離値を１６５０か
ら１４５０に変更する。次に、残りの１つの組の正解カ
テゴリ「み」と距離値５００とに従って、認識候補
「み」を追加し、その距離値を１５００（＝１８００＋
２００−５００）とする。従って、ステップＳ１０で距
離値でソートした新たな認識候補カテゴリ群および距離
値は図６（ｃ）に示すようになる。次に、表示手段７は
第１位候補の「巾」を認識結果として表示する。Next, the contents of the correction table 5 are shown in FIG.
In the state shown in (a) (same as in FIG. 4 (d) and FIG. 5 (a)), another certain character (here, "width" is assumed) is recognized, It is assumed that the recognition means 2 outputs the recognition candidate category group and the distance value as shown in FIG. 6B. That is, the first candidate is “width”, the second candidate is “ke”, the third candidate is “sa”, the fourth candidate is “yu”, and the fifth candidate is “ya”. Distance value of 120
It is assumed that they are 0, 1350, 1400, 1650 and 1800. At this time, the candidate correction means 4 corrects the distance value because there are two sets in the correction table 5 including the misreading category “width” of the same category as the first-ranked candidate “width”. First, one set of correct category "Yu" and distance difference 2
00, the distance value of the recognition candidate “Yu” is changed from 1650 to 1450. Next, a recognition candidate “mi” is added according to the remaining one set of correct answer category “mi” and the distance value 500, and the distance value is set to 1500 (= 1800 +).
200-500). Therefore, the new recognition candidate category group and distance values sorted by distance value in step S10 are as shown in FIG. 6 (c). Next, the display means 7 displays the “width” of the first candidate as the recognition result.

【００４９】このように、距離差が２００のように小さ
い条件で誤読し、同じカテゴリの組「巾，ゆ」が距離差
が４５０のように大きい条件で再度生じた場合は、認識
手段２による認識結果が維持される。In this way, when the misreading is performed under the condition that the distance difference is as small as 200, and the pair "width, yu" of the same category occurs again under the condition that the distance difference is as large as 450, the recognizing means 2 determines. The recognition result is maintained.

【００５０】次に、修正用テーブル５の内容が図７
（ａ）に示す状態（図４（ｄ）と同じ）であるときに、
別の或る文字（ここでは、「み」、それも前回の「み」
に比べて多少「み」らしく手書きされたものを想定して
いる）の認識が行われ、認識手段２から図７（ｂ）に示
すような認識候補カテゴリ群および距離値が出力された
とする。即ち、第１位候補が「巾」，第２位候補が
「け」，第３位候補が「サ」，第４位候補が「Ｈ」，第
５位候補が「や」であり、それぞれの距離値が１２０
０，１２５０，１３００，１４００，１４５０であった
とする。このとき、候補修正手段４では、第１位候補
「巾」と同じカテゴリの誤読カテゴリ「巾」を含む組が
修正用テーブル５に２つ存在するので、距離値の修正を
行う。先ず、１つの組の正解カテゴリ「ゆ」と距離差２
００とに従って、認識候補「ゆ」を追加し、その距離値
を１４５０（＝１４５０＋２００−２００）とする。次
に、残りの１つの組の正解カテゴリ「み」と距離値５０
０とに従って、認識候補「み」を追加し、その距離値を
１１５０（＝１４５０＋２００−５００）とする。従っ
て、ステップＳ１０で距離値でソートした新たな認識候
補カテゴリ群および距離値は図７（ｃ）に示すようにな
る。次に、表示手段７は第１位候補の「み」を認識結果
として表示する。Next, the contents of the correction table 5 are shown in FIG.
When in the state shown in (a) (the same as in FIG. 4D),
Another certain character (here, "mi", which is also the previous "mi"
It is assumed that the handwriting is somewhat “like” compared to the above) and the recognition means 2 outputs the recognition candidate category group and the distance value as shown in FIG. 7B. That is, the first candidate is “width”, the second candidate is “ke”, the third candidate is “sa”, the fourth candidate is “H”, and the fifth candidate is “ya”. Distance value of 120
It is assumed that they are 0, 1250, 1300, 1400, and 1450. At this time, the candidate correction means 4 corrects the distance value because there are two sets in the correction table 5 including the misreading category “width” of the same category as the first-ranked candidate “width”. First, one set of correct category "Yu" and distance difference 2
00, the recognition candidate “Yu” is added, and the distance value is set to 1450 (= 1450 + 200−200). Next, the remaining one set of correct category “M” and distance value 50
In accordance with 0, the recognition candidate “mi” is added, and the distance value is set to 1150 (= 1450 + 200−500). Therefore, the new recognition candidate category group and distance value sorted by distance value in step S10 are as shown in FIG. 7 (c). Next, the display means 7 displays the first candidate "mi" as a recognition result.

【００５１】このように、距離差が５００のように大き
い条件で誤読し、同じカテゴリの組「巾，み」が再度出
現した場合は、例え距離差が大きくても下位の候補が上
位にくることもあり得る。As described above, when the misreading is performed under the condition that the distance difference is large, such as 500, and the group "width, mi" of the same category appears again, even if the distance difference is large, the lower candidates come to the upper position. It is possible.

【００５２】次に、修正用テーブル５の内容が図８
（ａ）に示す状態（図４（ｄ）と同じ）であるときに、
別の或る文字（ここでは、「ゆ」、それも前記訂正時の
「ゆ」に比べて更に「ゆ」らしくない手書きされたもの
を想定している）の認識が行われ、認識手段２から図８
（ｂ）に示すような認識候補カテゴリ群および距離値が
出力されたとする。即ち、第１位候補が「巾」，第２位
候補が「ゆ」，第３位候補が「け」，第４位候補が
「や」，第５位候補が「サ」であり、それぞれの距離値
が１３００，１５５０，１６００，１７００，１８００
であったとする。このとき、候補修正手段４では、第１
位候補「巾」と同じカテゴリの誤読カテゴリ「巾」を含
む組が修正用テーブル５に２つ存在するので、距離値の
修正を行う。先ず、１つの組の正解カテゴリ「ゆ」と距
離差２００とに従って、認識候補「ゆ」の距離値を１５
５０から１３５０に変更する。次に、残りの１つの組の
正解カテゴリ「み」と距離値５００とに従って、認識候
補「み」を追加し、その距離値を１５００（＝１８００
＋２００−５００）とする。従って、ステップＳ１０で
距離値でソートした新たな認識候補カテゴリ群および距
離値は図８（ｃ）に示すようになる。次に、表示手段７
は第１位候補の「巾」を認識結果として表示する。ここ
で本来の文字が「ゆ」であった為にオペレータが誤読と
判断し、表示手段７に表示された第２位候補の「ゆ」を
選択すると、結果訂正手段６により認識結果が「巾」か
ら「ゆ」に訂正される（Ｓ１３）。また、誤読カテゴリ
「巾」及びその距離値１３００と、正解カテゴリ「ゆ」
及びその距離値１３５０とがテーブル更新手段９に通知
される。テーブル更新手段９では、正解カテゴリ「ゆ」
の距離値１３５０から誤読カテゴリ「巾」の距離値１３
００を引いた５０を距離差５３とし、それに誤読カテゴ
リ「巾」，正解カテゴリ「ゆ」を加えた組５０を修正用
テーブル５に登録する（Ｓ１４）。このとき、修正用テ
ーブル５には同じ誤読カテゴリ「巾」，同じ正解カテゴ
リ「ゆ」を持つ既登録の組｛巾，ゆ，２００｝が存在す
るので、その距離差２００に今回の距離差５０が加算さ
れる。この結果、修正用テーブル５の内容は図８（ｄ）
のようになる。なお、同じ誤読カテゴリ，正解カテゴリ
を持つ既登録の組は削除せずに、それとは別に今回のも
のを登録するようにしてもよい。即ち、前述の例で言え
ば、組｛巾，ゆ，２００｝を残したまま、組｛巾，ゆ，
５０｝を追加登録するようにしてもよい。Next, the contents of the correction table 5 are shown in FIG.
When in the state shown in (a) (the same as in FIG. 4D),
Another certain character (here, "yu", which is also assumed to be handwritten that does not look like "yu" more than "yu" at the time of correction) is recognized, and the recognition means 2 From Figure 8
It is assumed that the recognition candidate category group and the distance value as shown in (b) are output. That is, the first candidate is “width”, the second candidate is “yu”, the third candidate is “ke”, the fourth candidate is “ya”, and the fifth candidate is “sa”. Distance values of 1300, 1550, 1600, 1700, 1800
Assume that At this time, the candidate correction means 4 uses the first
Since there are two sets in the correction table 5 including the misreading category “width” of the same category as the position candidate “width”, the distance value is corrected. First, the distance value of the recognition candidate “Yu” is set to 15 according to one set of correct category “Yu” and the distance difference 200.
Change from 50 to 1350. Next, a recognition candidate "mi" is added according to the remaining one set of correct category "mi" and the distance value 500, and the distance value is set to 1500 (= 1800).
+ 200-500). Therefore, the new recognition candidate category group and distance values sorted by distance value in step S10 are as shown in FIG. 8 (c). Next, the display means 7
Displays the "width" of the first candidate as the recognition result. When the operator judges that the original character is "Yu" because the original character is "Yu" and the second candidate "Yu" displayed on the display unit 7 is selected, the recognition result is "width" by the result correction unit 6. Is corrected to "Yu" (S13). Also, the misread category "width" and its distance value 1300, and the correct category "yu"
And its distance value 1350 are notified to the table updating means 9. In the table updating means 9, the correct answer category "Yu"
From the distance value 1350 of the misreading category "width" to the distance value 13
50 obtained by subtracting 00 is set as the distance difference 53, and the set 50 in which the misreading category “width” and the correct answer category “Yu” are added is registered in the correction table 5 (S14). At this time, since the registered table {width, yu, 200} having the same misread category “width” and the same correct answer category “yu” exists in the correction table 5, the distance difference 200 is the distance difference 50 of this time. Is added. As a result, the contents of the correction table 5 are shown in FIG.
become that way. The registered set having the same misread category and correct answer category may not be deleted, but the current one may be registered separately. That is, in the case of the above example, the set {width, yu, 200} is left while the set {width, yu, 200} is left.
50} may be additionally registered.

【００５３】このように、距離差が２００のように小さ
い条件で誤読し、同じカテゴリの組「巾，ゆ」がそれよ
り大きい距離差で現れた場合は、以前に正解として訂正
されたカテゴリ「ゆ」が必ずしも第１位候補にはなり得
ない。しかし、再度訂正することにより修正用テーブル
５中の距離差が変更されるため、吸収できる距離差を徐
々に増大していくことができる。従って、次の例に示す
ように、先の「ゆ」に比べて多少なりとも「ゆ」らしい
「ゆ」の読み取りでは、再び「ゆ」が認識結果として出
力される。As described above, when the misreading is performed under the condition that the distance difference is as small as 200, and the group "width, yu" of the same category appears with a larger distance difference, the category "corrected" is corrected. "Yu" cannot necessarily be the first candidate. However, since the distance difference in the correction table 5 is changed by correcting again, the distance difference that can be absorbed can be gradually increased. Therefore, as shown in the following example, when reading "yu" which is more "yu" than "yu", "yu" is output again as a recognition result.

【００５４】即ち、修正用テーブル５の内容が図９
（ａ）に示す状態（図８（ｄ）と同じ）であるときに、
図８で想定した「ゆ」に比べて多少なりとも「ゆ」らし
い「ゆ」の認識が行われ、認識手段２から図９（ｂ）に
示すような認識候補カテゴリ群および距離値が出力され
たとする。即ち、第１位候補が「巾」，第２位候補が
「ゆ」，第３位候補が「け」，第４位候補が「や」，第
５位候補が「サ」であり、それぞれの距離値が１３０
０，１５００，１６００，１７００，１８００であった
とする。このとき、候補修正手段４では、第１位候補
「巾」と同じカテゴリの誤読カテゴリ「巾」を含む組が
修正用テーブル５に２つ存在するので、距離値の修正を
行う。先ず、１つの組の正解カテゴリ「ゆ」と距離値２
５０とに従って、認識候補「ゆ」の距離値を１５００か
ら１２５０に変更する。次に、残りの１つの組の正解カ
テゴリ「み」と距離値５００とに従って、認識候補
「み」を追加し、その距離値を１５００（＝１８００＋
２００−５００）とする。従って、ステップＳ１０で距
離値でソートした新たな認識候補カテゴリ群および距離
値は図９（ｃ）に示すようになり、第１位候補の「ゆ」
が認識結果として表示される。That is, the contents of the correction table 5 are shown in FIG.
When in the state shown in FIG. 8A (same as FIG. 8D),
"Yu" that is more or less "Yu" is recognized as compared with "Yu" assumed in FIG. 8, and the recognition means 2 outputs the recognition candidate category group and the distance value as shown in FIG. 9B. Suppose That is, the first candidate is “width”, the second candidate is “yu”, the third candidate is “ke”, the fourth candidate is “ya”, and the fifth candidate is “sa”. Has a distance value of 130
It is assumed that they are 0, 1500, 1600, 1700 and 1800. At this time, the candidate correction means 4 corrects the distance value because there are two sets in the correction table 5 including the misreading category “width” of the same category as the first-ranked candidate “width”. First, a set of correct category "Yu" and distance value 2
50, the distance value of the recognition candidate “Yu” is changed from 1500 to 1250. Next, a recognition candidate “mi” is added according to the remaining one set of correct answer category “mi” and the distance value 500, and the distance value is set to 1500 (= 1800 +).
200-500). Therefore, the new recognition candidate category group and distance values sorted by distance value in step S10 are as shown in FIG. 9C, and the first candidate "Yu"
Is displayed as the recognition result.

【００５５】図１０は図１の実施例の別の動作フローチ
ャートである。この動作フローチャートが図２のものと
相違するところは、ステップＳ１５において第１位候補
カテゴリを保持し、図２のステップＳ２〜Ｓ１０の処理
（ステップＳ１６）の後、第１位候補カテゴリがステッ
プＳ１５において保持した候補カテゴリと異なるかチェ
ックし（ステップＳ１７）、異なる場合（Ｓ１７でＹＥ
Ｓ）には再度ステップＳ２〜Ｓ１０の処理を行う（ステ
ップＳ１８）点にある。図１の候補修正手段４は、図１
０のステップＳ１５〜Ｓ１８の処理を実行する。FIG. 10 is another operation flowchart of the embodiment shown in FIG. This operation flowchart differs from that of FIG. 2 in that the first-ranked candidate category is held in step S15, and after the processing of steps S2 to S10 (step S16) in FIG. 2, the first-ranked candidate category is changed to step S15. It is checked whether it is different from the candidate category held in step S17 (step S17). If it is different (YES in step S17)
S) is that the processes of steps S2 to S10 are performed again (step S18). The candidate correction means 4 shown in FIG.
The processing of steps S15 to S18 of 0 is executed.

【００５６】次に、具体例を挙げて更に詳細に説明す
る。Next, a specific example will be described in more detail.

【００５７】修正用テーブル５の内容が図１１（ａ）に
示す状態（図４（ａ）と同じ）であるときに、別の或る
文字（ここでは「巾」を想定している）の認識が行わ
れ、認識手段２から図１１（ｂ）に示すような認識候補
カテゴリ群および距離値が出力されたとする。即ち、第
１位候補が「巾」，第２位候補が「ゆ」，第３位候補が
「サ」，第４位候補が「Ｈ」，第５位候補が「や」であ
り、それぞれの距離値が１１００，１１５０，１２０
０，１４００，１４００であったとする。まず、候補修
正手段４では、第１位候補「巾」を記憶しておく（ステ
ップＳ１５）。次に、第１位候補「巾」と同じカテゴリ
の誤読カテゴリ「巾」を含む組が修正用テーブル５に１
つ存在するので、その組の正解カテゴリ「ゆ」と距離差
２００とに従って、認識候補「ゆ」の距離値を１１５０
から９５０に変更する。従って、新たな認識候補カテゴ
リ群および距離値は図１１（ｃ）に示すようになる（ス
テップＳ１６）。次に、新しい第１位候補「ゆ」が記憶
した候補（この場合「巾」）と異なるかチェックする
（ステップＳ１７）。この場合異なるため、第１位候補
「ゆ」と同じカテゴリの誤読カテゴリを含む組を修正用
テーブル５に探すが、存在しないため修正を行わない
（ステップＳ１８）。次に、表示手段７は第１位候補の
「ゆ」を認識結果として表示し、また第２位以下の候補
も表示する（Ｓ１１）。ここで本来の入力文字が「巾」
であった為にオペレータが誤読と判断し、訂正情報入力
手段８から「巾」を入力すると、結果訂正手段６は認識
結果を「ゆ」から「巾」に訂正する（Ｓ１３）。テーブ
ル更新手段９は、誤読カテゴリ「ゆ」，正解カテゴリ
「巾」，距離差１５０（＝１１００−９５０）を修正用
テーブル５に登録する（Ｓ１４）。この結果、修正用テ
ーブル５の内容は図１１（ｄ）のようになる。When the content of the correction table 5 is in the state shown in FIG. 11A (same as FIG. 4A), another certain character (here, "width" is assumed) is displayed. It is assumed that the recognition is performed and the recognition means 2 outputs the recognition candidate category group and the distance value as shown in FIG. That is, the first candidate is “width”, the second candidate is “yu”, the third candidate is “sa”, the fourth candidate is “H”, and the fifth candidate is “ya”. Distance values of 1100, 1150, 120
It is assumed that they are 0, 1400, and 1400. First, the candidate correction means 4 stores the first-ranked candidate "width" (step S15). Next, a set including the misreading category “width” in the same category as the first-ranked candidate “width” is set to 1 in the correction table 5.
Therefore, the distance value of the recognition candidate “Yu” is set to 1150 according to the correct category “Yu” and the distance difference 200 of the set.
Change from 950 to 950. Therefore, the new recognition candidate category group and the distance value are as shown in FIG. 11C (step S16). Next, it is checked whether the new first-ranked candidate "Yu" is different from the stored candidate (in this case, "width") (step S17). In this case, since it is different, a set including a misreading category of the same category as the first-ranked candidate “Yu” is searched for in the correction table 5, but it is not corrected because it does not exist (step S18). Next, the display means 7 displays the first-ranked candidate "Yu" as the recognition result, and also displays the second-ranked candidates and below (S11). Here the original input character is "width"
Therefore, when the operator judges that the reading is erroneous and inputs "width" from the correction information input means 8, the result correction means 6 corrects the recognition result from "yu" to "width" (S13). The table updating means 9 registers the misreading category "Yu", the correct answer category "Width", and the distance difference 150 (= 1100-950) in the correction table 5 (S14). As a result, the contents of the correction table 5 are as shown in FIG.

【００５８】次に、修正用テーブル５の内容が図１２
（ａ）に示す状態（図１１（ｄ）と同じ）であるとき
に、別の或る文字（ここでは、「巾」を想定している）
の認識が行われ、認識手段２から図１２（ｂ）に示すよ
うな認識候補カテゴリ群および距離値が出力されたとす
る。即ち、第１位候補が「巾」，第２位候補が「ゆ」，
第３位候補が「け」，第４位候補が「サ」，第５位候補
が「や」であり、それぞれの距離値が１２００，１３５
０，１５００，１８００，１９００であったとする。ま
ず、候補修正手段４では、第１位候補「巾」を記憶して
おく（ステップＳ１５）。次に、第１位候補「巾」と同
じカテゴリの誤読カテゴリ「巾」を含む組が修正用テー
ブル５に１つ存在するので、距離差２００に従って認識
候補「ゆ」の距離値を１３５０から１１５０に変更す
る。従って、新たな認識候補カテゴリ群および距離値は
図１２（ｃ）に示すようになる（ステップＳ１６）。次
に、新しい第１位候補「ゆ」が記憶した候補（この場合
「巾」）と異なるかチェックする（ステップＳ１７）。
この場合異なるため、第１位候補「ゆ」と同じカテゴリ
の誤読カテゴリを含む組を修正用テーブル５に探し、１
つ存在するので、距離差１５０に従って認識候補「巾」
の距離値を１２００から１０５０に変更する。従って、
新たな認識候補カテゴリ群および距離値は図１２（ｄ）
に示すようになる（ステップＳ１８）。Next, the contents of the correction table 5 are shown in FIG.
In the state shown in FIG. 11A (same as FIG. 11D), another certain character (here, "width" is assumed).
Is recognized, and the recognition means 2 outputs a recognition candidate category group and a distance value as shown in FIG. That is, the first candidate is "width", the second candidate is "yu",
The third candidate is “ke”, the fourth candidate is “sa”, the fifth candidate is “ya”, and the distance values are 1200 and 135, respectively.
It is assumed that they are 0, 1500, 1800 and 1900. First, the candidate correction means 4 stores the first-ranked candidate "width" (step S15). Next, since there is one set in the correction table 5 that includes the misreading category “width” of the same category as the first-ranked candidate “width”, the distance value of the recognition candidate “yu” is changed from 1350 to 1150 according to the distance difference 200. Change to. Therefore, the new recognition candidate category group and the distance value are as shown in FIG. 12C (step S16). Next, it is checked whether the new first-ranked candidate "Yu" is different from the stored candidate (in this case, "width") (step S17).
In this case, since it is different, a set including the misread category of the same category as the first-ranked candidate “Yu” is searched for in the correction table 5 and 1
Since there are two, there is a recognition candidate “width” according to the distance difference 150.
Change the distance value of from 1200 to 1050. Therefore,
The new recognition candidate category group and distance value are shown in FIG.
(Step S18).

【００５９】このように、誤読しやすい組「ゆ」「巾」
の両方を何度も認識する場合は、１回の修正処理では互
いに逆の文字に誤読されやすくなるが、２回修正処理を
行うと、最初の認識処理の結果が重視され、認識率を逆
に低下させてしまう現象を防ぐことができる。As described above, the group "Yu" and "Width" which are easily misread
If both are recognized many times, it is easy for the correction process to be misread by the opposite characters, but if the correction process is performed twice, the result of the first recognition process is emphasized and the recognition rate is reversed. It is possible to prevent the phenomenon that it lowers.

【００６０】図１３は本発明を適用した文字認識装置の
別の実施例のブロック図である。この実施例の文字認識
装置が図１の実施例のものと相違するところは、修正用
情報として、距離差５３と認識用特徴５４とを用いる点
にある。この相違に伴って、認識手段２’，候補修正手
段４’，修正用テーブル５’，テーブル更新手段９’は
図１に示される対応する手段に対して、以下のように変
更されている。FIG. 13 is a block diagram of another embodiment of the character recognition device to which the present invention is applied. The character recognition device of this embodiment is different from that of the embodiment of FIG. 1 in that a distance difference 53 and a recognition feature 54 are used as correction information. Due to this difference, the recognition means 2 ', the candidate correction means 4', the correction table 5 ', and the table update means 9'are changed from the corresponding means shown in FIG. 1 as follows.

【００６１】修正用テーブル５’は、誤読カテゴリ５１
と正解カテゴリ５２と距離差５３と認識用特徴５４とを
含む組５０’を複数組格納し得る容量を有している。The correction table 5'has a misreading category 51.
Has a capacity capable of storing a plurality of sets 50 ′ including the correct answer category 52, the distance difference 53, and the recognition feature 54.

【００６２】認識手段２’は、データ入力手段１から入
力された文字イメージ画像から認識用特徴を抽出し、認
識辞書３に予め記憶されている文字カテゴリ毎の標準特
徴と比較して、距離値が小さい上位複数個（例えば第５
位までの５個）の認識候補カテゴリと、それら各々の距
離値とを求めて候補修正手段４’に出力すると共に、文
字イメージ画像から抽出した認識特徴をテーブル更新手
段９’に出力する。The recognizing means 2'extracts the recognizing features from the character image image inputted from the data inputting means 1, compares the recognizing features with the standard features stored in the recognizing dictionary 3 in advance for each character category, and calculates the distance value. Multiple higher ranks (eg, 5th
The recognition candidate categories of up to five) and their respective distance values are obtained and output to the candidate correction means 4 ', and the recognition features extracted from the character image image are output to the table update means 9'.

【００６３】候補修正手段４’は、前記式に代えて、
下記の式により、距離値を修正する。The candidate correction means 4 ', instead of the above equation,
Correct the distance value using the following formula.

【００６４】Ｄ’＝Ｄ−ｗ・ｓ・ａ／ｄ … ここで、Ｄ’は修正後の距離値，Ｄは修正前の距離値，
ｗは重み係数，ｓは修正情報中の距離差，ｄは修正用情
報中の認識用特徴と今回の文字イメージ画像から抽出し
た認識用特徴との距離値，ａは距離値ｄを正規化する定
数である基準距離値である。認識用特徴間の距離値ｄの
計算は、認識手段２’における認識辞書３の各カテゴリ
の標準特徴と入力文字イメージ画像から得られる認識用
特徴との距離計算と同じでよく、例えば、ユークリッド
距離や市街区距離を用いることができる。D ′ = D−w · s · a / d ... where D ′ is the corrected distance value, D is the uncorrected distance value,
w is a weighting coefficient, s is a distance difference in the correction information, d is a distance value between the recognition feature in the correction information and the recognition feature extracted from the current character image, and a normalizes the distance value d. It is a reference distance value that is a constant. The calculation of the distance value d between the recognition features may be the same as the calculation of the distance between the standard feature of each category of the recognition dictionary 3 in the recognition means 2 ′ and the recognition feature obtained from the input character image image, for example, the Euclidean distance. Or city block distance can be used.

【００６５】テーブル更新手段９’は、結果訂正手段６
から誤読カテゴリ及びその距離値と正解カテゴリ及びそ
の距離値とが通知されたとき、図１のテーブル更新手段
９と同様に距離差５３を求め、この求めた距離差５３
に、認識手段２’から入力されている認識用特徴を追加
して修正用情報を作成し、これに更に誤読カテゴリと正
解カテゴリとを付加して、修正用テーブル５’に登録す
べき組５０’を生成する。The table updating means 9'is the result correcting means 6
When the misreading category and its distance value and the correct answer category and its distance value are notified from, the distance difference 53 is calculated in the same manner as the table updating means 9 in FIG.
, The recognition feature input from the recognition means 2 ′ is added to create correction information, and the misread category and the correct answer category are further added to this, and the set 50 to be registered in the correction table 5 ′ is added. 'Is generated.

【００６６】なお、修正用テーブル５’に登録する認識
用特徴は、認識用特徴を圧縮した圧縮特徴にしてもよ
い。こうすることにより、修正用テーブル５’の必要容
量を削減することができる。The recognition feature registered in the correction table 5'may be a compression feature obtained by compressing the recognition feature. By doing so, the required capacity of the correction table 5'can be reduced.

【００６７】本実施例の文字認識装置のその他の機能お
よび動作は図１の文字認識装置と同じである。本実施例
では、修正用テーブル５’の認識用特徴５４と入力文字
から得られる認識用特徴が近く、その結果距離値ｄが小
さい場合、距離値Ｄが大きく減少するため、対応する候
補が上位に来やすくなる。逆に、修正用テーブル５’の
認識用特徴５４と入力文字から得られる認識用特徴とが
遠く、その結果距離値ｄが大きい場合は、距離値Ｄはあ
まり減少しないため、対応する候補の順位はあまり上が
らない特性を示す。Other functions and operations of the character recognition device of this embodiment are the same as those of the character recognition device of FIG. In the present embodiment, when the recognition feature 54 of the correction table 5 ′ and the recognition feature obtained from the input character are close to each other, and as a result, the distance value d is small, the distance value D is greatly reduced, and the corresponding candidate is ranked higher. Easier to come to. On the contrary, when the recognition feature 54 of the correction table 5 ′ and the recognition feature obtained from the input character are distant and the distance value d is large as a result, the distance value D does not decrease so much, and therefore the rank of the corresponding candidate is reduced. Indicates a characteristic that does not rise so much.

【００６８】以上本発明の実施例について説明したが、
本発明は以上の実施例にのみ限定されずその他各種の付
加変更が可能である。例えば、データ入力手段で得られ
る入力データは、スキャナ等により入力される文字イメ
ージ画像としたが、タブレットにより入力されるオンラ
イン文字データや、マイクにより入力される音声データ
であってもよい。また、修正用テーブルに記憶する誤読
カテゴリは、認識辞書のテンプレート番号でもよい。テ
ンプレート番号にすると、認識辞書に同一カテゴリのテ
ンプレートを複数保持する場合にテンプレートを区別す
ることができる。The embodiments of the present invention have been described above.
The present invention is not limited to the above embodiments, and various other additions and changes are possible. For example, although the input data obtained by the data input means is a character image image input by a scanner or the like, it may be online character data input by a tablet or voice data input by a microphone. Further, the misread category stored in the correction table may be the template number of the recognition dictionary. If the template number is used, the templates can be distinguished when the recognition dictionary holds a plurality of templates in the same category.

【００６９】[0069]

【発明の効果】以上説明したように本発明によれば、オ
ペレータの認識結果訂正情報を以降の認識処理時におけ
る距離値の修正に利用し、一度誤読したことのあるカテ
ゴリが認識手段の出力に第１位候補として出現した場合
に、修正用テーブルに登録された正解カテゴリと一致す
る認識候補カテゴリの距離値を適正に減じて、第１位候
補に躍り出る可能性を残したので、第１位候補が正解と
なる認識率を従来のものに比べて向上することができ
る。As described above, according to the present invention, the recognition result correction information of the operator is used for the correction of the distance value in the subsequent recognition processing, and the category that has been misread once becomes the output of the recognition means. If it appears as a first-ranked candidate, the distance value of the recognition candidate category that matches the correct category registered in the correction table is appropriately reduced, leaving the possibility of jumping to the first-ranked candidate. It is possible to improve the recognition rate in which the candidate is the correct answer as compared with the conventional one.

【００７０】また、修正処理により第１位候補が修正さ
れた場合に再度修正処理を行うことにより、誤読カテゴ
リと正解カテゴリが相互に登録された文字の組に対して
は、元の認識結果を重視して認識率の低下が生じる可能
性を少なくしたので、認識率を従来に比べて向上させる
ことができる。When the first-ranked candidate is corrected by the correction process, the correction process is performed again, so that the original recognition result is returned to the character set in which the misread category and the correct answer category are mutually registered. Since the possibility of a decrease in the recognition rate is emphasized and reduced, the recognition rate can be improved as compared with the conventional case.

[Brief description of drawings]

【図１】本発明を適用した文字認識装置の一実施例のブ
ロック図である。FIG. 1 is a block diagram of an embodiment of a character recognition device to which the present invention is applied.

【図２】図１の実施例の文字認識装置の動作手順を示す
フローチャートである。FIG. 2 is a flowchart showing an operation procedure of the character recognition device of the embodiment of FIG.

【図３】図１の実施例の動作説明図であって、誤読の訂
正が行われたことにより初期状態の修正用テーブルに修
正用情報を含む１つ目の組が登録された状況を示す図で
ある。FIG. 3 is an operation explanatory diagram of the embodiment of FIG. 1, showing a situation in which a first set including correction information is registered in a correction table in an initial state due to correction of misreading. It is a figure.

【図４】図１の実施例の動作説明図であって、再び誤読
の訂正が行われたことにより修正用テーブルに修正用情
報を含む２つ目の組が登録される状況を示す図である。FIG. 4 is a diagram for explaining the operation of the embodiment of FIG. 1, showing a situation in which a second set including correction information is registered in the correction table due to correction of misreading again. is there.

【図５】図１の実施例の動作説明図であって、修正用テ
ーブルに登録された修正用情報を含む組によって正解カ
テゴリが導き出せた例を示す図である。5 is an operation explanatory diagram of the embodiment in FIG. 1, and is a diagram showing an example in which a correct category is derived by a set including correction information registered in a correction table.

【図６】図１の実施例の動作説明図であって、修正用テ
ーブルに登録された修正用情報を含む組による影響を受
けずに正解カテゴリが導き出せた例を示す図である。6 is an operation explanatory diagram of the embodiment in FIG. 1, and is a diagram showing an example in which a correct category is derived without being affected by a set including correction information registered in a correction table.

【図７】図１の実施例の動作説明図であって、修正用テ
ーブルに登録された修正用情報を含む組によって正解カ
テゴリが導き出せた他の例を示す図である。FIG. 7 is an operation explanatory diagram of the embodiment of FIG. 1, and is a diagram showing another example in which a correct category is derived by a set including correction information registered in a correction table.

【図８】図１の実施例の動作説明図であって、修正用テ
ーブルに登録された修正用情報を含む組によっても正解
カテゴリが導き出せなかった例と、その為に再び訂正を
行って新たな修正用情報を含む組を修正用テーブルに登
録した状況を示す図である。FIG. 8 is an explanatory diagram of the operation of the embodiment of FIG. 1, in which the correct category cannot be derived even by the set including the correction information registered in the correction table, and a correction is made again for that. It is a figure which shows the condition which registered the group containing various corrective information in the corrective table.

【図９】図１の実施例の動作説明図であって、再度の訂
正によって、正解カテゴリが導き出せた例を示す図であ
る。FIG. 9 is a diagram for explaining the operation of the embodiment of FIG. 1, and is a diagram showing an example in which a correct answer category has been derived by another correction.

【図１０】図１の実施例の文字認識装置の動作手順を示
す別のフローチャートである。10 is another flowchart showing an operation procedure of the character recognition device in the embodiment of FIG.

【図１１】図１の実施例の図１０のフローによる動作説
明図であって、誤読カテゴリと正解カテゴリが逆になる
組が修正用テーブルに登録される状況を示す図である。FIG. 11 is an operation explanatory diagram according to the flow of FIG. 10 of the embodiment of FIG. 1, and is a diagram showing a situation in which a set in which the misread category and the correct answer category are reversed is registered in the correction table.

【図１２】図１の実施例の図１０のフローによる動作説
明図であって、２回の修正処理によって正解カテゴリが
導き出せた例を示す図である。12 is an explanatory diagram of an operation according to the flow of FIG. 10 of the embodiment of FIG. 1, and is a diagram showing an example in which a correct category is derived by two correction processes.

【図１３】本発明を適用した文字認識装置の別の実施例
のブロック図である。FIG. 13 is a block diagram of another embodiment of the character recognition device to which the present invention is applied.

【図１４】従来の文字認識装置の構成例を示すブロック
図である。FIG. 14 is a block diagram showing a configuration example of a conventional character recognition device.

[Explanation of symbols]

１データ入力手段２，２’ 認識手段３認識辞書４，４’ 候補修正手段５，５’ 修正用テーブル５０，５０’ 組５１誤読カテゴリ５２正解カテゴリ５３距離差５４認識用特徴６結果訂正手段７表示手段８訂正情報入力手段９’ テーブル更新手段 1 data input means 2, 2'recognition means 3 recognition dictionary 4, 4 'candidate correction means 5, 5' correction table 50, 50 'set 51 misreading category 52 correct category 53 distance difference 54 recognition feature 6 result correction means 7 Display means 8 Correction information input means 9'Table updating means

Claims

[Claims]

1. A feature for recognition is extracted from data to be recognized,
Recognition that obtains a plurality of high-ranking recognition candidate categories having small distance values, which are the differences between the recognition characteristics and the standard characteristics, and the distance values thereof, by comparing with the standard characteristics of each category registered in advance in the recognition dictionary Means, a correction table for storing a set of misreading category, correct answer category, and correction information, and a set including a misreading category that matches the first-ranked recognition candidate category in the recognition candidate category group obtained by the recognition means. Is stored in the correction table, candidate correction means for reducing the distance value of the recognition candidate category matching the correct category in the set by a value according to the correction information in the set, and the candidate correction means. The first-ranked recognition candidate category in the recognition candidate category group after processing by the means is output as a recognition result, and according to the correction information input from the correction information input means. And a table updating unit for adding a set of a misread category, a correct category, and correction information to the correction table when the result correction unit corrects the recognition result. Recognizing device candidate correction method.

2. The candidate correcting means sets a predetermined distance value that matches the correct answer category when a recognition candidate category matching the correct answer category does not exist in the recognition candidate category group obtained by the recognizing means. 2. The configuration is such that the distance value is corrected after adding the given recognition candidate category to the recognition candidate category group.
Candidate correction method for the described recognition device.

3. The table updating means uses, as the correction information, a distance difference which is a difference between a distance value of a misreading category and a distance value of a correct answer category.
Candidate correction method for the described recognition device.

4. The candidate correcting means sets the corrected distance value D ′ as D ′ = D, where D is a distance value before correction, w is a weighting coefficient, and s is a distance difference which is correction information. 4. The candidate correction method for a recognition device according to claim 3, wherein the candidate correction method is obtained by −w · s.

5. The table updating means uses, as the correction information, a distance difference which is a difference between a distance value of a misreading category and a distance value of a correct answer category, and a recognition feature extracted from recognition target data. The candidate correction method for a recognition device according to claim 2, wherein:

6. The candidate correction means recognizes a distance value before correction, D, a weighting coefficient, w, a distance difference in the correction information, s, a recognition feature in the correction information, and recognition target data at this time. If the distance value to the feature for use is d, and the reference distance value that is a constant for normalizing the distance value d is a, the corrected distance value D ′ is given by D ′ = D−w · s · a / d The candidate correction method of the recognition device according to claim 5, wherein the candidate correction method is obtained.

7. The candidate correction means, when the first recognition candidate category in the recognition candidate category group obtained by the recognition means is changed by the candidate correction using the correction table, a new first correction candidate 7. The candidate correction method for a recognition apparatus according to claim 1, wherein the candidate correction method is configured to perform the candidate correction again by using the position candidate category.