JP2894305B2

JP2894305B2 - Recognition device candidate correction method

Info

Publication number: JP2894305B2
Application number: JP8344865A
Authority: JP
Inventors: 雅彦濱中
Original assignee: Nippon Electric Co Ltd
Current assignee: NEC Corp
Priority date: 1995-12-28
Filing date: 1996-12-25
Publication date: 1999-05-24
Anticipated expiration: 2016-12-25
Also published as: JPH09237322A

Description

DETAILED DESCRIPTION OF THE INVENTION

【０００１】[0001]

【発明の属する技術分野】本発明は帳票上に書かれた文
字を認識する文字認識装置等のデータ認識装置に関し、
より詳細には、認識結果に誤りがあった為にオペレータ
が正しい認識結果に訂正した場合に、その訂正情報を以
降の認識処理に活用して認識精度を高めるようにした認
識装置の候補修正方式に関する。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a data recognition device such as a character recognition device for recognizing characters written on a form.
More specifically, when an operator corrects a recognition result to an incorrect one because of an error in the recognition result, the correction information is used in a subsequent recognition process to improve a recognition accuracy of the recognition device. About.

【０００２】[0002]

【従来の技術】文字認識装置、特に手書文字認識装置に
おいては認識率を１００％にすることが極めて困難であ
る。このため、認識結果を表示し、オペレータが目視で
チェックして、誤読している文字は正しい文字に置き換
えるような半自動式が一般的に採用されている。そし
て、このようなオペレータによる認識結果訂正作業を支
援するために従来より種々の技術が提案されている。2. Description of the Related Art It is extremely difficult for a character recognition device, especially a handwritten character recognition device, to achieve a recognition rate of 100%. For this reason, a semi-automatic method of displaying a recognition result, visually checking by an operator, and replacing misread characters with correct characters is generally adopted. Various techniques have conventionally been proposed to assist such an operator in correcting the recognition result.

【０００３】図１４は従来の文字認識装置の構成例を示
すブロック図であり、特開昭６１−７４０８１号公報
（以下、文献１と称す）に記載された文字認識装置の構
成を機能ブロック図化したものである。FIG. 14 is a block diagram showing an example of the configuration of a conventional character recognition device. The functional block diagram of the configuration of the character recognition device described in Japanese Patent Application Laid-Open No. 61-74081 (hereinafter referred to as Reference 1) is shown in FIG. It is a thing.

【０００４】図１４において、帳票上などに書かれた文
字はスキャナ等のデータ入力手段１０によって文字イメ
ージ画像として入力され、認識手段２２に与えられる。
認識辞書２１には文字カテゴリ毎の文字パターンの標準
特徴が記憶されており、認識手段２２は、文字イメージ
画像から認識用特徴を抽出し、これと認識辞書２１中の
各文字カテゴリ毎の標準特徴とを比較して例えば第５位
候補までの認識候補カテゴリを求め、候補追加手段５０
に出力する。In FIG. 14, a character written on a form or the like is input as a character image image by a data input means 10 such as a scanner and is given to a recognition means 22.
The recognition dictionary 21 stores standard features of character patterns for each character category, and the recognition unit 22 extracts recognition features from the character image image, and extracts the recognition features and the standard features for each character category in the recognition dictionary 21. , For example, the recognition candidate category up to the fifth candidate is obtained, and the candidate adding means 50
Output to

【０００５】候補追加手段５０は、必要に応じて後述す
る候補追加処理を施したのち、全ての認識候補カテゴリ
を結果訂正手段４０に出力し、結果訂正手段４０は、そ
の第１候補の認識候補カテゴリを認識結果として表示手
段４１に表示する。[0005] The candidate adding means 50 performs a candidate adding process described later as necessary, and then outputs all recognition candidate categories to the result correcting means 40. The result correcting means 40 outputs the first candidate recognition candidate. The category is displayed on the display means 41 as a recognition result.

【０００６】オペレータは、表示手段４１に表示された
認識結果を見て、帳票上等の文字が正しく認識されたか
否かをチェックする。そして、誤読している場合、キー
ボード等の訂正情報入力手段４２から誤読している旨を
入力する。結果訂正手段４０は、これに応じて候補追加
手段５０から出力されていた第２候補以降の認識候補カ
テゴリを表示手段４１に表示する。The operator looks at the recognition result displayed on the display means 41 and checks whether the characters on the form or the like have been correctly recognized. In the case of misreading, the fact that the misreading is performed is input from the correction information input means 42 such as a keyboard. The result correction unit 40 displays the second and subsequent recognition candidate categories output from the candidate addition unit 50 on the display unit 41 in response to this.

【０００７】従って、例えば、帳票上に書かれた文字が
「み」であったのにかかわらず、認識手段２２で決定さ
れた認識候補が上位より順に「巾」，「け」，「サ」，
「Ｈ」，「や」であり、候補追加手段５０で新たな候補
の追加がなかった場合、表示手段４１には認識結果とし
て第１候補の「巾」が表示され、オペレータが誤読であ
る旨を指示すると、残りの候補「け」，「サ」，
「Ｈ」，「や」が表示手段４１に表示される。Accordingly, for example, regardless of whether the character written on the form is "mi", the recognition candidates determined by the recognizing means 22 are "width", "ke", "sa" in descending order. ,
If “H” or “Y”, and no new candidate has been added by the candidate adding unit 50, the display unit 41 displays the “width” of the first candidate as a recognition result, indicating that the operator misread the message. And the remaining candidates "ke", "sa",
“H” and “ya” are displayed on the display means 41.

【０００８】オペレータは、この表示された候補中に正
解が含まれていれば、それを選択することにより結果訂
正手段４０に認識結果をその選択した候補に変更する処
理を行わせるわけであるが、前述のように正解が候補中
にない場合には、例えばかな漢字変換等により訂正情報
入力手段４２から正解の「み」を入力して、訂正を指示
する。結果訂正手段４０は、正解が入力されると、認識
結果をその正解に変更する処理を行うと共に、誤読カテ
ゴリ（第１候補であった「巾」のカテゴリ）と正解カテ
ゴリ（オペレータにより入力された「み」のカテゴリ）
とをテーブル更新手段５２に送る。テーブル更新手段５
２は、この誤読カテゴリと正解カテゴリとの組を、それ
が未登録ならカテゴリテーブル５１に追加登録する。If the displayed candidate contains a correct answer, the operator selects the correct answer to cause the result correcting means 40 to change the recognition result to the selected candidate. If the correct answer is not among the candidates as described above, the correct answer "mi" is input from the correction information input means 42 by, for example, kana-kanji conversion or the like, and the correction is instructed. When the correct answer is input, the result correcting means 40 performs a process of changing the recognition result to the correct answer, and performs a misreading category (a category of “width” which was the first candidate) and a correct answer category (input by the operator). "Mi" category)
Is sent to the table updating means 52. Table updating means 5
2 additionally registers the set of the misread category and the correct answer category in the category table 51 if it is not registered.

【０００９】さて、このようなカテゴリテーブル５１へ
の登録が行われると、その後に再び同様な書体の「み」
の認識が試みられ、例えば先と同様に認識手段２２が上
位より順に「巾」，「け」，「サ」，「Ｈ」，「や」の
認識候補を出力して候補追加手段５０に出力すると、候
補追加手段５０による候補追加処理で「み」が新たな候
補に追加される。即ち、候補追加手段５０は、認識手段
２２から送られてきた第１候補のカテゴリ「巾」と同一
カテゴリである誤読カテゴリ「巾」をカテゴリテーブル
５１から検索し、見つけるとその誤読カテゴリ「巾」と
対を成す正解カテゴリ「み」を第２候補として候補に追
加する。従って、上述の場合、新たな認識候補は、
「巾」，「み」，「け」，…，となり、最初に表示手段
４１に表示される認識結果は「巾」で前回と変わらない
が、オペレータが誤読を指示したときには、前回表示さ
れなかった「み」が認識候補として表示手段４１に表示
されることになり、正解への訂正は表示された「み」を
選択するだけで済む。[0009] When such registration in the category table 51 is performed, the same typeface “mi” is again used.
For example, as described above, the recognition unit 22 outputs recognition candidates of “width”, “ke”, “sa”, “H”, and “ya” in order from the top and outputs the candidates to the candidate addition unit 50. Then, “mi” is added to a new candidate in the candidate adding process by the candidate adding unit 50. That is, the candidate adding unit 50 searches the category table 51 for the misread category “width” that is the same category as the category “width” of the first candidate sent from the recognizing unit 22. Is added as a second candidate to the candidates. Therefore, in the above case, the new recognition candidate is
.., And the first recognition result displayed on the display means 41 is “width”, which is the same as the previous one, but is not displayed last time when the operator instructs misreading. "Mi" is displayed on the display means 41 as a recognition candidate, and correction to the correct answer can be made only by selecting the displayed "mi".

【００１０】なお、図１４に示した従来技術と同様の技
術は特開平１−１００６８６号公報（以下、文献２と称
す）にも記載されている。A technique similar to the prior art shown in FIG. 14 is also described in Japanese Patent Application Laid-Open No. 1-168686 (hereinafter referred to as Document 2).

【００１１】他方、特開昭５９−２７３７５号公報（以
下、文献３と称す），特開昭６３−２０８１８０号公報
（以下、文献４と称す）および特開平５−７３７０９号
公報（以下、文献５と称す）には、認識結果として表示
された一連の文字列において或る１つの文字の認識結果
の訂正を行った場合、それより以降の認識結果中に同様
の訂正を行うべき箇所が存在するか否かを自動的に検出
し、オペレータによる一度の訂正操作で同種の複数の誤
読部分の訂正を一括して行えるようにした技術が記載さ
れている。On the other hand, JP-A-59-27375 (hereinafter referred to as Reference 3), JP-A-63-208180 (hereinafter referred to as Reference 4) and JP-A-5-73709 (hereinafter referred to as Reference 3). 5), when a recognition result of a certain character is corrected in a series of character strings displayed as a recognition result, there is a portion where the same correction should be performed in subsequent recognition results. A technique is described that automatically detects whether or not to perform the same operation and corrects a plurality of erroneously read portions of the same type collectively by a single correction operation by an operator.

【００１２】[0012]

【発明が解決しようとする課題】このように従来の文字
認識装置においては、オペレータによる認識結果訂正作
業を支援するために種々の技術が提案されているが、以
下のような問題点があった。As described above, in the conventional character recognition apparatus, various techniques have been proposed to assist the operator in correcting the recognition result, but have the following problems. .

【００１３】文献１および文献２に示される技術では、
カテゴリテーブル５１に登録された正解カテゴリを固定
の順位（前述の例では第２位）の認識候補として追加す
るだけであるため、最初に表示される認識結果は常に認
識手段が第１位候補としたものに限られる。つまり、オ
ペレータによる認識結果訂正情報は第１位候補に影響を
全く与えない。従って、候補中に正解が含まれる正解含
有率は改善されても、第１位候補が正解となる認識率は
向上させることができない。このため、オペレータによ
る訂正回数自体を削減することは難しい。[0013] In the techniques described in Documents 1 and 2,
Since the correct category registered in the category table 51 is only added as a recognition candidate of a fixed rank (the second rank in the above example), the first displayed recognition result always indicates that the recognition means is the first rank candidate. Limited to That is, the recognition result correction information by the operator has no effect on the first candidate. Therefore, even if the correct answer content rate in which the correct answer is included in the candidates is improved, the recognition rate at which the first candidate becomes the correct answer cannot be improved. For this reason, it is difficult to reduce the number of corrections themselves by the operator.

【００１４】また文献３乃至文献５に示される技術は、
一連の認識結果に対する訂正作業の効率化は実現される
が、その訂正作業においてオペレータが入力した認識結
果訂正情報は今回の認識結果の訂正だけに利用されるも
のに過ぎないため、以降の新たな認識対象文字列に対す
る認識では、再び同じ結果をもたらす。即ち、オペレー
タの認識結果訂正情報を活用して認識率を向上させるこ
とができていない。The techniques disclosed in References 3 to 5 are:
Although the efficiency of the correction work for a series of recognition results can be improved, the recognition result correction information input by the operator in the correction work is used only for correcting the current recognition result. Recognition of the recognition target character string brings the same result again. That is, the recognition rate cannot be improved by utilizing the recognition result correction information of the operator.

【００１５】本発明はこのような従来の問題点を解決し
たもので、その目的は、オペレータの認識結果訂正情報
を以降の認識処理に活用し、候補中に正解が含まれる正
解含有率だけでなく、第１位候補が正解となる認識率を
も向上させ得るようにすることにある。The present invention has solved such a conventional problem. The purpose of the present invention is to utilize the operator's recognition result correction information in the subsequent recognition processing, and to determine only the correct answer content ratio in which the correct answer is included in the candidates. Instead, it is to improve the recognition rate at which the first candidate is the correct answer.

【００１６】[0016]

【課題を解決するための手段】本発明は、文字イメージ
画像などの認識対象データから認識用特徴を抽出し、認
識辞書に予め登録されている各カテゴリ毎の標準特徴と
比較して、認識用特徴と標準特徴との差である距離値が
小さい上位複数個の認識候補カテゴリ及びそれらの前記
距離値を求める認識手段と、誤読カテゴリと正解カテゴ
リと修正用情報との組を格納する修正用テーブルと、前
記認識手段で求められた認識候補カテゴリ群における第
１位の認識候補カテゴリと一致する誤読カテゴリを含む
組が前記修正用テーブルに格納されている場合に、その
組中の正解カテゴリと一致する認識候補カテゴリの距離
値をその組中の修正用情報に応じた値だけ減じる候補修
正手段と、該候補修正手段による処理後の認識候補カテ
ゴリ群における第１位の認識候補カテゴリを認識結果と
して出力すると共に、訂正情報入力手段から入力される
訂正情報に従って前記認識結果を訂正する結果訂正手段
と、該結果訂正手段による訂正時に、誤読カテゴリと正
解カテゴリと修正用情報との組を前記修正用テーブルに
追加するテーブル更新手段とを備えている。According to the present invention, a feature for recognition is extracted from recognition target data such as a character image and compared with standard features for each category registered in a recognition dictionary in advance. A plurality of upper recognition candidate categories having a small distance value, which is a difference between a feature and a standard feature, a recognition unit for obtaining the distance value, and a correction table for storing a set of a misread category, a correct category, and correction information. And when a set including an erroneous reading category that matches the first-ranked recognition candidate category in the group of recognition candidate categories determined by the recognition unit is stored in the correction table, the set matches the correct category in the set. Correction means for reducing the distance value of the recognition candidate category to be performed by a value corresponding to the correction information in the set; A result correction means for outputting the recognition candidate category of the rank as a recognition result, and correcting the recognition result in accordance with the correction information input from the correction information input means; Table updating means for adding a set with the correction information to the correction table.

【００１７】また、前記候補修正手段は、前記正解カテ
ゴリと一致する認識候補カテゴリが前記認識手段で求め
られた認識候補カテゴリ群に存在しないときは、前記正
解カテゴリと一致する、所定の距離値を付与した認識候
補カテゴリを前記認識候補カテゴリ群に追加した上で前
記距離値の修正を行う構成を有している。Further, the candidate correcting means, when the recognition candidate category matching the correct answer category does not exist in the recognition candidate category group obtained by the recognition means, sets a predetermined distance value matching the correct answer category. The configuration is such that the distance value is corrected after adding the given recognition candidate category to the recognition candidate category group.

【００１８】前記テーブル更新手段が前記修正用テーブ
ルに追加する組中の修正用情報としては、例えば、（１）誤読カテゴリの距離値と正解カテゴリの距離値と
の差である距離差。（２）誤読カテゴリの距離値と正解カテゴリの距離値と
の差である距離差、および認識対象データから抽出され
た認識用特徴。が用いられる。The correction information in the set added by the table updating means to the correction table includes, for example, (1) a distance difference which is a difference between a distance value of the misread category and a distance value of the correct category. (2) A distance difference, which is a difference between the distance value of the misread category and the distance value of the correct category, and a recognition feature extracted from the recognition target data. Is used.

【００１９】前記（１）のような修正用情報を用いる場
合、前記候補修正手段は、例えば次式によって距離値
を修正する。When using the correction information as in the above (1), the candidate correction means corrects the distance value by, for example, the following equation.

【００２０】Ｄ’＝Ｄ−ｗ・ｓ … ここで、Ｄ’は修正後の距離値，Ｄは修正前の距離値，
ｗは重み係数，ｓは修正情報である距離差である。D ′ = D−w · s where D ′ is a distance value after correction, D is a distance value before correction,
w is a weight coefficient, and s is a distance difference as correction information.

【００２１】また前記（２）のような修正用情報を用い
る場合、前記候補修正手段は、例えば次式によって距
離値を修正する。When the correction information as described in (2) is used, the candidate correction means corrects the distance value by the following equation, for example.

【００２２】Ｄ’＝Ｄ−ｗ・ｓ・ａ／ｄ … ここで、Ｄ’は修正後の距離値，Ｄは修正前の距離値，
ｗは重み係数，ｓは修正情報中の距離差，ｄは修正用情
報中の認識用特徴と今回の認識対象データの認識用特徴
との距離値，ａは距離値ｄを正規化する定数である基準
距離値である。また、前記候補修正手段は、前記認識手
段で求められた認識候補カテゴリ群における第１位の認
識候補カテゴリが前記修正用テーブルを用いた候補修正
により変更された場合に、新しい第１位候補カテゴリを
用いて手再度候補修正を行う構成を有している。D ′ = D−w · s · a / d where D ′ is a distance value after correction, D is a distance value before correction,
w is a weighting factor, s is the distance difference in the correction information, d is the distance value between the recognition feature in the correction information and the recognition feature of the data to be recognized this time, and a is a constant for normalizing the distance value d. This is a certain reference distance value. In addition, the candidate correcting unit is configured to generate a new first candidate category when the first recognition candidate category in the group of recognition candidate categories obtained by the recognition unit is changed by candidate correction using the correction table. Is used to correct a candidate again.

【００２３】上述のように構成された本発明にあって
は、誤読時にオペレータが正しいカテゴリに訂正する
と、その正解カテゴリと誤読カテゴリ（装置が第１位候
補として出力したカテゴリ）と修正用情報との組が修正
用テーブルに登録される。そして、以降、その誤読カテ
ゴリと同じカテゴリが第１位の認識候補カテゴリとして
求められた場合、その誤読カテゴリに対応する正解カテ
ゴリと同じカテゴリの認識候補カテゴリの距離値が減じ
られる。従って、減じられる値の大きさによっては、第
１位候補に躍り出る場合がある。これが前記文献１及び
文献２に記載された技術と大きく相違するところであ
り、最初に表示される認識結果は常に認識手段が第１位
候補としたものに限られず、以前に正解カテゴリとして
入力したカテゴリが認識結果として出力され得る場合が
ある。このような作用により、第１位候補が正解となる
認識率を向上させることができる。In the present invention configured as described above, when the operator corrects a correct category at the time of erroneous reading, the correct category, the erroneous reading category (the category output as the first candidate by the apparatus), the correction information, Are registered in the correction table. Thereafter, when the same category as the misread category is obtained as the first recognition candidate category, the distance value of the recognition candidate category of the same category as the correct category corresponding to the misread category is reduced. Therefore, depending on the magnitude of the value to be reduced, there is a case where it jumps to the first candidate. This is a major difference from the techniques described in the above-mentioned Documents 1 and 2. The recognition result displayed first is not limited to the one in which the recognition means is always set as the first candidate, but the category previously input as the correct answer category. May be output as a recognition result. By such an operation, the recognition rate at which the first-rank candidate becomes a correct answer can be improved.

【００２４】[0024]

【発明の実施の形態】次に本発明の実施の形態の例につ
いて図面を参照して詳細に説明する。DESCRIPTION OF THE PREFERRED EMBODIMENTS Next, embodiments of the present invention will be described in detail with reference to the drawings.

【００２５】図１は本発明を適用した文字認識装置の一
実施例のブロック図である。同図に示すように本実施例
の文字認識装置は、データ入力手段１と、認識手段２
と、認識辞書３と、候補修正手段４と、修正用テーブル
５と、結果訂正手段６と、表示手段７と、訂正情報入力
手段８と、テーブル更新手段９とを備えている。FIG. 1 is a block diagram of an embodiment of a character recognition apparatus to which the present invention is applied. As shown in FIG. 1, the character recognition device according to the present embodiment includes a data input unit 1 and a recognition unit 2.
, A recognition dictionary 3, a candidate correction unit 4, a correction table 5, a result correction unit 6, a display unit 7, a correction information input unit 8, and a table update unit 9.

【００２６】また、図２は図１の実施例の動作フローチ
ャートである。以下、図１および図２を参照して本実施
例の各部の機能ならびに動作を説明する。FIG. 2 is an operation flowchart of the embodiment of FIG. Hereinafter, the function and operation of each unit of the present embodiment will be described with reference to FIGS.

【００２７】データ入力手段１は、例えばスキャナ等で
構成され、例えば図示しない帳票上に書かれた手書き文
字を走査して各文字毎の文字イメージ画像を生成し、認
識手段２に出力する。The data input means 1 is constituted by, for example, a scanner or the like. For example, the data input means 1 scans handwritten characters written on a form (not shown) to generate a character image image for each character and outputs it to the recognizing means 2.

【００２８】認識辞書３には、文字カテゴリ毎の標準文
字パターンの認識用特徴（標準特徴）が予め記憶されて
いる。認識手段２は、データ入力手段１から入力された
文字イメージ画像から認識用特徴を抽出し、認識辞書３
に予め記憶されている文字カテゴリ毎の標準特徴と比較
して、距離値が小さい上位複数個（例えば第５位までの
５個）の認識候補カテゴリと、それら各々の距離値とを
求める（Ｓ１）。この求められた認識候補カテゴリ群と
各距離値は候補修正手段４に出力される。ここで、距離
値とは認識用特徴と標準特徴との差であり、類似度が高
いほど値が小さくなる。認識対象文字の認識用特徴と認
識辞書中の標準特徴との距離値を求めて文字認識を行う
手法としては従来より各種のものが提案されており、本
発明はその任意の手法を用いることができる。例えば、
このような手法を記載した文献として、『方向パタンマ
ッチング法の改良と手書き漢字認識への応用』（１９９
０年６月，電子情報通信学会研究会技報，ＰＲＵ９０−
２０）がある。The recognition dictionary 3 stores in advance the recognition characteristics (standard characteristics) of the standard character pattern for each character category. The recognition means 2 extracts a recognition feature from the character image image input from the data input means 1 and
In comparison with the standard features for each character category stored in advance, a plurality of recognition candidate categories (e.g., five to the fifth place) having smaller distance values and their respective distance values are obtained (S1). ). The obtained recognition candidate category group and each distance value are output to the candidate correcting means 4. Here, the distance value is a difference between the recognition feature and the standard feature, and the value decreases as the similarity increases. Various methods have been proposed for performing character recognition by obtaining a distance value between a feature for recognition of a recognition target character and a standard feature in a recognition dictionary, and the present invention may use any of the methods. it can. For example,
A document describing such a technique is described in "Improvement of Direction Pattern Matching Method and Application to Handwritten Kanji Character Recognition" (199).
June 2000, IEICE Technical Report, PRU90-
20).

【００２９】修正用テーブル５は、誤読カテゴリ５１と
正解カテゴリ５２と距離差５３との組５０を格納するた
めのテーブルであり、例えば２００個の組を格納し得る
容量を有している。ここで、本実施例では、修正用情報
として誤読カテゴリ５１にかかる距離値と正解カテゴリ
５２にかかる距離値との差である距離差５３を用いてい
る。この修正用テーブル５の初期状態は空状態、つまり
１つの組も格納されていない状態である。また、この修
正用テーブル５は利用者毎あるいは認識対象の種類（帳
票の種類等）毎に交換可能である。The correction table 5 is a table for storing a set 50 of a misread category 51, a correct answer category 52, and a distance difference 53, and has a capacity to store, for example, 200 sets. Here, in the present embodiment, a distance difference 53 which is a difference between the distance value relating to the misread category 51 and the distance value relating to the correct category 52 is used as the correction information. The initial state of the correction table 5 is an empty state, that is, a state in which no set is stored. The correction table 5 can be exchanged for each user or each type of recognition target (type of form, etc.).

【００３０】候補修正手段４は、修正用テーブル５を参
照して、認識手段２から入力された認識候補カテゴリ群
（第１位か第５位までの５個の認識候補カテゴリ）に新
たな認識候補カテゴリを追加したり、所定の認識候補カ
テゴリの距離値を減じた後に距離値でソートして、新た
な認識候補カテゴリ群を生成する手段であり、図２のス
テップＳ２〜Ｓ１０の処理を実行する。The candidate correction means 4 refers to the correction table 5 and newly performs a new recognition on the recognition candidate category group (the first or fifth highest recognition candidate category) inputted from the recognition means 2. This is a means for adding a candidate category or subtracting a distance value of a predetermined recognition candidate category and then sorting by distance values to generate a new recognition candidate category group, and executes the processing of steps S2 to S10 in FIG. I do.

【００３１】先ず、候補修正手段４は、修正用テーブル
５中の組５０における誤読カテゴリ５１を例えばテーブ
ルの先頭に近い組から順に１つずつ検索して、その誤読
カテゴリが認識手段２から入力された認識候補カテゴリ
群における第１位候補のカテゴリと同じか否かを調べて
いく（Ｓ２，Ｓ３，Ｓ４）。そして、第１位候補と同じ
カテゴリの誤読カテゴリを見つける毎に（Ｓ４でＹＥ
Ｓ）、下記のような処理を実行する。First, the candidate correction unit 4 searches the misread categories 51 in the set 50 in the correction table 5 one by one in order from, for example, the set closest to the head of the table, and the misread category is input from the recognition unit 2. It is checked whether it is the same as the category of the first candidate in the recognized candidate category group (S2, S3, S4). Each time a misread category of the same category as the first candidate is found (YE in S4)
S), the following processing is executed.

【００３２】まず、その見つけた誤読カテゴリを含む組
５０中の正解カテゴリ５２と同じカテゴリの認識候補カ
テゴリが、認識手段２から入力された認識候補カテゴリ
群に含まれるか否かを、認識候補カテゴリ群から認識候
補カテゴリを１つずつ順に検索して調べる（Ｓ５，Ｓ
６，Ｓ７）。First, it is determined whether or not a recognition candidate category of the same category as the correct answer category 52 in the set 50 including the found misread category is included in the recognition candidate category group input from the recognition means 2. Search and examine the recognition candidate categories one by one from the group one by one (S5, S5
6, S7).

【００３３】そして、正解カテゴリ５２と同じカテゴリ
の認識候補カテゴリが存在していた場合（Ｓ７でＹＥ
Ｓ）、その認識候補カテゴリの距離値を次式により修正
する（Ｓ９）。Then, if a recognition candidate category of the same category as the correct answer category 52 exists (YE in S7)
S), the distance value of the recognition candidate category is corrected by the following equation (S9).

【００３４】Ｄ’＝Ｄ−ｗ・ｓ … ここで、Ｄ’は修正後の距離値，Ｄは修正前の距離値，
ｗは重み係数，ｓは正解カテゴリ５２を含む組５０中の
修正用情報である距離差５３である。ここで、重み係数
ｗは例えば１に固定化しても良く、また、当該組５０の
修正用テーブル５への記憶が古いものに対しては１に比
べて小さくなるような値にしてもよい。D ′ = D−w · s where D ′ is a distance value after correction, D is a distance value before correction,
w is a weight coefficient, and s is a distance difference 53 that is correction information in the set 50 including the correct answer category 52. Here, the weight coefficient w may be fixed to, for example, 1, or may be set to a value that is smaller than 1 when the set 50 in the correction table 5 is old.

【００３５】他方、正解カテゴリ５２と同じカテゴリの
認識候補カテゴリが存在しなかった場合（Ｓ６でＹＥ
Ｓ）、正解カテゴリ５２と同じカテゴリの認識候補カテ
ゴリを認識候補カテゴリ群に追加する（Ｓ８）。このと
き、追加した認識候補カテゴリの距離値としては、例え
ば、認識手段２から入力された認識候補カテゴリ群にお
ける最下位の認識候補カテゴリの距離値に予め定められ
た定数α（例えば２００）を足した値とする。そして、
ステップＳ９に進み、この追加した認識候補カテゴリの
距離値を前記の式により修正する。On the other hand, when there is no recognition candidate category of the same category as the correct answer category 52 (YE in S6)
S), a recognition candidate category of the same category as the correct answer category 52 is added to the recognition candidate category group (S8). At this time, as the distance value of the added recognition candidate category, for example, a predetermined constant α (for example, 200) is added to the distance value of the lowest recognition candidate category in the recognition candidate category group input from the recognition means 2. Value. And
Proceeding to step S9, the distance value of the added recognition candidate category is corrected by the above equation.

【００３６】候補修正手段４は、以上のような処理後、
認識候補カテゴリ群中の認識候補カテゴリを、それらの
距離値で昇順にソートする（Ｓ１０）。このとき、距離
値が修正されていると、候補順位が認識手段２から出力
された時点と相違する場合がある。このように新たに順
序付けた認識候補カテゴリ群のうち上位複数個（例えば
第５位までの５個）の認識候補カテゴリとそれらの距離
値は結果訂正手段６に出力される。After the above processing, the candidate correcting means 4
The recognition candidate categories in the recognition candidate category group are sorted in ascending order by their distance values (S10). At this time, if the distance value has been corrected, the candidate order may be different from the time when the recognition unit 2 outputs the candidate order. Out of the newly ordered recognition candidate category groups, the upper plurality (for example, five to the fifth) of the recognition candidate categories and their distance values are output to the result correction means 6.

【００３７】結果訂正手段６は、候補修正手段４から入
力された認識候補カテゴリ群における第１位の認識候補
カテゴリを認識結果として、例えばＣＲＴ等で構成され
る表示手段７に表示する（Ｓ１１）。また、誤読があっ
た場合の訂正にそなえて第２位以下の認識候補カテゴリ
を表示手段７の別の表示領域に表示する。なお、この第
２位以下の認識候補カテゴリの表示はオペレータから誤
読している旨の指示があった時点で表示するようにして
も良い。The result correction means 6 displays the first recognition candidate category in the recognition candidate category group input from the candidate correction means 4 as a recognition result on the display means 7 constituted by, for example, a CRT (S11). . In addition, the second and lower recognition candidate categories are displayed in another display area of the display means 7 in preparation for correction in the case of misreading. The display of the second or lower recognition candidate category may be displayed when the operator instructs that the reading is erroneous.

【００３８】オペレータは、表示手段７に表示された認
識結果をチェックし、誤読している場合には、キーボー
ド等の如き訂正情報入力手段８から訂正情報を入力して
訂正を行わせる。このとき、表示手段７に表示された第
２位以下の認識候補カテゴリ中に正解カテゴリが存在す
る場合には、その正解カテゴリを選択する情報を訂正情
報入力手段８から与えると（Ｓ１２でＹＥＳ）、結果訂
正手段６は、認識結果を上記選択された正解カテゴリに
訂正する（Ｓ１３）。また、第２位以下の認識候補カテ
ゴリ中に正解カテゴリが存在しない場合、オペレータは
訂正情報入力手段８を通じて例えばかな漢字変換などに
よって正解カテゴリを入力して訂正を行わせる。このと
き結果訂正手段６は、入力された訂正カテゴリで認識結
果を訂正する（Ｓ１３）。The operator checks the recognition result displayed on the display means 7 and, if erroneous reading is performed, inputs correction information from the correction information input means 8 such as a keyboard to make correction. At this time, if a correct answer category exists in the second or lower recognition candidate categories displayed on the display means 7, information for selecting the correct answer category is given from the correction information input means 8 (YES in S12). Then, the result correcting means 6 corrects the recognition result to the selected correct answer category (S13). If the correct category does not exist in the second and lower recognition candidate categories, the operator inputs the correct category through the correction information input means 8 by, for example, kana-kanji conversion or the like, and causes the correction to be performed. At this time, the result correcting means 6 corrects the recognition result with the input correction category (S13).

【００３９】そして、結果訂正手段６は、認識結果を訂
正した場合、訂正前のカテゴリ（誤読カテゴリ）及びそ
の距離値と、訂正後のカテゴリ（正解カテゴリ）及びそ
の距離値とをテーブル更新手段９に通知し、修正用テー
ブルの更新を行わせる（Ｓ１４）。なお、正解カテゴリ
が認識候補カテゴリ群に存在しないときは、例えば認識
候補カテゴリ群における最下位の認識候補カテゴリの距
離値に予め定められた定数α（例えば２００）を加算し
た値を、正解カテゴリの距離値とする。When the result of the recognition is corrected, the result correction means 6 updates the category before correction (misread category) and its distance value and the corrected category (correct answer category) and its distance value in the table updating means 9. To update the correction table (S14). When the correct category does not exist in the recognition candidate category group, for example, a value obtained by adding a predetermined constant α (for example, 200) to the distance value of the lowest recognition candidate category in the recognition candidate category group is used as the correct category. The distance value.

【００４０】テーブル更新手段９は、先ず、結果訂正手
段６から通知された誤読カテゴリの距離値から、正解カ
テゴリの距離値を減算して、距離差を求める。そして、
この距離差と前記正解カテゴリと前記誤読カテゴリとを
含む組５０を、修正用テーブル５に追加登録する。The table updating means 9 first obtains a distance difference by subtracting the distance value of the correct answer category from the distance value of the misread category notified from the result correcting means 6. And
A set 50 including the distance difference, the correct answer category, and the misread category is additionally registered in the correction table 5.

【００４１】このとき、テーブル更新手段９は、例えば
修正用テーブル５の先頭から順に新たな組５０を登録す
るようにし、テーブルの最後尾まで組５０が登録されて
満杯になっていた場合（つまり、先の例では２００個の
組が登録されていた場合）には、再びテーブルの先頭に
戻り最も古く登録された組５０を上書きするかたちで追
加登録する。このように登録する組を例えば２００個程
度に限定し、２００個登録した後はテーブル５の領域を
循環的に使用することで、修正用テーブル５に必要なメ
モリの増大を防ぐことができる。At this time, the table updating means 9 registers, for example, new sets 50 in order from the top of the correction table 5, and when the sets 50 are registered up to the end of the table and becomes full (ie, (In the above example, when 200 sets are registered), the table is returned to the top of the table again and the oldest registered set 50 is overwritten and additionally registered. In this way, the number of sets to be registered is limited to, for example, about 200, and after the 200 sets are registered, the area of the table 5 is cyclically used, thereby preventing an increase in the memory required for the correction table 5.

【００４２】またテーブル更新手段９は、本実施例にお
いては、今回登録する組の誤読カテゴリおよび正解カテ
ゴリと同じ誤読カテゴリおよび正解カテゴリを含む組が
修正用テーブル５に既に登録されている場合には、その
既登録の距離差に今回の距離差を加算するようにしてい
る。In the present embodiment, the table updating means 9 determines whether a group including the same misreading category and correct answer category as the currently registered set of misreading and correct answer categories has already been registered in the correction table 5. The current distance difference is added to the registered distance difference.

【００４３】次に、具体例を挙げて本実施例を更に詳細
に説明する。Next, this embodiment will be described in more detail with reference to specific examples.

【００４４】修正用テーブル５の内容が図３（ａ）に示
すように空である初期の状態において、帳票等に手書き
された或る文字の認識が行われ、認識手段２から図３
（ｂ）に示すような認識候補カテゴリ群および距離値が
出力されたとする。即ち、第１位候補が「巾」，第２位
候補が「ゆ」，第３位候補が「け」，第４位候補が
「や」，第５位候補が「サ」であり、それぞれの距離値
が１３００，１５００，１６００，１７００，１８００
であったとする。このとき、候補修正手段４では、修正
用テーブル５に１つも組が登録されていないので、図２
のステップＳ３で直ちに検索終了と判定されるために距
離値の修正は行われない。従って、ステップＳ１０にお
いて距離値でソートした新たな認識候補カテゴリ群およ
び距離値は図３（ｃ）に示すように図３（ｂ）と同じに
なる。このため、表示手段７には第１位候補の「巾」が
認識結果として表示され、また第２位以下の候補が表示
される（Ｓ１１）。ここで本来の入力文字が「ゆ」であ
った為にオペレータが誤読と判断し、表示手段７に表示
された第２位候補の「ゆ」を選択すると、結果訂正手段
６により認識結果が「巾」から「ゆ」に訂正される（Ｓ
１３）。また、誤読カテゴリ「巾」及びその距離値１３
００と、正解カテゴリ「ゆ」及びその距離値１５００と
がテーブル更新手段９に通知される。テーブル更新手段
９では、正解カテゴリ「ゆ」の距離値１５００から誤読
カテゴリ「巾」の距離値１３００を引いた２００を距離
差５３とし、それに誤読カテゴリ「巾」，正解カテゴリ
「ゆ」を加えた組５０を修正用テーブル５に登録する
（Ｓ１４）。この結果、修正用テーブル５の内容は図３
（ｄ）のようになる。In the initial state where the contents of the correction table 5 are empty as shown in FIG. 3 (a), certain characters handwritten on a form or the like are recognized, and
It is assumed that a recognition candidate category group and a distance value as shown in FIG. That is, the first candidate is “width”, the second candidate is “yu”, the third candidate is “ke”, the fourth candidate is “ya”, and the fifth candidate is “sa”. Distance values of 1300, 1500, 1600, 1700, 1800
Assume that At this time, in the candidate correction unit 4, since no group is registered in the correction table 5, FIG.
Since the search is immediately determined to be completed in step S3, the distance value is not corrected. Therefore, the new recognition candidate category group and the distance value sorted by the distance value in step S10 become the same as those in FIG. 3B as shown in FIG. 3C. For this reason, the "width" of the first candidate is displayed on the display means 7 as a recognition result, and the second and lower candidates are displayed (S11). Here, since the original input character is "Y", the operator determines that the reading is erroneous, and selects the second candidate "Y" displayed on the display means 7, and the result correction means 6 changes the recognition result to "Y". Is corrected from "width" to "yu" (S
13). In addition, the misread category “width” and its distance value 13
00, the correct answer category “Y” and its distance value 1500 are notified to the table updating means 9. In the table updating means 9, 200 obtained by subtracting the distance value 1300 of the misread category “width” from the distance value 1500 of the correct answer category “yu” is set as a distance difference 53, and the misread category “width” and the correct answer category “yu” are added thereto. The set 50 is registered in the correction table 5 (S14). As a result, the contents of the correction table 5 are shown in FIG.
(D).

【００４５】次に、修正用テーブル５の内容が図４
（ａ）に示す状態（図３（ｄ）と同じ）であるときに、
別の或る文字の認識が行われ、認識手段２から図４
（ｂ）に示すような認識候補カテゴリ群および距離値が
出力されたとする。即ち、第１位候補が「巾」，第２位
候補が「け」，第３位候補が「サ」，第４位候補が
「Ｈ」，第５位候補が「ゆ」であり、それぞれの距離値
が１１００，１１５０，１２００，１４００，１４００
であったとする。このとき、候補修正手段４では、第１
位候補「巾」と同じカテゴリの誤読カテゴリ「巾」を含
む組が修正用テーブル５に１つ存在するので、その組の
正解カテゴリ「ゆ」と距離差２００とに従って、認識候
補「ゆ」の距離値を１４００から１２００に変更する。
従って、ステップＳ１０で距離値でソートした新たな認
識候補カテゴリ群および距離値は図４（ｃ）に示すよう
になる。次に、表示手段７は第１位候補の「巾」を認識
結果として表示し、また第２位以下の候補も表示する
（Ｓ１１）。ここで本来の入力文字が「み」であった為
にオペレータが誤読と判断し、訂正情報入力手段８から
「み」を入力すると、結果訂正手段６は認識結果を
「巾」から「み」に訂正する（Ｓ１３）。また、誤読カ
テゴリ「巾」及びその距離値１１００と、正解カテゴリ
「み」及びその距離値とをテーブル更新手段９に通知す
る。ここで、正解カテゴリ「み」は認識候補カテゴリ群
に存在しないため、最下位候補「ゆ」の距離値１４００
に２００（＝α）を足した１６００が正解カテゴリ
「み」の距離値とされる。テーブル更新手段９は、正解
カテゴリ「み」の距離値１６００から誤読カテゴリ
「巾」の距離値１１００を引いた５００を距離差５３と
し、それに誤読カテゴリ「巾」，正解カテゴリ「み」を
加えた組５０を修正用テーブル５に登録する（Ｓ１
４）。この結果、修正用テーブル５の内容は図４（ｄ）
のようになる。Next, the contents of the correction table 5 are shown in FIG.
In the state shown in FIG. 3A (the same as FIG. 3D),
Recognition of another certain character is performed.
It is assumed that a recognition candidate category group and a distance value as shown in FIG. That is, the first candidate is “width”, the second candidate is “ke”, the third candidate is “sa”, the fourth candidate is “H”, and the fifth candidate is “yu”. Distance values of 1100, 1150, 1200, 1400, 1400
Assume that At this time, the candidate correction means 4
Since there is one set in the correction table 5 containing the misread category “width” of the same category as the rank candidate “width”, the recognition candidate “yu” is determined according to the correct answer category “yu” and the distance difference 200 of that set. The distance value is changed from 1400 to 1200.
Therefore, the new recognition candidate category group and the distance values sorted by the distance values in step S10 are as shown in FIG. Next, the display means 7 displays the "width" of the first candidate as a recognition result, and also displays the second and lower candidates (S11). Here, since the original input character is "mi", the operator determines that the reading is erroneous and inputs "mi" from the correction information input means 8, and the result correction means 6 changes the recognition result from "width" to "mi". (S13). Further, the table updating unit 9 is notified of the misread category “width” and its distance value 1100 and the correct answer category “mi” and its distance value. Here, since the correct category “mi” does not exist in the recognition candidate category group, the distance value 1400 of the lowest candidate “yu” is set.
Is added to 200 (= α), and 1600 is set as the distance value of the correct answer category “mi”. The table updating means 9 subtracts the distance value 1100 of the misread category "width" from the distance value 1600 of the correct answer category "mi" to obtain 500 as the distance difference 53, and adds the misread category "width" and the correct answer category "mi". The set 50 is registered in the correction table 5 (S1).
4). As a result, the contents of the correction table 5 are as shown in FIG.
become that way.

【００４６】次に、修正用テーブル５の内容が図５
（ａ）に示す状態（図４（ｄ）と同じ）であるときに、
別の或る文字（ここでは、「ゆ」，それも前回の「ゆ」
に比べてより「ゆ」らしく手書きされたものを想定して
いる）の認識が行われ、認識手段２から図５（ｂ）に示
すような認識候補カテゴリ群および距離値が出力された
とする。即ち、第１位候補が「巾」，第２位候補が
「ゆ」，第３位候補が「け」，第４位候補が「サ」，第
５位候補が「や」であり、それぞれの距離値が１２０
０，１３５０，１５００，１８００，１９００であった
とする。このとき、候補修正手段４では、第１位候補
「巾」と同じカテゴリの誤読カテゴリ「巾」を含む組が
修正用テーブル５に２つ存在するので、距離値の修正を
行う。先ず、１つの組の正解カテゴリ「ゆ」と距離差２
００とに従って、認識候補「ゆ」の距離値を１３５０か
ら１１５０に変更する。次に、残りの１つの組の正解カ
テゴリ「み」と距離差５００とに従って、認識候補
「み」を追加し、その距離値を１６００（＝１９００＋
２００−５００）とする。従って、ステップＳ１０で距
離値でソートした新たな認識候補カテゴリ群および距離
値は図５（ｃ）に示すようになる。次に、表示手段７は
第１位候補の「ゆ」を認識結果として表示する。Next, the contents of the correction table 5 are shown in FIG.
In the state shown in FIG. 4A (the same as FIG. 4D),
Another certain character (here, “Yu”, which is also the previous “Yu”)
(It is assumed that the handwriting is more hand-drawn compared to the above.), And the recognition unit 2 outputs a recognition candidate category group and a distance value as shown in FIG. That is, the first candidate is “width”, the second candidate is “yu”, the third candidate is “ke”, the fourth candidate is “sa”, and the fifth candidate is “ya”. Distance value of 120
0, 1350, 1500, 1800, and 1900. At this time, the candidate correction unit 4 corrects the distance value because two sets including the misread category “width” of the same category as the first candidate “width” exist in the correction table 5. First, one set of correct answer category “Y” and distance difference 2
00, the distance value of the recognition candidate “Y” is changed from 1350 to 1150. Next, a recognition candidate "mi" is added according to the remaining one set of the correct answer category "mi" and the distance difference 500, and the distance value is set to 1600 (= 1900 +
200-500). Therefore, the new recognition candidate category group and the distance values sorted by the distance values in step S10 are as shown in FIG. Next, the display means 7 displays the first candidate “Y” as a recognition result.

【００４７】このように、距離差が２００のように小さ
い条件で誤読し、同じカテゴリの組「巾，ゆ」が同様に
距離差が１５０のように小さい条件で再度生じた場合
は、以前に正解として訂正されたカテゴリ「ゆ」が上位
にくる。換言すれば、組の登録時に認識対象とした文字
に比べ、よりその文字に似ている文字であれば第１位候
補となり得ることを示している。As described above, when the misreading is performed under the condition that the distance difference is as small as 200, and the group of the same category “width, yu” is again generated under the condition that the distance difference is as small as 150, The category "Yu" corrected as the correct answer comes to the top. In other words, it indicates that any character that is more similar to the character that was recognized when the set was registered may be the first candidate.

【００４８】次に、修正用テーブル５の内容が図６
（ａ）に示す状態（図４（ｄ），図５（ａ）と同じ）で
あるときに、別の或る文字（ここでは、「巾」を想定し
ている）の認識が行われ、認識手段２から図６（ｂ）に
示すような認識候補カテゴリ群および距離値が出力され
たとする。即ち、第１位候補が「巾」，第２位候補が
「け」，第３位候補が「サ」，第４位候補が「ゆ」，第
５位候補が「や」であり、それぞれの距離値が１２０
０，１３５０，１４００，１６５０，１８００であった
とする。このとき、候補修正手段４では、第１位候補
「巾」と同じカテゴリの誤読カテゴリ「巾」を含む組が
修正用テーブル５に２つ存在するので、距離値の修正を
行う。先ず、１つの組の正解カテゴリ「ゆ」と距離差２
００とに従って、認識候補「ゆ」の距離値を１６５０か
ら１４５０に変更する。次に、残りの１つの組の正解カ
テゴリ「み」と距離値５００とに従って、認識候補
「み」を追加し、その距離値を１５００（＝１８００＋
２００−５００）とする。従って、ステップＳ１０で距
離値でソートした新たな認識候補カテゴリ群および距離
値は図６（ｃ）に示すようになる。次に、表示手段７は
第１位候補の「巾」を認識結果として表示する。Next, the contents of the correction table 5 are shown in FIG.
In the state shown in (a) (the same as in FIGS. 4 (d) and 5 (a)), another certain character (here, "width" is assumed) is recognized, Suppose that a group of recognition candidate categories and distance values as shown in FIG. That is, the first candidate is “width”, the second candidate is “ke”, the third candidate is “sa”, the fourth candidate is “yu”, and the fifth candidate is “ya”, respectively. Distance value of 120
0, 1350, 1400, 1650, 1800. At this time, the candidate correction unit 4 corrects the distance value because two sets including the misread category “width” of the same category as the first candidate “width” exist in the correction table 5. First, one set of correct answer category “Y” and distance difference 2
In accordance with 00, the distance value of the recognition candidate “Y” is changed from 1650 to 1450. Next, a recognition candidate "mi" is added according to the remaining one set of the correct answer category "mi" and the distance value 500, and the distance value is set to 1500 (= 1800 +
200-500). Accordingly, the new recognition candidate category group and the distance values sorted by the distance values in step S10 are as shown in FIG. Next, the display means 7 displays the “width” of the first candidate as a recognition result.

【００４９】このように、距離差が２００のように小さ
い条件で誤読し、同じカテゴリの組「巾，ゆ」が距離差
が４５０のように大きい条件で再度生じた場合は、認識
手段２による認識結果が維持される。As described above, when the erroneous reading is performed under the condition that the distance difference is as small as 200, and the pair “width, yu” of the same category occurs again under the condition that the distance difference is as large as 450, the recognition means 2 The recognition result is maintained.

【００５０】次に、修正用テーブル５の内容が図７
（ａ）に示す状態（図４（ｄ）と同じ）であるときに、
別の或る文字（ここでは、「み」、それも前回の「み」
に比べて多少「み」らしく手書きされたものを想定して
いる）の認識が行われ、認識手段２から図７（ｂ）に示
すような認識候補カテゴリ群および距離値が出力された
とする。即ち、第１位候補が「巾」，第２位候補が
「け」，第３位候補が「サ」，第４位候補が「Ｈ」，第
５位候補が「や」であり、それぞれの距離値が１２０
０，１２５０，１３００，１４００，１４５０であった
とする。このとき、候補修正手段４では、第１位候補
「巾」と同じカテゴリの誤読カテゴリ「巾」を含む組が
修正用テーブル５に２つ存在するので、距離値の修正を
行う。先ず、１つの組の正解カテゴリ「ゆ」と距離差２
００とに従って、認識候補「ゆ」を追加し、その距離値
を１４５０（＝１４５０＋２００−２００）とする。次
に、残りの１つの組の正解カテゴリ「み」と距離値５０
０とに従って、認識候補「み」を追加し、その距離値を
１１５０（＝１４５０＋２００−５００）とする。従っ
て、ステップＳ１０で距離値でソートした新たな認識候
補カテゴリ群および距離値は図７（ｃ）に示すようにな
る。次に、表示手段７は第１位候補の「み」を認識結果
として表示する。Next, the contents of the correction table 5 are shown in FIG.
In the state shown in FIG. 4A (the same as FIG. 4D),
Another certain character (here, "mi", which is also the previous "mi"
(It is assumed that the handwriting is slightly more "compared to handwriting"), and a recognition candidate category group and a distance value as shown in FIG. That is, the first candidate is "width", the second candidate is "ke", the third candidate is "sa", the fourth candidate is "H", and the fifth candidate is "ya", respectively. Distance value of 120
0, 1250, 1300, 1400, 1450. At this time, the candidate correction unit 4 corrects the distance value because two sets including the misread category “width” of the same category as the first candidate “width” exist in the correction table 5. First, one set of correct answer category “Y” and distance difference 2
00, the recognition candidate “Y” is added, and the distance value is set to 1450 (= 1450 + 200−200). Next, the remaining one set of the correct answer category “mi” and the distance value 50
According to 0, the recognition candidate “mi” is added, and the distance value is set to 1150 (= 1450 + 200−500). Therefore, the new recognition candidate category group and the distance values sorted by the distance values in step S10 are as shown in FIG. 7C. Next, the display unit 7 displays the first candidate “mi” as a recognition result.

【００５１】このように、距離差が５００のように大き
い条件で誤読し、同じカテゴリの組「巾，み」が再度出
現した場合は、例え距離差が大きくても下位の候補が上
位にくることもあり得る。As described above, when the misreading is performed under the condition that the distance difference is as large as 500 and the pair of the same category “width, mi” appears again, even if the distance difference is large, the lower candidate becomes higher. It is possible.

【００５２】次に、修正用テーブル５の内容が図８
（ａ）に示す状態（図４（ｄ）と同じ）であるときに、
別の或る文字（ここでは、「ゆ」、それも前記訂正時の
「ゆ」に比べて更に「ゆ」らしくない手書きされたもの
を想定している）の認識が行われ、認識手段２から図８
（ｂ）に示すような認識候補カテゴリ群および距離値が
出力されたとする。即ち、第１位候補が「巾」，第２位
候補が「ゆ」，第３位候補が「け」，第４位候補が
「や」，第５位候補が「サ」であり、それぞれの距離値
が１３００，１５５０，１６００，１７００，１８００
であったとする。このとき、候補修正手段４では、第１
位候補「巾」と同じカテゴリの誤読カテゴリ「巾」を含
む組が修正用テーブル５に２つ存在するので、距離値の
修正を行う。先ず、１つの組の正解カテゴリ「ゆ」と距
離差２００とに従って、認識候補「ゆ」の距離値を１５
５０から１３５０に変更する。次に、残りの１つの組の
正解カテゴリ「み」と距離値５００とに従って、認識候
補「み」を追加し、その距離値を１５００（＝１８００
＋２００−５００）とする。従って、ステップＳ１０で
距離値でソートした新たな認識候補カテゴリ群および距
離値は図８（ｃ）に示すようになる。次に、表示手段７
は第１位候補の「巾」を認識結果として表示する。ここ
で本来の文字が「ゆ」であった為にオペレータが誤読と
判断し、表示手段７に表示された第２位候補の「ゆ」を
選択すると、結果訂正手段６により認識結果が「巾」か
ら「ゆ」に訂正される（Ｓ１３）。また、誤読カテゴリ
「巾」及びその距離値１３００と、正解カテゴリ「ゆ」
及びその距離値１３５０とがテーブル更新手段９に通知
される。テーブル更新手段９では、正解カテゴリ「ゆ」
の距離値１３５０から誤読カテゴリ「巾」の距離値１３
００を引いた５０を距離差５３とし、それに誤読カテゴ
リ「巾」，正解カテゴリ「ゆ」を加えた組５０を修正用
テーブル５に登録する（Ｓ１４）。このとき、修正用テ
ーブル５には同じ誤読カテゴリ「巾」，同じ正解カテゴ
リ「ゆ」を持つ既登録の組｛巾，ゆ，２００｝が存在す
るので、その距離差２００に今回の距離差５０が加算さ
れる。この結果、修正用テーブル５の内容は図８（ｄ）
のようになる。なお、同じ誤読カテゴリ，正解カテゴリ
を持つ既登録の組は削除せずに、それとは別に今回のも
のを登録するようにしてもよい。即ち、前述の例で言え
ば、組｛巾，ゆ，２００｝を残したまま、組｛巾，ゆ，
５０｝を追加登録するようにしてもよい。Next, the contents of the correction table 5 are shown in FIG.
In the state shown in FIG. 4A (the same as FIG. 4D),
Recognition of another certain character (here, “Y”, which is also assumed to be handwritten that does not seem more “Y” than the “Y” at the time of the correction) is performed, and the recognition unit 2 From FIG. 8
It is assumed that a recognition candidate category group and a distance value as shown in FIG. That is, the first candidate is “width”, the second candidate is “yu”, the third candidate is “ke”, the fourth candidate is “ya”, and the fifth candidate is “sa”. Distance values of 1300, 1550, 1600, 1700, 1800
Assume that At this time, the candidate correction means 4
Since two sets including the misread category “width” of the same category as the rank candidate “width” exist in the correction table 5, the distance value is corrected. First, the distance value of the recognition candidate “Y” is set to 15 according to one set of the correct answer category “Y” and the distance difference 200.
Change from 50 to 1350. Next, a recognition candidate "mi" is added according to the remaining one set of the correct category "mi" and the distance value 500, and the distance value is set to 1500 (= 1800).
+ 200-500). Therefore, the new recognition candidate category group and the distance value sorted by the distance value in step S10 are as shown in FIG. 8C. Next, display means 7
Displays the "width" of the first candidate as a recognition result. Here, since the original character is “Y”, the operator determines that the reading is erroneous, and selects the second candidate “Y” displayed on the display means 7. Is corrected to "Y" (S13). In addition, the misread category “width” and its distance value 1300 and the correct answer category “yu”
And the distance value 1350 are notified to the table updating means 9. In the table updating means 9, the correct answer category "Y"
Distance value 13 of misread category "width" from distance value 1350 of
A set 50 in which 50 obtained by subtracting 00 is added to the distance difference 53 and the misread category “width” and the correct answer category “yu” is added to the correction table 5 (S14). At this time, since the registered table {width, yu, 200} having the same misread category "width" and the same correct category "yu" exists in the correction table 5, the distance difference 200 is added to the current distance difference 50. Is added. As a result, the contents of the correction table 5 are as shown in FIG.
become that way. Note that a registered group having the same misread category and correct category may not be deleted, but the current group may be registered separately. That is, in the above example, the pair {width, yu, 200} is left while leaving the pair {width, yu, 200}.
50 ° may be additionally registered.

【００５３】このように、距離差が２００のように小さ
い条件で誤読し、同じカテゴリの組「巾，ゆ」がそれよ
り大きい距離差で現れた場合は、以前に正解として訂正
されたカテゴリ「ゆ」が必ずしも第１位候補にはなり得
ない。しかし、再度訂正することにより修正用テーブル
５中の距離差が変更されるため、吸収できる距離差を徐
々に増大していくことができる。従って、次の例に示す
ように、先の「ゆ」に比べて多少なりとも「ゆ」らしい
「ゆ」の読み取りでは、再び「ゆ」が認識結果として出
力される。As described above, if the misreading is performed under the condition that the distance difference is as small as 200, and the same category pair “width, yu” appears with a larger distance difference, the category “corrected as a correct answer previously” “Yu” cannot necessarily be the first candidate. However, since the distance difference in the correction table 5 is changed by performing the correction again, the distance difference that can be absorbed can be gradually increased. Therefore, as shown in the following example, in the case of reading “Y” that is more or less “Y” compared to the previous “Y”, “Y” is output again as a recognition result.

【００５４】即ち、修正用テーブル５の内容が図９
（ａ）に示す状態（図８（ｄ）と同じ）であるときに、
図８で想定した「ゆ」に比べて多少なりとも「ゆ」らし
い「ゆ」の認識が行われ、認識手段２から図９（ｂ）に
示すような認識候補カテゴリ群および距離値が出力され
たとする。即ち、第１位候補が「巾」，第２位候補が
「ゆ」，第３位候補が「け」，第４位候補が「や」，第
５位候補が「サ」であり、それぞれの距離値が１３０
０，１５００，１６００，１７００，１８００であった
とする。このとき、候補修正手段４では、第１位候補
「巾」と同じカテゴリの誤読カテゴリ「巾」を含む組が
修正用テーブル５に２つ存在するので、距離値の修正を
行う。先ず、１つの組の正解カテゴリ「ゆ」と距離値２
５０とに従って、認識候補「ゆ」の距離値を１５００か
ら１２５０に変更する。次に、残りの１つの組の正解カ
テゴリ「み」と距離値５００とに従って、認識候補
「み」を追加し、その距離値を１５００（＝１８００＋
２００−５００）とする。従って、ステップＳ１０で距
離値でソートした新たな認識候補カテゴリ群および距離
値は図９（ｃ）に示すようになり、第１位候補の「ゆ」
が認識結果として表示される。That is, the contents of the correction table 5 are shown in FIG.
In the state shown in FIG. 8A (the same as FIG. 8D),
Recognition of “Y” that is more or less “Y” compared to “Y” assumed in FIG. 8 is performed, and a recognition candidate category group and a distance value as shown in FIG. Suppose. That is, the first candidate is “width”, the second candidate is “yu”, the third candidate is “ke”, the fourth candidate is “ya”, and the fifth candidate is “sa”. Is 130
It is assumed that they are 0, 1500, 1600, 1700, and 1800. At this time, the candidate correction unit 4 corrects the distance value because two sets including the misread category “width” of the same category as the first candidate “width” exist in the correction table 5. First, one set of correct answer category “Y” and distance value 2
According to 50, the distance value of the recognition candidate “Y” is changed from 1500 to 1250. Next, a recognition candidate "mi" is added according to the remaining one set of the correct answer category "mi" and the distance value 500, and the distance value is set to 1500 (= 1800 +
200-500). Accordingly, the new recognition candidate category group and the distance value sorted by the distance value in step S10 are as shown in FIG. 9C, and the first candidate "Y"
Is displayed as a recognition result.

【００５５】図１０は図１の実施例の別の動作フローチ
ャートである。この動作フローチャートが図２のものと
相違するところは、ステップＳ１５において第１位候補
カテゴリを保持し、図２のステップＳ２〜Ｓ１０の処理
（ステップＳ１６）の後、第１位候補カテゴリがステッ
プＳ１５において保持した候補カテゴリと異なるかチェ
ックし（ステップＳ１７）、異なる場合（Ｓ１７でＹＥ
Ｓ）には再度ステップＳ２〜Ｓ１０の処理を行う（ステ
ップＳ１８）点にある。図１の候補修正手段４は、図１
０のステップＳ１５〜Ｓ１８の処理を実行する。FIG. 10 is another operation flowchart of the embodiment of FIG. This operation flowchart differs from that of FIG. 2 in that the first-rank candidate category is held in step S15, and after the processing of steps S2 to S10 in FIG. 2 (step S16), the first-rank candidate category is changed to step S15. It is checked whether it is different from the candidate category held in (step S17), and if different (YE in S17)
The point of S) is that the processes of steps S2 to S10 are performed again (step S18). The candidate correcting means 4 of FIG.
0, the processes of steps S15 to S18 are executed.

【００５６】次に、具体例を挙げて更に詳細に説明す
る。Next, a more specific example will be described.

【００５７】修正用テーブル５の内容が図１１（ａ）に
示す状態（図４（ａ）と同じ）であるときに、別の或る
文字（ここでは「巾」を想定している）の認識が行わ
れ、認識手段２から図１１（ｂ）に示すような認識候補
カテゴリ群および距離値が出力されたとする。即ち、第
１位候補が「巾」，第２位候補が「ゆ」，第３位候補が
「サ」，第４位候補が「Ｈ」，第５位候補が「や」であ
り、それぞれの距離値が１１００，１１５０，１２０
０，１４００，１４００であったとする。まず、候補修
正手段４では、第１位候補「巾」を記憶しておく（ステ
ップＳ１５）。次に、第１位候補「巾」と同じカテゴリ
の誤読カテゴリ「巾」を含む組が修正用テーブル５に１
つ存在するので、その組の正解カテゴリ「ゆ」と距離差
２００とに従って、認識候補「ゆ」の距離値を１１５０
から９５０に変更する。従って、新たな認識候補カテゴ
リ群および距離値は図１１（ｃ）に示すようになる（ス
テップＳ１６）。次に、新しい第１位候補「ゆ」が記憶
した候補（この場合「巾」）と異なるかチェックする
（ステップＳ１７）。この場合異なるため、第１位候補
「ゆ」と同じカテゴリの誤読カテゴリを含む組を修正用
テーブル５に探すが、存在しないため修正を行わない
（ステップＳ１８）。次に、表示手段７は第１位候補の
「ゆ」を認識結果として表示し、また第２位以下の候補
も表示する（Ｓ１１）。ここで本来の入力文字が「巾」
であった為にオペレータが誤読と判断し、訂正情報入力
手段８から「巾」を入力すると、結果訂正手段６は認識
結果を「ゆ」から「巾」に訂正する（Ｓ１３）。テーブ
ル更新手段９は、誤読カテゴリ「ゆ」，正解カテゴリ
「巾」，距離差１５０（＝１１００−９５０）を修正用
テーブル５に登録する（Ｓ１４）。この結果、修正用テ
ーブル５の内容は図１１（ｄ）のようになる。When the contents of the correction table 5 are in the state shown in FIG. 11A (the same as FIG. 4A), another certain character (here, "width" is assumed) is used. It is assumed that the recognition has been performed, and a recognition candidate category group and a distance value as shown in FIG. That is, the first candidate is “width”, the second candidate is “yu”, the third candidate is “sa”, the fourth candidate is “H”, and the fifth candidate is “ya”. Distance values of 1100, 1150, 120
It is assumed that the values are 0, 1400, and 1400. First, the candidate correcting means 4 stores the first candidate "width" (step S15). Next, a set including the misread category “width” of the same category as the first candidate “width” is added to the correction table 5 by one.
Therefore, the distance value of the recognition candidate “Y” is set to 1150 according to the correct answer category “Y” of the group and the distance difference 200.
To 950. Therefore, the new recognition candidate category group and the distance value are as shown in FIG. 11C (step S16). Next, it is checked whether the new first-place candidate “Y” is different from the stored candidate (in this case, “width”) (step S17). In this case, because the difference is different, a set including the misread category of the same category as the first candidate “Y” is searched in the correction table 5, but is not corrected because it does not exist (step S18). Next, the display means 7 displays the first candidate "Y" as a recognition result, and also displays the second and lower candidates (S11). Here, the original input character is "width"
Therefore, when the operator determines that the reading is erroneous and inputs "width" from the correction information input means 8, the result correcting means 6 corrects the recognition result from "y" to "width" (S13). The table updating means 9 registers the misread category “Y”, the correct answer category “width”, and the distance difference 150 (= 1100−950) in the correction table 5 (S14). As a result, the contents of the correction table 5 are as shown in FIG.

【００５８】次に、修正用テーブル５の内容が図１２
（ａ）に示す状態（図１１（ｄ）と同じ）であるとき
に、別の或る文字（ここでは、「巾」を想定している）
の認識が行われ、認識手段２から図１２（ｂ）に示すよ
うな認識候補カテゴリ群および距離値が出力されたとす
る。即ち、第１位候補が「巾」，第２位候補が「ゆ」，
第３位候補が「け」，第４位候補が「サ」，第５位候補
が「や」であり、それぞれの距離値が１２００，１３５
０，１５００，１８００，１９００であったとする。ま
ず、候補修正手段４では、第１位候補「巾」を記憶して
おく（ステップＳ１５）。次に、第１位候補「巾」と同
じカテゴリの誤読カテゴリ「巾」を含む組が修正用テー
ブル５に１つ存在するので、距離差２００に従って認識
候補「ゆ」の距離値を１３５０から１１５０に変更す
る。従って、新たな認識候補カテゴリ群および距離値は
図１２（ｃ）に示すようになる（ステップＳ１６）。次
に、新しい第１位候補「ゆ」が記憶した候補（この場合
「巾」）と異なるかチェックする（ステップＳ１７）。
この場合異なるため、第１位候補「ゆ」と同じカテゴリ
の誤読カテゴリを含む組を修正用テーブル５に探し、１
つ存在するので、距離差１５０に従って認識候補「巾」
の距離値を１２００から１０５０に変更する。従って、
新たな認識候補カテゴリ群および距離値は図１２（ｄ）
に示すようになる（ステップＳ１８）。Next, the contents of the correction table 5 are shown in FIG.
In the state shown in FIG. 11A (same as FIG. 11D), another certain character (here, “width” is assumed)
Is recognized, and a recognition candidate category group and a distance value as shown in FIG. That is, the first candidate is “width”, the second candidate is “yu”,
The third candidate is “ke”, the fourth candidate is “sa”, the fifth candidate is “ya”, and the respective distance values are 1200 and 135.
It is assumed that they are 0, 1500, 1800, and 1900. First, the candidate correcting means 4 stores the first candidate "width" (step S15). Next, since there is one set including the misread category “width” of the same category as the first candidate “width” in the correction table 5, the distance value of the recognition candidate “yu” is changed from 1350 to 1150 according to the distance difference 200. Change to Accordingly, the new recognition candidate category group and the distance value are as shown in FIG. 12C (step S16). Next, it is checked whether the new first-place candidate “Y” is different from the stored candidate (in this case, “width”) (step S17).
In this case, since a difference is found, a set including an erroneous reading category of the same category as the first candidate “Y” is searched in the correction table 5, and 1
Since there are two, the recognition candidate “width” according to the distance difference 150
Is changed from 1200 to 1050. Therefore,
The new recognition candidate category group and the distance value are shown in FIG.
(Step S18).

【００５９】このように、誤読しやすい組「ゆ」「巾」
の両方を何度も認識する場合は、１回の修正処理では互
いに逆の文字に誤読されやすくなるが、２回修正処理を
行うと、最初の認識処理の結果が重視され、認識率を逆
に低下させてしまう現象を防ぐことができる。As described above, the set “yu” and “width” that are easily misread.
If both are recognized many times, it is easy for misreading to be the opposite character in one correction process. However, when the correction process is performed twice, the result of the first recognition process is emphasized and the recognition rate is reversed. Can be prevented.

【００６０】図１３は本発明を適用した文字認識装置の
別の実施例のブロック図である。この実施例の文字認識
装置が図１の実施例のものと相違するところは、修正用
情報として、距離差５３と認識用特徴５４とを用いる点
にある。この相違に伴って、認識手段２’，候補修正手
段４’，修正用テーブル５’，テーブル更新手段９’は
図１に示される対応する手段に対して、以下のように変
更されている。FIG. 13 is a block diagram of another embodiment of the character recognition apparatus to which the present invention is applied. The character recognition device of this embodiment differs from that of the embodiment of FIG. 1 in that a distance difference 53 and a recognition feature 54 are used as correction information. Along with this difference, the recognition means 2 ', the candidate correction means 4', the correction table 5 ', and the table updating means 9' are modified as follows with respect to the corresponding means shown in FIG.

【００６１】修正用テーブル５’は、誤読カテゴリ５１
と正解カテゴリ５２と距離差５３と認識用特徴５４とを
含む組５０’を複数組格納し得る容量を有している。The correction table 5 'contains the misread category 51
, A correct category 52, a distance difference 53, and a recognition feature 54.

【００６２】認識手段２’は、データ入力手段１から入
力された文字イメージ画像から認識用特徴を抽出し、認
識辞書３に予め記憶されている文字カテゴリ毎の標準特
徴と比較して、距離値が小さい上位複数個（例えば第５
位までの５個）の認識候補カテゴリと、それら各々の距
離値とを求めて候補修正手段４’に出力すると共に、文
字イメージ画像から抽出した認識特徴をテーブル更新手
段９’に出力する。The recognizing means 2 ′ extracts a feature for recognition from the character image image input from the data input means 1, compares it with standard features for each character category stored in advance in the recognition dictionary 3, Are smaller (for example, 5th
(5) to the recognition candidate categories and their respective distance values are output to the candidate correcting means 4 ', and the recognition features extracted from the character image are output to the table updating means 9'.

【００６３】候補修正手段４’は、前記式に代えて、
下記の式により、距離値を修正する。The candidate correcting means 4 'replaces the above equation with
The distance value is corrected by the following equation.

【００６４】Ｄ’＝Ｄ−ｗ・ｓ・ａ／ｄ … ここで、Ｄ’は修正後の距離値，Ｄは修正前の距離値，
ｗは重み係数，ｓは修正情報中の距離差，ｄは修正用情
報中の認識用特徴と今回の文字イメージ画像から抽出し
た認識用特徴との距離値，ａは距離値ｄを正規化する定
数である基準距離値である。認識用特徴間の距離値ｄの
計算は、認識手段２’における認識辞書３の各カテゴリ
の標準特徴と入力文字イメージ画像から得られる認識用
特徴との距離計算と同じでよく、例えば、ユークリッド
距離や市街区距離を用いることができる。D ′ = D−w · s · a / d where D ′ is the distance value after correction, D is the distance value before correction,
w is a weighting factor, s is the distance difference in the correction information, d is the distance value between the recognition feature in the correction information and the recognition feature extracted from the current character image, and a is the distance value d. This is a reference distance value that is a constant. The calculation of the distance value d between the features for recognition may be the same as the calculation of the distance between the standard feature of each category of the recognition dictionary 3 and the features for recognition obtained from the input character image image in the recognition means 2 ', for example, the Euclidean distance. Or city block distance.

【００６５】テーブル更新手段９’は、結果訂正手段６
から誤読カテゴリ及びその距離値と正解カテゴリ及びそ
の距離値とが通知されたとき、図１のテーブル更新手段
９と同様に距離差５３を求め、この求めた距離差５３
に、認識手段２’から入力されている認識用特徴を追加
して修正用情報を作成し、これに更に誤読カテゴリと正
解カテゴリとを付加して、修正用テーブル５’に登録す
べき組５０’を生成する。The table updating means 9 'is provided with the result correcting means 6
Is notified of the misread category and its distance value and the correct answer category and its distance value, the distance difference 53 is obtained in the same manner as the table updating means 9 in FIG.
In addition, the information for recognition input from the recognizing means 2 'is added to create correction information, and the misreading category and the correct answer category are further added to the correction information. 'Is generated.

【００６６】なお、修正用テーブル５’に登録する認識
用特徴は、認識用特徴を圧縮した圧縮特徴にしてもよ
い。こうすることにより、修正用テーブル５’の必要容
量を削減することができる。The recognition feature registered in the correction table 5 'may be a compressed feature obtained by compressing the recognition feature. By doing so, the required capacity of the correction table 5 'can be reduced.

【００６７】本実施例の文字認識装置のその他の機能お
よび動作は図１の文字認識装置と同じである。本実施例
では、修正用テーブル５’の認識用特徴５４と入力文字
から得られる認識用特徴が近く、その結果距離値ｄが小
さい場合、距離値Ｄが大きく減少するため、対応する候
補が上位に来やすくなる。逆に、修正用テーブル５’の
認識用特徴５４と入力文字から得られる認識用特徴とが
遠く、その結果距離値ｄが大きい場合は、距離値Ｄはあ
まり減少しないため、対応する候補の順位はあまり上が
らない特性を示す。Other functions and operations of the character recognition device of this embodiment are the same as those of the character recognition device of FIG. In the present embodiment, when the recognition feature 54 of the correction table 5 'and the recognition feature obtained from the input character are close to each other and the distance value d is small, the distance value D greatly decreases. It is easy to come to. Conversely, when the recognition feature 54 of the correction table 5 ′ is far from the recognition feature obtained from the input character, and as a result the distance value d is large, the distance value D does not decrease so much. Indicates a characteristic that does not increase very much.

【００６８】以上本発明の実施例について説明したが、
本発明は以上の実施例にのみ限定されずその他各種の付
加変更が可能である。例えば、データ入力手段で得られ
る入力データは、スキャナ等により入力される文字イメ
ージ画像としたが、タブレットにより入力されるオンラ
イン文字データや、マイクにより入力される音声データ
であってもよい。また、修正用テーブルに記憶する誤読
カテゴリは、認識辞書のテンプレート番号でもよい。テ
ンプレート番号にすると、認識辞書に同一カテゴリのテ
ンプレートを複数保持する場合にテンプレートを区別す
ることができる。The embodiments of the present invention have been described above.
The present invention is not limited to the above embodiments, and various other additions and changes are possible. For example, the input data obtained by the data input unit is a character image image input by a scanner or the like, but may be online character data input by a tablet or voice data input by a microphone. The misread category stored in the correction table may be a template number of the recognition dictionary. If the template number is used, the template can be distinguished when a plurality of templates of the same category are stored in the recognition dictionary.

【００６９】[0069]

【発明の効果】以上説明したように本発明によれば、オ
ペレータの認識結果訂正情報を以降の認識処理時におけ
る距離値の修正に利用し、一度誤読したことのあるカテ
ゴリが認識手段の出力に第１位候補として出現した場合
に、修正用テーブルに登録された正解カテゴリと一致す
る認識候補カテゴリの距離値を適正に減じて、第１位候
補に躍り出る可能性を残したので、第１位候補が正解と
なる認識率を従来のものに比べて向上することができ
る。As described above, according to the present invention, the recognition result correction information of the operator is used for correcting the distance value in the subsequent recognition processing, and the category which has been misread once is output to the output of the recognition means. When appearing as the first candidate, the distance value of the recognition candidate category that matches the correct category registered in the correction table is appropriately reduced, leaving the possibility of jumping to the first candidate. The recognition rate at which the candidate becomes a correct answer can be improved as compared with the conventional one.

【００７０】また、修正処理により第１位候補が修正さ
れた場合に再度修正処理を行うことにより、誤読カテゴ
リと正解カテゴリが相互に登録された文字の組に対して
は、元の認識結果を重視して認識率の低下が生じる可能
性を少なくしたので、認識率を従来に比べて向上させる
ことができる。When the first candidate is corrected by the correction processing, the correction processing is performed again, so that the original recognition result is obtained for the character set in which the misread category and the correct category are mutually registered. Since the possibility of lowering the recognition rate is reduced with emphasis, the recognition rate can be improved as compared with the conventional case.

[Brief description of the drawings]

【図１】本発明を適用した文字認識装置の一実施例のブ
ロック図である。FIG. 1 is a block diagram of an embodiment of a character recognition device to which the present invention is applied.

【図２】図１の実施例の文字認識装置の動作手順を示す
フローチャートである。FIG. 2 is a flowchart showing an operation procedure of the character recognition device of the embodiment of FIG.

【図３】図１の実施例の動作説明図であって、誤読の訂
正が行われたことにより初期状態の修正用テーブルに修
正用情報を含む１つ目の組が登録された状況を示す図で
ある。FIG. 3 is an explanatory diagram of the operation of the embodiment of FIG. 1 and shows a situation in which a first set including correction information is registered in a correction table in an initial state due to correction of misreading; FIG.

【図４】図１の実施例の動作説明図であって、再び誤読
の訂正が行われたことにより修正用テーブルに修正用情
報を含む２つ目の組が登録される状況を示す図である。FIG. 4 is an operation explanatory diagram of the embodiment of FIG. 1, showing a situation where a second set including correction information is registered in the correction table due to correction of misreading again; is there.

【図５】図１の実施例の動作説明図であって、修正用テ
ーブルに登録された修正用情報を含む組によって正解カ
テゴリが導き出せた例を示す図である。FIG. 5 is an explanatory diagram of the operation of the embodiment of FIG. 1, showing an example in which a correct category can be derived by a set including correction information registered in a correction table.

【図６】図１の実施例の動作説明図であって、修正用テ
ーブルに登録された修正用情報を含む組による影響を受
けずに正解カテゴリが導き出せた例を示す図である。FIG. 6 is an explanatory diagram of the operation of the embodiment of FIG. 1, showing an example in which a correct answer category can be derived without being affected by a set including correction information registered in a correction table.

【図７】図１の実施例の動作説明図であって、修正用テ
ーブルに登録された修正用情報を含む組によって正解カ
テゴリが導き出せた他の例を示す図である。FIG. 7 is an operation explanatory diagram of the embodiment of FIG. 1, showing another example in which a correct answer category can be derived by a set including correction information registered in a correction table.

【図８】図１の実施例の動作説明図であって、修正用テ
ーブルに登録された修正用情報を含む組によっても正解
カテゴリが導き出せなかった例と、その為に再び訂正を
行って新たな修正用情報を含む組を修正用テーブルに登
録した状況を示す図である。FIG. 8 is an explanatory diagram of the operation of the embodiment of FIG. 1, in which the correct category cannot be derived even by the set including the correction information registered in the correction table, FIG. 8 is a diagram showing a situation in which a set including important correction information is registered in a correction table.

【図９】図１の実施例の動作説明図であって、再度の訂
正によって、正解カテゴリが導き出せた例を示す図であ
る。9 is an explanatory diagram of the operation of the embodiment of FIG. 1, showing an example in which a correct category can be derived by another correction.

【図１０】図１の実施例の文字認識装置の動作手順を示
す別のフローチャートである。FIG. 10 is another flowchart showing an operation procedure of the character recognition device of the embodiment of FIG. 1;

【図１１】図１の実施例の図１０のフローによる動作説
明図であって、誤読カテゴリと正解カテゴリが逆になる
組が修正用テーブルに登録される状況を示す図である。11 is an explanatory diagram of the operation of the embodiment of FIG. 1 according to the flow of FIG. 10, and shows a situation in which a set in which the misread category and the correct category are reversed is registered in a correction table.

【図１２】図１の実施例の図１０のフローによる動作説
明図であって、２回の修正処理によって正解カテゴリが
導き出せた例を示す図である。12 is an operation explanatory diagram of the embodiment of FIG. 1 according to the flow of FIG. 10, and is a diagram showing an example in which a correct answer category can be derived by two correction processes.

【図１３】本発明を適用した文字認識装置の別の実施例
のブロック図である。FIG. 13 is a block diagram of another embodiment of the character recognition device to which the present invention is applied.

【図１４】従来の文字認識装置の構成例を示すブロック
図である。FIG. 14 is a block diagram illustrating a configuration example of a conventional character recognition device.

[Explanation of symbols]

１データ入力手段２，２’ 認識手段３認識辞書４，４’ 候補修正手段５，５’ 修正用テーブル５０，５０’ 組５１誤読カテゴリ５２正解カテゴリ５３距離差５４認識用特徴６結果訂正手段７表示手段８訂正情報入力手段９’ テーブル更新手段 DESCRIPTION OF SYMBOLS 1 Data input means 2, 2 'Recognition means 3 Recognition dictionary 4, 4' Candidate correction means 5, 5 'Correction table 50, 50' Set 51 Misread category 52 Correct answer category 53 Distance difference 54 Recognition feature 6 Result correction means 7 Display means 8 Correction information input means 9 'Table updating means

Claims

(57) [Claims]

1. A feature for recognition is extracted from data to be recognized,
Recognition for obtaining a plurality of upper-ranked recognition candidate categories having a smaller distance value, which is a difference between the recognition feature and the standard feature, compared with the standard feature for each category registered in advance in the recognition dictionary, and the distance values thereof Means, a correction table storing a set of a misread category, a correct answer category, and correction information; and a set including a misread category that matches the first-ranked recognition candidate category in the recognition candidate category group obtained by the recognition means. Is stored in the correction table, the candidate correction means for reducing the distance value of the recognition candidate category that matches the correct answer category in the set by a value corresponding to the correction information in the set; Output the first-ranked recognition candidate category in the recognition-candidate category group after processing by the means as a recognition result, and follow the correction information input from the correction information input means. And a table updating means for adding a set of a misread category, a correct answer category, and correction information to the correction table at the time of correction by the result correction means. Method for correcting candidate recognition devices.

2. The method according to claim 1, wherein when the recognition candidate category matching the correct answer category does not exist in the recognition candidate category group obtained by the recognition means, the predetermined distance value matching the correct answer category is set. 2. The apparatus according to claim 1, further comprising a step of correcting the distance value after adding the given recognition candidate category to the recognition candidate category group.
The candidate correction method for the described recognition device.

3. The method according to claim 2, wherein the table updating means uses a distance difference between the distance value of the misread category and the distance value of the correct category as the correction information.
The candidate correction method for the described recognition device.

4. When the distance value before correction is D, the weighting factor is w, and the distance difference as correction information is s, the candidate correction means calculates the corrected distance value D ′ by D ′ = D The candidate correction method for a recognition device according to claim 3, wherein the candidate correction method is obtained by −w · s.

5. The table updating means uses, as the correction information, a distance difference between a distance value of a misread category and a distance value of a correct category, and a recognition feature extracted from recognition target data. 3. The candidate correction method for a recognition device according to claim 2, wherein:

6. The candidate correcting means recognizes a distance value before correction as D, a weighting coefficient as w, a distance difference in correction information as s, a recognition feature in the correction information and a current recognition target data. Assuming that the distance value to the feature for use is d and the reference distance value which is a constant for normalizing the distance value d is a, the corrected distance value D ′ is represented by D ′ = D−w · s · a / d 6. The method according to claim 5, wherein the candidate is corrected.

7. The method according to claim 1, wherein the first correction candidate category in the group of recognition candidate categories determined by the recognition means is changed by a candidate correction using the correction table. 7. The method according to claim 1, further comprising the step of manually correcting the candidate using the rank candidate category.