JPH04104384A

JPH04104384A - Character recognizing device

Info

Publication number: JPH04104384A
Application number: JP2221023A
Authority: JP
Inventors: Hiroaki Ikeda; 裕章池田
Original assignee: Canon Inc
Current assignee: Canon Inc
Priority date: 1990-08-24
Filing date: 1990-08-24
Publication date: 1992-04-06

Abstract

PURPOSE:To automatically correct erroneous recognition between analogous characters by specifying the position and dimension of a character to be recognized from among picture data and comparing these information with prescribed reference information. CONSTITUTION:After a character original 1 s read by a photoelectric conversion device 2, it is amplified by an amplifier 4, converted into binary digital data by a binarization circuit 5, and taken out the dimensional and position information for each character with the aid of a character segmenting part 6 to perform normalization and characteristic extraction of character. Then, a character is recognized by a dictionary 53 for identification stored in a ROM 9. Next, the decision is made and the result of the identification is corrected by a decision reference information storage part 64 of a RAM 10. The result of correction displayed on a display part 14 is corrected by hand, and the judgement of the correction is reflected on the decision reference information and utilized to the processing after that. Thus, the erroneous recognition of character picture after normalization can be automatically corrected.

Description

【発明の詳細な説明】［産業上の利用分野］本発明は文字認識装置に関し、特に、例えば、誤認識さ
れた文字を自動修正する文字認識装置に関するものであ
る。DETAILED DESCRIPTION OF THE INVENTION [Field of Industrial Application] The present invention relates to a character recognition device, and particularly to a character recognition device that automatically corrects, for example, erroneously recognized characters.

［従来の技術］従来の一般的な文字認識装置における文字認識処理を示
すフローチャートを第４図に示す。このような文字認識
装置では、イメージスキャナなどの光電変換装置を利用
し原稿を読み取り、二値画像データに変換する（Ｓ４１
）。続いて、ステップＳ４２で二値画像に変換された入
力画像から１文字分の文字画像を切り出す。この切り出
しは、まず行方向の画素分布より行の抽出をし、次に行
に垂直な方向の画素分布を調べることで行なわれる。次
に、切り出された文字画像の大きさの変動を吸収するた
め、ステップＳ４６で正規化を行なう。これに続いて、
ステップＳ４７で正規化された文字画像の特徴抽出が行
なわれる。このようにして特徴抽出が行われた正規化画
像は、ステップＳ４８で、予め用意されている識別用辞
書５３を参照しつつ類似度の計算を行ない、最も類似度
の大きい文字を認識結果として選択し、最後にステップ
Ｓ５１で認識結果を表示する。[Prior Art] FIG. 4 is a flowchart showing character recognition processing in a conventional general character recognition device. Such a character recognition device uses a photoelectric conversion device such as an image scanner to read a document and convert it into binary image data (S41
). Subsequently, in step S42, a character image for one character is cut out from the input image converted into a binary image. This cutting is performed by first extracting a row from the pixel distribution in the row direction, and then examining the pixel distribution in the direction perpendicular to the row. Next, in order to absorb variations in the size of the cut out character images, normalization is performed in step S46. Following this,
Feature extraction of the normalized character image is performed in step S47. For the normalized image from which features have been extracted in this way, in step S48, the degree of similarity is calculated while referring to the identification dictionary 53 prepared in advance, and the character with the highest degree of similarity is selected as the recognition result. Finally, the recognition result is displayed in step S51.

［発明が解決しようとする課題］しかしながら上記従来例では、正規化により、例えば、
第５図に示すような°“　°°（句読点）と０　　（オ
ー）、また°“、　　（コンマ）と°°゛°。[Problem to be solved by the invention] However, in the above conventional example, due to normalization, for example,
°“ °° (punctuation mark) and 0 (o), as well as °“, (comma) and °°゛°, as shown in Figure 5.

（アポストロフィ）など正規化後の文字画像がきわめて
類似する文字（記号を含む）が結果として生じるため、
そのような類似文字どうしの誤認識が生じてしまうとい
う欠点があった。(apostrophe) and other characters (including symbols) whose character images are extremely similar after normalization are generated as a result.
There is a drawback that such similar characters may be misrecognized.

本発明は上記従来例に鑑みてなされたもので、類似した
文字とおしの誤認識を自動的に修正できる文字認識装置
を提供することを目的とする。The present invention has been made in view of the above conventional example, and an object of the present invention is to provide a character recognition device that can automatically correct misrecognition of similar characters.

［課題を解決するための手段］上記目的を達成するために本発明の文字認識装置は以下
の様な出力からなる。即ち、文字が描かれた原稿を入力し、前記文字を画像データと
して読み取り、前記画像データを文字として認識する文
字認識装置であって、前記画像データの中から文字の位
置と大きさを特定する特定手段と、前記特定手段から出
力される装置と大きさ情報を、所定の基準情報と比較す
る比較手段と、前記比較手段からの比較結果に基づいて
、前記画像データの中から認識された文字を修正する修
正手段と、前記所定の基準情報を更新する更新手段とを
有することを特徴とする文字認識装置を備える。[Means for Solving the Problems] In order to achieve the above object, the character recognition device of the present invention has the following output. That is, a character recognition device inputs a document with characters drawn on it, reads the characters as image data, recognizes the image data as characters, and identifies the position and size of the characters from the image data. a character recognized from the image data based on a comparison result from the comparison means; A character recognition device characterized in that it has a modification means for modifying the predetermined reference information, and an updating means for updating the predetermined reference information.

［作用］以上の出力により本発明は、画像データの中から認識さ
れる文字の位置と大きさを特定し、それらの情報を所定
の基準情報と比較することにより、文字を特定し修正す
るよう動作する。[Operation] With the above output, the present invention specifies the position and size of the character recognized from the image data, and compares this information with predetermined reference information to identify and correct the character. Operate.

［実施例］以下添付図面を参照して本発明の好適な実施例を詳細に
説明する。第１図は本発明の代表的な実施例である誤認
識文字の自動修正が可能な文字認識装置の出力を表わす
ブロック図である。第１図において、文字認識装置は画
像読取部と文字検出・修正部から成り立っている。さら
に、画像読取部はイメージスキャナ等の光電変換装置２
、スキャン制御器３及びアンプ４から出力されている。[Embodiments] Hereinafter, preferred embodiments of the present invention will be described in detail with reference to the accompanying drawings. FIG. 1 is a block diagram showing the output of a character recognition device capable of automatically correcting erroneously recognized characters, which is a typical embodiment of the present invention. In FIG. 1, the character recognition device consists of an image reading section and a character detection/correction section. Furthermore, the image reading section includes a photoelectric conversion device 2 such as an image scanner.
, the scan controller 3 and the amplifier 4.

また、文字検出・修正部は、二値化回路５、文字切り出
し部６、正規化部７、ＣＰＵ８、ＲＯＭ９、ＲＡＭｌ０
１入出力制御部１１、特徴抽出部１２、手動入力部１３
、表示部１４及びＣＰＵバス１５から出力されている。Further, the character detection/correction section includes a binarization circuit 5, a character extraction section 6, a normalization section 7, a CPU 8, a ROM 9, and a RAM 10.
1 input/output control section 11, feature extraction section 12, manual input section 13
, are output from the display section 14 and the CPU bus 15.

ここで、ＲＯＭ９には、識別用辞書５３の他に、基準文
字サイズ計算、大きさ・位置情報取り出し、識別計算、
修正判定及び自動修正など後述の各処理を行う処理プロ
グラムが収容されている。Here, in addition to the identification dictionary 53, the ROM 9 includes standard character size calculation, size/position information retrieval, identification calculation,
A processing program that performs various processes such as correction determination and automatic correction, which will be described later, is stored therein.

このような文字認識装置において、文字原稿１はイメー
ジスキャナ等の光電変換装置２で読み取られた後、その
出力がアンプ４で増幅され、二値化回路５によって、ア
ナログ信号から二値のデジタルデータに変換される。次
に、文字切り出し部６により１文字毎に文字が切り出さ
れ、後に詳述する文字の大きさ・位置情報を取り出した
後、正規化、文字の特徴抽出を行う。続いて、ＲＯＭ９
に格納された識別用辞書を参照しながら正規化された文
字に最も類似する文字を選択することにより、文字が認
識される。In such a character recognition device, a character document 1 is read by a photoelectric conversion device 2 such as an image scanner, the output thereof is amplified by an amplifier 4, and a binarization circuit 5 converts an analog signal into binary digital data. is converted to Next, each character is cut out by the character cutting unit 6, and after extracting character size and position information, which will be described in detail later, normalization and character feature extraction are performed. Next, ROM9
The character is recognized by selecting the character most similar to the normalized character while referring to the identification dictionary stored in the .

次に、第２図に示すフローチャートを用いながら、本実
施例の文字認識及び誤認識文字の自動修正処理について
説明する。ただし、第２図に示すフローチャートにおい
て、従来例と同じ処理ステップは同じ工程番号を付し、
かつ従来技術によるものとして説明を省略する。Next, character recognition and automatic correction processing for misrecognized characters in this embodiment will be explained using the flowchart shown in FIG. However, in the flowchart shown in Figure 2, the same processing steps as in the conventional example are given the same process numbers,
Further, the explanation will be omitted as it is based on the prior art.

まず、ステップ８１０〜１４において、イメージスキャ
ナ等で読み込まれ、アンプにより増幅され、二値化回路
５で二値化された画像データが、文字切り出し部６で１
文字毎に切り出された後、ステップＳ１８で、基準文字
サイズ計算を行う。First, in steps 810 to 14, image data read by an image scanner or the like, amplified by an amplifier, and binarized by the binarization circuit 5 is converted into a digitized image by the character cutting unit 6.
After each character is cut out, standard character size calculation is performed in step S18.

基準文字サイズ計算とは、例えば、入力され二値化され
た画像データに含まれる文字１行中で文字高が最大とな
る文字を選ぶことである。続いて、ステップＳ２０でそ
の文字の文字高を基準値（Ｈ）とし、１行毎にＨの値を
求め、認識対象の各文字画像の文字高をｈとし、ｈ／Ｈ
の値を計算する。この結果はＲＡＭ１Ｏの大きさ情報格
納部６０に格納する。さらにステップＳ２２で、第３図
に示すように文字切り出し部６で切り出された切り出し
枠７０の領域データと各文字画像の文字高を用い、上部
から文字画像上部までの距離（１）と、切り出し枠下部
から文字画像下部までの距離（ｂ）を求め、ｔ／Ｈ及び
ｂ／Ｈの値を計算し、その結果をＲＡＭｌ０の位置情報
格納部６２に格納する。Calculating the standard character size means, for example, selecting a character with the maximum character height in one line of characters included in the input and binarized image data. Next, in step S20, the character height of the character is set as a reference value (H), the value of H is calculated for each line, the character height of each character image to be recognized is set to h, and h/H is calculated.
Calculate the value of . This result is stored in the size information storage section 60 of the RAM 1O. Further, in step S22, as shown in FIG. 3, the distance (1) from the top of the character image to the top of the character image is calculated using the area data of the clipping frame 70 clipped by the character clipping unit 6 and the character height of each character image. The distance (b) from the bottom of the frame to the bottom of the character image is determined, the values of t/H and b/H are calculated, and the results are stored in the position information storage section 62 of RAM10.

次に、正規化及び特徴抽出がなされた画像データに対し
て識別用辞書５３を用いて、最も類似する文字を識別し
た後、ステップＳ３０で、その文字が修正を必要とする
かどうかを判定する。ここで、修正判定には識別用辞書
５３から選択された最も類似する文字、大きさ情報格納
部６０に格納されている大きさ情報、位置情報格納部６
２に格納されている位置情報及びＲＡＭｌ０の判定基準
情報格納部６４に格納されている判定基準情報が用いら
れ、次のような判定を行う。Next, after identifying the most similar character using the identification dictionary 53 for the normalized and feature-extracted image data, it is determined in step S30 whether the character requires modification. . Here, the modification determination includes the most similar character selected from the identification dictionary 53, the size information stored in the size information storage section 60, and the position information storage section 6.
The position information stored in 2 and the determination criterion information stored in the determination criterion information storage section 64 of RAM10 are used to perform the following determination.

例えば、識別用辞書５３から選択された類似度が最も大
きい文字（以下第１候補とする）が、（１）その大きさ
だけが異なる類似文字がある場合（例：°“や°°と°
°や°゛）、第１候補文字の大きさに関する判定基準情報である閾値
Ｕと、大きさ情報格納部６０に格納したｈ／Ｈの値とを
比較し、ｈ／Ｈ＜　　　Ｕならば小文字、Ｕ　≦　ｈ／Ｈならば、大文字と判定する。For example, if the character with the highest degree of similarity selected from the identification dictionary 53 (hereinafter referred to as the first candidate) has (1) similar characters that differ only in size (e.g.
° or °゛), the threshold value U, which is the criterion information regarding the size of the first candidate character, is compared with the value of h/H stored in the size information storage unit 60, and if h/H<U, the character is lowercase. , if U≦h/H, it is determined to be an uppercase letter.

そして、引き続くステップＳ３２において、もし第１候
補が大文字で、判定結果が小文字となった場合、識別結
果を小文字に修正する。その後、その修正された認識結
果を入出力制御部１１を経て表示部１４に出力する。Then, in the subsequent step S32, if the first candidate is an uppercase letter and the determination result is a lowercase letter, the identification result is corrected to a lowercase letter. Thereafter, the corrected recognition result is output to the display section 14 via the input/output control section 11.

（２）その位置だけが異なる類似文字がある場合（（列
　：””’（アポストロフィー）と　“’、”（コンマ
）　）　、位置情報格納部６２に格納したｔ／Ｈ及びｂ
／Ｈの値よりｐ＝　（ｔ／Ｈ）−（ｂ／Ｈ）を計算し、位置に関する判定基準情報である基準値Ｐ　
（Ｐ＞０）及びＱ　（Ｑ＜Ｏ）と比較しｐ＞Ｐなら文字切り出し枠７０の下部に、ｐ＜Ｑなら文字切り出し枠７０の上部に、そして、Ｑ≦ｐ≦Ｐなら文字切り出し枠７０の中部に文字が存在すると判定
する。(2) If there are similar characters that differ only in their positions ((column: “”’ (apostrophe) and “’,” (comma)), t/H and b stored in the position information storage unit 62
From the value of /H, p = (t/H) - (b/H) is calculated, and the reference value P, which is the determination reference information regarding the position, is calculated.
(P>0) and Q (Q<O), if p>P, it will be placed at the bottom of the character cutting frame 70, if p<Q, it will be placed at the top of the character cutting frame 70, and if Q≦p≦P, it will be placed in the character cutting frame 70. It is determined that a character exists in the middle part of 70.

そして、引き続（ステップＳ３２において、もし第１候
補が下部文字で、判定結果が下部文字以外になった場合
、識別結果を判定結果で修正する。その後、その修正さ
れた認識結果を入出力制御部１１を経て表示部１４に出
力する。Then, in step S32, if the first candidate is a lower character and the determination result is other than the lower character, the identification result is corrected by the determination result.Then, the revised recognition result is used for input/output control. It is output to the display section 14 via the section 11.

さらに、表示部１４に修正された認識結果を表示後、ス
テップ８３６において、ステップ３３０〜Ｓ３２でなさ
れた修正判定に基づ（自動修正後の認識結果の誤りを手
動で修正する。このとき、利用者は表示部１４に表示さ
れた認識結果を見ながら、手動入力部１３からの人力に
より、小文字と大文字の修正や、位置の違いによる類似
文字どうしの修正などを行う。このようにして手動修正
で修正された修正結果は再び表示部１４に表示される。Furthermore, after displaying the corrected recognition result on the display unit 14, in step 836, errors in the recognition result after automatic correction are manually corrected based on the correction determinations made in steps 330 to S32. While looking at the recognition results displayed on the display unit 14, the operator uses manual input from the manual input unit 13 to correct lowercase and uppercase letters, correct similar characters due to differences in position, etc. Manual corrections are made in this way. The modified results are displayed on the display section 14 again.

ここで手動でなされた小文字と大文字の修正や、位置の
違いによる類似文字どうしの修正の判断は、判定基準情
報の値であるＵ、Ｐ、Ｑなどに反映され以後の処理に利
用される。The manual correction of lowercase and uppercase letters and the correction of similar characters due to differences in position are reflected in the values of the criterion information, such as U, P, and Q, and are used in subsequent processing.

従って本実施例に従えば、仮名や英字のように文字の大
きさ、あるいは、位置のみが異なるような類似文字ばか
りではなく、°“、゛と°“　°゛などのような位置も
大きさも異なるが正規化後の文字画像が類似している文
字についても、誤認識を自動的に修正することができる
。Therefore, according to this embodiment, not only similar characters that differ only in character size or position, such as kana and alphabetic characters, but also characters that differ in position and size, such as °", ゛ and °" °゛, can be used. Misrecognition can also be automatically corrected for characters that are different but have similar character images after normalization.

また、本実施例においては、誤認識を修正するための文
字情報として、文字の高さ方向の大きさや位置情報を用
いた場合について説明したが、本発明はこれに限定され
るものではない。例えば、文字の高さ方向の情報に加え
、文字幅の情報を用いても、本発明を適用することがで
きる。この場合、本実施例の基準文字サイズ計算におい
て、１行中で文字幅が最大となる文字を基準値（Ｗ）と
して選択し、“−°゛と“−゛などの文字高だけでは判
別しきれない文字について、その文字幅（ｗ）に対する
ｗ　／　Ｗの値と判定基準情報の閾値により修正判定を
行なうことができる。さらに例えば、文字高（ｈ）と文
字幅（ｗ）の比や文字外接矩形面積ｈＸｗを基準にする
ことによっても本発明を適用することが可能である。こ
のことにより、半角文字と全角文字などの文字幅の違い
によるものについても自動修正が可能となる。Further, in this embodiment, a case has been described in which the size and position information in the height direction of a character are used as character information for correcting misrecognition, but the present invention is not limited to this. For example, the present invention can be applied using information on character width in addition to information on character height. In this case, in the standard character size calculation of this example, the character with the maximum character width in one line is selected as the standard value (W), and it is not possible to distinguish only by the character height such as "-°゛" and "-゛". For characters that cannot be filled out, correction determination can be made based on the value of w/W for the character width (w) and the threshold value of the determination criterion information. Further, for example, the present invention can also be applied by using the ratio of character height (h) to character width (w) or the area of a character circumscribed rectangle hXw as a reference. This makes it possible to automatically correct characters due to differences in character width, such as half-width characters and full-width characters.

さらに本実施例においては、基準文字サイズを各行の文
字高が最大ものと定義して説明したが、本発明はこれに
限定されるものではない。例えば、基準文字サイズを得
るための文字をマウスなどの手動入力部からの外部入力
により指定するようにすることもできる。Further, in this embodiment, the standard character size is defined as the maximum character height in each line, but the present invention is not limited to this. For example, a character for obtaining a reference character size may be specified by external input from a manual input unit such as a mouse.

［発明の効果］以上説明したように本発明によれば、類似した文字どう
しの誤認識を自動的に修正できる効果がある。また、手
動修正による修正結果が以後の自動文字修正に反映され
、文字自動修正能力が向上するので、手作業であった誤
認識の修正作業を軽減されるという利点も有する。[Effects of the Invention] As explained above, according to the present invention, there is an effect that misrecognition of similar characters can be automatically corrected. Furthermore, the result of manual correction is reflected in subsequent automatic character corrections, improving the ability to automatically correct characters, which also has the advantage of reducing the manual work of correcting erroneous recognition.

[Brief explanation of drawings]

第１図は本発明の代表的な実施例である文字認識装置の
出力を示すブロック図、第２図は文字認識及び誤認識文字の自動修正処理につい
て示すフローチャート、第３図は大きさ情報や位置情報についての説明図、第４図は従来例による文字認識装置の処理フローチャー
ト、そして、第５図は従来例による正規化についての説明図である。図中、１・・・文字原稿、２・・・光電変換装置、３・
・・スキャン制御器、４・・・アンプ、５・・・二値化
回路、６・・・文字切り出し部、７・・・正規化部、８
・・・ＣＰＵ、９・・・ＲＯＭ、１０・・・ＲＡＭ、１
１・・・入出力制御部、１２・・・特徴抽出部、１３・
・・手動入力部、１４・・・判定基準情報、１５・・・
ＣＰＵバス、５３・・・識別用辞書、６０・・・大きさ
情報格納部、６２・・・位置情報格納部、６４・・・判
定基準情報格納部である。Fig. 1 is a block diagram showing the output of a character recognition device that is a typical embodiment of the present invention, Fig. 2 is a flowchart showing character recognition and automatic correction processing of misrecognized characters, and Fig. 3 is a block diagram showing the output of a character recognition device that is a typical embodiment of the present invention. FIG. 4 is a processing flowchart of a conventional character recognition device; FIG. 5 is an explanatory diagram of normalization according to a conventional example. In the figure, 1... text manuscript, 2... photoelectric conversion device, 3...
...Scan controller, 4...Amplifier, 5...Binarization circuit, 6...Character cutting section, 7...Normalization section, 8
...CPU, 9...ROM, 10...RAM, 1
1... Input/output control unit, 12... Feature extraction unit, 13.
...Manual input section, 14...Judgment criteria information, 15...
CPU bus, 53...Identification dictionary, 60...Size information storage section, 62...Position information storage section, 64...Judgment criterion information storage section.

Claims

[Scope of Claims] A character recognition device that inputs a document on which characters are drawn, reads the characters as image data, and recognizes the image data as characters, the device comprising: identifying means for identifying the position and size information outputted from the identifying means, comparing means for comparing the position and size information outputted from the identifying means with predetermined reference information, and based on the comparison result from the comparing means, from among the image data. A character recognition device comprising: a modification means for modifying a recognized character; and an updating means for updating the predetermined reference information.