JPH1069522A

JPH1069522A - Character recognition device, character recognition method and recording medium recording character recognition method

Info

Publication number: JPH1069522A
Application number: JP8227066A
Authority: JP
Inventors: Minoru Takakura; 穂高倉; Ichiro Nakao; 一郎中尾; Mariko Takenouchi; 磨理子竹之内
Original assignee: Matsushita Electric Industrial Co Ltd
Current assignee: Panasonic Holdings Corp
Priority date: 1996-08-28
Filing date: 1996-08-28
Publication date: 1998-03-10

Abstract

PROBLEM TO BE SOLVED: To judge the state of a character string image like thinning or inter-character contact and the difference of the character kind of English sentences/Japanese sentences, to accurately segment character images for respective character strings and to improve a character recognition rate. SOLUTION: A character string state estimation part 104 estimates whether or not the character string image is an abnormal thinned image or the like based on the width of the circumscribing rectangle of a character element image or the like and a first character kind estimation part 105 estimates whether the character string is the English sentence or the Japanese sentence based on the distribution of the interval of the circumscribing rectangle. An English sentence character segmentation part 106 segments the character image of the English sentence character string, a Japanese sentence character segmentation part 108 segments the character element image corresponding to a reference character width decided by a Japanese sentence reference character width decision part 107 and a character recognition part 109 recognizes the character image. A first character segmentation error judgement part 110 judges the correct/error of character segmentation based on the evaluated value of a recognized result and a second character segmentation error judgement part 112 judges it based on the character kind of the character by the estimation of the English sentence or the Japanese sentence by a second character kind estimation part 111. The character recognition part 109 recognizes a segmentation error character again.

Description

DETAILED DESCRIPTION OF THE INVENTION

【０００１】[0001]

【発明の属する技術分野】本発明は、文書画像から文字
画像を切り出して文字を認識する文字認識装置に関し、
もっと詳しくは、名刺や広告文書に見られるような文字
列ごとに和文、英文の混在した文書画像から正確に文字
画像を切り出して文字を認識する文字認識装置に関す
る。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a character recognition apparatus for recognizing characters by cutting out a character image from a document image.
More specifically, the present invention relates to a character recognition device that accurately cuts out a character image from a document image in which Japanese and English texts are mixed for each character string as seen in business cards and advertisement documents and recognizes characters.

【０００２】[0002]

【従来の技術】近年、文字認識の適用分野が拡大してき
ており、従来の専用帳票や固定フォーマット文書以外
の、一般文書から文字認識のための文字切出しが必要と
なっている。以下、図面を参照しながら、従来の文字認
識装置の一例である特開昭６３−１６３９１号公報記載
の技術を説明する。2. Description of the Related Art In recent years, the field of application of character recognition has been expanding, and it has become necessary to extract characters for character recognition from general documents other than conventional dedicated forms and fixed format documents. Hereinafter, a technique described in Japanese Patent Application Laid-Open No. 63-16391, which is an example of a conventional character recognition device, will be described with reference to the drawings.

【０００３】図１５は、この文字認識装置の構成図を示
す。文字列検出部１５０１は、入力された文書画像から
認識対象となる文字列を検出する。文字要素検出部１５
０２は、検出された文字列画像から文字要素を検出す
る。基準文字幅決定部１５０３は、各文字要素の幅ある
いは高さから基準文字幅を決定する。文字切り出し部１
５０４は、基準文字幅に基づき、文字の切り出しを行
う。文字認識部１５０５は、切り出された文字の認識を
行う。FIG. 15 shows a configuration diagram of this character recognition device. The character string detection unit 1501 detects a character string to be recognized from the input document image. Character element detector 15
02 detects a character element from the detected character string image. The reference character width determination unit 1503 determines a reference character width from the width or height of each character element. Character cutout part 1
504 cuts out characters based on the reference character width. The character recognizing unit 1505 recognizes the cut out character.

【０００４】この装置の具体的な動作について、図１６
を用いて説明する。文字列検出部１５０１は、入力画像
から、図１６（ａ）に示すような認識対象となる文字列
１６０１の存在領域を検出する。ここで文字列１６０１
の高さＹ、幅Ｗが求められる。文字要素検出部１５０２
は、文字列検出部１５０１で検出された各文字列１６０
１等に対し、図１６（ｂ）に示すように、文字列方向に
垂直な方向へ文字部を射影して、射影が連続する範囲ｗ
₁，ｗ₂，…ｗ₁₁と射影が途切れる範囲ｄ₁，ｄ₂，…
ｄ₁₀とを求める。射影が連続する範囲を１つの文字要素
とし、図１６（ｃ）に示すように、個々の文字要素Ｐｓ
₁，Ｐｓ₂，…Ｐｓ₁₁を検出する。FIG. 16 shows the specific operation of this device.
This will be described with reference to FIG. The character string detection unit 1501 detects an existing area of the character string 1601 to be recognized as shown in FIG. 16A from the input image. Here, the character string 1601
Height Y and width W are required. Character element detection unit 1502
Are the character strings 160 detected by the character string detection unit 1501.
For example, as shown in FIG. 16 (b), the character portion is projected in a direction perpendicular to the character string direction, and a range w where projection is continuous is performed.
_1, w _2, ... range d _1, d ₂ to w ₁₁ and the projection is interrupted, ...
seek and d _10. The range in which the projections are continuous is defined as one character element, and as shown in FIG.
_1, Ps _2, to detect the ... Ps _11.

【０００５】次に、基準文字幅決定部１５０３は、文字
列１５０１の文字列方向に垂直方向の長さ（横書きの場
合には高さ）Ｙと文字列１５０１中の各文字要素Ｐｓ_i
の外接矩形の文字列方向の長さ（横書きの場合には幅）
ｗ_iの最大値とを比較し、より大きい方を基準文字幅Ａ
とする。図１６（ｂ）から幅ｗ８が基準文字幅Ａとなる
ことがわかる。[0005] Next, the reference character width determination unit 1503 determines the length Y (height in the case of horizontal writing) Y in the character string direction of the character string 1501 and each character element Ps _i in the character string 1501.
Length of the circumscribed rectangle in the character string direction (width for horizontal writing)
w _i is compared with the maximum value, and the larger value is used as the reference character width A.
And FIG. 16B shows that the width w8 is the reference character width A.

【０００６】文字切り出し部１５０４は、隣接するｎ個
の文字要素を文字列方向に結合した場合の矩形長ＣｗをＣｗ＝｜Σｗｉ＋Σ（ａ・ｄｉ）｜で定義する。ここ
で、ａは定数である。Ｃｗ≦Ａの場合に、隣接するｎ個
の文字要素を結合して１つの文字とする。このようにし
て図１６（ｄ）に示すように文字Ｐ１，Ｐ２…Ｐ５が検
出される。The character cutout unit 1504 defines a rectangular length Cw when n adjacent character elements are combined in the character string direction as Cw = | {wi +} (a · di) |. Here, a is a constant. When Cw ≦ A, adjacent n character elements are combined into one character. Thus, the characters P1, P2,..., P5 are detected as shown in FIG.

【０００７】文字認識部１５０５は、文字切出し部１５
０４で検出された個々の文字Ｐ１，Ｐ２…Ｐ５の認識を
行う。The character recognizing unit 1505 includes a character extracting unit 15
The individual characters P1, P2... P5 detected at step 04 are recognized.

【０００８】[0008]

【発明が解決しようとする課題】しかしながら、上記の
ような文字認識装置では、文字同士の接触があるときに
は、正しい基準文字幅を得ることができずに文字切り出
し部１５０４が一文字ずつ文字を切り出せないので正し
い文字認識ができない。同様に英文文字列と和文文字列
との区別ができないため、文字により文字幅の変動の大
きな英文文字列の場合に、十分な文字切り出し精度が得
られない。However, in the character recognition apparatus as described above, when there is contact between characters, the character reference unit 1504 cannot obtain a correct reference character width and cannot extract characters one by one. Therefore, correct character recognition cannot be performed. Similarly, it is not possible to distinguish between an English character string and a Japanese character string. Therefore, in the case of an English character string in which the character width varies greatly depending on the character, sufficient character extraction accuracy cannot be obtained.

【０００９】更に、文書の印字状態や画像読み取り時の
条件により、画像のかすれが起きることがよくあるが。
そのような場合に、上記と同様、推定した基準文字幅が
実際の文字幅と大きくずれてしまうので、正しい文字切
り出しができない。本発明は、文書画像が掠れや文字同
士の接触のような異常な状態にある場合にも、また、和
文・英文の文字列が混在している場合にも、正しい文字
画像の切り出しを行い、文字認識の精度が向上した文字
認識装置及びその方法を提供することを目的とする。Furthermore, the image often fades depending on the printing condition of the document and the conditions at the time of reading the image.
In such a case, the estimated reference character width greatly deviates from the actual character width in the same manner as described above, so that correct character extraction cannot be performed. The present invention, even when the document image is in an abnormal state such as blurring or contact between characters, and also when a mixture of Japanese and English character strings, cut out the correct character image, An object of the present invention is to provide a character recognition device and a method thereof with improved character recognition accuracy.

【００１０】[0010]

【課題を解決するための手段】上記課題を解決するため
に、本発明に係る文字認識装置は、文書画像から文字列
画像とその外接矩形の位置とを抽出する文字列画像抽出
手段と、上記文字列画像から文字要素画像とその外接矩
形の位置とを抽出する文字要素画像抽出手段と、上記文
字要素画像の外接矩形の文字列画像に平行な第１長さ
（即ち横書き文字列の幅）と、上記文字列画像の文字列
画像に直角方向の第２長さ（即ち横書き文字列の高さ）
に基づいて定められるしきい値で上記文字列画像の状態
が正常か異常かを分類する文字列状態分類手段と、前記
文字列状態分類手段で分類した上記文字列画像の状態に
よって文字列画像から文字画像を切り出す文字列画像に
平行な基準長さ（基準文字幅）を決定する基準文字幅決
定手段と、前記基準文字幅決定手段で決定された基準長
さに従い文字要素画像を統合、分離して文字画像を切り
出す文字画像切出手段と、切り出された文字画像の特徴
と認識辞書の標準特徴とを照合し、文字画像を相当する
文字コードとして認識する文字認識手段とを備えること
としている。これによって、字要素画像の切り出しため
の基準文字幅が正常な画像と異常な画像とで区別される
ので、文字要素画像の切り出し精度が向上するので、文
字認識の認識効率が高まる。In order to solve the above problems, a character recognition apparatus according to the present invention comprises: a character string image extracting means for extracting a character string image and a position of a circumscribed rectangle from a document image; Character element image extraction means for extracting a character element image and a position of a circumscribed rectangle from the character string image, and a first length parallel to the character string image of the circumscribed rectangle of the character element image (ie, a width of a horizontal writing character string) And a second length in a direction perpendicular to the character string image of the character string image (that is, the height of the horizontal writing character string)
A character string state classifying means for classifying whether the state of the character string image is normal or abnormal at a threshold value determined based on the character string image according to the state of the character string image classified by the character string state classifying means Reference character width determining means for determining a reference length (reference character width) parallel to the character string image from which the character image is cut out, and character element images are integrated and separated according to the reference length determined by the reference character width determining means. And a character recognizing means for recognizing a character image as a corresponding character code by collating a characteristic of the character image with a standard characteristic of a recognition dictionary. Thus, the reference character width for cutting out the character element image is distinguished between a normal image and an abnormal image, so that the accuracy of cutting out the character element image is improved, and the recognition efficiency of character recognition is increased.

【００１１】[0011]

【発明の実施の形態】以下、本発明に係る文字認識装置
を図面を用いて説明する。図１は、本発明に係る文字認
識装置の一実施の形態の構成図である。この文字認識装
置は、画像入力部１０１と、文字列検出部１０２と、文
字要素検出部１０３と、文字列状態推定部１０４と、第
１字種推定部１０５と、英文文字切り出し部１０６と、
和文基準文字幅決定部１０７と、和文文字切り出し部１
０８と、文字認識部１０９と、第１文字切り出し誤り判
定部１１０と、第２字種推定部１１１と、第２文字切り
出し誤り判定部１１２と、基準文字幅修正部１１３と、
修正文字切り出し部１１４と、認識結果出力部１１５と
を備えている。DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS Hereinafter, a character recognition device according to the present invention will be described with reference to the drawings. FIG. 1 is a configuration diagram of an embodiment of a character recognition device according to the present invention. The character recognition device includes an image input unit 101, a character string detection unit 102, a character element detection unit 103, a character string state estimation unit 104, a first character type estimation unit 105, an English character extraction unit 106,
Japanese character width determining unit 107 and Japanese character extracting unit 1
08, a character recognition unit 109, a first character segmentation error determination unit 110, a second character type estimation unit 111, a second character segmentation error determination unit 112, a reference character width correction unit 113,
A modified character cutout unit 114 and a recognition result output unit 115 are provided.

【００１２】画像入力部１０１は、スキャナ等からな
り、オペレータにより用意された文字を含む原稿をＬ／
Ｅ（Light/Electric）変換等して、２値データで構成さ
れる画素（文字画像部分に対応する黒画素と背景部分に
対応する白画素）の集合である文書画像データを文字列
検出部１０２に通知する。文字列検出部１０２は、画像
入力部１０１から文書画像データの通知を受けると、そ
の文書画像データを記憶するとともに、文書画像データ
を走査して黒画素の存在領域から文字列画像を検出し
て、その第１列目の文字列画像の座標を求める。また、
文字認識部１０９から文字列画像の検出の指示を受ける
と、第２列目以降の文字列画像の座標を求める。求めた
文字列座標を文字要素検出部１０３と文字列状態推定部
１０４とに通知する。An image input unit 101 is composed of a scanner or the like, and converts a document including characters prepared by an operator into an L / L.
By performing E (Light / Electric) conversion or the like, the document image data, which is a set of pixels (black pixels corresponding to the character image portion and white pixels corresponding to the background portion) formed of binary data, is converted to the character string detection unit 102. Notify. Upon receiving the notification of the document image data from the image input unit 101, the character string detection unit 102 stores the document image data and scans the document image data to detect the character string image from the black pixel existing area. , The coordinates of the character string image in the first column are obtained. Also,
Upon receiving an instruction to detect a character string image from the character recognition unit 109, the coordinates of the character string images in the second and subsequent columns are obtained. The obtained character string coordinates are notified to the character element detection unit 103 and the character string state estimation unit 104.

【００１３】具体的には、図２（ａ）〜（ｄ）に示すよ
うに、文字列画像Ｌ_1,Ｌ_2,Ｌ_3,Ｌ₄を外接する矩形の左
上端点２０１、左下端点２０２等のそれぞれの座標（ｘ
_s，ｙ_s），（ｘ_e，ｙ_e）を求めている。図２（ａ）
〜（ｄ）に付記されている文字列幅Ｗ_iと文字列高さＹ
_iとは、後述する文字列状態推定部１０４で座標
（ｘ_s，ｙ_s），（ｘ_e，ｙ_e）の値から求められる。[0013] Specifically, as shown in FIG. 2 (a) ~ (d) , the rectangle circumscribing a character string image _{_{_{L 1, L 2, L 3}}} , L 4 upper left end point 201, such as a lower left end point 202 Each coordinate (x
_{_s,} y _s), seeking (x _{_e,} y _e). FIG. 2 (a)
To the character string width _Wi and the character string height Y
_i and the coordinate string state estimating unit 104 described later (x _{_s,} y _s), determined from the value of (x _{_e,} y _e).

【００１４】文字要素検出部１０３は、文字列座標（ｘ
_s，ｙ_s），（ｘ_e，ｙ_e）の通知を受けると、文字列
検出部１０２に記憶されている文書画像データの文字列
画像Ｌ_1,Ｌ₂，…を外接する矩形ごとにそれぞれ走査し
て、黒画素の連接する黒画素連結部を文字要素画像とし
て検出する。これによって、文字要素画像のラン情報
（走査線ごとの黒画素の開始位置と終了位置の座標）
と、この文字要素画像の外接矩形の左上端点と右下端点
との座標（ｘ_si，ｙ_si），（ｘ_ei，ｙ_ei）とを求める。
ここで添字ｉは、文字列画像ごとの文字要素画像の先頭
から付した番号である。The character element detecting unit 103 calculates the character string coordinates (x
_s , y _s ) and (x _e , y _e ), the character string images L _1, L ₂ ,... of the document image data stored in the character string detection unit 102 are respectively circumscribed. Scanning is performed to detect a black pixel connection portion where black pixels are connected as a character element image. Thereby, the run information of the character element image (the coordinates of the start position and the end position of the black pixel for each scanning line)
And the coordinates (x _si , y _si ) and (x _ei , y _ei ) of the upper left and lower right points of the circumscribed rectangle of the character element image.
Here, the subscript i is a number assigned from the top of the character element image for each character string image.

【００１５】図２（ａ）に示した文字列画像Ｌ₁から
は、図３（ａ）に示すような文字要素画像Ｓ_1,Ｓ_2,…Ｓ
₅の各ラン情報と、各外接矩形の左上端点３０１、右下
端点３０２等の座標（ｘ_si，ｙ_si），（ｘ_ei，ｙ_ei）と
が求められ、文字列状態推定部１０４に通知される。同
様に図２（ｂ）〜（ｄ）に示した文字列画像Ｌ₂〜Ｌ₄
からは、図３（ｂ）〜（ｄ）にそれぞれ示すような文字
要素画像が求められる。ここで文字要素画像の下方の
（）内に示された数値は、外接矩形の座標（ｘ_si，ｙ
_si），（ｘ_ei，ｙ_ei）から後述する文字列状態推定部１
０４で求められる文字要素画像の幅を示している。[0015] Figure 2 from the character string image L ₁ shown in (a), the 3 character component image S ₁ as shown (a), the S _2, ... S
₅ and the coordinates (x _si , y _si ) and (x _ei , y _ei ) of the upper left corner 301 and the lower right corner 302 of each circumscribed rectangle are obtained, and notified to the character string state estimation unit 104. Is done. Character string image L ₂ ~L ₄ shown similarly in FIG. 2 (b) ~ (d)
, Character element images as shown in FIGS. 3B to 3D are obtained. Here, the numerical values shown in parentheses below the character element image are the coordinates (x _si , y
_si ), (x _ei , y _ei )
04 shows the width of the character element image obtained.

【００１６】文字列状態推定部１０４は、文字列検出部
１０２から各文字列画像の外接矩形の座標（ｘ_s，
ｙ_s），（ｘ_e，ｙ_e）の通知を受け、文字要素検出部
１０３から各文字要素画像のラン情報と外接矩形の座標
（ｘ_si，ｙ_si），（ｘ_ei，ｙ_ei）との通知を受けると、
各文字列画像の高さＹを座標値ｙ_s，ｙ_eから求める。
また、各文字要素画像の幅ｗｉを座標値ｘ_si，ｙ_eiとか
ら求める。The character string state estimating unit 104 receives the coordinates (x _s ,
y _s), (x _e, notified of y _e), run information and bounding rectangles of coordinates of each character element image from the character element detection unit _{_{103 (x si, y si)}} , and (x _{_ei,} y _ei) Is notified,
The height Y of each character string image is determined from the coordinate values y _s and y _e .
Also, determine the width wi of each character element image coordinates x _si, and a y _ei.

【００１７】次に、文字列画像に含まれる文字要素画像
が幅細文字要素であるか幅太文字要素であるかを判定す
るための第１しきい値Ｗ１と第２しきい値Ｗ２とを式
（数１）、（数２）により求める。第１しきい値Ｗ１
は、式（数１）Next, a first threshold value W1 and a second threshold value W2 for determining whether the character element image included in the character string image is a thin character element or a wide character element are set. It is determined by the equations (Equation 1) and (Equation 2). First threshold value W1
Is the formula (Equation 1)

【００１８】[0018]

【数１】で求められる。α１は係数であり、０．１〜０．５の範
囲で選ばれ、本実施の形態ではα１＝０．２とする。式
（数１）では、文字列画像の高さＹが非常に大きいとき
に対応するため、Ｗｍｉｎの値が導入されて、いずれか
小さい方の値が第１しきい値Ｗ１とされる。ここでＷｍ
ｉｎは実験により決めた値で、対象とする画像の性質
や、スキャナの性能により異なるが、ほぼ０．５ｍｍ〜
１ｍｍの範囲が適当であり、本実施の形態では、１ｍｍ
とする。なお、外接矩形の座標値は、画素単位で表示し
ているので、Ｗｍｉｎ＝１６としている。(Equation 1) Is required. α1 is a coefficient, which is selected in the range of 0.1 to 0.5, and α1 = 0.2 in the present embodiment. In the formula (Equation 1), the value of Wmin is introduced and the smaller value is set as the first threshold value W1 in order to cope with the case where the height Y of the character string image is very large. Where Wm
in is a value determined by an experiment, and varies depending on the properties of the target image and the performance of the scanner.
A range of 1 mm is appropriate, and in this embodiment, 1 mm
And Since the coordinate values of the circumscribed rectangle are displayed in pixel units, Wmin = 16.

【００１９】第２しきい値Ｗ２は、式（数２）The second threshold value W2 is given by the following equation (Equation 2).

【００２０】[0020]

【数２】で求められる。ここでα２は係数であり、文字の縦横比
がほぼ１に近い文字の多い和文を基準に１．０〜１．５
程度の範囲で選ばれ、本実施の形態ではα２＝１．２と
する。次に文字列状態推定部１０４は、各文字列画像ご
とに、文字要素画像を外接矩形の幅ｗｉの値によって、
第１しきい値Ｗ１以下の幅細文字要素画像と第２しきい
値Ｗ２以上の幅太文字要素画像と、通常の文字要素画像
とに分類する。分類した幅細文字要素画像の数Ｎ１と幅
太文字要素画像の数Ｎ２とをそれぞれ求め、文字要素画
像の総数Ｎに示す割合をそれぞれ求める。幅細文字要素
画像の割合が予め定めた割合Ｒ１以上のときは、文字列
画像が掠れていると判断し、幅太文字要素画像の割合が
予め定めた割合Ｒ２以上のときは、文字列画像に文字間
接触が多いと判断し、それ以外のときは通常の文字列画
像と判断する。(Equation 2) Is required. Here, α2 is a coefficient, and is 1.0 to 1.5 based on a Japanese sentence having many characters in which the aspect ratio of the characters is almost unity.
Α2 = 1.2 in the present embodiment. Next, the character string state estimating unit 104 divides the character element image by the value of the width wi of the circumscribed rectangle for each character string image.
It is classified into a thin character element image having a first threshold W1 or less, a wide character element image having a second threshold W2 or more, and a normal character element image. The number N1 of the classified narrow-width character element images and the number N2 of the wide-width character element images are respectively obtained, and the ratio indicated by the total number N of the character element images is obtained. When the ratio of the narrow character element images is equal to or more than the predetermined ratio R1, it is determined that the character string image is blurred. When the ratio of the wide character element images is equal to or more than the predetermined ratio R2, the character string image is determined. It is determined that there is much inter-character contact, and otherwise, it is determined to be a normal character string image.

【００２１】ここで、割合Ｒ１、Ｒ２は０．５程度以下
の範囲で選ばれ、本実施の形態ではＲ１＝０．３３、Ｒ
２＝０．１７としている。例えば、文字列画像Ｌ₁（図
２（ａ）、図３（ａ））の場合には、文字列画像の高さ
Ｙ₁＝５０なので、第１しきい値Ｗ１＝１０、第２しき
い値Ｗ２＝６０と計算され、幅細文字要素画像の数Ｎ１
＝０、幅太文字要素画像の数Ｎ２＝１となり通常の文字
列画像と判断される。Here, the ratios R1 and R2 are selected within a range of about 0.5 or less. In the present embodiment, R1 = 0.33 and R1
2 = 0.17. For example, in the case of the character string image L ₁ (FIGS. 2A and 3A), since the height Y ₁ of the character string image is 50, the first threshold value W1 = 10 and the second threshold value The value W2 is calculated as 60, and the number N1 of narrow character element images is calculated.
= 0, the number N2 of wide character element images becomes 1, and it is determined that the image is a normal character string image.

【００２２】文字列画像Ｌ₂（図２（ｂ）、図３
（ｂ））の場合には、文字列画像の高さＹ ₂＝４３なの
で、第１しきい値Ｗ１＝８、第２しきい値Ｗ２＝５１と
計算され、幅細文字要素画像の数Ｎ１＝１３で全文字要
素画像の数の０．５となるので、文字列画像Ｌ₂は掠れ
ていると判断される。文字列画像Ｌ₃（図２（ｃ）、図
３（ｃ））の場合には、文字列画像の高さＹ ₃＝３５な
ので、第１しきい値Ｗ１＝７、第２しきい値Ｗ２＝４２
と計算され、幅太文字要素画像の数Ｎ２＝４で全文字要
素画像の０．２１となるので、文字列画像Ｌ₃は文字間
接触が多いと判断される。Character string image L_Two(FIG. 2B, FIG. 3
In the case of (b)), the height Y of the character string image _Two= 43
And the first threshold value W1 = 8 and the second threshold value W2 = 51.
Calculated and required for all characters with the number of narrow character element images N1 = 13
Since the number of elementary images is 0.5, the character string image L_TwoIs shaved
Is determined to be. Character string image L_Three(FIG. 2 (c), FIG.
3 (c)), the height Y of the character string image _Three= 35
Therefore, the first threshold value W1 = 7 and the second threshold value W2 = 42
Is calculated, and all characters are required when the number of wide character element images is N2 = 4.
Since the original image is 0.21, the character string image L_ThreeIs between letters
It is determined that there is a lot of contact.

【００２３】文字列画像Ｌ₄（図２（ｄ）、図３
（ｄ））の場合は、通常の文字列画像と判断される。文
字列画像が掠れていると判断したとき、又は文字列画像
が文字間接触が多いと判断したときは、和文基準文字幅
決定部１０７にその旨と、文字列画像の高さＹと文字要
素画像の外接矩形の座標とラン情報とを通知する。The character string image L ₄ (FIG. 2D, FIG. 3)
In the case of (d)), it is determined that the image is a normal character string image. If it is determined that the character string image is blurred, or if the character string image is determined to have a lot of character-to-character contact, the Japanese sentence reference character width determination unit 107 notifies the fact, the height Y of the character string image and the character element. The coordinates of the circumscribed rectangle of the image and the run information are notified.

【００２４】文字列画像が通常の文字列画像と判断した
ときは、第１字種推定部１０５に文字列画像の座標（ｘ
_s，ｙ_s），（ｘ_e，ｙ_e）と文字要素画像の座標（ｘ
_si，ｙ_si），（ｘ_ei，ｙ_ei）とラン情報とを通知する。
第１字種推定部１０５は、文字列状態推定部１０４から
通常の文字列画像である旨等の通知を受けると、各文字
要素画像の隣接する外接矩形間の距離Ｄ_i＝Ｘ _S(i+1)−
Ｘ_eiのヒストグラムを作成し、この矩形間の距離Ｄ_iの
短い側から最初に現れるピークの範囲Ｄ_Rを求める。Ｄ
１≦Ｄ_R≦Ｄ２となるとき、外接矩形間の距離Ｄ_iがこ
の範囲Ｄ_Rに属する矩形間の間隔の数Ｎ３を求める。The character string image is determined to be a normal character string image
At this time, the coordinates (x
_s, Y_s), (X_e, Y_e) And the coordinates (x
_si, Y_si), (X_ei, Y_ei) And run information.
The first character type estimating unit 105 outputs the character string state estimating unit 104
When you receive a notification such as a normal character string image,
Distance D between adjacent circumscribed rectangles of the element image_i= X _{S (i + 1)}−
X_eiAnd a distance D between the rectangles_iof
Range D of peak first appearing from short side_RAsk for. D
1 ≦ D_R≤ D2, the distance D between the circumscribed rectangles_iGinger
Range D_RThe number N3 of the intervals between the rectangles belonging to.

【００２５】次に、第３しきい値Ｄを文字列画像の座標
値ｙ_s，ｙ_eの差である高さＹより式（数３）Next, the third threshold value D is calculated from the height Y which is the difference between the coordinate values y _s and y _e of the character string image according to the following equation (Equation 3).

【００２６】[0026]

【数３】を用いて求める。ここで係数α３は０．３３程度以下の
範囲で選ばれ、本実施の形態ではα３＝０．３３とす
る。ピークの範囲の最大値Ｄ２の値がこの第３しきい値
Ｄ以下であり、かつ、この範囲Ｄ_Rに属する外接矩形間
の間隔の数Ｎ３の文字要素画像数Ｎ−１（外接矩形間の
間隔の総数）に対する割合が予め定めた定数Ｒ３以上で
あるか否かを判定する。なお、定数Ｒ３は、０．８程度
以上の範囲で選ばれ、本実施の形態ではＲ３＝０．８と
する。肯定のときは、文字列画像が英文であると判定
し、否定のときには和文であると判定する。英文と判定
したときは、文字要素画像の外接矩形の座標等を英文文
字切り出し部１０６に通知し、和文であると判定したと
きには、通常の文字列画像である旨と、文字要素画像の
座標とラン情報とを和文基準文字幅決定部１０７に通知
する。(Equation 3) Is determined using Here, the coefficient α3 is selected in a range of about 0.33 or less, and in the present embodiment, α3 = 0.33. The value of the maximum value D2 of the peak range is equal to or less than the third threshold value D, and the number N-1 of character element images of the number N3 of the intervals between the circumscribed rectangles belonging to this range D _R (N-1 It is determined whether or not the ratio to the total number of intervals is equal to or greater than a predetermined constant R3. Note that the constant R3 is selected in a range of about 0.8 or more, and R3 = 0.8 in the present embodiment. If affirmative, it is determined that the character string image is in English, and if negative, it is determined that it is in Japanese. When it is determined to be an English sentence, the coordinates of the circumscribed rectangle of the character element image and the like are notified to the English character extraction unit 106, and when it is determined to be a Japanese sentence, the fact that the image is a normal character string image and the coordinates of the character element image The run information is notified to the Japanese-language reference character width determination unit 107.

【００２７】例えば、文字列画像Ｌ₁の場合には、図４
（ａ）に示すヒストグラムが得られ、和文と判定され
る。文字列画像Ｌ４の場合には、文字列画像の高さＹ₄
＝３１（図２（ｄ））なので第３しきい値Ｄ＝１０であ
り、図４（ｂ）に示すようにヒストグラムの第１ピーク
の範囲４≦Ｄ_R≦７に属する隣接文字要素画像の間隔の
数Ｎ３は１５であり、間隔の総数に対する割合は０．９
４となり、定数Ｒ３以上であるので、英文であると判定
される。[0027] For example, in the case of the character string image L ₁ is 4
The histogram shown in (a) is obtained, and is determined to be a Japanese sentence. In the case of the character string image L4, the height Y _{4 of the} character string image
= 31 (FIG. 2 (d)), the third threshold value D = 10, and as shown in FIG. 4 (b), the value of the adjacent character element image belonging to the first peak range 4 ≦ D _R ≦ 7 of the histogram. The number N3 of intervals is 15, and the ratio to the total number of intervals is 0.9.
4, which is equal to or greater than the constant R3, and is therefore determined to be an English sentence.

【００２８】英文文字切出部１０６は、第１字種推定部
１０５から文字要素画像の外接矩形の座標とラン情報と
の通知を受けると、文字列検出部１０２に記憶されてい
る文書画像データを読み出し、アルファベットに特有の
文字画像の切り出しを行い、その文字画像を文字要素画
像の外接矩形の座標とラン情報とともに文字認識部１０
９に通知する。この文字画像の切り出しについての詳細
は、特開昭６２−２１６８０１号公報に記載されてい
る。Upon receiving the notification of the coordinates of the circumscribed rectangle of the character element image and the run information from the first character type estimation unit 105, the English character extraction unit 106 stores the document image data stored in the character string detection unit 102. Is read out, a character image specific to the alphabet is cut out, and the character image is read out along with the coordinates of the circumscribed rectangle of the character element image and the run information.
Notify 9. The details of cutting out the character image are described in JP-A-62-216801.

【００２９】和文基準文字幅決定部１０７は、文字列状
態推定部１０４から文字列画像の高さＹと文字要素画像
の外接矩形の座標とラン情報との通知を受けると、式
（数４）を用いて、When the Japanese reference character width determining unit 107 receives the notification of the height Y of the character string image, the coordinates of the circumscribed rectangle of the character element image, and the run information from the character string state estimating unit 104, the expression (Equation 4) is obtained. Using,

【００３０】[0030]

【数４】文字列画像の高さＹに予め定めた係数αを乗じた値を基
準文字幅Ａ１を計算する。ここでαは、和文文字の場
合、文字の縦横比がほぼ１に近い文字が多いので０．５
〜１．５程度の範囲で選択され、本実施の形態では１と
する。このように掠れた文字要素画像や文字間接触の多
い文字列画像に対しては、個々の文字要素画像の幅をも
とにして基準文字幅を決定しても精度が高くならないの
で、文字列画像の高さＹをもとに基準文字幅Ａ１を決定
する。決定した基準文字幅と文字要素画像の外接矩形の
座標とラン情報とを和文文字切出部１０８に通知する。(Equation 4) The reference character width A1 is calculated from a value obtained by multiplying the height Y of the character string image by a predetermined coefficient α. Here, α is 0.5 for Japanese characters because many characters have an aspect ratio close to 1 in many cases.
It is selected in the range of about 1.5 to 1.5, and is set to 1 in the present embodiment. For character image images that have been slashed in this way or character string images that have a lot of character contact, even if the reference character width is determined based on the width of each character element image, the accuracy does not increase. The reference character width A1 is determined based on the height Y of the image. The determined reference character width, coordinates of the circumscribed rectangle of the character element image, and run information are notified to the Japanese character extraction unit 108.

【００３１】上述の文字列画像Ｌ₂（図２（ｂ））の場
合には、文字列画像の高さＹ₂＝４３なので、基準文字
幅Ａ１＝４３と決定される。文字列画像Ｌ３（図２
（ｃ））の場合には、文字列画像の高さＹ₃＝３５なの
で、基準文字幅Ａ１＝３５と決定される。また、和文基
準文字幅決定部１０７は、第１字種推定部１０５から文
字要素画像の外接矩形の座標とラン情報との通知を受け
ると、外接矩形の幅ｗｉを式（数）In the case of the character string image L ₂ (FIG. 2B), since the height Y ₂ of the character string image is 43, the reference character width A1 is determined to be 43. The character string image L3 (FIG. 2)
In the case of (c)), since the height Y ₃ of the character string image is 35, the reference character width A1 is determined to be 35. Further, when receiving the notification of the coordinates of the circumscribed rectangle of the character element image and the run information from the first character type estimating unit 105, the Japanese character reference character width determination unit 107 calculates the width wi of the circumscribed rectangle by the formula (number).

【００３２】[0032]

【数５】を用いて計算し、外接矩形の幅ｗｉのヒストグラムを作
成する。得られたヒストグラムの最多頻度の文字要素画
像の幅が属するピークの範囲Ｗ_R1を求める。Ｗ３≦Ｗ_R1≦Ｗ４この範囲Ｗ_R1の最大値Ｗ４を基準文字幅Ａ２と決定す
る。決定した基準文字幅Ａ２と文字要素画像の外接矩形
の座標とラン情報とを和文文字切出し部１０８に通知す
る。(Equation 5) And a histogram having a width wi of the circumscribed rectangle is created. The range W _R1 of the peak to which the width of the most frequent character element image of the obtained histogram belongs is determined. W3 ≦ W _R1 ≦ W4 The maximum value W4 of this range W _R1 is determined as the reference character width A2. The determined reference character width A2, the coordinates of the circumscribed rectangle of the character element image, and run information are notified to the Japanese character extraction unit 108.

【００３３】上述の文字列画像Ｌ₁（図２（ａ））の場
合には、図５に示すようなヒストグラムが得られ、５０
≦Ｗ_R1≦５３となり、基準文字幅Ａ２＝５３と決定され
る。和文文字切出部１０８は、和文基準文字幅決定部１
０７から基準文字幅Ａ１又はＡ２等の通知を受けると、
文字列検出部１０２に記憶されている文書画像データを
読み出し、各文字列画像ごとに基準文字幅Ａ１又はＡ２
を用いて、文字要素画像を文字画像に統合して、文字認
識部１０９にその文字画像を外接矩形の座標とラン情報
とともに通知する。なお、この文字画像の統合は、上記
従来の技術で説明した特開昭６３−１６３９１号公報記
載の式（数６）In the case of the above character string image L ₁ (FIG. 2A), a histogram as shown in FIG.
≦ _WR1 ≦ 53, and the reference character width A2 = 53 is determined. The Japanese character extracting section 108 is a Japanese character width determining section 1.
When the notification of the reference character width A1 or A2 is received from 07,
The document image data stored in the character string detection unit 102 is read, and the reference character width A1 or A2
Is used to integrate the character element image into the character image, and notifies the character recognition unit 109 of the character image together with the coordinates of the circumscribed rectangle and the run information. The integration of the character images is performed by the equation (Equation 6) described in Japanese Patent Application Laid-Open No. 63-16391 described in the above prior art.

【００３４】[0034]

【数６】を用いて、Ｃｗ≦Ａ１又はＣｗ≦Ａ２となるように隣接
するｎ個の文字要素画像を統合する。以上のようにし
て、文字列画像Ｌ１〜Ｌ４から切り出された文字画像
は、図６（ａ）〜（ｄ）にそれぞれ示されている。(Equation 6) Is used to integrate adjacent n character element images such that Cw ≦ A1 or Cw ≦ A2. The character images cut out from the character string images L1 to L4 as described above are shown in FIGS. 6A to 6D, respectively.

【００３５】文字列画像Ｌ₁（図６（ａ））の場合に
は、通常の画像であるとしているので、文字画像Ｃ１
は、実際には文字間で接触している「情報」が一文字画
像とされている。また、文字列画像Ｌ₃（図６（ｃ））
の場合には、２文字であるべき「阪支」が文字間接触の
ために異常な２つの文字画像Ｃ２、Ｃ３に切り出されて
いる。In the case of the character string image L ₁ (FIG. 6A), it is assumed that the image is a normal image.
In practice, "information" in which characters are in contact with each other is regarded as a one-character image. The character string image L ₃ (FIG. 6C)
In the case of, “Hanji”, which should be two characters, is cut out into two abnormal character images C2 and C3 due to contact between characters.

【００３６】文字認識部１０９は、認識辞書（図示せ
ず）を有し、英文文字切出部１０６または和文文字切出
部１０８から通知された文字画像に相当する文字コード
を認識する。認識辞書は、標準文字のＮ次元の標準特徴
ベクトルＦｉｊと文字コードとを組にして登録してい
る。文字認識部１０９は通知された文字画像からＮ次元
の特徴ベクトルｆｊを抽出し、式（数７）を用いて市街
地距離ｖｉを計算する。The character recognition unit 109 has a recognition dictionary (not shown), and recognizes a character code corresponding to the character image notified from the English character extraction unit 106 or the Japanese character extraction unit 108. The recognition dictionary registers an N-dimensional standard feature vector Fij of standard characters and a character code as a set. The character recognizing unit 109 extracts the N-dimensional feature vector fj from the notified character image, and calculates the city area distance vi using Expression (Formula 7).

【００３７】[0037]

【数７】ここで、ｉは辞書番号であり、ｊ＝１，２，…Ｎであ
る。この市街地距離ｖｉが最小の辞書番号ｉの文字コー
ドを認識結果とするとともに、この市街地距離ｖの値を
評価値ｖとして、文字コードと文字画像の外接矩形の座
標とともに第１文字切り出し誤り判定部１１０に通知す
る。(Equation 7) Here, i is a dictionary number, and j = 1, 2,... N. The character code of the dictionary number i with the smallest city area distance vi is used as a recognition result, and the value of this city area distance v is used as an evaluation value v, along with the character code and the coordinates of the circumscribed rectangle of the character image. Notify 110.

【００３８】この評価値は、認識結果の文字コードの確
からしさの尺度となる。即ち、認識対象の文字画像の特
徴ベクトルｆｊと標準特徴ベクトルＦｉｊとが同一であ
れば市街地距離ｖｉ＝０となり、評価値は０となる。評
価値が小さい程、認識結果の文字コードが確からしいこ
とになる。このようにして、各文字列画像Ｌ１〜Ｌ４の
文字画像Ｃ₁，Ｃ₂… を文字コードとして認識した結
果が図６（ａ）〜（ｄ）の文字画像の下に記載してい
る。なお、図６では、文字コードの表す文字で文字コー
ドを示している。その文字コードの下には、その認識結
果の評価値を示している。This evaluation value is a measure of the likelihood of the character code of the recognition result. That is, if the feature vector fj of the character image to be recognized is the same as the standard feature vector Fij, the urban area distance vi = 0, and the evaluation value is 0. The smaller the evaluation value, the more likely the character code of the recognition result is. The results of recognizing the character images C ₁ , C _2, ... Of the character string images L1 to L4 as character codes in this manner are described below the character images in FIGS. 6 (a) to 6 (d). In FIG. 6, a character code is indicated by a character represented by the character code. Below the character code, an evaluation value of the recognition result is shown.

【００３９】また、文字認識部１０９は、修正文字切り
出し部１１４から通知された文字画像に相当する文字コ
ードを上記と同様に認識する。この場合、修正文字切り
出し部１１４から和文文字であるか英文文字コードへの
修正であるかの通知を受けている場合には、そのコード
に対応する認識辞書の標準特徴ベクトルとのみ照合す
る。認識結果である文字コードが認識されると、先に英
文文字切出部１０６又は和文文字切出部１０８から通知
された修正のない文字画像と修正文字切り出し部１１４
で修正された文字画像との文字コードとその外接矩形の
座標と評価値とが認識結果出力部１１５に出力される。
文字認識部１０９は、一の文字列画像の文字認識が終了
すると、文字列検出部１０２に次の文字列画像を検出す
るよう指示する。The character recognizing unit 109 recognizes a character code corresponding to the character image notified from the corrected character extracting unit 114 in the same manner as described above. In this case, if the correction character cutout unit 114 has received a notification as to whether it is a Japanese character or a correction to an English character code, it is compared only with the standard feature vector of the recognition dictionary corresponding to the code. When the character code as the recognition result is recognized, the character image without correction and the corrected character cutout unit 114 previously notified from the English character cutout unit 106 or the Japanese character cutout unit 108
The character code of the character image and the coordinates of the circumscribed rectangle and the evaluation value of the corrected character image are output to the recognition result output unit 115.
When the character recognition of one character string image is completed, the character recognition unit 109 instructs the character string detection unit 102 to detect the next character string image.

【００４０】第１文字切り出し誤り判定部１１０は、評
価値のしきい値Ｒを予め記憶している。文字認識部１０
９から文字コード等とともに通知された市街地距離の値
である評価値ｖがしきい値Ｒを超えるか否かを判定し、
肯定のときには文字切り出し誤りであると判断し、誤り
である旨とその文字コードと文字画像の外接矩形の座標
とを記憶する。否定のときには文字切り出しが誤りでな
いと判断し、その文字コードと文字画像の外接矩形の座
標とを記憶する。文字切り出しが誤りでない文字コード
と文字画像の外接矩形の座標とを第２字種推定部１１１
に順次通知する。The first character segmentation error determination section 110 stores a threshold value R of the evaluation value in advance. Character recognition unit 10
It is determined whether or not the evaluation value v, which is the value of the city distance notified from 9 with the character code, exceeds the threshold value R,
If the result is affirmative, it is determined that a character segmentation error has occurred, and the fact of the error, the character code thereof, and the coordinates of the circumscribed rectangle of the character image are stored. If the result is negative, it is determined that character extraction is not an error, and the character code and the coordinates of the circumscribed rectangle of the character image are stored. The second character type estimating unit 111 calculates a character code in which character extraction is not incorrect and coordinates of a circumscribed rectangle of the character image.
Notify sequentially.

【００４１】ここで、しきい値Ｒ＝８００として文字切
り出し誤りであるか否かを判定すると、図６に示したよ
うに、文字列画像Ｌ１の文字画像Ｃ₁「情報」、文字列
画像Ｌ２の文字画像Ｃ₄「西」、文字列画像Ｌ３の文字
画像Ｃ₂，Ｃ₃が文字切り出し誤りと判断される。な
お、このしきい値Ｒは市街地距離ｖｉを計算するための
認識辞書標準特徴ベクトルと抽出された特徴ベクトルの
数値によって適当に選ばれる。Here, when it is determined whether or not a character segmentation error has occurred with the threshold value R = 800, as shown in FIG. 6, the character image C ₁ “information” of the character string image L ₁ and the character string image L 2 The character image C ₄ “west” and the character images C ₂ and C _{3 of} the character string image L3 are determined to be character extraction errors. The threshold value R is appropriately selected based on the recognition dictionary standard feature vector for calculating the city distance vi and the numerical value of the extracted feature vector.

【００４２】第２字種推定部１１１は、文字コードと文
字種（漢字、平仮名、カタカナ等）との一覧と、予め定
めている定数Ｒ４とを記憶している。第１文字切り出し
誤り判定部１１０から通知される文字コードを文字種別
に分類し、通知された文字コードの数Ｎ０を計数する。
一文字列画像の処理が終了すると、次に分類した文字種
のうち、和文専用文字（漢字、平仮名、カタカナ）数Ｎ
４を計算し、Ｎ４／Ｎ０≧Ｒ４か否かを判定する。肯定
のときは、この一文字列画像が和文文字列であると推定
し、第２文字切り出し誤り判定部１１２に通知する。否
定のときは、この一文字列画像が英文文字列であると推
定し、第２文字切り出し誤り判定部１１２に通知する。The second character type estimating section 111 stores a list of character codes and character types (Kanji, Hiragana, Katakana, etc.) and a predetermined constant R4. The character codes notified from the first character segmentation error determination unit 110 are classified into character types, and the number N0 of the notified character codes is counted.
When the processing of one character string image is completed, the number N of Japanese-only characters (kanji, hiragana, katakana) among the character types classified next is N
4 to determine whether N4 / N0 ≧ R4. When the result is affirmative, the one character string image is estimated to be a Japanese character string, and the second character extraction error determination unit 112 is notified. If the result is negative, the one character string image is estimated to be an English character string, and the second character cutout error determination unit 112 is notified.

【００４３】ここで、定数Ｒ４の値は０．５程度以上の
範囲で選ばれ、本実施の形態ではＲ４＝０．９とする。
また、和文文字列と英文文字列との別の通知とともに、
第１文字切り出し誤り判定部１１０から通知された文字
コードと文字画像の外接矩形の座標とを第２文字切り出
し誤り判定部１１２に通知する。Here, the value of the constant R4 is selected in a range of about 0.5 or more, and in the present embodiment, R4 = 0.9.
In addition, along with another notification of Japanese and English strings,
The character code and the coordinates of the circumscribed rectangle of the character image notified from the first character segmentation error determination unit 110 are notified to the second character segmentation error determination unit 112.

【００４４】第２文字切り出し誤り判定部１１２は、第
２字種推定部１１１と同様に、文字コードと対応する文
字の文字種との一覧を記憶している。第２字種推定部１
１１から英文文字列である旨と文字コードと文字画像の
外接矩形の座標との通知を受けると、文字コードの対応
する文字の文字種が和文専用文字種であるか否かを検索
し、和文専用文字種であるときには、その認識結果の文
字コードが文字切り出し誤りの結果であるとして、その
文字コードとその文字画像の外接矩形の座標と英文文字
コードに修正する旨とを記憶する。和文専用文字種でな
いときは、その認識結果の文字コードが正しいと判定す
る。The second character segmentation error determination unit 112 stores a list of character codes and corresponding character types, similarly to the second character type estimation unit 111. Second character type estimation unit 1
11 receives a notification that the character string is an English character string, the character code, and the coordinates of the circumscribed rectangle of the character image, and searches whether or not the character type of the character corresponding to the character code is a Japanese character type. , The character code of the recognition result is determined to be the result of a character cutout error, and the character code, the coordinates of the circumscribed rectangle of the character image, and the fact that the character code is to be corrected are stored. If it is not a Japanese-only character type, it is determined that the character code of the recognition result is correct.

【００４５】また、第２字種推定部１１１から和文文字
列である旨と文字コードとその文字画像の外接矩形の座
標との通知を受けると、文字コードに対応する文字の文
字種が英文専用文字種であるか否かを検索し、英文専用
文字種であるときは、更に前後の文字コードに対応する
文字の文字種が和文専用文字種であるか否かを検索し、
そうであれば通知された文字コードが文字切り出し誤り
の結果であるとしてその文字コードとその文字画像の外
接矩形の座標と和文文字コードに修正する旨とを記憶す
る。一文字列画像の処理が終了すると、基準文字幅修正
部１１３を起動する。When the second character type estimating unit 111 notifies that the character string is a Japanese character string, the character code and the coordinates of the circumscribed rectangle of the character image, the character type of the character corresponding to the character code is set to the English character type. Whether the character type corresponding to the preceding and following character codes is a Japanese-only character type.
If so, the notified character code is determined to be the result of a character extraction error, and the character code, the coordinates of the circumscribed rectangle of the character image, and the fact that the character code is to be corrected to the Japanese character code are stored. When the processing of one character string image is completed, the reference character width correction unit 113 is activated.

【００４６】基準文字幅修正部１１３は、第２字種推定
部１１１から起動されると、第１文字切り出し誤り判定
部１１０で文字切り出し誤りである旨とともに記憶され
ている文字コードに対応する文字画像の外接矩形の座標
又は第２文字切り出し誤り判定部１１２に記憶されてい
る文字コードに対応する文字画像の外接矩形の座標を読
み出す。次にこれらの文字コード以外の文字コード（文
字切り出し誤りでないと第１文字切り出し誤り判定部１
１０及び第２文字切り出し誤り判定部１１２で判定され
た文字画像の文字コード）に対応する外接矩形の座標を
第１文字切り出し誤り判定部１１０から読み出す。この
外接矩形の座標から文字画像の外接矩形の幅のヒストグ
ラムを上記和文基準文字幅決定部１０７と同様の手法で
作成する。得られたヒストグラムの最多頻度の文字画像
の幅が属するピークの範囲Ｗ_R2を求める。When activated by the second character type estimating unit 111, the reference character width correcting unit 113 determines that the first character segmentation error determination unit 110 has detected a character segmentation error and stores the character corresponding to the character code stored therein. The coordinates of the circumscribed rectangle of the image or the coordinates of the circumscribed rectangle of the character image corresponding to the character code stored in the second character segmentation error determination unit 112 are read. Next, a character code other than these character codes (the first character cutout error determining unit 1 determines that there is no character cutout error).
10 and the coordinates of the circumscribed rectangle corresponding to the character code of the character image determined by the second character extraction error determination unit 112 are read from the first character extraction error determination unit 110. From the coordinates of the circumscribed rectangle, a histogram of the width of the circumscribed rectangle of the character image is created by a method similar to that of the Japanese text reference character width determination unit 107. The range W _R2 of the peak to which the width of the most frequent character image of the obtained histogram belongs is determined.

【００４７】Ｗ５≦Ｗ_R2≦Ｗ６この範囲Ｗ_R2の最大値Ｗ６を修正基準文字幅Ａ３と決定
する。決定した修正基準文字幅Ａ３と文字切り出し誤り
であるとして読み出した文字画像の外接矩形の座標とを
修正文字切り出し部１１４に通知する。図７（ａ）は、
文字列画像Ｌ₁で切り出し誤りでないと判定された文字
画像の外接矩形の幅のヒストグラムである。このヒスト
グラムから修正基準文字幅Ａ３＝５３が決定される。W5 ≦ W _R2 ≦ W6 The maximum value W6 of this range W _R2 is determined as the modified reference character width A3. The corrected character cutout unit 114 is notified of the determined correction reference character width A3 and the coordinates of the circumscribed rectangle of the character image read out as a character cutout error. FIG. 7 (a)
Is a histogram of the circumscribed rectangle of the width of the character string image L ₁ in cutout character image is determined not to be erroneous. The correction reference character width A3 = 53 is determined from this histogram.

【００４８】同様に、図７（ｂ）は文字列画像Ｌ₂のヒ
ストグラムであり、図７（ｃ）は、文字列画像Ｌ₃のヒ
ストグラムである。このヒストグラムから修正基準文字
幅Ａ３が文字列画像Ｌ₂ではＡ３＝４１と文字列画像Ｌ
₃ではＡ３＝３８とそれぞれ決定される。修正文字切り
出し部１１４は、基準文字幅修正部１１３から修正基準
文字幅Ａ３の通知を受けると、同時に通知された文字画
像の外接矩形の座標から得られるその文字画像の存在領
域（幅）と修正基準文字幅Ａ３と比較する。修正基準文
字幅Ａ３が大きいときには、掠れ文字であるとして連続
する文字画像を統合して修正基準文字幅Ａ３に一致する
文字画像を文字列検出部１０２に記憶されている文書画
像データを参照して切り出し、そのラン情報と外接矩形
の座標とを文字認識部１０９に通知する。[0048] Similarly, FIG. 7 (b) is a histogram of the character string image L _2, FIG. 7 (c) is a histogram of the character string image L _3. In modified from the histogram reference character width A3 string image L ₂ A3 = 41 and the character string image L
_In A3, it is determined that A3 = 38. Upon receiving the notification of the correction reference character width A3 from the reference character width correction unit 113, the correction character cutout unit 114 and the correction area and the correction area (width) of the character image obtained from the coordinates of the circumscribed rectangle of the character image notified at the same time. Compare with the reference character width A3. When the corrected reference character width A3 is large, continuous character images are determined to be blurred characters, and a character image that matches the corrected reference character width A3 is referred to the document image data stored in the character string detection unit 102. The cut-out information and the coordinates of the circumscribed rectangle are notified to the character recognition unit 109.

【００４９】また、修正基準文字幅Ａ３が小さいときに
は、文字間接触がある文字であるとして、文字画像の存
在領域（幅）を修正基準文字幅Ａ３で除算して文字数を
出し、その文字数に対応した文字画像を文字列検出部１
０２に記憶されている文書画像データを参照して切り出
し、そのラン情報と外接矩形の座標とを文字認識部１０
９に通知する。この際、第２文字切り出し誤り判定部１
１２を参照して、文字切り出し誤りが和文文字又は英文
文字コードへの修正である旨が記憶されているときに
は、その旨も通知する。When the modified reference character width A3 is small, it is determined that the character has a character-to-character contact, and the area (width) of the character image is divided by the modified reference character width A3 to obtain the number of characters. Character image detected by character string detector 1
02, referring to the document image data stored in the character recognition unit 10 based on the run information and the coordinates of the circumscribed rectangle.
Notify 9. At this time, the second character segmentation error determination unit 1
Referring to FIG. 12, when it is stored that the character segmentation error is a correction to a Japanese character or English character code, the fact is also notified.

【００５０】この結果、上記した文字認識部１０９にお
いて、図８（ａ）に示すように文字列画像Ｌ₁の文字画
像Ｃ₁（図６（ａ））は、正しい文字画像「情」Ｃ_C1と
「報」Ｃ_C2とに認識される。また、図８（ｂ）に示すよ
うに、切り出し誤りとされた文字列画像Ｌ₂の文字画像
Ｃ（図６（ｂ））は、再度認識評価値ｄｉがしきい値Ｒ
よりも大きな値である文字画像「西」Ｃ_C4と認識され
る。これは文字画像Ｃ₄の掠れのために評価値が大きな
値となったものであり、認識結果が誤りとなったわけで
はないからである。図８（ｃ）に示すように文字列画像
Ｌ₃の文字画像Ｃ ₂、Ｃ₃（図６（ｃ））は、正しい文
字画像「阪」Ｃ_C2と「支」Ｃ_C3とに認識される。As a result, the character recognition unit 109 described above
And a character string image L as shown in FIG.₁Character painting
Statue C₁(FIG. 6 (a)) shows a correct character image "information" C_C1When
"Report" C_C2And will be recognized. Also, as shown in FIG.
As described above, the character string image L determined to have been cut out error._TwoCharacter image
C (FIG. 6B) indicates that the recognition evaluation value di is again equal to the threshold R
Character image "west" C with a value larger than_C4Recognized
You. This is character image C_FourEvaluation value is large due to
Value, and the recognition result was incorrect.
Because there is no. A character string image as shown in FIG.
L_ThreeCharacter image C _Two, C_Three(FIG. 6 (c)) shows the correct sentence
Character image "Saka" C_C2And "support" C_C3And will be recognized.

【００５１】認識結果出力部１１５は、文字認識部１０
９から通知された文字コードを相当する標準文字画像に
変換して表示画面に表示する。次に、本実施の形態の動
作を図９〜図１２のフローチャートを用いて説明する。
先ず、画像入力部１０１は、オペレータから文書原稿の
入力を受けると、文字画像に対応する黒画素と背景部分
に対応する白画素とからなる文書画像データに変換して
文字列検出部１０２に通知する（Ｓ９０２）。The recognition result output unit 115 is provided for the character recognition unit 10.
9 is converted to a corresponding standard character image and displayed on the display screen. Next, the operation of the present embodiment will be described with reference to the flowcharts of FIGS.
First, when an input of a document original is received from an operator, the image input unit 101 converts the input into document image data including black pixels corresponding to a character image and white pixels corresponding to a background part, and notifies the character string detection unit 102 of the image data. (S902).

【００５２】文字列検出部１０２は、その文書画像デー
タを記憶し、文書画像データから文字列画像を検出し、
文字列画像を文字要素検出部１０３に通知する（Ｓ９０
４）。文字列状態推定部１０４は、文字列検出部１０２
から文字列画像の通知を受けると、文字列画像の高さＹ
を算出する（Ｓ９０６）。The character string detecting section 102 stores the document image data, detects a character string image from the document image data,
The character string image is notified to the character element detection unit 103 (S90
4). The character string state estimating unit 104
When the notification of the character string image is received from
Is calculated (S906).

【００５３】文字要素検出部１０３は、文字列検出部１
０２から文字列画像の通知を受けると、黒画素連結部を
文字要素画像として検出するとともに文字要素画像数Ｎ
を計数する（Ｓ９０８）。文字列状態推定部１０４は、
式（数１）を用いて第１しきい値Ｗ１を算出する。この
第１しきい値Ｗ１は、文字要素画像が幅細文字要素画像
であるか否かを画する値である（Ｓ９１０）。式（数
２）を用いて第２しきい値Ｗ２を算出する。この第２し
きい値Ｗ２は文字要素画像が幅太文字要素画像であるか
否かを画する値である（Ｓ９１２）。次に、文字列状態
推定部１０４は、文字要素画像の外接矩形の幅が第１し
きい値Ｗ１以下の文字要素画像の数Ｎ１を計数し（Ｓ９
１４）、同様に外接矩形の幅が第２しきい値Ｗ２以上の
文字要素画像の数Ｎ２を計数する（Ｓ９１６）。Ｎ１／
Ｎの値が所定の割合Ｒ１以上であるか否かを判定し（Ｓ
９１８）、以上であれば掠れ文字が多いと判断し、Ｓ９
２２に移る。未満であれば、Ｎ２／Ｎの値が所定の割合
Ｒ２以上であるか否かを判定し（Ｓ９２０）、以上であ
れば、文字間接触が多いと判断し、Ｓ９２２に移る。The character element detection unit 103 includes the character string detection unit 1
When the notification of the character string image is received from No. 02, the black pixel connection part is detected as a character element image and the number N of character element images
Is counted (S908). The character string state estimating unit 104
The first threshold value W1 is calculated using the equation (Equation 1). The first threshold value W1 is a value indicating whether or not the character element image is a narrow character element image (S910). The second threshold value W2 is calculated using the equation (Equation 2). The second threshold value W2 is a value indicating whether or not the character element image is a bold character element image (S912). Next, the character string state estimation unit 104 counts the number N1 of character element images in which the width of the circumscribed rectangle of the character element image is equal to or smaller than the first threshold value W1 (S9).
14) Similarly, the number N2 of character element images in which the width of the circumscribed rectangle is equal to or larger than the second threshold value W2 is counted (S916). N1 /
It is determined whether the value of N is equal to or greater than a predetermined ratio R1 (S
918) If above, it is determined that there are many blurred characters, and S9
Move to 22. If less than, it is determined whether or not the value of N2 / N is equal to or greater than a predetermined ratio R2 (S920). If it is more than that, it is determined that there is much inter-character contact, and the flow proceeds to S922.

【００５４】Ｓ９２２において、和文基準文字幅決定部
１０９は、基準文字幅Ａ１を式（数４）を用いて計算
し、Ｓ１０２２に移る。Ｓ９２０において、未満と判定
されたときには、通常の文字列画像であると判断し、第
１字種推定部１０５は、隣接する文字要素画像の外接矩
形間の間隔のヒストグラムを作成し（Ｓ１００２）、そ
のヒストグラムの第１ピークの範囲を求め（Ｓ１００
４）、その範囲内に入る文字要素画像の数Ｎ３を計数す
る（Ｓ１００６）。In S922, the Japanese sentence reference character width determination unit 109 calculates the reference character width A1 using the equation (Equation 4), and proceeds to S1022. If it is determined in S920 that the character image is less than the normal character string image, the first character type estimation unit 105 creates a histogram of the interval between circumscribed rectangles of adjacent character element images (S1002), The range of the first peak of the histogram is obtained (S100
4) Count the number N3 of character element images falling within the range (S1006).

【００５５】次に、式（数３）を用いて第３しきい値Ｄ
を計算し（Ｓ１００８）、第１ピークの外接矩形間の間
隔Ｄ_Rの最大値Ｄ₂が第３しきい値Ｄ以下で、かつ、Ｎ
３／Ｎの値が所定の割合Ｒ３以上であるか否かを判定す
る（Ｓ１０１０）。これによって、文字列画像が英文で
あるか和文であるかを判定する。肯定のときは、文字列
画像が英文であると判定され、英文文字切出部１０６
は、文字要素画像のラン情報を基に、微小文字要素画像
の統合処理をして（Ｓ１０１２）、文字画像を切り出し
（Ｓ１０１４）、Ｓ１１０２に移る。Next, the third threshold value D is calculated using the equation (Equation 3).
Was calculated (S1008), the maximum value D ₂ distance D _R between the circumscribed rectangle of the first peak following the third threshold value D, and, N
It is determined whether the value of 3 / N is equal to or greater than a predetermined ratio R3 (S1010). Thereby, it is determined whether the character string image is an English sentence or a Japanese sentence. If affirmative, it is determined that the character string image is in English, and the English character extraction unit 106
Performs the integration process of the minute character element image based on the run information of the character element image (S1012), cuts out the character image (S1014), and proceeds to S1102.

【００５６】Ｓ１０１０で否定を判定されたときは、文
字列画像が和文であると判定され、和文基準文字幅決定
部１０７は、文字要素画像の外接矩形幅のヒストグラム
を作成し（Ｓ１０１６）、その最多頻度の外接矩形幅が
属するピークの範囲Ｗ_R1を求める（Ｓ１０１８）。その
範囲Ｗ_R1の最大値Ｗ４を基準文字幅Ａ２に決定する（Ｓ
１０２０）。If a negative determination is made in S1010, the character string image is determined to be a Japanese sentence, and the Japanese sentence reference character width determination unit 107 creates a histogram of the circumscribed rectangle width of the character element image (S1016). The peak range W _R1 to which the most frequent circumscribed rectangle width belongs is determined (S1018). The maximum value W4 of the range W _R1 is determined as the reference character width A2 (S
1020).

【００５７】和文文字切り出し部１０８は、和文基準文
字幅決定部１０７から通知された基準文字幅Ａ１又はＡ
２以下の外接矩形幅Ｃ_Wの文字画像となるように式（数
６）を用いて文字要素画像を統合して切り出す（Ｓ１０
２２）。文字認識部１０９は、英文文字切出部１０６又
は和文文字切り出し部１０８から切り出された文字画像
の通知を受けると、特徴ベクトルを抽出し、認識辞書の
全ての標準特徴ベクトルとの市街地距離ｖｉを式（数
７）を用いて計算し、その値ｖｉの最小となる標準特徴
ベクトルの文字コードを認識結果とし、評価値ｖととも
に第１文字切り出し誤り判定部１１０に通知する（Ｓ１
１０２）。The Japanese character extraction unit 108 outputs the reference character width A1 or A
The character element images are integrated and cut out using Expression (Equation 6) so as to obtain a character image having a circumscribed rectangular width _CW of 2 or less (S10).
22). When the character recognition unit 109 receives the notification of the character image cut out from the English character cutout unit 106 or the Japanese character cutout unit 108, the character recognition unit 109 extracts the feature vector and calculates the city distance vi with all the standard feature vectors in the recognition dictionary. It is calculated using the equation (Equation 7), and the character code of the standard feature vector having the minimum value vi is used as the recognition result, and is notified to the first character segmentation error determination unit 110 together with the evaluation value v (S1).
102).

【００５８】第１文字切り出し誤り判定部１１０は、通
知された評価値ｖの値が予め記憶されているしきい値Ｒ
を超えるか否かを判定し（Ｓ１１０４）、肯定のときは
Ｓ１１１０に移る。否定のときは、文字認識が正しいと
して、第２字種推定部１１１は、第１文字切り出し誤り
判定部１１０から通知された文字コードを字種別（漢字
・平仮名・カタカナ・その他）に分類し（Ｓ１１０
６）、通知された文字コードの数Ｎ０を計数する（Ｓ１
１０８）。Ｓ１１１０において、一文字列画像の処理が
終了していなければ、Ｓ１１０２に戻り、終了すると和
文専用文字数Ｎ４を計算し（Ｓ１１１２）、Ｎ４／Ｎ０
の値が予め記憶している定数Ｒ４以上であるか否かを判
断し（Ｓ１１１４）、肯定のときはＳ１２０２に移る。The first character segmentation error determination section 110 stores the value of the notified evaluation value v in a threshold R
Is determined (S1104), and if affirmative, the process moves to S1110. If the result is negative, it is determined that the character recognition is correct, and the second character type estimating unit 111 classifies the character codes notified from the first character segmentation error determining unit 110 into character types (Kanji, Hiragana, Katakana, and others) ( S110
6) Count the number N0 of the notified character codes (S1)
108). If the processing of one character string image has not been completed in S1110, the process returns to S1102. When the processing is completed, the number N4 of Japanese-only characters is calculated (S1112), and N4 / N0 is obtained.
Is determined to be greater than or equal to the constant R4 stored in advance (S1114), and if affirmative, the process moves to S1202.

【００５９】否定のときは、この文字列画像が英文文字
列であるとして、第２文字切り出し誤り判定部１１２
は、第２字種推定部１１１を介して通知された文字コー
ドが和文専用文字種であるか否かを判定し（Ｓ１１１
６）、肯定のときは文字切り出し誤りとして記憶し（Ｓ
１１１８）、否定のときは、一文字列画像の処理が終了
していなければ（Ｓ１１２０）、Ｓ１１１６に戻り、終
了していればＳ１２１０に移る。If not, it is determined that the character string image is an English character string, and the second character segmentation error determination unit 112
Determines whether the character code notified via the second character type estimating unit 111 is a Japanese character only character type (S111).
6) If affirmative, store it as a character segmentation error (S
1118) If not, if the processing of the one character string image has not been completed (S1120), the process returns to S1116, and if completed, the process proceeds to S1210.

【００６０】Ｓ１２０２において、第２文字切り出し誤
り判定部１１２は、文字列画像が和文文字列であるとし
て、第２字種推定部を介して通知された文字コードが英
文専用文字種（アルファベット・数字・記号）であるか
否かを判定し（Ｓ１２０２）、否定のときはＳ１２０８
に移り、肯定のときは、前後の文字コードのいずれかが
和文専用文字種であるか否かを判定し（Ｓ１２０４）否
定のときはＳ１２０８に移り、肯定のときには通知され
た文字コードが文字切り出し誤りであるとして記憶する
（Ｓ１２０６）。Ｓ１２０８において、一文字列画像の
処理が終了したか否かを判定し、終了していなければＳ
１２０２に戻り、終了していればＳ１２１０に移る。In step S1202, the second character segmentation error determination unit 112 determines that the character string image is a Japanese character string, and sets the character code notified via the second character type estimation unit to the English character type (alphabetic, numeric, (S1202), and if not, S1208.
If it is affirmative, it is determined whether any of the preceding and following character codes is a Japanese character type (S1204). If the result is negative, the process proceeds to S1208. Is stored (S1206). In step S1208, it is determined whether the processing of the one character string image has been completed.
It returns to 1202, and if it has been completed, it will transfer to S1210.

【００６１】Ｓ１２１０において、基準文字幅修正部１
１３は、切り出し誤りでないと判断された文字画像の外
接矩形幅のヒストグラムを作成し（Ｓ１２１０）、その
ヒストグラムの最多頻度の外接矩形幅が属するピークの
範囲Ｗ_R2を求める（Ｓ１２１２）。この範囲Ｗ_R2の最大
値Ｗ６を修正基準文字幅Ａ３として、修正文字切り出し
部１１４に通知する（Ｓ１２１４）。At S1210, reference character width correction unit 1
In step S1210, a histogram of the circumscribed rectangle width of the character image determined not to be an extraction error is created (S1210), and a peak range W _R2 to which the most frequent circumscribed rectangle width of the histogram belongs is obtained (S1212). The maximum value W6 of the range W _R2 is notified to the corrected character cutout unit 114 as the corrected reference character width A3 (S1214).

【００６２】修正文字切り出し部１１４は、通知された
修正基準文字幅Ａ３に基づいて、文字切り出し誤りとさ
れた文字画像を修正して切り出し、文字認識部１０９に
通知する（Ｓ１２１６）。文字認識部１２１８は、通知
された文字画像を認識辞書を照合して文字コードを決定
する。この際、英文文字種又は和文文字種である旨の通
知を受けているときは、その対象字種内で照合する（Ｓ
１２１８）。一文字列画像の処理が終了しているか否か
を判定し（Ｓ１２２０）、終了していなければＳ１２１
６に戻り、終了しているときは全ての文字列画像の処理
が終了しているか否かを判定し、終了していないときは
Ｓ９０６に戻り、終了しているときは、認識結果出力部
１１５が認識結果である文字コードに対応する文字画像
を出力して処理を終了する（Ｓ１２２２）。The corrected character cutout unit 114 corrects and cuts out a character image in which a character cutout error has occurred based on the notified correction reference character width A3, and notifies the character recognition unit 109 (S1216). The character recognition unit 1218 determines the character code by comparing the notified character image with the recognition dictionary. At this time, if a notification that the character type is an English character type or a Japanese character type is received, the collation is performed within the target character type (S
1218). It is determined whether the processing of the one character string image has been completed (S1220).
6, if it has been completed, it is determined whether or not processing of all character string images has been completed. If it has not been completed, the process returns to S906. If it has been completed, the recognition result output unit 115 Outputs a character image corresponding to the character code that is the recognition result, and ends the process (S1222).

【００６３】なお、本実施の形態では、横書き文字列を
例に取り説明したけれども、縦書き文字列について適応
できるのは勿論である。この場合には、文字列方向の長
さを「高さ」、文字列に直角方向の長さを「幅」として
取り扱う。なお、本実施の形態では、図１に示したよう
な構成で本発明に係る文字認識装置を実現したけれど
も、本発明はプログラムによって実現し、これをフロッ
ピーディスク等の記録媒体に記録して移送することによ
り、独立した他のコンピュータ・システムで容易に実施
することができる。図１３は、これをフロッピーディス
クで実施する場合を説明する図である。Although this embodiment has been described by taking a horizontally written character string as an example, it is needless to say that the present invention can be applied to a vertically written character string. In this case, the length in the character string direction is treated as “height”, and the length in the direction perpendicular to the character string is treated as “width”. In this embodiment, the character recognition device according to the present invention is realized by the configuration shown in FIG. 1, but the present invention is realized by a program, which is recorded on a recording medium such as a floppy disk and transferred. By doing so, it can be easily implemented on another independent computer system. FIG. 13 is a diagram for explaining a case where this is carried out using a floppy disk.

【００６４】記録媒体本体であるフロッピーディスク１
３０１の物理フォーマットは、同心円状に外周から内周
に向かってトラック１、２、…、８０を作成し、角度方
向に１６のセクタに分割している。このように割り当て
られた領域に従って、プログラムを記録する。このフロ
ッピーディスク１３０１は、ケース１３０２に収納さ
れ、これによって、ディスクを埃や外部からの衝撃から
守り、安全に移送することができる。Floppy disk 1 as a recording medium body
In the physical format 301, tracks 1, 2,..., 80 are created concentrically from the outer periphery to the inner periphery, and are divided into 16 sectors in the angular direction. The program is recorded according to the allocated area. The floppy disk 1301 is housed in a case 1302, so that the disk can be protected from dust and external impact and can be transported safely.

【００６５】図１４は、フロッピーディスク１３０１に
プログラムの記録再生を行うことを説明する図である。
図示のようにコンピュータ・システム１４０３にフロッ
ピーディスクドライブ１４０１を接続することにより、
ディスク１３０１に対してプログラムを記録再生するこ
とが可能となる。ディスク１３０１はフロッピーディス
クドライブ１４０１に、挿入口１４０２を介して組込
み、および取り出しがなされる。記録する場合はコンピ
ュータ・システム１４０３からプログラムをフロッピー
ディスクドライブ１４０１によってディスク１３０１に
記録する。再生する場合は、フロッピーディスクドライ
ブ１４０１がプログラムをディスク１３０１から読み出
し、コンピュータ・システム１４０３に転送する。FIG. 14 is a diagram for explaining recording and reproduction of a program on the floppy disk 1301.
By connecting the floppy disk drive 1401 to the computer system 1403 as shown,
The program can be recorded and reproduced on the disk 1301. The disk 1301 is inserted into and removed from a floppy disk drive 1401 via an insertion slot 1402. When recording, the program is recorded on the disk 1301 by the floppy disk drive 1401 from the computer system 1403. For reproduction, the floppy disk drive 1401 reads the program from the disk 1301 and transfers it to the computer system 1403.

【００６６】なお、この実施の形態においては、記録媒
体としてフロッピーディスクを用いて説明を行ったが、
光ディスクを用いても同様に行うことができる。また記
録媒体はこれらに限られず、ＩＣカード、ＲＯＭカセッ
ト等、プログラムを記録できるものであれば、同様に実
施することができる。In this embodiment, a description has been given using a floppy disk as a recording medium.
The same can be done using an optical disk. The recording medium is not limited to these, but can be similarly implemented as long as the program can be recorded thereon, such as an IC card or a ROM cassette.

【００６７】[0067]

【発明の効果】以上のように本発明によれば、文書画像
から文字列画像とその外接矩形の位置とを抽出する文字
列画像抽出手段と、上記文字列画像から文字要素画像と
その外接矩形の位置とを抽出する文字要素画像抽出手段
と、上記文字要素画像の外接矩形の文字列画像に平行な
第１長さと、上記文字列画像の文字列画像に直角方向の
第２長さに基づいて定められるしきい値で上記文字列画
像の状態が正常か異常かを分類する文字列状態分類手段
と、前記文字列状態分類手段で分類した上記文字列画像
の状態によって文字列画像から文字画像を切り出す文字
列画像に平行な基準長さを決定する基準文字幅決定手段
と、前記基準文字幅決定手段で決定された基準長さに従
い文字要素画像を統合、分離して文字画像を切り出す文
字画像切出手段と、切り出された文字画像の特徴と認識
辞書の標準特徴とを照合し、文字画像を相当する文字コ
ードとして認識する文字認識手段とを備えることとして
いるので、文字列状態分類手段が文字列状態が正常か異
常かを分類し、基準文字幅決定手段は、その分類に従い
文字画像の切り出し幅を決定できる。従って、文字画像
の切り出し精度が向上し、文字認識効率が高まる。As described above, according to the present invention, a character string image extracting means for extracting a character string image and a position of a circumscribed rectangle from a document image, a character element image and a circumscribed rectangle from the character string image Character element image extracting means for extracting the position of the character element image, a first length parallel to the character string image of the circumscribed rectangle of the character element image, and a second length perpendicular to the character string image of the character string image. A character string state classifying means for classifying whether the state of the character string image is normal or abnormal with a threshold value determined by the threshold value, and a character image from the character string image according to the state of the character string image classified by the character string state classifying means. Character width determining means for determining a reference length parallel to the character string image to be cut out, and a character image for integrating and separating the character element images according to the reference length determined by the reference character width determining means to cut out the character image Cutting means and The character string state classifying means determines that the character string state is normal because the character string state classifying means includes character recognition means for recognizing the character of the cut-out character image with the standard characteristic of the recognition dictionary and recognizing the character image as a corresponding character code. The reference character width determination means can determine the cutout width of the character image according to the classification. Therefore, the accuracy of extracting a character image is improved, and the efficiency of character recognition is improved.

【００６８】また、前記文字列状態分類手段は、文字列
画像単位で文字要素画像の総数Ｎを計数する総数計数部
と、文字列画像単位で各文字要素画像の第１長さを計算
する第１長さ計算部と、文字列画像の外接矩形の第２長
さＹを基準に第１しきい値Ｗ１を計算し、前記第１長さ
計算部で計算された各文字要素画像の外接矩形の第１長
さと第１しきい値Ｗ１とを比較し、第１長さが第１しき
い値Ｗ１以下の幅細文字要素の数Ｎ１を計数し、Ｎ１／
Ｎの値が予め定めた割合Ｒ１以上のとき文字列画像を異
常な掠れ画像と判定する掠れ画像判定部と、文字列画像
の外接矩形の第２長さＹを基準に第２しきい値Ｗ２を計
算し、前記第１長さ計算部で計算された各文字要素画像
の外接矩形の第１長さと第２しきい値Ｗ２とを比較し、
第１長さが第２しきい値Ｗ２以上の幅太文字要素の数Ｎ
２を計数し、Ｎ２／Ｎの値が予め定めた割合Ｒ２以上の
とき文字列画像を異常な文字間接触の多い画像と判定す
る接触画像判定部と、前記掠れ画像判定部又は接触画像
判定部で異常な画像と判定されないとき、文字列画像を
正常な画像と判定する正常画像判定部とを有することと
しているので、文字列画像の外接矩形の高さから計算式
を用いて計算した第１、第２しきい値を基準に、文字要
素画像抽出手段で抽出された文字要素画像を幅細又は幅
太文字要素に分類するので、文字列画像が正常か異常か
を容易に分類できる。The character string state classifying means counts the total number N of character element images for each character string image, and calculates a first length of each character element image for each character string image. A first length calculation unit, and a first threshold value W1 calculated based on a second length Y of a circumscribed rectangle of the character string image, and a circumscribed rectangle of each character element image calculated by the first length calculation unit Is compared with the first threshold value W1, the number N1 of narrow character elements whose first length is equal to or less than the first threshold value W1 is counted, and N1 /
A blurred image determination unit that determines a character string image as an abnormal blurred image when the value of N is equal to or greater than a predetermined ratio R1, and a second threshold value W2 based on a second length Y of a circumscribed rectangle of the character string image And comparing the first length of the circumscribed rectangle of each character element image calculated by the first length calculation unit with a second threshold value W2,
Number N of bold character elements whose first length is greater than or equal to second threshold value W2
2, a contact image determining unit that determines a character string image to be an image having many abnormal character-to-character contacts when the value of N2 / N is equal to or greater than a predetermined ratio R2, and the blurred image determining unit or the contact image determining unit And a normal image determining unit that determines the character string image to be a normal image when the image is not determined to be an abnormal image, so that the first image calculated using the calculation formula from the height of the circumscribed rectangle of the character string image Since the character element images extracted by the character element image extracting means are classified into narrow or wide character elements based on the second threshold value, it is possible to easily classify whether the character string image is normal or abnormal.

【００６９】また、前記文字列状態分類手段で正常な画
像と分類された文字列画像の文字要素画像の外接矩形の
間隔の分布に基づいて文字列画像が英文であるか和文で
あるかを推定する第１字種推定手段を備え、前記文字画
像切出手段は、前記第１字種推定手段で英文と推定され
たときには、文字要素画像に分離した文字要素画像を統
合して文字画像を切り出す英文文字切出部と、前記第１
字種推定手段で和文と推定されたときには、前記基準文
字幅決定手段で決定された基準文字幅に従い文字要素画
像を統合し又は分離して文字画像を切り出す和文文字切
出部とを備えることとしているので、文字要素画像の外
接矩形の間隔の分布から文字列画像が英文であるか和文
であるかを推定して、文字画像を切り出すことができ
る。従って、英文と和文とが文字列画像単位で混在して
いても文字画像の切り出し精度が更に向上し、文字認識
の効率が高まる。Also, based on the distribution of the intervals of the circumscribed rectangles of the character element images of the character string image classified as a normal image by the character string state classifying means, it is estimated whether the character string image is English or Japanese. A first character type estimating unit that extracts a character image by integrating a character element image separated into a character element image when the first character type estimating unit estimates that the character is an English sentence An English character extracting section and the first
When the character type is estimated by the character type estimating means, a Japanese character extracting section is provided which integrates or separates character element images according to the reference character width determined by the reference character width determining means and cuts out the character image. Therefore, it is possible to estimate whether the character string image is an English sentence or a Japanese sentence from the distribution of the intervals of the circumscribed rectangles of the character element image, and cut out the character image. Therefore, even if English sentences and Japanese sentences are mixed in a character string image unit, the accuracy of extracting a character image is further improved, and the efficiency of character recognition is increased.

【００７０】また、前記第１字種推定手段は、隣接する
文字要素画像の外接矩形間の間隔のヒストグラムを作成
し、ヒストグラムの間隔の短い側から最初のピークの範
囲の最大値とその範囲に属する矩形間の間隔の数Ｎ３を
求める間隔数計数部と、前記間隔数計数部で求めた最初
のピークの範囲の最大値が上記第２長さＹに基づいて計
算される第３しきい値Ｄ以下で、かつ、上記間隔の数Ｎ
３の値と隣接する全矩形間の間隔の数Ｎ−１との比が予
め定めた定数Ｒ以上であるときに英文と、そうでないと
きに和文と推定する英和文推定部とを有することとして
いるので、文字列画像が英文であるか和文であるかの判
定の精度が向上する。従って、文字画像の切り出しが正
確になる。Further, the first character type estimating means creates a histogram of the interval between circumscribed rectangles of adjacent character element images, and calculates the maximum value of the range of the first peak from the shorter side of the interval of the histogram and the maximum value. An interval number counting section for obtaining the number N3 of intervals between the belonging rectangles, and a third threshold value for calculating the maximum value of the range of the first peak obtained by the interval number counting section based on the second length Y D or less and the number of intervals N
An English-Japanese sentence estimating unit that estimates an English sentence when the ratio of the value of 3 to the number N−1 of intervals between all adjacent rectangles is equal to or greater than a predetermined constant R, and otherwise estimates the sentence as a Japanese sentence Therefore, the accuracy of determining whether the character string image is English or Japanese is improved. Therefore, the cutout of the character image becomes accurate.

【００７１】また、前記基準文字幅決定手段は、前記文
字列状態分類手段で異常な文字列画像に分類されたとき
に、文字列画像の外接矩形の第２長さＹを基準に第１基
準長さＡ１を計算する第１基準文字幅計算部と、ここ
で、Ｙは文字列画像の直角方向の長さであり、αは１に
近い所定の係数であり、前記文字列状態分類手段で正常
な文字列画像と分類され、かつ、前記英和文推定部で和
文と推定されたときに、文字要素画像の外接矩形の第１
長さのヒストグラムを作成し、最多頻度の幅が属するピ
ークの最大値を第２基準長さ幅Ａ２に決定する第２基準
文字幅決定部とを有し、前記和文文字切出部は、第１基
準長さＡ１又は第２基準長さＡ２に従い文字画像を切り
出すこととしているので、異常な状態の文字列画像でも
第１基準文字幅が求められ、正常な状態の文字列画像で
は、更に精度の高い第２基準文字幅が求められる。従っ
て、文字画像の切り出し精度が更に向上する。Further, when the character string state classifying means classifies the character string image into an abnormal character string image, the reference character width determining means sets the first character string image based on the second length Y of the circumscribed rectangle of the character string image. A first reference character width calculating unit for calculating the length A1, wherein Y is the length of the character string image in the perpendicular direction, α is a predetermined coefficient close to 1, and the character string state classifying means When it is classified as a normal character string image and is estimated as a Japanese sentence by the English-Japanese sentence estimation unit, the first circumscribed rectangle of the character element image
A second reference character width determining unit that creates a length histogram and determines the maximum value of the peak to which the most frequent width belongs as the second reference length width A2; Since the character image is cut out in accordance with the first reference length A1 or the second reference length A2, the first reference character width is obtained even in a character string image in an abnormal state, and the accuracy is further improved in a character string image in a normal state. Is required for the second reference character width. Therefore, the cutout accuracy of the character image is further improved.

【００７２】また、前記文字認識手段は、文字画像に相
当する文字コードを認識したとき、その認識の確からし
さを表す評価値を出力する評価値出力部を有し、評価値
を予め定めた値と比較して文字コードの認識が誤りか否
かを判定する認識誤り判定手段と、前記認識誤り判定手
段で誤りでないと判定された文字コードの文字総数Ｎ０
を計数し、和文専用の文字種に属する文字コードの文字
数Ｎ４を計数し、値Ｎ０に対する値Ｎ４の割合が所定値
以上のときその文字列画像を和文文字列と推定し、未満
のときに英文文字列と推定する第２字種推定手段と、前
記第２字種推定手段で和文文字列であるとされた文字列
画像中に単独で英文字とされた文字画像を切り出し誤り
と判定し、英文文字列であるとされた文字列画像中に和
文字とされた文字画像を切り出し誤りと判定する切り出
し誤り判定手段と、前記認識誤り判定手段又は切り出し
誤り判定手段で認識又は切り出しが誤りと判定された文
字画像を単独又は統合して、修正した文字画像を切り出
す修正文字画像切出手段と、前記修正文字画像切出手段
で切り出された文字画像を相当する文字コードとして再
認識する文字再認識手段とを備えることとしているの
で、文字認識の確からしさを表す評価値の出力により、
その認識誤りを判定でき、文字列画像をその文字種から
英文文字列か和文文字列かを推定し、各文字列に異なる
文字種と認識された文字画像があれば文字画像の切り出
し誤りと判定し、文字画像を修正して切り出し再認識で
きる。従って、文字画像の切り出し精度と認識精度が更
に向上することができる。Further, the character recognizing means has an evaluation value output section for outputting an evaluation value indicating the certainty of the recognition when the character code corresponding to the character image is recognized, and the evaluation value is set to a predetermined value. A recognition error judging means for judging whether or not the recognition of the character code is erroneous, and a total number of characters N0 of the character code judged to be non-error by the recognition error judging means.
Is counted, and the number of characters N4 of the character code belonging to the character type dedicated to Japanese text is counted. When the ratio of the value N4 to the value N0 is equal to or more than a predetermined value, the character string image is estimated as a Japanese character string, and when the ratio is less than the English character, A second character type estimating means for estimating the character string, and extracting a character image which is solely regarded as an English character from a character string image determined to be a Japanese character character string by the second character type estimating means, to determine an error, and A cutout error determination unit that determines a cutout error in a character image that is a Japanese character in a character string image that is determined to be a character string, and recognition or cutout is determined to be incorrect by the recognition error determination unit or the cutout error determination unit. Character image cutting means for cutting out a corrected character image by alone or integrating the corrected character images, and a character recognition device for re-recognizing the character image cut out by the corrected character image cutting means as a corresponding character code. Since the possible and means, the output of the evaluation value representing the likelihood of the character recognition,
The recognition error can be determined, the character string image is estimated from the character type as an English character string or a Japanese character string, and if there is a character image recognized as a different character type in each character string, it is determined as a character image cutout error, The character image can be corrected and cut out and re-recognized. Therefore, the cutout accuracy and the recognition accuracy of the character image can be further improved.

【００７３】また、前記修正文字画像切出手段は、和文
文字列の文字列画像であって、前記認識誤り手段又は切
り出し誤り判定手段で認識又は切り出しが誤りと判定さ
れた文字画像以外の文字画像の外接矩形の文字列画像に
平行な長さのヒストグラムを作成し、そのヒストグラム
の最多頻度の長さが属するピークの最大値を修正基準文
字幅Ａ３と決定する修正基準文字幅決定部と、前記修正
基準文字幅決定部で決定された修正基準文字幅Ａ３を基
準に前記認識誤り判定手段又は切り出し誤り判定手段で
認識又は切り出しが誤りと判定された文字画像の領域か
ら文字数を求め、その文字数に応じた文字画像を切り出
す修正切り出し部とを有することとしているので、文字
画像の認識が誤りと判定された文字画像を修正された基
準文字幅で求められた文字数分の文字画像として再切り
出しできる。従って、文字画像の切り出しの精度が更に
向上する。The modified character image extracting means is a character string image of a Japanese character string, and is a character image other than the character image whose recognition or extraction is determined to be incorrect by the recognition error means or the extraction error determining means. A correction reference character width determination unit that creates a histogram having a length parallel to the character string image of the circumscribed rectangle of the above and determines the maximum value of the peak to which the most frequent length of the histogram belongs as the correction reference character width A3; The number of characters is determined from the area of the character image where the recognition or cutout is determined to be incorrect by the recognition error determination means or the cutout error determination means based on the correction reference character width A3 determined by the correction reference character width determination unit. And a modified cut-out unit for cutting out the character image corresponding to the character image. And it can be re-cut as the number of characters of the character image. Therefore, the accuracy of cutting out the character image is further improved.

【００７４】また、前記文字再認識手段は、前記修正文
字画像切出手段で切り出された文字画像を相当する文字
コードとして再認識する場合に、英文文字列の文字列画
像のときには、文字画像の特徴を認識辞書の英文専用文
字種の標準特徴とのみ照合し、和文文字列の文字列画像
のときには、文字画像の特徴を認識辞書の和文専用文字
種の標準特徴とのみ照合することを特徴としているの
で、英文文字列の文字画像の認識には英文字専用の文字
種を、和文文字列の文字画像の認識には和文字専用の文
字種を照合対象とすることで認識精度を更に向上するこ
とができる。The character re-recognizing means re-recognizes the character image cut out by the corrected character image cutting-out means as a corresponding character code. Since the feature is compared only with the standard feature of the English-only character type of the recognition dictionary, and in the case of a character string image of a Japanese character string, the feature of the character image is compared only with the standard characteristic of the Japanese-only character type of the recognition dictionary. The recognition accuracy can be further improved by using a character type dedicated to English characters for recognition of a character image of an English character string and a character type dedicated to Japanese characters for recognition of a character image of a Japanese character string.

【００７５】また、文書画像から文字列画像とその外接
矩形の位置とを抽出する文字列画像抽出ステップと、上
記文字列画像から文字要素画像とその外接矩形の位置と
を抽出する文字要素画像抽出ステップと、上記文字要素
画像の外接矩形の文字列画像に平行な第１長さと、上記
文字列画像の文字列画像の直角方向の第２長さに基づい
て定められるしきい値で上記文字列画像の状態が正常か
異常かを分類する文字列状態分類ステップと、前記文字
列状態分類ステップで分類した上記文字列画像の状態に
よって文字列画像から文字画像を切り出す文字列画像に
平行な基準長さを決定する基準文字幅決定ステップと、
前記基準文字幅決定ステップで決定された基準長さに従
い文字要素画像を統合、分離して文字画像を切り出す文
字画像切出ステップと、切り出された文字画像の特徴と
認識辞書の標準特徴とを照合し、文字画像を相当する文
字コードとして認識する文字認識ステップとを有して実
行することとしているので、上記文字認識装置と同様の
効果が得られる。A character string image extracting step of extracting a character string image and a position of a circumscribed rectangle from the document image, and a character element image extracting step of extracting a character element image and a position of the circumscribed rectangle from the character string image The character string with a threshold value determined based on a step, a first length parallel to the character string image of the circumscribed rectangle of the character element image, and a second length in a direction perpendicular to the character string image of the character string image A character string state classifying step of classifying whether the state of the image is normal or abnormal, and a reference length parallel to the character string image that cuts out the character image from the character string image according to the state of the character string image classified in the character string state classification step. A reference character width determining step of determining the
Integrating and separating the character element images according to the reference length determined in the reference character width determining step to cut out the character image, and comparing the features of the cut-out character image with the standard features of the recognition dictionary. The character recognition step is performed with a character recognition step of recognizing a character image as a corresponding character code, so that the same effects as those of the character recognition device can be obtained.

【００７６】更に、文字認識方法が記録された記録媒体
であって、認識辞書と以下の特定ステップとを実行する
プログラムとが記録され、特定ステップには、文書画像
から文字列画像とその外接矩形の位置とを抽出する文字
列画像抽出ステップと、上記文字列画像から文字要素画
像とその外接矩形の位置とを抽出する文字要素画像抽出
ステップと、上記文字要素画像の外接矩形の文字列画像
に平行な第１長さと、上記文字列画像の文字列画像の直
角方向の第２長さに基づいて定められるしきい値で上記
文字列画像の状態が正常か異常かを分類する文字列状態
分類ステップと、前記文字列状態分類ステップで分類し
た上記文字列画像の状態によって文字列画像から文字画
像を切り出す文字列画像に平行な基準長さを決定する基
準文字幅決定ステップと、前記基準文字幅決定ステップ
で決定された基準長さに従い文字要素画像を統合、分離
して文字画像を切り出す文字画像切出ステップと、切り
出された文字画像の特徴と認識辞書の標準特徴とを照合
し、文字画像を相当する文字コードとして認識する文字
認識ステップとを含むこととしているので、文字認識機
能を有しない他のパーソナルコンピュータと相俟って、
上記文字認識装置と同様の効果が得られる。Further, the present invention is a recording medium on which a character recognition method is recorded, wherein a recognition dictionary and a program for executing the following specific steps are recorded, and the specific steps include a character string image from a document image and a circumscribed rectangle thereof. A character string image extracting step of extracting a character element image and a position of a circumscribed rectangle from the character string image, and a character string image of a circumscribed rectangle of the character element image. Character string state classification for classifying whether the state of the character string image is normal or abnormal based on a first parallel length and a threshold value determined based on a second length in a direction perpendicular to the character string image of the character string image And a reference character width determining step of determining a reference length parallel to the character string image for extracting a character image from the character string image based on the state of the character string image classified in the character string state classification step. A character image extracting step of integrating and separating character element images according to the reference length determined in the reference character width determining step, and extracting a character image; characteristics of the extracted character image and standard characteristics of the recognition dictionary. And a character recognition step of recognizing a character image as a corresponding character code, so that, in combination with another personal computer having no character recognition function,
The same effects as those of the above character recognition device can be obtained.

[Brief description of the drawings]

【図１】本発明に係る文字認識装置の一実施の形態の構
成図である。FIG. 1 is a configuration diagram of an embodiment of a character recognition device according to the present invention.

【図２】上記実施の形態の文字検出部で検出された文字
列画像を示す図である。FIG. 2 is a diagram illustrating a character string image detected by a character detection unit according to the embodiment.

【図３】上記実施の形態の文字要素検出部で検出された
文字要素画像を示す図である。FIG. 3 is a diagram illustrating a character element image detected by a character element detection unit according to the embodiment.

【図４】上記実施の形態の第１字種推定部で作成された
文字要素画像の外接矩形間の間隔のヒストグラムであ
る。FIG. 4 is a histogram of intervals between circumscribed rectangles of a character element image created by a first character type estimating unit of the embodiment.

【図５】上記実施の形態の和文基準文字幅決定部で作成
された文字要素画像の外接矩形の幅のヒストグラムであ
る。FIG. 5 is a histogram of the width of a circumscribed rectangle of a character element image created by the Japanese sentence reference character width determination unit of the embodiment.

【図６】上記実施の形態の英文文字切り出し部と和文文
字切り出し部とで切り出された文字画像と文字認識部で
最初に認識された文字とその評価値との説明図である。FIG. 6 is an explanatory diagram of a character image cut out by an English character cutout unit and a Japanese character cutout unit in the above embodiment, a character first recognized by a character recognition unit, and an evaluation value thereof.

【図７】上記実施の形態の基準文字修正部で作成された
文字切り出しの正しい文字画像の外接矩形の幅のヒスト
グラムである。FIG. 7 is a histogram of the width of a circumscribed rectangle of a character image with correct character cutout created by the reference character correction unit of the embodiment.

【図８】上記実施の形態の修正文字切り出し部で切り出
された文字画像と文字認識部で最終的に認識された文字
とその評価値との説明図である。FIG. 8 is an explanatory diagram of a character image cut out by a corrected character cutout unit according to the embodiment, a character finally recognized by a character recognition unit, and its evaluation value.

【図９】上記実施の形態の動作を説明するフローチャー
トである。FIG. 9 is a flowchart illustrating the operation of the embodiment.

【図１０】上記実施の形態の動作を説明するフローチャ
ートである。FIG. 10 is a flowchart illustrating an operation of the embodiment.

【図１１】上記実施の形態の動作を説明するフローチャ
ートである。FIG. 11 is a flowchart illustrating an operation of the embodiment.

【図１２】上記実施の形態の動作を説明するフローチャ
ートである。FIG. 12 is a flowchart illustrating an operation of the embodiment.

【図１３】上記実施の形態で説明した文字認識方法を記
録した記録媒体の説明図である。FIG. 13 is an explanatory diagram of a recording medium on which the character recognition method described in the above embodiment is recorded.

【図１４】上記記録媒体のコンピュータシステムへの装
着を説明する図である。FIG. 14 is a diagram for explaining mounting of the recording medium on a computer system.

【図１５】従来の文字認識装置の構成図である。FIG. 15 is a configuration diagram of a conventional character recognition device.

【図１６】従来の文字認識装置での文字切り出しを説明
する図である。FIG. 16 is a diagram illustrating character segmentation in a conventional character recognition device.

[Explanation of symbols]

１０１画像入力部１０２文字列検出部１０３文字要素検出部１０４文字列状態推定部１０５第１字種推定部１０６英文文字切り出し部１０７和文基準文字幅決定部１０８和文文字切り出し部１０９文字認識部１１０第１文字切り出し誤り判定部１１１第２字種推定部１１２第２文字切り出し誤り判定部１１３基準文字幅修正部１１４修正文字切り出し部１１５認識結果出力部１３０１フロッピーディスク１４０１フロッピーディスクドライブ１４０３コンピュータシステム Reference Signs List 101 Image input unit 102 Character string detecting unit 103 Character element detecting unit 104 Character string state estimating unit 105 First character type estimating unit 106 English character extracting unit 107 Japanese standard character width determining unit 108 Japanese character extracting unit 109 Character recognizing unit 110 1 character extraction error determination unit 111 second character type estimation unit 112 second character extraction error determination unit 113 reference character width correction unit 114 modified character extraction unit 115 recognition result output unit 1301 floppy disk 1401 floppy disk drive 1403 computer system

Claims

[Claims]

1. A character string image extracting means for extracting a character string image and a position of a circumscribed rectangle from a document image, and a character element image extraction for extracting a character element image and a position of a circumscribed rectangle from the character string image Means, and a first parallel to a character string image of a circumscribed rectangle of the character element image
A length and a second perpendicular to the character string image of the character string image
A character string state classifying means for classifying whether the state of the character string image is normal or abnormal with a threshold value determined based on the length, and a character string according to the state of the character string image classified by the character string state classifying means A reference character width determining unit that determines a reference length parallel to a character string image that cuts out a character image from an image; and a character image that integrates and separates character element images according to the reference length determined by the reference character width determining unit. A character image extracting means for extracting a character image, and a character recognizing means for recognizing a character image as a corresponding character code by comparing a characteristic of the extracted character image with a standard characteristic of a recognition dictionary. apparatus.

2. The character string state classifying means includes: a total number counting unit that counts the total number N of character element images in units of character string images; and a second unit that calculates a first length of each character element image in units of character string images. A first length calculation unit, a first threshold value W1 is calculated based on a second length Y of a circumscribed rectangle of the character string image, and a circumscribed rectangle of each character element image calculated by the first length calculation unit Is compared with the first threshold value W1, the number N1 of narrow character elements whose first length is equal to or less than the first threshold value W1 is counted, and the value of N1 / N is a predetermined ratio. A blurred image determination unit that determines a character string image as an abnormal blurred image when the value is equal to or greater than R1, a second threshold value W2 is calculated based on a second length Y of a circumscribed rectangle of the character string image, and the first length is calculated. The first length of the circumscribed rectangle of each character element image calculated by the length calculation unit is compared with a second threshold value W2, and the first length is determined by a second threshold. A contact image determination unit that counts the number N2 of the bold character elements having the value W2 or more, and determines that the character string image is an image having many abnormal inter-character contacts when the value of N2 / N is a predetermined ratio R2 or more; 2. The character recognition device according to claim 1, further comprising a normal image determination unit that determines a character string image as a normal image when the blurred image determination unit or the contact image determination unit does not determine an abnormal image.

3. A method for estimating whether a character string image is English or Japanese based on a distribution of intervals between circumscribed rectangles of character element images of a character string image classified as a normal image by the character string state classifying means. A first character type estimating unit that extracts a character image by integrating a character element image separated into a character element image when the first character type estimating unit estimates that the character is an English sentence An English character extracting section, and when the first character type estimating section estimates a Japanese character, integrates or separates the character element images according to the reference character width determined by the reference character width determining section and cuts out the character image. The character recognition device according to claim 1, further comprising a Japanese character extraction unit.

4. The first character type estimating means creates a histogram of the interval between circumscribed rectangles of adjacent character element images, and calculates the maximum value of the range of the first peak from the shorter side of the interval of the histogram and the range thereof. An interval number counting unit for obtaining the number N3 of intervals between the rectangles belonging to the third threshold value for calculating the maximum value of the range of the first peak obtained by the interval number counting unit based on the second length Y D
If the ratio between the value of the number N3 of the intervals and the number N-1 of the intervals between all adjacent rectangles is equal to or greater than a predetermined constant R, it is estimated that the sentence is an English sentence. The character recognition device according to claim 3, further comprising an English-Japanese sentence estimating unit.

5. The reference character width determining means, when classified into an abnormal character string image by the character string state classification means, based on a second length Y of a circumscribed rectangle of the character string image. A first reference character width calculating unit for calculating the length A1, wherein Y is the length of the character string image in the perpendicular direction, α is a predetermined coefficient close to 1, and When the image is classified as a normal character string image and is estimated as a Japanese sentence by the English-Japanese sentence estimation unit, a histogram of a first length of a circumscribed rectangle of the character element image is created, and a peak to which the most frequent width belongs is created. Second maximum
A second reference character width determining unit that determines a reference length width A2, wherein the Japanese character cutout unit cuts out a character image according to the first reference length A1 or the second reference length A2. The character recognition device according to claim 4, wherein

6. The character recognizing means, when recognizing a character code corresponding to a character image, has an evaluation value output unit for outputting an evaluation value representing the likelihood of the recognition, wherein the evaluation value is a predetermined value. A recognition error judging unit for judging whether or not the recognition of the character code is an error by comparing the number of characters N0 of the character code judged to be no error by the recognition error judging unit, and belonging to a character type dedicated to Japanese text The number of characters N4 of the character code is counted, and when the ratio of the value N4 to the value N0 is a predetermined value or more, the character string image is estimated as a Japanese character string, and when the ratio is less than the predetermined value, the character string image is estimated as an English character string.
The character type estimating means, and a character image that is solely regarded as an English character in the character string image determined to be a Japanese character string by the second character type estimating means is determined to be an error, and is determined to be an English character string. A cut-out error determining unit that determines a cut-out error in a character image that is a Japanese character in the extracted character string image, and a character image whose recognition or cut-out is determined to be incorrect by the recognition error determining unit or the cut-out error determining unit, alone or It is characterized by comprising a corrected character image cutout unit that cuts out a corrected character image by integrating, and a character re-recognition unit that re-recognizes the character image cut out by the corrected character image cutout unit as a corresponding character code. The character recognition device according to claim 1, 2, 3, 4, or 5.

7. The modified character image cutout means is a character string image of a Japanese character string, and is a character image other than the character image whose recognition or cutout is determined to be incorrect by the recognition error means or cutout error determination means. A modified reference character width determination unit that creates a histogram having a length parallel to the character string image of the circumscribed rectangle, and determines the maximum value of the peak to which the most frequent length of the histogram belongs as the modified reference character width A3; The number of characters is determined from the area of the character image where the recognition or cutout is determined to be incorrect by the recognition error determination means or the cutout error determination means based on the correction reference character width A3 determined by the correction reference character width determination unit. 7. The character recognition device according to claim 6, further comprising a correction cutout unit that cuts out a corresponding character image.

8. The character re-recognition unit, when re-recognizing a character image cut out by the corrected character image cutting-out unit as a corresponding character code, when the character image is an English character string, The feature is compared only with the standard feature of the English-only character type of the recognition dictionary, and in the case of a character string image of a Japanese character string, the feature of the character image is compared only with the standard characteristic of the Japanese-only character type of the recognition dictionary. Item 7. The character recognition device according to Item 7.

9. A character string image extracting step of extracting a character string image and a position of a circumscribed rectangle from a document image, and extracting a character element image and a position of a circumscribed rectangle from the character string image And a first step parallel to the character string image of the circumscribed rectangle of the character element image
The length and the second in the direction perpendicular to the character string image of the character string image
A character string state classification step of classifying whether the state of the character string image is normal or abnormal with a threshold value determined based on the length, and a character string according to the state of the character string image classified in the character string state classification step A reference character width determining step of determining a reference length parallel to a character string image from which a character image is cut out of the image; and a character image integrating and separating the character element images according to the reference length determined in the reference character width determining step. And a character recognition step of collating a feature of the cut-out character image with a standard feature of a recognition dictionary and recognizing the character image as a corresponding character code. Character recognition method.

10. A recording medium on which a character recognition method is recorded, in which a recognition dictionary and a program for executing the following specific steps are recorded, and in the specific steps, a character string image and a circumscribed rectangle thereof are converted from a document image. A character string image extracting step of extracting a character element image and a position of a circumscribed rectangle from the character string image; and a character string image of a circumscribed rectangle of the character element image. Parallel first
The length and the second in the direction perpendicular to the character string image of the character string image
A character string state classification step of classifying whether the state of the character string image is normal or abnormal with a threshold value determined based on the length, and a character string according to the state of the character string image classified in the character string state classification step A reference character width determining step of determining a reference length parallel to a character string image from which a character image is cut out of the image; and a character image integrating and separating the character element images according to the reference length determined in the reference character width determining step. A character image extracting step of extracting a character image, and a character recognition step of collating a characteristic of the extracted character image with a standard characteristic of a recognition dictionary and recognizing the character image as a corresponding character code. .