JPH06348911A

JPH06348911A - English character recognition device

Info

Publication number: JPH06348911A
Application number: JP5135785A
Authority: JP
Inventors: Michiaki Nobuoka; 道明信岡
Original assignee: Matsushita Electric Industrial Co Ltd
Current assignee: Panasonic Holdings Corp
Priority date: 1993-06-07
Filing date: 1993-06-07
Publication date: 1994-12-22

Abstract

PURPOSE:To provide the English character recognition device which can be improved in recognition precision and shortened in processing time by automatically deciding the style of an document to be recognized. CONSTITUTION:This device is equipped with a circumscribed rectangle detection part 3 which detects circumscribed rectangles in binary image data, a line rectangle segmentation part 4 which segments a line rectangle from the distribution of the circumscribed rectangles, a character rectangle segmentation part 5 which segments a character rectangle from the line rectangle, etc., by integrating the circumscribed rectangles, a figure feature extraction part 6 which extracts figure features from the character rectangle, a character recognition part 8 which recognizes a character by comparing the figure features with a recognition dictionary 7 and stores the result in a recognition result storage part 9, a document style decision part 14 which decides the style of the object document from the distance from the center line of an adjacent character rectangle, a word detection part 15 which detects a word from this decision result and the interval between the character rectangles, and a misrecognized character correction part 12 which compares the detected word with a word dictionary 11 and corrects the recognition result in the recognition result storage part 9.

Description

Detailed Description of the Invention

【０００１】[0001]

【産業上の利用分野】本発明は新聞，小説等の活字，ド
ット文字等の英文字を認識することのできる英文字認識
装置に関するものである。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to an English character recognizing device capable of recognizing typed characters such as newspapers and novels and English characters such as dot characters.

【０００２】[0002]

【従来の技術】近年、国際化社会が到来し、英語，独語
等のアルファベット等（以下英文字と呼ぶ）で書かれた
文献等を参照する機会が増加し、その際の作業性向上の
ために文書の電子化が行われている。この時のコンピュ
ータへの入力装置として、キーボードに代わって、英文
字認識装置が用いられるようになった。2. Description of the Related Art In recent years, with the advent of an internationalized society, there is an increasing number of opportunities to refer to documents written in alphabets such as English and German (hereinafter referred to as English letters) to improve workability. Documents are being digitized. As an input device to the computer at this time, an English character recognition device has come to be used instead of the keyboard.

【０００３】以下に従来の英文字認識装置について説明
する。図６は従来の英文字認識装置の機能ブロック図で
ある。１は認識対象文書を画像として入力して２値画像
データを出力する画像入力部、２は画像入力部１より出
力される２値画像データを記憶する画像記憶部、３は画
像記憶部２に記憶されている２値画像データから黒画素
の連なりに外接する外接矩形を検出する外接矩形検出
部、４は外接矩形検出部３で検出された外接矩形の水平
方向の位置の分布が高い部分を行として検出し各行に外
接する行矩形を切り出す行矩形切り出し部、５は行矩形
切り出し部４で切り出された行矩形に基づいて外接矩形
検出部３で検出された外接矩形を統合して２値画像デー
タ中の一つの文字に外接する文字矩形を切り出す文字矩
形切り出し部、６は文字矩形切り出し部５で切り出され
た文字矩形内の黒画素の分布を図形特徴として抽出する
図形特徴抽出部、７は全ての認識対象文字の図形特徴を
記憶している認識辞書、８は図形特徴抽出部６で抽出さ
れた文字矩形の図形特徴と認識辞書７内の認識対象文字
の図形特徴を比較して類似度の最も高い認識対象文字に
対応するＡＳＣＩＩコード等の文字コードを認識結果と
して出力する文字認識部、９は文字認識部８から出力さ
れる認識結果を記憶する認識結果記憶部、１０は文字矩
形切り出し部５で切り出された文字矩形の水平方向の間
隔に基づいて２値画像データ中の個々の単語を検出する
単語検出部、１１は全ての単語の正しいスペルを記憶し
ている単語辞書、１２は単語検出部１０で検出された単
語のスペルが正しいかどうかを単語辞書１１を参照する
ことで調べスペルが誤っている単語に誤認識された文字
が含まれているものと判断してその認識結果記憶部９中
の文字を訂正する誤認識文字訂正部、１３は誤認識文字
訂正部１２で訂正された認識結果記憶部９内に記憶され
ている認識結果を出力する認識結果出力部である。A conventional English character recognition device will be described below. FIG. 6 is a functional block diagram of a conventional English character recognition device. Reference numeral 1 denotes an image input unit that inputs a recognition target document as an image and outputs binary image data, 2 denotes an image storage unit that stores the binary image data output from the image input unit 1, and 3 denotes an image storage unit 2. A circumscribing rectangle detector 4 for detecting a circumscribing rectangle circumscribing a series of black pixels from the stored binary image data is a portion having a high horizontal position distribution of the circumscribing rectangle detected by the circumscribing rectangle detector 3. A line rectangle cutout unit that detects a line rectangle that is detected as a line and circumscribes each line is integrated based on the line rectangle cut out by the line rectangle cutout unit 4 and the circumscribed rectangles detected by the circumscribed rectangle detection unit 3 are integrated into a binary value. A character rectangle cutout unit that cuts out a character rectangle circumscribing one character in the image data, 6 is a graphic feature extraction unit that extracts the distribution of black pixels in the character rectangle cut out by the character rectangle cutout unit 5 as a graphic feature, 7 Is all The recognition dictionary 8 stores the graphic features of the recognition target character. Reference numeral 8 compares the graphic feature of the character rectangle extracted by the graphic feature extraction unit 6 with the graphic feature of the recognition target character in the recognition dictionary 7 to find the highest similarity. A character recognition unit that outputs a character code such as an ASCII code corresponding to a high recognition target character as a recognition result, a recognition result storage unit 9 that stores the recognition result output from the character recognition unit 8, and a character rectangle cutout unit 5 A word detection unit that detects individual words in the binary image data based on the horizontal intervals of the character rectangles cut out in step 11, a word dictionary that stores the correct spelling of all words, and 12 word detection It is determined by referring to the word dictionary 11 whether or not the spelling of the word detected in the section 10 is correct, and it is determined that the misspelled word includes the misrecognized character, and the recognition result is obtained. Misrecognized character correction unit corrects the characters in 憶部 9, 13 is a recognition result output unit for outputting the recognition result stored in the misrecognized character correction unit 12 the recognition result storage section 9, which is corrected by.

【０００４】以上のように構成された従来の英文字認識
装置について、以下その動作を説明する。初めに、画像
入力部１によって、認識対象文書を２値画像に変換して
入力し、画像記憶部２へ記憶する。次に、外接矩形検出
部３によって、画像記憶部２に記憶されている２値画像
データ内の黒画素の連なりに外接する外接矩形を検出す
る。次に、行矩形切り出し部４によって、外接矩形検出
部３で検出された外接矩形の水平方向の位置の分布を求
め、分布の高い部分を行として切り出し、各行に外接す
る行矩形を検出する。次に、文字矩形切り出し部５によ
って、外接矩形検出部３で検出された外接矩形を、外接
矩形の大きさ，外接矩形間の距離，行矩形切り出し部４
で切り出された行矩形等に基づいて統合し、２値画像デ
ータ中の１文字に外接する文字矩形を切り出す。次に、
図形特徴抽出部６によって、文字矩形切り出し部５で切
り出された文字矩形内の文字画像の黒画素の分布を図形
特徴として抽出する。次に、文字認識部８によって、図
形特徴抽出部６で抽出された文字矩形内の文字画像の図
形特徴と、認識辞書７に記憶されている認識対象文字の
図形特徴とを比較し、類似度の最も高い認識対象文字に
対応するＡＳＣＩＩコード等の文字コードを認識結果と
して出力し、認識結果記憶部９に記憶する。次に、単語
検出部１０によって、文字矩形切り出し部５で切り出さ
れた文字矩形の水平方向の間隔が単語区切りしきい値よ
りも大きな部分に単語の区切りがあるものとして、個々
の単語を検出する。次に、誤認識文字訂正部１２によっ
て、単語検出部１０で検出された各々の単語のスペルが
正しいかを単語辞書１１を参照することで調べ、スペル
が誤っていると判断された単語中の、誤認識したと思わ
れる文字の認識確度が低かった場合に、この文字を単語
辞書１１から推定される正しい文字へと訂正する。次
に、認識結果出力部１３によって、誤認識文字訂正部１
２で訂正された認識結果記憶部９に記憶されている認識
結果を出力する。The operation of the conventional English character recognizing device constructed as above will be described below. First, the image input unit 1 converts a recognition target document into a binary image, inputs the binary image, and stores the binary image in the image storage unit 2. Next, the circumscribing rectangle detecting unit 3 detects a circumscribing rectangle circumscribing a series of black pixels in the binary image data stored in the image storage unit 2. Next, the line rectangle cutout unit 4 obtains the distribution of the horizontal position of the circumscribed rectangle detected by the circumscribed rectangle detection unit 3, cuts out a portion having a high distribution as a line, and detects a line rectangle circumscribing each line. Next, the circumscribed rectangle detected by the circumscribed rectangle detection unit 3 is determined by the character rectangle cutout unit 5 to determine the size of the circumscribed rectangle, the distance between the circumscribed rectangles, and the line rectangle cutout unit 4.
The character rectangles circumscribing one character in the binary image data are extracted by integrating them based on the line rectangles and the like extracted by. next,
The figure feature extraction unit 6 extracts the distribution of black pixels of the character image in the character rectangle cut out by the character rectangle cutout unit 5 as a figure feature. Next, the character recognition unit 8 compares the graphic feature of the character image in the character rectangle extracted by the graphic feature extraction unit 6 with the graphic feature of the recognition target character stored in the recognition dictionary 7 to determine the similarity. The character code such as the ASCII code corresponding to the highest character to be recognized is output as a recognition result and stored in the recognition result storage unit 9. Next, the word detection unit 10 detects each word, assuming that there is a word break in a portion where the horizontal interval of the character rectangle cut out by the character rectangle cutout unit 5 is larger than the word break threshold value. . Next, the misrecognized character correction unit 12 checks whether or not the spelling of each word detected by the word detection unit 10 is correct by referring to the word dictionary 11, and among the words determined to be misspelled, If the recognition accuracy of the character that seems to have been erroneously recognized is low, the character is corrected to a correct character estimated from the word dictionary 11. Next, the recognition result output unit 13 causes the erroneously recognized character correction unit 1
The recognition result stored in the recognition result storage unit 9 corrected in 2 is output.

【０００５】[0005]

【発明が解決しようとする課題】しかしながら上記従来
の構成では、認識対象文書となる英文の文書のスタイル
にモノ・スペースとプロポーショナルとの２種類があ
り、モノ・スペースの文書とプロポーショナルの文書と
で文字外接矩形の間隔が異なるために、単語検出部で単
語間の区切りを検出する際に誤検出をすることがあり、
認識精度が低下して信頼性に欠けるとともに、単語検出
のための処理時間が増大して作業性に欠けるという問題
点を有していた。However, in the above-described conventional configuration, there are two types of styles of English documents to be recognized, that is, mono-space and proportional, and there are mono-space and proportional documents. Because the intervals of the circumscribing rectangles are different, the word detection unit may make a false detection when detecting a break between words.
There is a problem that the recognition accuracy is lowered and the reliability is low, and the processing time for word detection is long and the workability is low.

【０００６】本発明は上記従来の問題点を解決するもの
で、認識対象文書のスタイルがモノ・スペースかプロポ
ーショナルであるかを自動的に判定して、認識精度を向
上させることができ信頼性に優れ、処理時間を短縮する
ことができる作業性に優れた英文字認識装置を提供する
ことを目的とする。The present invention solves the above-mentioned problems of the prior art by automatically determining whether the style of the document to be recognized is mono-space or proportional to improve the recognition accuracy and reliability. An object of the present invention is to provide an English character recognizing device which is excellent in workability and which can reduce processing time.

【０００７】[0007]

【課題を解決するための手段】この目的を達成するため
に本発明の英文字認識装置は、認識対象文書から得られ
る２値画像データを記憶する画像記憶部と、前記画像記
憶部に記憶されている２値画像データ中の黒画素の連な
りに外接する外接矩形を検出する外接矩形検出部と、前
記外接矩形検出部で検出された外接矩形の水平方向の位
置の分布が高い部分を行として検出し、各行に外接する
行矩形等を切り出す行矩形切り出し部と、前記行矩形切
り出し部で切り出された行矩形等に基づいて前記外接矩
形検出部で検出された外接矩形を統合して、一つの文字
に外接する文字矩形を切り出す文字矩形切り出し部と、
前記文字矩形切り出し部で切り出された文字矩形内の黒
画素の分布を図形特徴として抽出する図形特徴抽出部
と、全ての認識対象文字の図形特徴を記憶している認識
辞書と、前記図形特徴抽出部で抽出された文字矩形の図
形特徴と前記認識辞書中の認識対象文字の図形特徴とを
比較して文字認識を行う文字認識部と、前記文字認識部
での認識結果を記憶する認識結果記憶部と、前記文字矩
形切り出し部で切り出された文字矩形の内隣接する文字
矩形の中心線の間の距離から認識対象文書のスタイルが
モノ・スペースか、あるいはプロポーショナルかを判定
する文書スタイル判定部と、前記文書スタイル判定部で
の判定結果と前記文字矩形切り出し部で切り出された文
字矩形の水平方向の間隔等とに基づいて個々の単語を検
出する単語検出部と、全ての単語の正しいスペルを記憶
している単語辞書と、前記単語検出部で検出された単語
と前記単語辞書とを比較し誤認識された文字を検出し
て、前記認識結果記憶部内の認識結果を訂正する誤認識
文字訂正部とを備えた構成を有している。In order to achieve this object, the English character recognition apparatus of the present invention stores an image storage unit for storing binary image data obtained from a document to be recognized, and the image storage unit. The circumscribing rectangle detecting unit that detects a circumscribing rectangle that circumscribes a series of black pixels in the binary image data, and a portion in which the horizontal distribution of the circumscribing rectangles detected by the circumscribing rectangle detecting unit is high is defined as a row A line rectangle cutout unit that detects a line rectangle that circumscribes each line, and a circumscribed rectangle that is detected by the circumscribed rectangle detection unit based on the line rectangle that is cut out by the line rectangle cutout unit is integrated, A character rectangle cutout part that cuts out a character rectangle that circumscribes two characters,
A graphic feature extraction unit that extracts the distribution of black pixels in the character rectangle cut out by the character rectangle cutout unit as a graphic feature, a recognition dictionary that stores the graphic features of all recognition target characters, and the graphic feature extraction. A character recognition unit that performs character recognition by comparing the graphic features of the character rectangle extracted by the character recognition unit with the graphic features of the recognition target character in the recognition dictionary; and a recognition result storage that stores the recognition result of the character recognition unit. And a document style determination unit that determines whether the style of the recognition-target document is mono space or proportional from the distance between the center and the center line of the adjacent character rectangle of the character rectangle cut out by the character rectangle cutout unit. A word detection unit that detects individual words based on the determination result of the document style determination unit and the horizontal spacing of the character rectangles cut out by the character rectangle cutout unit, and the like. A word dictionary that stores the correct spelling of all words and a character that is erroneously recognized by comparing the word detected by the word detection unit with the word dictionary are detected, and the recognition result in the recognition result storage unit And an erroneously recognized character correction unit for correcting

【０００８】[0008]

【作用】この構成によって、文書スタイル判定部が文字
矩形切り出し部で切り出された文字矩形の内、隣接する
文字矩形の中心線の間の距離に基づいて認識対象文書の
スタイルを自動的に判定し、単語検出部が判定されたス
タイルに応じて個々の単語を検出するために、単語の検
出を正確に行うことができ、認識精度を向上させること
ができるとともに、処理時間を短縮することができる。With this configuration, the document style determination unit automatically determines the style of the recognition target document based on the distance between the center lines of the adjacent character rectangles of the character rectangles cut out by the character rectangle cutout unit. Since the word detection unit detects individual words according to the determined style, the words can be detected accurately, the recognition accuracy can be improved, and the processing time can be shortened. .

【０００９】[0009]

【実施例】以下本発明の一実施例における英文字認識装
置について、図面を参照しながら説明する。図１は本発
明の一実施例における英文字認識装置の機能ブロック図
であり、図２は本発明の一実施例における英文字認識装
置の装置ブロック図である。１は画像入力部、２は画像
記憶部、３は外接矩形検出部、４は行矩形切り出し部、
５は文字矩形切り出し部、６は図形特徴抽出部、７は認
識辞書、８は文字認識部、９は認識結果記憶部、１１は
単語辞書、１２は誤認識文字訂正部、１３は認識結果出
力部であり、これらは従来例と同様なものなので同一の
符号を付し説明を省略する。１４は文字矩形切り出し部
５で切り出された文字矩形の内の隣接する文字矩形の中
心線間の間隔に基づいて認識対象文書のスタイルがモノ
・スペースであるかプロポーショナルであるかを判定す
る文書スタイル判定部、１５は文書スタイル判定部１４
での判定結果に応じてモノ・スペースの文書の場合は隣
接する文字矩形の中心線間の距離が大きい部分を単語区
切りとし、プロポーショナルの文書の場合は隣接する文
字矩形間の空間が大きい部分を単語区切りとして個々の
単語を検出する単語検出部である。図２において、１６
は英文字認識装置全体を制御する中央演算処理装置（以
下ＣＰＵと略す）、１７はＣＰＵ１６への指示が記載さ
れたプログラムが記憶された制御プログラム領域１８と
認識辞書７が記憶される認識辞書領域１９と単語辞書１
１が記憶された単語辞書領域２０とを有するリードオン
リメモリ（以下ＲＯＭと略す）、２１は画像記憶部２を
構成する２値画像データ領域２２と行矩形切り出し部４
で切り出された行矩形のデータが記憶される行矩形デー
タ領域２３と文字矩形切り出し部５で切り出された文字
矩形のデータが記憶される文字矩形データ領域２４と、
単語検出部１５で検出された個々の単語のデータが記憶
される単語データ領域２５と、認識結果記憶部９を構成
する認識結果データ領域２６とを有するランダムアクセ
スメモリ（以下ＲＡＭと略す）、２７は画像入力部１を
構成し認識対象文書を走査して２値画像データを得るス
キャナ、２８は英文字認識装置に文字認識開始の指示等
を与えるためのキーボード、２９は認識結果出力部１３
を構成し認識結果を出力するための出力装置、３０は上
記のデバイスを接続するバスである。DESCRIPTION OF THE PREFERRED EMBODIMENTS An English character recognition apparatus according to an embodiment of the present invention will be described below with reference to the drawings. FIG. 1 is a functional block diagram of an English character recognition device according to an embodiment of the present invention, and FIG. 2 is a device block diagram of an English character recognition device according to an embodiment of the present invention. 1 is an image input unit, 2 is an image storage unit, 3 is a circumscribed rectangle detection unit, 4 is a line rectangle cutout unit,
5 is a character rectangle cutout unit, 6 is a graphic feature extraction unit, 7 is a recognition dictionary, 8 is a character recognition unit, 9 is a recognition result storage unit, 11 is a word dictionary, 12 is a misrecognized character correction unit, and 13 is a recognition result output. Since these are the same as those in the conventional example, the same reference numerals are given and description thereof is omitted. Reference numeral 14 is a document style for determining whether the style of the recognition target document is mono space or proportional based on the interval between the center lines of the adjacent character rectangles of the character rectangles cut out by the character rectangle cutout unit 5. Judgment unit, 15 is a document style judgment unit 14
In the case of mono-spaced documents, the part where the distance between the center lines of adjacent character rectangles is large is defined as the word break in the case of mono space documents, and in the case of proportional documents, the part where the space between adjacent character rectangles is large is defined as the word break. It is a word detection unit that detects individual words as word breaks. In FIG. 2, 16
Is a central processing unit (hereinafter abbreviated as CPU) that controls the entire English character recognition device, and 17 is a control program region 18 in which a program in which instructions to the CPU 16 are written and a recognition dictionary region in which the recognition dictionary 7 is stored 19 and word dictionary 1
A read-only memory (hereinafter abbreviated as ROM) having a word dictionary area 20 in which 1 is stored, 21 is a binary image data area 22 and a line rectangle cutout section 4 which constitute the image storage section 2.
A line rectangle data area 23 in which the data of the line rectangle cut out by the above is stored, and a character rectangle data area 24 in which the data of the character rectangle cut out by the character rectangle cutout unit 5 is stored,
A random access memory (hereinafter abbreviated as RAM) 27 having a word data area 25 in which data of individual words detected by the word detection unit 15 is stored, and a recognition result data area 26 constituting the recognition result storage unit 9; Is a scanner that constitutes the image input unit 1 and scans a document to be recognized to obtain binary image data; 28 is a keyboard for giving an instruction to start character recognition to an English character recognition device; and 29 is a recognition result output unit 13
And an output device for outputting the recognition result, and a bus 30 for connecting the above devices.

【００１０】以上のように構成された本発明の一実施例
における英文字認識装置について、以下その動作を説明
する。図３は本発明の一実施例における英文字認識装置
の全体フローチャートであり、図４は本発明の一実施例
における英文字認識装置の文書スタイル判定処理の詳細
なフローチャートであり、図５（ａ）はプロポーショナ
ルの文書の一例を示す図であり、図５（ｂ）はモノ・ス
ペースの文書の一例を示す図である。ｄ1，ｄ2は隣接す
る文字矩形の中心線間の距離を示す文字矩形中心線間距
離である。The operation of the English character recognition apparatus according to the embodiment of the present invention constructed as above will be described below. FIG. 3 is an overall flowchart of the English character recognition apparatus according to the embodiment of the present invention, and FIG. 4 is a detailed flowchart of the document style determination processing of the English character recognition apparatus according to the embodiment of the present invention. 5B is a diagram showing an example of a proportional document, and FIG. 5B is a diagram showing an example of a mono space document. d1 and d2 are distances between center lines of character rectangles indicating distances between center lines of adjacent character rectangles.

【００１１】初めに、画像入力部１によって、認識対象
文書を２値画像に変換して入力し、これを画像記憶部２
に記憶する（Ｓ１）。次に、外接矩形検出部３によっ
て、画像記憶部２に記憶されている２値画像データ中の
黒画素の連なりに外接する外接矩形を求める（Ｓ２）。
次に、行矩形切り出し部４によって、Ｓ２で検出された
外接矩形の水平方向の位置の分布を求め、分布が高い部
分を行として切り出し、各行に外接する行矩形を検出
し、そのデータを行矩形データ領域２３に記憶する（Ｓ
３）。次に、文字矩形切り出し部５によって、Ｓ３で検
出された行矩形及びＳ２で検出された外接矩形の大き
さ，距離に基づいて外接矩形を統合し、２値画像データ
中の１文字に外接する文字矩形を切り出し、文字矩形デ
ータ領域２４に記憶する（Ｓ４）。次に、図形特徴抽出
部６によって、文字矩形データ領域２４中に記憶されて
いる文字矩形の位置座標を基にして、画像記憶部２中の
２値画像データから１文字分の文字画像を抽出し、この
文字画像の黒画素の分布を求めて図形特徴を抽出する
（Ｓ５）。次に、文字認識部８によって、Ｓ５で抽出さ
れた文字画像の図形特徴と、認識辞書７中の認識対象文
字の図形特徴とを従来例と同様にして比較し、類似度の
最も高い認識対象文字に対応するＡＳＣＩＩコード等の
文字コードを認識結果として出力し、これを認識結果記
憶部９に記憶する（Ｓ６）。次に、文書スタイル判定部
１４によって、Ｓ４で切り出された文字矩形の位置関係
に基づいて、認識対象文書のスタイルがモノ・スペース
であるかプロポーショナルであるかを判定する（Ｓ
７）。このＳｔｅｐについては後述する。次に、単語検
出部１５によって、Ｓ７で判定されたスタイルがモノ・
スペースである場合は隣接する文字矩形の中心線間の距
離が大きい部分を、プロポーショナルである場合は隣接
する文字矩形の水平方向の空間が大きい部分を、それぞ
れ単語の区切りとして個々の単語を検出し、単語データ
領域２５に記憶する（Ｓ８）。次に、誤認識文字訂正部
１２によって、Ｓ８で検出された単語のスペルに誤りが
ないかを、単語辞書１１を参照することで調べ、スペル
が誤っていると判断された単語中の、誤認識したと思わ
れる文字のＳ６で決定された認識確度が低かった場合
に、この文字を単語辞書１１から推定される正しい文字
へと訂正する（Ｓ９）。次に、認識結果出力部１３によ
って、Ｓ９で訂正された、認識結果記憶部９に記憶され
ている認識結果を出力装置２９から出力し、全ての処理
を終了する（Ｓ１０）。First, the image input unit 1 converts a document to be recognized into a binary image and inputs it, and the image is stored in the image storage unit 2.
(S1). Next, the circumscribing rectangle detection unit 3 obtains a circumscribing rectangle circumscribing a series of black pixels in the binary image data stored in the image storage unit 2 (S2).
Next, the line rectangle cutout unit 4 obtains the distribution of the horizontal position of the circumscribed rectangle detected in S2, cuts out a portion having a high distribution as a line, detects the line rectangle circumscribing each line, and outputs the data. Store in the rectangular data area 23 (S
3). Next, the character rectangle cutout unit 5 integrates the circumscribed rectangles based on the size and distance of the line rectangle detected in S3 and the circumscribed rectangle detected in S2, and circumscribes one character in the binary image data. The character rectangle is cut out and stored in the character rectangle data area 24 (S4). Next, the graphic feature extraction unit 6 extracts a character image for one character from the binary image data in the image storage unit 2 based on the position coordinates of the character rectangle stored in the character rectangle data area 24. Then, the distribution of the black pixels of this character image is obtained to extract the graphic feature (S5). Next, the character recognition unit 8 compares the graphic feature of the character image extracted in S5 with the graphic feature of the recognition target character in the recognition dictionary 7 in the same manner as in the conventional example, and the recognition target having the highest degree of similarity is compared. A character code such as an ASCII code corresponding to the character is output as a recognition result and stored in the recognition result storage unit 9 (S6). Next, the document style determination unit 14 determines whether the style of the recognition target document is mono space or proportional based on the positional relationship of the character rectangles cut out in S4 (S
7). This Step will be described later. Next, the word detection unit 15 determines that the style determined in S7 is mono.
If it is a space, the part where the distance between the center lines of adjacent character rectangles is large is detected, and if it is proportional, the part where the space between adjacent character rectangles is large is detected as the word delimiters. , Are stored in the word data area 25 (S8). Next, the misrecognized character correction unit 12 checks whether or not the spelling of the word detected in S8 is correct by referring to the word dictionary 11, and corrects the spelling error in the word determined to be incorrect. When the recognition accuracy of the character that seems to be recognized is low in S6, this character is corrected to the correct character estimated from the word dictionary 11 (S9). Next, the recognition result output unit 13 outputs the recognition result corrected in S9 and stored in the recognition result storage unit 9 from the output device 29, and all processing is terminated (S10).

【００１２】次に、Ｓ７における文書スタイル判定処理
について詳細に説明する。図４において、まず、配列変
数Ｔａｂｌｅ［］を度数分布テーブルとして初期化する
（Ｓ１１）。次に、変数Ｓｐｃｎｏを文字間スペース数
のカウンタとしてゼロクリアする（Ｓ１２）。次に、変
数ｄに、Ｓ４で切り出された隣接する文字矩形の中心線
間の距離を代入する（Ｓ１３）。ここで、例として、図
５（ａ）に示すようなプロポーショナルの認識対象文書
中の‘ｃ’と‘ｉ’の間ではｄにｄ1が、図５（ｂ）に
示すようなモノ・スペースの認識対象文書では同様にｄ
にｄ2が、それぞれ代入される。次に、Ｓ１３で代入さ
れたｄを４で割り、その値を四捨五入したものを変数
ｄ′に代入する（Ｓ１４）。次に、度数分布テーブルの
ｄ′番目（以下Ｔａｂｌｅ［ｄ′］と呼ぶ）を１つイン
クリメントする（Ｓ１５）。次に、文字間スペース数カ
ウンタＳｐｃｎｏを１つインクリメントする（Ｓ１
６）。次に、全ての文字矩形に対する処理が終了したか
調べる（Ｓ１７）。Ｎｏである場合は、次の文字矩形に
対する処理を行うためにＳ１３へｊｕｍｐし、Ｙｅｓで
ある場合は、度数分布テーブルの度数Ｔａｂｌｅ［ｘ］
の値が最大となる位置ｘを求め、これを変数ｍａｘｎｏ
へ代入する（Ｓ１８）。次に、変数ｍａｘにＴａｂｌｅ
［ｍａｘｎｏ］即ち度数の最大値を代入する（Ｓ１
９）。次に、ｍａｘｎｏの値が接触文字考慮しきい値以
下であるか調べる（Ｓ２０）。Ｎｏである場合は、Ｓ２
３へｊｕｍｐし、Ｙｅｓである場合は、ｍａｘを文字間
スペース数カウンタＳｐｃｎｏで割ったものがスタイル
判定しきい値より大きいか調べる（Ｓ２１）。Ｎｏであ
る場合は、Ｓ２５へｊｕｍｐし、Ｙｅｓである場合は、
認識対象文書のスタイルをモノ・スペースと判定して、
文書スタイル判定処理を終了する（Ｓ２２）。次に、認
識対象文書中の隣接する文字が接触する可能性を考慮し
て、ｍａｘにＴａｂｌｅ［ｍａｘｎｏ×２］の値を加算
する（Ｓ２３）。次に、ｍａｘを文字間スペース数カウ
ンタＳｐｃｎｏで割ったものが接触文字考慮済スタイル
判定しきい値より大きいか調べる（Ｓ２４）。Ｙｅｓで
ある場合は、Ｓ２２へｊｕｍｐし、Ｎｏである場合は、
認識対象文書のスタイルをプロポーショナルと判定し
て、文書スタイル判定処理を終了する（Ｓ２５）。Next, the document style determination process in S7 will be described in detail. In FIG. 4, first, the array variable Table [] is initialized as a frequency distribution table (S11). Next, the variable Spcno is cleared to zero as a counter for the number of spaces between characters (S12). Next, the distance between the center lines of the adjacent character rectangles cut out in S4 is substituted into the variable d (S13). Here, as an example, d1 is d1 between'c 'and'i' in a proportional recognition target document as shown in FIG. 5A, and a mono space as shown in FIG. Similarly in the document to be recognized, d
And d2 are respectively substituted into. Next, d substituted in S13 is divided by 4, and the value rounded off is substituted for the variable d '(S14). Next, the d'th (hereinafter referred to as Table [d ']) of the frequency distribution table is incremented by 1 (S15). Next, the inter-character space number counter Spcno is incremented by 1 (S1
6). Next, it is checked whether the processing for all the character rectangles has been completed (S17). If No, jump to S13 to perform processing for the next character rectangle, and if Yes, the frequency Table [x] in the frequency distribution table.
Find the position x where the value of is the maximum, and use this as the variable maxno
(S18). Next, set the variable max to Table
[Maxno] That is, the maximum value of the frequency is substituted (S1
9). Next, it is checked whether the value of maxno is less than or equal to the contact character consideration threshold value (S20). If No, S2
If it is Yes, then it is checked whether the value obtained by dividing max by the intercharacter space counter Spcno is larger than the style determination threshold value (S21). If No, jump to S25, and if Yes,
Judge that the style of the recognition target document is mono space,
The document style determination process ends (S22). Next, the value of Table [maxno × 2] is added to max in consideration of the possibility that adjacent characters in the recognition-target document come into contact with each other (S23). Next, it is checked whether the value obtained by dividing max by the inter-character space counter Spcno is larger than the contact character considered style determination threshold value (S24). If Yes, jump to S22, and if No,
The style of the recognition target document is determined to be proportional, and the document style determination process ends (S25).

【００１３】[0013]

【発明の効果】以上のように本発明は、文書スタイル判
定部が文字矩形切り出し部で切り出された文字矩形の
内、隣接する文字矩形の中心線の間の距離に基づいて認
識対象文書のスタイルを自動的に判定し、単語検出部が
判定されたスタイルに応じて個々の単語を検出するため
に、単語の検出を正確に行うことができ、認識精度を向
上させることができ信頼性に優れるとともに、処理時間
を短縮することができる作業性に優れた英文字認識装置
を実現できるものである。As described above, according to the present invention, the style of the document to be recognized is determined based on the distance between the center lines of the adjacent character rectangles among the character rectangles cut out by the character rectangle cutout unit by the document style determination unit. Is automatically determined, and the word detection unit detects individual words according to the determined style, so that it is possible to perform accurate word detection, improve recognition accuracy, and have excellent reliability. At the same time, it is possible to realize an English character recognizing device with excellent workability that can reduce the processing time.

[Brief description of drawings]

【図１】本発明の一実施例における英文字認識装置の機
能ブロック図FIG. 1 is a functional block diagram of an English character recognition device according to an embodiment of the present invention.

【図２】本発明の一実施例における英文字認識装置の装
置ブロック図FIG. 2 is a device block diagram of an English character recognition device according to an embodiment of the present invention.

【図３】本発明の一実施例における英文字認識装置の全
体フローチャートFIG. 3 is an overall flowchart of an English character recognition device according to an embodiment of the present invention.

【図４】本発明の一実施例における英文字認識装置の文
書スタイル判定処理の詳細なフローチャートFIG. 4 is a detailed flowchart of document style determination processing of the English character recognition device according to the embodiment of the present invention.

【図５】（ａ）はプロポーショナルの文書の一例を示す
図（ｂ）はモノ・スペースの文書の一例を示す図FIG. 5A is a diagram showing an example of a proportional document. FIG. 5B is a diagram showing an example of a mono space document.

【図６】従来の英文字認識装置の機能ブロック図FIG. 6 is a functional block diagram of a conventional English character recognition device.

[Explanation of symbols]

１画像入力部２画像記憶部３外接矩形検出部４行矩形切り出し部５文字矩形切り出し部６図形特徴抽出部７認識辞書８文字認識部９認識結果記憶部１０単語検出部１１単語辞書１２誤認識文字訂正部１３認識結果出力部１４文書スタイル判定部１５単語検出部１６中央演算処理装置（ＣＰＵ）１７リードオンリメモリ（ＲＯＭ）１８制御プログラム領域１９認識辞書領域２０単語辞書領域２１ランダムアクセスメモリ（ＲＡＭ）２２２値画像データ領域２３行矩形データ領域２４文字矩形データ領域２５単語データ領域２６認識結果データ領域２７スキャナ２８キーボード２９出力装置３０バス 1 image input unit 2 image storage unit 3 circumscribed rectangle detection unit 4 line rectangle cutout unit 5 character rectangle cutout unit 6 figure feature extraction unit 7 recognition dictionary 8 character recognition unit 9 recognition result storage unit 10 word detection unit 11 word dictionary 12 misrecognition Character correction unit 13 Recognition result output unit 14 Document style determination unit 15 Word detection unit 16 Central processing unit (CPU) 17 Read only memory (ROM) 18 Control program area 19 Recognition dictionary area 20 Word dictionary area 21 Random access memory (RAM) ) 22 binary image data area 23 line rectangular data area 24 character rectangular data area 25 word data area 26 recognition result data area 27 scanner 28 keyboard 29 output device 30 bus

Claims

[Claims]

1. An image storage unit for storing binary image data obtained from a document to be recognized, and a circumscribing rectangle circumscribing a series of black pixels in the binary image data stored in the image storage unit are detected. A circumscribed rectangle detection unit, a line rectangle cutout unit that detects a portion having a high horizontal position distribution of the circumscribed rectangle detected by the circumscribed rectangle detection unit as a line, and cuts out a line rectangle that circumscribes each line, and the line rectangle cutout Based on the line rectangle cut out by the unit, the circumscribed rectangles detected by the circumscribed rectangle detection unit are integrated to cut out a character rectangle circumscribing one character, and a character rectangle cutout unit is cut out by the character rectangle cutout unit. A graphic feature extraction unit that extracts the distribution of black pixels in a character rectangle as a graphic feature, a recognition dictionary that stores the graphic features of all recognition target characters, and a sentence extracted by the graphic feature extraction unit. A character recognition unit that performs character recognition by comparing a rectangular graphic feature with a graphic feature of a recognition target character in the recognition dictionary, a recognition result storage unit that stores a recognition result in the character recognition unit, and the character rectangle. A document style determination unit that determines whether the style of the recognition target document is mono space or proportional from the distance between the center lines of the adjacent character rectangles of the character rectangles cut out by the cutout unit, and the document style determination unit. A word detection unit that detects individual words based on the determination result in (1) and the horizontal intervals of the character rectangles cut out by the character rectangle cutting unit, and a word dictionary that stores the correct spelling of all words. And a erroneously recognized character correction unit that corrects the recognition result in the recognition result storage unit by comparing the word detected by the word detection unit with the word dictionary to detect erroneously recognized characters. English character recognition apparatus characterized by comprising a.