JP2000259765A

JP2000259765A - Character recognition device

Info

Publication number: JP2000259765A
Application number: JP11061909A
Authority: JP
Inventors: Takeshi Furuto; 健古戸
Original assignee: Sumitomo Electric Industries Ltd
Current assignee: Sumitomo Electric Industries Ltd
Priority date: 1999-03-09
Filing date: 1999-03-09
Publication date: 2000-09-22

Abstract

PROBLEM TO BE SOLVED: To improve the recognition rate without requiring a long processing time and a large-scale memory by inverting and outputting the color of a pixel of interest in an input image when a pixel matrix having the pixel of interest in the center satisfies a specific expression. SOLUTION: A preprocess part 2 performs a filter process while scanning the input image and sends an output image to a segmentation part 3 when finishing the filter process for the whole image or a desirable specified area. The segmentation part 3 segments a character image of a parts where characters are prsent from the whole preprocessed image and sends it to a recognition part 4. The recognition part 4 recognizes the character image as characters and outputs the result in the form of character codes. Here, the filter used by the preprocess part 2 inverts and outputs the color of the pixel of interest in the input image when the pixel matrix X (i, j) of three rows and three columns having the pixel of interest in the center satisfies a conditional expression, etc. In the expression, 0 represents a white pixel, 1 represents a black pixel, and * is an arbitrary pixel whose color is not important.

Description

DETAILED DESCRIPTION OF THE INVENTION

【０００１】[0001]

【発明の属する技術分野】この発明はＦＡＸ（ｉ，ｊ）
の標準モードのように縦横の解像度比が１：２で読みと
られた後に歪みを補正された画像を認識する文字認識装
置に関する。TECHNICAL FIELD The present invention relates to a facsimile (i, j)
And a character recognition device for recognizing an image in which distortion has been corrected after the image is read at a vertical / horizontal resolution ratio of 1: 2 as in the standard mode.

【０００２】[0002]

【従来の技術】ＦＡＸで通常用いられている標準モード
では、縦方向の解像度はおよそ１００dpi(dot per inc
h)、横方向の解像度は２００dpiで、縦横の解像度比が
１：２となる。したがって、ここで得られた白黒の画素
を等ピッチで並べると、図６のように縦方向に縮んだ画
像となる。このような画像は視覚的にも不自然であるか
ら、画像を出力する前に、情報の不足を補完して、画像
の縦横比が１：１になるように補正する必要がある。こ
の補正は各行を２度ずつ連続して描画することで近似的
に行われる。図７に補正の結果得られる画像を示す。2. Description of the Related Art In a standard mode usually used for FAX, a vertical resolution is approximately 100 dpi (dot per inc.).
h), the horizontal resolution is 200 dpi, and the vertical / horizontal resolution ratio is 1: 2. Therefore, if the obtained black and white pixels are arranged at an equal pitch, an image contracted in the vertical direction as shown in FIG. Since such an image is visually unnatural, it is necessary to compensate for the lack of information and output the image so that the image has an aspect ratio of 1: 1 before outputting the image. This correction is approximately performed by successively drawing each line twice. FIG. 7 shows an image obtained as a result of the correction.

【０００３】一方、文字認識のアルゴリズムは標準的な
字形に合わせて開発されているから、図６のように縮ん
だ文字を認識することはできない。したがってＦＡＸ画
像を認識対象とする場合も、図７のような補正された画
像を対象に文字認識を行う。文字認識の手法としては、
種々のアルゴリズムが考案されているが、例えば、「文
字認識における特徴量の次元数と変数変換に関する考
察」（若林他、信学技報ＰＲＵ９２−１１５、１９９３
年１月）、他で紹介されている、図形の特徴量を用いた
統計的文字認識手法は文字の変形に強く、認識率が優れ
ているため、広く用いられている。On the other hand, since a character recognition algorithm has been developed in accordance with a standard character shape, it is not possible to recognize a contracted character as shown in FIG. Therefore, even when a FAX image is to be recognized, character recognition is performed on the corrected image as shown in FIG. As a method of character recognition,
Various algorithms have been devised. For example, for example, “Consideration on Dimensionality and Variable Conversion of Feature in Character Recognition” (Wakabayashi et al., IEICE Technical Report PRU92-115, 1993)
Statistical character recognition method using the feature amount of a figure, which is introduced elsewhere, is widely used because it is strong in character deformation and has an excellent recognition rate.

【０００４】この統計的文字認識手法で文字の特徴量の
基本となるのが、方向指数ヒストグラムである。図８は
先述の文献（若林他）における方向指数ヒストグラムの
例である。図８の例では、切り出された１文字の画像で
ある図８(a)から、まず輪郭線画像を抽出し図８(b)を得
る。それを７×７の小ブロックに分割し、例えば図８
(c)のようにブロックにおける輪郭画像の隣接画素の連
結状態を、正逆の方向は問わず、０、４５、９０、１３
５度の４方向成分に分類して集計し、方向指数ヒストグ
ラムを求める。こうして各ブロック毎に得られた方向指
数ヒストグラムの全体としての分布である図８(d)が文
字の特徴量となる。この先述の文献の認識手法に限ら
ず、他の多くの認識手法でも隣接画素間の縦、横、斜め
各方向の連結成分の分布を重要な文字の特徴量の基本要
素としている。[0004] The direction index histogram is the basis of the character feature amount in this statistical character recognition method. FIG. 8 is an example of a direction index histogram in the aforementioned document (Wakabayashi et al.). In the example of FIG. 8, a contour image is first extracted from FIG. 8A, which is an image of a cut-out single character, to obtain FIG. 8B. It is divided into 7 × 7 small blocks.
As shown in (c), the connection state of the adjacent pixels of the contour image in the block is 0, 45, 90, 13 regardless of the forward or reverse direction.
The data is classified into five-degree four-direction components and tabulated, and a direction index histogram is obtained. FIG. 8D, which is the distribution of the direction index histogram obtained as a whole for each block in this manner, is the character feature amount. Not only the recognition method described in the above-mentioned document, but also in many other recognition methods, the distribution of connected components in the vertical, horizontal, and diagonal directions between adjacent pixels is used as a basic element of an important character feature amount.

【０００５】[0005]

【発明が解決しようとする課題】図７の縦横比を補正さ
れたＦＡＸ画像を見ると、画像の輪郭の斜め成分の多く
は、黒または白画素が縦方向に２画素連続し次の行で２
画素横に延びるパターンとなっている。これは、もとの
画像の各行を単純に２度ずつ重ねたため、本来ならば斜
め輪郭成分であるべきところがすべて縦または横の成分
となってしまったことによるもので、いわゆるガクガク
した画像になっているのである。視覚的にも、図７の画
像は滑らかさを欠いており不自然であるが、文字認識に
とってはさらに問題が深刻である。この画像では、本来
の文字画像の輪郭の連結成分の分布が大きく崩れてしま
っているために、統計的文字認識手法やその他多くの認
識手法を適用することができず、連結成分を参照しな
い、あまり認識率の良くない認識手法を適用せざるを得
ない。Looking at the FAX image in which the aspect ratio has been corrected in FIG. 7, most of the oblique components of the outline of the image are such that two black or white pixels continue in the vertical direction in the vertical direction, and 2
The pattern extends in the horizontal direction of the pixel. This is because each line of the original image is simply superimposed twice, so that all the parts that should have been oblique contour components have become vertical or horizontal components, resulting in a so-called jerky image. -ing Visually, the image of FIG. 7 lacks smoothness and is unnatural, but the problem is even more serious for character recognition. In this image, the distribution of the connected components of the contour of the original character image has been greatly distorted, so that the statistical character recognition method and many other recognition methods cannot be applied, and the connected components are not referred to. A recognition method with a poor recognition rate must be applied.

【０００６】画像の輪郭をなめらかにするための、一般
的なスムージング手法としては、例えば隣接画素の加重
平均をとる線形空間フィルタがあるが、これを適用する
と、計算時間がかかるだけでなく、文字の中の細かい部
分がつぶれてしまうため、かえって認識率が低下する。
本願発明は、ＦＡＸ（ｉ，ｊ）の標準モードのような画
像に対して、一般的に完成度が高く、認識率の優れてい
ることで定評のある統計的文字認識手法を適用するため
に、簡単な処理で画像の輪郭の連結成分の歪みを改善す
る前処理を提供することを課題とするものである。As a general smoothing method for smoothing the outline of an image, for example, there is a linear spatial filter that takes a weighted average of adjacent pixels. Since the small portion in the image is crushed, the recognition rate is rather lowered.
The present invention is to apply a statistical character recognition method which is generally well-established and has a high recognition rate to an image such as the standard mode of FAX (i, j). It is another object of the present invention to provide a pre-process for improving the distortion of a connected component of an outline of an image by a simple process.

【０００７】[0007]

【課題を解決するための手段】本願請求項１記載の文字
認識装置は、少なくとも画像入力部と、画像を修正する
前処理部と、画像から文字画像を切り出す切り出し部
と、切り出された文字画像を認識する文字認識部とを有
し、該前処理部は、画素Ｐｉ，ｊ（ただし、２≦ｉ≦Ｍ
−１、２≦ｊ≦Ｎ−１）を中心とする３行３列の画素行
列Ｘ（ｉ，ｊ）が、１は黒画素、０は白画素、＊は任意
の画素とするとき、下記の式１、式２、式３、式４のい
ずれかを満足すれば、Ｐｉ，ｊ＝１とすることを特徴と
している。According to a first aspect of the present invention, there is provided a character recognition apparatus including at least an image input unit, a preprocessing unit for correcting an image, a cutout unit for cutting out a character image from an image, and a cutout character image. And a character recognition unit for recognizing pixels Pi, j (where 2 ≦ i ≦ M
−1, 2 ≦ j ≦ N−1), a three-row, three-column pixel matrix X (i, j), where 1 is a black pixel, 0 is a white pixel, and * is any pixel, If any one of Expressions 1, 2, 3, and 4 is satisfied, Pi, j = 1 is set.

【０００８】[0008]

【数１】 (Equation 1)

【０００９】[0009]

【数２】 (Equation 2)

【００１０】[0010]

【数３】 (Equation 3)

【００１１】[0011]

【数４】 (Equation 4)

【００１２】本願請求項２記載の文字認識装置は、少な
くとも画像入力部と、画像を修正する前処理部と、画像
から文字画像を切り出す切り出し部と、切り出された文
字画像を認識する文字認識部とを有し、該前処理部は、
画素Ｐｉ，ｊ（ただし、２≦ｉ≦Ｍ−１、２≦ｊ≦Ｎ−
１）を中心とする３行３列の画素行列Ｘ（ｉ，ｊ）が、
１は黒画素、０は白画素、＊は任意の画素とするとき、
下記の式５、式６、式７、式８のいずれかを満足すれ
ば、Ｐｉ，ｊ＝１とすることを特徴としている。According to a second aspect of the present invention, at least an image input unit, a preprocessing unit for correcting an image, a cutout unit for cutting out a character image from an image, and a character recognition unit for recognizing the cutout character image And the pre-processing unit comprises:
Pixel Pi, j (where 2 ≦ i ≦ M−1, 2 ≦ j ≦ N−
A 3 × 3 pixel matrix X (i, j) centered on 1) is
When 1 is a black pixel, 0 is a white pixel, and * is an arbitrary pixel,
If any of the following Expressions 5, 6, 7, and 8 is satisfied, Pi, j = 1 is set.

【００１３】[0013]

【数５】 (Equation 5)

【００１４】[0014]

【数６】 (Equation 6)

【００１５】[0015]

【数７】 (Equation 7)

【００１６】[0016]

【数８】 (Equation 8)

【００１７】本願請求項３記載の文字認識装置は、少な
くとも画像入力部と、画像を修正する前処理部と、画像
から文字画像を切り出す切り出し部と、切り出された文
字画像を認識する文字認識部とを有し、該前処理部は、
画素Ｐｉ，ｊ（ただし、２≦ｉ≦Ｍ−１、２≦ｊ≦Ｎ−
１）を中心とする３行３列の画素行列Ｘ（ｉ，ｊ）が、
１は黒画素、０は白画素、＊は任意の画素とするとき、
前記の式１、式３のいずれかを満足すれば、Ｐｉ，ｊ＝
１とし、また前記の式６、式８のいずれかを満足すれ
ば、Ｐｉ，ｊ＝０とすることを特徴としている。According to a third aspect of the present invention, at least an image input unit, a preprocessing unit for correcting an image, a cutout unit for cutting out a character image from an image, and a character recognition unit for recognizing the cutout character image are provided. And the pre-processing unit comprises:
Pixel Pi, j (where 2 ≦ i ≦ M−1, 2 ≦ j ≦ N−
A 3 × 3 pixel matrix X (i, j) centered on 1) is
When 1 is a black pixel, 0 is a white pixel, and * is an arbitrary pixel,
If either of the above equations 1 and 3 is satisfied, Pi, j =
It is characterized in that Pi, j = 0 if any one of Expressions 6 and 8 is satisfied.

【００１８】本願請求項４記載の文字認識装置は、少な
くとも画像入力部と、画像を修正する前処理部と、画像
から文字画像を切り出す切り出し部と、切り出された文
字画像を認識する文字認識部とを有し、該前処理部は画
素Ｐｉ，ｊ（ただし、２≦ｉ≦Ｍ−１、２≦ｊ≦Ｎ−
１）を中心とする３行３列の画素行列Ｘ（ｉ，ｊ）が、
１は黒画素、０は白画素、＊は任意の画素とするとき、
前記の式２、式４のいずれかを満足すれば、Ｐｉ，ｊ＝
１とし、また前記の式５、式７のいずれかを満足すれ
ば、Ｐｉ，ｊ＝０とすることを特徴としている。According to a fourth aspect of the present invention, there is provided a character recognition apparatus including at least an image input unit, a preprocessing unit for correcting an image, a cutout unit for cutting out a character image from an image, and a character recognition unit for recognizing the cutout character image. And the pre-processing unit includes a pixel Pi, j (where 2 ≦ i ≦ M−1, 2 ≦ j ≦ N−
A 3 × 3 pixel matrix X (i, j) centered on 1) is
When 1 is a black pixel, 0 is a white pixel, and * is an arbitrary pixel,
If either of Equations 2 and 4 is satisfied, Pi, j =
It is characterized in that Pi, j = 0 if any one of Equations 5 and 7 is satisfied.

【００１９】本願請求項５記載の文字認識装置は、画像
入力部と、画像を修正する前処理部と、画像から文字画
像を切り出す切り出し部と、切り出された文字画像を認
識する文字認識部とを有し、該前処理部は、画素Ｐｉ，
ｊ（ただし、２≦ｉ≦Ｍ−１、２≦ｊ≦Ｎ−１）を中心
とする３行３列の画素行列Ｘ（ｉ，ｊ）が、１は黒画
素、０は白画素、＊は任意の画素とするとき、前記の式
１、式４のいずれかを満足すれば、Ｐｉ，ｊ＝１とし、
また前記の式６、式７のいずれかを満足すれば、Ｐｉ，
ｊ＝０とすることを特徴としている。According to a fifth aspect of the present invention, there is provided a character recognition apparatus comprising: an image input unit; a preprocessing unit for correcting an image; a cutout unit for cutting out a character image from an image; and a character recognition unit for recognizing the cutout character image. And the pre-processing unit includes a pixel Pi,
j (however, 2 ≦ i ≦ M−1, 2 ≦ j ≦ N−1), a 3 × 3 pixel matrix X (i, j), where 1 is a black pixel, 0 is a white pixel, and * When any pixel is satisfied, Pi, j = 1 is satisfied if any of Expressions 1 and 4 is satisfied.
If any of the above equations 6 and 7 is satisfied, Pi,
It is characterized in that j = 0.

【００２０】本願請求項６記載の文字認識装置は、縦横
の解像度比の違いを補正したＭ行Ｎ列の白または黒の画
素から構成される二値化画像を認識する文字認識装置で
あって、少なくとも画像入力部と、画像を修正する前処
理部と、画像から文字画像を切り出す切り出し部と、切
り出された文字画像を認識する文字認識部とを有し、該
前処理部は、画素Ｐｉ，ｊ（ただし、２≦ｉ≦Ｍ−１、
２≦ｊ≦Ｎ−１）を中心とする３行３列の画素行列Ｘ
（ｉ，ｊ）が、１は黒画素、０は白画素、＊は任意の画
素とするとき、前記の式２、式３のいずれかを満足すれ
ば、Ｐｉ，ｊ＝１とし、また前記の式５、式８のいずれ
かを満足すれば、Ｐｉ，ｊ＝０とすることを特徴として
いる。According to a sixth aspect of the present invention, there is provided a character recognition apparatus for recognizing a binarized image composed of M rows and N columns of white or black pixels in which the difference in the vertical and horizontal resolution ratios has been corrected. , At least an image input unit, a preprocessing unit that corrects the image, a cutout unit that cuts out a character image from the image, and a character recognition unit that recognizes the cutout character image, wherein the preprocessing unit includes a pixel Pi , J (where 2 ≦ i ≦ M−1,
3 ≦ 3 pixel matrix X centered on 2 ≦ j ≦ N−1)
When (i, j) is 1 for a black pixel, 0 for a white pixel, and * for any pixel, if either of the above equations 2 and 3 is satisfied, then Pi, j = 1, and If any of Expressions 5 and 8 is satisfied, Pi, j = 0 is set.

【００２１】以上、本願請求項１〜６に記載された文字
認識装置は、縦横の解像度比の違いを補正した結果、輪
郭の本来斜め成分であるべきところが縦または横の成分
となってしまったような二値化画像に対して、既知の文
字認識手法にとって適切なスムージング処理を加え、再
び輪郭の斜め成分を再現することによって、認識率を大
幅に改善することを可能とする。なおスムージング処理
の内容はそれぞれの請求項に記載された発明毎に異なっ
ている。As described above, in the character recognition device according to the first to sixth aspects of the present invention, as a result of correcting the difference in the vertical and horizontal resolution ratios, the original oblique component of the contour becomes a vertical or horizontal component. Such a binarized image is subjected to a smoothing process appropriate for a known character recognition method, and the oblique component of the contour is reproduced again, thereby making it possible to greatly improve the recognition rate. The content of the smoothing process differs for each invention described in each claim.

【００２２】[0022]

【発明の実施の形態】図１は本願発明の文字認識装置の
処理構成を示す。画像入力部１は外部からＦＡＸ等で読
み込まれた画像を入力する。ＦＡＸ標準モードで縦横の
解像度が異なるときは、縦横比の歪みを従来技術の欄で
述べたような手法で補正された画像が入力され、前処理
部２に渡される。前処理部２では入力画像を走査しなが
ら後述するフィルタ処理を行い、画像全域あるいは指定
された所望の領域に対してフィルタ処理を終了すると、
出力画像を切り出し部３に送る。FIG. 1 shows a processing configuration of a character recognition device according to the present invention. The image input unit 1 inputs an image read from an external device by facsimile or the like. When the vertical and horizontal resolutions are different in the FAX standard mode, an image in which the distortion of the aspect ratio is corrected by the method described in the section of the related art is input and passed to the preprocessing unit 2. The preprocessing unit 2 performs a filtering process described below while scanning the input image, and ends the filtering process on the entire image or a designated desired region.
The output image is sent to the cutout unit 3.

【００２３】切り出し部３では前処理を終えた全体画像
の中から文字が存在する部分の文字画像を切り出して認
識部４に送る。認識部４では文字画像を文字として認識
し、その結果を文字コードとして出力する。テキスト出
力部は要求に応じた形式で装置から認識結果を外部に出
力する機能を有するもので、種々のコードセットやイン
タフェースに対応するものが用いられている。The cutout unit 3 cuts out a character image where a character exists from the whole image that has been subjected to the preprocessing, and sends it to the recognition unit 4. The recognition unit 4 recognizes the character image as a character and outputs the result as a character code. The text output unit has a function of outputting a recognition result from a device to the outside in a format according to a request, and a unit corresponding to various code sets and interfaces is used.

【００２４】前処理部２は必ずしも切り出し部３の前で
ある必要はないが、画像が細かく切り出される前にフィ
ルタ処理を行う方が処理としては簡単である。ただしフ
ィルタ処理に要する時間を考えると、切り出した後の画
像に対してのみフィルタ処理を行う方が時間的には短く
なるので、図１において、前処理部２と切り出し部３の
順序を入れ替えても良い。以下の実施例では理解を簡単
にするため、先に１文字に切り出された画像を対象に、
前処理を行う場合について説明する。切り出しを前処理
の後にしても、複数の文字画像全体について一度に前処
理を加えるだけで本質的な差異は無い。なお、入力画像
に含まれるノイズ除去等の従来から行われている前処理
は、画像入力部１で行っても良いし、前処理部２で行っ
ても良い。The pre-processing unit 2 does not necessarily need to be before the clipping unit 3, but it is simpler to perform a filtering process before the image is finely clipped. However, considering the time required for the filtering process, it is shorter in terms of time to perform the filtering process only on the image after the clipping. Therefore, in FIG. 1, the order of the preprocessing unit 2 and the clipping unit 3 is changed. Is also good. In the following embodiment, for the sake of simplicity of understanding, an image cut out into one character first is targeted.
The case where the preprocessing is performed will be described. Even if the clipping is performed after the pre-processing, there is no essential difference only by performing the pre-processing on all the character images at once. Conventional preprocessing such as removal of noise included in an input image may be performed by the image input unit 1 or the preprocessing unit 2.

【００２５】以下では本願発明の前処理部２で用いるフ
ィルタについて説明する。なお、以下の説明で用いる式
において、０は白画素、１は黒画素を表し、＊は画素の
色を問わないこととする。本願発明で用いるフィルタ
は、入力画像中の注目画素を中心とする３行３列の画素
行列Ｘ（ｉ，ｊ）が、前記の式１、式２、式３、式４、
式５、式６、式７、式８のうち特定の条件式を満足した
とき、その注目画素の色を反転して出力するものであ
る。Hereinafter, a filter used in the preprocessing unit 2 of the present invention will be described. In the equations used in the following description, 0 represents a white pixel, 1 represents a black pixel, and * represents any pixel color. In the filter used in the present invention, the pixel matrix X (i, j) of 3 rows and 3 columns centering on the pixel of interest in the input image is obtained by using the above formulas 1, 2, 3, 4,
When a specific conditional expression among Expressions 5, 6, 7, and 8 is satisfied, the color of the pixel of interest is inverted and output.

【００２６】図２は本願発明で用いる各フィルタの作用
を白画素、黒画素の配置で示した図である。図において
も＊は画素の色を問わないことを示す。図２において、
ａ１、ａ２、ａ３、ａ４、ｂ１、ｂ２、ｂ３、ｂ４はそ
れぞれがその名称のフィルタの作用を説明している。入
力画像から得られた３行３列の画素行列Ｘ（ｉ，ｊ）
が、矢印の左側に示された配列に一致したとき、中心の
注目画素Ｐｉ，ｊを反転する。すなわち、矢印の左側の
画素配列は、各フィルタが用いる条件式に相当し、入力
画素行列が左側の画素配列に一致するということは条件
式を満たすことに他ならない。３行３列の入力画素行列
Ｘ（ｉ，ｊ）が左側の配列に一致したとき、その中心の
注目画素Ｐｉ，ｊは反転し、その結果、画素行列Ｘ
（ｉ，ｊ）は矢印の右に示される配列に変わる。FIG. 2 is a diagram showing the operation of each filter used in the present invention in the arrangement of white pixels and black pixels. Also in the figure, * indicates that the color of the pixel is not considered. In FIG.
a1, a2, a3, a4, b1, b2, b3, b4 each describe the operation of the filter with that name. Pixel matrix X (i, j) of 3 rows and 3 columns obtained from the input image
, The target pixel Pi, j at the center is inverted. That is, the pixel array on the left side of the arrow corresponds to the conditional expression used by each filter, and the fact that the input pixel matrix matches the pixel array on the left side is nothing but satisfying the conditional expression. When the input pixel matrix X (i, j) of 3 rows and 3 columns matches the left array, the pixel of interest Pi, j at its center is inverted, and as a result, the pixel matrix X
(I, j) changes to the array shown to the right of the arrow.

【００２７】フィルタａ１、ａ２、ａ３、ａ４は、それ
ぞれ式１、式２、式３、式４を条件式として用いるフィ
ルタである。これらの式１〜式４を条件式とするフィル
タは、いずれも注目画素が白の場合に限って入力画像の
画素行列Ｘ（ｉ，ｊ）が条件を満足する可能性があり、
条件が満足されると、注目画素を白から黒に反転する。
これらａ１〜ａ４の４つのフィルタをまとめてａ群と称
する。例えばフィルタａ１は、中心の注目画素が白画素
であり、注目画素の右上、右、下、左下の画素が黒画素
であり、注目画素の上、左の画素が白画素であるとき、
注目画素を黒画素に変換して出力するフィルタである。The filters a1, a2, a3, and a4 are filters using Equations 1, 2, 3, and 4 as conditional expressions, respectively. In any of the filters using conditional expressions 1 to 4, there is a possibility that the pixel matrix X (i, j) of the input image satisfies the condition only when the target pixel is white.
When the condition is satisfied, the target pixel is inverted from white to black.
These four filters a1 to a4 are collectively referred to as a group a. For example, the filter a1 is such that when the center pixel of interest is a white pixel, the upper right, right, lower, and lower left pixels of the pixel of interest are black pixels, and the upper and left pixels of the pixel of interest are white pixels,
This is a filter that converts a target pixel into a black pixel and outputs the converted pixel.

【００２８】これに対して、フィルタｂ１、ｂ２、ｂ
３、ｂ４は、それぞれ式５、式６、式７、式８を条件式
として用いるフィルタである。これらの、条件式として
式５、式６、式７、式８のいずれかを用いるフィルタ
は、注目画素が黒である場合に限って、入力画像の画素
行列Ｘ（ｉ，ｊ）が条件式を満足する可能性があり、も
し満足すれば注目画素を白に変換して出力する。これら
ｂ１〜ｂ４の４つのフィルタをまとめてｂ群とする。例
えばフィルタｂ１は、中心の注目画素が黒画素であり、
注目画素の右上、上、左、左下の画素が白画素であり、
注目画素の下、右の画素が黒画素であるとき、注目画素
を白画素に変換して出力するフィルタである。On the other hand, filters b1, b2, b
Reference numerals 3 and b4 denote filters using Expression 5, Expression 6, Expression 7, and Expression 8, respectively, as conditional expressions. These filters using any one of Expression 5, Expression 6, Expression 7, and Expression 8 as the conditional expressions only allow the pixel matrix X (i, j) of the input image to be a conditional expression if the target pixel is black. May be satisfied, and if so, the pixel of interest is converted to white and output. These four filters b1 to b4 are collectively referred to as a group b. For example, in the filter b1, the center pixel of interest is a black pixel,
The upper right, upper, left and lower left pixels of the target pixel are white pixels,
When the right pixel under the target pixel is a black pixel, the filter converts the target pixel into a white pixel and outputs the converted white pixel.

【００２９】整理すると、ａ群に属するフィルタａ１は
入力画素行列Ｘ（ｉ，ｊ）が式１を満足するとき行列中
央の注目画素を白から黒に反転し、フィルタａ２は式２
を満足するとき同様に注目画素を黒に反転し、以下同様
にフィルタａ３は式３を満足するとき注目画素を黒に反
転し、フィルタａ４は式４を満足するとき注目画素を黒
に反転するものである。またｂ群に属するフィルタｂ１
はＸ（ｉ，ｊ）が式５を満足するとき中央の注目画素を
黒から白に反転し、以下同様に、フィルタｂ２は式６を
満足するとき白に反転し、フィルタｂ３は式７を満足す
るとき白に反転し、フィルタｂ４は式８を満足するとき
白に反転するものである。In summary, when the input pixel matrix X (i, j) satisfies the expression 1, the filter a1 belonging to the group a inverts the target pixel at the center of the matrix from white to black, and the filter a2 expresses the expression 2
Is satisfied, the target pixel is similarly inverted to black. Similarly, the filter a3 inverts the target pixel to black when the expression 3 is satisfied, and the filter a4 inverts the target pixel to black when the expression 4 is satisfied. Things. Filter b1 belonging to group b
Inverts the center pixel of interest from black to white when X (i, j) satisfies Equation 5, and so on, filter b2 inverts to white when Equation 6 is satisfied, and filter b3 calculates Equation 7 The filter b4 is inverted to white when satisfied, and the filter b4 is inverted to white when Expression 8 is satisfied.

【００３０】一言で言えば、これらのフィルタは、注目
画素を中心とする３行３列の画素行列Ｘ（ｉ，ｊ）が条
件式を満足すれば、行列中央の注目画素を反転し、条件
式を満足しなければ注目画素を反転せずにそのまま出力
する作用を有する。なお、注目画素が入力画像の端の行
または列にあるときは、注目画素を中心とする３行３列
の画素行列Ｘ（ｉ，ｊ）は存在しないので、フィルタ処
理の対象とはならない。しかし文字画像が完全に入力画
像の中に入っていればこのような状態は起こり得ず、逆
に文字の一部が入力画像からはみ出して文字が欠けてい
る場合にはフィルタ処理を施しても認識できない場合が
ほとんどであるので、実用上の問題はない。In short, these filters invert the target pixel at the center of the matrix if the pixel matrix X (i, j) of three rows and three columns centering on the target pixel satisfies the conditional expression, If the conditional expression is not satisfied, there is an effect that the target pixel is output as it is without being inverted. When the target pixel is located at the end row or column of the input image, there is no pixel matrix X (i, j) of three rows and three columns centering on the target pixel, and therefore, is not subjected to the filtering process. However, if the character image is completely inside the input image, such a situation cannot occur, and conversely, if a part of the character protrudes from the input image and the character is missing, even if filter processing is performed There are almost no practical problems because they cannot be recognized in most cases.

【００３１】ａ群に属するａ１、ａ２、ａ３、ａ４の４
つのフィルタはいずれも白画素を黒画素に変換するもの
であるとともに、ａ１は画像の輪郭線のうち右上から左
下に向かう画像の上辺に当たる輪郭線を処理するもの
で、以下同様にａ２は右上から左下に向かう画像の下辺
に当たる輪郭線、ａ３は左上から右下に向かう上辺の輪
郭線、ａ４は左上から右下に向かう下辺の輪郭線を処理
するものである。また、ｂ群に属するｂ１、ｂ２、ｂ
３、ｂ４の４つのフィルタはいずれも黒画素を白画素に
変換するものであると同時に、ａ群と同様に、ｂ１は画
像の輪郭線のうち右上から左下に向かう画像上辺の輪郭
線を処理するもので、以下同様にｂ２は右上から左下に
向かう画像下辺の輪郭線、ｂ３は左上から右下に向かう
上辺の輪郭線、ｂ４は左上から右下に向かう下辺の輪郭
線を処理するものである。A1, a2, a3 and a4 of group a
Each of the three filters converts a white pixel into a black pixel, and a1 processes an outline corresponding to the upper side of the image from the upper right to the lower left among the outlines of the image. A contour line corresponding to the lower side of the image going to the lower left, a3 is a contour line for the upper side going from upper left to lower right, and a4 is a contour line for the lower side going from upper left to lower right. B1, b2, b belonging to group b
Each of the four filters 3 and b4 converts a black pixel into a white pixel, and at the same time, similarly to the group a, b1 processes the contour of the upper edge of the image from the upper right to the lower left among the contours of the image. In the same manner, b2 processes the contour of the lower side of the image from upper right to lower left, b3 processes the contour of the upper side from upper left to lower right, and b4 processes the contour of the lower side from upper left to lower right. is there.

【００３２】ここで、ａ群、ｂ群のそれぞれのフィルタ
の役割を整理すると、ａ群は白画素を黒画素に変換し、
ｂ群は黒画素を白画素に変換するが、それと同時に、ａ
群、ｂ群のいずれにおいても、１番目のフィルタは右上
から左下に向かう斜線の画像上辺の輪郭に作用し、２番
目のフィルタは同じ方向の斜線の画像下辺の輪郭に作用
する。また３番目のフィルタは左上から右下に向かう斜
線の上辺の輪郭に作用し、４番目のフィルタは同じ方向
の斜線の下辺の輪郭に作用する。このことは、図２の各
フィルタの白画素、黒画素の配置を、例えば図７の入力
画像における斜め線の輪郭と対比してみれば容易に理解
できる。Here, when the roles of the filters of the groups a and b are arranged, the group a converts white pixels into black pixels,
The b group converts black pixels into white pixels, but at the same time, a
In each of the groups b and b, the first filter acts on the contour of the upper side of the diagonally shaded image from upper right to lower left, and the second filter acts on the contour of the lower side of the diagonally shaded image in the same direction. The third filter acts on the contour of the upper side of the oblique line from the upper left to the lower right, and the fourth filter acts on the contour of the lower side of the oblique line in the same direction. This can be easily understood by comparing the arrangement of the white pixels and the black pixels of each filter in FIG. 2 with, for example, the outline of the oblique line in the input image in FIG.

【００３３】各フィルタの役割を考慮すると、基本的に
は、右上−左下方向と左上−右下方向のそれぞれについ
て、上辺輪郭と下辺輪郭が処理されるような組み合わせ
であれば、ａ群とｂ群の間でどのような組み合わせも可
能であることが分かる。また４通りの役割について、ａ
群またはｂ群からフィルタを選ぶわけであるから、すべ
ての組み合わせは、２の４乗で１６通りになることも分
かる。１６通りのフィルタセット（フィルタの組み合わ
せ）の一覧を表１に示す。Considering the role of each filter, basically, if the combination is such that the upper side contour and the lower side contour are processed in each of the upper right and lower left directions and the upper left and lower right directions, the groups a and b It can be seen that any combination between groups is possible. For the four roles, a
Since a filter is selected from the group or the group b, it can be seen that all combinations have 16 powers of 2 4. Table 1 shows a list of 16 filter sets (filter combinations).

【００３４】[0034]

【表１】 [Table 1]

【００３５】本願発明の前処理部においては、入力画像
の中の注目画素を中心とする３行３列の画素行列に対し
て、表１の１６通りのフィルタの組み合わせのうち好ま
しい組み合わせのフィルタセットを適用して注目画素の
変換を試み、３行３列の画素行列を構成できない画像の
端の行あるいは列の画素を除くすべての画素に対する処
理が終わるまで、順次処理を繰り返していく。以下で
は、このフィルタの組み合わせの選択について説明す
る。表１の組み合わせのうち、ａ群のフィルタのみを組
み合わせて用いるフィルタセットＳ１は、画素行列Ｘ
（ｉ，ｊ）が式１、式２、式３、式４のいずれかの条件
を満足するとき、注目画素Ｐｉ，ｊを白から黒に反転さ
せる。In the preprocessing unit of the present invention, a filter set of a preferable combination among 16 combinations of filters shown in Table 1 is applied to a pixel matrix of 3 rows and 3 columns centering on a target pixel in an input image. Is applied to attempt the conversion of the pixel of interest, and the processing is sequentially repeated until the processing on all the pixels except for the pixels at the end row or column of the image in which a three-row, three-column pixel matrix cannot be formed. Hereinafter, selection of this filter combination will be described. Among the combinations in Table 1, a filter set S1 using only a filter of group a is a pixel matrix X
When (i, j) satisfies any of the conditions of Expressions 1, 2, 3, and 4, the target pixel Pi, j is inverted from white to black.

【００３６】図７の補正入力画像に表１のＳ１、すなわ
ちａ１、ａ２、ａ３、ａ４のフィルタからなるフィルタ
セットを適用して得られる出力画像を図３に示す。図３
の文字画像の輪郭には本来の斜め連結成分が再現されて
おり、文字認識でも良好な認識結果を得ることが期待で
きる。フィルタセットＳ１に属するａ１、ａ２、ａ３、
ａ４のａ群のフィルタはいずれも白画素を黒画素に変換
するものであるから、フィルタセットＳ１を用いて得ら
れた出力画像である図３と元の画像である図７とを比較
すると、図３の斜め線は元の画像に比べて太くなってい
る。FIG. 3 shows an output image obtained by applying S1 in Table 1, that is, a filter set including filters a1, a2, a3, and a4 to the corrected input image of FIG. FIG.
The original diagonal connected component is reproduced in the outline of the character image, and it can be expected that a good recognition result can be obtained even in character recognition. A1, a2, a3 belonging to the filter set S1,
Since all the filters in a group a4 convert white pixels into black pixels, comparing FIG. 3 which is an output image obtained using the filter set S1 with FIG. 7 which is the original image, The oblique line in FIG. 3 is thicker than the original image.

【００３７】ｂ群のフィルタのみを組み合わせて用いる
フィルタセットＳ２は、画素行列Ｘ（ｉ，ｊ）が式５、
式６、式７、式８のいずれかの条件を満足するとき、注
目画素Ｐｉ，ｊを黒から白に反転させる。図７の補正入
力画像に表１のＳ２、すなわちｂ１、ｂ２、ｂ３、ｂ４
のフィルタからなるフィルタセットを適用して得られる
出力画像を図４に示す。図４も、先程の図３と同様に、
斜め連結成分が良く再現されている。フィルタセットＳ
２を構成するｂ１、ｂ２、ｂ３、ｂ４のｂ群のフィルタ
はいずれも黒画素を白画素に変換するものであるから、
このフィルタセットＳ２を用いて得られた図４の画像の
斜め線は元の画像に比べて細くなっている。A filter set S2 using only the b group of filters in combination has a pixel matrix X (i, j) expressed by the following equation (5).
When any of the conditions of Expressions 6, 7, and 8 is satisfied, the target pixel Pi, j is inverted from black to white. The correction input image shown in FIG. 7 includes S2 in Table 1, that is, b1, b2, b3, and b4.
FIG. 4 shows an output image obtained by applying the filter set including the filters (1) to (3). FIG. 4 is also similar to FIG.
The oblique connected components are well reproduced. Filter set S
B1, b2, b3, and b4 of the b group constituting filter 2 convert a black pixel into a white pixel.
The oblique line of the image of FIG. 4 obtained using this filter set S2 is thinner than the original image.

【００３８】線の太り、細りは認識方法によってはあま
り影響を受けないが、線の太さの影響を受ける認識方法
もある。したがって、線の太さをあまり変えないような
フィルタの工夫も必要である。元の画像における斜め線
の太さを維持するためには、ａ群に属するフィルタと、
ｂ群に属するフィルタとを適当に組み合わせる必要があ
る。例えば右上から左下に向かう斜め線の画像の上側の
輪郭では白画素を黒画素に変換するようにａ群のフィル
タを選び、同じ斜線の画像の下側の輪郭では黒画素を白
画素に変換するようにｂ群のフィルタを選べば、斜線の
太さは変わらない。同様に、左上から右下に向かう斜め
線の画像についてもａ群とｂ群のフィルタのペアを選べ
ば、どちら方向の斜線も元の太さを維持できるから、水
平線や垂直線の太さとのバランスが崩れることもなく、
文字画像の特徴が保存される。Thickness and thinness of the line are not so affected by the recognition method, but there are also recognition methods affected by the thickness of the line. Therefore, it is necessary to devise a filter that does not change the thickness of the line so much. In order to maintain the thickness of the oblique line in the original image, a filter belonging to group a,
It is necessary to appropriately combine filters belonging to group b. For example, a filter of group a is selected so as to convert white pixels into black pixels in the upper contour of an oblique line image from upper right to lower left, and black pixels are converted into white pixels in the lower contour of the same oblique line image. If the filter of the b group is selected as described above, the thickness of the diagonal line does not change. Similarly, with respect to the image of the diagonal line from the upper left to the lower right, if the pair of the filters of the group a and the group b is selected, the original thickness of the diagonal lines in both directions can be maintained, so that the thickness of the horizontal line and the vertical line Without losing balance,
Character image features are saved.

【００３９】表１において、Ｓ３からＳ６の４通りのフ
ィルタセットは、どちら方向の斜線についても、上辺側
と下辺側で必ずａ群とｂ群から一つずつフィルタを選択
しているため、線幅は変化しない。フィルタセットＳ３
はフィルタａ１、ｂ２、ａ３、ｂ４の組み合わせから成
る。すなわち画素行列Ｘ（ｉ，ｊ）が式１または式３を
満足すれば中央の注目画素を白から黒に反転し、式６ま
たは式８を満足すれば中央の注目画素を黒から白に反転
する。In Table 1, the four types of filter sets S3 to S6 always select one filter from the groups a and b on the upper side and the lower side for oblique lines in either direction. The width does not change. Filter set S3
Is composed of a combination of filters a1, b2, a3 and b4. That is, if the pixel matrix X (i, j) satisfies the expression 1 or 3, the central pixel of interest is inverted from white to black, and if the expression 6 or 8 is satisfied, the central pixel of interest is inverted from black to white. I do.

【００４０】Ｓ３では、ａ１とｂ２とが対になってい
て、右上−左下方向の輪郭のうち画像上辺の輪郭ではａ
１の作用で黒画素が増え、画像下辺の輪郭ではｂ２の作
用で白画素が増えるため、全体として右上−左下方向斜
線の線幅は変わらず、この方向の斜線の重心は上方に移
動する。またａ３とｂ４のフィルタも対になっていて、
左上−右下方向の輪郭のうち上辺の輪郭ではａ３の作用
で黒画素が増え、下辺の輪郭ではｂ４の作用で白画素が
増えるため、左上−右下方向の斜線の線幅も変化しな
い。フィルタセットＳ３を用いて処理した画像の例を図
５に示す。図３あるいは図４と比較すると、図５では元
の斜線の幅が維持されていることが分かる。In S3, a1 and b2 are paired, and among the contours in the upper right and lower left directions, a
Since the number of black pixels increases by the action of 1 and the number of white pixels increases by the action of b2 in the outline of the lower side of the image, the line width of the oblique line in the upper right and lower left directions does not change as a whole, and the center of gravity of the oblique line in this direction moves upward. The filters a3 and b4 are also paired,
Of the contours in the upper left and lower right directions, the number of black pixels increases by the action of a3 in the upper side contour and the number of white pixels increases by the action of b4 in the lower side contour, so that the line width of the oblique lines in the upper left and lower right directions does not change. FIG. 5 shows an example of an image processed using the filter set S3. Compared to FIG. 3 or FIG. 4, it can be seen that the original hatched width is maintained in FIG.

【００４１】フィルタセットＳ４はｂ１、ａ２、ｂ３、
ａ４のフィルタからなり、画素行列Ｘ（ｉ，ｊ）が式２
または式４を満足すれば注目画素を白から黒に反転し、
式５または式７を満足すれば注目画素を黒から白に反転
する。すなわち、右上−左下方向および左上−右下方向
のいずれの方向についても、上辺の輪郭では白画素が増
え、下辺の輪郭では黒画素が増えるため、斜線の線幅は
変わらない。The filter set S4 includes b1, a2, b3,
a4 filter, and the pixel matrix X (i, j) is expressed by Equation 2
Alternatively, if Expression 4 is satisfied, the pixel of interest is inverted from white to black,
If Expression 5 or Expression 7 is satisfied, the target pixel is inverted from black to white. That is, in any of the upper right direction and the lower left direction and the upper left direction and the lower right direction, the number of white pixels increases in the outline of the upper side and the number of black pixels increases in the outline of the lower side, so that the line width of the oblique line does not change.

【００４２】フィルタセットＳ５はａ１、ｂ２、ｂ３、
ａ４のフィルタからなり、画素行列Ｘ（ｉ，ｊ）が式１
または式４を満足すれば注目画素を白から黒に反転し、
式６または式７を満足すれば注目画素を黒から白に反転
する。すなわち、右上−左下方向については、上辺の輪
郭では黒画素が増え下辺の輪郭では白画素が増えるが、
逆に左上−右下方向の方向については、上辺の輪郭では
白画素が増え下辺の輪郭では黒画素が増える。いずれに
しても斜線の線幅は変わらない。The filter set S5 includes a1, b2, b3,
a4, and the pixel matrix X (i, j) is expressed by the following equation 1.
Alternatively, if Expression 4 is satisfied, the pixel of interest is inverted from white to black,
If Expression 6 or Expression 7 is satisfied, the target pixel is inverted from black to white. That is, in the upper right-lower left direction, the number of black pixels increases in the outline of the upper side and the number of white pixels increases in the outline of the lower side.
Conversely, in the direction from the upper left to the lower right, white pixels increase in the outline of the upper side and black pixels increase in the outline of the lower side. In any case, the line width of the oblique line does not change.

【００４３】フィルタセットＳ６はｂ１、ａ２、ａ３、
ｂ４のフィルタからなり、画素行列Ｘ（ｉ，ｊ）が式２
または式３を満足すれば注目画素を白から黒に反転し、
式５または式８を満足すれば注目画素を黒から白に反転
する。すなわち、右上−左下方向については、上辺の輪
郭では白画素が増え下辺の輪郭では黒画素が増えるが、
逆に左上−右下方向の方向については、上辺の輪郭では
黒画素が増え下辺の輪郭では白画素が増える。これはフ
ィルタセットＳ５とは対称的な作用であるが、このフィ
ルタセット６においても、やはりいずれの方向について
も斜線の線幅は変わらない。The filter set S6 includes b1, a2, a3,
b4, and the pixel matrix X (i, j) is expressed by the following equation 2.
Alternatively, if Expression 3 is satisfied, the pixel of interest is inverted from white to black,
If Expression 5 or Expression 8 is satisfied, the target pixel is inverted from black to white. That is, in the upper right-lower left direction, white pixels increase in the outline of the upper side and black pixels increase in the outline of the lower side.
Conversely, in the direction from the upper left to the lower right, the number of black pixels increases in the outline of the upper side, and the number of white pixels increases in the outline of the lower side. This is a symmetrical operation with respect to the filter set S5, but the line width of the diagonal lines does not change in any direction in the filter set 6 as well.

【００４４】以上のことから推察されるように、Ｓ３か
らＳ６のフィルタセットでは、斜め線の線幅は保存され
る代わりに、斜線の重心は１／２画素だけ上下左右いず
れかの方向に移動する。これはフィルタ処理の結果、斜
線の片側の黒画素が増え反対側の黒画素が減ることによ
るもので、２方向それぞれの斜線でどちら側の黒画素が
増えるかによって、Ｓ３では上、Ｓ４では下、Ｓ５では
左、Ｓ６では右方向に文字画像の重心が移動する。As can be inferred from the above, in the filter set from S3 to S6, the line width of the diagonal line is not preserved, but the center of gravity of the diagonal line is moved in the up, down, left, or right direction by 1/2 pixel. I do. This is because the number of black pixels on one side of the diagonal line increases and the number of black pixels on the opposite side decreases as a result of the filter processing. Depending on which black pixel increases in each diagonal line in two directions, the upper part in S3 and the lower part in S4. , S5, and the center of gravity of the character image moves rightward in S6.

【００４５】残るＳ７以降のセットを用いた場合は、例
えば左上−右下斜線は太り、右上−左下斜線は細るとい
ったように線幅のバランスを欠く結果となるが、斜線の
スムージング効果は得られるので、これらのフィルタセ
ットを採用することも可能である。Ｓ７とＳ８は一方の
斜め線は太り、他方の斜め線は細る組み合わせである。
またＳ９〜Ｓ１２は一方の斜め線は太り、他方は線幅が
変わらない組み合わせである。またＳ１３〜Ｓ１６は一
方の斜め線は細り、他方は線幅が変わらないという組み
合わせである。When the remaining sets after S7 are used, for example, the upper left and lower right diagonal lines become thicker and the upper right and lower left diagonal lines become thin, resulting in an imbalance in line width. However, a diagonal line smoothing effect is obtained. Therefore, it is also possible to employ these filter sets. S7 and S8 are combinations in which one diagonal line is thick and the other diagonal line is thin.
S9 to S12 are combinations in which one oblique line is thick and the other is unchanged in line width. In addition, S13 to S16 are combinations in which one oblique line is thin and the other line width does not change.

【００４６】[0046]

【発明の効果】請求項１の文字認識装置は、簡単なフィ
ルタ処理で文字認識に必要な斜線の輪郭を平滑化できる
ため、処理時間をかけず、また大規模なメモリを必要と
せずに、認識率の改善ができる。また全体を一様にスム
ージングするのではなく斜線で輪郭のギザギザが大きい
部分にのみスムージング処理が施されるため、文字のつ
ぶれを生じず、かつファインモードの画像に適用しても
余計な悪影響を与えない。さらに、請求項１の文字認識
装置は斜線の線幅を太らせながら平滑化するので、かす
れ気味の画像に適用すれば、線幅の改善効果も奏する。According to the character recognition device of the present invention, the outline of the diagonal lines required for character recognition can be smoothed by simple filter processing, so that processing time is not required and a large-scale memory is not required. The recognition rate can be improved. Also, since smoothing processing is applied only to the parts where the outline is jagged with diagonal lines instead of smoothing the whole uniformly, character collapse does not occur and even if applied to images in fine mode, there is no adverse effect. Do not give. Furthermore, since the character recognition device according to the first aspect smoothes while increasing the line width of the oblique lines, if the image recognition apparatus is applied to a faint image, the effect of improving the line width is also exhibited.

【００４７】請求項２の文字認識装置は、請求項１が斜
線の線幅を太らせるの対して、斜線の線幅を細らせなが
ら平滑化するので、濃すぎる傾向のある画像に適用すれ
ば線幅を改善することができる。これに対して請求項３
〜６の文字認識装置は、斜線の線幅を変えずに平滑化す
るため、線幅を情報な重要として用いる文字認識の手法
に適している。According to the character recognition device of the second aspect, since the smoothing is performed while reducing the line width of the diagonal lines, while the line width of the diagonal lines is widened in the first aspect, the character recognition apparatus can be applied to an image that tends to be too dark. For example, the line width can be improved. Claim 3
Since the character recognition devices of Nos. 1 to 6 perform smoothing without changing the line width of oblique lines, they are suitable for a character recognition method in which the line width is important as information.

【００４８】[0048]

[Brief description of the drawings]

【図１】本発明の文字認識装置の処理構成を示す図であ
る。FIG. 1 is a diagram showing a processing configuration of a character recognition device of the present invention.

【図２】本発明のフィルタの作用を示す図である。FIG. 2 is a diagram showing the operation of the filter of the present invention.

【図３】本発明のフィルタセットＳ１を用いたときの出
力画像を示す図である。FIG. 3 is a diagram showing an output image when a filter set S1 of the present invention is used.

【図４】本発明のフィルタセットＳ２を用いたときの出
力画像を示す図である。FIG. 4 is a diagram showing an output image when a filter set S2 of the present invention is used.

【００４９】[0049]

【図５】本発明のフィルタセットＳ３を用いたときの出
力画像を示す図である。FIG. 5 is a diagram showing an output image when a filter set S3 of the present invention is used.

【図６】縦方向が横方向の１／２の解像度で入力された
画像を示す図である。FIG. 6 is a diagram showing an image input in a vertical direction at a resolution of 横 of a horizontal direction.

【図７】図６の画像の縦横比を改善した画像を示す図で
ある。7 is a diagram showing an image in which the aspect ratio of the image in FIG. 6 is improved.

【図８】文字認識方法における方向指数ヒストグラムの
求め方を示す図である。FIG. 8 is a diagram showing how to obtain a direction index histogram in the character recognition method.

【００５０】[0050]

[Explanation of symbols]

１：画像入力部２：前処理部３：切り出し部４：認識部５：テキスト出力部 1: Image input unit 2: Preprocessing unit 3: Cutout unit 4: Recognition unit 5: Text output unit

───────────────────────────────────────────────────── フロントページの続きＦターム(参考） 5B029 AA01 EE08 EE16 EE17 EE19 5B057 AA11 CA02 CA06 CA12 CA16 CB02 CB06 CB12 CB16 CC02 CE05 CE06 5B064 AA01 AA08 CA12 CA13 CA15 5C077 LL08 LL09 LL10 LL19 MP05 PP02 ──────────────────────────────────────────────────続き Continued on the front page F-term (reference)

Claims

[Claims]

1. A character recognition device for recognizing a binarized image composed of white or black pixels in M rows and N columns in which the difference in the vertical and horizontal resolution ratios has been corrected, comprising at least an image input unit and an image input unit. It has a pre-processing unit for correction, a cut-out unit for cutting out a character image from an image, and a character recognition unit for recognizing the cut-out character image. The pre-processing unit includes pixels Pi, j (where
3 rows 3 centered on 2 ≦ i ≦ M−1, 2 ≦ j ≦ N−1)
When the pixel matrix X (i, j) of the column is 1 as a black pixel, 0 as a white pixel, and * as an arbitrary pixel, any one of the following formulas 1, 2, 3, and 4 is satisfied. For example, Pi, j = 1. (Equation 1) (Equation 2) (Equation 3) (Equation 4)

2. A character recognition apparatus for recognizing a binarized image composed of M rows and N columns of white or black pixels in which a difference in the resolution ratio between the vertical and horizontal directions has been corrected. It has a pre-processing unit for correction, a cut-out unit for cutting out a character image from an image, and a character recognition unit for recognizing the cut-out character image. The pre-processing unit includes pixels Pi, j (where
3 rows 3 centered on 2 ≦ i ≦ M−1, 2 ≦ j ≦ N−1)
When the pixel matrix X (i, j) of the column is 1 as a black pixel, 0 as a white pixel, and * as an arbitrary pixel, any of the following formulas 5, 6, 7, and 8 is satisfied. For example, Pi, j = 1. (Equation 5) (Equation 6) (Equation 7) (Equation 8)

3. A character recognition device for recognizing a binarized image composed of M rows and N columns of white or black pixels in which a difference in resolution ratio between height and width is corrected, comprising at least an image input unit and an image input unit. It has a pre-processing unit for correction, a cut-out unit for cutting out a character image from an image, and a character recognition unit for recognizing the cut-out character image. The pre-processing unit includes pixels Pi, j (where
3 rows 3 centered on 2 ≦ i ≦ M−1, 2 ≦ j ≦ N−1)
When 1 is a black pixel, 0 is a white pixel, and * is an arbitrary pixel, X, (i, j) of a column, if either of the above equations 1 and 3 is satisfied, Pi, j = The character recognition device is characterized in that Pi, j = 0 if the above expression is 1, and if any of the above expressions 6 and 8 is satisfied.

4. A character recognition device for recognizing a binarized image composed of M rows and N columns of white or black pixels in which the difference between the vertical and horizontal resolution ratios has been corrected, comprising: an image input unit; It has a pre-processing unit for correction, a cut-out unit for cutting out a character image from an image, and a character recognition unit for recognizing the cut-out character image.
≦ i ≦ M−1, 2 ≦ j ≦ N−1), a 3 × 3 pixel matrix X (i, j), where 1 is a black pixel, 0 is a white pixel,
* Represents an arbitrary pixel, and if either of the above equations 2 and 4 is satisfied, Pi, j = 1, and the above equation 5,
A character recognition device characterized by setting Pi, j = 0 if any one of Expressions 7 is satisfied.

5. A character recognition apparatus for recognizing a binarized image composed of M rows and N columns of white or black pixels in which a difference in the resolution ratio between the height and width is corrected, comprising: an image input unit; And a character recognizing unit for recognizing the cut-out character image. The pre-processing unit includes a pixel Pi, j (where 2 ≦ i ≦ M
−1, 2 ≦ j ≦ N−1), a three-row, three-column pixel matrix X (i, j), where 1 is a black pixel, 0 is a white pixel, and * is an arbitrary pixel, A character characterized by setting Pi, j = 1 if any of Expressions 1 and 4 are satisfied, and setting Pi, j = 0 if any of Expressions 6 and 7 are satisfied. Recognition device.

6. A character recognition device for recognizing a binarized image composed of M rows and N columns of white or black pixels in which the difference between the vertical and horizontal resolution ratios has been corrected. It has a pre-processing unit for correction, a cut-out unit for cutting out a character image from an image, and a character recognition unit for recognizing the cut-out character image. The pre-processing unit includes pixels Pi, j (where
3 rows 3 centered on 2 ≦ i ≦ M−1, 2 ≦ j ≦ N−1)
When the pixel matrix X (i, j) of a column is 1 for a black pixel, 0 for a white pixel, and * for an arbitrary pixel, if either of the above equations 2 and 3 is satisfied, Pi, j = 1; and if any of Equations 5 and 8 is satisfied, Pi, j = 0.