JP2000235620A

JP2000235620A - Character recognizing device

Info

Publication number: JP2000235620A
Application number: JP11037107A
Authority: JP
Inventors: Hiroyasu Miyahara; 景泰宮原; Fumio Yoda; 文夫依田
Original assignee: Mitsubishi Electric Corp
Current assignee: Mitsubishi Electric Corp
Priority date: 1999-02-16
Filing date: 1999-02-16
Publication date: 2000-08-29
Anticipated expiration: 2019-02-16
Also published as: JP3813373B2

Abstract

PROBLEM TO BE SOLVED: To provide a character recognizing device which obtains a high quality grayscale image for recognition from one input image inputted by using a single plate color image sensor without depending upon a character or background colors. SOLUTION: This device is provided with an image inputting means 1 which obtains an input image by a single plate color image pickup device D constructed by arranging plural pixels in a multiple matrix, an image generating means 2 for character detection which defines a pixel corresponding to a specific single color in an input image as a reference pixel and prepares a grayscale image for character detection on the basis of the pixel value of the reference pixel, a character detecting means 3 which detects a character area from the grayscale image for character detection, an image producing means 4 for character recognition which produces a grayscale image for recognition by using a pixel value in a character area about the character area in the input image and a character recognizing means 5 which recognizes a character from the grayscale image for recognition.

Description

DETAILED DESCRIPTION OF THE INVENTION

【０００１】[0001]

【発明の属する技術分野】この発明は、画像中の文字を
認識する文字認識装置に関するものであり、特に単板の
カラーイメージセンサを用いて入力した画像の文字を認
識する文字認識装置に関するものである。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a character recognition device for recognizing characters in an image, and more particularly to a character recognition device for recognizing characters in an input image using a single-plate color image sensor. is there.

【０００２】[0002]

【従来の技術】図１３は例えば特開平９−１８７０２１
号公報に記載された従来の文字認識装置を示す概略構成
図である。図において１０１はカラー入力画像を入力す
る画像入力手段、３は得られた画像から文字を検出する
文字検出手段、５は得られた文字領域から文字を認識す
る文字認識手段である。2. Description of the Related Art FIG.
FIG. 1 is a schematic configuration diagram showing a conventional character recognition device described in Japanese Unexamined Patent Publication (Kokai) No. H10-15095. In the figure, reference numeral 101 denotes an image input unit for inputting a color input image, 3 denotes a character detection unit for detecting characters from the obtained image, and 5 denotes a character recognition unit for recognizing characters from the obtained character area.

【０００３】画像入力手段１０１の詳細な構成要素とし
て、Ａは光学レンズ、Ｂは光軸移動手段、Ｃは光軸移動
制御手段、Ｄは単版カラーイメージセンサ、ＥはＡ／Ｄ
変換手段、Ｆは第１の記憶手段、Ｈは第２の記憶手段、
Ｉはカラー／白黒指定手段、Ｊは輝度成分検出手段、Ｋ
は白黒画像生成手段である。As detailed components of the image input means 101, A is an optical lens, B is an optical axis moving means, C is an optical axis moving control means, D is a single color image sensor, and E is an A / D.
Conversion means, F is first storage means, H is second storage means,
I is color / monochrome designating means, J is luminance component detecting means, K
Is a monochrome image generating means.

【０００４】次に動作を説明する。なお、このような従
来装置では、撮像対象がカラー画像か白黒画像かをユー
ザが設定し、このユーザ設定に基づいたカラー／白黒指
定手段Ｉの指示により動作が異なるが、本説明において
は、文字認識を前提とした白黒（グレースケール）画像
取得の場合について説明する。Next, the operation will be described. In such a conventional apparatus, the user sets whether the imaging target is a color image or a black-and-white image, and the operation differs depending on the instruction of the color / black-and-white specifying means I based on the user setting. A case of obtaining a black and white (gray scale) image on the premise of recognition will be described.

【０００５】まず光学レンズＡを通って入射する被写体
からの光学像（反射光）を、光軸移動手段Ｂが光軸移動
制御手段Ｃの制御の元で光軸を移動させて単版カラーイ
メージセンサＤに当て、単版カラーイメージセンサＤに
より生成された画像信号をＡ／Ｄ変換手段ＥがＡ／Ｄ変
換してデジタル画像を作成し、第１の記憶手段Ｆに記憶
する。さらに、前記カラー／白黒指定手段Ｉによる指示
が白黒画像のため、画素ずらししないで撮像された画像
に加え、光軸移動手段Ｂにより画素位置が左、右、下方
向に半画素ずつずれた画像を含む合計４画面を得る。な
お、本例で用いる単板カラーイメージセンサＤは、図１
４に示すように光の波長帯（色）毎の感度が異なる４画
素１組の補色系の画素構造となっている。[0005] First, an optical image (reflected light) from a subject incident through an optical lens A is moved by an optical axis moving means B under the control of an optical axis movement control means C to form a single-plate color image. The A / D converter E converts the image signal generated by the single-color image sensor D into a digital image by applying the image signal to the sensor D, and stores the digital image in the first storage unit F. Further, since the instruction by the color / monochrome designating means I is a black and white image, in addition to the image taken without shifting the pixels, the image whose pixel position is shifted by half a pixel to the left, right and down by the optical axis moving means B To obtain a total of four screens. Note that the single-plate color image sensor D used in this example is the same as that shown in FIG.
As shown in FIG. 4, a complementary color pixel structure of a set of four pixels having different sensitivities for each wavelength band (color) of light is provided.

【０００６】次に輝度成分検出手段Ｊは、前記カラー／
白黒指定手段Ｉによる白黒画像指示に基づいて、前記４
画面に対して、フィルタの各色成分から独立に輝度成分
を求める。輝度成分は、各色に対応した画素毎にあらか
じめ正規化関数を求めておき、撮像された画素値にこの
正規化関数を適用することで求められる。本動作により
得られた４画面分の輝度成分を、白黒画像生成手段Ｋ
が、ずらし位置に対応した第２の記憶手段Ｈ上のメモリ
に格納して一つのグレースケール画像とする。Next, the luminance component detecting means J detects the color / color
Based on the black and white image instruction by the black and white specifying means I,
For the screen, a luminance component is obtained independently from each color component of the filter. The luminance component is obtained by obtaining a normalization function in advance for each pixel corresponding to each color, and applying this normalization function to the imaged pixel value. The luminance components for the four screens obtained by this operation are converted into black and white image generating means K.
Are stored in a memory on the second storage means H corresponding to the shift position to form one grayscale image.

【０００７】その後、本グレースケール画像を対象とし
て、文字検出手段３が個々の文字を検出し、文字認識手
段５が検出された文字パターンを認識する。文字検出と
文字認識に関しては、従来手法、例えば三菱電機技報
Ｖｏｌ．６２Ｎｏ．２ｐｐ．９−１２「ナンバープ
レート認識技術」に記載の方法を用いる。Thereafter, the character detecting means 3 detects individual characters in the grayscale image, and the character recognizing means 5 recognizes the detected character pattern. Regarding character detection and character recognition, conventional methods such as Mitsubishi Electric Technical Report
Vol. 62 No. 2 pp. The method described in 9-12 “License plate recognition technology” is used.

【０００８】[0008]

【発明が解決しようとする課題】単板カラーイメージセ
ンサＤを用いた撮像装置を画像入力手段とした従来の文
字認識装置では、あらかじめ定めた正規化関数により各
画素の輝度成分を求めるため、対象物が白黒の場合（例
えば、文字が白、背景が黒の場合）や、あらかじめ文字
と背景の色が分かっている場合には、その色に応じた正
規化関数を準備しておくことが可能であるが、文字や背
景がどのような色か分からない場合には対応できず、高
品質なグレースケール画像を得られない。そのため、認
識精度が低くなるという課題があった。また、画素ずら
しによる高解像度化は複数回撮像した画像を合成するた
め、静止した対象物では有効であるが、対象物が移動し
ている場合には適用できないという課題もあった。In a conventional character recognition device using an image pickup device using a single-chip color image sensor D as image input means, the luminance component of each pixel is obtained by a predetermined normalization function. If the object is black and white (for example, the character is white and the background is black), or if the color of the character and background is known in advance, it is possible to prepare a normalization function corresponding to that color However, it is not possible to cope with the case where the color of the character or the background is unknown, and a high-quality grayscale image cannot be obtained. Therefore, there is a problem that recognition accuracy is reduced. In addition, since high resolution by pixel shift combines images captured a plurality of times, it is effective for a stationary target, but cannot be applied when the target is moving.

【０００９】この発明は、前記のような課題を解決する
ためになされたものであり、文字検出用グレースケール
画像を、予め定めた画素値のみを用いて作成し、検出さ
れた個々の文字領域内の画素値を用いて認識用グレース
ケール画像を作成し直すことで、文字や背景の色によら
ず、単板カラーイメージセンサを用いて入力した１個の
入力画像から高品質な認識用グレースケール画像を得る
文字認識装置を提供することを目的としている。SUMMARY OF THE INVENTION The present invention has been made to solve the above-described problem. A gray scale image for character detection is created using only predetermined pixel values, and each detected character area is detected. By re-creating the grayscale image for recognition using the pixel values in the image, a high-quality grayscale image for recognition can be obtained from a single input image input using a single-chip color image sensor, regardless of the color of characters and background. It is an object of the present invention to provide a character recognition device for obtaining a scale image.

【００１０】[0010]

【課題を解決するための手段】この発明に係る文字認識
手段は、以下の手段を備えている。（ａ）複数の画素が多数マトリクス状に配列されてなる
単板カラー撮像素子にて入力画像を得る画像入力手段、
（ｂ）入力画像中の特定の単色に対応した画素を基準画
素とし、基準画素の画素値に基づいて文字検出用グレー
スケール画像を作成する文字検出用画像作成手段、
（ｃ）文字検出用グレースケール画像から文字領域を検
出する文字検出手段、（ｄ）入力画像中の文字領域につ
いて、文字領域内の画素値を用いて認識用グレースケー
ル画像を作成する文字認識用画像作成手段、（ｅ）認識
用グレースケール画像から文字を認識する文字認識手
段。The character recognition means according to the present invention comprises the following means. (A) image input means for obtaining an input image with a single-chip color imaging device in which a plurality of pixels are arranged in a matrix
(B) a character detection image creating means for creating a gray scale image for character detection based on a pixel value of the reference pixel, using a pixel corresponding to a specific single color in the input image as a reference pixel;
(C) character detection means for detecting a character area from a gray scale image for character detection, and (d) character recognition for generating a gray scale image for recognition using a pixel value in the character area for the character area in the input image. Image creation means; (e) character recognition means for recognizing characters from a grayscale image for recognition.

【００１１】また、文字検出用画像作成手段は、入力画
像の基準画素位置の画素値のみを使用して文字検出用グ
レースケール画像を作成する。The character detecting image generating means generates a character detecting gray scale image using only the pixel value at the reference pixel position of the input image.

【００１２】また、文字検出用画像作成手段は、入力画
像において、各非基準画素に隣接する周辺の基準画素の
画素値から算出した値を該非基準画素の画素値とし、こ
の値と基準画素の画素値とに基づいて文字検出用グレー
スケール画像を作成する。Further, the character detecting image creating means sets a value calculated from a pixel value of a peripheral reference pixel adjacent to each non-reference pixel in the input image as a pixel value of the non-reference pixel. A gray scale image for character detection is created based on the pixel values.

【００１３】また、文字認識用画像作成手段は、文字領
域毎に、非基準画素の濃淡補正比率を算出し、基準画素
の画素値と、濃淡補正比率にて補正した非基準画素の画
素値とに基づいて認識用グレースケール画像を作成す
る。The character recognizing image creating means calculates, for each character area, a gray-scale correction ratio of the non-reference pixel, and calculates a pixel value of the reference pixel and a pixel value of the non-reference pixel corrected by the gray-scale correction ratio. A grayscale image for recognition is created based on.

【００１４】また、文字認識用画像作成手段は、文字領
域毎に、非基準画素の濃淡補正比率および濃淡補正比率
にて補正した参照値を算出し、さらに各非基準画素に対
し、周辺の基準画素の画素値から異なる演算によって複
数の画素値を算出し、該複数の画素値から参照値に最も
近い値を該非基準画素の画素値とし、この画素値と、基
準画素の画素値とに基づいて認識用グレースケール画像
を作成する。The character recognizing image creating means calculates, for each character area, a gray-scale correction ratio of the non-reference pixel and a reference value corrected by the gray-scale correction ratio. A plurality of pixel values are calculated by different calculations from the pixel values of the pixels, and a value closest to the reference value is determined as the pixel value of the non-reference pixel from the plurality of pixel values, based on the pixel value and the pixel value of the reference pixel. To create a grayscale image for recognition.

【００１５】また、演算によって算出される複数の画素
値は、所定の非基準画素の左右の基準画素の画素値の平
均値、および上下の基準画素の画素値の平均値である。The plurality of pixel values calculated by the calculation are the average value of the pixel values of the left and right reference pixels of the predetermined non-reference pixel and the average value of the pixel values of the upper and lower reference pixels.

【００１６】また、文字認識用画像作成手段は、文字領
域毎に、各色毎の平均画素値を求め、平均画素値の基準
画素に対する非基準画素の比率を濃淡補正比率とする。The character recognizing image creating means obtains an average pixel value for each color for each character area, and sets a ratio of the non-reference pixel to the reference pixel in the average pixel value as a gray level correction ratio.

【００１７】また、文字認識用画像作成手段は、文字領
域毎に、全画素を色と画素値の大小に応じて複数のグル
ープに分け、グループ毎に平均画素値を求め、この平均
画素値の基準画素に対する非基準画素の比率を各グルー
プ毎の濃淡補正比率とする。The image generating means for character recognition divides all the pixels into a plurality of groups according to the color and the magnitude of the pixel value for each character area, obtains an average pixel value for each group, and calculates an average pixel value of the group. The ratio of the non-reference pixel to the reference pixel is defined as the density correction ratio for each group.

【００１８】さらに、単板カラー撮像素子は、原色系の
カラーイメージセンサであり、基準画素は緑色に対応し
た画素である。Further, the single-chip color image sensor is a color image sensor of a primary color system, and the reference pixel is a pixel corresponding to green.

【００１９】[0019]

【発明の実施の形態】実施の形態１．図１はこの発明の
文字認識装置を示す概略構成図である。図１において従
来と同一のものは、同一符号を付して説明を省略する。
１は単板カラー撮像素子としての単板カラーイメージセ
ンサＤを用いて画像を入力する画像入力手段、２は文字
検出用グレースケール画像を作成する文字検出用画像作
成手段、４は検出された文字位置に対応した入力画像中
の文字領域について認識用グレースケール画像を作成す
る文字認識用画像作成手段である。DESCRIPTION OF THE PREFERRED EMBODIMENTS Embodiment 1 FIG. 1 is a schematic configuration diagram showing a character recognition device of the present invention. In FIG. 1, the same components as those in the related art are denoted by the same reference numerals, and description thereof is omitted.
1 is an image input means for inputting an image using a single-chip color image sensor D as a single-chip color image sensor, 2 is a character detection image creating means for creating a gray scale image for character detection, and 4 is a detected character. This is a character recognition image creating unit that creates a recognition grayscale image for a character region in the input image corresponding to the position.

【００２０】画像入力手段１の詳細な構成要素として、
Ａは光学レンズ、Ｄは上述の単版カラーイメージセン
サ、ＥはＡ／Ｄ変換手段、Ｆは記憶手段である。As detailed components of the image input means 1,
A is an optical lens, D is the single color image sensor described above, E is A / D conversion means, and F is storage means.

【００２１】次に動作を図により説明する。図２はこの
発明の処理フロー図である。ここでは図３から図７まで
を用いて動作を説明する。図３は単板カラーイメージセ
ンサの画素並び、図４は入力画像、図５は文字検出用グ
レースケール画像、図６は文字検出用グレースケール画
像中の検出文字領域を示す図、図７は入力画像中の検出
文字領域を示す図である。Next, the operation will be described with reference to the drawings. FIG. 2 is a processing flowchart of the present invention. Here, the operation will be described with reference to FIGS. FIG. 3 shows a pixel array of a single-chip color image sensor, FIG. 4 shows an input image, FIG. 5 shows a gray scale image for character detection, FIG. 6 shows a detected character area in the gray scale image for character detection, and FIG. FIG. 4 is a diagram illustrating a detected character area in an image.

【００２２】まず、単板カラーイメージセンサＤを用い
た画像入力手段１が、画像を入力してデジタル化し、入
力画像を作成（ステップＳ１）する。単板カラーイメー
ジセンサとは、特定の色（光の波長帯）に感度を持つ画
素が複数配列されて構成されているものであり、例え
ば、画像電子学会誌第２７巻第２号（１９９８）ｐ
ｐ．８３〜８９の「２．ディジタルスチルカメラ用撮像
素子」の表３に記載されているように、原色系と補色系
の２種類がある。本実施の形態では、原色系を使用した
例を説明する。原色系では、図３に示す４画素を1組と
し、これが多数並んだ構成となっている。図３におい
て、６と９はＧ（緑色）成分に感度を持つ画素、７はＲ
（赤色）成分に感度を持つ画素、８はＢ（青色）成分に
感度を持つ画素である。First, an image input means 1 using a single-chip color image sensor D inputs and digitizes an image to create an input image (step S1). The single-chip color image sensor is configured by arranging a plurality of pixels having sensitivity to a specific color (wavelength band of light). For example, the Journal of the Institute of Image Electronics Engineers of Japan Vol. 27, No. 2, (1998) p
p. As described in Table 3 of “2. Imaging device for digital still camera” of 83 to 89, there are two types of primary color system and complementary color system. In this embodiment, an example using a primary color system will be described. The primary color system has a configuration in which the four pixels shown in FIG. In FIG. 3, 6 and 9 are pixels having sensitivity to the G (green) component, and 7 is R
A pixel having sensitivity to the (red) component and a pixel 8 having sensitivity to the B (blue) component.

【００２３】また、図４は画像入力手段１により作成さ
れた入力画像である。図４において、入力画像は幅ｗ画
素、高さｈ画素であり、各画素は対応する色成分と座標
値で表現している。例えばＧ（２，４）は、Ｘ座標が
２、Ｙ座標が４の位置にあるＧ成分の画素を表わす。FIG. 4 shows an input image created by the image input means 1. In FIG. 4, the input image has a width of w pixels and a height of h pixels, and each pixel is represented by a corresponding color component and a coordinate value. For example, G (2, 4) represents a G component pixel at a position where the X coordinate is 2 and the Y coordinate is 4.

【００２４】次に文字検出用画像作成手段２は、図４の
入力画像から文字検出用グレースケール画像を作成（ス
テップＳ２）する。本ステップにおいては、入力画像の
うち特定色成分の画素を基準画素として用いる。本実施
の形態ではＧ成分（緑）の画素を基準画素とした例で説
明する。まず入力画像のうち隣接する縦２ラインをライ
ンペアとし、このラインペアからＧ成分の画素値のみを
抽出して一つの縦ラインとする。この縦ラインの画素を
合成し、文字検出用グレースケール画像とする。Next, the character detecting image forming means 2 forms a character detecting gray scale image from the input image of FIG. 4 (step S2). In this step, pixels of a specific color component in the input image are used as reference pixels. In the present embodiment, an example will be described in which a G component (green) pixel is used as a reference pixel. First, two adjacent vertical lines in the input image are set as a line pair, and only the G component pixel value is extracted from this line pair to form one vertical line. The pixels of this vertical line are combined to form a gray scale image for character detection.

【００２５】図５は図４の入力画像から作成した文字検
出用グレースケール画像を示す。図５において、各画素
は入力画像中の対応する画素を示している。画像サイズ
は、幅がｐ画素、高さがｈ画素であり、画像幅ｐは入力
画像の幅ｗの半分である。入力画像との対応関係として
は、図４の入力画像におけるＸ座標１と２の縦ラインが
図５のＸ座標１の縦ラインに、入力画像におけるＸ座標
３と４の縦ラインが図５のＸ座標２の縦ラインに対応し
ている。FIG. 5 shows a gray scale image for character detection created from the input image of FIG. In FIG. 5, each pixel indicates a corresponding pixel in the input image. The image size is p pixels in width and h pixels in height, and the image width p is half the width w of the input image. As for the correspondence relationship with the input image, the vertical lines of X coordinates 1 and 2 in the input image of FIG. 4 correspond to the vertical lines of X coordinate 1 in FIG. 5, and the vertical lines of X coordinates 3 and 4 in the input image correspond to those in FIG. This corresponds to the vertical line at X coordinate 2.

【００２６】次に文字検出手段３は、従来と同様の手順
で、文字検出用グレースケール画像から文字を検出（ス
テップＳ３）する。Next, the character detecting means 3 detects characters from the gray scale image for character detection in the same procedure as in the prior art (step S3).

【００２７】その後、検出された個々の文字領域を対象
として、文字認識用画像作成手段４が認識用グレースケ
ール画像を作成（ステップＳ４）する。まず、文字検出
用グレースケール画像における検出位置の座標を入力画
像における座標に変換し、入力画像における文字領域を
求める。図６は文字検出用グレースケール画像での検出
文字領域の例を示す図であり、１０が当該文字領域であ
る。図７は図６における検出文字領域１０に対応した入
力画像中の文字領域を示し、１１が入力画像での文字領
域である。本実施の形態における文字検出用グレースケ
ール画像は、入力画像と高さが同じで幅が半分の画像で
あるため、検出位置のＹ座標は同じであるが、Ｘ座標は
異なっている。Thereafter, the character recognition image creating means 4 creates a recognition gray scale image for each detected character area (step S4). First, the coordinates of the detection position in the gray scale image for character detection are converted into coordinates in the input image, and a character area in the input image is obtained. FIG. 6 is a diagram showing an example of a detected character area in a gray scale image for character detection, and 10 is the character area. FIG. 7 shows a character area in the input image corresponding to the detected character area 10 in FIG. 6, and 11 is a character area in the input image. The gray scale image for character detection in the present embodiment is an image having the same height and half the width as the input image, so that the Y coordinate of the detection position is the same, but the X coordinate is different.

【００２８】文字認識用画像作成手段４は、この入力画
像中の文字領域を対象として、色成分別に平均画素値を
算出（ステップＳ４１）する。本動作により、Ｇ成分
（緑）、Ｒ成分（赤）、Ｂ成分（青）それぞれの平均画
素値が得られる。次に文字認識用画像作成手段４は、非
基準画素（基準画素以外）の色成分の平均画素値につい
て、基準画素の平均画素値との比を計算して補正比率を
求める（ステップＳ４２）。本実施の形態では基準画素
はＧ成分の画素であるため、式１の算式に従ってＲ成分
及びＢ成分の補正比率を求める。The character recognizing image creating means 4 calculates an average pixel value for each color component for the character area in the input image (step S41). By this operation, an average pixel value of each of the G component (green), the R component (red), and the B component (blue) is obtained. Next, the character recognizing image creating means 4 calculates the ratio of the average pixel value of the color component of the non-reference pixel (other than the reference pixel) to the average pixel value of the reference pixel to obtain a correction ratio (step S42). In the present embodiment, since the reference pixel is a G component pixel, the correction ratio of the R component and the B component is obtained according to the equation (1).

【００２９】[0029]

【数１】 (Equation 1)

【００３０】さらに文字認識用画像作成手段４は、入力
画像中の文字領域について画素値を補正（ステップＳ４
３）して認識用グレースケール画像を作成する。この画
素値補正では、基準画素の値は補正せずにそのまま用
い、非基準画素については、対応する色成分に応じた補
正比率により式２に従ってグレースケール値を求める。Further, the character recognition image creating means 4 corrects the pixel value for the character area in the input image (step S4).
3) to create a grayscale image for recognition. In this pixel value correction, the value of the reference pixel is used without correction, and for the non-reference pixel, a gray scale value is obtained according to Equation 2 using a correction ratio corresponding to the corresponding color component.

【００３１】[0031]

【数２】 (Equation 2)

【００３２】次に文字認識手段５は、従来と同様の手順
で、認識用グレースケール画像から文字を認識（ステッ
プＳ５）する。以上のステップＳ４〜ステップＳ５の動
作を検出された文字の数だけ繰り返し、全ての検出文字
の認識が終了した時点で処理終了とする（ステップＳ
６）。Next, the character recognizing means 5 recognizes a character from the gray scale image for recognition in the same procedure as in the prior art (step S5). The operations in steps S4 to S5 are repeated by the number of detected characters, and the process ends when the recognition of all the detected characters is completed (step S5).
6).

【００３３】この実施の形態では、以上の工程を経るこ
とで、文字と背景の色が不明な場合でも高品質なグレー
スケール画像で認識が行え、高い認識精度を得ることが
できる。また、一つの撮像画像だけを使用するため、移
動している対象物にも対応することが可能となる。さら
に、入力画像全体を対象とする文字検出用グレースケー
ル画像作成は処理量の少ない簡単な方法とし、高品質化
のため処理量の多くなる認識用グレースケール画像作成
では検出された文字領域だけを対象とすることにより、
処理量を押さえつつ認識精度の高い、コストパフォーマ
ンスに優れた文字認識装置が実現できる。In this embodiment, by performing the above-described steps, a high-quality grayscale image can be recognized even when the colors of the characters and the background are unknown, and high recognition accuracy can be obtained. Further, since only one captured image is used, it is possible to cope with a moving object. Furthermore, creating a grayscale image for character detection for the entire input image is a simple method with a small amount of processing, and creating a grayscale image for recognition, which requires a large amount of processing for high quality, uses only the detected character area. By targeting
It is possible to realize a character recognition device that has high recognition accuracy while suppressing the processing amount and is excellent in cost performance.

【００３４】実施の形態２．以下、この発明の実施の形
態２を図について説明する。上記実施の形態１と同様に
概略構成図は図１である。次に動作を図により説明す
る。処理フロー図は図８である。ここでは図４と図９を
用いて動作を説明する。図９は文字検出用グレースケー
ル画像である。Embodiment 2 Hereinafter, a second embodiment of the present invention will be described with reference to the drawings. FIG. 1 is a schematic configuration diagram similar to the first embodiment. Next, the operation will be described with reference to the drawings. FIG. 8 is a processing flowchart. Here, the operation will be described with reference to FIGS. FIG. 9 is a gray scale image for character detection.

【００３５】まず、実施の形態１と同様の手順で、画像
入力手段１が画像を入力して入力画像を作成（ステップ
Ｔ１）する。次に文字検出用画像作成手段２は、入力画
像から文字検出用グレースケール画像を作成（ステップ
Ｔ２）する。本実施の形態では、基準画素は実施の形態
１と同じＧ成分画素とするが、文字検出用グレースケー
ル画像は入力画像と幅・高さが同じものとし、基準画素
の値はそのまま用い、非基準画素の値を周辺の基準画素
値から決定して文字検出用グレースケール画像とする。First, in the same procedure as in the first embodiment, the image input means 1 inputs an image to create an input image (step T1). Next, the character detection image creation means 2 creates a character detection grayscale image from the input image (step T2). In the present embodiment, the reference pixel is the same G component pixel as in the first embodiment, but the gray scale image for character detection has the same width and height as the input image, and the value of the reference pixel is used as it is. The value of the reference pixel is determined from the surrounding reference pixel values to obtain a gray scale image for character detection.

【００３６】図９は図４の入力画像に対応して作成した
文字検出用グレースケール画像の例であり、図９におい
て、Ｇは入力画像における基準画素に対応し、この画素
値は入力画像での画素値と同じである。一方Ｇ’は入力
画像での非基準画素を示し、この画素値は周辺の基準画
素値から算出する。この算出法については、例えば、左
右に隣接する基準画素値の平均値とする方法、上下に隣
接する基準画素値の平均値とする方法、上下左右に隣接
する基準画素値の平均値とする方法、などがあるが、本
実施の形態では、左右に隣接する基準画素値の平均値と
する方法で説明する。この場合、例えば図９のＧ’
（３，２）の画素値は、Ｇ（２，２）とＧ（４，２）の
画素値の平均値となる。FIG. 9 is an example of a gray scale image for character detection created corresponding to the input image of FIG. 4. In FIG. 9, G corresponds to a reference pixel in the input image, and this pixel value is the input image. Is the same as the pixel value. On the other hand, G ′ indicates a non-reference pixel in the input image, and this pixel value is calculated from peripheral reference pixel values. For example, a method of calculating the average value of the reference pixel values adjacent to the left and right, a method of calculating the average value of the reference pixel values of the upper and lower sides, and a method of calculating the average value of the reference pixel values of the upper and lower sides are used. However, in the present embodiment, a method will be described in which the average value of the reference pixel values adjacent to the left and right is used. In this case, for example, G ′ in FIG.
The pixel value of (3,2) is an average value of the pixel values of G (2,2) and G (4,2).

【００３７】この後は、実施の形態１と同様の手順で、
文字検出を行い（ステップＴ３）、検出された全文字領
域について、認識用グレースケール画像作成と文字認識
を行う（ステップＴ４〜Ｔ６）。Thereafter, the same procedure as in the first embodiment is used.
Character detection is performed (step T3), and a grayscale image for recognition and character recognition are performed for all detected character regions (steps T4 to T6).

【００３８】この実施の形態では、以上の工程を経るこ
とで、解像度の高い文字検出用グレースケール画像を作
成でき、文字検出精度を高めることができる。なお、本
実施の形態では、非基準画素に対応した文字検出用グレ
ースケール画像の画素値を左右に隣接する基準画素値の
平均値としたが、これは上下左右に隣接する基準画素値
の平均値とする方法など、他の方法を用いても良い。In this embodiment, through the above steps, a high-resolution gray scale image for character detection can be created, and the character detection accuracy can be improved. In this embodiment, the pixel value of the gray scale image for character detection corresponding to the non-reference pixel is the average value of the reference pixel values adjacent to the left and right. Other methods such as a method of setting a value may be used.

【００３９】実施の形態３．以下、この発明の実施の形
態３を図について説明する。上記実施の形態１と同様に
概略構成図は図１である。次に動作を図により説明す
る。処理フロー図は図１０である。ここでは図１１を用
いて動作を説明する。図１１は画素値補正を説明するた
めの図である。Embodiment 3 Hereinafter, a third embodiment of the present invention will be described with reference to the drawings. FIG. 1 is a schematic configuration diagram similar to the first embodiment. Next, the operation will be described with reference to the drawings. FIG. 10 is a processing flowchart. Here, the operation will be described with reference to FIG. FIG. 11 is a diagram for explaining pixel value correction.

【００４０】まず、実施の形態１と同様の手順で、画像
入力手段１が画像を入力して入力画像を作成（ステップ
Ｕ１）し、文字検出用画像作成手段２が入力画像から文
字検出用グレースケール画像を作成（ステップＵ２）し
て、文字検出手段３が文字検出用グレースケール画像か
ら文字を検出（ステップＵ３）する。First, in the same procedure as in the first embodiment, the image input means 1 inputs an image to create an input image (step U1), and the character detecting image creating means 2 uses the character detecting grayscale from the input image. A scale image is created (step U2), and the character detection means 3 detects a character from the gray scale image for character detection (step U3).

【００４１】次に文字認識用画像作成手段４は、入力画
像中の検出文字領域を対象として、認識用グレースケー
ル画像を作成（ステップＵ４）する。本動作において
も、実施の形態１と同様に、まず色成分別に平均画素値
を算出（ステップＵ４１）し、非基準画素の補正比率を
色成分別に算出（ステップＵ４２）する。さらに文字認
識用画像作成手段４は、非基準画素の値を上記補正比率
で補正するが、本実施の形態では、ここで補正した値を
そのまま認識用グレースケール画像の画素値とはせず、
画素値補完のための参照値とする（ステップＵ４３）。Next, the character recognition image creating means 4 creates a recognition gray scale image for the detected character area in the input image (step U4). In this operation, as in the first embodiment, first, the average pixel value is calculated for each color component (step U41), and the correction ratio of the non-reference pixel is calculated for each color component (step U42). Further, the character recognition image creating means 4 corrects the value of the non-reference pixel with the correction ratio, but in the present embodiment, the corrected value is not directly used as the pixel value of the recognition grayscale image.
A reference value for pixel value complement is set (step U43).

【００４２】上記参照値を用いた非基準画素の画素値補
完（ステップＵ４４）について、以下で説明する。本補
完においては、周辺の基準画素の値から補完対象画素の
補完値を決定するが、複数の異なる方法で算出した複数
の補完値のうち最も参照値に近い値を選択して当該画素
の補完値とし、これを認識用グレースケール画像の画素
値とする。上記複数の補完方法には種々の方法がある
が、本実施の形態では、上下に隣接する基準画素の値の
平均値をとる方法と、左右に隣接する基準画素の平均値
をとる方法の二つを用いる場合で説明する。図１１はこ
の画素値補完を説明するための図であり、１２は補完対
象の非基準画素、１３〜１６は非基準画素１２に隣接す
る基準画素を示し、図中の基準画素１３〜１６における
数値は入力画像における各画素の値を示している。ここ
で、まず文字認識用画像作成手段４は、第１の補完値と
して上下に隣接する基準画素の値の平均値（１２０＋１
５０）÷２＝１３５と、第２の補完値として左右に隣接
する基準画素の値の平均値（５２＋６８）÷２＝６０を
算出する。次にステップＵ４３で得た非基準画素１２の
参照値と上記二つの補完値とで差の絶対値を計算し、こ
の値が最小となる補完値、すなわち参照値と最も近い補
完値を選択する。例えば、非基準画素１２の参照値が１
２０とすると、第１の補完値との差の絶対値がＡＢＳ
（１２０−１３５）＝１５、第２の補完値との差の絶対
値がＡＢＳ（１２０−６０）＝６０となり、第１の補完
値が選択される。なお、上記説明において、ＡＢＳは絶
対値をとる演算子を示す。The pixel value complement of the non-reference pixel using the reference value (step U44) will be described below. In this complementation, the complementation value of the pixel to be complemented is determined from the value of the peripheral reference pixel, and the value closest to the reference value is selected from among a plurality of complementation values calculated by a plurality of different methods to complement the pixel. Value, and this is used as the pixel value of the grayscale image for recognition. There are various methods for the plurality of complementing methods. In the present embodiment, there are two methods: a method of averaging the values of vertically adjacent reference pixels and a method of averaging the values of left and right adjacent reference pixels. The case where one is used will be described. FIG. 11 is a diagram for explaining this pixel value complementation. Reference numeral 12 denotes a non-reference pixel to be complemented, reference numerals 13 to 16 denote reference pixels adjacent to the non-reference pixel 12, and reference pixels 13 to 16 in FIG. Numerical values indicate the value of each pixel in the input image. Here, first, the character-recognition-image creating unit 4 determines the average value (120 + 1) of the values of the reference pixels vertically adjacent to each other as the first complement value.
50) ÷ 2 = 135, and calculate the average value of the values of the reference pixels adjacent to the left and right (52 + 68) ÷ 2 = 60 as the second complement value. Next, the absolute value of the difference between the reference value of the non-reference pixel 12 obtained in step U43 and the two complementary values is calculated, and the complementary value that minimizes this value, that is, the complementary value closest to the reference value, is selected. . For example, the reference value of the non-reference pixel 12 is 1
Assuming that the absolute value of the difference from the first complement value is ABS
(120-135) = 15, the absolute value of the difference from the second complement value is ABS (120-60) = 60, and the first complement value is selected. In the above description, ABS indicates an operator that takes an absolute value.

【００４３】その後は、実施の形態１と同様の手順で文
字認識を行う（ステップＵ５）。さらに、認識処理が為
されていない文字領域が無くなるまで、上記認識用グレ
ースケール画像作成と文字認識を繰り返す（ステップＵ
４〜Ｕ６）。Thereafter, character recognition is performed in the same procedure as in the first embodiment (step U5). Further, the above-described generation of the grayscale image for recognition and the character recognition are repeated until there is no longer any character area that has not been subjected to the recognition processing (step U
4-U6).

【００４４】この実施の形態では、以上の工程を経るこ
とで、特定の色成分の平均画素値が小さく、補正比率に
よる補正では誤差が大きい場合でも高品質な認識用グレ
ースケール画像を作成できる。なお、本実施の形態で
は、文字認識用グレースケース画像作成における非基準
画素の補完方法として、上下に隣接する基準画素の値の
平均値をとる方法と、左右に隣接する基準画素の平均値
をとる方法の二つで説明したが、これは別の方法でも良
く、また三つ以上の方法を適用しても良い。In this embodiment, through the above-described steps, a high-quality gray scale image for recognition can be created even when the average pixel value of a specific color component is small and the correction by the correction ratio has a large error. In the present embodiment, as a method of complementing non-reference pixels in creating a grace case image for character recognition, a method of taking an average value of values of reference pixels adjacent vertically and a method of taking an average value of reference pixels adjacent to left and right are used. Although two methods have been described, this may be another method, or three or more methods may be applied.

【００４５】実施の形態４．以下、この発明の実施の形
態４を図について説明する。上記実施の形態１と同様に
概略構成図は図１である。次に動作を図により説明す
る。処理フロー図は図１２である。Embodiment 4 Hereinafter, a fourth embodiment of the present invention will be described with reference to the drawings. FIG. 1 is a schematic configuration diagram similar to the first embodiment. Next, the operation will be described with reference to the drawings. FIG. 12 is a processing flowchart.

【００４６】まず、実施の形態１と同様の手順で、画像
入力手段１が画像を入力して入力画像を作成（ステップ
Ｖ１）し、文字検出用画像作成手段２が入力画像から文
字検出用グレースケール画像を作成（ステップＶ２）し
て、文字検出手段３が文字検出用グレースケール画像か
ら文字を検出（ステップＶ３）する。First, in the same procedure as in the first embodiment, the image input means 1 inputs an image to create an input image (step V1), and the character detection image creating means 2 uses the character detection grayscale from the input image. A scale image is created (Step V2), and the character detecting means 3 detects a character from the gray scale image for character detection (Step V3).

【００４７】次に文字認識用画像作成手段４は、入力画
像中の文字領域を対象として、認識用グレースケール画
像を作成（ステップＶ４）する。本ステップでは、まず
画素値に応じて色成分別に画素のグループ化を行う（ス
テップＶ４１）。このグループ化では、例えば、特定の
色成分の全画素を対象として２値化レベルを求め、この
２値化レベルより大きな値を持つ画素を当該色成分のグ
ループ１に属すものとし、２値化レベル以下の値の画素
は当該色成分のグループ２に属すものとして、二つのグ
ループに分けることでグループ化する。この動作をＲ成
分、Ｇ成分、Ｂ成分それぞれに対して行うことで、合計
六つのグループが得られる。なお、上記２値化レベルの
決定においては、例えば、電子情報通信学会論文誌Ｖ
ｏｌ．Ｊ６３−ＤＮｏ．４ｐｐ３４９−３５６の
「判別および最小２乗基準に基づく自動しきい値選定
法」に記載の方法を用いる。Next, the character recognition image creating means 4 creates a recognition gray scale image for the character area in the input image (step V4). In this step, first, pixels are grouped for each color component according to the pixel value (step V41). In this grouping, for example, a binarization level is obtained for all pixels of a specific color component, and a pixel having a value larger than this binarization level belongs to the color component group 1 and binarization is performed. Pixels having a value equal to or lower than the level belong to the color component group 2 and are divided into two groups. By performing this operation for each of the R, G, and B components, a total of six groups can be obtained. In determining the binarization level, for example, the IEICE Transactions V
ol. J63-D No. 4 pp 349-356, “Automatic Threshold Selection Method Based on Discrimination and Least Square Criterion”.

【００４８】次に文字認識用画像作成手段４は、グルー
プ別に平均画素値を算出（ステップＶ４２）し、さらに
グループ別に補正比率を算出する（ステップＶ４３）。
補正比率算出においては、例えば式３に従って非基準画
素の各グループの補正比率を求める。Next, the image generating means 4 for character recognition calculates an average pixel value for each group (step V42), and further calculates a correction ratio for each group (step V43).
In calculating the correction ratio, the correction ratio of each group of the non-reference pixels is obtained according to, for example, Equation 3.

【００４９】[0049]

【数３】 (Equation 3)

【００５０】さらに文字認識用画像作成手段４は、非基
準画素について、属するグループの補正比率に当該画素
の値を掛けて参照値を算出する（ステップＶ４４）。以
降は実施の形態３と同様に、非基準画素について周辺の
基準画素値と参照値から補完値を求め（ステップＶ４
５）、認識用グレースケール画像を作成した後、文字認
識手段５が文字認識を行う（ステップＶ５）。Further, the character recognition image creating means 4 calculates a reference value for the non-reference pixel by multiplying the correction ratio of the group to which the pixel belongs by the value of the pixel (step V44). Thereafter, in the same manner as in the third embodiment, a complement value is obtained from the peripheral reference pixel value and the reference value for the non-reference pixel (step V4).
5) After creating the gray scale image for recognition, the character recognition means 5 performs character recognition (step V5).

【００５１】この実施の形態では、以上の工程を経るこ
とで、文字部と背景部において色成分の割合が極度に異
なるような場合でも、高品質な認識用グレースケール画
像を作成できる。なお、本実施の形態では、画素のグル
ープ化を各色成分毎に２グループとしたが、これは３グ
ループ以上にしても良い。また、グループ分けの基準を
２値化レベルとしたが、これは他の方法を用いても良
い。In this embodiment, through the above steps, a high-quality grayscale image for recognition can be created even when the ratio of the color components is extremely different between the character portion and the background portion. In this embodiment, the pixels are grouped into two groups for each color component, but may be three or more groups. Further, although the criterion for grouping is the binarization level, other methods may be used.

【００５２】なお、以上の実施の形態１〜４では、カラ
ーイメージセンサを原色系のものとし、また基準画素を
Ｇ成分の画素としたが、これは原色系以外の単版カラー
イメージセンサでも良く、基準画素もＧ成分以外の画素
としても良い。また、動作フローとして、認識用グレー
スケール画像作成から文字認識までを検出文字領域の数
だけ繰り返すようにしたが、これは全文字領域について
認識用グレースケール画像を作成した後、まとめて文字
認識するようにしても良い。In the first to fourth embodiments, the color image sensor is of a primary color system and the reference pixel is a G component pixel. However, a single color image sensor other than the primary color system may be used. The reference pixel may be a pixel other than the G component. In addition, as an operation flow, the process from the generation of the grayscale image for recognition to the character recognition is repeated by the number of the detected character regions, but this is performed by creating the grayscale image for recognition for all the character regions and then performing character recognition collectively. You may do it.

【００５３】[0053]

【発明の効果】この発明に係る文字認識手段は、以下の
手段を備えている。（ａ）複数の画素が多数マトリクス状に配列されてなる
単板カラー撮像素子にて入力画像を得る画像入力手段、
（ｂ）入力画像中の特定の単色に対応した画素を基準画
素とし、基準画素の画素値に基づいて文字検出用グレー
スケール画像を作成する文字検出用画像作成手段、
（ｃ）文字検出用グレースケール画像から文字領域を検
出する文字検出手段、（ｄ）入力画像中の文字領域につ
いて、文字領域内の画素値を用いて認識用グレースケー
ル画像を作成する文字認識用画像作成手段、（ｅ）認識
用グレースケール画像から文字を認識する文字認識手
段。そのため、文字検出用グレースケール画像は予め定
めた画素の画素値を用いて作成するが、検出された個々
の文字領域内の画素値を用いて認識用グレースケール画
像を作成し直すことで、文字や背景の色によらず、単板
カラーイメージセンサによる入力画像から高品質な認識
用グレースケール画像を得ることができ、高い認識精度
を実現できる。The character recognition means according to the present invention includes the following means. (A) image input means for obtaining an input image with a single-chip color imaging device in which a plurality of pixels are arranged in a matrix
(B) a character detection image creating means for creating a gray scale image for character detection based on a pixel value of the reference pixel, using a pixel corresponding to a specific single color in the input image as a reference pixel;
(C) character detection means for detecting a character area from a gray scale image for character detection, and (d) character recognition for generating a gray scale image for recognition using a pixel value in the character area for the character area in the input image. Image creation means; (e) character recognition means for recognizing characters from a grayscale image for recognition. Therefore, the grayscale image for character detection is created using the pixel values of the predetermined pixels.However, by recreating the grayscale image for recognition using the pixel values in the detected individual character regions, the character A high-quality grayscale image for recognition can be obtained from an image input by a single-chip color image sensor regardless of the background color or the color of the background, and high recognition accuracy can be realized.

【００５４】文字検出用画像作成手段は、入力画像の基
準画素位置の画素値のみを使用して文字検出用グレース
ケール画像を作成する。そのため、入力画像中の基準画
素を抽出して文字検出用グレースケール画像を作成する
ことで、簡単に文字検出グレースケール画像が作成で
き、また画像サイズの縮小に伴い高速な文字検出が可能
となる。The character detecting image forming means forms a character detecting gray scale image using only the pixel values at the reference pixel positions of the input image. Therefore, by extracting a reference pixel from the input image to create a grayscale image for character detection, a grayscale image for character detection can be easily created, and high-speed character detection can be performed as the image size is reduced. .

【００５５】また、文字認識用画像作成手段は、文字領
域毎に、非基準画素の濃淡補正比率および濃淡補正比率
にて補正した参照値を算出し、さらに各非基準画素に対
し、周辺の基準画素の画素値から異なる演算によって複
数の画素値を算出し、該複数の画素値から参照値に最も
近い値を該非基準画素の画素値とし、この画素値と、基
準画素の画素値とに基づいて認識用グレースケール画像
を作成する。そのため、基準画素の値はそのまま用い、
非基準画素の値は当該画素周辺の基準画素の値から決定
して文字検出用グレースケール画像とすることにより、
入力画像の解像度を保持した画像で文字検出ができる。The character recognizing image creating means calculates, for each character area, a gray-scale correction ratio of the non-reference pixel and a reference value corrected by the gray-scale correction ratio. A plurality of pixel values are calculated by different calculations from the pixel values of the pixels, and a value closest to the reference value is determined as the pixel value of the non-reference pixel from the plurality of pixel values, based on the pixel value and the pixel value of the reference pixel. To create a grayscale image for recognition. Therefore, the value of the reference pixel is used as it is,
By determining the value of the non-reference pixel from the value of the reference pixel around the pixel and using it as a character detection grayscale image,
Character detection can be performed on an image that maintains the resolution of the input image.

【００５６】また、文字認識用画像作成手段は、文字領
域毎に、非基準画素の濃淡補正比率を算出し、基準画素
の画素値と、濃淡補正比率にて補正した非基準画素の画
素値とに基づいて認識用グレースケール画像を作成す
る。そのため、各文字領域において非基準画素の値を補
正して認識用グレースケール画像を作成することによ
り、文字部と背景部の画素値レベルが均一な高品質な認
識用グレースケール画像を得ることができる。Further, the character recognizing image creating means calculates, for each character area, the gray-scale correction ratio of the non-reference pixel, and calculates the pixel value of the reference pixel and the pixel value of the non-reference pixel corrected by the gray-scale correction ratio. A grayscale image for recognition is created based on. Therefore, by correcting the value of the non-reference pixel in each character region to create a recognition grayscale image, it is possible to obtain a high quality recognition grayscale image in which the pixel value levels of the character portion and the background portion are uniform. it can.

【００５７】また、文字認識用画像作成手段は、文字領
域毎に、非基準画素の濃淡補正比率および濃淡補正比率
にて補正した参照値を算出し、さらに周辺の基準画素の
画素値から異なる演算によって算出した複数の画素値を
算出し、複数の画素値から参照値に最も近い値を隣接す
る非基準画素の画素値とし、この画素値と、基準画素の
画素値とに基づいて認識用グレースケール画像を作成す
る。そのため、非基準画素について、その濃淡補正値を
参照値として用い、当該画素周辺の基準画素の画素値か
ら認識用グレースケール値を決定することにより、濃淡
補正による誤差が大きい場合でも、文字部と背景部の画
素値レベルが均一な高品質な認識用グレースケール画像
を得ることができる。The character recognizing image generating means calculates, for each character area, a gray-scale correction ratio of the non-reference pixel and a reference value corrected by the gray-scale correction ratio, and further calculates a different value from the pixel value of the peripheral reference pixel. The pixel value of the non-reference pixel adjacent to the reference value is calculated as the pixel value of the adjacent non-reference pixel from the plurality of pixel values, and the gray value for recognition is determined based on the pixel value and the pixel value of the reference pixel. Create a scale image. Therefore, for a non-reference pixel, the grayscale correction value is used as a reference value, and the recognition grayscale value is determined from the pixel values of the reference pixels around the pixel. A high-quality grayscale image for recognition with a uniform pixel value level in the background can be obtained.

【００５８】また、演算によって算出される複数の画素
値は、所定の非基準画素の左右の基準画素の画素値の平
均値、および上下の基準画素の画素値の平均値である。
そのため、非基準画素について、その濃淡補正値を参照
値として用い、左右の基準画素の平均値と上下の基準画
素の平均値のうち、最も参照値に近い値を選択して当該
画素の認識用グレースケール値とすることにより、基準
画素値による補正が簡単な近傍演算で実現できる。The plurality of pixel values calculated by the calculation are the average value of the pixel values of the left and right reference pixels of the predetermined non-reference pixel and the average value of the pixel values of the upper and lower reference pixels.
Therefore, for a non-reference pixel, the grayscale correction value is used as a reference value, and a value closest to the reference value is selected from among the average value of the left and right reference pixels and the average value of the upper and lower reference pixels, and is used for recognition of the pixel. By using the gray scale value, correction using the reference pixel value can be realized by a simple neighborhood operation.

【００５９】また、文字認識用画像作成手段は、文字領
域毎に、各色毎の平均画素値を求め、平均画素値の基準
画素に対する非基準画素の比率を濃淡補正比率とする。
そのため、各文字領域において、対応色別の平均画素値
を求め、平均画素値の比率を各色に対応した画素の濃淡
補正比率とすることにより、文字部と背景部の画素値レ
ベルが均一な高品質な認識用グレースケール画像を得る
ことができる。Further, the character recognizing image generating means obtains an average pixel value for each color for each character area, and sets the ratio of the non-reference pixel to the reference pixel in the average pixel value as the gray level correction ratio.
Therefore, in each character area, the average pixel value for each corresponding color is determined, and the ratio of the average pixel value is set as the density correction ratio of the pixel corresponding to each color, so that the pixel value level of the character portion and the background portion is uniform. A high-quality grayscale image for recognition can be obtained.

【００６０】また、文字認識用画像作成手段は、文字領
域毎に、全画素を色と画素値の大小に応じて複数のグル
ープに分け、グループ毎に平均画素値を求め、この平均
画素値の基準画素に対する非基準画素の比率を各グルー
プ毎の濃淡補正比率とする。そのため、各文字領域にお
いて、全画素を対応色と画素値に応じて複数のグループ
に分けてグループ毎に平均画素値を求め、この平均画素
値の比率を各グループに属する画素の濃淡補正比率とす
ることにより、文字部と背景部において色成分の割合が
極度に異なるような場合でも、高品質な認識用グレース
ケール画像を得ることができる。Further, the character recognizing image creating means divides all the pixels into a plurality of groups according to the color and the magnitude of the pixel value for each character area, obtains an average pixel value for each group, and obtains the average pixel value. The ratio of the non-reference pixel to the reference pixel is defined as the density correction ratio for each group. Therefore, in each character area, all the pixels are divided into a plurality of groups according to the corresponding color and the pixel value, and an average pixel value is obtained for each group, and the ratio of the average pixel value is determined by the density correction ratio of the pixels belonging to each group. By doing so, a high-quality grayscale image for recognition can be obtained even when the ratio of the color components is extremely different between the character portion and the background portion.

【００６１】さらに、単板カラー撮像素子は、原色系の
カラーイメージセンサであり、基準画素は緑色に対応し
た画素である。そのため、画像入力手段は原色系のカラ
ーイメージセンサを用いて画像を入力し、文字検出用画
像作成手段は入力画像中の緑色に対応した画素を基準画
素として文字検出用グレースケール画像を作成すること
により、色信号の感度は良いものの輝度信号の感度が悪
い原色系カラーイメージセンサを用いた場合でも、高品
質な認識用グレースケール画像を得ることができる。Further, the single-chip color imaging device is a color image sensor of a primary color system, and the reference pixel is a pixel corresponding to green. Therefore, the image input means inputs an image using a primary color color image sensor, and the character detection image creation means creates a character detection grayscale image using a pixel corresponding to green in the input image as a reference pixel. Accordingly, a high-quality grayscale image for recognition can be obtained even when a primary-color-based color image sensor having good sensitivity for color signals but poor sensitivity for luminance signals is used.

[Brief description of the drawings]

【図１】この発明の文字認識装置を示す概略構成図で
ある。FIG. 1 is a schematic configuration diagram showing a character recognition device of the present invention.

【図２】この発明の文字認識装置の処理フロー図であ
る。FIG. 2 is a processing flowchart of the character recognition device of the present invention.

【図３】単板カラーイメージセンサの画素並びを示す
図である。FIG. 3 is a diagram showing a pixel arrangement of a single-chip color image sensor.

【図４】単板カラーイメージセンサの入力画像を示す
図である。FIG. 4 is a diagram showing an input image of a single-chip color image sensor.

【図５】文字検出用グレースケール画像を示す図であ
る。FIG. 5 is a diagram showing a gray scale image for character detection.

【図６】文字検出用グレースケール画像中の検出文字
領域を示す図である。FIG. 6 is a diagram illustrating a detected character area in a gray scale image for character detection.

【図７】入力画像中の検出文字領域を示す図である。FIG. 7 is a diagram illustrating a detected character area in an input image.

【図８】この発明の文字認識装置の他の例を示す処理
フロー図である。FIG. 8 is a processing flowchart showing another example of the character recognition device of the present invention.

【図９】文字検出用グレースケール画像を示す図であ
る。FIG. 9 is a diagram showing a gray scale image for character detection.

【図１０】この発明の文字認識装置の他の例を示す処
理フロー図である。FIG. 10 is a processing flowchart showing another example of the character recognition device of the present invention.

【図１１】画素値補正を説明するための図である。FIG. 11 is a diagram for explaining pixel value correction.

【図１２】この発明の文字認識装置の他の例を示す処
理フロー図である。FIG. 12 is a processing flowchart showing another example of the character recognition device of the present invention.

【図１３】従来の文字認識装置を示す概略構成図であ
る。FIG. 13 is a schematic configuration diagram showing a conventional character recognition device.

【図１４】４画素１組の補色系の画素構造を示す図で
ある。FIG. 14 is a diagram illustrating a complementary-color pixel structure of a set of four pixels.

[Explanation of symbols]

１画像入力手段、２文字検出用画像作成手段、３
文字検出手段、４文字認識用画像作成手段、５文字
認識手段、Ｄ単板カラーイメージセンサ（単板カラー
撮像素子）。1 image input means, 2 character detection image creation means, 3
Character detecting means, 4 character recognizing image creating means, 5 character recognizing means, D single-chip color image sensor (single-chip color image sensor).

───────────────────────────────────────────────────── フロントページの続きＦターム(参考） 5B029 AA01 BB02 CC21 DD09 5C065 AA07 BB30 CC01 CC04 DD01 EE12 GG26 GG32 5C066 AA01 BA01 CA27 DC06 EC12 GA04 GB01 HA02 HA03 JA01 KD01 KD06 ──────────────────────────────────────────────────続き Continued on the front page F term (reference) 5B029 AA01 BB02 CC21 DD09 5C065 AA07 BB30 CC01 CC04 DD01 EE12 GG26 GG32 5C066 AA01 BA01 CA27 DC06 EC12 GA04 GB01 HA02 HA03 JA01 KD01 KD06

Claims

[Claims]

1. A character recognition device comprising the following means. (A) image input means for obtaining an input image with a single-chip color imaging device in which a plurality of pixels are arranged in a matrix, (b) pixels corresponding to a specific single color in the input image are set as reference pixels, (C) character detection means for detecting a character area from the character detection grayscale image based on the pixel value of the reference pixel; (d) the input; (E) character recognition means for recognizing a character from the recognition grayscale image for the character region in the image using a pixel value in the character region to generate a recognition grayscale image; .

2. The character detecting image generating unit according to claim 1, wherein the character detecting image generating unit generates the character detecting grayscale image using only the pixel value of the input image at the reference pixel position. Character recognition device.

3. The image detection means for character detection, comprising: a pixel value at a predetermined reference pixel position in the input image;
2. The character detection grayscale image is created based on a pixel value at a non-reference pixel position calculated from a pixel value of a reference pixel adjacent to the non-reference pixel.
Character recognition device according to the description.

4. The character recognition image creating means calculates, for each of the character areas, a gray-scale correction ratio of a non-reference pixel, and a pixel value of the reference pixel and a non-standard pixel corrected by the gray-scale correction ratio. The character recognition device according to claim 1, wherein a recognition grayscale image is created based on the pixel value of the character.

5. The image generating means for character recognition calculates, for each of the character regions, a gray-scale correction ratio of a non-reference pixel and a reference value corrected with the gray-scale correction ratio, and further calculates, for each non-reference pixel, A plurality of pixel values are calculated by different calculations from the pixel values of the surrounding reference pixels, and a value closest to the reference value is determined as the pixel value of the non-reference pixel from the plurality of pixel values. 4. A grayscale image for recognition is created based on the values.
The character recognition device according to any one of the above.

6. The plurality of pixel values calculated by the calculation are an average value of pixel values of left and right reference pixels of a predetermined non-reference pixel and an average value of pixel values of upper and lower reference pixels. The character recognition device according to claim 5, wherein

7. The image generating means for character recognition calculates an average pixel value for each color for each character area, and sets a ratio of the non-reference pixel to a reference pixel of the average pixel value as the density correction ratio. The character recognition device according to any one of claims 4 to 6, wherein:

8. The image generating means for character recognition divides all pixels into a plurality of groups for each character area according to the color and the magnitude of the pixel value, and calculates an average pixel value for each group. 7. The character recognition device according to claim 4, wherein a ratio of a non-reference pixel to a reference pixel of a value is set as the gray-scale correction ratio for each group.

9. The character according to claim 1, wherein the single-chip color imaging device is a primary color color image sensor, and the reference pixel is a pixel corresponding to green. Recognition device.