JP2001184511A

JP2001184511A - Device, system and method for processing image and storage medium

Info

Publication number: JP2001184511A
Application number: JP37160699A
Authority: JP
Inventors: Yukari Toda; ゆかり戸田
Original assignee: Canon Inc
Current assignee: Canon Inc
Priority date: 1999-12-27
Filing date: 1999-12-27
Publication date: 2001-07-06

Abstract

PROBLEM TO BE SOLVED: To provide an image processor capable of performing the optimal and efficient area division of a multilevel image. SOLUTION: A binarizing means 105 acquires plural binary images from the multilevel image. An area extracting means 107 extracts an area (black area), which includes a pixel group (the set of the groups of black pixels) having a prescribed value, from the plural binary images provided by the binarizing means 105. An area dividing means 110 divides the area provided by the area extracting means 107 on the basis of the crowding state of starting and ending pixels of a pixel group existent inside the relevant area. An area analyzing means 109 analyzes an attribute concerning all the areas provided by the area extracting means 107 and the area dividing means 110.

Description

DETAILED DESCRIPTION OF THE INVENTION

【０００１】[0001]

【発明の属する技術分野】本発明は、例えば、カラー文
書を読み取って得られた多値画像を領域分割する、画像
処理装置、画像処理システム、画像処理方法、及びそれ
を実施するための処理ステップをコンピュータが読出可
能に格納した記憶媒体に関するものである。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to, for example, an image processing apparatus, an image processing system, an image processing method, and a processing step for performing a region division of a multivalued image obtained by reading a color document. In a computer-readable storage medium.

【０００２】[0002]

【従来の技術】近年では、スキャナの普及により、文書
の電子化が進んでいる。そこで、例えば、文書をスキャ
ナで読み取り、その読取画像上で領域分割を行った場
合、次にようなメリットがある。2. Description of the Related Art In recent years, with the spread of scanners, documents have been digitized. Thus, for example, when a document is read by a scanner and area division is performed on the read image, there are the following merits.

【０００３】例えば、スキャナにより、文字領域部と画
像領域部を含む文書Ａ４サイズのカラー文書を３００ｄ
ｐｉでフルカラー２４ｂｉｔ（多値）で読み取った場
合、その読取画像（電子化された文書）の情報量は、約
２４Ｍバイトとなるが、当該読取画像を文字領域部と画
像領域部に分割し、文字領域部については、２値化して
ＭＭＲ符号化し、画像領域部については、ＪＰＥＧ符号
化等をすることによって、当該読取画像の画質を保ちつ
つ、その情報量を縮小することが可能となる。例えば、
画像によっては、その情報量を１／１００程度に縮小す
ることが可能となる。また、文字領域部については、Ｏ
ＣＲを行うことによってコード化すれば、後々のファイ
ルの検索も可能となり、他のテキストエディット型アプ
リケーションでの再利用も可能となる。For example, a scanner converts a document A4 size color document including a character area portion and an image area portion into a 300d document.
When reading in full color 24 bits (multi-valued) with pi, the information amount of the read image (digitized document) is about 24 Mbytes, but the read image is divided into a character area portion and an image area portion, By performing binarization and MMR encoding for the character area portion and performing JPEG encoding and the like for the image area portion, it is possible to reduce the amount of information while maintaining the image quality of the read image. For example,
Depending on the image, the information amount can be reduced to about 1/100. As for the character area, O
If the encoding is performed by performing the CR, the file can be searched later, and the file can be reused in another text editing type application.

【０００４】[0004]

【発明が解決しようとする課題】しかしながら、従来の
画像領域分割方法では、スキャナ等で得られたカラー文
書の読取画像等の多値画像に対して領域分割を行う場
合、多値画像を１つの閾値で二値化して、その二値化画
像に対して領域分割を行っていた。このため、二値化結
果の精度が悪いと、それが領域分割に影響してしまい、
正確な領域分割を行うのが困難であった。また、カラー
文書では、下地や文字領域部等、様々な色部の組み合わ
せが１つの文書内に存在することが考えらるので、１つ
の二値化画像からは、良い領域分割が行えなかった。However, in the conventional image area dividing method, when multi-valued images such as a read image of a color document obtained by a scanner or the like are divided, one multi-valued image is divided into one. Binarization was performed using a threshold, and region division was performed on the binarized image. For this reason, if the accuracy of the binarization result is poor, it will affect the region division,
It has been difficult to perform accurate region division. Further, in a color document, it is considered that various combinations of color portions, such as a background and a character region portion, may exist in one document. Therefore, good region division cannot be performed from one binarized image. .

【０００５】具体的には例えば、図１９（ａ）に示すよ
うな文書９００があるとする。文書９００において、色
ｂの領域Ｂ上には、”あいうえお”という文字Ａが色ａ
で書かれており、色ｄの領域Ｄ上には、”かきくけこ”
という文字Ｃが色ｃで書かれており、これらの文字Ａ、
領域Ｂ、文字Ｃ、領域Ｄの各輝度レベルは、上記図１９
（ｂ）に示すように、文字Ａ、領域Ｂ、文字Ｃ、領域Ｄ
の順で高輝度レベルの方向に分布しているものとする。[0005] Specifically, for example, it is assumed that there is a document 900 as shown in FIG. In the document 900, on the area B of the color b, the character A of "
In the area D of the color d, "Kakikukeko"
Is written in color c, and these letters A,
The brightness levels of the area B, the character C, and the area D are as shown in FIG.
As shown in (b), character A, region B, character C, region D
Are distributed in the direction of the high luminance level in this order.

【０００６】このような文書９００をスキャナで読み取
り、その読取画像を、ある閾値で二値化した場合、その
閾値（二値化閾値）によって、二値化結果が、上記図１
９（ｃ）〜（ｅ）に示すような結果となる。When such a document 900 is read by a scanner and the read image is binarized by a certain threshold, the binarization result is obtained by the threshold (binarization threshold) shown in FIG.
9 (c) to 9 (e).

【０００７】上記図１９（ｃ）は、二値化閾値を文字Ａ
の輝度レベルと文字Ｃの輝度レベルの間の値（同図
（ｂ）中””で示す）とした場合に得られる二値化画
像９０１を示したものである。上記図１９（ｄ）は、二
値化閾値を文字Ｃの輝度レベルと領域Ｂの輝度レベルの
間の値（同図（ｂ）中””で示す）とした場合に得ら
れる二値化画像９０２を示したものである。上記図１９
（ｅ）は、二値化閾値を領域Ｂの輝度レベルと領域Ｄの
輝度レベルの間の値（同図（ｂ）中””で示す）とし
た場合に得られる二値化画像９０３を示したものであ
る。したがって、上記図１９（ｂ）中””で示す値を
二値化閾値として得られた、同図（ｄ）に示す二値化画
像９０２でないと、全ての文字を検出できる領域分割は
望めない。FIG. 19 (c) shows that the binarization threshold is a character A
This figure shows a binarized image 901 obtained when a value between the luminance level of the character C and the luminance level of the character C is indicated (indicated by "" in FIG. 3B). FIG. 19D shows a binarized image obtained when the binarization threshold is set to a value between the luminance level of the character C and the luminance level of the area B (indicated by “” in FIG. 19B). 902. FIG. 19 above
(E) shows a binarized image 903 obtained when the binarization threshold is a value between the luminance level of the region B and the luminance level of the region D (indicated by “” in FIG. It is a thing. Therefore, unless the binarized image 902 shown in FIG. 19D is obtained, in which the value indicated by "" in FIG. 19B is obtained as the binarization threshold, the region division in which all characters can be detected cannot be expected. .

【０００８】しかしながら、文字Ｃの輝度レベルと領域
Ｂの輝度レベルが、図２０に示すように、文字Ｃの輝度
レベル＜領域Ｂの輝度レベルであった場合、１つの二値
化閾値では、領域分割に最適な二値化画像は得られない
ことになる。However, if the luminance level of the character C and the luminance level of the area B are such that the luminance level of the character C <the luminance level of the area B, as shown in FIG. An optimal binarized image for division cannot be obtained.

【０００９】そこで、上記の問題を解決するために、領
域によって二値化閾値を変化させて二値化を行うアダプ
ティブ閾値二値化方式があるが、例えば、図２１（ａ）
に示すような、黒っぽい色ｄ’（低輝度）の領域Ｄ上
に、”かきくけこ”という文字Ｃが白っぽい色ｃ’（高
輝度）で書かれた文書９０４に対しては、最適な領域分
割が行えない。すなわち、文字Ａ、領域Ｂ、文字Ｃ、領
域Ｄの輝度レベルは、上記図２１（ｂ）に示すように、
文字Ａ、領域Ｄ、領域Ｂ、文字Ｃ、の順で高輝度レベル
の方向に分布しており、この状態で、二値化閾値を文字
Ａの輝度レベルと領域Ｄの輝度レベルの間の値（同図
（ｂ）中””で示す）を二値化閾値とした場合、同図
（ｃ）に示す二値化画像９０５が得られ、二値化閾値を
領域Ｄの輝度レベルと領域Ｂの輝度レベルの間の値（同
図（ｂ）中””で示す）とした場合、同図（ｄ）に示
す二値化画像９０６が得られ、二値化閾値を領域Ｂの輝
度レベルと文字Ｃの輝度レベルの間の値（同図（ｂ）
中””で示す）とした場合、同図（ｅ）に示す二値化
画像９０７が得られることになり、上記アダプティブ閾
値二値化方式であっても、領域分割に最適な二値化画像
は得られない。In order to solve the above problem, there is an adaptive threshold binarization method in which the binarization is changed by changing the binarization threshold depending on the area. For example, FIG.
For a document 904 in which the character C of “Kakikokeko” is written in a whitish color c ′ (high brightness) on an area D of a dark color d ′ (low brightness) as shown in FIG. Cannot divide. That is, the luminance levels of the character A, the region B, the character C, and the region D are as shown in FIG.
Characters A, D, B, and C are distributed in the direction of the high luminance level in this order. In this state, the binarization threshold is set to a value between the luminance level of the character A and the luminance level of the region D. When (indicated by “” in FIG. 2B) is a binarization threshold, a binarized image 905 shown in FIG. 2C is obtained, and the binarization threshold is set to the luminance level of the area D and the area B. (Indicated by “” in FIG. 2B), a binarized image 906 shown in FIG. 2D is obtained, and the binarization threshold is set to the luminance level of the area B. A value between the luminance levels of the character C ((b) in FIG.
In the case of “medium”, the binarized image 907 shown in FIG. 11E is obtained, and even with the adaptive threshold binarization method, the binarized image optimal for the region division is obtained. Cannot be obtained.

【００１０】また、分割された領域の処理（２値化処理
等）は、利便性のため外接矩形をとって、その形状での
処理がなされていた。このため、例えば、図２２に示す
ような、黒下地に「ＡＢＣＤＥＦＧ」及び「イロハニホ
ヘト」の白文字が存在する領域９１０と、白下地に「１
２３」、「４５６」、「７９８」等の黒文字が存在する
領域９１１とからなる画像であり、特に、領域９１１
が、領域９１０と外接するような形状（外接矩形の形
状）で存在する画像である場合、この外接矩形を単位に
して、例えば、ＯＣＲのために２値化処理を実行する
と、図２３に示すように、「１２３」、「４５６」、
「７８９」等の黒文字が白下地に存在する領域９１１
が、通常領域（白下地に黒文字が存在する領域）である
にも関わらず反転されてしまう。In the processing of the divided areas (binary processing, etc.), a circumscribed rectangle is used for convenience, and processing is performed in the shape of the circumscribed rectangle. Therefore, for example, as shown in FIG. 22, a region 910 where white characters “ABCDEFG” and “Irohanihohet” exist on a black background and “1” on a white background
23 "," 456 "," 798 ", and other areas in which black characters are present.
Is an image existing in a shape circumscribing the region 910 (shape of a circumscribed rectangle), and performing a binarization process for OCR in units of the circumscribed rectangle, as shown in FIG. So, "123", "456",
An area 911 where a black character such as “789” exists on a white background
Is inverted in spite of being a normal area (an area where a black character exists on a white background).

【００１１】このときの二値化処理を、外接矩形を単位
にした処理でなく、自由形での処理とすれば、上記の問
題は起こらないが、自由形での処理は、それをサポート
する関数等が必要となり、その分、処理が複雑になって
しまう。これに対して、外接矩形を単位にした処理は、
簡単な構成で効率よく処理することができるが、上記の
問題が起きてしまう。If the binarization process at this time is not a process in units of circumscribed rectangles but a process in free form, the above problem does not occur, but the process in free format supports it. Functions and the like are required, which complicates the processing. On the other hand, the processing in units of the circumscribed rectangle is
Although the processing can be efficiently performed with a simple configuration, the above problem occurs.

【００１２】そこで、本発明は、上記の欠点を除去する
ために成されたもので、多値画像の領域分割を最適に且
つ効率的に行える、画像処理装置、画像処理システム、
画像処理方法、及びそれを実施するための処理ステップ
をコンピュータが読出可能に格納した記憶媒体を提供す
ることを目的とする。SUMMARY OF THE INVENTION The present invention has been made to eliminate the above-mentioned disadvantages, and has been made in view of the above circumstances.
It is an object of the present invention to provide an image processing method and a storage medium in which processing steps for executing the image processing method are readable by a computer.

【００１３】[0013]

【課題を解決するための手段】斯かる目的下において、
第１の発明は、多値画像の領域分割を行う画像処理装置
であって、上記多値画像から複数の量子化画像を取得す
る量子化手段と、上記二値化手段により得られた量子化
画像から、所定値を有する画素群を含む領域を抽出する
領域抽出手段と、上記領域抽出手段により得られた領域
を、当該領域内に存在する画素群の開始画素及び終了画
素の密集状態に基づいて分割する領域分割手段と、上記
領域抽出手段及び上記領域分割手段により得られた各領
域についての属性を解析する領域解析手段を備えること
を特徴とする。For such a purpose,
According to a first aspect, there is provided an image processing apparatus for performing region division of a multi-valued image, comprising: a quantization unit configured to acquire a plurality of quantized images from the multi-valued image; A region extraction unit that extracts a region including a pixel group having a predetermined value from an image, and a region obtained by the region extraction unit is determined based on a density of start pixels and end pixels of the pixel group existing in the region. And a region analyzing unit that analyzes attributes of each region obtained by the region extracting unit and the region dividing unit.

【００１４】第２の発明は、上記第１の発明において、
上記量子化手段は、上記量子化画像として二値化画像を
取得することを特徴とする。According to a second aspect, in the first aspect,
The quantizing means acquires a binarized image as the quantized image.

【００１５】第３の発明は、上記第１の発明において、
上記量子化手段は、上記多値画像における、上記領域抽
出手段の領域抽出結果に基づいた領域部分の量子化画像
を取得することを特徴とする。According to a third aspect, in the first aspect,
The quantizing means acquires a quantized image of an area portion of the multi-valued image based on an area extraction result of the area extracting means.

【００１６】第４の発明は、上記第１の発明において、
上記量子化手段は、上記多値画像のヒストグラムから得
られた複数の閾値に基づいて、上記複数の量子化画像を
取得することを特徴とする。According to a fourth aspect, in the first aspect,
The quantizing means acquires the plurality of quantized images based on a plurality of thresholds obtained from a histogram of the multi-valued image.

【００１７】第５の発明は、上記第１の発明において、
上記領域抽出手段は、上記量子化画像から、上記所定値
を有する画素が一次元的に連続する画素群を検出する検
出手段と、上記検出手段により検出された画素群の座標
情報を保持する第１の保持手段と、上記検出手段により
既に検出された画素群の座標情報を保持する第２の保持
手段と、上記第１の保持手段内の情報と上記第２の保持
手段内の情報の結合処理を行う結合手段とを含むことを
特徴とする。According to a fifth aspect, in the first aspect,
The region extracting unit includes a detecting unit configured to detect a pixel group in which the pixels having the predetermined value are one-dimensionally continuous from the quantized image, and a second unit that stores coordinate information of the pixel group detected by the detecting unit. (1) holding means, second holding means for holding the coordinate information of the pixel group already detected by the detection means, and combining information in the first holding means and information in the second holding means. And coupling means for performing processing.

【００１８】第６の発明は、上記第５の発明において、
上記検出手段は、上記量子化画像上において、Ｍライン
毎に上記領域の検出を行うことを特徴とする。According to a sixth aspect based on the fifth aspect,
The detection means detects the area on the quantized image for every M lines.

【００１９】第７の発明は、上記第１の発明において、
上記領域抽出手段にて抽出される領域は、一次元的に連
続する画素群を含む外接矩形の領域を含み、上記領域分
割手段は、上記領域抽出手段にて抽出された領域内に含
まれる画素群の開始画素及び終了画素のヒストグラムか
ら、当該開始画素及び終了画素の密集点を検出する第１
の検出手段と、上記第１の検出手段での検出結果に基づ
き得られる分割領域内に存在する画素群の開始画素及び
終了画素について、上記一次元方向とは異なる次元方向
の広がりを検出する第２の検出手段と、上記第２の検出
手段での検出結果に基づいて、領域分割を実行する分割
手段とを含むことを特徴とする。According to a seventh aspect, in the first aspect,
The region extracted by the region extracting unit includes a circumscribed rectangular region including a one-dimensionally continuous pixel group, and the region dividing unit includes a pixel included in the region extracted by the region extracting unit. A first method for detecting a dense point of the start pixel and the end pixel from the histogram of the start pixel and the end pixel of the group.
Detecting means for detecting a spread in a dimensional direction different from the one-dimensional direction with respect to a start pixel and an end pixel of a pixel group present in a divided area obtained based on a detection result obtained by the first detection means. 2 and a dividing means for performing region division based on the detection result of the second detecting means.

【００２０】第８の発明は、複数の機器が通信可能に接
続されてなる画像処理システムであって、上記複数の機
器のうち少なくとも１つの機器は、請求項１〜７の何れ
かに記載の画像処理装置の機能を有することを特徴とす
る。According to an eighth aspect of the present invention, there is provided an image processing system in which a plurality of devices are communicably connected, wherein at least one device among the plurality of devices is according to any one of claims 1 to 7. It has a function of an image processing device.

【００２１】第９の発明は、多値画像の領域分割を行う
ための画像処理方法であって、上記多値画像からから少
なくとも１つ以上の量子化画像を作成し、その量子化画
像から、黒画素の固まりの領域を取得する領域抽出ステ
ップと、上記領域抽出ステップにより得られた黒領域
を、当該黒領域内に存在する黒画素の固まりの開始画素
及び終了画素の密集状態に基づいて分割する領域分割ス
テップと、上記領域抽出ステップ及び領域分割ステップ
により得られた全ての領域に対して、その属性を解析す
る領域解析ステップとを含むことを特徴とする。A ninth aspect of the present invention is an image processing method for dividing a multi-valued image into regions, wherein at least one or more quantized images are created from the multi-valued image, and An area extracting step of obtaining an area of a cluster of black pixels, and dividing the black area obtained by the area extracting step on the basis of the density of the start pixels and end pixels of the cluster of black pixels existing in the black area. And a region analyzing step of analyzing attributes of all regions obtained by the region extracting step and the region dividing step.

【００２２】第１０の発明は、上記第９の発明におい
て、上記領域抽出ステップは、上記量子化画像として二
値化画像を作成するステップを含むことを特徴とする。In a tenth aspect based on the ninth aspect, the area extracting step includes a step of creating a binarized image as the quantized image.

【００２３】第１１の発明は、上記第９の発明におい
て、上記少なくとも１つ以上の量子化画像の一部は、上
記多値画像の全面から得られた量子化画像ではないこと
を特徴とする。In an eleventh aspect based on the ninth aspect, a part of the at least one or more quantized images is not a quantized image obtained from the entire surface of the multi-valued image. .

【００２４】第１２の発明は、上記第９の発明におい
て、上記領域抽出ステップは、上記量子化画像から、一
次元的に連続する黒画素の固まり検出する黒検出ステッ
プと、上記黒検出ステップにより検出された黒画素の固
まりの座標情報を保持する第１の保持ステップと、上記
黒検出ステップにより既に検出された黒画素の固まりの
座標情報を保持する第２の保持ステップと、上記第１の
保持ステップによる保持情報と上記第２の保持ステップ
による保持情報の結合処理を行う結合ステップを含むこ
とを特徴とする。In a twelfth aspect based on the ninth aspect, the region extracting step includes a black detecting step of detecting a cluster of one-dimensionally continuous black pixels from the quantized image, and a black detecting step. A first holding step of holding the detected coordinate information of the cluster of black pixels, a second holding step of holding the coordinate information of the cluster of black pixels already detected by the black detection step, and the first holding step; The method includes a combining step of combining the held information in the holding step and the held information in the second holding step.

【００２５】第１３の発明は、上記第１２の発明におい
て、上記黒検出ステップは、上記量子化画像上において
Ｍライン毎に上記検出処理を実行するステップを含むこ
とを特徴とする。In a thirteenth aspect based on the twelfth aspect, the black detecting step includes a step of executing the detecting process for each M lines on the quantized image.

【００２６】第１４の発明は、上記第９の発明におい
て、上記領域抽出ステップにより抽出される黒領域は、
一次元的に連続する黒画素の固まりを含む外接矩形の領
域を含み、上記領域分割ステップは、上記領域抽出ステ
ップにより抽出された黒領域内に含まれる黒画素の固ま
りの開始画素及び終了画素のヒストグラムから、当該開
始画素及び終了画素の密集点を検出する第１の検出ステ
ップと、上記第１の検出ステップでの検出結果に基づい
て上記黒領域を分割し、その分割領域内に存在する画素
群の開始画素及び終了画素について、上記一次元方向と
は異なる次元方向の広がりを検出する第２の検出ステッ
プと、上記第２の検出ステップでの検出結果に基づい
て、上記黒領域の分割を実行する分割ステップとを含む
ことを特徴とする。In a fourteenth aspect based on the ninth aspect, the black area extracted in the area extracting step is:
A circumscribed rectangular area including a cluster of one-dimensionally continuous black pixels is included, and the area dividing step includes a start pixel and an end pixel of a cluster of black pixels included in the black area extracted by the area extracting step. A first detection step of detecting a dense point of the start pixel and the end pixel from the histogram, and dividing the black region based on a detection result in the first detection step; A second detection step of detecting a spread in a dimensional direction different from the one-dimensional direction for the start pixel and the end pixel of the group, and dividing the black area based on a detection result in the second detection step. And a dividing step to be executed.

【００２７】第１５の発明は、請求項１〜７の何れかに
記載の画像処理装置の機能、又は請求項８記載の画像処
理システムの機能を実施するための処理プログラムを、
コンピュータが読出可能に格納した記憶媒体であること
を特徴とする。According to a fifteenth aspect, a processing program for implementing the function of the image processing apparatus according to any one of claims 1 to 7 or the function of the image processing system according to claim 8 is provided.
It is a storage medium that is readable by a computer.

【００２８】第１６の発明は、請求項９〜１４の何れか
に記載の画像処理方法の処理ステップを、コンピュータ
が読出可能に格納した記憶媒体であることを特徴とす
る。According to a sixteenth aspect of the present invention, a computer readable storage medium stores the processing steps of the image processing method according to any one of the ninth to fourteenth aspects.

【００２９】[0029]

【発明の実施の形態】以下、本発明の実施の形態につい
て図面を用いて説明する。Embodiments of the present invention will be described below with reference to the drawings.

【００３０】本発明は、例えば、図１に示すような画像
処理装置１００に適用される。この画像処理装置１００
は、上記図１に示すように、入力画像情報（多値画像情
報）から輝度情報（輝度値）を取得する輝度変換部１０
１と、輝度変換部１０１により得られた輝度情報を１画
像分保持する多値画像メモリ１０２と、多値画像メモリ
１０２内の輝度情報に基づきヒストグラムを生成するス
トグラム演算部１０３と、ヒストグラム演算部１０３に
て得られたヒストグラムに基づき複数の閾値を決定する
閾値決定部１０４と、ヒストグラム演算部１０３にて得
られたヒストグラムに基づき最適二値化閾値等を取得す
る領域解析部１０９と、閾値決定部１０４や領域解析部
１０９にて得られた閾値に基づき二値化画像を作成する
二値化部１０５と、二値化部１０５にて得られた二値化
画像を保持する二値画像メモリ１０６と、二値画像メモ
リ１０６内の二値化画像から黒領域の座標情報を取得す
る黒領域抽出部１０７と、黒領域抽出部１０７にて得ら
れた黒領域を解析して必要に応じて当該黒領域を分割し
て複数の黒領域の情報を取得する黒領域分割部１１０
と、黒領域分割部１１０及び領域解析部１０９にて得ら
れた情報を保持する黒領域座標保持部１０８とを備えて
おり、ヒストグラム演算部１０３及び二値化部１０５
は、黒領域座標保持部１０８内の情報にも基づきそれぞ
れの処理を実行するようになされている。The present invention is applied to, for example, an image processing apparatus 100 as shown in FIG. This image processing apparatus 100
As shown in FIG. 1, the luminance conversion unit 10 acquires luminance information (luminance value) from input image information (multi-valued image information).
1, a multi-valued image memory 102 for holding the luminance information obtained by the luminance converting unit 101 for one image, a stogram calculating unit 103 for generating a histogram based on the luminance information in the multi-valued image memory 102, and a histogram calculating unit A threshold determination unit 104 for determining a plurality of thresholds based on the histogram obtained in 103; an area analysis unit 109 for obtaining an optimal binarization threshold or the like based on the histogram obtained in the histogram calculation unit 103; A binarizing unit 105 for generating a binarized image based on a threshold value obtained by the unit 104 or the region analyzing unit 109; and a binary image memory for holding the binarized image obtained by the binarizing unit 105 106, a black region extracting unit 107 for obtaining coordinate information of the black region from the binary image in the binary image memory 106, and analyzing the black region obtained by the black region extracting unit 107 Obtaining information of a plurality of black regions by dividing the black area as needed Te black area dividing unit 110
And a black region coordinate storage unit 108 for storing information obtained by the black region division unit 110 and the region analysis unit 109. The histogram calculation unit 103 and the binarization unit 105
Are executed based on the information in the black area coordinate storage unit 108.

【００３１】上述のような画像処理装置１００は、例え
ば、図２のフローチャートに従って、次のように動作す
る。The image processing apparatus 100 operates as follows according to, for example, the flowchart of FIG.

【００３２】ステップＳ２０１：輝度変換部１０１に対
して入力される多値画像情報（以下、「対象画像」と言
う）は、ＲＧＢの三原色の情報からなるカラー画像情報
である。輝度変換部１０１は、ＲＧＢで入力された対象
画像を構成する各画素についての輝度情報（輝度値Ｙ）
を、Ｙ＝０．２９９Ｒ＋０．５８７Ｇ＋０．１１４Ｂなる式に従って算出する。多値画像メモリ１０２は、輝
度変換部１０１にて得られた輝度値を、１画像（対象画
像）を構成する画素分保持する。Step S201: The multi-valued image information (hereinafter referred to as "target image") input to the luminance conversion unit 101 is color image information composed of information of three primary colors of RGB. The luminance conversion unit 101 generates luminance information (luminance value Y) for each pixel constituting the target image input in RGB.
Is calculated according to the following equation: Y = 0.299R + 0.587G + 0.114B. The multi-valued image memory 102 holds the luminance values obtained by the luminance conversion unit 101 for the pixels constituting one image (target image).

【００３３】ステップＳ２０２：ヒストグラム演算部１
０３は、多値画像メモリ１０２から、全画像又は一部画
像を構成する画素分の輝度値Ｙを読み出し、輝度につい
てのヒストグラムを生成する。ここでは、多値画像メモ
リ１０２に保持されている対象画像を構成する全ての画
素の輝度値のヒストグラムを生成する。このとき、画像
読取時の解像度及び画像サイズによって、全画素数が変
わってため、画像読取時の解像度及び画像サイズに基づ
き正規化を行った方が好ましい。これは、後述する閾値
決定部１０４において、複数の閾値を決定する際に、ヒ
ストグラム演算部１０３にて得られるヒストグラムの遷
移の様子を利用しており、画素数の変化の傾きや画素数
そのものをパラメータとしているためである。Step S202: Histogram calculation section 1
In step 03, the luminance values Y of the pixels constituting the entire image or a partial image are read from the multi-value image memory 102, and a histogram of the luminance is generated. Here, a histogram of the luminance values of all the pixels constituting the target image held in the multi-valued image memory 102 is generated. At this time, since the total number of pixels changes depending on the resolution and image size at the time of image reading, it is preferable to perform normalization based on the resolution and image size at the time of image reading. This is achieved by using the state of transition of the histogram obtained by the histogram calculation unit 103 when determining a plurality of threshold values in the threshold value determination unit 104 described later, and determining the inclination of the change in the number of pixels and the number of pixels themselves. This is because it is a parameter.

【００３４】ステップＳ２０３：閾値決定部１０４は、
ヒストグラム演算部１０３にて得られたヒストグラムに
基づき、複数の閾値を決定する。Step S203: The threshold value determining unit 104
A plurality of thresholds are determined based on the histogram obtained by the histogram calculator 103.

【００３５】具体的には例えば、対象画像として、図３
（ａ）に示すような文書の多値画像情報が入力され、そ
の輝度情報が画像メモリ１０２に保持されている場合、
ヒストグラム演算部１０３にて得られるヒストグラム
は、図４に示すようなヒストグラムとなる。Specifically, for example, as an object image, FIG.
When multivalued image information of a document as shown in FIG. 9A is input and the luminance information is held in the image memory 102,
The histogram obtained by the histogram calculator 103 is a histogram as shown in FIG.

【００３６】そこで、閾値決定部１０４は、先ず、上記
図４のヒストグラムにおいて、輝度値”２５５”（白）
から輝度値”０”（黒）へと向かって対応する画素数を
参照する。このときの参照点は、サンプリングして４間
隔づつ（”２５５”、”２５１”、”２４７”、・・
・）等のようにしても良い。Therefore, the threshold value determining unit 104 first determines the luminance value “255” (white) in the histogram of FIG.
From the pixel to the brightness value “0” (black). At this time, the reference points are sampled at intervals of four (“255”, “251”, “247”,...).
・), Etc.

【００３７】そして、閾値決定部１０４は、次のような
条件１〜条件６での「条件１を満足一＞条件２を満足一
＞条件３を満足」に該当する輝度値を閾値として検出す
る。条件１：参照点から”１０”逆上った点までの合計画素
数が閾値１より大きい。条件２：急激な減少が発生している（「傾き１より大」
が１回、又は「傾き２より大」の状態が連続して２回発
生している等）。条件３：緩い減少、或いは上昇が発生している（傾き３
より小さい状態）。条件４：参照点から”４０”逆上った点までの合計画素
数が閾値２より大きい。条件５：参照点の画素数が閾値３より小さい。条件６：参照点から輝度値が”２０”前の画素数が閾値
４より大きい。Then, the threshold value determination unit 104 detects a luminance value corresponding to “satisfies condition 1> satisfies condition 2> satisfies condition 3” in the following conditions 1 to 6 as a threshold value. . Condition 1: The total number of pixels from the reference point to the point that goes up by "10" is larger than the threshold value 1. Condition 2: A sharp decrease has occurred (“greater than slope 1”)
Once, or the state of “greater than slope 2” occurs twice in succession). Condition 3: A gradual decrease or increase has occurred (slope 3
Smaller state). Condition 4: The total number of pixels from the reference point to a point that is “40” up is larger than the threshold value 2. Condition 5: the number of pixels at the reference point is smaller than threshold value 3. Condition 6: the number of pixels whose luminance value is “20” before the reference point is larger than threshold value 4.

【００３８】したがって、上記図４に示すように、”４
０１”〜”４０４”で示す４つの閾値が検出されること
になる。これらの閾値４０１〜４０４において、例え
ば、閾値４０２については、「条件４を満足、且つ条件
５を満足、且つ条件６を満足」に該当する閾値として検
出される。すなわち、条件１〜条件３は、ばらつきの少
ない輝度のかたまりを抽出することを目的とした条件で
あり、条件４〜条件６は、ゆるやかに分布する輝度のか
たまりを抽出することを目的とした条件であり、これら
の条件のうち条件４〜条件６に該当する閾値として、閾
値４０２が検出される。Therefore, as shown in FIG.
Four thresholds indicated by 01 "to" 404 "are detected. Among these thresholds 401 to 404, for example, regarding the threshold 402," condition 4 is satisfied, condition 5 is satisfied, and condition 6 is satisfied. It is detected as a threshold value corresponding to “satisfied”. That is, the conditions 1 to 3 are conditions for extracting a cluster of luminance with little variation, and the conditions 4 to 6 are conditions for extracting a cluster of luminance that is slowly distributed. The threshold 402 is detected as a threshold corresponding to the conditions 4 to 6 among these conditions.

【００３９】その後、閾値決定部１０４は、後処理とし
て、上記図４のヒストグラムにおいて、輝度値”０”か
ら輝度値”２５５”へと向かって対応する画素数を参照
して、存在する画素の合計値を算出し、閾値４０１〜４
０４のうち、累積存在画素数が閾値５より小さいところ
に存在する輝度値に対応する閾値を消去する。この結
果、閾値４０１〜４０４のうち、閾値４０４が消去され
る。閾値決定部１０４は、残った閾値４０１〜４０３
を、二値化部１０５へ与える二値化閾値として決定す
る。Thereafter, as a post-processing, the threshold determination unit 104 refers to the number of pixels corresponding to the luminance value “0” to the luminance value “255” in the histogram of FIG. The total value is calculated, and the threshold values 401 to 4
Among 04, the threshold value corresponding to the luminance value existing where the cumulative number of pixels is smaller than the threshold value 5 is deleted. As a result, the threshold 404 among the thresholds 401 to 404 is deleted. The threshold determination unit 104 determines the remaining thresholds 401 to 403
Is determined as a binarization threshold value to be given to the binarization unit 105.

【００４０】尚、上述のような閾値決定部１０４におい
て、後処理での閾値の消去処理は必須の処理ではない。
また、条件１や条件４において、参照点から逆上る値に
ついても、”１０”や”４０”等に限られるものではな
く、任意の値（スキャナの特性に依存した値等）を用い
るようにしてもよい。また、閾値決定部１０４での処理
は、上記図４に示したようなヒストグラム上のノイズに
惑わされないように平均化しながら実行するようにして
もよい。また、条件１〜条件６に従って閾値を検出する
際、参照点において、条件１〜条件６がそれぞれ１回発
生したら、当該参照点の輝度値を閾値として決定するの
ではなく、例えば、２回連続して発生した場合に、当該
参照点の輝度値を閾値として決定するようにしてもよ
い。In the above-described threshold determination section 104, the erasure processing of the threshold in the post-processing is not an essential processing.
Also, in the conditions 1 and 4, the value going up from the reference point is not limited to “10” or “40”, but may be any value (a value depending on the characteristics of the scanner). You may. Further, the processing in the threshold value determination unit 104 may be executed while averaging so as not to be confused by noise on the histogram as shown in FIG. Further, when detecting the threshold value according to the conditions 1 to 6, if the conditions 1 to 6 occur once at the reference point, the luminance value of the reference point is not determined as the threshold value, but, for example, two consecutive times. When the occurrence occurs, the luminance value of the reference point may be determined as the threshold.

【００４１】ステップＳ２０４：二値化部（単純二値化
部）１０５は、閾値決定部１０４にて得られた複数の二
値化閾値、或いは領域解析部１０９にて得られた後述す
る閾値に基づいて、多値画像メモリ１０２に保持されて
いる対象画像（対象画像を構成する各画素の輝度情報）
を二値化して、二値化画像を生成する。二値メモリ１０
６は、二値化部１０５にて得られた二値化画像を保持す
る。Step S204: The binarization section (simple binarization section) 105 converts the plurality of binarization threshold values obtained by the threshold value determination section 104 or the threshold values described later obtained by the region analysis section 109. Based on the target image (the luminance information of each pixel forming the target image)
Is binarized to generate a binarized image. Binary memory 10
Reference numeral 6 holds the binarized image obtained by the binarization unit 105.

【００４２】したがって、例えば、対象画像として上記
図３（ａ）に示した文書の多値画像情報が入力され、二
値化部１０５において、閾値決定部１０４にて得られた
複数の二値化閾値４０１〜４０３に基づいた二値化画像
の生成が行われた場合、二値メモリ１０６には、上記図
３（ｂ）〜（ｃ）に示すような二値化画像が保持される
ことになる。上記図３（ｂ）は、二値化閾値４０１に基
づいた二値化により得られた二値化画像を示したもので
あり、同図（ｃ）は、二値化閾値４０２に基づいた二値
化により得られた二値化画像を示したものであり、同図
（ｄ）は、二値化閾値４０３に基づいた二値化により得
られた二値化画像を示したものである。Therefore, for example, the multi-valued image information of the document shown in FIG. 3A is input as the target image, and the binarization unit 105 outputs a plurality of binarized images obtained by the threshold value determination unit 104. When a binarized image is generated based on the thresholds 401 to 403, the binarized image as shown in FIGS. 3B to 3C is held in the binary memory 106. Become. FIG. 3B shows a binarized image obtained by binarization based on the binarization threshold 401, and FIG. 3C shows a binarized image based on the binarization threshold 402. FIG. 3D shows a binarized image obtained by binarization, and FIG. 4D shows a binarized image obtained by binarization based on the binarization threshold 403.

【００４３】ステップＳ２０５：黒領域抽出部１０７
は、二値メモリ１０６に保持された二値化画像から、黒
領域の座標情報を作成する。例えば、詳細は後述する
が、黒領域抽出部１０７は、上記図４に示した二値化閾
値４０１〜４０３のうち、”白”（輝度値”２５５”）
に対応する輝度値に一番近い閾値４０１に基づいた二値
化により得られた二値化画像（上記図３（ｂ）参照）を
二値画像メモリ２０６から読み出し、その二値化画像に
おいて、”黒”に対応する画素値を有する画素の固まり
の座標を抽出することで、黒領域の座標情報を作成す
る。また、他の閾値により得られた二値化画像について
も同様にして、黒領域の座標情報を作成する。黒領域分
割部１１０は、黒領域抽出部１０７にて得られた黒領域
を解析し、必要に応じて、当該黒領域をさらに分割して
複数の黒領域の情報を取得する。黒領域座標保持部１０
８は、黒領域分割部１１０にて得られた黒領域の座標情
報を保持する。Step S205: Black area extraction unit 107
Creates coordinate information of a black area from the binarized image held in the binary memory 106. For example, although the details will be described later, the black area extraction unit 107 selects “white” (luminance value “255”) among the binarization thresholds 401 to 403 shown in FIG.
The binarized image (see FIG. 3B) obtained by the binarization based on the threshold value 401 closest to the luminance value corresponding to is read out from the binary image memory 206, and in the binary image, By extracting the coordinates of a cluster of pixels having a pixel value corresponding to “black”, coordinate information of a black area is created. Similarly, coordinate information of a black region is created for a binarized image obtained by another threshold. The black region dividing unit 110 analyzes the black region obtained by the black region extracting unit 107, and further divides the black region as necessary to obtain information on a plurality of black regions. Black area coordinate holding unit 10
Numeral 8 holds the coordinate information of the black area obtained by the black area dividing unit 110.

【００４４】ステップＳ２０６：領域解析部１０９は、
詳細は後述するが、黒領域座標保持部１０８に保持され
た全ての黒領域の座標情報に対して、対象黒領域に対応
する画像情報（多値画像メモリ１０２に保持されている
輝度情報）のヒストグラムを、ヒストグラム演算部１０
３により取得し、そのヒストグラムから、対象黒領域が
文字領域であるか否か、文字領域であるならば反転領域
（黒っぽい下地に白っぽい文字が存在する領域）である
か否か、当該下地の平均画素値（平均下地色）、当該文
字の平均画素値（平均文字色）、二値化の際に用いる最
適な二値化閾値等の解析情報（黒領域の属性の解析情
報）を取得する。黒領域座標保持部１０８は、領域解析
部１０９にて得られた各種の解析情報を保持する。Step S206: The area analysis unit 109
As will be described in detail later, the coordinate information of all the black areas held in the black area coordinate holding unit 108 is compared with the image information (luminance information held in the multi-valued image memory 102) corresponding to the target black area. The histogram is calculated by the histogram calculation unit 10
3 to determine whether or not the target black area is a character area, and if it is a character area, whether or not the target black area is an inverted area (an area where a white character exists on a black background), Analysis information (analysis information of the attribute of the black area) such as a pixel value (average background color), an average pixel value (average character color) of the character, and an optimal binarization threshold used for binarization are acquired. The black area coordinate storage unit 108 stores various analysis information obtained by the area analysis unit 109.

【００４５】尚、上記図２に示す点線矢印で示す処理に
ついては後述する。また、本実施の形態では、ＲＧＢの
カラー画像情報を入力としているが、これに限られるこ
とはなく、例えば、ＣＭＹＫやＬａｂ等、如何なる色空
間の情報でもかまわない。また、ＲＧＢのカラー画像情
報を輝度情報に変換して、輝度１次元で二値化を行うよ
うにしているが、これに限られることはなく、例えば、
ＲＧＢ３次元空間を利用して二値化を行っても良い。The processing indicated by the dotted arrow shown in FIG. 2 will be described later. Further, in the present embodiment, RGB color image information is input. However, the present invention is not limited to this, and any color space information such as CMYK or Lab may be used. In addition, RGB color image information is converted into luminance information and luminance is one-dimensionally binarized. However, the present invention is not limited to this.
Binarization may be performed using an RGB three-dimensional space.

【００４６】図５は、上記図２に示したステップＳ２０
５の黒領域抽出処理を具体的に示したものである。すな
わち、黒領域抽出部１０７は、上記図５のフローチャー
トに従って、次のように動作する。FIG. 5 is a flowchart showing the operation of step S20 shown in FIG.
5 specifically illustrates the black region extraction processing of FIG. That is, the black region extraction unit 107 operates as follows according to the flowchart of FIG.

【００４７】ステップＳ５０１：先ず、例えば、上記図
４に示した二値化閾値４０１〜４０３のうち、”白”
（輝度値”２５５”）に対応する輝度値に一番近い二値
化閾値４０１に基づいた二値化により得られた二値化画
像（上記図３（ｂ）参照、以下、「対象二値化画像」と
言う）を二値画像メモリ２０６から読み出し、その対象
二値化画像に対して、予め設定されたサンプリングパラ
メータｓａｍｐｌｅ＿ｔａｔｅにより示されるライン数
毎に、本処理を実行するためのカウンタｊ（縦方向（副
走査方向）のカウンタ）を”０”にリセットする。Step S501: First, for example, among the binarization thresholds 401 to 403 shown in FIG.
(Binarized image obtained by binarization based on the binarization threshold 401 closest to the luminance value corresponding to (luminance value “255”) (see FIG. 3B, hereinafter, “target binary”). ) Is read out from the binary image memory 206, and a counter j () for executing the present process for the target binary image for each number of lines indicated by a preset sampling parameter sample_state The counter in the vertical direction (sub-scanning direction) is reset to “0”.

【００４８】ステップＳ５０２：カウンタｊが、予め設
定された値ｈｅｉｇｈｔ（対象二値化画像の縦幅）に達
したか否かを判別する。この判別の結果、「ｊ＜ｈｅｉ
ｇｈｔ」でない場合、対象二値化画像に対する本処理が
終了したと認識して本処理を終了する。一方、「ｊ＜ｈ
ｅｉｇｈｔ」である場合には、次のステップＳ５０３か
らの処理を実行する。Step S502: It is determined whether or not the counter j has reached a preset value height (the vertical width of the target binarized image). As a result of this determination, "j <hei
If it is not “ght”, it recognizes that this processing for the target binary image has been completed, and ends this processing. On the other hand, “j <h
If "right", the processing from the next step S503 is executed.

【００４９】ステップＳ５０３：ステップＳ５０２の判
別の結果、「ｊ＜ｈｅｉｇｈｔ」である場合、詳細は後
述するが、カウンタｊにより示される対象二値化画像上
の１ラインを構成する各画素において、”黒”に対応す
る画素値を有する画素（黒画素）の固まりの組を検出す
る。本ステップＳ５０３にて検出された黒画素の固まり
の組数を”ｌｉｎｅｎｕｍ”で示す。また、ある１つの
黒画素の固まりの組において、当該固まりの始めの画素
の座標をｘｓｔａｒｔ［ｘ］で示し、終わりの画素の座
標をｘｅｎｄ［ｘ］で示す。Step S503: If "j <height" is determined as a result of the determination in step S502, as will be described in detail later, in each pixel constituting one line on the target binary image indicated by the counter j, " A set of pixels (black pixels) having a pixel value corresponding to “black” is detected. The number of sets of black pixel clusters detected in step S503 is indicated by “linenum”. In a set of one black pixel block, the coordinates of the start pixel of the block are indicated by xstart [x], and the coordinates of the end pixel are indicated by xend [x].

【００５０】ステップＳ５０４：ステップＳ５０３にて
検出された黒画素の固まりの各組（”ｌｉｎｅｎｕｍ”
で示される数分の組）のそれぞれに対して、ステップＳ
５０５〜Ｓ５０９の処理を実行するためのカウンタｌ
を”０”にリセットする。Step S504: Each set ("linenum") of a set of black pixels detected in step S503
Step S)
Counter l for executing the processing of 505 to S509
Is reset to “0”.

【００５１】ステップＳ５０５：カウンタｌが、組数ｌ
ｉｎｅｎｕｍに達したか否かを判別する。この判別の結
果、「ｌ＜ｌｉｎｅｎｕｍ」でない場合、”ｌｉｎｅｎ
ｕｍ”で示される数分の全ての組に対して、ステップＳ
５０５〜Ｓ５０９の処理を実行し終えたと認識し、後述
するステップＳ５１０へと進む。一方、「ｌ＜ｌｉｎｅ
ｎｕｍ」である場合、次のステップＳ５０６からの処理
を実行する。Step S505: Counter l is set to l
It is determined whether or not inenum has been reached. As a result of this determination, if “l <linenum” is not satisfied, “linen
um ”for all sets of several minutes.
It recognizes that the processes of 505 to S509 have been executed, and proceeds to step S510 described later. On the other hand, "l <line
num ”, the processing from the next step S506 is executed.

【００５２】ステップＳ５０６：ステップＳ５０５の判
別の結果、「ｌ＜ｌｉｎｅｎｕｍ」である場合、カウン
タｌで示される黒画素の固まりの組の座標ｘｓｔａｒｔ
［ｌ］及び座標ｘｅｎｄ［ｌ］と、後述する既に存在す
る全ての構造体Ｇｒｏｕｐの座標とを比較して、座標ｘ
ｓｔａｒｔ［ｌ］及び座標ｘｅｎｄ［ｌ］により示され
る黒画素の固まりと、当該構造体Ｇｒｏｕｐとの結合処
理を行う。Step S506: If "l <linenum" as a result of the determination in step S505, the coordinates xstart of the set of black pixel clusters indicated by the counter l
[L] and the coordinates xend [l] are compared with the coordinates of all the existing structures Group described later, and the coordinates x
A connection process is performed between a group of black pixels indicated by start [l] and coordinates xend [l] and the structure Group.

【００５３】ステップＳ５０７：座標ｘｓｔａｒｔ
［ｌ］及び座標ｘｅｎｄ［ｌ］により示される黒画素の
固まりが、上記既に存在する全ての構造体Ｇｒｏｕｐの
何れとも結合されなかったか否かを判別する。この判別
の結果、座標ｘｓｔａｒｔ［ｌ］及び座標ｘｅｎｄ
［ｌ］により示される黒画素の固まりが、上記既に存在
する全ての構造体Ｇｒｏｕｐの何れかと結合された場合
には、次のステップＳ５０８は実行せずに、そのまま後
述するテップＳ５０９へと進む。Step S507: Coordinate xstart
It is determined whether or not the cluster of black pixels indicated by [l] and the coordinate xend [l] has not been combined with any of the above-mentioned existing structures Group. As a result of this determination, the coordinates xstart [l] and the coordinates xend
If the cluster of black pixels indicated by [1] is combined with any of the existing structural groups, the next step S508 is not executed, and the process directly proceeds to step S509 described later.

【００５４】ステップＳ５０８：ステップＳ５０７での
判別の結果、座標ｘｓｔａｒｔ［ｌ］及び座標ｘｅｎｄ
［ｌ］により示される黒画素の固まりが、上記既に存在
する全ての構造体Ｇｒｏｕｐの何れかとも結合されなか
った場合、座標ｘｓｔａｒｔ［ｌ］及び座標ｘｅｎｄ
［ｌ］により示される黒画素の固まりを、新規の構造体
Ｇｒｏｕｐへ登録する。この登録された構造体Ｇｒｏｕ
ｐが、上述したステップＳ５０６での結合処理にて用い
られる。尚、ここでの”構造体Ｇｒｏｕｐ”についての
詳細は後述する。Step S508: As a result of the judgment in step S507, the coordinates xstart [l] and the coordinates xend
If the cluster of black pixels indicated by [l] is not connected to any of the above-mentioned existing structures Group, the coordinates xstart [l] and the coordinates xend
The cluster of black pixels indicated by [l] is registered in a new structure Group. This registered structure Grou
p is used in the combining process in step S506 described above. The details of the “structure group” here will be described later.

【００５５】ステップＳ５０９：座標ｘｓｔａｒｔ
［ｌ］及び座標ｘｅｎｄ［ｌ］により示される黒画素の
固まりの次の固まりの組に対して、ステップＳ５０５か
らの処理を実行するために、カウンタｌをインクリメン
ト（ｌ＝ｌ＋１）して、ステップＳ５０５へと戻る。Step S509: coordinate xstart
The counter l is incremented (l = l + 1) to execute the processing from step S505 on the next set of black pixels indicated by [l] and the coordinates xend [l]. It returns to S505.

【００５６】ステップＳ５１０：上述のようにして、黒
画素の固まりの全ての組に対して、ステップＳ５０５〜
ステップＳ５０９の処理が実行し終えると、縦方向のカ
ウンタｊに対して、サンプリングパラメータｓａｍｐｌ
ｅ＿ｔａｔｅを加算して、ステップＳ５０２へと戻る。
これにより、サンプリングパラメータｓａｍｐｌｅ＿ｔ
ａｔｅが、例えば、”１”である場合には、対象二値化
画像に対して１ライン毎に本処理が実行され、”４”で
ある場合には、対象二値化画像に対して４ライン毎に本
処理が実行されることになる。Step S510: As described above, steps S505 to S505 are executed for all the sets of the black pixel clusters.
When the processing of step S509 is completed, the sampling parameter sampl is added to the vertical counter j.
e_state is added, and the process returns to step S502.
Thereby, the sampling parameter sample_t
If ate is, for example, “1”, this processing is executed for each line of the target binary image, and if “ate” is “4”, 4 is applied to the target binary image. This processing is executed for each line.

【００５７】図６は、上記図５に示したステップＳ５０
３の黒画素の固まりの組の検出処理を具体的に示したも
のである。FIG. 6 is a flowchart showing step S50 shown in FIG.
3 specifically illustrates a detection process of a set of a set of black pixels of No. 3;

【００５８】ステップＳ６０１：先ず、対象ライン（縦
方向のカウンタｊにより示される対象二値化画像におけ
るライン）を構成する全ての画素に対して、予め設定さ
れたサンプリングパラメータｓａｍｐｌｅ＿ｙｏｋｏに
より示される画素数毎に、本処理を実行するためのカウ
ンタｉ（横方向（主走査方向）のカウンタ）を”０”に
リセットする。また、後述するステップ処理にて用いる
フラグｓｔａｒｔｆｌａｇを”ＯＦＦ”にセットすると
共に、黒画素の固まりの組数を示す”ｌｉｎｅｎｕｍ”
を”０”にリセットする。Step S601: First, for every pixel constituting the target line (the line in the target binary image indicated by the vertical counter j), for each pixel number indicated by the preset sampling parameter sample_yoko Then, the counter i (the counter in the horizontal direction (main scanning direction)) for executing this processing is reset to “0”. In addition, a flag "startflag" used in the step processing described later is set to "OFF", and "linenum" indicating the number of groups of black pixels is set.
Is reset to “0”.

【００５９】ステップＳ６０２：カウンタｉが、予め設
定された値ｗｉｄｔｈ（対象二値化画像の横幅）に達し
たか否かを判別する。この判別の結果、「ｉ＜ｗｉｄｔ
ｈ」でない場合、後述するステップＳ６１３へと進み、
「ｉ＜ｗｉｄｔｈ」である場合、次のステップＳ６０３
へと進む。Step S602: It is determined whether or not the counter i has reached a predetermined value width (width of the target binarized image). As a result of this determination, “i <widt
h ”, the process proceeds to step S613 described below,
If “i <width”, the next step S603
Proceed to.

【００６０】ステップＳ６０３：ステップＳ６０２の判
別の結果、「ｉ＜ｗｉｄｔｈ」である場合、対象ライン
において、カウンタｉで示される画素を中心として、横
ｍ×縦ｎ画素のウィンドウエリアを設け、そのエリアに
対して、予め設定された横ｍ×縦ｎ画素のパターンデー
タによるパタンマッチングを行う。そして、上記ウィン
ドウエリアとパターンデータが一致しているか否かを判
別する。この判別の結果、一致している場合には次のス
テップＳ６０４へ進み、不一致である場合には後述する
ステップＳ６０７へ進む。Step S603: If “i <width” as a result of the discrimination in step S602, a window area of width m × n pixels is provided around the pixel indicated by the counter i in the target line, and the area is set. , Pattern matching is performed using preset pattern data of horizontal m × vertical n pixels. Then, it is determined whether or not the window area matches the pattern data. As a result of this determination, if they match, the process proceeds to the next step S604, and if they do not match, the process proceeds to step S607 described later.

【００６１】尚、ステップＳ６０３において、パタンマ
ッチングを行う横ｍ×縦ｎ画素のウィンドウエリアとし
ては、例えば、横７×縦１画素のように横方向のみ参照
するエリアであってもよいし、横３×縦３画素のように
上下参照するエリアであってもよい。また、パタンマッ
チングの代わりに、次のような条件式１や２等の条件を
用いて、ステップＳ６０４へ進むか、ステップＳ６０７
へ進むかを決定するようにしてもよい。条件式１：カウンタｉで示される画素（注目画素）が黒
画素である。条件式２：注目画素の隣あう２つの画素が黒画素であ
る。In step S603, the window area of horizontal m × vertical n pixels for performing pattern matching may be, for example, an area which is referred to only in the horizontal direction, such as horizontal 7 × vertical 1 pixel, or horizontal. It may be an area that is vertically referenced, such as 3 × 3 vertical pixels. In addition, instead of pattern matching, the process proceeds to step S604 using conditions such as the following conditional expressions 1 and 2, or proceeds to step S607.
It may be determined whether to proceed to. Conditional expression 1: The pixel indicated by the counter i (pixel of interest) is a black pixel. Conditional expression 2: Two pixels adjacent to the target pixel are black pixels.

【００６２】ステップＳ６０４：ステップＳ６０３の判
別の結果、上記ウィンドウエリアとパターンデータが一
致している場合、フラグｓｔａｒｔｆｌａｇが”ＯＦ
Ｆ”であるか否かを判別する。この判別の結果、「ｓｔ
ａｒｔｆｌａｇ＝ＯＦＦ」である場合、次のステップＳ
６０５へと進み、「ｓｔａｒｔｆｌａｇ＝ＯＦＦ」でな
い場合には、次のステップＳ６０５及びＳ６０６は実行
せずに、そのまま後述するステップＳ６１２へと進む。Step S604: If the window area and the pattern data match as a result of the determination in step S603, the flag startflag is set to "OF".
F "is determined. As a result of this determination," st "
If "artflag = OFF", the next step S
Proceeding to step 605, if not “startflag = OFF”, the next step S605 and S606 are not executed, and the process directly proceeds to step S612 described later.

【００６３】ステップＳ６０５：ステップＳ６０４の判
別の結果、「ｓｔａｒｔｆｌａｇ＝ＯＦＦ」である場
合、黒画素の固まりの組の始めの画素の座標を示すｘｓ
ｔａｒｔ［ｌｉｎｅｎｕｍ］に対して、カウンタｉを設
定する。すなわち、現在の”ｌｉｎｅｎｕｍ”の値で示
される黒画素の固まりの組（対象組）の始めの画素の座
標を示すｘｓｔａｒｔ［ｌｉｎｅｎｕｍ］に対して、カ
ウンタｉで示される画素（注目画素）の座標を設定す
る。また、フラグｓｔａｒｔｆｌａｇに対して、ｘｓｔ
ａｒｔ［ｌｉｎｅｎｕｍ］への設定が終了し、ｘｅｎｄ
［ｌｉｎｅｎｕｍ］への設定が未だ終了していないこと
を示すために、”ＯＮ”を設定する。Step S605: If “startflag = OFF” as a result of the determination in step S604, xs indicating the coordinates of the first pixel of the set of black pixels is set.
A counter i is set for start [linenum]. That is, with respect to xstart [linenum] indicating the coordinates of the first pixel of the set of black pixels (target set) indicated by the current value of “linenum”, the coordinates of the pixel (target pixel) indicated by the counter i Set. Also, for the flag startflag, xst
art [linenum] has been set and xend
“ON” is set to indicate that the setting to [linenum] has not been completed.

【００６４】ステップＳ６１２：カウンタｉ（横方向
（主走査方向）のカウンタ）画素に対して、サンプリン
グパラメータｓａｍｐｌｅ＿ｙｏｋｏを加算して、ステ
ップＳ６０２へと戻る。これにより、サンプリングパラ
メータｓａｍｐｌｅ＿ｙｏｋｏが、例えば、”１”であ
る場合には、対象ラインに対して１画素毎に本処理が実
行され、”４”である場合には、対象ラインに対して４
画素毎に本処理が実行されることになる。そして、ステ
ップＳ６０２において、「ｉ＜ｗｉｄｔｈ」でないと判
別されると、次のステップＳ６１３へと進む。Step S612: The sampling parameter sample_yoko is added to the pixel of the counter i (counter in the horizontal direction (main scanning direction)), and the process returns to step S602. Accordingly, when the sampling parameter sample_yoko is, for example, “1”, the present process is executed for each pixel on the target line, and when the sampling parameter sample_yoko is “4”, 4 for the target line.
This processing is executed for each pixel. If it is determined in step S602 that “i <width” is not satisfied, the process proceeds to the next step S613.

【００６５】ステップＳ６１３：ステップＳ６０２の判
別の結果、「ｉ＜ｗｉｄｔｈ」でない場合、フラグｓｔ
ａｒｔｆｌａｇが”ＯＮ”であるか否かを判別する。こ
の判別の結果、「ｓｔａｒｔｆｌａｇ＝ＯＮ」でない場
合、すなわち「ｉ＜ｗｉｄｔｈ」でなく、且つ「ｓｔａ
ｒｔｆｌａｇ＝ＯＮ」でない場合、対象ラインの黒画素
の固まりの全ての組の検出が終了したと認識し、本処理
を終了する。一方、「ｓｔａｒｔｆｌａｇ＝ＯＮ」であ
る場合には、後述するステップＳ６０８へと進む。Step S613: If the result of determination in step S602 is that "i <width" is not satisfied, the flag st
It is determined whether or not artflag is “ON”. If the result of this determination is not “startflag = ON”, that is, not “i <width” and “staflag = ON”,
If “rtflag = ON” is not satisfied, it is recognized that the detection of all the sets of black pixels on the target line has been completed, and this processing ends. On the other hand, if “startflag = ON”, the process proceeds to step S608 described below.

【００６６】ステップＳ６０７：また、上述したステッ
プＳ６０３において、対象ラインのカウンタｉで示され
る画素を中心とした横ｍ×縦ｎ画素のウィンドウエリア
と、予め設定された横ｍ×縦ｎ画素のパターンデータと
が一致しないと判別された場合、フラグｓｔａｒｔｆｌ
ａｇが”ＯＮ”であるか否かを判別する。この判別の結
果、「ｓｔａｒｔｆｌａｇ＝ＯＮ」でない場合、次のス
テップＳ６０８〜ステップＳ６１１の処理は実行せず
に、そのまま上述したステップＳ６１２へ進む。一方、
「ｓｔａｒｔｆｌａｇ＝ＯＮ」である場合には、次のス
テップＳ６０８へと進む。Step S607: Also, in the above-mentioned step S603, a window area of horizontal m × vertical n pixels centering on the pixel indicated by the counter i of the target line and a predetermined pattern of horizontal m × vertical n pixels If it is determined that the data does not match, the flag startfl
It is determined whether or not ag is “ON”. If the result of this determination is not “startflag = ON”, the processing proceeds to the above-described step S612 without executing the processing of the next steps S608 to S611. on the other hand,
If “startflag = ON”, the process proceeds to the next step S608.

【００６７】ステップＳ６０８：ステップＳ６１３又は
ステップＳ６０７にて、「ｓｔａｒｔｆｌａｇ＝ＯＮ」
であると判別された場合、黒画素の固まりの組の終わり
の画素の座標を示すｘｅｎｄ［ｌｉｎｅｎｕｍ］に対し
て、”カウンタｉ−１”を設定する。すなわち、”ｌｉ
ｎｅｎｕｍ”で示される黒画素の固まりの組（対象組）
の終わりの画素の座標を示すｘｅｎｄ［ｌｉｎｅｎｕ
ｍ］に対して、カウンタｉで示される画素（注目画素）
の前の画素の座標を設定する。Step S608: In step S613 or step S607, "startflag = ON"
When it is determined that “xend [linenum]” indicates the coordinates of the pixel at the end of the set of black pixels, “counter i−1” is set. That is, "li
set of black pixels indicated by “nenum” (target set)
Xend [linemenu indicating the coordinates of the pixel at the end of
m], the pixel indicated by the counter i (pixel of interest)
Set the coordinates of the pixel before.

【００６８】ステップＳ６０９：”ｘｅｎｄ［ｌｉｎｅ
ｎｕｍ］−ｘｓｔａｒｔ［ｌｉｎｅｎｕｍ］＋１”の結
果が、予め設定された、対象二値化画像の横幅の制限値
ｄｅｌ＿ｗｉｄｔｈより大きいか否かを判別する。この
判別の結果、「”ｘｅｎｄ［ｌｉｎｅｎｕｍ］−ｘｓｔ
ａｒｔ［ｌｉｎｅｎｕｍ］＋１”＞ｄｅｌ＿ｗｉｄｔ
ｈ」でない場合、次のステップＳ６１０の処理は実行せ
ずに、そのまま後述するステップＳ６１１へ進み、「”
ｘｅｎｄ［ｌｉｎｅｎｕｍ］−ｘｓｔａｒｔ［ｌｉｎｅ
ｎｕｍ］＋１”＞ｄｅｌ＿ｗｉｄｔｈ」である場合に、
次のステップＳ６１１へ進む。Step S609: "xend [line
It is determined whether or not the result of [num] -xstart [linenum] +1 "is greater than a preset limit value del_width of the target binarized image. As a result of this determination,""xend [linenum]- xst
art [linenum] +1 "> del_widt
If it is not "h", the process of the next step S610 is not executed, and the process directly proceeds to step S611, which will be described later.
xend [linenum] -xstart [line
num] +1 ">del_width",
The process proceeds to the next step S611.

【００６９】ステップＳ６１０：ステップＳ６０９の判
別の結果、「”ｘｅｎｄ［ｌｉｎｅｎｕｍ］−ｘｓｔａ
ｒｔ［ｌｉｎｅｎｕｍ］＋１”＞ｄｅｌ＿ｗｉｄｔｈ」
である場合、”ｘｓｔａｒｔ［ｌｉｎｅｎｕｍ］”及
び”ｘｅｎｄ［ｌｉｎｅｎｕｍ］”で示される画素の固
まりの組を、現在の”ｌｉｎｅｎｕｍ”の値で示される
黒画素の固まりの組として決定し、次の組の検出のため
に、”ｌｉｎｅｎｕｍ”をインクリメントする。Step S610: As a result of the determination in step S609, "" xend [linenum] -xsta
rt [linenum] +1 ">del_width"
, The set of pixels indicated by “xstart [linenum]” and “xend [linenum]” is determined as the set of black pixels indicated by the current value of “linenum”. "Linenum" is incremented for the detection of.

【００７０】ステップＳ６１１：フラグｓｔａｒｔｆｌ
ａｇを”ＯＦＦ”に設定して、上述したステップＳ６１
２へ進む。尚、ステップＳ６０９の判別の結果、「”ｘ
ｅｎｄ［ｌｉｎｅｎｕｍ］−ｘｓｔａｒｔ［ｌｉｎｅｎ
ｕｍ］＋１”＞ｄｅｌ＿ｗｉｄｔｈ」でない場合、ステ
ップＳ６０９は実行されないため、すなわち”ｌｉｎｅ
ｎｕｍ”をインクリメントされないため、現在の”ｘｓ
ｔａｒｔ［ｌｉｎｅｎｕｍ］”及び”ｘｅｎｄ［ｌｉｎ
ｅｎｕｍ］”の値は、ステップＳ６１１からの処理にて
上書きされる。すなわち、”ｘｓｔａｒｔ［ｌｉｎｅｎ
ｕｍ］”及び”ｘｅｎｄ［ｌｉｎｅｎｕｍ］”で示され
る画素の固まりの組は、黒画素の固まりの組として認識
されず、事実上その組は削除されることになる。Step S611: Flag startfl
ag is set to “OFF” and the above-described step S61 is performed.
Proceed to 2. Note that as a result of the determination in step S609, "" x
end [linenum] -xstart [linen
um] +1 ”> del_width”, the step S609 is not executed, ie, “line
num "is not incremented, so the current" xs
start [linenum] "and" xend [lin
enum] ”is overwritten in the processing from step S611. That is,“ xstart [linen]
um] "and" xend [linenum] "are not recognized as a set of black pixels, and are effectively deleted.

【００７１】図７は、上述のようにして得られた黒画素
の固まりの組の構造体の一例を示したものである。FIG. 7 shows an example of a structure of a set of a group of black pixels obtained as described above.

【００７２】上記図６に示した処理では、黒画素の固ま
りの組が、上記図７の”７０１”で示す構造体Ｐａｒｔ
ｓとして生成される。ここでは、開くウィンドウを１次
元として、Ｘ方向の座標のみを有するものとしている
が、これに限られることはなく、上記図７の”７０１
ａ”で示すように、ウィンドウを２次元として、構造体
Ｐａｒｔｓのメンバーにｙ方向の情報をも含ませるよう
にしてもよい。In the processing shown in FIG. 6, a set of black pixels is converted into a structure Part 701 indicated by "701" in FIG.
s. Here, the window to be opened is one-dimensional and has only the coordinates in the X direction. However, the present invention is not limited to this.
As shown by "a", the window may be two-dimensional, and the members of the structure Parts may include information in the y direction.

【００７３】構造体Ｐａｒｔｓは、上述したｘｓｔａｒ
ｔ［ｌｉｎｅｎｕｍ］及びｘｅｎｄ［ｌｉｎｅｎｕｍ］
に対応する黒画素の固まりの組の開始座標及び終了座標
を示す情報（メンバー）”ｓｔａｒｔｘ”及び”ｅｎｄ
ｘ”と、次の情報のアドレス情報”ｎｅｘｔ”とを含ん
でいる。アドレス情報”ｎｅｘｔ”には、次の情報がな
い場合には”ＮＵＬＬ”が設定される。The structure Parts is the xstar described above.
t [linenum] and xend [linenum]
(Members) "startx" and "end" indicating the start coordinates and end coordinates of a set of black pixels corresponding to
x "and address information" next "of the next information, and" NULL "is set in the address information" next "when there is no next information.

【００７４】上記図５に示した処理では、上記図６に示
した処理にて得られた黒画素の固まりの組（構造体Ｐａ
ｒｔｓ）が結合される、上記図７の”７０２”で示す構
造体Ｇｒｏｕｐが生成される。構造体Ｇｒｏｕｐは、開
始座標及び終了座標を２次元的に示す情報”ｓｘ”、”
ｓｙ”、”ｅｘ”、及び”ｅｙ”と、構造体Ｇｒｏｕｐ
を構築する構造体Ｐａｒｔｓの集まりの１番目の構造体
Ｐａｒｔｓのアドレスの情報”ｆｉｒｓｔ”と、構造体
Ｇｒｏｕｐを構築する構造体Ｐａｒｔｓの集まりの最後
の構造体Ｐａｒｔｓのアドレスの情報”ｌａｓｔとを含
み、これらの情報は、黒領域抽出部１０７で生成され
る。In the processing shown in FIG. 5, a set of black pixels (structure Pa) obtained by the processing shown in FIG.
rts) are combined, and a structure Group indicated by “702” in FIG. 7 is generated. The structure Group includes information “sx” and “sx” that indicate the start coordinate and the end coordinate two-dimensionally.
sy ”,“ ex ”, and“ ey ”, and the structure Group
The information “first” of the address of the first structure Parts of the collection of the structures Parts that constructs the structure “Parts”, and the information “last” of the address of the last structure Parts of the collection of the structures Parts that construct the structure Group These pieces of information are generated by the black region extraction unit 107.

【００７５】また、構造体Ｇｒｏｕｐは、構造体Ｇｒｏ
ｕｐが示す領域を二値化するのにふさわしい閾値の情
報”Ｔｈｒｅｓｈｏｌｄ”、構造体Ｇｒｏｕｐが示す領
域が文字領域であるか文字以外の領域（画像領域等）で
あるかを示すフラグ情報”ＣｈａｒａＦｌｇ”と、構造
体Ｇｒｏｕｐが示す領域が文字領域である場合に当該領
域が反転文字領域（下地が黒っぽく、文字部が白っぽい
領域）であるか否かを示すフラグ情報”ＩｎｖｅｒｔＦ
ｌｇ”と、構造体Ｇｒｏｕｐが示す領域の黒領域部分の
形状が四角であるか否かを示すフラグ情報”Ｓｈａｐｅ
Ｒｅｃｔａｎｇｌｅ”と、構造体Ｇｒｏｕｐが示す領域
が抽出された二値化画像を作成した閾値より１つ前の閾
値にて作成された二値化画像上にほぼ同じオブジェクト
が存在するか否かを示すフラグ情報”ＳａｍｅＡｓＭｏ
ｔｈｅｒ”と、次の構造体Ｇｒｏｕｐのアドレスを示す
ポインタ情報”ｎｅｘｔ”とを含み、これらの情報は、
領域解析部１０９にて生成される。ポインタ情報”ｎｅ
ｘｔ”には、次の構造体Ｇｒｏｕｐがない場合には”Ｎ
ＵＬＬ”が設定される。The structure Group is composed of the structure Gro
Threshold information “Threshold” suitable for binarizing the area indicated by “up”, and flag information “CharaFlg” indicating whether the area indicated by the structure Group is a character area or a non-character area (eg, an image area). And flag information "InvertF indicating whether or not the region indicated by the structure Group is a character region, whether or not the region is an inverted character region (a black background and a white character portion).
lg "and flag information" Shape indicating whether or not the shape of the black area portion of the area indicated by the structure Group is square.
Rectangle ”indicates whether or not substantially the same object exists on the binarized image created with the threshold value one immediately before the threshold value that created the binarized image in which the region indicated by the structure Group is extracted. Flag information "SameAsMo
ther "and pointer information" next "indicating the address of the next structure Group.
It is generated by the area analysis unit 109. Pointer information "ne
xt ”includes“ N ”when there is no next structure Group.
"ULL" is set.

【００７６】図８（ａ）〜（ｆ）は、黒領域抽出部１０
７での上記図５及び図６に示した処理の実行により、対
象二値化画像上では、どのような処理が実際に行われる
かを具体的に示したものである。FIGS. 8A to 8F show the black region extracting unit 10.
By executing the processing shown in FIGS. 5 and 6 in FIG. 7, what kind of processing is actually performed on the target binarized image is specifically shown.

【００７７】上記図８（ａ）参照：先ず、上記図５に示
したステップＳ５０３において、二値化画像上のｙ座標
＝”５”のライン（上記図５の処理でのカウンタｊが”
５”であるときの対象ライン）上の、ｘｓｔａｒｔ
［５］及びｘｅｎｄ［１５］で示される黒画素の固まり
の組（構造体Ｐａｒｔｓ）８０１と、ｘｓｔａｒｔ［５
０］及びｘｅｎｄ［６５］で示される黒画素の固まりの
組（構造体Ｐａｒｔｓ）８０２とが検出されたとする。Referring to FIG. 8A, first, in step S503 shown in FIG. 5, the line of the y coordinate = “5” on the binarized image (the counter j in the processing of FIG.
Xstart on the target line when it is 5 ")
A set (structure Parts) 801 of a group of black pixels indicated by [5] and xend [15], and xstart [5]
0] and xend [65], and a set (structure Parts) 802 of a group of black pixels is detected.

【００７８】このとき、既に存在する構造体Ｇｒｏｕｐ
は無い状態であるので、ステップＳ５０８において、新
規の２つの構造体Ｇｒｏｕｐが作成され、これらの構造
体Ｇｒｏｕｐのメンバーとして、構造体Ｐａｒｔｓ８０
１，８０２がそれぞれ登録（設定）される。At this time, the existing structure Group
In step S508, two new structure groups are created, and as a member of these structure groups, the structure Parts80 is created.
1, 802 are respectively registered (set).

【００７９】この場合、構造体Ｐａｒｔｓ８０１につい
ての、上記図７に示した構造体Ｐａｒｔｓ及び構造体Ｇ
ｒｏｕｐの各メンバーの値には、次のような値が設定さ
れる。ｓｔａｒｔｘ＝５（ｘｓｔａｒｔ［５］の値）ｅｎｄｘ＝１５（ｘｅｎｄ［１５］の値）ｓｘ＝５（ｘｓｔａｒｔ［５］の値）ｓｙ＝５（カウンタｊの値）ｅｘ＝１５（ｘｅｎｄ［１５］の値）ｅｙ＝６（ｓａｍｐｌｅ＿ｔａｔｅ＝１とし
て、”ｊ＋ｓａｍｐｌｅ＿ｔａｔｅ”の値）ｆｉｒｓｔ＝構造体Ｐａｒｔｓ８０１のアドレスｌａｓｔ＝構造体Ｐａｒｔｓ８０１のアドレスまた、構造体Ｐａｒｔｓ８０２についても、構造体Ｐａ
ｒｔｓ８０１の場合と同様にして、上記図７に示した構
造体Ｐａｒｔｓ及び構造体Ｇｒｏｕｐの各メンバーに該
当する値が設定される。尚、上記図８では、構造体Ｐａ
ｒｔｓを黒塗りで示し、構造体Ｇｒｏｕｐを白枠で示し
ている。In this case, with respect to the structure Parts 801, the structure Parts and the structure G shown in FIG.
The following values are set as the values of the members of the group. startx = 5 (value of xstart [5]) endx = 15 (value of xend [15]) sx = 5 (value of xstart [5]) sy = 5 (value of counter j) ex = 15 (xend [15] Ey = 6 (value of “j + sample_date” where sample_state = 1) first = address of structure Parts 801 last = address of structure Parts 801 Also, for structure Parts 802, structure Pa
As in the case of rts801, values corresponding to the members of the structure Parts and the structure Group shown in FIG. 7 are set. In FIG. 8, the structure Pa
rts is shown in black, and the structure Group is shown in a white frame.

【００８０】上記図８（ｂ）参照：次に、ステップＳ５
０３において、二値化画像上のｙ座標＝”５”のライン
の次のライン（カウンタｊがカウントアップされ”６”
であるときの対象ライン）上の、ｘｓｔａｒｔ［０］及
びｘｅｎｄ［１０］で示される黒画素の固まりの組（構
造体Ｐａｒｔｓ）８０３と、ｘｓｔａｒｔ［４５］及び
ｘｅｎｄ［７５］で示される黒画素の固まりの組（構造
体Ｐａｒｔｓ）８０４と、ｘｓｔａｒｔ［１００］及び
ｘｅｎｄ［１１０］で示される黒画素の固まりの組（構
造体Ｐａｒｔｓ）８０５との３つの組が検出されたとす
る。Referring to FIG. 8B: Next, at step S5
At 03, the line next to the line of the y coordinate = “5” on the binarized image (counter j is counted up to “6”)
, A set (structure Parts) 803 of black pixels indicated by xstart [0] and xend [10], and black pixels indicated by xstart [45] and xend [75] It is assumed that three sets of a set (structure Parts) 804 and a set of black pixels (structure Parts) 805 indicated by xstart [100] and xend [110] are detected.

【００８１】このとき、既に存在する構造体Ｇｒｏｕｐ
としては、構造体Ｐａｒｔｓ８０１，８０２の構造体Ｇ
ｒｏｕｐ８０１’，８０２’が存在するため、ステップ
Ｓ５０６にて、構造体Ｇｒｏｕｐ８０１’，８０２’に
対する構造体Ｐａｒｔｓ８０３〜８０５の結合処理（座
標比較による結合処理）が行われる。At this time, the already existing structure Group
As the structure G of the structures Parts 801 and 802
Since the groups 801 ′ and 802 ′ exist, in step S 506, the joining processing of the structural parts 803 to 805 (joining processing by coordinate comparison) to the structural groups 801 ′ and 802 ′ is performed.

【００８２】上記結合処理により、例えば、構造体Ｐａ
ｒｔｓ８０３は、構造体Ｇｒｏｕｐ８０１’と接触して
いるので、その構造体Ｇｒｏｕｐ８０１’に結合され
る。この結果、構造体Ｇｒｏｕｐ８０１’は、上記図８
（ｃ）の”８０６”に示す構造体Ｇｒｏｕｐに成長す
る。また、構造体Ｐａｒｔｓ８０４は、構造体Ｇｒｏｕ
ｐ８０２’と接触しているので、その構造体Ｇｒｏｕｐ
８０２’に結合される。この結果、構造体Ｇｒｏｕｐ８
０２’は、上記図８（ｃ）の”８０７”に示す構造体Ｇ
ｒｏｕｐに成長する。構造体Ｐａｒｔｓ８０５について
は、構造体Ｇｒｏｕｐ８０１’，８０２’の何れにも接
触していないので（結合できない）、ステップＳ５０８
において、新規の構造体Ｇｒｏｕｐが作成され、この構
造体Ｇｒｏｕｐのメンバーとして、構造体Ｐａｒｔｓ８
０５が登録（設定）される。By the above coupling process, for example, the structure Pa
Since the rts 803 is in contact with the structure Group 801 ′, it is coupled to the structure Group 801 ′. As a result, the structure Group 801 ′ corresponds to FIG.
The structure grows to the structure Group indicated by “806” in (c). In addition, the structure Parts 804 is a structure Grou.
Since it is in contact with p802 ', its structure Group
802 '. As a result, the structure Group8
02 ′ is the structure G shown in “807” in FIG.
grow into a loop. Since the structure Parts 805 does not come into contact with any of the structure groups 801 ′ and 802 ′ (cannot be combined), step S508 is performed.
, A new structure Group is created, and as a member of the structure Group, the structure Parts8
05 is registered (set).

【００８３】上記図８（ｃ）参照：次に、ステップＳ５
０３において、二値化画像上のｙ座標＝”６”のライン
の次のライン（カウンタｊがカウントアップされ”７”
であるときの対象ライン）上の、ｘｓｔａｒｔ［６５］
及びｘｅｎｄ［１０５］で示される黒画素の固まりの組
（構造体Ｐａｒｔｓ）８０８が検出されたとする。Referring to FIG. 8C: Next, at step S5
At 03, the line next to the line at y coordinate = “6” on the binarized image (counter j is counted up to “7”
Xstart [65] on the target line when
Suppose that a set (structure Parts) 808 of a group of black pixels indicated by xend [105] and xend [105] is detected.

【００８４】このとき、既に存在する構造体Ｇｒｏｕｐ
としては、構造体Ｇｒｏｕｐ８０６，８０７と、構造体
Ｐａｒｔｓ８０５の構造体Ｇｒｏｕｐ８０５’とが存在
するため、ステップＳ５０６にて、構造体Ｇｒｏｕｐ８
０６，８０７，８０５’に対する構造体Ｐａｒｔｓ８０
８の結合処理（座標比較による結合処理）が行われる。At this time, the already existing structure Group
Since there is a structure Group 806, 807 and a structure Group 805 ′ of the structure Parts 805, in Step S506, the structure Group 8
Structure Parts80 for 06,807,805 '
8 is performed (coupling processing by coordinate comparison).

【００８５】ここで、構造体Ｐａｒｔｓ８０８は、２つ
の構造体Ｇｒｏｕｐ８０７，８０５’に接触した状態に
ある。したがって、この場合、構造体Ｐａｒｔｓ８０８
は、２つの構造体Ｇｒｏｕｐ８０７，８０５’を合体さ
せた構造体Ｇｒｏｕｐに対して結合される。この結果、
２つの構造体Ｇｒｏｕｐ８０７，８０５’は、上記図８
（ｄ）の”８０９”に示す１つの構造体Ｇｒｏｕｐに成
長する。Here, the structure Parts 808 is in contact with the two structures Group 807 and 805 '. Therefore, in this case, the structure Parts 808
Is bonded to the structure Group that combines the two structures Group 807 and 805 ′. As a result,
The two structures Group 807, 805 'are shown in FIG.
It grows into one structure Group shown by “809” in (d).

【００８６】この場合、構造体Ｇｒｏｕｐ８０９として
は、構造体Ｇｒｏｕｐ８０７の情報（上記図７参照）が
用いられ、構造体Ｇｒｏｕｐ８０５’は消去される。但
し、構造体Ｇｒｏｕｐ８０７の情報において、ｎｅｘｔ＝構造体Ｇｒｏｕｐ８０５’の”ｆｉｒｓ
ｔ”により示される構造体Ｐａｒｔｓのアドレスｌａｓｔ＝構造体Ｇｒｏｕｐ８０５’の”ｌａｓ
ｔ”により示される構造体Ｐａｒｔｓのアドレスという、”ｎｅｘｔ”及び”ｌａｓｔ”の情報の書き換
えが行われ、その結果が、構造体Ｇｒｏｕｐ８０９の情
報とされる。In this case, information of the structure Group 807 (see FIG. 7) is used as the structure Group 809, and the structure Group 805 'is deleted. However, in the information of the structure Group 807, next = “firsts” of the structure Group 805 ′
address of the structure Parts indicated by “t” last = “las” of the structure Group 805 ′
The information of “next” and “last”, which is the address of the structure Parts indicated by “t”, is rewritten, and the result is the information of the structure Group 809.

【００８７】上記図８（ｄ），（ｆ）参照：したがっ
て、構造体Ｐａｒｔｓ８０１，８０３を含む構造体Ｇｒ
ｏｕｐ８０６と、構造体Ｐａｒｔｓ８０２，８０４，８
０５，８０８を含む構造体Ｇｒｏｕｐ８０６との２つの
構造体Ｇｒｏｕｐが生成される。8 (d) and (f): Therefore, the structure Gr including the structures Parts 801 and 803
up806 and the structures Parts802, 804, 8
Two structures Group are generated, including a structure Group 806 including 05,808.

【００８８】以上説明したような黒領域（構造体Ｇｒｏ
ｕｐにより示される領域）の抽出処理は、二値化部１０
５にて得られた複数の二値化画像（複数の二値化閾値に
基づき得られた二値化画像）に対してそれぞれ実行され
る。The black region (structure Gro) as described above
The extraction process of the region indicated by “up” is performed by the binarization unit 10.
5 is performed on each of the plurality of binarized images (binary images obtained based on the plurality of binarization thresholds).

【００８９】この結果、例えば、対象画像が、上記図３
（ａ）に示したような文書の多値画像であった場合、同
図（ｂ）に示すように、二値化閾値４０１に基づいた二
値化画像からは、黒領域３０１〜３０５が抽出される。
これらの黒領域３０１〜３０５を示す構造体Ｇｒｏｕｐ
では、黒領域部分の形状が四角であるか否かを示すフラ
グ情報”ＳｈａｐｅＲｅｃｔａｎｇｌｅ”が、”ＴＲＵ
Ｅ（黒領域部分の形状が四角である）”に設定されてお
り、構造体Ｇｒｏｕｐが示す領域が抽出された二値化画
像を作成した閾値より１つ前の閾値にて作成された二値
化画像上にほぼ同じオブジェクトが存在するか否かを示
すフラグ情報”ＳａｍｅＡｓＭｏｔｈｅｒ”が、”ＦＡ
ＬＳＥ（同じオブジェクトが存在しない）”に設定され
ている。As a result, for example, the target image is
In the case of a multi-valued image of a document as shown in (a), black regions 301 to 305 are extracted from the binarized image based on the binarization threshold 401 as shown in FIG. Is done.
Structure Group showing these black regions 301 to 305
Then, the flag information “ShapeRectangle” indicating whether or not the shape of the black area portion is a square is set to “TRU”.
E (the shape of the black area portion is a square) ", and the binary value created by the threshold value immediately before the threshold value used to create the binary image from which the area indicated by the structure Group is extracted. The flag information “SameAsMother” indicating whether or not substantially the same object exists on the structured image is “FA
LSE (the same object does not exist) "is set.

【００９０】また、上記図３（ｃ）に示すように、二値
化閾値４０２に基づいた二値化画像からは、黒領域３０
６〜３０９が抽出される。これらの黒領域３０６〜３０
９を示す構造体Ｇｒｏｕｐでは、フラグ情報”Ｓｈａｐ
ｅＲｅｃｔａｎｇｌｅ”が、”ＴＲＵＥ（黒領域部分の
形状が四角である）”に設定されている。フラグ情報”
ＳａｍｅＡｓＭｏｔｈｅｒ”については、黒領域３０６
を示す構造体Ｇｒｏｕｐのみが”ＴＲＵＥ（同じオブジ
ェクトが存在する）”に設定されており、他の黒領域３
０７〜３０９では”ＦＡＬＳＥ（同じオブジェクトが存
在しない）”に設定されている。Further, as shown in FIG. 3C, the black area 30 based on the binarized image
6 to 309 are extracted. These black areas 306 to 30
9, the flag information “Shap
eRectangle "is set to" TRUE (the shape of the black area is a square). "
For “SameAsMother”, the black area 306
Is set to “TRUE (the same object exists)” and the other black area 3
For 07 to 309, "FALSE (the same object does not exist)" is set.

【００９１】また、上記図３（ｄ）に示すように、二値
化閾値４０３に基づいた二値化画像からは、黒領域３１
０が抽出される。この黒領域３１０を示す構造体Ｇｒｏ
ｕｐでは、フラグ情報”ＳｈａｐｅＲｅｃｔａｎｇｌ
ｅ”が、”ＦＡＬＳＥ（黒領域部分の形状が四角でな
い）”に設定されており、フラグ情報”ＳａｍｅＡｓＭ
ｏｔｈｅｒ”が、”ＴＲＵＥ（同じオブジェクトが存在
する）”に設定されている。Further, as shown in FIG. 3D, the black area 31 based on the binary threshold
0 is extracted. The structure Gro indicating the black region 310
In the up, the flag information "ShapeRectangl
e "is set to" FALSE (the shape of the black area is not square) "and the flag information" SameAsM
"other" is set to "TRUE (the same object exists)".

【００９２】尚、黒領域の抽出の際、フラグ情報”Ｓｈ
ａｐｅＲｅｃｔａｎｇｌｅ”が、”ＴＲＵＥ”、又は所
定の大きさ以上の領域を、黒領域として抽出するように
してもよい。When the black area is extracted, the flag information “Sh
A region in which “aperRectangle” is “TRUE” or a region having a predetermined size or more may be extracted as a black region.

【００９３】黒領域抽出部１０７は、上述の処理（上記
図５及び図６に示した処理）を実施するための構成とし
て、例えば、図９に示すような構成を備えている。すな
わち、黒領域抽出部１０７は、上記図９に示すように、
二値メモリ１０６（上記図２参照）に保持された処理対
象となる二値化画像に対してウィンドウを設け、そのウ
ィンドウ内の黒画素の固まり（構造体Ｐａｒｔｓにより
示される黒画素の固まりの組）を検出する検出部１００
１と、検出部１００１にて検出された黒画素の固まりの
１次元的な座標情報（構造体Ｐａｒｔｓにより示される
黒画素の固まりの組の座標情報）を保持する座標保持部
１００２と、既に検出済みの黒画素の固まりの２次元的
な座標情報（構造体Ｇｒｏｕｐにより示される黒領域の
座標情報）を保持する２次元座標保持部１００４と、座
標保持部１００２内の座標情報と２次元座標保持部１０
０４内の座標情報を比較し、必要に応じて結合処理（構
造体Ｐａｒｔｓの構造体Ｇｒｏｕｐへの合体処理）を実
行した結果得られる２次元的な座標情報（構造体Ｇｒｏ
ｕｐにより示される黒領域の座標情報）を２次元座標保
持部１００４へ保持する比較部１００３とを備えてお
り、２次元座標保持部１００４へ保持された情報が、黒
領域分割部１１０（上記図２参照）へ与えられるように
なされている。The black area extraction unit 107 has, for example, a configuration as shown in FIG. 9 as a configuration for performing the above-described processing (the processing shown in FIGS. 5 and 6). That is, as shown in FIG.
A window is provided for the binarized image to be processed held in the binary memory 106 (see FIG. 2), and a set of black pixels in the window (a set of black pixels indicated by the structure Parts) ) Detecting section 100
1, a coordinate holding unit 1002 that holds one-dimensional coordinate information of a set of black pixels detected by the detection unit 1001 (coordinate information of a set of black pixels indicated by the structure Parts), A two-dimensional coordinate storage unit 1004 for storing two-dimensional coordinate information (coordinate information of a black area indicated by the structure Group) of a group of completed black pixels, and the coordinate information and two-dimensional coordinate storage in the coordinate storage unit 1002 Part 10
The two-dimensional coordinate information (structure Gro) obtained as a result of comparing the coordinate information in the data structure 04 and executing the combining process (the merging process of the structure Parts into the structure Group) as necessary
and a comparison unit 1003 for holding the coordinate information of the black area indicated by “up” in the two-dimensional coordinate holding unit 1004. The information held in the two-dimensional coordinate holding unit 1004 is stored in the black area dividing unit 110 (see FIG. 2).

【００９４】黒領域分割部１１０は、黒領域抽出部１０
７から黒領域の情報が与えられると、例えば、図１０に
示すフローチャートに従って、次のように動作する。The black area dividing section 110 includes the black area extracting section 10
When the information on the black area is given from 7, for example, the following operation is performed according to the flowchart shown in FIG.

【００９５】ステップＳ１１０１：先ず、黒領域抽出部
１０７からの、１つの二値化画像から得られた全ての黒
領域（構造体Ｇｒｏｕｐにより示される黒領域、以下、
単に「構造体Ｇｒｏｕｐ」とも言う）に対して、次のス
テップＳ１１０２からの処理を実行し終えたか否かを判
別する。この判別の結果、終了していた場合、本処理終
了とし、まだ未処理の構造体Ｇｒｏｕｐがある場合に
は、次のステップＳ１１０２へ進む。Step S1101: First, all black areas (black areas indicated by the structure Group, hereinafter referred to as “structures Group”) obtained from one binarized image from the black area extraction unit 107.
It is determined whether the processing from the next step S1102 has been completed on the “structure group”). If the result of this determination is that the processing has been completed, this processing is terminated. If there is an unprocessed structure Group, the flow proceeds to the next step S1102.

【００９６】ステップＳ１１０２：未処理の対象構造体
Ｇｒｏｕｐの情報（上記図７参照）を参照し、対象構造
体Ｇｒｏｕｐの大きさが予め設定された閾値より大きい
か、また、黒領域部分の形状が四角であるか否かを示す
フラグ情報”ＳｈａｐｅＲｅｃｔａｎｇｌｅ”が”ＦＡ
ＬＳＥ（黒領域部分の形状が四角でない）”か否かを判
別し、その判別の結果、対象黒領域の大きさが予め設定
された閾値より大きく、且つ、フラグ情報”Ｓｈａｐｅ
Ｒｅｃｔａｎｇｌｅ”が”ＦＡＬＳＥ”である場合、次
のステップＳ１１０３へ進み、そうでない場合には、そ
のまま後述するステップＳ１１１０へ進む。Step S1102: Referring to the information of the unprocessed target structure Group (see FIG. 7), whether the size of the target structure Group is larger than a preset threshold value or the shape of the black area portion The flag information “ShapeRectangle” indicating whether or not it is a square is “FA”
LSE (the shape of the black area is not square) ”is determined, and as a result of the determination, the size of the target black area is larger than a preset threshold value, and the flag information“ Shape ”
If "Rectangle" is "FALSE", the flow advances to the next step S1103; otherwise, the flow directly advances to step S1110 to be described later.

【００９７】ステップＳ１１０３：ステップＳ１１０２
での判別の結果、対象構造体Ｇｒｏｕｐが上記の条件に
該当した場合、対象構造体Ｇｒｏｕｐが有する構造体Ｐ
ａｒｔｓの情報（上記図７参照）を参照し、構造体Ｐａ
ｒｔｓにより示される黒画素の固まりの組の始めの画素
の座標を示す”ｓｔａｒｔｘ”のヒストグラム（以下、
「ヒストグラムｓｔａｒｔｈｉｓｔ」と言う）、及び当
該黒画素の固まりの終わりの画素の座標を示す”ｅｎｄ
ｘ”のヒストグラム（以下、「ヒストグラムｅｎｄｈｉ
ｓｔ」と言う）を作成する。Step S1103: Step S1102
When the target structure Group satisfies the above condition as a result of the determination in the above, the structure P included in the target structure Group
With reference to the information of the arts (see FIG. 7 above), the structure Pa
A histogram of “startx” (hereinafter, “startx”) indicating the coordinates of the first pixel of the set of black pixels indicated by rts
"End" indicating the coordinates of the pixel at the end of the cluster of black pixels.
x "histogram (hereinafter" histogram endhi
st ").

【００９８】ステップＳ１１０４：ステップＳ１１０３
で作成したヒストグラムｓｔａｒｔｈｉｓｔ及びヒスト
グラムｅｎｄｈｉｓｔの密集ポイントをそれぞれｍ点検
出する。Step S1104: Step S1103
M dense points of the histogram starthist and the histogram endhist created in step (1) are detected.

【００９９】ステップＳ１１０５：ステップＳ１１０４
で得た密集ポイントの情報を参照して、対象構造体Ｇｒ
ｏｕｐを分割すべきか否か、すなわち対象構造体Ｇｒｏ
ｕｐにより示される黒領域を分割すべきか否かを判別す
る。例えば、密集ポイントの位置及び数に対して任意の
条件を設け、その条件を満たしている場合には、領域分
割すると認識し、次のステップＳ１１０６へ進む。一
方、上記条件を満たしていない場合には、領域分割しな
いと認識し、そのまま後述するステップＳ１１１０へ進
む。Step S1105: Step S1104
With reference to the information of the crowded points obtained in the above, the target structure Gr
whether or not to split “up”, that is, the target structure Gro
It is determined whether or not the black area indicated by up should be divided. For example, an arbitrary condition is provided for the position and the number of the crowded points, and when the condition is satisfied, it is recognized that the area is divided, and the process proceeds to the next step S1106. On the other hand, when the above condition is not satisfied, it is recognized that the area is not divided, and the process proceeds to step S1110 to be described later.

【０１００】ステップＳ１１０６：ステップＳ１１０５
の判別の結果、領域分割する場合、ヒストグラムｓｔａ
ｒｔｈｉｓｔから検出されたｍ点の密集ポイント、及び
ヒストグラムｅｎｄｈｉｓｔから検出されたｍ点の密集
ポイントから、それぞれ１点を選択し、これらの２つの
点を、”ｓｔａｒｔｐｏｉｎｔ”及び”ｅｎｄｐｏｉｎ
ｔ”として設定する。ここでは、一度に分割できる領域
は、０〜ｓｔａｒｔｐｏｉｎｔｓｔａｒｔｐｏｉｎｔ〜ｅｎｄｐｏｉｎｔｅｎｄｐｏｉｎｔ〜最後までにより示される３つの領域とする。Step S1106: Step S1105
When the region is divided as a result of the
One point is selected from each of the m dense points detected from the rthist and the m dense points detected from the histogram endhist, and these two points are defined as “startpoint” and “endpoint”.
In this case, the area that can be divided at a time is three areas indicated by 0 to start point start point to end point end point to the end.

【０１０１】ステップＳ１１０７：ステップＳ１１０６
の分割点の設定により得られた〜の３つの領域のそ
れぞれに対して、構造体Ｐａｒｔｓにより示される黒画
素の固まり組の方向（ｘ方向）とは異なる、例えば、ｙ
方向の広がりを検出する。Step S1107: Step S1106
Is different from the direction (x direction) of the set of black pixels indicated by the structure Parts for each of the three regions obtained by setting the division point of, for example, y
Detects directional spread.

【０１０２】ステップＳ１１０８：ステップＳ１１０７
の検出結果により、再度、領域分割すべきか否かを判別
する。例えば、、、のそれぞれの領域に対して、
ｙ方向の分布が全体的に広がっているか否かを判別し、
その判別の結果、、、のそれぞれの領域が、ｙ方
向の分布が全体的に広がっている場合、領域分割すべき
でないと認識し、そのまま後述するステップＳ１１１０
へ進む。一方、、、のそれぞれの領域のｙ方向の
分布に偏りがある場合には、領域分割すべきと認識し、
次のステップＳ１１０９へ進む。Step S1108: Step S1107
, It is determined again whether or not to divide the area. For example, for each region of
determine whether the distribution in the y-direction is widening as a whole,
As a result of the discrimination, if the respective regions have a wide distribution in the y direction, it is recognized that the region should not be divided, and step S1110 to be described later is performed as it is.
Proceed to. On the other hand, if there is a bias in the y-direction distribution of each of the regions, it is recognized that the region should be divided,
Proceed to the next step S1109.

【０１０３】ステップＳ１１０９：ステップＳ１１０８
の判別の結果、領域分割する場合、ステップＳ１１０６
の分割点の設定により得られた〜の３つの領域に、
対象構造体Ｇｒｏｕｐにより示される黒領域を分割す
る。また、〜の３つの領域に対応する新規の構造体
Ｇｒｏｕｐを作成し、その新規の構造体Ｇｒｏｕｐに対
して、該当する新たな二次元の座標情報を設定する。そ
して、分割の元になった対象構造体Ｇｒｏｕｐを削除す
る。尚、分割の元になった対象構造体Ｇｒｏｕｐを削除
するのではなく、そのまま流用して、座標情報を更新す
るようにしてもよい。Step S1109: Step S1108
If the result of the determination is that the area is to be divided, step S1106
In the three areas of ~ obtained by setting the division point of
The black area indicated by the target structure Group is divided. Further, a new structure Group corresponding to the three regions is created, and the corresponding new two-dimensional coordinate information is set for the new structure Group. Then, the target structure Group that is the source of the division is deleted. The coordinate information may be updated by diverting the target structure Group from which the division is based, instead of deleting the target structure Group.

【０１０４】ステップＳ１１１０：ステップＳ１１０９
の処理後、分割の結果得られた新規の構造体Ｇｒｏｕｐ
を、黒領域座標保持部１０８（上記図２参照）へ保持
し、対象構造体Ｇｒｏｕｐを次の構造体Ｇｒｏｕｐとし
て（次の構造体Ｇｒｏｕｐへポインタを進める）、ステ
ップＳ１１０１へ戻り、以降の処理ステップを繰り返し
実行する。一方、ステップＳ１１０２，Ｓ１１０５，Ｓ
１１０８の各判別処理で”Ｎｏ”で分岐した場合に、本
ステップＳ１１１０を実行する場合、分割が行われなか
った対象構造体Ｇｒｏｕｐをそのまま黒領域座標保持部
１０８（上記図２参照）へ保持し、対象構造体Ｇｒｏｕ
ｐを次の構造体Ｇｒｏｕｐとして（次の構造体Ｇｒｏｕ
ｐへポインタを進める）、ステップＳ１１０１へ戻り、
以降の処理ステップを繰り返し実行する。したがって、
黒領域座標保持部１０８には、黒領域抽出部１０７で得
られた黒領域の情報と、必要に応じて当該黒領域を黒領
域分割部１１０で分割して得られた黒領域の情報とが保
持されることになる。Step S1110: Step S1109
, A new structure Group obtained as a result of the division
Is stored in the black area coordinate storage unit 108 (see FIG. 2), the target structure Group is set as the next structure Group (the pointer is advanced to the next structure Group), and the process returns to step S1101 to perform the subsequent processing steps. Is repeatedly executed. On the other hand, steps S1102, S1105, S
When the step S1110 is executed in the case of branching at “No” in each determination process of 1108, the target structure Group that has not been divided is stored in the black region coordinate storage unit 108 (see FIG. 2 above) as it is. , Target structure Grou
Let p be the next structure Group (next structure Group
p), the process returns to step S1101,
The subsequent processing steps are repeatedly executed. Therefore,
The black region coordinate holding unit 108 stores the information of the black region obtained by the black region extraction unit 107 and the information of the black region obtained by dividing the black region by the black region division unit 110 as necessary. Will be retained.

【０１０５】図１１（ａ）〜（ｃ）、図１２、図１３、
図１４は、黒領域分割部１１０での上記図１０に示した
処理の実行により、対象二値化画像上では、どのような
処理が実際に行われるかを具体的に示したものである。FIGS. 11 (a) to 11 (c), FIGS.
FIG. 14 specifically shows what processing is actually performed on the target binarized image by the execution of the processing shown in FIG. 10 in the black area dividing unit 110.

【０１０６】まず、図１１（ａ）に示す画像は、上記図
２２に示した画像と同様に、白下地に「１２３」、「４
５６」、「７９８」等の黒文字が存在する通常領域が、
黒下地に「ＡＢＣＤＥＦＧ」及び「イロハニホヘト」の
白文字が存在する反転領域と外接している状態で存在す
る画像である。このような画像が黒領域抽出部１０７で
処理されると、上記図１１（ａ）中の太線で示す領域を
構造体Ｐａｒｔｓとする黒領域（構造体Ｇｒｏｕｐで示
される領域）が抽出される。First, the image shown in FIG. 11A has “123” and “4” on a white background, similarly to the image shown in FIG.
Normal areas where black characters such as "56" and "798" exist
This is an image that exists in a state circumscribing a reverse area where white characters “ABCDEFG” and “Irohanihohet” are present on a black background. When such an image is processed by the black region extraction unit 107, a black region (region indicated by the structure Group) in which the region indicated by the thick line in FIG.

【０１０７】尚、上記図１１（ａ）は、構造体Ｐａｒｔ
ｓが存在する様子を表すためのイメージ図であり、実際
には、同図中の太線で示す数よりも多くの構造体Ｐａｒ
ｔｓが存在することになる。Note that FIG. 11A shows the structure Part
FIG. 14 is an image diagram for showing a state in which s exists, and in fact, there are more structures Par than the number indicated by a thick line in FIG.
ts will be present.

【０１０８】そこで、黒領域分割部１１０において、上
記図１０のステップＳ１１０３の処理が実行されると、
ヒストグラムｓｔａｒｔｈｉｓｔ及びヒストグラムｅｎ
ｄｈｉｓｔとして、上記図１１（ｂ）及び（ｃ）に示す
ようなヒストグラムが得られる。Therefore, when the processing of step S1103 in FIG. 10 is executed in the black area dividing section 110,
Histogram histogram and histogram en
A histogram as shown in FIGS. 11B and 11C is obtained as dhist.

【０１０９】次のステップＳ１１０４の処理が実行され
ると、上記図１１（ｂ）に示されるヒストグラムｓｔａ
ｒｔｈｉｓｔから密集ポイント１５０１が検出されると
共に、同図（ｃ）に示されるヒストグラムｅｎｄｈｉｓ
ｔから密集ポイント１５０２，１５０３が検出される。When the processing of the next step S1104 is executed, the histogram “sta” shown in FIG.
rhist, a dense point 1501 is detected, and a histogram endhis shown in FIG.
Congestion points 1502 and 1503 are detected from t.

【０１１０】次のステップＳ１１０５の処理では、ステ
ップＳ１１０４での密集ポイントを利用して、分割すべ
きか否かの判定（第１の判定）が行われるが、ここで
は、ヒストグラムｅｎｄｈｉｓｔの密集ポイント１５０
２，１５０３が離れた関係にあるので、分割する必要が
あると判定する。すなわち、密集ポイント１５０２，１
５０３が離れた関係にあるということは、構造体Ｐａｒ
ｔｓにより示される黒画素の固まりの組の終わりの画素
の座標を示す”ｅｎｄｘ”の密集が離れている状態であ
ることを意味するので、構造体Ｇｒｏｕｐで示される黒
領域を分割する必要があると判定する。In the next step S1105, a decision (first decision) is made as to whether or not to divide using the dense points in step S1104. Here, the dense points 150 of the histogram endhist are determined.
Since 2,1503 are in a distant relationship, it is determined that division is necessary. That is, the dense points 1502, 1
The fact that 503 is in a distant relationship means that the structure Par
This means that the density of “endx” indicating the coordinates of the pixel at the end of the set of black pixels indicated by ts is far away, so it is necessary to divide the black area indicated by the structure Group Is determined.

【０１１１】ステップＳ１１０５の判定結果により、次
のステップＳ１１０６の処理が実行されると、”ｓｔａ
ｒｔｐｏｉｎｔ”に”０”が設定され、”ｅｎｄｐｏｉ
ｎｔ”に密集ポイント１５０２が設定される。When the processing in the next step S1106 is executed based on the determination result in step S1105, "sta
“0” is set in “rtpoint”, and “endpoi” is set.
A dense point 1502 is set at “nt”.

【０１１２】次のステップＳ１１０７の処理では、ｙ方
向の広がりが検出される。ここでは、０〜ｓｔａｒｔｐｏｉｎｔｓｔａｒｔｐｏｉｎｔ〜ｅｎｄｐｏｉｎｔｅｎｄｐｏｉｎｔ〜最後までにより示される３つの領域において、の領域との領
域は同じ領域を示すことになるため、の領域（０〜ｅ
ｎｄｐｏｉｎｔ）との領域（ｅｎｄｐｏｉｎｔ〜最後
まで）についてのｙ方向の広がりを、構造体Ｐａｒｔｓ
のｙ方向の情報（上記図７の”７０１ａ”で示す情報）
を参照する等して検出する。In the next step S1107, the spread in the y direction is detected. Here, in the three areas indicated by 0 to start point start point to end point end point to the end, the area and the area indicate the same area.
ndpoint) and the area (endpoint to the end) in the y-direction are represented by the structure Parts
(Information indicated by “701a” in FIG. 7)
, Etc. to detect.

【０１１３】次のステップＳ１１０８の処理では、ステ
ップＳ１１０７でのｙ方向の広がりの検出の結果から、
再度、分割すべきか否かの判定（第２の判定）が行われ
る。ここでは、上記検出の結果により、の領域（０〜
ｅｎｄｐｏｉｎｔ）は、上記図１１（ａ）に示す”１５
０４”〜”１５０６”に分布しており、の領域（ｅｎ
ｄｐｏｉｎｔ〜最後まで）は、同図に示す”１５０４”
〜”１５０５”にしか分布していないことが認識できる
ため、分割すべきであると判定する。In the next step S1108, based on the result of detection of the spread in the y direction in step S1107,
Again, a determination as to whether or not to divide (a second determination) is made. Here, the region (0 to 0)
The “endpoint” is “15” shown in FIG.
04 "to" 1506 ", and the region (en
dpoint to the end) is “1504” shown in FIG.
Since it can be recognized that the data is distributed only to “1505”, it is determined that the data should be divided.

【０１１４】ステップＳ１１０８の判定結果により、次
のステップＳ１１０９の処理が実行されると、構造体Ｇ
ｒｏｕｐで示される黒領域が、の領域との領域の２
つの領域に分割される。すなわち、図１２に示すよう
に、黒下地に「ＡＢＣＤＥＦＧ」及び「イロハニホヘ
ト」の白文字が存在する反転領域（黒領域）が、「ＡＢ
ＣＤＥＦＧ」の領域（の領域）と、「イロハニホヘ
ト」の領域（の領域）との２つの領域に分割される。When the processing in the next step S1109 is executed based on the determination result in step S1108, the structure G
The black area indicated by the loop is the area 2
Divided into two areas. That is, as shown in FIG. 12, an inverted area (black area) in which white characters of “ABCDEFG” and “Irohanihoheto” are present on a black background is “AB
The area is divided into two areas, an area of (CDDEFG) and an area of (Irohanihohet).

【０１１５】尚、ステップＳ１１０９での領域分割にお
いて、上記図１２に示した分割に対して、状況によって
は、図１３に示すような分割を行う必要がある場合があ
る。この場合、例えば、元の多値画像の色情報を参照し
て、上記図１２に示すような分割を行うか、上記図１３
に示すような分割を行うかを選択するようにしてもよ
い。In the area division in step S1109, the division shown in FIG. 12 may need to be performed as shown in FIG. 13 depending on the situation. In this case, for example, with reference to the color information of the original multi-valued image, the division as shown in FIG.
It is also possible to select whether to perform the division as shown in FIG.

【０１１６】したがって、上記図１１に示した画像の黒
領域が、上記図１２（又は図１３）に示したように分割
されることによって、例えば、ＯＣＲのために２値化処
理を実行したとしても、従来のように、「１２３」、
「４５６」、「７８９」等の黒文字が白下地に存在する
領域が、通常領域（白下地に黒文字が存在する領域）で
あるにも関わらず反転されてしまう（上記図２３参照）
ことはなく、図１４に示すように、「ＡＢＣＤＥＦＧ」
及び「イロハニホヘト」の白文字が存在する黒領域のみ
が反転され、「ＡＢＣＤＥＦＧ」、「イロハニホヘ
ト」、「１２３」、「４５６」、「７８９」等の全ての
文字を良好にＯＣＲできる二値化が可能となる。Therefore, by dividing the black area of the image shown in FIG. 11 as shown in FIG. 12 (or FIG. 13), for example, if the binarization processing is executed for OCR, Also, as before, "123",
An area where black characters such as "456" and "789" are present on a white background is inverted despite being a normal area (an area where black characters are present on a white background) (see FIG. 23 above).
As shown in FIG. 14, "ABCDEFG"
Only the black area where the white characters of “Irohanihohet” are present is inverted, and binarization that can satisfactorily OCR all the characters such as “ABCDEFG”, “Irohanihohet”, “123”, “456”, and “789” is performed. It becomes possible.

【０１１７】そして、ステップＳ１１１０の処理が実行
されると、上述のようにして、構造体Ｇｒｏｕｐで示さ
れる黒領域を分割して得られた新たな黒領域を示す新規
の構造体Ｇｒｏｕｐが、黒領域座標保持部１０８へ保持
される。また、分割が行われなかった構造体Ｇｒｏｕｐ
については、そのまま黒領域座標保持部１０８へ保持さ
れる。When the process of step S1110 is executed, a new structure Group indicating a new black region obtained by dividing the black region indicated by the structure Group as described above is added to the black structure. It is held in the area coordinate holding unit 108. In addition, the structure Group that has not been divided
Is held in the black area coordinate holding unit 108 as it is.

【０１１８】領域解析部１０９は、上記図２に示したス
テップＳ２０６において、黒領域座標保持部１０８に保
持された全ての黒領域に対して、対象黒領域が文字領域
であるか否か、文字領域であるならば反転領域であるか
否か、当該下地の平均画素値（平均下地色）、当該文字
の平均画素値（平均文字色）、二値化の際に用いる最適
な二値化閾値等の解析情報を取得する。In step S206 shown in FIG. 2, the area analyzing unit 109 determines whether or not the target black area is a character area for all the black areas held in the black area coordinate holding unit 108. If it is a region, whether or not it is an inverted region, the average pixel value of the background (average background color), the average pixel value of the character (average character color), the optimal binarization threshold used for binarization And other analysis information.

【０１１９】例えば、黒領域座標保持部１０８に保持さ
れた黒領域の座標情報が、上記図３に示した黒領域３０
１〜３１０の座標情報である場合、領域解析部１０９
は、先ず、これらの黒領域３０１〜３１０のそれぞれに
対応するヒストグラムを、ヒストグラム演算部１０３に
より取得する。そして、領域解析部１０９は、黒領域３
０１〜３１０の対応する各ヒストグラムから黒領域３０
１〜３１０のそれぞれに対して、対象黒領域が文字領域
であるか否か、文字領域であるならば反転領域であるか
否か、反転領域であるならば反転すべきか否かを判断
し、さらに、対象黒領域を二値化する際の最適閾値を算
出し、これらの結果（解析結果）を、黒領域座標保持部
１０８に保持されている各種情報（上記図７参照）にお
ける、構造体Ｇｒｏｕｐが示す領域を二値化するのにふ
さわしい閾値の情報”Ｔｈｒｅｓｈｏｌｄ”、構造体Ｇ
ｒｏｕｐが示す領域が文字領域であるか文字以外の領域
（画像領域等）であるかを示すフラグ情報”Ｃｈａｒａ
Ｆｌｇ”、構造体Ｇｒｏｕｐが示す領域が文字領域であ
る場合に当該領域が反転文字領域（下地が黒っぽく、文
字部が白っぽい領域）であるか否かを示すフラグ情報”
ＩｎｖｅｒｔＦｌｇ”、構造体Ｇｒｏｕｐが示す領域が
抽出された二値化画像を作成した閾値より１つ前の閾値
にて作成された二値化画像上にほぼ同じオブジェクトが
存在するか否かを示すフラグ情報”ＳａｍｅＡｓＭｏｔ
ｈｅｒ”、及び次の構造体Ｇｒｏｕｐのアドレスを示す
ポインタ情報”ｎｅｘｔ”に対してそれぞれ設定する。For example, the coordinate information of the black area held in the black area coordinate holding unit 108 is the same as the black area 30 shown in FIG.
If the coordinate information is 1 to 310, the area analysis unit 109
First, the histogram calculation unit 103 acquires histograms corresponding to the black regions 301 to 310, respectively. Then, the region analyzing unit 109 determines that the black region 3
From the corresponding histograms of 01 to 310, the black area 30
For each of 1 to 310, it is determined whether the target black area is a character area, if it is a character area, whether it is an inverted area, and if it is an inverted area, whether it should be inverted, Further, an optimum threshold value for binarizing the target black area is calculated, and these results (analysis results) are stored in the structure in various information (see FIG. 7 described above) held in the black area coordinate holding unit 108. Threshold information “Threshold” suitable for binarizing the area indicated by the Group, the structure G
The flag information "Chara" indicating whether the area indicated by the group is a character area or a non-character area (eg, an image area).
Flg ", flag information indicating whether or not the region indicated by the structure Group is a character region when the region is an inverted character region (a black background and a white character portion).
"InvertFlg", a flag indicating whether or not substantially the same object exists on the binarized image created with the threshold value one immediately before the threshold value that created the binarized image in which the region indicated by the structure Group is extracted. Information "SameAsMot
her "and pointer information" next "indicating the address of the next structure Group.

【０１２０】ここで、ヒストグラム演算部１０３により
得られる黒領域３０１〜３１０に対応したヒストグラム
から、上述のような各種解析を行なう際、子どもの黒領
域の情報を省いたほうが望ましい。例えば、黒領域３０
３と黒領域３０７の関係は、図１５に示すように、黒領
域３０３の中に黒領域３０７（黒領域３０３を親とする
子の黒領域）が含まれる状態となっているが、黒領域３
０７についての情報（重複画素の情報）は用いないよう
にするほうが、よりよい解析結果を得ることができる。
このための処理方法としては、マップ画像を用いる方法
や、座標を用いる方法等がある。ここでは、その一例と
して、マップ画像を用いる方法について説明する。Here, it is desirable to omit the information on the child's black area from the histograms corresponding to the black areas 301 to 310 obtained by the histogram calculation unit 103 when performing the above-described various analyses. For example, the black area 30
As shown in FIG. 15, the black region 303 includes a black region 307 (a black region of a child having the black region 303 as a parent). 3
A better analysis result can be obtained by not using information on 07 (information on overlapping pixels).
As a processing method for this, there is a method using a map image, a method using coordinates, and the like. Here, a method using a map image will be described as an example.

【０１２１】マップ画像を利用した方法：この方法は、
ヒストグラム演算部１０３において、黒領域のヒストグ
ラムを生成する際、マップ画像を用いる方法である。例
えば、”０”に近い二値化閾値４０３で二値化した結果
（上記図３（ｄ）に示した二値化画像）から抽出された
黒領域３１０のヒストグラムを作成する場合、対象画像
（上記図３（ａ）に示した文書の多値画像）のマップ画
像を用意し、当該マップ画像を構成する全ての画素を”
０”で初期化しておく。そして、黒領域３１０のヒスト
グラムを作成するときにアクセスした画素部に対応する
マップ画像の画素部を”１”に設定する。これにより、
マップ画像では、アクセスされていない画素部は”０”
であり、アクセスされた画素部は”１”となるため、画
素値が”１”でない画素の情報を用いてヒストグラムを
作成すればよい。このような方法により、ヒストグラム
演算部１０３でのヒストグラムの作成が行われることに
よって、領域解析部１０９は、上記図１５に示したよう
な関係にある黒領域であっても、画素情報の重複のない
正確な当該黒領域のヒストグラムを取得することができ
るため、最適な解析処理を行なうことができる。A method using a map image:
The histogram calculation unit 103 uses a map image when generating a histogram of a black area. For example, when creating a histogram of the black region 310 extracted from the binarization result (the binarized image shown in FIG. 3D) with the binarization threshold 403 close to “0”, the target image ( A map image of the multi-value image of the document shown in FIG. 3A is prepared, and all the pixels constituting the map image are set to “
Then, the pixel portion of the map image corresponding to the pixel portion accessed when creating the histogram of the black region 310 is set to “1”.
In the map image, the pixel part not accessed is “0”
Since the accessed pixel portion is “1”, a histogram may be created using information on pixels whose pixel values are not “1”. By creating a histogram in the histogram calculation unit 103 according to such a method, the area analysis unit 109 can determine whether the pixel information overlaps even in the black area having the relationship shown in FIG. Since an accurate histogram of the black area can be obtained, an optimal analysis process can be performed.

【０１２２】尚、上述のマップ画像を利用した方法にお
いて、フラグ情報”ＳａｍｅＡｓＭｏｔｈｅｒ”が”Ｔ
ＲＵＥ（同じオブジェクトが存在する）”である黒領域
については、最終的な二値化処理時において重複二値化
をする時間を削減するために、二値化処理しないように
構成してもよい。この場合、当該黒領域については、領
域解析部１０９での解析処理についても行なわない、或
いは、解析処理を行ってもマップ画像での当該黒領域に
対応する画素部に対して”１”を設定しないようにす
る。In the above-described method using the map image, the flag information "SameAsMother" is set to "T
The black region that is “RUE (the same object exists)” may be configured not to be subjected to the binarization process in order to reduce the time for performing duplicate binarization in the final binarization process. In this case, for the black area, the analysis processing in the area analysis unit 109 is not performed, or even if the analysis processing is performed, “1” is set to the pixel portion corresponding to the black area in the map image. Do not set.

【０１２３】領域解析部１０９での解析処理の具体例を
示すと、次のような処理がある。例えば、ヒストグラム
演算部１０３により得られた対象黒領域のヒストグラム
の平均値ａｖｅｒａｇｅと、そのスキューｓｋｅｗを、A specific example of the analysis processing in the area analysis unit 109 is as follows. For example, the average value average of the histogram of the target black area obtained by the histogram calculation unit 103 and its skew skew are

【０１２４】[0124]

【数１】 (Equation 1)

【０１２５】なる演算により求め、この結果えられたス
キューｓｋｅｗの絶対値が、対象黒領域が存在する二値
化画像取得時の二値化閾値より大きい場合、対象黒領域
は文字領域であると認識し、対象黒領域を示す構造体Ｇ
ｒｏｕｐのフラグ情報”ＣｈａｒａＦｌｇ”を”ＴＲＵ
Ｅ”に設定する。また、この場合、さらに上記スキュー
ｓｋｅｗの値が「正」である場合、対象黒領域が反転文
字領域（下地が黒っぽく、文字部が白っぽい領域）であ
ると認識し、対象黒領域を示す構造体Ｇｒｏｕｐのフラ
グ情報”ＩｎｖｅｒｔＦｌｇ”を”ＴＲＵＥ”に設定す
る。また、この場合の対象黒領域を二値化するのにふさ
わしい閾値として、対象黒領域の下地部分が消去される
ような閾値を対象黒領域のヒストグラムから求め、その
結果を、対象黒領域を示す構造体Ｇｒｏｕｐの情報”Ｔ
ｈｒｅｓｈｏｌｄ”へ設定する。If the absolute value of the resulting skew skew is larger than the binarization threshold at the time of obtaining the binarized image in which the target black area exists, it is determined that the target black area is a character area. Structure G that recognizes and indicates the target black area
group information "CharaFlg" to "TRU"
E ”. In this case, if the value of the skew skew is“ positive ”, the target black area is recognized as an inverted character area (a black background and a white character area). The flag information “InvertFlg” of the structure Group indicating the black area is set to “TRUE”. In addition, as a threshold suitable for binarizing the target black region in this case, a threshold value at which the background portion of the target black region is erased is obtained from the histogram of the target black region, and the result indicates the target black region. Structure Group information "T
threshold ".

【０１２６】上述のようにして、領域解析部１０９は、
上記図２に示したステップＳ２０６において、黒領域座
標保持部１０８に保持された全ての黒領域についての解
析を行い、その解析結果を黒領域座標保持部１０８へ書
き込むが、構造体Ｇｒｏｕｐのフラグ情報”Ｉｎｖｅｒ
ｔＦｌｇ”が”ＴＲＵＥ”である黒領域については、さ
らに次のような処理を実行する。As described above, the area analysis unit 109
In step S206 shown in FIG. 2, analysis is performed on all the black areas held in the black area coordinate holding unit 108, and the analysis result is written in the black area coordinate holding unit 108. The flag information of the structure Group "Inver
The following processing is further performed on the black area in which tFlg ”is“ TRUE ”.

【０１２７】まず、構造体Ｇｒｏｕｐのフラグ情報”Ｉ
ｎｖｅｒｔＦｌｇ”が”ＴＲＵＥ”である黒領域（以
下、「対象黒領域」と言う）は、文字領域であり、且つ
反転文字領域（下地が黒っぽく、文字部が白っぽい領
域）である。そこで、本処理では、対象黒領域に対応す
る部分の多値画像（多値メモリ１９２に保持されている
画像情報）を反転して、その反転後の画像に対して、上
記図２に示したステップＳ２０２〜Ｓ２０６の処理を繰
り返し実行する（上記図２の点線矢印参照）。尚、この
ときの反転処理は、例えば、多値画像そのものを反転す
るようにしてもよいし、多値画像から得られたヒストグ
ラム（ヒストグラム演算１０３で得られたヒストグラ
ム）を反転するようにしてもよいし、その方法は限定さ
れることはない。First, the flag information “I” of the structure Group
A black area in which “nvertFlg” is “TRUE” (hereinafter, referred to as a “target black area”) is a character area and an inverted character area (a black background and a whitish character part). Therefore, in this processing, the multi-valued image (the image information held in the multi-valued memory 192) of the portion corresponding to the target black area is inverted, and the image after the inversion is shown in FIG. The processing of steps S202 to S206 is repeatedly executed (see the dotted arrow in FIG. 2). The inversion processing at this time may be, for example, to invert the multi-valued image itself or to invert the histogram obtained from the multi-valued image (the histogram obtained by the histogram calculation 103). Good, and the method is not limited.

【０１２８】具体的には例えば、上記図３（ａ）に示し
た多値画像から得られた黒領域３０１〜３１０（同図
（ｂ）〜（ｃ）参照）において、黒領域３０１，３０
６，３１０は、当該多値画像の反転文字領域に対応する
部分であるが、これらの黒領域３０１，３０６，３１０
のうち、黒領域３０６，３１０については、構造体Ｇｒ
ｏｕｐが示す領域（当該黒領域）が抽出された二値化画
像（同図（ｃ）及び（ｄ）に示す二値化画像）を作成し
た閾値より１つ前の閾値にて作成された二値化画像（同
図（ｂ）に示す二値化画像）上にほぼ同じオブジェクト
が存在するか否かを示すフラグ情報”ＳａｍｅＡｓＭｏ
ｔｈｅｒ”が”ＴＲＵＥ（存在する）”に設定されるた
め、黒領域３０１に対して、本処理が実行される。More specifically, for example, in the black areas 301 to 310 (see FIGS. 3B to 3C) obtained from the multi-valued image shown in FIG.
6, 310 are portions corresponding to the inverted character area of the multi-valued image, and these black areas 301, 306, 310
Of the black regions 306 and 310, the structure Gr
The bin created with the threshold value one immediately before the threshold at which the binarized image (the binarized image shown in FIGS. 3C and 3D) from which the region (the black region) indicated by “up” is extracted is created. Flag information "SameAsMo" indicating whether or not substantially the same object exists on the binarized image (binary image shown in FIG. 3B).
Since “other” is set to “TRUE (exists)”, the present process is executed on the black region 301.

【０１２９】黒領域３０１に対応する多値画像の領域を
反転したものを、図１６（ａ）に示す。この反転画像３
５０に対して、先ず、上記図２のステップＳ２０２にて
ヒストグラム演算部１０３により処理が実行されると、
図１７に示すようなヒストグラムが得られる。次のステ
ップＳ２０３では、閾値決定部１０４により、上記図１
７に示す２つの二値化閾値４５１，４５２が決定され
る。次のステップＳ２０４では、二値化部１０５によ
り、上記図１６（ｂ）及び（ｃ）に示すような二値化画
像が得られる。上記図１６（ｂ）に示す二値化画像は、
二値化閾値４５１に基づいた二値化により得られた画像
であり、同図（ｃ）に示す二値化画像は、二値化閾値４
５２に基づいた二値化により得られた画像である。FIG. 16A shows an inverted multi-valued image area corresponding to the black area 301. This inverted image 3
For 50, first, when processing is executed by the histogram calculation unit 103 in step S202 of FIG.
A histogram as shown in FIG. 17 is obtained. In the next step S203, the threshold determination unit 104 causes the above-described FIG.
7, two binarization thresholds 451 and 452 are determined. In the next step S204, the binarization unit 105 obtains a binarized image as shown in FIGS. 16B and 16C. The binarized image shown in FIG.
An image obtained by binarization based on the binarization threshold 451, and the binarized image shown in FIG.
52 is an image obtained by binarization based on No. 52.

【０１３０】そして、次のステップＳ２０５では、黒領
域抽出部１０７により、上記図１６（ｂ）に示す二値化
画像から黒領域３５１〜３５５が抽出される。これらの
黒領域３５１〜３５５のうち、黒領域３５１〜３５４に
ついては、当該領域が四角形状であるため、領域解析部
１０９は、これを認識し、これらの黒領域３５１〜３５
４を示す構造体Ｇｒｏｕｐの、黒領域部分の形状が四角
であるか否かを示すフラグ情報”ＳｈａｐｅＲｅｃｔａ
ｎｇｌｅ”を”ＴＲＵＥ（四角形状）”に設定する。こ
れにより、黒領域抽出部１０７では、上記図１６（ｃ）
に示す二値化画像からの黒領域抽出処理が行われる。こ
の結果、上記図１６（ｃ）に示す二値化画像からは、黒
領域３５６〜３５８が抽出される。In the next step S205, the black areas 351 to 355 are extracted from the binarized image shown in FIG. Of these black regions 351 to 355, the black regions 351 to 354 have a quadrangular shape. Therefore, the region analysis unit 109 recognizes this, and the black regions 351 to 355 are recognized.
The flag information “ShapeRecta” indicating whether or not the shape of the black area portion of the structure Group indicating the “4” is square.
ngle "is set to" TRUE (square shape) ".
A black region extraction process from the binarized image shown in FIG. As a result, black regions 356 to 358 are extracted from the binarized image shown in FIG.

【０１３１】黒領域３５６〜３５８は、その構造体Ｇｒ
ｏｕｐが示す領域（当該黒領域）が抽出された二値化画
像（上記図１６（ｃ）に示す二値化画像）を作成した閾
値より１つ前の閾値にて作成された二値化画像（同図
（ｂ）に示す二値化画像）上にほぼ同じオブジェクトが
存在するので、領域解析部１０９は、これを認識し、こ
れらの黒領域３５６〜３５８を示す構造体Ｇｒｏｕｐの
フラグ情報”ＳａｍｅＡｓＭｏｔｈｅｒ”を”ＴＲＵＥ
（存在する）”に設定する。The black regions 356 to 358 correspond to the structure Gr.
The binarized image created with the threshold value one before the threshold value at which the binarized image (the binarized image shown in FIG. 16C) from which the region indicated by “up” (the black region) is extracted is generated. Since almost the same object exists on the (binarized image shown in FIG. 9B), the area analysis unit 109 recognizes this and sets the flag information of the structure Group indicating these black areas 356 to 358. SameAsMother "to" TRUE
(Exists) ".

【０１３２】したがって、フラグ情報”ＳａｍｅＡｓＭ
ｏｔｈｅｒ”が”ＴＲＵＥ（存在する）”でない黒領域
３５１〜３５５に対して、再び本処理が実行され、領域
解析部１０９にて、構造体Ｇｒｏｕｐが示す領域が文字
領域であるか文字以外の領域（画像領域等）であるかを
示すフラグ情報”ＣｈａｒａＦｌｇ”、構造体Ｇｒｏｕ
ｐが示す領域が文字領域である場合に当該領域が反転文
字領域（下地が黒っぽく、文字部が白っぽい領域）であ
るか否かを示すフラグ情報”ＩｎｖｅｒｔＦｌｇ”、及
び構造体Ｇｒｏｕｐが示す領域を二値化するのにふさわ
しい閾値の情報”Ｔｈｒｅｓｈｏｌｄ”等への設定が行
われる。Therefore, the flag information "SameAsM"
This processing is executed again on the black areas 351 to 355 in which “other” is not “TRUE (existing)”, and the area analysis unit 109 determines whether the area indicated by the structure Group is a character area or a non-character area. (CharaFlg) indicating whether the image is an image region or the like, and a structure Grou
When the area indicated by p is a character area, the flag information “InvertFlg” indicating whether or not the area is an inverted character area (an area with a black base and a white character part) and an area indicated by the structure Group The threshold is set to information “Threshold” or the like suitable for conversion into a value.

【０１３３】上述のようにして、全ての黒領域が抽出さ
れ、それぞれの黒領域に対する二値化閾値や、黒領域に
対応する多値画像を反転するか否か等が決定される。As described above, all the black regions are extracted, and the binarization threshold value for each black region, whether to invert the multi-valued image corresponding to the black region, and the like are determined.

【０１３４】尚、黒領域部分の形状が四角であるか否か
を示すフラグ情報”ＳｈａｐｅＲｅｃｔａｎｇｌｅ”
は、例えば、対象黒領域が、図１８（ａ）に示すような
形状の領域（黒塗部分）である場合には、”ＴＲＵＥ
（四角形状）”に設定され、同図（ｂ）に示すような形
状の領域である場合には、”ＦＡＬＳＥ（四角形状でな
い）”に設定されるものとする。It should be noted that flag information "ShapeRectangle" indicating whether or not the shape of the black area is a square is shown.
For example, when the target black area is an area (black portion) having a shape as shown in FIG.
(Square shape) ", and in the case of an area having a shape as shown in FIG. 4B," FALSE (not square shape) "is set.

【０１３５】尚、上述した実施の形態では、二値メモリ
１０６を、二値化部１０５で得られる二値化画像分の容
量（閾値決定部１０４で得られる二値化閾値の数分の枚
数の画像を保持できる容量）を有するものとしている
が、このような構成の場合、多値画像を複数の二値化閾
値で二値化して、一度に複数の二値化画像を作成するこ
とができるため、処理高速の点では有効であるが、二値
化閾値の数分の枚数の画像を保持できる容量が必要とな
ってくる。しかしながら、次のような構成とすること
で、二値メモリ１０６を、１画像分の容量を有するもと
し、メモリの節約を行える。In the above-described embodiment, the binary memory 106 is used to store the capacity of the binary image obtained by the binarizing unit 105 (the number of binary images corresponding to the number of binary thresholds obtained by the threshold determining unit 104). In such a configuration, it is possible to binarize a multi-valued image with a plurality of binarization thresholds and create a plurality of binarized images at once. This is effective in terms of processing speed, but requires a capacity that can hold the number of images equal to the number of binarization thresholds. However, with the following configuration, the binary memory 106 can have the capacity for one image and can save the memory.

【０１３６】先ず、閾値決定部１０４は、最初の二値化
閾値を決定し、二値化部１０５は、当該二値化閾値での
二値化を実行し、その結果（二値化画像）を、１画像分
の容量を有する二値メモリ１０６へ保持する。次に、黒
領域抽出部１０７は、二値メモリ１０６内の二値化画像
に対する黒領域抽出処理をで実行する。その後、二値メ
モリ１０６内の二値化画像を破棄する。次に、閾値決定
部１０４は、最初の二値化閾値を決定する。二値化部１
０５は、当該二値化閾値での二値化を、黒領域抽出部１
０７での黒領域抽出結果に対応した領域のみに対して実
行する。そして、黒領域抽出部１０７は、上記黒領域抽
出結果に対応した領域の二値化画像に対して、黒領域抽
出処理を実行する。First, the threshold value decision unit 104 decides the first binarization threshold value, and the binarization unit 105 executes binarization using the binarization threshold value, and the result (binary image) Is stored in the binary memory 106 having a capacity for one image. Next, the black region extraction unit 107 executes a black region extraction process on the binarized image in the binary memory 106. After that, the binarized image in the binary memory 106 is discarded. Next, the threshold determination unit 104 determines an initial binarization threshold. Binarization unit 1
05 denotes binarization at the binarization threshold value by the black region extraction unit 1
The process is executed only for the region corresponding to the black region extraction result in 07. Then, the black region extraction unit 107 executes a black region extraction process on the binarized image of the region corresponding to the black region extraction result.

【０１３７】したがって、上述のような構成では、二値
メモリ１０６の容量は、１画像分の容量があればよく、
多量の二値メモリを所持できない場合に対応することが
できる。Therefore, in the configuration described above, the capacity of the binary memory 106 only needs to have a capacity for one image.
It is possible to cope with a case where a large amount of binary memory cannot be possessed.

【０１３８】また、上述した実施の形態では、対象とな
る多値画像を構成する全ての画素情報からヒストグラム
を作成し、そのヒストグラムから複数の二値化閾値を決
定し、それらの二値化閾値から複数の二値化画像を取得
するように構成したが、これに限られることはなく、例
えば、黒領域抽出部１０７により得られた黒領域につい
てのヒストグラムを作成し、そのヒストグラムから二値
化閾値を決定して二値化画像を取得するようにしてもよ
い。このような構成とすれば、さらに小領域への分割が
良好に行える利点がある。In the above-described embodiment, a histogram is created from all pieces of pixel information constituting a target multi-valued image, a plurality of binarization thresholds are determined from the histogram, and the binarization thresholds are determined. Is configured to acquire a plurality of binarized images from the image data. However, the present invention is not limited to this. For example, a histogram for a black region obtained by the black region A threshold may be determined to obtain a binarized image. With such a configuration, there is an advantage that division into smaller areas can be performed more favorably.

【０１３９】また、上述した実施の形態では、多値画像
から二値化画像を生成するように構成したが、これに限
られることはなく、３値化或いは４値化等の量子化画像
を生成して、その画像から最終的に領域分割する構成も
本発明に含まれる。In the above-described embodiment, the binarized image is generated from the multi-valued image. However, the present invention is not limited to this. The present invention includes a configuration in which the image is generated and the area is finally divided from the image.

【０１４０】また、本発明の目的は、上述した実施の形
態のホスト及び端末の機能を実現するソフトウェアのプ
ログラムコードを記憶した記憶媒体を、システム或いは
装置に供給し、そのシステム或いは装置のコンピュータ
（又はＣＰＵやＭＰＵ）が記憶媒体に格納されたプログ
ラムコードを読みだして実行することによっても、達成
されることは言うまでもない。この場合、記憶媒体から
読み出されたプログラムコード自体が本実施の形態の機
能を実現することとなり、そのプログラムコードを記憶
した記憶媒体は本発明を構成することとなる。プログラ
ムコードを供給するための記憶媒体としては、ＲＯＭ、
フロッピーディスク、ハードディスク、光ディスク、光
磁気ディスク、ＣＤ−ＲＯＭ、ＣＤ−Ｒ、磁気テープ、
不揮発性のメモリカード等を用いることができる。ま
た、コンピュータが読みだしたプログラムコードを実行
することにより、本実施の形態の機能が実現されるだけ
でなく、そのプログラムコードの指示に基づき、コンピ
ュータ上で稼動しているＯＳ等が実際の処理の一部又は
全部を行い、その処理によって本実施の形態の機能が実
現される場合も含まれることは言うまでもない。さら
に、記憶媒体から読み出されたプログラムコードが、コ
ンピュータに挿入された拡張機能ボードやコンピュータ
に接続された機能拡張ユニットに備わるメモリに書き込
まれた後、そのプログラムコードの指示に基づき、その
機能拡張ボードや機能拡張ユニットに備わるＣＰＵなど
が実際の処理の一部又は全部を行い、その処理によって
本実施の形態の機能が実現される場合も含まれることは
言うまでもない。Further, an object of the present invention is to supply a storage medium storing program codes of software for realizing the functions of the host and the terminal of the above-described embodiment to a system or an apparatus, and to provide a computer (a computer) of the system or the apparatus. It is needless to say that the present invention can also be achieved by a CPU or an MPU) reading and executing a program code stored in a storage medium. In this case, the program code itself read from the storage medium implements the functions of the present embodiment, and the storage medium storing the program code constitutes the present invention. ROM, as a storage medium for supplying the program code,
Floppy disk, hard disk, optical disk, magneto-optical disk, CD-ROM, CD-R, magnetic tape,
A non-volatile memory card or the like can be used. By executing the program code read out by the computer, not only the functions of the present embodiment are realized, but also the OS and the like running on the computer perform actual processing based on the instructions of the program code. It goes without saying that a part or all of the above is performed, and the processing realizes the function of the present embodiment. Further, after the program code read from the storage medium is written to a memory provided in an extension function board inserted into the computer or a function extension unit connected to the computer, the function extension is performed based on the instruction of the program code. It goes without saying that a CPU or the like provided in the board or the function expansion unit performs part or all of the actual processing, and the processing realizes the functions of the present embodiment.

【０１４１】[0141]

【発明の効果】以上説明したように本発明では、処理対
象となる多値画像（カラー文書を読み取って得られた画
像等）を領域分割する際、対象多値画像から、少なくと
も１つ以上の量子化画像（二値化画像等）を作成し、そ
れらの量子化画像から、所定値の画素群を含む領域（一
次元的に連続する黒画素の固まりを含む外接矩形の領域
等）を取得する。また、その領域内に存在する画素群の
始まりの画素（開始画素）及び終わりの画素（終了画
素）についてのヒストグラムをとり、これらの開始画素
及び終了画素の密集点を検出し、さらに、その密集点に
基づき分割して得られる領域内に存在する開始画素及び
終了画素の他次元方向の広がりを検出し、その検出結果
により、実際に分割を行うべきか否か判定して、領域分
割を実行するように構成した。そして、このようにして
取得した全ての領域についての属性、例えば、対象領域
が文字領域であるか否か、文字領域であるならば下地が
黒っぽく文字部が白っぽい反転領域であるか否か、下地
の平均画素値（平均下地色）、文字の平均画素値（平均
文字色）、対象領域を二値化する際に用いる最適な閾値
等の属性を解析するように構成した。これにより、処理
対象となる多値画像が、下地や文字領域部等、様々な色
部の組み合わせが存在する文書から得られたものであっ
ても、最適な領域分割を行うことができると共に、特
に、外接矩形を取ると他の余計な領域を含んでしまうよ
うな領域を有する画像であっても、良好に且つ効率的に
領域分割することが可能となり、その後の処理を矩形単
位で行うこともできる。As described above, in the present invention, when a multivalued image to be processed (such as an image obtained by reading a color document) is divided into regions, at least one Create a quantized image (such as a binarized image) and obtain an area including a pixel group of a predetermined value (a circumscribed rectangular area including a cluster of one-dimensionally continuous black pixels) from the quantized image. I do. Further, a histogram is obtained for the start pixel (start pixel) and the end pixel (end pixel) of the pixel group existing in the area, and the dense points of these start pixels and end pixels are detected. The start pixel and the end pixel existing in the region obtained by dividing based on the points are detected in other dimension directions, and based on the detection result, it is determined whether or not to actually perform the division, and the region is divided. It was configured to be. The attributes of all the regions obtained in this manner, for example, whether the target region is a character region, if the target region is a character region, whether the background is black and the character portion is a whitish inverted region, , The average pixel value (average background color), the average pixel value (average character color) of characters, and the optimal threshold value used when binarizing the target area are analyzed. Thereby, even if the multivalued image to be processed is obtained from a document in which various combinations of color parts such as a background and a character area part are present, optimal area division can be performed, and In particular, even if the image has an area that would include another unnecessary area if a circumscribed rectangle is taken, it is possible to divide the area satisfactorily and efficiently, and perform subsequent processing in units of rectangles. Can also.

[Brief description of the drawings]

【図１】本発明を適用した画像処理装置の構成を説明す
るための図である。FIG. 1 is a diagram illustrating a configuration of an image processing apparatus to which the present invention has been applied.

【図２】上記画像処理装置の動作を説明するためのフロ
ーチャートである。FIG. 2 is a flowchart for explaining the operation of the image processing apparatus.

【図３】上記画像処理装置の処理対象となる多値画像の
一例と、当該多値画像に対する二値化処理で発生する複
数の二値化閾値による二値画像を説明するための図であ
る。FIG. 3 is a diagram for explaining an example of a multi-value image to be processed by the image processing apparatus and a binary image based on a plurality of binarization thresholds generated in a binarization process on the multi-value image. .

【図４】上記多値画像から得られるヒストグラムを説明
するための図である。FIG. 4 is a diagram for explaining a histogram obtained from the multi-value image.

【図５】上記画像処理装置での黒領域抽出処理を説明す
るためのフローチャートである。FIG. 5 is a flowchart illustrating a black region extraction process in the image processing apparatus.

【図６】上記黒領域抽出処理での黒画素の固まりの組を
検出する処理を説明するためのフローチャートである。FIG. 6 is a flowchart for explaining a process of detecting a set of black pixels in the black region extraction process.

【図７】上記画像処理装置の黒領域座標保持部に保持さ
れる情報の一例を説明するための図である。FIG. 7 is a diagram illustrating an example of information stored in a black area coordinate storage unit of the image processing apparatus.

【図８】上記黒領域抽出処理を具体的に説明するための
図である。FIG. 8 is a diagram for specifically explaining the black region extraction processing.

【図９】上記黒領域抽出処理を実行する黒領域抽出部の
構成を示すブロック図である。FIG. 9 is a block diagram illustrating a configuration of a black area extraction unit that performs the black area extraction processing.

【図１０】上記画像処理装置での黒領域分割処理を説明
するためのフローチャートである。FIG. 10 is a flowchart illustrating a black area dividing process in the image processing apparatus.

【図１１】上記黒領域分割処理で対象となる二値化画像
の一例と、当該画像から得られる黒画素の固まりの組の
始めの画素及び終わりの画素のヒストグラムを説明する
ための図である。FIG. 11 is a diagram for explaining an example of a binarized image to be subjected to the black region division processing, and a histogram of the first pixel and the last pixel of a set of black pixels obtained from the image; .

【図１２】上記黒領域分割処理での領域分割の結果の一
例を説明するための図である。FIG. 12 is a diagram illustrating an example of a result of area division in the black area division processing.

【図１３】上記黒領域分割処理での領域分割の結果の他
の例を説明するための図である。FIG. 13 is a diagram for explaining another example of a result of area division in the black area division processing.

【図１４】上記黒領域分割処理での領域分割の結果に基
づいて、矩形単位でＯＣＲ用に二値化した結果を説明す
るための図である。FIG. 14 is a diagram for describing a result of binarization for OCR in a rectangular unit based on a result of area division in the black area division processing.

【図１５】上記黒領域抽出処理及び上記黒領域分割処理
にて得られた黒領域が入れ子になった状態を説明するた
めの図である。FIG. 15 is a diagram for explaining a state in which black areas obtained by the black area extraction processing and the black area division processing are nested.

【図１６】上記黒領域抽出処理での階層的な黒領域抽出
処理（反転処理による黒領域の抽出処理）を説明するた
めの図である。FIG. 16 is a diagram for explaining hierarchical black area extraction processing (black area extraction processing by inversion processing) in the black area extraction processing.

【図１７】上記黒領域抽出処理（反転処理による黒領域
の抽出処理）でのヒストグラムの一例を説明するための
図である。FIG. 17 is a diagram illustrating an example of a histogram in the above-described black area extraction processing (black area extraction processing by inversion processing).

【図１８】上記黒領域座標保持部に保持される情報の、
黒領域部分の形状が四角であるか否かを示すフラグ情
報”ＳｈａｐｅＲｅｃｔａｎｇｌｅ”について説明する
ための図である。FIG. 18 shows information stored in the black area coordinate storage unit.
It is a figure for explaining flag information "ShapeRectangle" which shows whether the shape of a black area part is a square.

【図１９】従来の画像領域分割方法を説明するための図
である。FIG. 19 is a diagram for explaining a conventional image area dividing method.

【図２０】上記画像領域分割方法での対象画像（二値化
が困難な多値画像）の輝度分布状態を説明するための図
である。FIG. 20 is a diagram for describing a luminance distribution state of a target image (a multi-valued image that is difficult to binarize) in the image area dividing method.

【図２１】アダプティブ閾値二値化方式での従来の画像
領域分割方法を説明するための図である。FIG. 21 is a diagram for explaining a conventional image area dividing method using an adaptive threshold binarization method.

【図２２】上記画像領域分割方法で対象となる画像の一
例を説明するための図である。FIG. 22 is a diagram for explaining an example of an image to be processed by the image area dividing method.

【図２３】上記画像に対して上記画像領域分割方法によ
る処理を行った結果に基づいて、矩形単位でのＯＣＲ用
に二値化した結果を説明するための図である。FIG. 23 is a diagram for describing a result of binarization for OCR in a rectangular unit based on a result of performing the image region dividing method on the image.

[Explanation of symbols]

１００画像処理装置１０１輝度変換部１０２多値メモリ１０３ヒストグラム１０４閾値決定部１０５二値化部１０６二値メモリ１０７黒領域抽出部１０８黒領域座標保持部１０９領域解析部１１０黒領域分割部 REFERENCE SIGNS LIST 100 Image processing device 101 Luminance conversion unit 102 Multi-valued memory 103 Histogram 104 Threshold value determination unit 105 Binarization unit 106 Binary memory 107 Black region extraction unit 108 Black region coordinate holding unit 109 Area analysis unit 110 Black region division unit

Claims

[Claims]

1. An image processing apparatus for dividing an area of a multi-valued image, comprising: a quantizing means for acquiring a plurality of quantized images from the multi-valued image; and a quantized image obtained by the binarizing means. A region extracting means for extracting a region including a pixel group having a predetermined value, and a region obtained by the region extracting unit, based on a density of start pixels and end pixels of a pixel group present in the region. An image processing apparatus comprising: an area dividing unit that divides an image; and an area analyzing unit that analyzes an attribute of each area obtained by the area extracting unit and the area dividing unit.

2. The image processing apparatus according to claim 1, wherein said quantization means acquires a binarized image as said quantized image.

3. The image processing apparatus according to claim 1, wherein said quantization means acquires a quantized image of an area portion of said multi-valued image based on an area extraction result of said area extraction means. .

4. The image processing apparatus according to claim 1, wherein said quantization means acquires said plurality of quantized images based on a plurality of thresholds obtained from a histogram of said multi-valued image. .

5. An area extracting means, comprising: a detecting means for detecting, from the quantized image, a pixel group in which pixels having the predetermined value are one-dimensionally continuous; and coordinates of the pixel group detected by the detecting means. First holding means for holding information; second holding means for holding coordinate information of a pixel group already detected by the detection means; information in the first holding means; and the second holding means 2. The image processing apparatus according to claim 1, further comprising a combining unit configured to perform a combining process of the information in the image data.

6. The image processing apparatus according to claim 5, wherein said detecting means detects said area for every M lines on said quantized image.

7. The area extracted by the area extracting means includes a circumscribed rectangular area including a one-dimensionally continuous pixel group, and the area dividing means includes an area extracted by the area extracting means. A first detection unit that detects a dense point of the start pixel and the end pixel from a histogram of the start pixel and the end pixel of the pixel group included in the pixel group; and a division obtained based on a detection result of the first detection unit. For a start pixel and an end pixel of a pixel group present in the region, a second detection unit that detects a spread in a dimensional direction different from the one-dimensional direction, based on a detection result by the second detection unit, 2. A dividing means for executing region division.
An image processing apparatus as described in the above.

8. An image processing system in which a plurality of devices are communicably connected, wherein at least one of the plurality of devices is the image processing apparatus according to any one of claims 1 to 7. An image processing system having a function.

9. An image processing method for dividing a multi-valued image into regions, wherein at least one or more quantized images are created from the multi-valued image, and a cluster of black pixels is formed from the quantized image. And an area dividing step of dividing the black area obtained by the area extracting step on the basis of the density of the start pixels and end pixels of the cluster of black pixels existing in the black area. And an area analysis step of analyzing attributes of all areas obtained by the area extraction step and the area division step.

10. The image processing method according to claim 9, wherein said region extracting step includes a step of creating a binarized image as said quantized image.

11. The image processing method according to claim 9, wherein a part of the at least one quantized image is not a quantized image obtained from the entire surface of the multi-valued image.

12. The region extracting step includes: a black detecting step of detecting a one-dimensionally continuous black pixel cluster from the quantized image; and a coordinate information of the black pixel cluster detected by the black detecting step. A first holding step of holding; a second holding step of holding coordinate information of a cluster of black pixels already detected by the black detection step; a holding information by the first holding step; and the second holding The image processing method according to claim 9, further comprising a combining step of performing a combining process of the held information by the steps.

13. The image processing method according to claim 12, wherein the black detection step includes a step of executing the detection processing for each M lines on the quantized image.

14. The black area extracted in the area extraction step includes a circumscribed rectangular area including a cluster of one-dimensionally continuous black pixels, and the area division step is extracted in the area extraction step. A first detection step of detecting a dense point of the start pixel and the end pixel from a histogram of a start pixel and an end pixel of a cluster of black pixels included in the black region; and a detection result of the first detection step. A second detection step of dividing the black region based on the first region and detecting a spread of a start pixel and an end pixel of a pixel group existing in the divided region in a dimensional direction different from the one-dimensional direction; 10. The image processing method according to claim 9, further comprising a dividing step of dividing the black region based on a detection result in the detecting step.

15. A computer readable storage of a processing program for implementing the function of the image processing apparatus according to claim 1 or the function of the image processing system according to claim 8. A storage medium characterized by the above-mentioned.

16. A storage medium, wherein the processing steps of the image processing method according to claim 9 are readable by a computer.