JP3020293B2

JP3020293B2 - Attribute determination method

Info

Publication number: JP3020293B2
Application number: JP3052842A
Authority: JP
Inventors: 直弘天本; 節正広垣; 明利塚本
Original assignee: Oki Electric Industry Co Ltd
Current assignee: Oki Electric Industry Co Ltd
Priority date: 1991-03-18
Filing date: 1991-03-18
Publication date: 2000-03-15
Anticipated expiration: 2015-03-15
Also published as: JPH04288773A

Description

DETAILED DESCRIPTION OF THE INVENTION

【０００１】[0001]

【産業上の利用分野】本発明は、ファクシミリ等の通信
機器や文書画像データベース入力装置、光学的文字読取
り装置（ＯＣＲ）等において、文書画像中の写真領域及
び図表領域といった属性を判別する文書画像の属性判別
方法に関するものである。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a document image for identifying attributes such as a photograph area and a chart area in a document image in a communication device such as a facsimile, a document image database input device, an optical character reading device (OCR), or the like. The attribute determination method.

【０００２】[0002]

【従来の技術】従来、この種の属性判別方法としては、
例えば特開昭６２−７１３７９号公報に記載されるもの
があった。2. Description of the Related Art Conventionally, this type of attribute discrimination method includes:
For example, there is one described in JP-A-62-71379.

【０００３】この文書画像の属性判別方法では、文書画
像データを入力し、主走査方向（例えば、横方向）に黒
画素を計数（カウント）して閾値を超えるラインを検出
し、該カウント値が閾値以下の白ラインが所定個数連続
する状態を判定して第１の領域切り出しを行う。この第
１の領域切り出し内で、副走査方向（例えば、縦方向）
に黒画素をカウントして該カウントが閾値を超える列を
検出し、該計カウントが閾値以下の白列が所定個数連続
する状態を判定して第２の領域切り出しを行う。In this document image attribute discriminating method, document image data is input, black pixels are counted (counted) in a main scanning direction (for example, a horizontal direction), and lines exceeding a threshold value are detected. A state in which a predetermined number of white lines equal to or less than the threshold value continues is determined, and the first area is cut out. In the first region cutout, the sub-scanning direction (for example, the vertical direction)
, A row in which the count exceeds the threshold is detected, and a state in which a predetermined number of white rows in which the total count is equal to or less than the threshold is determined to perform a second region cutout.

【０００４】さらに、第２の領域切り出し内で、第１の
領域切り出しと同様な処理により、第３の領域切り出し
を行い、この第３の領域切り出し内で、第２の領域切り
出しと同様な処理により、第４の領域切り出しを行う。
そこで、この第４の領域切り出しで検出された領域につ
いて、その領域のランレングス情報、及び黒画素率情報
により、文字領域、写真領域、図表領域等の属性を判別
している。Further, in the second area cutout, a third area cutout is performed by the same processing as the first area cutout, and the same processing as the second area cutout is performed in the third area cutout. Performs the fourth region cutout.
Therefore, for the area detected by the fourth area cutout, attributes such as a character area, a photograph area, and a chart area are determined based on the run length information and the black pixel ratio information of the area.

【０００５】[0005]

【発明が解決しようとする課題】しかしながら、従来の
属性判別方法では、次のような課題があった。However, the conventional attribute discriminating method has the following problems.

【０００６】（ａ）従来の属性判別方法では、領域切り
出し処理において、対象に応じて様々な閾値を任意に設
定する必要がある。例えば、個々の論文誌に対して領域
切り出しを行う場合、それらの閾値を各論文誌に応じた
適切な値に設定し直さなければならず、その値の選定に
手間がかかる。そのため、領域切り出し処理後に行う属
性の判別処理の効率が低いという問題があった。(A) In the conventional attribute discriminating method, it is necessary to arbitrarily set various thresholds in accordance with a target in the region cutout processing. For example, when region segmentation is performed for individual journals, those thresholds must be reset to appropriate values according to each journal, and it takes time to select such values. Therefore, there is a problem that the efficiency of the attribute discrimination process performed after the region clipping process is low.

【０００７】（ｂ）前記（ａ）の問題を解決するため、
本願出願人は、先に特開平１−２６４６４９号明細書に
おいて属性判別方法を提案した。この提案では、最適２
値化処理により、文書画像を入力し、文字・フィールド
セパレータ領域は像域分離、図表・写真領域はディザ処
理（網点処理）を施し、それら両者の画像を合成する。
この最適２値化処理により２値化された画像を走査して
領域抽出した後、図表・写真候補と判定された外接矩形
について２値化された画像を走査し、白・黒反転回数に
よって写真領域と図表領域とを判別する。(B) To solve the problem (a),
The applicant of the present application has previously proposed an attribute discrimination method in Japanese Patent Application Laid-Open No. 1-264649. In this proposal, optimal 2
The document image is input by the binarization processing, the character / field separator area is subjected to image area separation, and the figure / table / photograph area is subjected to dither processing (halftone processing), and these two images are combined.
After scanning the image binarized by the optimal binarization processing to extract an area, the binarized image determined as a chart / photo candidate is scanned with a binarized image, and the image is scanned according to the number of white / black inversions. An area and a chart area are determined.

【０００８】ところが、この属性判別方法では、ディザ
処理した画像を縦方向及び横方向に走査する必要があ
り、処理時間がかかるという問題を有している。即ち、
先に提案した属性判別方法では、例えば、４００ｄｐｉ
（３４０８×４６２４画素）の画像を処理しており、し
かも領域抽出により得られた矩形内部の画像を必ず縦及
び横方向に各々一回ずつ全面走査する必要があり、この
処理のみでも１分近い時間を必要とする。従って、領域
分割全体として考えると、１枚の文書画像を処理するの
に、数分という時間を要してしまい、処理に時間がかか
りすぎる。However, this attribute discrimination method has a problem that it is necessary to scan a dither-processed image in a vertical direction and a horizontal direction, which takes a long processing time. That is,
In the attribute discrimination method proposed earlier, for example, 400 dpi
(3408 x 4624 pixels) image is processed, and the entire image inside the rectangle obtained by the region extraction must be completely scanned once each in the vertical and horizontal directions, and this processing alone is close to one minute. Needs time. Therefore, considering the whole area division, it takes several minutes to process one document image, and the processing takes too much time.

【０００９】このように、短い処理時間で、写真領域と
図表領域の属性を比較的簡単に判別することが困難であ
り、技術的に充分満足のゆく属性判別方法を得ることが
困難であった。As described above, it is difficult to determine the attributes of a photograph area and a chart area relatively easily in a short processing time, and it is difficult to obtain an attribute determination method that is technically sufficiently satisfactory. .

【００１０】本発明は、前記従来技術が持っていた課題
として、短い処理時間で比較的簡単に写真領域と図表領
域との属性を判別することが困難な点について解決した
属性判別方法を提供するのである。The present invention provides an attribute discriminating method which solves the problem of the prior art that it is difficult to discriminate the attributes of a photograph area and a chart area relatively easily with a short processing time. It is.

【００１１】[0011]

【課題を解決するための手段】前記課題を解決するため
に、第１の発明は、文書画像を２値化して２値画像を作
成し、前記２値画像の情報領域（例えば、黒画素領域）
を外接矩形化して外接矩形を検出し、前記外接矩形の写
真領域と図表領域の属性を判別する属性判別方法におい
て、次のような手段を講じている。According to a first aspect of the present invention, a document image is binarized to generate a binary image, and an information area (for example, a black pixel area) of the binary image is created. )
In the attribute discriminating method for detecting the circumscribed rectangle by converting the circumscribed rectangle into circumscribed rectangles and discriminating the attributes of the photograph area and the chart area of the circumscribed rectangle, the following means are taken.

【００１２】前記２値画像の全体を縮小して縮小画像を
作成し、前記外接矩形の特徴とその外接矩形内の２値画
像の特徴から、写真候補領域及び図表候補領域の第１の
外接矩形を抽出する。そして、前記第１の外接矩形か
ら、前記縮小画像に該当する第２の外接矩形を求め、前
記第２の外接矩形の内部を走査してその矩形内の全黒画
素数をカウントする。このカウントした全黒画素数が前
記第２の外接矩形の面積に占める割合と閾値とを比較す
ることにより、写真領域と図表領域を分離するようにし
ている。A reduced image is created by reducing the entirety of the binary image, and the first circumscribed rectangle of the photograph candidate area and the chart candidate area is obtained from the features of the circumscribed rectangle and the features of the binary image within the circumscribed rectangle. Is extracted. Then, a second circumscribed rectangle corresponding to the reduced image is obtained from the first circumscribed rectangle, and the inside of the second circumscribed rectangle is scanned to count the number of all black pixels in the rectangle. By comparing the ratio of the counted total number of black pixels to the area of the second circumscribed rectangle with the threshold value, the photographic region and the chart region are separated.

【００１３】第２の発明は、第１の発明と同様に、従来
の属性判別方法において、前記外接矩形の特徴とその外
接矩形内の２値画像の特徴から、写真候補領域及び図表
候補領域の第１の外接矩形を抽出し、前記第１の外接矩
形のみを縮小して縮小画像を作成する。そして、前記縮
小画像に該当する第２の外接矩形を求め、前記第２の外
接矩形の内部を走査してその矩形内の全黒画素数をカウ
ントし、そのカウントした黒画素数が前記第２の外接矩
形の面積に占める割合と閾値とを比較することにより、
写真領域と図表領域を分離するようにしている。According to a second aspect of the present invention, as in the first aspect, in the conventional attribute discriminating method, a photograph candidate area and a chart candidate area are determined from the characteristics of the circumscribed rectangle and the characteristics of the binary image in the circumscribed rectangle. A first circumscribed rectangle is extracted, and only the first circumscribed rectangle is reduced to create a reduced image. Then, a second circumscribing rectangle corresponding to the reduced image is obtained, the inside of the second circumscribing rectangle is scanned, and the number of all black pixels in the rectangle is counted. By comparing the percentage of the area of the circumscribed rectangle with the threshold value,
The photograph area and the chart area are separated.

【００１４】第３の発明は、第１又は第２の発明におい
て、前記２値画像を作成する方法として、前記文書画像
を多値画像データの形で読み取り、該多値画像データを
その特徴及び性質を損なうことなく最適２値化して求め
るようにしている。According to a third aspect, in the first or second aspect, as a method for creating the binary image, the document image is read in the form of multi-valued image data, and the multi-valued image data is read with its characteristics and The optimal binarization is performed without deteriorating the properties.

【００１５】第４の発明は、第１又は第２の発明におい
て、前記２値画像を作成する方法として、前記文書画像
を単純２値化、又はディザ処理のみにより２値化して求
めるようにしている。According to a fourth aspect of the present invention, in the first or second aspect, the document image is obtained by binarizing the document image by simple binarization or dither processing alone. I have.

【００１６】第５の発明は、第１又は第２の発明におい
て、画素間の論理和（ＯＲ）を取って前記縮小画像を作
成するようにしている。In a fifth aspect based on the first or second aspect, the reduced image is created by taking a logical sum (OR) between pixels.

【００１７】[0017]

【作用】第１の発明によれば、以上のように属性判別方
法を構成したので、文書画像から２値画像が作成され、
さらにその２値画像から外接矩形が検出されると、該２
値画像の全体が縮小されて縮小画像が作成される。さら
に、縮小前の外接矩形の特徴とその外接矩形内の２値画
像の特徴から、写真候補領域及び図表候補領域からなる
第１の外接矩形が抽出される。そして、第１の外接矩形
から、縮小画像に該当する第２の外接矩形が求められ、
その第２の外接矩形内の走査によってその矩形内の全黒
画素数がカウントされ、その全黒画素数が第２の外接矩
形の面積に占める割合によって写真領域と図表領域とが
分離される。これにより、縮小画像の走査でよいこと、
さらにその全面走査の回数も１回でよいため、処理の簡
単化と処理時間の短縮化が図れる。According to the first aspect of the present invention, since the attribute determining method is configured as described above, a binary image is created from a document image.
Further, when a circumscribed rectangle is detected from the binary image,
The entire value image is reduced to create a reduced image. Furthermore, a first circumscribed rectangle including a photograph candidate area and a chart candidate area is extracted from the features of the circumscribed rectangle before reduction and the features of the binary image in the circumscribed rectangle. Then, a second circumscribed rectangle corresponding to the reduced image is obtained from the first circumscribed rectangle,
By scanning the second circumscribed rectangle, the number of all black pixels in the rectangle is counted, and the photograph area and the chart area are separated by the ratio of the total number of black pixels to the area of the second circumscribed rectangle. Thereby, scanning of the reduced image is sufficient,
Further, since the number of times of the entire surface scan may be one, the processing can be simplified and the processing time can be shortened.

【００１８】第２の発明によれば、写真候補領域及び図
表候補領域からなる第１の外接矩形を抽出した後、該第
１の外接矩形のみを縮小して縮小画像を作成する。そし
て、この縮小画像に該当する第２の外接矩形を求め、第
１の発明と同様に、該第２の外接矩形内の全黒画素数を
カウントし、その全黒画素数が第２の外接矩形の面積に
占める割合によって写真領域と図表領域を分離してい
る。このように、画像の縮小領域を図表・写真領域に限
定することにより、より処理の高速化が図れる。第３の
発明によれば、最適２値化した２値画像データを用いて
属性の判別を行っているので、的確な属性の判別が行え
る。According to the second aspect, after extracting the first circumscribed rectangle including the photograph candidate area and the chart candidate area, only the first circumscribed rectangle is reduced to create a reduced image. Then, a second circumscribed rectangle corresponding to the reduced image is obtained, and the number of all black pixels in the second circumscribed rectangle is counted, as in the first invention. The photograph area and the chart area are separated according to the proportion of the rectangular area. By limiting the reduced area of the image to the chart / photograph area as described above, the processing speed can be further increased. According to the third aspect, the attribute is determined using the optimally binarized binary image data, so that the attribute can be accurately determined.

【００１９】第４の発明によれば、単純２値化あるいは
ディザ処理のみにより２値化した２値画像データを用い
て属性判別を行っているので、２値化処理の簡単化と、
その後の処理の簡単化により、属性判別処理のより高速
化が図れる。According to the fourth aspect, attribute determination is performed using binary image data binarized only by simple binarization or dither processing. Therefore, the binarization processing can be simplified.
By simplifying the subsequent processing, the speed of the attribute determination processing can be further increased.

【００２０】第５の発明によれば、疑似中間調・網点表
現等の画像データのＯＲを取って縮小画像を作成するこ
とにより、ほとんどの領域が黒画素で占められ、それに
よって縮小処理の簡単化が図れる。According to the fifth aspect of the present invention, a reduced image is created by ORing image data such as pseudo halftone and halftone dot representation, so that most of the area is occupied by black pixels. It can be simplified.

【００２１】[0021]

【実施例】図２は、本発明の第１の実施例を示す属性判
別装置の概略の機能ブロック図である。FIG. 2 is a schematic functional block diagram of an attribute discriminating apparatus showing a first embodiment of the present invention.

【００２２】この属性判別装置では、ＣＣＤ（Ｃｈａｒ
ｇｅＣｏｕｐｌｅｄＤｅｖｉｃｅ）等のイメージセ
ンサを有し、該イメージセンサにより文書画像を走査し
て多値画像データを読み取る読取部１を備えている。こ
の読取部１の出力側には、読み取られた多値画像データ
を格納する多値画像メモリ２と、最適２値化手段３とが
接続されている。最適２値化手段３は、読み取られた多
値画像データをその特徴及び性質を損なうことなく２値
画像に変換する機能を有し、その出力側には、変換され
た２値画像を格納する２値画像メモリ４と、外接矩形検
出手段５とが接続されている。In this attribute discriminating apparatus, a CCD (Char)
An image sensor such as a Ge Coupled Device) is provided, and a reading unit 1 that reads a multi-value image data by scanning a document image with the image sensor is provided. The output side of the reading unit 1 is connected to a multi-valued image memory 2 for storing the read multi-valued image data, and an optimum binarizing unit 3. The optimal binarizing means 3 has a function of converting the read multi-valued image data into a binary image without impairing its characteristics and properties, and stores the converted binary image on its output side. The binary image memory 4 and the circumscribed rectangle detecting means 5 are connected.

【００２３】外接矩形検出手段５は、最適２値化して得
られた２値画像の情報領域（例えば、黒画素領域）を外
接矩形化して抽出する機能を有し、その出力側には、抽
出された外接矩形の位置を保存する外接矩形位置メモリ
６、属性判別手段７、及び縮小２値画像メモリ８が接続
されている。The circumscribed rectangle detecting means 5 has a function of circumscribing and extracting an information area (for example, a black pixel area) of a binary image obtained by optimal binarization. A circumscribed rectangle position memory 6 for storing the position of the circumscribed rectangle thus determined, an attribute discriminating means 7, and a reduced binary image memory 8 are connected.

【００２４】属性判別手段７は、抽出された外接矩形の
２値画像データを全面縮小して縮小画像を作成し、その
縮小画像を最小２値化画像メモリ８に格納すると共に、
抽出された外接矩形及び縮小画像の特徴等から、外接矩
形をフィールドセパレータ領域、写真領域、図表領域、
及びその他の文字領域に分離する機能を有している。こ
の属性判別手段７の出力側には、分類されたフィールド
セパレータ領域の位置を保存するフィールドセパレータ
位置メモリ９、写真領域の位置を保存する写真領域位置
メモリ１０、図表領域の位置を保存する図表領域位置メ
モリ１１、及び文字統合手段１２が接続されている。The attribute discriminating means 7 reduces the entire area of the extracted binary image data of the circumscribed rectangle to create a reduced image, stores the reduced image in the minimum binary image memory 8, and
From the extracted circumscribed rectangle and the characteristics of the reduced image, etc., the circumscribed rectangle is defined as a field separator area, a photograph area, a chart area,
And another character area. On the output side of the attribute discriminating means 7, there are provided a field separator position memory 9 for storing the positions of the classified field separator regions, a photograph region position memory 10 for storing the positions of the photograph regions, and a chart region for storing the positions of the chart regions. The position memory 11 and the character integrating means 12 are connected.

【００２５】文字統合手段１２は、文字と判別された外
接矩形について行・文字ブロックの統合を行い、その統
合結果を文字領域として抽出する機能を有し、その出力
側には、文字領域の位置を保存する文字領域位置メモリ
１３が接続されている。The character integrating means 12 has a function of integrating a line and a character block with respect to a circumscribed rectangle determined to be a character, and extracting a result of the integration as a character area. Is connected to a character area position memory 13 for storing the.

【００２６】次に、図２の装置を用いた属性判別方法に
ついて説明する。Next, an attribute determining method using the apparatus shown in FIG. 2 will be described.

【００２７】文書画像が読取部１に入力されると、該読
取部１では、内部のイメージセンサによって入力文書画
像を走査し、多値画像データを読み取り、その多値画像
データを多値画像メモリ２に格納する。最適２値化手段
３では、多値画像メモリ２に格納された多値画像データ
を入力し、文字・フィールドセパレータ領域は像域分
離、図表・写真領域はディザ処理を施し、それら両者の
画像を合成する処理を行う。When a document image is input to the reading unit 1, the reading unit 1 scans the input document image by an internal image sensor, reads multi-value image data, and stores the multi-value image data in a multi-value image memory. 2 is stored. The optimal binarizing means 3 inputs the multi-valued image data stored in the multi-valued image memory 2, performs image area separation on the character / field separator area, performs dither processing on the chart / photograph area, and processes both images. Perform processing for combining.

【００２８】即ち、最適２値化手段３では、入力多値画
像データにおいて、注目画素を中心とした局所領域での
濃度差分値の絶対値の大きさから、非中間調画像部分と
中間調画像部分とに分類する。そして、非中間調画像部
分と判別された場合、注目画素を固定閾値によって２値
化し、一方、中間調画像部分と判定された場合、注目画
素をディザマトリクス閾値によって２値化する。That is, the optimal binarizing means 3 determines the non-halftone image portion and the halftone image portion in the input multi-valued image data from the magnitude of the absolute value of the density difference value in the local region centering on the target pixel. Classify into parts and If the pixel is determined to be a non-halftone image portion, the pixel of interest is binarized using a fixed threshold. If the pixel is determined to be a halftone image portion, the pixel of interest is binarized using a dither matrix threshold.

【００２９】このような最適２値化処理が行われると、
文字領域及び図表領域といった非中間調画像部分は、白
と黒の境のエッジ部分のくっきりとした画像となり、写
真領域等の中間調画像部分は、新聞等で見られるディザ
画像となり、これらの合成２値画像として得られる。こ
の合成２値画像のデータは、２値画像メモリ４に格納さ
れる。When such an optimal binarization process is performed,
Non-halftone image parts such as character areas and chart areas become sharp images at the edges of the boundary between white and black, and halftone image parts such as photographic areas become dither images seen in newspapers and the like. Obtained as a binary image. The data of the composite binary image is stored in the binary image memory 4.

【００３０】外接矩形検出手段５では、２値画像メモリ
４に格納された２値画像データを、例えば主走査方向
（横方向）Ｘ及び副走査方向（縦方向）Ｙに走査する。
そして、２値画像データの全領域において、横方向Ｘに
黒画素が１つでも存在する黒ラインを検出し、次に黒画
素が１つも存在しない白ラインが閾値ＴＨ以上連続する
までの領域を切り出す（ステップＳＴ１）。今度は、そ
の切り出された領域において、縦方向Ｙに黒画素が１つ
でも存在する黒カラムを検出し、黒画素が１つも存在し
ない白カラムが閾値ＴＨ以上連続するまでの領域を切り
出す（ステップＳＴ２）。The circumscribed rectangle detecting means 5 scans the binary image data stored in the binary image memory 4 in, for example, the main scanning direction (horizontal direction) X and the sub-scanning direction (vertical direction) Y.
Then, in all areas of the binary image data, a black line in which at least one black pixel exists in the horizontal direction X is detected, and an area until a white line in which no black pixel exists continues more than the threshold TH is determined. Cut out (step ST1). Next, a black column in which at least one black pixel exists in the vertical direction Y is detected in the cut-out region, and a region is cut out until a white column in which no black pixel exists continues beyond the threshold value TH (step ST2).

【００３１】さらに、その切り出された領域において、
前記と同様に、ステップＳＴ１とステップＳＴ２を繰り
返し実行し、これ以上の切り出しが行えないという所ま
で、繰り返し処理を続ける。そして、最終的に切り出さ
れた領域を外接矩形として検出し、その位置を示す始点
座標及び終点座標を外接矩形位置メモリ６に格納する。Further, in the cut area,
In the same manner as described above, the steps ST1 and ST2 are repeatedly executed, and the repetition processing is continued until further cutout cannot be performed. Then, the finally cut out area is detected as a circumscribed rectangle, and the start point coordinates and the end point coordinates indicating the position are stored in the circumscribed rectangle position memory 6.

【００３２】属性判別手段７では、２値画像メモリ４に
格納された２値画像を縮小して縮小画像を形成し、その
縮小画像を縮小２値画像メモリ８に格納する。さらに、
外接矩形位置メモリ６に格納された全ての外接矩形につ
いて、その外接矩形特徴とその矩形内の特徴から、フィ
ールドセパレータ領域と、写真・図形領域と、その他の
文字領域との３つの属性に分類し、さらに最小２値画像
メモリ８に格納された縮小画像を参照して写真・図表領
域から、写真領域と図表領域とを判別する。The attribute discriminating means 7 reduces the binary image stored in the binary image memory 4 to form a reduced image, and stores the reduced image in the reduced binary image memory 8. further,
All the circumscribed rectangles stored in the circumscribed rectangle position memory 6 are classified into three attributes of a field separator area, a photograph / graphic area, and other character areas from the circumscribed rectangle features and the features within the rectangle. Further, referring to the reduced image stored in the minimum binary image memory 8, a photograph area and a chart area are determined from the photograph and chart area.

【００３３】属性判別手段７によって分類されたフィー
ルドセパレータ領域、写真領域、及び図形領域の各位置
を示す始点座標及び終点座標は、それぞれフィールドセ
パレータ位置メモリ９、写真領域位置メモリ１０、及び
図表領域位置メモリ１１に格納される。さらに、文字領
域については、文字統合手段１２により、文字領域と判
別された外接矩形を行・文字ブロックに統合され、その
結果が文字領域として抽出されて文字領域位置メモリ１
３に格納される。The start point coordinates and the end point coordinates indicating the positions of the field separator area, the photograph area, and the graphic area classified by the attribute discriminating means 7 are stored in the field separator position memory 9, the photograph area position memory 10, and the chart area position, respectively. Stored in the memory 11. Further, with regard to the character area, the circumscribed rectangle determined as the character area is integrated into a line / character block by the character integration means 12, and the result is extracted as a character area and the character area position memory 1 is extracted.
3 is stored.

【００３４】図１は、図２中の属性判別手段７で処理さ
れる属性判別処理のフローチャートである。FIG. 1 is a flowchart of the attribute discriminating process performed by the attribute discriminating means 7 in FIG.

【００３５】この属性判別処理では、ステップＳ２０に
おいて、図２の２値画像メモリ４に格納された２値画像
データを入力し、ステップＳ２１で、該２値画像データ
を全面縮小する。この縮小の割合は、例えばプログラム
で実現するときには、４００ＤＰＩの画像で１／８（＝
５０ＤＰＩ）程度が適当である。次に、ステップＳ２２
において、２値画像メモリ４に格納された属性判別すべ
き領域の外接矩形の始点座標（ＸＳｉ，ＹＳｉ）及び終
点座標（ＸＥｉ，ＹＥｉ）を入力する。ここで、Ｘは主
走査方向（横方向）、Ｙは副走査方向（縦方向）であ
り、座標番号ｉはｉ＝１，２，…，ｎである。In this attribute discriminating process, in step S20, the binary image data stored in the binary image memory 4 of FIG. 2 is input, and in step S21, the binary image data is reduced in size. When the reduction ratio is realized by a program, for example, 1/8 (= 400 DPI image)
50 DPI) is appropriate. Next, step S22
, The start point coordinates (XSi, YSi) and the end point coordinates (XEi, YEi) of the circumscribed rectangle of the area whose attribute is to be determined and stored in the binary image memory 4 are input. Here, X is the main scanning direction (horizontal direction), Y is the sub-scanning direction (vertical direction), and the coordinate number i is i = 1, 2,..., N.

【００３６】ステップＳ２３において、入力した外接矩
形の縦ＨＩ（＝ＹＥｉ−ＹＳｉ＋１）と横ＷＤ（＝ＸＥ
ｉ−ＸＳｉ＋１）との縦横比ＷＤ／ＨＩが、ある固定閾
値ＴＨ１より大きいか否かを比較する。外接矩形の縦横
比ＷＤ／ＨＩが固定閾値ＴＨ１よりも大きければ、その
外接矩形をステップＳ２４でフィールドセパレータ領域
と判定し、その判定結果を図２のフィールドセパレータ
位置メモリ９へ格納する。In step S23, the input circumscribed rectangle has a vertical HI (= YEi-YSi + 1) and a horizontal WD (= XE
It is compared whether the aspect ratio WD / HI with (i-XSi + 1) is larger than a fixed threshold TH1. If the aspect ratio WD / HI of the circumscribed rectangle is larger than the fixed threshold value TH1, the circumscribed rectangle is determined as the field separator area in step S24, and the determination result is stored in the field separator position memory 9 in FIG.

【００３７】外接矩形の縦横比ＷＤ／ＨＩが固定閾値Ｔ
Ｈ１よりも小さければ、文字・写真・図表候補領域とす
る。この文字・写真・図表候補領域については、ステッ
プＳ２５で、その外接矩形領域の面積ＷＤ×ＨＩを求
め、該面積ＷＤ×ＨＩがある固定閾値ＴＨ２よりも大き
いか否かを比較する。大きければ、写真・図表候補領域
と判定し、ステップＳ２６へ進む。小さければ、ステッ
プＳ３１で文字領域と判定し、図２の文字統合手段１２
を介して文字領域位置メモリ１３に格納する。The aspect ratio WD / HI of the circumscribed rectangle is a fixed threshold T
If it is smaller than H1, it is set as a character / photo / diagram candidate area. For this character / photo / diagram candidate area, in step S25, the area WD × HI of the circumscribed rectangular area is obtained, and it is compared whether the area WD × HI is larger than a certain fixed threshold TH2. If it is larger, it is determined to be a photograph / chart candidate area, and the process proceeds to step S26. If it is smaller, it is determined in step S31 that the area is a character area.
Through the character area position memory 13.

【００３８】ステップＳ２５で写真・図表候補領域と判
定された場合には、ステップＳ２６により、ステップＳ
２１で求めた縮小画像に該当する外接矩形の始点座標
（ＸＳｉ／ｎ，ＹＳｉ／ｎ）及び終点座標（ＸＥｉ／
ｎ，ＹＥｉ／ｎ）を算出した後、ステップＳ２７で、該
外接矩形内について縮小画像を横方向Ｘ及び縦方向Ｙに
走査し、該外接矩形内部に占める全黒画素数ＴＯをカウ
ントする。そして、全黒画素数ＴＯが外接矩形の面積に
占める割合を計算し、その割合がある固定閾値ＴＨ３よ
りも大きいか否かを比較する。If it is determined in step S25 that the area is a photograph / chart candidate area, the process proceeds to step S26 where step S26 is performed.
The start point coordinates (XSi / n, YSi / n) and end point coordinates (XEi /
After calculating (n, YEi / n), in step S27, the reduced image is scanned in the horizontal direction X and the vertical direction Y within the circumscribed rectangle, and the total number of black pixels TO occupying the inside of the circumscribed rectangle is counted. Then, the ratio of the total number of black pixels TO to the area of the circumscribed rectangle is calculated, and it is compared whether or not the ratio is larger than a certain fixed threshold TH3.

【００３９】割合が固定閾値ＴＨ３よりも大きければ、
ステップＳ２９で、外接矩形領域を写真領域と判定し、
その判定結果を図２の写真領域位置メモリ１０に格納す
る。割合が固定閾値ＴＨ３よりも小さければ、ステップ
Ｓ３０で、外接矩形領域を図表領域と判定し、その判定
結果を図２の図表領域位置メモリ１１へ格納する。これ
により、属性判別処理を終了する。If the ratio is greater than the fixed threshold TH3,
In step S29, the circumscribed rectangular area is determined to be a photograph area,
The determination result is stored in the photograph area position memory 10 in FIG. If the ratio is smaller than the fixed threshold TH3, in step S30, the circumscribed rectangular area is determined to be a chart area, and the determination result is stored in the chart area position memory 11 of FIG. Thus, the attribute determination processing ends.

【００４０】以上のように、この第１の実施例では、図
１のステップＳ２１で、２値画像データを全面縮小し、
ステップＳ２７で、縮小画像を走査する等の処理によ
り、写真領域と図表領域とを判別するようにしている。
そのため、走査を行う処理も縮小画像に対して行い、し
かもその全面走査する回数も１回でよいため、写真領域
と図表領域との判別が簡単になり、その上、その属性の
判別の処理時間を大幅に短縮できるため、高速処理が可
能となる。As described above, in the first embodiment, the entire binary image data is reduced in step S21 in FIG.
In step S27, a photograph area and a chart area are determined by processing such as scanning a reduced image.
Therefore, the scanning process is performed on the reduced image, and the number of times the entire surface needs to be scanned is only one. Therefore, the discrimination between the photograph area and the chart area is simplified, and the processing time for discriminating the attribute is also increased. Can be greatly reduced, and high-speed processing can be performed.

【００４１】図３は、本発明の第２の実施例を示す属性
判別処理のフローチャートであり、図１のフローチャー
トと共通の要素には共通の符号が付されている。FIG. 3 is a flowchart of an attribute discriminating process according to a second embodiment of the present invention. Elements common to those in the flowchart of FIG. 1 are denoted by the same reference numerals.

【００４２】この属性判別処理のフローチャートでは、
画像の縮小領域を変更した場合の領域判別の処理内容が
示されている。In the flowchart of the attribute determination process,
The processing content of the area determination when the reduced area of the image is changed is shown.

【００４３】即ち、この属性判別処理では、図１のステ
ップＳ２１を省略し、ステップＳ２０で２値画像データ
を入力し、その２値画像データの縮小を行わずに、ステ
ップＳ２２で外接矩形を入力してステップＳ２３で、フ
ィールドセパレータ領域と文字・写真・図表候補領域と
の分類を行っている。That is, in this attribute discriminating process, step S21 in FIG. 1 is omitted, binary image data is input in step S20, and a circumscribed rectangle is input in step S22 without reducing the binary image data. Then, in step S23, the field separator area and the character / photograph / chart candidate area are classified.

【００４４】そして、ステップＳ２５で、文字・写真・
図表候補領域について外接矩形領域の面積を求め、写真
・図表候補領域と文字領域との分類を行っている。写真
・図表候補領域については、その外接矩形の始点座標
（ＸＳｉ，ＹＳｉ）及び終点座標（ＸＥｉ，ＹＥｉ）に
おける内部の２値画像を１／ｎ縮小する。次に、ステッ
プＳ２６で、図１と同様に、縮小画像に該当する外接矩
形の始点座標（ＸＳｉ／ｎ，ＹＳｉ／ｎ）及び終点座標
（ＸＥｉ／ｎ，ＹＥｉ／ｎ）を算出した後、ステップＳ
２７〜ステップＳ３０により、写真領域と図表領域とを
判別している。Then, in step S25, characters, photos,
The area of the circumscribed rectangular area is determined for the chart candidate area, and the photograph / chart candidate area and the character area are classified. For the photograph / chart candidate area, the internal binary image at the start point coordinates (XSi, YSi) and end point coordinates (XEi, YEi) of the circumscribed rectangle is reduced by 1 / n. Next, in step S26, as in FIG. 1, the start point coordinates (XSi / n, YSi / n) and the end point coordinates (XEi / n, YEi / n) of the circumscribed rectangle corresponding to the reduced image are calculated. S
Through the steps 27 to S30, the photograph area and the chart area are determined.

【００４５】この第２の実施例では、縮小画像を使用す
るのが、写真領域と図表領域を判別するときのみで、フ
ィールドセパレータ領域及び文字領域の判別のときには
使用せず、画像の縮小領域を図表・写真候補領域に限定
している。これにより、第１の実施例に比べて縮小処理
量を低減でき、さらに高速な属性判別処理が可能とな
る。In the second embodiment, the reduced image is used only when discriminating between the photograph area and the chart area, and is not used when discriminating the field separator area and the character area. It is limited to the chart / photo candidate area. As a result, the amount of reduction processing can be reduced as compared with the first embodiment, and higher-speed attribute discrimination processing can be performed.

【００４６】なお、本発明は上記実施例に限定されず、
種々の変形が可能である。その変形例としては、例えば
次のようなものがある。The present invention is not limited to the above embodiment,
Various modifications are possible. For example, there are the following modifications.

【００４７】（ｉ）図４（ａ），（ｂ）は２値化処理と
その縮小画像との関係の一例を示す図であり、同図
（ａ）は疑似中間調表現の２値画像データの縮小画像を
示し、同図（ｂ）は２値画像データが網点のときの縮小
画像を示す図である。(I) FIGS. 4 (a) and 4 (b) are diagrams showing an example of the relationship between the binarization processing and the reduced image, and FIG. 4 (a) shows the binary image data in pseudo halftone expression. FIG. 3B is a diagram showing a reduced image when the binary image data is a halftone dot.

【００４８】この図から明らかなように、写真領域は画
素間のＯＲを取って縮小することにより、２値化処理が
疑似中間調・網点表現等であっても、ほとんどの領域が
黒画素で占められることになる。従って、上記実施例で
は、図２の最適２値化手段３で多値画像データを最適２
値化しているが、これに変えて単純２値、もしくはディ
ザ処理のみで２値化した画像を使用しても、写真領域と
図表領域との判別が可能である。このように単純２値も
しくはディザ処理のみで２値化した画像を使用すること
により、上記実施例よりも、さらに処理の高速化を図る
ことができる。（ii）第１及び第２の実施例の属性判別方法を実行する
図２の属性判別装置は、他の構成に変更してもよい。例
えば、フィールドセパレータ領域、写真領域、及び図表
領域と判定された外接矩形の位置は、それぞれメモリ
９，１０，１１に保存され、その他の文字領域の外接矩
形位置は、外接矩形位置メモリ６にそのまま保存されて
いる。そのため、この外接矩形位置メモリ６で文字領域
位置メモリ１３の代わりを果たすことが可能であり、従
って該文字領域位置メモリ１３を省略することにより、
メモリの削減化が図れる。As is apparent from this figure, the photographic area is reduced by taking the OR between the pixels, so that even if the binarization processing is a pseudo-halftone / halftone expression, almost all areas are black pixels. Will be occupied by. Therefore, in the above embodiment, the multi-valued image data is optimally binarized by the optimal binarizing means 3 in FIG.
Although the image data is binarized, it is possible to discriminate between a photograph area and a chart area by using a simple binary image or an image binarized only by dither processing. As described above, by using an image binarized by simple binary or dither processing alone, the processing speed can be further increased as compared with the above embodiment. (Ii) The attribute discriminating apparatus of FIG. 2 that executes the attribute discriminating methods of the first and second embodiments may be changed to another configuration. For example, the positions of the circumscribed rectangles determined as the field separator area, the photograph area, and the chart area are stored in the memories 9, 10, and 11, respectively, and the circumscribed rectangle positions of the other character areas are stored in the circumscribed rectangle position memory 6 as they are. Has been saved. Therefore, the circumscribed rectangular position memory 6 can serve as the character area position memory 13, and thus, by omitting the character area position memory 13,
Memory can be reduced.

【００４９】[0049]

【発明の効果】以上詳細に説明したように、第１の発明
によれば、２値画像の全体を縮小して縮小画像を作成
し、その縮小画像を走査して該縮小画像における外接矩
形内に占める全黒画素数の割合によって写真領域と図表
領域とを判別するようにしている。そのため、縮小画像
を走査し、しかもその全面走査する回数も１回でよいた
め、写真領域と図表領域との区別が簡単に行え、しかも
属性判別の処理時間を大幅に短縮できる。As described in detail above, according to the first aspect, a reduced image is created by reducing the whole of a binary image, and the reduced image is scanned to define a circumscribed rectangle in the reduced image. The photograph area and the chart area are determined based on the ratio of the total number of black pixels to the image area. Therefore, since the reduced image is scanned and the entire surface is scanned only once, the photograph area and the chart area can be easily distinguished from each other, and the processing time for attribute determination can be greatly reduced.

【００５０】第２の発明によれば、図表・写真領域に限
定してその第１の外接矩形のみを縮小することにより、
縮小画像を作成するようにしているので、縮小処理等に
伴なう処理の簡単化と、それによる判別処理のより高速
化が図れる。According to the second aspect of the present invention, only the first circumscribed rectangle is reduced only in the chart / photograph area,
Since a reduced image is created, the processing involved in the reduction processing and the like can be simplified, and the speed of the discrimination processing can be further increased.

【００５１】第３の発明によれば、最適２値化した画像
を用いて属性の判別を行っているので、写真領域と図表
領域との判別を的確に行うことができる。According to the third aspect, since the attribute is determined using the optimal binarized image, it is possible to accurately determine the photographic area and the chart area.

【００５２】第４の発明によれば、単純２値もしくはデ
ィザ処理のみにより２値化した２値画像を用いて属性判
別を行うようにしているので、その判別処理のより簡単
化と高速化を図ることができる。According to the fourth aspect, attribute determination is performed using a binary image binarized by simple binary or dither processing alone, so that the determination processing is simplified and speeded up. Can be planned.

【００５３】第５の発明によれば、画素間のＯＲを取っ
て縮小画像を作成しているので、縮小画像の作成が簡単
になり、それによって属性判別処理をより高速化するこ
とができる。According to the fifth aspect, since the reduced image is created by ORing the pixels, the creation of the reduced image is simplified, and the attribute discrimination process can be further speeded up.

[Brief description of the drawings]

【図１】本発明の第１の実施例を示す属性判別処理のフ
ローチャートである。FIG. 1 is a flowchart of an attribute determination process according to a first embodiment of the present invention.

【図２】本発明の第１の実施例を示す属性判別装置の機
能ブロック図である。FIG. 2 is a functional block diagram of the attribute discriminating apparatus according to the first embodiment of the present invention.

【図３】本発明の第２の実施例を示す属性判別処理のフ
ローチャートである。FIG. 3 is a flowchart of an attribute determining process according to a second embodiment of the present invention.

【図４】図２における２値化処理とその縮小画像との関
係の一例を示す図である。FIG. 4 is a diagram illustrating an example of a relationship between a binarization process in FIG. 2 and a reduced image thereof.

[Explanation of symbols]

１読取部３最適２値化手段５外接矩形検出手段７属性判別手段８縮小２値画像メモリ REFERENCE SIGNS LIST 1 reading unit 3 optimal binarizing means 5 circumscribed rectangle detecting means 7 attribute discriminating means 8 reduced binary image memory

───────────────────────────────────────────────────── フロントページの続き (56)参考文献特開平３−126180（ＪＰ，Ａ) 特開平１−316883（ＪＰ，Ａ) 特開昭56−116182（ＪＰ，Ａ) (58)調査した分野(Int.Cl.⁷，ＤＢ名) H04N 1/40 - 1/409 G06T 1/00 ──────────────────────────────────────────────────続き Continuation of front page (56) References JP-A-3-126180 (JP, A) JP-A-1-316883 (JP, A) JP-A-56-116182 (JP, A) (58) Survey Field (Int.Cl. ⁷ , DB name) H04N 1/40-1/409 G06T 1/00

Claims

(57) [Claims]

1. A binary image is created by binarizing a document image, a circumscribed rectangle is formed in an information area of the binary image, a circumscribed rectangle is detected, and attributes of the photograph area and the chart area of the circumscribed rectangle are determined. In the attribute discriminating method, a reduced image is created by reducing the entirety of the binary image, and the first image of the photograph candidate area and the chart candidate area is determined from the features of the circumscribed rectangle and the features of the binary image within the circumscribed rectangle. From the first circumscribed rectangle, obtain a second circumscribed rectangle corresponding to the reduced image from the first circumscribed rectangle, scan the inside of the second circumscribed rectangle, and scan all the circumscribed rectangles within the second circumscribed rectangle. An attribute discriminating method comprising: counting the number of black pixels; and comparing a ratio of the total number of black pixels to the area of the second circumscribed rectangle with a threshold value to separate a photograph region and a chart region.

2. An attribute discriminating method for binarizing a document image to create a binary image, circumscribing an information area of the binary image to detect a circumscribed rectangle, and discriminating an attribute of the circumscribed rectangle. Based on the features of the circumscribed rectangle and the features of the binary image within the circumscribed rectangle, the first of the photograph candidate area and the chart candidate area
Is extracted, a reduced image is created by reducing only the first circumscribed rectangle, a second circumscribed rectangle corresponding to the reduced image is obtained, and the inside of the second circumscribed rectangle is scanned. Counting the number of all black pixels in the second circumscribed rectangle, and comparing the ratio of the total number of black pixels to the area of the second circumscribed rectangle with a threshold value to separate the photographic region and the chart region. An attribute determination method characterized by the following.

3. The attribute discriminating method according to claim 1, wherein the binary image is obtained by reading the document image in the form of multi-valued image data, and losing the multi-valued image data without losing its characteristics and properties. Attribute discrimination method obtained by optimal binarization.

4. The attribute discriminating method according to claim 1, wherein the binary image is obtained by binarizing the document image by simple binarization or dither processing alone.

5. The attribute determining method according to claim 1, wherein the reduced image is created by taking a logical sum between pixels.