JPH03126181A

JPH03126181A - Area dividing method for document image

Info

Publication number: JPH03126181A
Application number: JP1264649A
Authority: JP
Inventors: Kazuhiro Ishikawa; 和弘石川; Sadamasa Hirogaki; 広垣　節正; Yutaka Mazaki; 裕真崎; Shuichi Fujikura; 秀一藤倉
Original assignee: Oki Electric Industry Co Ltd
Current assignee: Oki Electric Industry Co Ltd
Priority date: 1989-10-11
Filing date: 1989-10-11
Publication date: 1991-05-29

Abstract

PURPOSE:To improve attribute discrimination accuracy by repeating segmentation processing with a circumscribing rectangle detecting means, performing classification with an attribute discriminating means, and unifying a circumscribing rectangle to a block with a character unifying means. CONSTITUTION:The circumscribing rectangle detecting means 31, when optimally binarized image data is inputted, makes the information areas of a field separator, a photograph, a chart, and a character part, etc., in the data into the circumscribing rectangles, and detects the feature of the circumscribing rectangle and that of a binary image in the circumscribing rectangle. After that, the circumscribing rectangle discriminating means 33 discriminates the attribute of the field separator, the photograph, the chart, and the character part, etc., in the information area, and furthermore, automatically sets a unified threshold value from the image obtained by extracting only the character part out of the attribute corresponding to individual document image data. Following that, a character part unifying means 37 unifies the circumscribing rectangle of the character part to a character part block. Thereby, it is possible to obtain an area dividing method for document image with easy selection of the threshold value.

Description

【発明の詳細な説明】（産業上の利用分野）本発明は、ファクシミリやテレックス等の通信機器、画
１象データベース入力方法等に用いられるもので、画像
データ中に混在する文字、写真、図表等の情報領域を抽
出し、その領域の属性を１１１別する文書画像の領域分
割方法に関するものである。Detailed Description of the Invention (Industrial Application Field) The present invention is used for communication equipment such as facsimile and telex, and image database input method, etc. The present invention relates to a method for dividing a document image into regions by extracting information regions such as the following, and classifying the attributes of the regions into 111 categories.

（従来の技術）従来、このような分野の技術としては、■１．゛ｊ開昭
６２−７１３７９号公報、（を特開昭６１−１０３３７
２等に記載されるものがあった。以下、その構成を図を
用いて説明′ζる。(Prior art) Conventionally, technologies in this field include: ■1.゛j Publication No. 1987-71379, (Japanese Patent Publication No. 10337/1982
Some items were listed as 2nd prize. The configuration will be explained below using figures.

第２図は、前記文献■に記載された従来の文書画像の領
域分割方法を用いた画像処理装置の〜・構成例を示す構
成ブロック図である。この画像処理装置は、複数ライン
からなるラインメモリ１、レジスタと加算回路からなる
累加算回路２ａ、２ｂ。FIG. 2 is a block diagram illustrating an example of the configuration of an image processing apparatus using the conventional document image area division method described in the document (2). This image processing device includes a line memory 1 consisting of a plurality of lines, and cumulative addition circuits 2a and 2b consisting of registers and addition circuits.

２ｃ、予め定められた閾値と比較する判定回路３ａ、３
ｂ、３ｃ、３ｄ、３ｅ、３ｆ、複数段のシフトレジスタ
４、ゲート回１？！５、レジスタ回路と加算回路からな
る微分回Ｂ６ａ、６ｂ、ランレングス・カウンタ回路７
、ランレングスメモリ回路８、及び白ランレングス発生
数カラフト回８つで構成されている。この種の方法では
、画像データｇが入力されると、その画像データｇがラ
インメモリ１に記憶される。その後、累加算回路２ａに
より主走査方向に黒画素が計数される。その累加見回２
Ｍ２ａの出力が、判定回路３ａ、微分回路６ａ、及び判
定回路３ｂを順次経る過程で、計数閾値を越えるライン
が検出され、上記計数値が同値以下の白ラインが、所定
の個数連続する状態を判定して第１の領域切り出しが行
われる。2c, determination circuits 3a, 3 for comparison with a predetermined threshold value;
b, 3c, 3d, 3e, 3f, multi-stage shift register 4, gate time 1? ! 5. Differential circuit B6a, 6b consisting of a register circuit and an addition circuit, run-length counter circuit 7
, a run length memory circuit 8, and eight white run length generation number craft circuits. In this type of method, when image data g is input, the image data g is stored in the line memory 1. Thereafter, black pixels are counted in the main scanning direction by the cumulative addition circuit 2a. The cumulative viewing 2
In the process in which the output of M2a passes sequentially through the determination circuit 3a, the differentiation circuit 6a, and the determination circuit 3b, a line exceeding the count threshold is detected, and a state in which a predetermined number of consecutive white lines with the count value equal to or less than the above continues. After the determination, first region cutting is performed.

次に、第１の領域切り出し６において、判定回路３ｂの
出力がゲート回路５、シフトレジスタ４累加算回路２ｂ
、判定回Ｗ＠　３　ｃ　、微分回路６ｂ、及び判定回路
３ｄを順次経、るに従い、副走査方向に黒画素が計数さ
れ、その計数値が閾値を越える列が検出され、上記計数
値か閾値以下の自利が、所定の個数連続する状態を判定
して第２の領域切り出しが行われる。Next, in the first region extraction 6, the output of the determination circuit 3b is transferred to the gate circuit 5, the shift register 4, the cumulative addition circuit 2b
, the judgment circuit W@3c, the differentiation circuit 6b, and the judgment circuit 3d are sequentially passed through, black pixels are counted in the sub-scanning direction, a column whose count value exceeds the threshold value is detected, and the above count value or the threshold value is detected. The second area extraction is performed by determining a state in which a predetermined number of the following self-interests are continuous.

さらに、第２の領域切り出しと同様の処理により第３の
領域切り出しを行い、続いて第３の領域切り出しと同様
の処理により第４の領域切り出しを行う。そこで、第４
の領域切り出しで検出された領域について、その領域の
ランレングス情報および黒画素率情報により、文字部、
写真部、図表部の領域の属性を判別する方法°ζあった
。Furthermore, a third region is cut out using the same process as the second region cutout, and then a fourth region is cut out using the same process as the third region cutout. Therefore, the fourth
For the area detected by area extraction, the character part,
There was a method to determine the attributes of the photo and diagram areas.

（発明が解決しようとする課題）しかしながら、上記構成の文書画像の領域分割方法では
、次のような課題があった。(Problems to be Solved by the Invention) However, the method for dividing document images into regions having the above configuration has the following problems.

領域を分割する際に、総合間値ＴＨＸを設定するパラメ
ータを任意に設定する必要があり、例えば、論文誌等の
個々の文書画像データに対して領域分割処理を行う場合
、そのパラメータを設定し直さなければならなかった。When dividing a region, it is necessary to arbitrarily set parameters for setting the overall mean value THX. For example, when performing region division processing on individual document image data such as academic journals, it is necessary to set the parameters. I had to fix it.

さらに、個々の論文誌等に応じた適当なパラメータを選
定する際に、手間が掛かるという問題があった。Furthermore, there is a problem in that it takes time and effort to select appropriate parameters according to individual journals, etc.

本発明は、前記従来技術が持っていた課題として、領域
を分割する際に、手間が掛かるという点について解決し
た文書画像の領域分割方法を提供するものである。The present invention provides a method for dividing document images into regions that solves the problem of the prior art, which is that it takes time and effort to divide regions.

（課題を解決するための手段）本発明によれば、前記課題を解決するために、入力され
た画ｆ象データの文字部を含む情報領域を切り出し、そ
の情報領域の属性判別処理を行う文書画像の領域分割方
法において、前記画像データに対して最適２敏化されな
２値画渫データを外接矩形検出手段に入力し、その外接
矩形検出手段により、該２値画像データ中の前記情報領
域を外接矩形化して外接矩形の特徴とその外接矩形内の
２値画像の特徴とを検出し、属性判別手段により、前記
外接矩形の特徴及びその外接矩形内の２値画像の特徴か
ら前記情報領域の属性を判別し、　前記属性判別手段に
よって前記属性の内、前記文字部と判別された外接矩形
を、文学部統合手段を用いて文字部ブロックに統合する
ものである。　さらに、前記文学部統合手段は、前記２
値画像データから前記文字部のみを抽出して得られた文
字部画像に対して周辺ヒストグラムを作成し、その周辺
ヒストグラムを用いて前記文字部の外接矩形を前記文字
部ブロックに統合するものである。前記文学部統合手段
は、前記周辺ヒストグラムの作成結果から、横方向に対
して前記黒画素が全く存在しない白ラインが連続する白
ライン領域及び前記黒画素が少なくとも１つ存在する黒
ラインが連続する黒画素ライン領域と、縦方向に対して
前記黒画素が全く存在しない白カラムが連続する白カラ
ム領域及び前記黒画素が少なくとも１つ存在する黒カラ
ムが連続する黒画素カラム領域とを、それぞれ抽出し、
その抽出結果から横書き・縦書きの判定を行い、その判
定結果に基づき横方向・縦方向の統合閾値を求め、前記
文字部の外接矩形を統合閾値に基づいて統合処理を行い
、その結果を文字部領域として抽出するものである。(Means for Solving the Problems) According to the present invention, in order to solve the above problems, an information region including a character part of input image data is cut out, and an attribute determination process of the information region is performed on a document. In the method for dividing an image into regions, binary stroke data that has been optimally scaled to the image data is input to a circumscribing rectangle detecting means, and the circumscribing rectangle detecting means detects the information area in the binary image data. is converted into a circumscribing rectangle, the features of the circumscribing rectangle and the features of the binary image within the circumscribing rectangle are detected, and the attribute determining means determines the information area from the features of the circumscribing rectangle and the features of the binary image within the circumscribing rectangle. The attributes of the character portion are determined by the attribute determining means, and the circumscribed rectangles determined to be the character portion among the attributes by the attribute determining means are integrated into the character portion block using the literature uniting means. Furthermore, the literature department integration means
A surrounding histogram is created for a text image obtained by extracting only the text from value image data, and the circumscribing rectangle of the text is integrated into the text block using the surrounding histogram. . The literature integration means determines from the creation result of the peripheral histogram that there is a white line area in which there are continuous white lines in which the black pixel does not exist in the horizontal direction, and a black line area in which there are continuous black lines in which at least one black pixel exists in the horizontal direction. A pixel line region, a white column region in which there are consecutive white columns in which no black pixels exist in the vertical direction, and a black pixel column region in which there are consecutive black columns in which at least one black pixel exists are extracted, respectively. ,
From the extraction results, horizontal writing/vertical writing is determined, horizontal/vertical integration thresholds are determined based on the determination results, the circumscribed rectangle of the character part is integrated based on the integration threshold, and the result is This is extracted as a partial area.

また、前記縦書き・横書きの判定は、前記周辺ヒストグ
ラムの作成結果から横方向に前記黒画素が全く存在しな
い白ラインが連続する白ライン領域と、縦方向に前記黒
画素が全く存在しない自刃ラムが連続する白カラム領域
との数の大小に基づき行うものである。Further, the determination of vertical writing/horizontal writing is based on the creation result of the peripheral histogram. This is done based on the number of consecutive white column areas.

そして、前記縦方向・横方向の統合閾値は、横書きと判
定された場合は前記白ライン領域幅と前記黒ライン領域
幅とのヒストグラムから５、また樅書きと判定された場
合は前記白カラム領域幅と前記環カラム領域幅とのヒス
トグラムからそれぞれ決定するものである。Then, the integrated threshold in the vertical and horizontal directions is set to 5 from the histogram of the white line area width and the black line area width when horizontal writing is determined, and when it is determined as fir writing, the white column area Each width is determined from a histogram of the width and the width of the ring column region.

そして、前記縦方向・横方向の統合閾値は、前記白ライ
ンまたは白カラム領域幅のヒス１〜グラムに基づき最も
個数の多い前記白ラインまたは自刃ラム領域から幅の大
きくなる方向に走査して１ｖ４数が０となる前記白ライ
ンまたは白カラム領域幅を検出した後、その直前の前記
白ラインまたは自刃ラム領域を値ＴＷとして抽出し、さ
らに前記黒ラインまたは環カラム領域幅のヒストグラム
に対して値ＴＢを抽出し、その値１゛Ｗと値ＴＢとに基
づき、横書きの場合、横方向の総合間値ＴＨＸ＝ＴＷ＋
ＴＢ、［方向の総合間値ＴＨＹ＝ＴＷ＋Ｅｒ（但し、Ｅ
ｒ；傾き誤差分のライン数）に設定し、縦書きの場合、
横方向の総合間１ｔ！ＴＨＸ＝ＴＷ＋Ｅｒ縦方向の総合
間値Ｔ）（Ｙ＝ＴＷ＋ＴＢに設定するものである。The integrated threshold in the vertical and horizontal directions is calculated by scanning from the white line or self-cutting ram area with the largest number in the direction of increasing width based on the histogram of the width of the white line or white column area. After detecting the width of the white line or white column area where the number is 0, extract the white line or self-cutting ram area immediately before that as a value TW, and then calculate the value for the histogram of the width of the black line or ring column area. TB is extracted, and based on the value 1゛W and the value TB, in the case of horizontal writing, the total horizontal distance value THX = TW +
TB, [total value between directions THY=TW+Er (however, E
r: the number of lines for the tilt error), and in the case of vertical writing,
Total horizontal distance 1t! THX=TW+Er (comprehensive longitudinal value T) (Y=TW+TB).

さらに、前記統合処理として、２つ注目する前記外接矩
形の領域座標（ＸＳＩ、ＹＳＩ）（Ｘｅ１、Ｙｅｔ）、
（ＸＳ２．ＹＳ２＞（Ｘｅ２．Ｙｅ２）　（但し、ＸＳ
１．、ＸＳ２．Ｙ］、ＹＳ２　；横方向、縦方向の領域
の始点座標、Ｘｅ１、Ｘ←２、Ｙｅ　１．Ｙｅ２　；横
方向、縦方向の領域の終点座標）について、次の統合条
件式（１）〜（４）に基づき、前記文字部と判別された
外接矩形を順次、１つにまとめ、極限まで統合を繰り返
した後、その結果を前記文字部プロ・・／りとして抽出
するものである。Furthermore, as the integration process, two area coordinates (XSI, YSI) (Xe1, Yet) of the circumscribed rectangle of interest,
(XS2.YS2>(Xe2.Ye2) (However, XS
1. , XS2. Y], YS2; Starting point coordinates of the area in the horizontal and vertical directions, Xe1, X←2, Ye 1. Ye2; End point coordinates of the horizontal and vertical regions), the circumscribed rectangles determined to be the character portion are successively combined into one, based on the following integration conditional expressions (1) to (4), and integrated to the limit. After repeating this, the result is extracted as the character part pro.../.

統合条件式％式％（但し、ＴＨＸ、横方向の統合閾値）（２）ＹＳ２−Ｙｅ　ｌ≦Ｔ　Ｉ（’ｌ’（但し、ＴＨ
Ｘ；横方向の統合閾値）（３）ＸＳＬ−Ｘｅ２≦ＴＨＸ（２）ＹＳ２≦ＴＨＹ（但し、ＴＨＹ；縦向の統合閾値）（３）ＸＳ　ｌ　、
　ＹＳ２−Ｙｅ　ｌＸ５ｌ−Ｘｅ２．ＹＳｌ−Ｙｅｌの
値がマイナスとなった時は条件を満たす）（ｆ′１三用
）本発明は、以上のように文書画像の領域分割方法を構成
したので、外接矩形検出手段は、文書画像データに対し
て、その文書画像データの性雷等を失なわいように最適
２値化された２ｆｔ！画像データを入力する。その外接
矩形検出手段により、該２象画像データ中のフィールド
セパレータ、写真、図表、および文字部等の情報領域を
外接矩形化して外接矩形の特徴とその外接矩形内の２象
画１象の特徴とを検出する。さらに、属性判別手段によ
り、外接矩形の特徴と、その外接矩形内の２値画像の特
徴とから前記情報領域のフィールドセパレータ、写真、
図表、および文字部等の属性を判別する。Integrated conditional expression % expression % (however, THX, horizontal integrated threshold) (2) YS2-Ye l≦TI
X; horizontal integration threshold) (3) XSL-Xe2≦THX (2) YS2≦THY (however, THY: vertical integration threshold) (3) XS l ,
YS2-Ye lX5l-Xe2. When the value of YSl-Yel is negative, the condition is satisfied) (for f'13) Since the present invention has configured the document image area division method as described above, the circumscribing rectangle detection means can The data is optimally binarized to 2ft to avoid losing the characteristics of the document image data! Enter image data. The circumscribing rectangle detecting means converts information areas such as field separators, photographs, charts, and character parts in the two-image image data into a circumscribing rectangle, and converts the characteristics of the circumscribed rectangle and the characteristics of the two quadrants and one image within the circumscribed rectangle. and detect. Furthermore, the attribute determining means determines the field separator of the information area, the photo,
Determine the attributes of figures, tables, text, etc.

そして、その属性判別手段によって前記属性の内、文字
部のみを抽出して得られた画像から領域な分割する際の
パラメータに代わる閾値を個々の文書画像データに応じ
て自動的に設定し、前記文字部の外接矩形を、文学部統
合手段を用いて文字部ブロックに統合する。Then, the attribute determining means automatically sets a threshold value in place of a parameter for dividing the image into regions from the image obtained by extracting only the character part from among the attributes, according to each document image data, and The circumscribed rectangle of the text section is integrated into the text section block using the literature section integration means.

したがって、前記課圧を解決て゛きるのである。Therefore, the above-mentioned imposed pressure can be solved.

（実施例）第１図は、本発明の実施例を示す文書画１象の領域分割
方法の機能ブロック図゛ζある。(Embodiment) FIG. 1 is a functional block diagram of a method for dividing a document image into regions according to an embodiment of the present invention.

この文書画像の領域分割方法は、多値ＷＪ１憤を最Ｊ２
値１ヒして２値画像を得る最適２値化処理手段２０と、
前記２値画像を入力して領域の抽出、属性の判別を行う
領域抽出・判別１匹工１手段３０とを備え、ディジタル
・プロセッサ７ｒで゛構成さｉｔている。最適２値化処
理手段２０は、ＣＣＤ（ＣＩ］ａｒｇｅ　　Ｃｏｕｐｌ
ｅｔＪ　　Ｄｅｖｉｃｅ）’＄のイメージセンサを有し
、そのイメージセンサにより情報媒１本を走査して多値
側１噴を１））る読取り部２１を備えている。この読取
り部２１の出力１則にはコ売み収った多値画像を格納す
るＲ　Ａ　Ｍ等の多１直画像メモリ２２と、その多１直
画１象メモリ、２２内の多値画像を最適２値化してその
特徴、性質を失うことなく、ある程度の画質を持った２
値画像に変換する最適２値化手段２３とが、接続されて
いる。さらに最適２値化手段２３の出力側には、その最
適２値化手段２３で変換された２値画像を格納するＲＡ
Ｍ等の２値画像メモリ２４が接続されている。This method of dividing the area of a document image is to
Optimal binarization processing means 20 for obtaining a binary image by increasing the value by 1;
It is comprised of a digital processor 7r, and includes an area extraction/discrimination unit 30 for inputting the binary image and extracting an area and determining attributes. The optimal binarization processing means 20 is a CCD (CI) large couple.
The apparatus is equipped with a reading unit 21 which has an image sensor of $ 1), and scans one information medium with the image sensor to detect one injection on the multi-value side. The first output of this reading unit 21 includes a multi-level image memory 22 such as RAM that stores the multi-valued images that have been sold, and a multi-level image memory 22 that stores the multi-valued images. Optimally binarizes the image to create two images with a certain level of image quality without losing their characteristics and properties.
Optimal binarization means 23 for converting into a value image is connected. Furthermore, on the output side of the optimal binarization means 23, there is an RA that stores the binary image converted by the optimal binarization means 23.
A binary image memory 24 such as M is connected.

２値画像メモリ２４に格納された２値画像を走査する領
域抽出・判別処理手段３０は、最適２値化して得られた
２値画像を入力し、その２値画像の情報領域（例えば、
黒画素領域）を外接矩形化して抽出する外接矩形検出手
段３１を備えている。The area extraction/discrimination processing means 30 that scans the binary image stored in the binary image memory 24 inputs the binary image obtained by optimal binarization, and extracts the information area of the binary image (for example,
A circumscribing rectangle detecting means 31 is provided for converting a black pixel area into a circumscribing rectangle and extracting the circumscribed rectangle.

外接矩形検出手段３１の出力側には、その外接矩形検出
手段３１で検出された外接矩形の位置を保存するＲＡＭ
等の外接矩形位置メモリ３２と、前記２値画像の特徴と
その矩形内の特徴から、外接矩形を図表、写真、フィー
ルドセパレータ、その他（文字）に分類する属性判別手
段３３とが接続されている。The output side of the circumscribed rectangle detection means 31 includes a RAM for storing the position of the circumscribed rectangle detected by the circumscribed rectangle detection means 31.
A circumscribed rectangle position memory 32 such as the above is connected to an attribute discrimination means 33 for classifying the circumscribed rectangle into diagrams, photographs, field separators, and others (characters) based on the features of the binary image and the features within the rectangle. .

属性判別手段３３の出力側には、図表、写真、フィール
ドセパレータ、および文字の属性判定別に外接矩形の位
置をそれぞれ保存する図表位置メモリ３４、写真位置メ
モリ３５、フィールドセパレータ３６および文学部統合
手段３７が順次接続されている。On the output side of the attribute determination means 33, there are a diagram position memory 34, a photograph position memory 35, a field separator 36, and a literature department integration means 37 that store the positions of circumscribed rectangles for each attribute determination of diagrams, photographs, field separators, and characters. connected sequentially.

文学部統合手段３７は、属性１゛！ｊ別手段３３で文字
部と判別された外接矩形を文字部ブロックに統合を行い
、その結果を文字部領域として抽出する処理を行う手段
であり、その文学部統合手段３７の出力側には文字部位
ヱ：メモリ３８が接続さｈている。Faculty of Literature integration means 37 has attribute 1゛! This means integrates the circumscribed rectangle determined to be a character part by the separate means 33 into a character part block, and extracts the result as a character part area. E: The memory 38 is connected.

以上のように構成される之書画像の領域処理方法を、第
３図〜第２０図を参照しつつ説明する。The area processing method for the book image configured as described above will be explained with reference to FIGS. 3 to 20.

（Ｉ）最適２１直化処理手段２０の動作読取り部２１は
、情報媒体を走査して多値画イ憤を出力側、それを多値
側１象メモリ２２に格納する。(I) The operation reading unit 21 of the optimum 21-value conversion processing means 20 scans the information medium and outputs a multi-value image image, and stores it in the multi-value side one-image memory 22.

この読取り部２１での読取り解１象度は、細がい程よく
、８本／ｍｍ以上の解像度が適当である。最適２値化手
段２３は、多値側１象メモリ２２内の多値画像データを
読み込み、その多値画像データにおいて局所領域での濃
度差分値の絶対値の大きさがら、中間調画像部分と非中
間調画像部分とに分類する。この最適２値化手段２３で
は、非中間調画像部分と判別した場合は、単純に固定閾
値で２値化（単純２値化）を行い、一方、中間調画像部
分と判別した場合は、デイザ法で２値化を行い、２値画
像を生成する。これらの最適２値化処理は、例えば、前
記文献■に記載された方法等で実行すればよい。The reading resolution of the reading unit 21 is moderately thin, and a resolution of 8 lines/mm or more is appropriate. The optimal binarization means 23 reads the multi-value image data in the multi-value side one-image memory 22, and determines the magnitude of the absolute value of the density difference value in the local area in the multi-value image data, and distinguishes between the halftone image part and the non-halftone image part. It is classified into halftone image parts. In this optimal binarization means 23, if it is determined that it is a non-halftone image part, it is simply binarized using a fixed threshold value (simple binarization), whereas if it is determined that it is a halftone image part, it is diluted. Binarization is performed using the method to generate a binary image. These optimal binarization processes may be performed, for example, by the method described in the above-mentioned document (2).

以上の最適２値ｊヒ処理により、文字、図表といっな非
中間調画像部分は、白と黒の堺のエツジ部分のくっきり
とした画像となり、一方、写真等の中間調画像部分は、
新聞等で見られるデイザ法となり、これらが合成されで
ある程度の画質の２値画渫が得られる。この２値画像は
、２値画像メモリ２４に格納される。Through the above optimal binary processing, non-halftone image parts such as characters and charts become images with sharp edges of black and white, while halftone image parts such as photographs, etc.
This is the dither method seen in newspapers, etc., and by combining these methods, a binary drawing with a certain level of image quality can be obtained. This binary image is stored in the binary image memory 24.

（ＩＩ）外接矩形検出手段３１の動作最適２鎮化手段２３によって得られた２値画像メモリ２
４中の２値画像データの主査方向をＸ軸方向とし、副走
査方向をＹ軸方向とする。(II) Optimal operation of circumscribed rectangle detection means 31 2 Binary image memory 2 obtained by suppression means 23
The main scanning direction of the binary image data in 4 is the X-axis direction, and the sub-scanning direction is the Y-axis direction.

以下、第３図および第４図を参考し、外接矩形検出手段
３１の動作をｌ（也明する。The operation of the circumscribed rectangle detection means 31 will be described below with reference to FIGS. 3 and 4.

第３図は第１図中の外接矩形検出手段３１の動作例を示
すフローチャート、および第４図は、外接矩形検出手段
３１を説明するし１である。なお第３図中の３５０〜Ｓ
６７は処理ステップを示し、第３図および第４図中のｉ
は階層、ｍはＸ方向に切り出された領域番号、ｎはＹＪ
Ｊ向に切り出された領域ばんごうである。また、第４図
中の４０は背景パターン、４１は情報領域（黒画素領域
）である。FIG. 3 is a flowchart showing an example of the operation of the circumscribed rectangle detection means 31 in FIG. 1, and FIG. 4 is a flowchart illustrating the circumscribed rectangle detection means 31. In addition, 350~S in Figure 3
67 indicates a processing step, i in FIGS. 3 and 4.
is the hierarchy, m is the area number cut out in the X direction, and n is YJ
This is an area cut out in the J direction. Further, 40 in FIG. 4 is a background pattern, and 41 is an information area (black pixel area).

先ず、第３図の３５０〜Ｓ５２において、ｉ。First, at 350 to S52 in FIG. 3, i.

ｎ、ｍをそれぞれ初期値１に初期化する。そしζ、領域
の始点座標（ＸＳ　（ｉ−１，ｍ）　、　ＹＳ　（ｉ−
１，ｎ）１．終点座標（Ｘｅ　（ｉ−Ｌ　ｍ）。Initialize n and m to the initial value 1, respectively. Then ζ, the starting point coordinates of the area (XS (i-1, m), YS (i-
1,n)1. End point coordinates (Xe (i-L m).

Ｙｅ　（ｉ−１，ｎ）ｌに対して、Ｙ軸方向の領域をｎ
個の領域に切り出す処ｎ１を行う（３５３）。For Ye (i-1, n)l, the area in the Y-axis direction is n
The process n1 of cutting out the area into several areas is performed (353).

この切り出し処理について、第５図のＹ方向の切り出し
処理を説明する図を用いて説明する。始点座標（ＸＳ　
（ｉ−１，ｒｔｔ）　、　ＹＳ　（ｉ−１，ｎ）　）、
終点座標（Ｘｅ　（ｉ−１，ｍ）、Ｙｅ　（ｉ−１゜ｎ
＞）で示される背景パターン４０上の情報領域４１に対
して、Ｘ方向に黒画素が１つでも存在する黒ラインを検
出する。その黒ラインのＹ座標値を始点Ｙ座標として抽
出し、次いでＸ方向に黒画素が１つも存在しない白ライ
ンが連続して閾値ＴＨを越える白ラインを検出する。そ
の日ラインの（ＴＨ＋ｌ）前の黒ラインのＹ座標値を終
点Ｙ座標値として抽出する。ここで、中間調濃度情報が
、例えば１画素当たり４ビツト（１６階調）に量子化さ
れている場合、解像度の点から、閾値ＴＨの値は１〜４
画像の範囲で設定することが適当である。This cutting process will be explained using a diagram illustrating the cutting process in the Y direction in FIG. 5. Starting point coordinates (XS
(i-1, rtt), YS (i-1, n)),
End point coordinates (Xe (i-1, m), Ye (i-1゜n
A black line in which at least one black pixel exists in the X direction is detected for the information area 41 on the background pattern 40 indicated by >). The Y coordinate value of the black line is extracted as the starting point Y coordinate, and then a white line in which there are no black pixels in the X direction consecutively exceeds the threshold value TH is detected. The Y-coordinate value of the black line (TH+l) before that day's line is extracted as the end point Y-coordinate value. Here, if the halftone density information is quantized to, for example, 4 bits per pixel (16 gradations), the value of the threshold TH is 1 to 4 from the viewpoint of resolution.
It is appropriate to set it within the range of the image.

以上の操作を領域（ＸＳ　（ｉ−１，ｍ）、ＹＳ（ｉ−
１，ｎ））、（Ｘｅ　（ｉ−１，ｍ）、Ｙｅ（ｉ−１，
ｎ））の全てのＹＦｉ、標について行い、ｎ閏の領域（
ＸＳ　（ｉ−１，ｍ＞、ＹＳ　（ｉ−１゜ｎ））、（Ｘ
ｅ　（ｉ−１，ｍ＞　、　Ｙｅ　（ｉ−１ｎ））（ＩＢ
し、ｎ；正の整数）を切り出す。このとき、領域（ＸＳ
　（０，ｍ）、ＹＳ　（０，ｎ））、（Ｘｅ　（０，ｍ
＞、Ｙｅ　（０，ｎ＞　）は、初期値として予めセット
シ゛ζおくが、その値は２値画像データの全領域を示す
。The above operations are performed in the area (XS (i-1, m), YS (i-1, m),
1, n)), (Xe (i-1, m), Ye(i-1,
n)) for all YFi, marks, and calculate the area of n leap (
XS (i-1, m>, YS (i-1゜n)), (X
e (i-1, m> , Ye (i-1n)) (IB
Then, cut out n (positive integer). At this time, the area (XS
(0, m), YS (0, n)), (Xe (0, m
>, Ye (0, n>) are set in advance as initial values, and the values indicate the entire area of the binary image data.

検出された連続する白ラインの数が閾値ＴＨを越えない
間に黒ラインを検出した場合は、上下の領域は同領域と
判断し、処理の継続を連続する白ラインが閾値を越える
まで繰り返し行う。また、黒ラインのＹ座標値が）’ｅ
　（ｉ−１，ｎ）と等しくなる場合は、Ｙｅ（ｉ　　１
．ｒ＋）を終点）′座標値として抽出し、Ｙ座標値がＹ
ｅ（ｉ−１，ｎ）となったら処理を終了する。If a black line is detected before the number of consecutive white lines detected exceeds the threshold TH, the upper and lower areas are determined to be the same area, and processing is repeated until the number of consecutive white lines exceeds the threshold. . Also, the Y coordinate value of the black line is )'e
(i-1,n), then Ye(i 1
．． r+) as the end point)' coordinate value, and the Y coordinate value is Y
When e(i-1, n) is reached, the process ends.

このようにし°ζ、領域（ＸＳ　（ｉ−１，ｍ）。In this way, °ζ, the area (XS (i-1, m).

ＹＳ　（ｉ−１、ｎ＞　）、（Ｘｅ　（ｉ　　１、　ｎ
＋）　。YS (i-1, n> ), (Xe (i 1, n
+).

Ｙｅ　（ｉ−Ｌ、　ｎ＞　）に対して、Ｙ方向に切り出
された上記のｎ個の領域について、そのｎ個の始点、終
点Ｙ座標を外接矩形位置メモリ３２に保存する（８５４
）。第４図に示すｉ＝１の時のｎの値が１．２．３で示
される）′方向に切り出された領域がその処理結果例を
示す。With respect to Ye (iL, n>), for the above n regions cut out in the Y direction, the Y coordinates of the n starting points and end points are stored in the circumscribed rectangle position memory 32 (854
). The area cut out in the )' direction, where the value of n when i=1 is 1.2.3 shown in FIG. 4, shows an example of the processing result.

次に、第３図の３５３でＹ方向に切り出されたｎ個の領
域について以下の処理を行う。先ず、初期値でｎの値を
１として初期化する（Ｓ５５）。Next, at 353 in FIG. 3, the following processing is performed on the n regions cut out in the Y direction. First, the value of n is initialized to 1 (S55).

そして領域始点座標（ＸＳ　（ｉ−１，ｍ＞、ＹＳ（ｉ
、ｎ））、終点座標（Ｘｅ　（ｉ−Ｌ、ｍ）。Then, the area starting point coordinates (XS (i-1, m>, YS (i
, n)), end point coordinates (Xe (i-L, m).

Ｙｅ　（ｉ、ｎ＞）に対して、Ｘ軸方向について領域を
複数個（ｍ個）の領域に切り出す処理を行う＜８５６）
。この切り出し処置について、第６図のＸ軸方向の切り
出し処置を説明する図を用いて説明する。For Ye (i, n>), perform processing to cut out the region into multiple (m) regions in the X-axis direction <856)
. This cutting-out procedure will be explained using a diagram illustrating the cutting-out procedure in the X-axis direction in FIG. 6.

始点座標及び終点座標で示される背景パターン４０上の
情報領域４１に対して、Ｙ方向に黒画素が１つでも存在
する黒カラムを検出し、その黒カラムのＸ座標値を始点
Ｘ座標として抽出する。次いで、Ｙ方向に黒画素が１つ
でも存在しない白カラムが連続して閾値ＴＨを越える白
カラムを検出し、その白カラムの（ＴＨ＋１”）前の黒
カラムのＸ座標値を終点Ｘ座標値として抽出する。ここ
で、閾１直ＴＨの値は１〜４画素の範囲で設定する。For the information area 41 on the background pattern 40 indicated by the start point coordinates and end point coordinates, detect a black column in which at least one black pixel exists in the Y direction, and extract the X coordinate value of the black column as the start point X coordinate. do. Next, a white column in which there is no black pixel in the Y direction exceeds the threshold value TH is detected, and the X coordinate value of the black column (TH+1") before that white column is set as the end point X coordinate value. Here, the value of the first threshold TH is set in the range of 1 to 4 pixels.

以上の操作を領域（ＸＳ　（ｉ−１，ｍ）、ＹＳ（ｉ、
ｎ））、（Ｘｅ　（ｉ−１，ｍ）、Ｙｅ　（ｉ。The above operations are performed in the area (XS (i-1, m), YS (i,
n)), (Xe (i-1, m), Ye (i.

ｎ））の全てのＸ座標について行い、ｍ個の領域（ＸＳ
　（ｔ、ｍ）、ＹＳ　（ｉ、ｒ＋））、（Ｘｅ（ｉ、ｍ
＞、Ｙｅ　（ｉ、ｎ＞）（但し、ｍ：正の整数）を切り
出す（８５６）。この時、検出された連続する白カラム
の数が１−１（直ＴＨを越えない間に黒カラムを検出し
た場合は、左右の領域は同領域と判断し、処理の継続を
連続する白カラムが閾値ＴＨを越えるまで繰り返し行う
。また、黒カラムのＸ座標値がＸｅ（ｉ−］、ｎ＞と等
しくなる場合は、Ｘｅ（ｉ−１，ｎ＞を終点Ｘ座標値と
して抽出し、Ｘ座標値がＹｅ　（ｉ−１，ｎ）となった
ら処理を終了する。n)) for all X coordinates of m regions (XS
(t, m), YS (i, r+)), (Xe(i, m
>, Ye (i, n>) (where m: a positive integer) is cut out (856). At this time, the number of consecutive white columns detected is 1-1 (if a black column is detected before the direct TH This is repeated until TH exceeds the threshold TH.Also, if the X coordinate value of the black column is equal to Xe(i-], n>, extract Xe(i-1, n> as the end point X coordinate value, When the coordinate value becomes Ye (i-1, n), the process ends.

このようにして、領域（ＸＳ（ｉ〜１．ｍ＞。In this way, the area (XS(i~1.m>).

ＹＳ　（ｉ、ｎ））、（Ｘｅ　（ｉ−１，ｍ＞、Ｙｅ（
ｉ、ｎ）ｌに対して、Ｘ方向に切り出された上記のｍ個
の領域について、そのｍｒｌｉの始点、終点Ｘ座標を外
接矩形位置メモリ３２に保存する（Ｓ３２）。第４図に
示すｉ＝１．ｒｔ＝１の時のｍの値が１．２で示される
Ｘ方向に切り出された領域がその処理結果例を示す。YS (i, n)), (Xe (i-1, m>, Ye(
With respect to i, n)l, the X coordinates of the starting point and ending point of mrli are stored in the circumscribed rectangle position memory 32 for the m regions cut out in the X direction (S32). i=1 shown in FIG. An example of the processing result is shown in an area cut out in the X direction where the value of m is 1.2 when rt=1.

以上の手順で処理を行い、階層ｉにおけるｍの値とｎの
値がそれぞれ１．１のみであった場合（３５８）　、領
域始点座標（ＸＳ　（ｉ、ｍ）、ＹＳ（ｉ、ｎ））、終
点座標（Ｘｅ　（ｔ、ｍ）、Ｙｅ　（ｉ、ｎ））を外接
矩形領域として抽出しく５５９）、その始点、終点座標
を外接矩形位置メモリ３２に保存する（Ｓ６０）。次い
で、この階層の値を減分しく５６１）、階層ｉ番目のす
べてのｎについて行ったか否かを判定する（Ｓ６２）。If the above procedure is performed and the values of m and n in layer i are each only 1.1 (358), area start point coordinates (XS (i, m), YS (i, n)) , the end point coordinates (Xe (t, m), Ye (i, n)) are extracted as a circumscribed rectangular area (559), and the start point and end point coordinates are stored in the circumscribed rectangle position memory 32 (S60). Next, the value of this layer is decremented (561), and it is determined whether this has been done for all n in the i-th layer (S62).

もし全てのｎについて行っていれば、次に階層ｉ番目の
全てのｍに′）いて行ったか否かを判定しく３６３）　
、もし全てのｍについて行っていれば、この第３図の全
ての処理を終了する（ｓ６４）。If it has been done for all n, then it is determined whether it has been done for all m in the i-th hierarchy (')363)
, if the process has been performed for all m, all the processes in FIG. 3 are completed (s64).

一方、３５８でノーの場合、階Ｎｉの値を増分しく５６
７）　、続いてＳ５２の処理に戻る。また、Ｓ６２でノ
ーの場合、ｎの値を増分しく５６５）、Ｓ５６の処理に
戻り、同様に３６３でノーの場合、ｍの値を減分しく５
６６）　、３５３に戻る。On the other hand, if the answer is 358, the value of the floor Ni is increased by 56.
7) Then, the process returns to S52. If the answer is NO in S62, the value of n is incremented by 565), and the process returns to S56. Similarly, if the answer is NO in 363, the value of m is incremented by 5.
66), return to 353.

以上が外接矩形検出手段の一連の処理であり、第４図に
示すような階層的手段を用いて処理を行う。要するに、
閾値ＴＨ以上の白ライン、白カラムに囲まれる領域を、
これ以りの切り出しはないという段階まで処理を繰り返
！−を行い、最終的に切り出された領域（２値画代デー
タの全領域）を外接矩形として検出し、その全ての外接
矩形の位置を示す始点座標、終点座標を外接矩形位置メ
モリ３２に保管するものである。The above is a series of processing by the circumscribed rectangle detection means, and the processing is performed using hierarchical means as shown in FIG. in short,
The area surrounded by the white line and white column above the threshold TH is
Repeat the process until there are no more cuts left! -, the finally cut out area (the entire area of the binary image margin data) is detected as a circumscribed rectangle, and the start point coordinates and end point coordinates indicating the positions of all the circumscribed rectangles are stored in the circumscribed rectangle position memory 32. It is something to do.

（Ｉ［Ｉ）属性判別手段３３の動乍属性判別手段３３は、外接矩形位置メモリ３２の内容を
読み出し、検出された外接矩形の全てについて、その外
接矩形特徴とその矩形内の特徴とからフィールドセパレ
ータ、写真、図表、及びその池（文字）の４つの領域に
分類する処理を行う。(I [I) The movement attribute discriminator 33 of the attribute discriminator 33 reads out the contents of the circumscribed rectangle position memory 32, and for all of the detected circumscribed rectangles, fields are determined from the circumscribed rectangle features and the features within the rectangle. A process is performed to classify the data into four areas: separators, photographs, diagrams, and letters.

この処理の一例を第７図に示す。第７図は、第１図の属
性判別手段３３の動乍フローチャートである。なお、全
ての外接矩形を（ＸＳｊ、ＹＳｊ）、（Ｘｅｊ、Ｙｅｊ
）（（［！Ｌ、ｊ；正の整数）で表す。An example of this process is shown in FIG. FIG. 7 is a flowchart of the operation of the attribute determining means 33 of FIG. 1. Note that all circumscribed rectangles are (XSj, YSj), (Xej, Yej
) (([!L, j; positive integer).

まず、ｊの値をそれぞれの初期値を１として初期化する
（Ｓ７０）。次いで、外接矩形（ＸＳｊ。First, the values of j are initialized by setting each initial value to 1 (S70). Next, the circumscribed rectangle (XSj.

ＹＳｊ）、（Ｘｅｊ、Ｙｅｊ）について、フィードセパ
レータを抽出する処理を行う（Ｓ７１）。YSj) and (Xej, Yej), a process of extracting the feed separator is performed (S71).

その外接矩形がフィードセパレータとし°ζ判断されな
い場合は、写真及び図表候補の抽出を行う（Ｓ７２）。If the circumscribed rectangle is not determined to be the feed separator, photograph and chart candidates are extracted (S72).

そこで、その外接矩形が写真及び図表候補と判定された
場合は、写真及び図表の判定を行う処理に入る（８７３
）。その結果、入力した外接矩形が写真であると判別さ
れた場合、その外接矩形の始点・終点座標を写真位置メ
モリ３５に保存する（Ｓ７４）。一方、図表であると判
別された外接矩形については、図表位置メモリ３４に保
存する（Ｓ７５）。さらに、すべての外接矩形（ＸＳｊ
、ＹＳｊ）、（Ｘｅｊ、Ｙｅｊ）について行ったか否か
を判定しく８７６）、全ての外接矩形について行ったと
判定された時、この処理は終了する（Ｓ７７）。Ｓ７６
でノーの場合は、ｊの値を増分しく５７８）　、次にＳ
’ｌの処理に戻る。Ｓ７１の処理で、フィールドセパレ
ータとして判定されたら、その外接矩形の始点、終点座
標をフィールドセパレータ位置メモリ３６に保存しく５
７９）　、Ｓ７６の処理を行う。また、ステップ７２の
処理で写真、図表候補と判定されない場合、その外接矩
形の始点、終点座標を文字部位置メモリ３７に保存しく
５８０）、次に３７６の処理を行う。Therefore, if the circumscribed rectangle is determined to be a photograph and diagram candidate, processing for determining the photograph and diagram is started (873
). As a result, if it is determined that the input circumscribed rectangle is a photograph, the start and end point coordinates of the circumscribed rectangle are stored in the photograph position memory 35 (S74). On the other hand, the circumscribed rectangle determined to be a diagram is stored in the diagram position memory 34 (S75). Furthermore, all the circumscribed rectangles (XSj
, YSj), (Xej, Yej) (876), and when it is determined that the process has been performed for all circumscribed rectangles, this process ends (S77). S76
If no, increment the value of j by 578), then S
Return to 'l processing. If it is determined as a field separator in the process of S71, the start point and end point coordinates of the circumscribed rectangle are stored in the field separator position memory 36.
79), perform the process of S76. If it is not determined to be a photo or chart candidate in the process of step 72, the coordinates of the starting point and end point of the circumscribed rectangle are stored in the character position memory 37 (580), and then the process of 376 is performed.

以上が属性判別手段３３の一連の動作であるが、第８図
〜第１５図を参照しつつ、８７１、Ｓ７２、Ｓ７３につ
いて詳細に説明する。The above is a series of operations of the attribute determination means 33, and 871, S72, and S73 will be explained in detail with reference to FIGS. 8 to 15.

第８図は、Ｓ７１のフィールドセパレータの抽出の動作
の一例を示すフローチャート、第９図は、外接期形のＸ
軸長さに対するＸ軸長さの関係図である。FIG. 8 is a flowchart showing an example of the field separator extraction operation in S71, and FIG.
It is a relationship diagram of the X-axis length with respect to the axial length.

第９図に示すように、フィールドセパレータはＸ軸の長
さがγ以上で、Ｙ−αχ十β（但し、αβ；ある固定値
、Ｘ；Ｘ軸の長さ〉で表される境界線よりＹ軸の長さが
小さいという条件で抽出が可能であることが分かる。As shown in Figure 9, the field separator has an It can be seen that extraction is possible under the condition that the length of the Y axis is small.

以上の条件に従い、第８図に示される手ＪＩＩＱでフィ
ールドセパレータの抽出を行う。According to the above conditions, field separators are extracted using the hand JIIQ shown in FIG.

まず、外接矩形（ＸＳｊ、ＹＳｊ）、（Ｘｅｊ。First, the circumscribed rectangle (XSj, YSj), (Xej.

Ｙｅｊ）を入力して（Ｓ９０）　、Ｘ軸の長さＸ１ｅｎ
、Ｙ軸の長さＹｌｅｎを求める（Ｓ９１＞。(S90), and the length of the X axis is X1en.
, find the length Ylen of the Y axis (S91>.

Ｘ軸の長さＸ１ｅｎは、Ｘｅｊ−ＸＳｊ＋１により、Ｙ
軸の長さＹｌｅｎはＹｅｊ−ＹＳｊ＋１により求める。The length X1en of the X axis is determined by Y
The length Ylen of the axis is determined by Yej-YSj+1.

そして、得られたＹｌｅｎとＸ１ｅｎとを比較して（Ｓ
９２＞　、Ｘ１ｅｎの方が大きい場合、Ｘ１ｅｎが閾値
γより大きいが判定する（８９Ｂ）。大きいと判定され
た場合、次にＸ１ｅｎＸα＋βの値を求め、Ｙｌｅｎの
長さがその求めた値よりも小さいと判定された場合（Ｓ
９４）、この外接矩形をフィールドセパレータと判定し
く５９５）、処理を終了する（８９６＞、Ｓ９２のノー
の場合、５９３と同様にＹｌｅｎが閾値γより大きいか
判定する（Ｓ９７）。Ｓ９７のイエス場合、次に８９４
と同様にＹ　ｌ　ｅ　ｒｔ　Ｘａ十βの値を求め、Ｘ１
ｅｎの長さがその求めた値よりも小さいと判定された場
合（８９８）　、次にステップＳ９５の処理を行う。Then, the obtained Ylen and X1en are compared (S
92>, if X1en is larger, it is determined that X1en is larger than the threshold γ (89B). If it is determined that the length of Ylen is smaller than the calculated value, then calculate the value of X1en
94), determines that this circumscribed rectangle is a field separator 595), and ends the process (896>, if no in S92, determines whether Ylen is larger than the threshold γ, as in 593 (S97); if yes in S97 , then 894
Similarly, find the value of Y l er t Xa + β, and
If it is determined that the length of en is smaller than the calculated value (898), then the process of step S95 is performed.

Ｓ９３、Ｓ９４、Ｓ９７、Ｓ９８が、それぞれノーの場
合、処理を終了する（Ｓ９６＞。If S93, S94, S97, and S98 are each negative, the process ends (S96>).

第１０図は、第７図の３７２の動作の一例を示す写真・
図表候補の抽出のフローチャートて′あり一第１１図は
、解像度８本／”ｍｍのときの外接矩形面積に対するそ
の矩形内の縦方向、横方向の白から黒への反転回数を示
す関係図である。FIG. 10 is a photograph showing an example of the operation of 372 in FIG.
Figure 11 is a diagram showing the number of times of reversal from white to black in the vertical and horizontal directions within the circumscribed rectangle with respect to the area of the circumscribed rectangle when the resolution is 8 lines/mm. be.

第１１図の横軸が示すように、文字等と図表・写真とを
外接矩形の面積の大小で判別可能である。As shown by the horizontal axis in FIG. 11, characters, etc., and diagrams/photos can be distinguished by the size of the area of the circumscribed rectangle.

そこで、この特徴を基に第１．０図に示される手段で図
表・写真候補の抽出を行う。Therefore, based on this feature, diagram/photograph candidates are extracted using the means shown in Figure 1.0.

第７図の８７１でフィールドセパレータとして抽出され
なかった外接矩形（ＸＳｊ、ＹＳｊ）、（Ｘｅｊ、Ｙｅ
ｊ）を入力して（３１００）、続いて第８図のＳ９］で
求めたＸ軸の長さＸ１ｅｎとＹ軸の長さＹｌｅｎとの積
により外接矩形の面積Ｓを求める（Ｓｌｏｔ）。得られ
た面積Ｓが閾値Ｃよりも小さい場合（Ｓ１０２＞　、こ
の外接矩形を文字等と判定しく８１０Ｂ）、処理を終了
する（３１０４）、５１０２で面積Ｓが閾値Ｃよりも大
きい場合、この外接矩形は図表・写真候補と判定され（
８１０５）、処理を終了する（ＳＩＯ４）。The circumscribed rectangles (XSj, YSj), (Xej, Ye
j) is input (3100), and then the area S of the circumscribed rectangle is determined by the product of the X-axis length X1en and the Y-axis length Ylen determined in S9 of FIG. 8 (Slot). If the obtained area S is smaller than the threshold C (S102>, this circumscribed rectangle is determined to be a character etc. 810B), the process ends (3104). If the area S is larger than the threshold C in 5102, this circumscribed rectangle is The rectangle is determined to be a diagram/photo candidate (
8105), and the process ends (SIO4).

第１２図は、第７図の８７３の写真・図表の判定の動作
の一例を示すフローチャートである。FIG. 12 is a flowchart illustrating an example of the photo/diagram determination operation 873 in FIG.

Ｓ７２の処理で図表・写真候補と判定され、かつ抽出さ
れた外接矩形について、これを写真あるいは図表のいず
れでるかを判定する処理を行う。For the extracted circumscribed rectangle that is determined to be a diagram/photo candidate in the process of S72, a process is performed to determine whether it is a photograph or a diagram.

第１１図に示すように、面積Ｓが閾値Ｃより大きい外接
矩形に関して（面積Ｓが閾値Ｃより小さい外接矩形に関
してはＳ７２により除かれている）、Ｙ＝ａＸ＋ｂ（但
し、ａ、ｂ；ある固定値、Ｘ：面積Ｓ）で表される境界
線より白から黒の反転回数が大きいか、あるいは小さい
かにより、写真あるいは図表の判定ができる。As shown in FIG. 11, for a circumscribed rectangle whose area S is larger than the threshold value C (circumscribed rectangles whose area S is smaller than the threshold value C are excluded by S72), Y=aX+b (however, a, b; some fixed A photograph or a diagram can be determined depending on whether the number of inversions from white to black is larger or smaller than the boundary line represented by the value (X: area S).

第１３図は、外接矩形面積に対するその矩形内の黒画素
数の関係図である。FIG. 13 is a diagram showing the relationship between the area of a circumscribed rectangle and the number of black pixels within the rectangle.

従来の文字、図表、写真の属性の判別手段として、面積
とその矩形内の黒画素数に基に、黒画素率を求めその大
小で属性判別を行っている方法が多かった。Conventional methods for determining the attributes of characters, diagrams, and photographs often involve determining the black pixel rate based on the area and the number of black pixels within the rectangle, and determining the attributes based on the size of the black pixel rate.

しかし、本実施例で用いた最適２値化された画像に対し
て処理した場合、第１３図に示すような結果が得られ、
写真と図表とを判別する判別基準の設定が困難にである
。したがって、第１１図の特徴を利用して第１２図に示
す動作手順によって写真と図表との判別を行う。However, when processing the optimally binarized image used in this example, the results shown in FIG. 13 are obtained.
This makes it difficult to set discrimination criteria for distinguishing between photographs and charts. Therefore, by utilizing the features shown in FIG. 11, photographs and charts are distinguished by the operating procedure shown in FIG. 12.

第７図の３７２で写真・図表候補と判定された外接矩形
（ＸＳｊ、ＹＳｊ＞、（Ｘｅｊ、Ｙｅｊ）を入力して（
ＳＬｉｏ）、先ず、カウントを０にしてカウントを初期
化する（Ｓ１１１）。また、２値画像データ中のＹＳｊ
がらＹｅｊに走査しつつ順次ＸＳｊからＸｅ、ｊに走査
する過程において、白から黒へ反転したらカウントを増
分する（Ｓ１１２）。これは、第１４図の説明図から明
らかなように、各ＹについてＸ方向に矢印の方向に走査
し、白から黒（パターンＡがらｌ＼パターンＦ−ｉ）へ
反転したらカウントを増分するものである。さらに、同
２値画像データ中のＸＳｊがらＸｅ、ｉに走査しつつ、
順次ＹＳｊからＹｅｊに走査する過程において、白から
黒へ反転したらカウントを増分する（３１１３）。これ
は、第１５図の説明図から明らかなように、各Ｘについ
て）′方向に矢印の方向に走査し、白から黒（パターン
ＡからヘパターンＢ）へ反転したらカウントを増分する
ものである。Input the circumscribed rectangle (XSj, YSj>, (Xej, Yej) determined to be a photo/diagram candidate at 372 in FIG.
SLio), first, the count is initialized by setting it to 0 (S111). Also, YSj in the binary image data
In the process of sequentially scanning from XSj to Xe, j while scanning from XSj to Yej, the count is incremented when white is reversed to black (S112). As is clear from the explanatory diagram in Fig. 14, this scans each Y in the X direction in the direction of the arrow, and increments the count when it reverses from white to black (pattern A to pattern Fi). It is. Furthermore, while scanning XSj in the same binary image data to Xe,i,
In the process of sequentially scanning from YSj to Yej, when white is reversed to black, the count is incremented (3113). As is clear from the explanatory diagram of FIG. 15, this scans each X' in the direction of the arrow, and increments the count when it is reversed from white to black (from pattern A to pattern B).

続いて、第１０図の８１０１で求めた外接矩形の面積Ｓ
から、ＳＸａ十すの値を求め、８１１２゜５１１３で得
られたカウントの１直がその求めた値よりも大きいと判
定された場合（Ｓ１１４）、この外接矩形は写真である
と判定され（Ｓ１１５）、処理を終了する（Ｓ１１６）
。５１１４でカウント値が小さいと判定されたら、この
外接矩形は図表であると判定され（８１１７）、処理を
終了する（Ｓ１１６）。Next, the area S of the circumscribed rectangle obtained at 8101 in FIG.
If the value of SXa is calculated from 8112°5113 and it is determined that the count 1 is larger than the calculated value (S114), it is determined that this circumscribed rectangle is a photograph (S115). ), the process ends (S116).
. If it is determined in 5114 that the count value is small, it is determined that this circumscribed rectangle is a chart (8117), and the process ends (S116).

第１６図は、第１図の文学部統合手段のフローチャート
である。FIG. 16 is a flowchart of the literature department integration means of FIG.

まず、２１直画像データから属性判別手段３３において
、文字部以外（写真、図表、フイ、−ルドセパレータ）
と判断された外接矩形領域の画像を削除して得られた画
像に対して、各Ｙに対するＸ方向の総黒画素数を、さら
に各Ｘに対するＹ方向の総黒画素数を表す周辺ヒストグ
ラムを作成する（Ｓ１２０＞。その周辺ヒスＩ・クラム
から、Ｘ方向に対して黒画素が全く存在しない白ライン
が連続する白ライン領域と、黒画素が少なくとも１つ存
在する黒ラインが連続する黒画素ライン領域とを、また
、Ｙ方向に対して黒画素が全く存在しない白カラムが連
続する白カラム領域と、黒画素が少なくとも１つ存在す
る黒カラムが連続する黒画素カラム領域とをそれぞれ抽
出する（８１２１）。First, from the 21 direct image data, the attribute determination means 33 determines whether the text portions other than text portions (photographs, diagrams, files, and field separators)
For the image obtained by deleting the image of the circumscribed rectangular area determined as (S120>. From the surrounding hiss I/crumb, there is a white line area where there are continuous white lines with no black pixels at all in the X direction, and a black pixel line where there are continuous black lines where there is at least one black pixel). In addition, in the Y direction, a white column area in which there are consecutive white columns without any black pixels, and a black pixel column area in which there are consecutive black columns in which at least one black pixel exists are extracted ( 8121).

次に、その抽出されたい白ライン領域の数と白カラム領
域の数との大小で、縦書きであるのか、横書きであるの
かを判定しく５１２２＞、その判定に基づいて、横書き
と判定された場合は白ライン領域と黒ライン領域との情
報を、一方、縦書きと判定された場合は白カラム領域と
黒カラム領域との情報を用いて、Ｘ（横）方向、Ｙ　（
縦）方向の統合条件の閾値を決定する処置を行う（Ｓ１
２３）。属性判別手段３３で文字部と判別された外接矩
形をその決定した統合閾値の条件に基づき、文字部ブロ
ックに統合し、その結果を文字領域として抽出する（８
１２４＞。Next, it is determined whether it is vertical writing or horizontal writing based on the size of the number of white line areas to be extracted and the number of white column areas.5122> Based on that determination, horizontal writing is determined. If it is determined that the text is written vertically, the information of the white line area and black line area is used. On the other hand, if vertical writing is determined, the information of the white column area and black column area is used.
Steps are taken to determine the threshold of the integration condition in the vertical direction (S1
23). The circumscribed rectangle determined to be a character part by the attribute determination means 33 is integrated into a character part block based on the conditions of the determined integration threshold, and the result is extracted as a character area (8
124>.

以上が、文学部統合手段３７の一連の処理であるが、以
下、各処理について詳細に説明する。The above is a series of processes of the literature department integrating means 37, and each process will be explained in detail below.

５１２０．５１２１について、第１７図を用いて説明す
る。5120.5121 will be explained using FIG. 17.

枠内の画像は、２値画１象データから文字部以外と判別
された外接矩形領域の画像を削除して得られた画像であ
り、枠外右側Ａの画像がその得られた画像に対し、各Ｙ
に対するＸ方向の総黒画素数を示す周辺分布ヒストグラ
ムの、また、枠外下側Ｂの画像が同様に各Ｘに対するＹ
方向の総黒画素数をそれぞれ示す。The image inside the frame is an image obtained by deleting the image of the circumscribed rectangular area that is determined to be other than the text part from the binary image 1-image data. Each Y
The peripheral distribution histogram showing the total number of black pixels in the X direction for each
The total number of black pixels in each direction is shown.

この第１７図は、上述の周辺分布しストダラムのｆ乍成
結果例を示す図である。FIG. 17 is a diagram illustrating an example of the result of the above-mentioned marginal distribution and Stodarham f.

そして、周辺ヒストグラムから、同図に示すようにＸ方
向に黒画素が１つも検出されない白ラインが連続する白
ライン領域と、黒画素が１つでも検出される黒ラインが
連続する黒ライン領域を、またＹ方向に対して同様に白
カラム領域、黒カラム領域とを抽出する。第１７図の場
合は、Ｙ方向に対して黒画素が１つも存在しない白カラ
ムが検出されなかったので、白カラム領域は存在ｊ−な
い。Then, from the peripheral histogram, as shown in the figure, a white line area where there are consecutive white lines in which no black pixels are detected in the X direction, and a black line area where there are consecutive black lines where at least one black pixel is detected are determined. , and similarly extract white column areas and black column areas in the Y direction. In the case of FIG. 17, no white column in which there is no black pixel in the Y direction was detected, so there is no white column area.

次に、５１２２を説明すると、５１２１°ご抽出された
白ライン領域の数と、白カラム領域の数とから、（白ラ
イン領域の数ン白カラム領域の数）と判定されたら、こ
の文書は横書きと、その逆である、と判定されたら、縦
書きとそれぞれ判定する。Next, to explain 5122, if it is determined from the number of extracted white line areas and the number of white column areas that 5121° is (number of white line areas - number of white column areas), then this document is If it is determined that the writing is horizontal or vice versa, it is determined that the writing is vertical.

８１２３について、第１８図を用いて説明する。8123 will be explained using FIG.

この第１８図は、５１２２で横書きと判定されたときの
処理例を示し、縦方向、横方向の統合閾値を説明する図
である２白ライン領域の幅（連続する白ライン数）のヒストグラ
ムを示したもので、Ｘ方向に白ライン領域の幅（ライン
数）を、Ｙ方向に各ライン領域幅に対する個数をそれぞ
れ表している。This Figure 18 shows an example of processing when horizontal writing is determined in 5122, and is a diagram explaining the integrated thresholds in the vertical and horizontal directions. The figure shows the width (number of lines) of the white line area in the X direction, and the number for each line area width in the Y direction.

そこで、最も個数の多い白ライン領域幅から（白ライン
領域幅が１６の箇所）幅の大きくなる方向に走査して、
個数が０となる白ライン領域幅が検出されたら、その幅
１つ前の白ライン領域幅を値ＴＷとして抽出する。この
図の場合は１９となる。Therefore, by scanning in the direction of increasing width from the white line area width that has the largest number (the area where the white line area width is 16),
When a white line area width whose number is 0 is detected, the white line area width immediately before that width is extracted as a value TW. In this figure, it is 19.

また、黒ライン領域幅のヒストグラムから同様の処理を
行い、値ＴＢを抽出する。Further, similar processing is performed on the histogram of the black line area width to extract the value TB.

その結果、得られた値ＴＷと値ＴＢとから横方向、縦方
向の統合閾値を決定する。As a result, integrated thresholds in the horizontal and vertical directions are determined from the obtained values TW and TB.

その関係を、横方向の総合間値ＴＨＸ＝ＴＷ＋ＴＢ縦方向の総合間値ＴＨＹ＝ＴＷ＋Ｅｒ　（但し、Ｅｒ：
傾き誤差分のライン数）に設定する。The relationship is as follows: Total horizontal value THX=TW+TB Vertical total value THY=TW+Er (However, Er:
(number of lines equal to the slope error).

以上の横方向の統合閾値の決定手段の説明では、横書き
と判定された場合について行ったが、縦書きと判定され
た場合は、上述の白ラインに領域、黒ライン領域に対し
て白カラム、黒カラム領域の幅のヒストガラムから同様
にＴＷ、ＴＢを抽出し、縦書きの場合、横方向の総合間値ＴＨＸ＝ＴＷ＋Ｅｒ縦方向の総合間値ＴＨＹ＝ＴＷ＋ＴＢに設定する。The above description of the means for determining the integrated threshold in the horizontal direction was based on the case where horizontal writing was determined. However, when it was determined that vertical writing TW and TB are extracted in the same way from the histogram of the width of the black column area, and in the case of vertical writing, the total distance in the horizontal direction is set to THX=TW+Er, and the total distance in the vertical direction is set to THY=TW+TB.

５１２４について、第１９図を用いて説明する。5124 will be explained using FIG. 19.

この第１９図は、文字部の外接矩形の統合を説明する図
である。FIG. 19 is a diagram illustrating the integration of circumscribed rectangles of character parts.

原点（０，０＞を２値画（象データの鼓左上に、そして
同図に示すパターンＡの様な隣り合う外接矩形やパター
ンＢの様ζ・重なり合う様な外接矩形について、５１２
３で求めた縦方向、横方向の統合閾値を基に次の条件に
より統合を行う。The origin (0, 0> is set as a binary image (at the top left of the elephant data), and for adjacent circumscribed rectangles like pattern A and overlapping circumscribed rectangles like pattern B shown in the same figure, 512
Integration is performed under the following conditions based on the vertical and horizontal integration thresholds found in step 3.

（２）ＴＨＹ；縦向の統合閾値）（３）ＸＳ２≦ＴｌＸ
（但し、ＴＨＸ、横が向の統合間１偵）（２）ＹＳ２−
Ｙｅ　１≦ＴＨＹ（但し、ＴＨＸ；横方向の統合閾値）（３）ＸＳ１−Ｘｅ２≦Ｔｌ（Ｘ（４）ＹＳｌ−Ｙｅ２≦ＴＨ）’ 上述の条件により２つの外接矩形が統合され、その員小
Ｘ、Ｙ座標をそれぞれ始点座標、終点座標として新たな
矩形領域として保存する。(2) THY; Vertical integration threshold) (3) XS2≦TlX
(However, THX is the 1st detective on the horizontal side) (2) YS2-
Ye 1≦THY (however, THX: horizontal integration threshold) (3) The X and Y coordinates are saved as a new rectangular area as the starting point coordinates and ending point coordinates, respectively.

この時、上述の統合条件の各左辺部の値かマイナスとな
った場合に関しては条件を満たしているものとする。At this time, it is assumed that the conditions are satisfied if the values on the left side of the above-mentioned integration conditions are negative.

以上の様な方法で、外接矩形（新たに保存した矩形領域
を含む）を次々と統合して保存するという処置を、もう
これ以上の上述の条件で統合をするものはないという状
態まで繰り返し行う。Using the method described above, repeat the process of merging and saving the circumscribed rectangles (including the newly saved rectangle area) one after another until there is nothing more to be integrated under the above conditions. .

また、保存方法として２つの外接矩形の情報が保存され
ているメモリ位置のうち、一方の外接矩形の情報を保存
し、他方の外接矩形の情報が保存されているメモリ位置
の内容を削除しながら処理を繰り返す方法がある。In addition, as a storage method, among the memory locations where the information of two circumscribed rectangles is saved, the information of one circumscribed rectangle is saved, and the contents of the memory location where the information of the other circumscribed rectangle is saved are deleted. There is a way to repeat the process.

第２０図は、ある論文誌の原稿を、８本／ｍｍの解像度
でよみ取った画像に対して処理を行った領域分割の結果
を示す図である。５０はフィールドセパレータ、５１は
写真、５２は文字部、および５３は図表である。FIG. 20 is a diagram showing the results of region segmentation performed on an image read from a manuscript of a certain journal at a resolution of 8 lines/mm. 50 is a field separator, 51 is a photograph, 52 is a text portion, and 53 is a diagram.

本実施例は次のような利点がある。This embodiment has the following advantages.

（イ）　第１１図に示す境界線Ｙ　＝　ａ　Ｘ　十すは
、ｂ＝ｏのとき、面積Ｓに対する反転回数の比が、ある
閾値よりも大きいか、あるいは小さいかにより図表と写
真との判別する。面積Ｓと反転回数との関係は第１１図
に示す通りになり、ａ、ｂにある固定数を持たせた境界
線Ｙ＝ａＸ＋ｂを境界線（閾値）とすることにより、−
７１、判別精度を高めることができる。(b) When the boundary line Y = a do. The relationship between the area S and the number of inversions is as shown in Fig. 11, and by setting the boundary line Y = a
71, discrimination accuracy can be improved.

（ロ）　従来、フィールドセパレータを判別するには、
Ｘ軸の長さとＹ軸の長さとの比がある関係よりも大きい
か、それとも小さいかで判別することが行われていた。(b) Conventionally, to determine the field separator,
Discrimination has been made based on whether the ratio between the length of the X axis and the length of the Y axis is larger or smaller than a certain relationship.

これは、Ｙ＝αＸ＋βに示すβが０で示される場合と同
様となる。ところが、外接矩形のＸ軸の長さとＹ軸の長
さとの関係は、第９図の示す通りになり、α、βにある
固定値を持たせたＹ＝αχ＋βを境界線（閾値）とする
ことにより、さらに判別制度を高めることができる。This is similar to the case where β shown in Y=αX+β is shown as 0. However, the relationship between the length of the X axis and the length of the Y axis of the circumscribed rectangle is as shown in Figure 9, and the boundary line (threshold) is Y = αχ + β, where α and β have fixed values. By doing so, the discrimination system can be further improved.

（ハ）　最適２値化され得られた２値画像に対して、外
接矩形検出手段で、適当な閾値ＴＨ以上（１〜４が適当
）の横方向（Ｘ方向）に黒画素が全く存在しない白ライ
ンと、閾値ＴＨ以上の横方向（Ｙ方向）に黒画素が全く
存在しない白カラムで囲まれる情報を、これ以上の切り
出しは行えないという状態まで、切り出し処理を繰り返
し、属性判別手段で検出された外接矩形特徴とその矩形
内の特徴からフィールドセパレータ、写真、図表、文字
部に分類し、文字統合手段において上述の属性判別手段
で、文字部と判別された外接矩形は、順次、文字部ブロ
ックに統合するようにしたので、■最適２値化された画
像に対して適用できる。■属性判別精度の向上が図れる
、■文字部ブロックが高精度に抽出できる、■種々の対
象とする画（象に柔軟に対応できる、等の利点がある。(c) With respect to the binary image obtained by optimal binarization, a circumscribing rectangle detection means is used to determine that there are no black pixels in the horizontal direction (X direction) above an appropriate threshold value TH (appropriately 1 to 4). The extraction process is repeated until the information surrounded by a white line and a white column in which there are no black pixels in the horizontal direction (Y direction) equal to or higher than the threshold TH can no longer be extracted, and the attribute discrimination means detects the information. The circumscribed rectangles are classified into field separators, photographs, diagrams, and text parts based on the circumscribed rectangle features and the features within the rectangle, and the enclosing rectangles that are determined to be text parts by the above-mentioned attribute discrimination means in the character integration means are sequentially classified into text parts. Since it is integrated into blocks, ■ it can be applied to optimally binarized images. It has the following advantages: 1. The accuracy of attribute discrimination can be improved; 2. Character blocks can be extracted with high precision. 2. It can flexibly respond to various target images.

なお、本発明は、図示の実施例に限定されず、種々の変
形が可能である。その変形例としては、例えば、次のよ
うなものがある。Note that the present invention is not limited to the illustrated embodiment, and various modifications are possible. Examples of such modifications include the following.

（ａ）　本実施例では、横長のフィールドセパレータに
ついてのみについて記述しているが、縦長のフィールド
セパレータの場合も考えられる。(a) In this embodiment, only a horizontally long field separator is described, but a vertically long field separator may also be considered.

（ｂ）　　本実施例では、第１０図に示す５１０３を設
けたが、省略してもよい。なぜなら外接矩形の情報は外
接矩形位置メモリ３２に保存されており、その中からフ
ィールドセパレータ、写真、及び図表と判別される外接
矩形が抽出されれば、残りの情報が文字部の外接矩形と
判断できるからである。(b) In this embodiment, 5103 shown in FIG. 10 is provided, but it may be omitted. This is because the information on the circumscribed rectangle is stored in the circumscribed rectangle position memory 32, and if the circumscribed rectangles that are identified as field separators, photographs, and charts are extracted from it, the remaining information is determined to be the circumscribed rectangle of the character part. Because you can.

（ｃ＞　　本実施例では、第１２図の５１１２，５１１
３において、白画素から黒画素へ反転したらカラン１へ
を増分すると１−だが、黒画素から白画素、または白画
素から黒画素及び黒画素から白Ｎ＋ｉ素へ反転したら増
分するとし７てもよい。(c> In this embodiment, 5112, 511 in FIG.
In 3, when inverting from a white pixel to a black pixel, the increment to Callan 1 is 1-, but when inverting from a black pixel to a white pixel, or from a white pixel to a black pixel, and from a black pixel to a white pixel, it may be incremented by 7. .

（ｄ）　　第３図の外接矩形検出処理に代えて、情報領
域４１と情報領域４１との間隔が閾値Ｔ’Ｈ以内は同領
域として、連結領域〈ラベリング処Ｆｌ）による矩形領
域を抽出づ−る方法や、境界追跡による矩形領域を抽出
する方法を用いて処理を行っても、外接矩形抽出結果は
同様な結果が得られ、ヒ記実施例と同様な効果が得られ
る。(d) Instead of the circumscribed rectangle detection process in FIG. 3, if the distance between the information areas 41 is within the threshold value T'H, it is assumed that they are the same area, and a rectangular area is extracted by the connected area (labeling process Fl). Even if processing is performed using a method of extracting a rectangular area by boundary tracking or a method of extracting a rectangular area by boundary tracking, the same circumscribed rectangle extraction results are obtained, and the same effects as in the embodiment described above can be obtained.

（発明の効果）以上詳細に説明したように、本発明によればｆ＆適適値
値化れ得らｉｔた２値両を督こ対して、外Ｒ矩形検出手
段で、適当な閾値ＴＨ以上（１〜・４が適当）の横方向
（Ｘ方向）に黒画素が全く存在しない白ラインと、閾値
”Ｉ’ＨＬ゛月二の横方向（Ｘ方向）に黒画素が全く存
在しない白カラムで囲まれる情報を、これ以上の切り出
しは行えないという状態まで、切り出し処理を繰り返し
、属性判別ず段で検出された外接矩形特徴とその矩形内
の特徴からフィールドセパレータ、写真、図表、文字部
に分類し、文字統合手段において上述の属性判別手段で
、文字部と判別された外接矩形は、順次、文字部ブロッ
クに統合するようにしたので、■最適２値化された画像
に対して適用できる。■属性判別精度の向上が図れる、
■文字部ブロックが高精度に抽出できる、■種々の対象
とする画像に柔軟に対応できる、等の効果が期待できる
。(Effects of the Invention) As described in detail above, according to the present invention, the binary values f and it that can be converted into an appropriate value are collected, and the outer R rectangle detection means is used to detect a value that is equal to or higher than the appropriate threshold value TH. A white line with no black pixels in the horizontal direction (X direction) (1 to 4 is appropriate) and a white column with no black pixels in the horizontal direction (X direction) with a threshold of "I'HL" The extraction process is repeated until it is no longer possible to extract the information surrounded by The circumscribed rectangles that are classified and determined to be character parts by the above-mentioned attribute determination means in the character integration means are sequentially integrated into the character part blocks, so that they can be applied to images that have been optimally binarized. ■It is possible to improve attribute discrimination accuracy.
The following effects can be expected: ■Character blocks can be extracted with high precision; ■It can be flexibly adapted to various target images.

[Brief explanation of the drawing]

第１図は本発明の実施例を示す画像処理装置の機能ブロ
ック図、第２図は従来の画像処理装置の機能ブロック図
、第３図は第１図の外接矩形検出手段のフローチャート
、第１１図は第１図の外接矩形検出手段を説明する図、
第５図は第３図のＹ方向切り出し処理を説明する図、第
６図は第３図のＸ方向切り出し処理を説明する図、第７
図は第１図の属性判別手段のフローチャート、第８図は
第７図のフィールドセパレークの抽出のフローチャート
、第９図は外接矩形のＸ軸長さに対するＹ軸長さの関係
図、第１０図は第７図の写真・図表候補の抽出のフロー
チャー１〜、第１１図は外接矩形面積に対する反転回数
の関係図、第１２図は第７図の写真・図表の判定のフロ
ーチャー１・、第１３図は外接矩形面積に対する黒画素
数の関係図、第１４図は第１２図の８１１２を説明する
図、第１５図は第１２図の８１１３を説明する図、第１
６図は第１図の文学部統合手段のフローチャート、第１
７図は周辺ヒストグラムの作成結果例を示す図、第１８
図は縦方向、横方向の統合閾値を説明する図、第１９図
は文字部の外接矩形の統合を説明する図、第２０図は領
域分割結果を示す図である。３１・・・・・・外接矩形検出手段、３３・・・・・・
属性判別手段、３４・・・・・・図表位置メモリ、３５
・・・・・・写真位置メモリ、３６・・・・・・フィー
ルドセパレータ位置メモリ、３７・・・・・・文学部統
合手段、３８・・・・・・文字部位置メモリ。FIG. 1 is a functional block diagram of an image processing device showing an embodiment of the present invention, FIG. 2 is a functional block diagram of a conventional image processing device, FIG. 3 is a flowchart of the circumscribed rectangle detection means in FIG. The figure is a diagram illustrating the circumscribed rectangle detection means in FIG. 1,
5 is a diagram for explaining the Y-direction extraction process in FIG. 3, FIG. 6 is a diagram for explaining the X-direction extraction process in FIG. 3, and FIG.
Figure 8 is a flowchart of the attribute determination means in Figure 1, Figure 8 is a flowchart of field separation extraction in Figure 7, Figure 9 is a diagram of the relationship between the X-axis length and the Y-axis length of the circumscribed rectangle, and Figure 10 is a flowchart of the attribute determination means in Figure 1. The diagrams are flowcharts 1 to 1 for extracting photo/diagram candidates in Fig. 7, Fig. 11 is a diagram showing the relationship between the number of inversions and the area of the circumscribed rectangle, and Fig. 12 is the flowchart 1 to judgment for photos/diagrams in Fig. 7. , FIG. 13 is a diagram of the relationship between the number of black pixels and the area of the circumscribed rectangle, FIG. 14 is a diagram explaining 8112 in FIG. 12, FIG. 15 is a diagram explaining 8113 in FIG.
Figure 6 is a flowchart of the literature department integration method in Figure 1.
Figure 7 is a diagram showing an example of the creation result of a peripheral histogram.
19 is a diagram illustrating integration thresholds in the vertical and horizontal directions, FIG. 19 is a diagram illustrating integration of circumscribed rectangles of character parts, and FIG. 20 is a diagram illustrating region division results. 31... Circumscribed rectangle detection means, 33...
Attribute determination means, 34...Chart position memory, 35
. . . Photo position memory, 36 . . . Field separator position memory, 37 . . . Literature department integration means, 38 . . . Character section position memory.

Claims

[Claims] 1. A document image region dividing method that cuts out an information region including a character portion of input image data and performs attribute determination processing on the information region, comprising: optimally binarizing the image data; The binary image data thus obtained is input to a circumscribed rectangle detection means, and the information area in the binary image data is converted into a circumscribed rectangle by the circumscribed rectangle detection means, and the characteristics of the circumscribed rectangle and the binary image within the circumscribed rectangle are determined. and the attribute determining means determines the attributes of the information area from the characteristics of the circumscribed rectangle and the features of the binary image within the circumscribed rectangle, and the attribute determining means determines the character among the attributes. 1. A method for dividing a document surface image into regions, the method comprising: integrating a circumscribed rectangle determined to be a text section into a text section block using a text section integration means. 2. The method for dividing both images of a document into regions according to claim 1, wherein the character part integrating means creates a peripheral histogram for a character part image obtained by extracting only the character part from the binary image data. and integrating a circumscribed rectangle of the character part into the character part block using the surrounding histogram. 3. The document image region dividing method according to claim 2, wherein the character part integrating means determines, from the creation result of the peripheral histogram, a white line region in which white lines in which the black pixels do not exist at all in the horizontal direction are continuous. and a black pixel line area in which a black line in which at least one black pixel exists is continuous, and a white column in which white columns in which no black pixel does not exist in the vertical direction are continuous;
A region and a black pixel column region in which black columns in which at least one black pixel exists are extracted, respectively, horizontal writing or vertical writing is determined based on the extraction results, and horizontal or vertical writing is performed based on the determination result. Find an integration threshold for the character part, perform integration processing on the circumscribed rectangle of the character part based on the integration threshold,
A document image region division method that extracts the result as a character region. 4. In the document image region dividing method according to claim 3, the determination of vertical writing/horizontal writing is based on the creation result of the peripheral histogram, and a white line area in which white lines in which the black pixels do not exist at all in the horizontal direction is continuous. . A method for dividing a document image into regions based on the number of consecutive white column regions in which there are no black pixels in the vertical direction. 5. In the document image region dividing method according to claim 3, the integrated threshold in the vertical and horizontal directions is calculated from a histogram of the white line region width and the black line region width when horizontal writing is determined; If it is determined that the document image is written vertically, the area division method of the document image is determined based on the histogram of the white column area width and the black column area width. 6. The document image area dividing method according to claim 3, wherein the integrated threshold in the vertical and horizontal directions is determined based on a histogram of the white line or white column area widths, and the white line, white column, or area having the largest number of white lines or white columns. After detecting the white line, white column, or region width whose number becomes 0 by scanning in the direction of increasing width, the white line or white column region immediately before that is extracted as a value TW, and then the black line is detected. Or, extract the value TB from the histogram of the black column area width, and extract the value TW and the value TB
Based on this, in the case of horizontal writing, set the horizontal total threshold value THX = TW + TB, the vertical total threshold value THY = TW + Er (however, Er: the number of lines corresponding to the tilt error), and in the case of vertical writing, set the total horizontal value as THX = TW + TB. A document image area division method that sets a threshold value THX=TW+Er and a total vertical threshold value THY=TW+TB. 7. In the document image region dividing method according to claim 3, in the integration process, two region coordinates (XS1, YS1) of the circumscribed rectangle of interest are determined.
) (Xe1, Ye1), (XS2, YS2) (Xe2,
Ye2) (However, XS1, XS2, YS1, YS2; starting point coordinates of horizontal and vertical areas, Xe1, Xe2, Ye
1, Ye2; end point coordinates of the horizontal and vertical regions), based on the following integration conditional expressions (1) to (4), the circumscribed rectangles that have been determined to be the character portion are successively combined into one, and the limit A method for dividing a document image into regions, in which the integration is repeated up to the point where the integration results are extracted as the text block. Integration conditional expression (1) XS2-Xe1≦THX (where THX: horizontal integrated threshold) (2) YS2-Ye1≦THY (however, THY: vertical integrated threshold) (3) XS1-Xe2≦THX ( 4) YS1-Ye2≦THY (However, XS2-Xe1, YS2-Ye1, XS1-X
When the value of e2, YS1-Ye1 is negative, the condition is met)