JPH0773271A

JPH0773271A - Area dividing method

Info

Publication number: JPH0773271A
Application number: JP5331252A
Authority: JP
Inventors: Toshifumi Yamaai; 敏文山合
Original assignee: Ricoh Co Ltd
Current assignee: Ricoh Co Ltd
Priority date: 1993-06-29
Filing date: 1993-12-27
Publication date: 1995-03-17
Anticipated expiration: 2017-05-27
Also published as: JP3285686B2

Abstract

PURPOSE:To automatically set the threshold value for integration of a character rectangle and also to separate the text area from the footers, the headers, etc., when a document area is divided. CONSTITUTION:A circumscribed rectangle detecting part 8 detects a circumscribed rectangle of the black picture element coupling component of an image obtained by applying the OR compression to an input document image through an image compressing part 7. A rectangle sorting part 9 sorts the rectangles into the character rectangles and other rectangles. A character area integrating part 10 automatically sets the threshold value for integration of characters based on the vertical/horizontal ratio of the character rectangle or a space distribution and integrates the lines of the character rectangle. However the integration of the text lines and the footers or headers is inhibited between the character rectangles. Furthermore the threshold value is increased by punctuation marks so that the failure of integration of punctuation marks can be prevented. Then the character rectangles of footers and headers are integrated with the character area different from the text area.

Description

Detailed Description of the Invention

【０００１】[0001]

【産業上の利用分野】本発明は、文書等の画像データよ
り文字領域や図表領域等の属性の異なる情報領域を分割
する領域分割技術に関する。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to an area dividing technique for dividing an information area having different attributes such as a character area and a figure area from image data such as a document.

【０００２】[0002]

【従来の技術】文字領域や図表領域等が混在した文書等
の文字認識をする場合、あるいは、そのような文書等の
画像を光ディスク等の記憶メディア上にファイリングし
たりファクシミリで送信したりする場合等に、属性の異
なる情報領域を分割し、各領域毎に最適な処理方法に切
り替える必要がある。2. Description of the Related Art When recognizing characters in a document in which character areas and chart areas are mixed, or when an image of such a document is filed on a storage medium such as an optical disk or transmitted by facsimile. For example, it is necessary to divide the information areas having different attributes and switch to the optimum processing method for each area.

【０００３】このような情報領域分割に関しては、周辺
分布を用いて文字領域とその他領域に切り分ける方法
（特開昭６２−７１３７９号）や、周辺分布を利用して
黒画素群の外接矩形を抽出し、その矩形の統合によって
文字領域を構成して他の領域と分割する方法（特開平３
−１２６１８１号）が知られている。Regarding such information area division, a method of dividing into a character area and other areas by using a peripheral distribution (Japanese Patent Laid-Open No. 62-71379) and a circumscribed rectangle of a black pixel group by utilizing the peripheral distribution are extracted. Then, a method of forming a character area by integrating the rectangles and dividing the character area from other areas (Japanese Patent Laid-Open No. Hei 3 (1999) -1999)
-126181) is known.

【０００４】しかし、前者の方法は、画像のスキューへ
の対応が難しく、さらに、対象とする文書毎にパラメー
タを設定しなければならないという問題点がある。後者
の方法は、矩形抽出に周辺分布を利用する関係から画像
のスキューの程度によってはスキュー補正処理が必要と
なるという問題点があり、またさらに、文字領域におい
て矩形の重なりがある場合や、囲み記事の領域が含まれ
るような構造の複雑な文書への対応が困難であるという
問題がある。However, the former method has a problem that it is difficult to deal with the skew of the image, and the parameter must be set for each target document. The latter method has a problem that skew correction processing is required depending on the degree of image skew due to the use of peripheral distribution for rectangle extraction. There is a problem that it is difficult to deal with a complicated document having a structure including an article area.

【０００５】なお、本出願人による平成４年第２６７３
１３号特許願に添付の明細書及び図面に、文字領域の抽
出に矩形統合を利用する改良された領域分割方法が開示
されている。この方法によれば、スキュー補正を不要に
でき、また、構造が複雑な文書や種々の文字サイズの文
書に対応することができる。しかし、文字矩形の行方向
への統合パラメータ（矩形間距離の閾値）の設定が不適
当であると、統合すべきでない隣合った領域が統合され
てしまったり、ある行方向の文字列から構成される本文
領域と、それに近接した異方向の文字列（本文に対する
タイトル等）が誤って統合される危険があった。The applicant's 1992 2673
The specification and drawings attached to the No. 13 patent application disclose an improved region segmentation method that utilizes rectangle integration to extract character regions. According to this method, skew correction can be dispensed with, and a document having a complicated structure and documents of various character sizes can be dealt with. However, if the setting of the integration parameter (threshold distance between rectangles) in the line direction of the character rectangle is inappropriate, adjacent areas that should not be combined will be combined, or a character string in a certain line direction will be created. There is a risk that the text area to be displayed and a character string in a different direction close to the text area (such as a title for the text) are mistakenly integrated.

【０００６】[0006]

【発明が解決しようとする課題】よって本発明の目的
は、領域分割に関する上述の如き問題点あるいは課題を
改善し、構造の複雑な文書類や文字サイズが様々な文書
類に対する領域分割の精度及び安定性の向上を図ること
にある。SUMMARY OF THE INVENTION Therefore, an object of the present invention is to improve the above-mentioned problems or problems relating to area division, and to improve the accuracy of area division for documents having complicated structures or documents having various character sizes. It is to improve stability.

【０００７】[0007]

【課題を解決するための手段】本発明によれば、文書等
の画像を所定倍率で圧縮した画像より黒画素連結成分に
外接した矩形を抽出し、抽出した矩形を文字の矩形とそ
れ以外の矩形に分類し、文字の矩形を行方向へ統合する
領域分割方法の改良方法が提供される。According to the present invention, a rectangle circumscribing a black pixel connected component is extracted from an image obtained by compressing an image of a document or the like at a predetermined magnification, and the extracted rectangle is defined as a rectangle of characters and other rectangles. Provided is an improved method of dividing an area, which classifies a rectangle of characters and integrates a rectangle of characters in the row direction.

【０００８】すなわち、請求項１記載の発明にあって
は、文字の矩形の高さと幅の比をとることにより画像の
圧縮による文字間の融合度合を推定し、この融合度合に
応じて、文字の矩形の該行方向への統合のための矩形間
距離の閾値を設定する。That is, according to the first aspect of the invention, the degree of fusion between characters due to image compression is estimated by taking the ratio of the height and width of the rectangle of the character, and the degree of fusion of the characters is estimated according to this degree of fusion. The threshold value of the inter-rectangle distance for integrating the rectangles in the row direction is set.

【０００９】請求項２記載の発明にあっては、該行方向
に近接した文字の矩形間の距離の分布を測定し、該分布
に基づいて、文字の矩形の該行方向への統合のための矩
形間距離の閾値を設定する。According to the second aspect of the present invention, the distribution of the distances between the rectangles of the characters adjacent to each other in the line direction is measured, and based on the distribution, the rectangles of the characters are integrated in the line direction. Set the threshold of the distance between rectangles.

【００１０】請求項３記載の発明にあっては、文字の矩
形の高さと幅の比をとることにより画像の圧縮による文
字間の融合度合を推定し、文字の矩形の該行方向への統
合のための矩形間距離の閾値の設定方法として、該融合
度合により設定する方法、または、該行方向に近接した
文字の矩形間の距離の分布により設定する方法を、該融
合度合の大小に応じて選択する。According to the third aspect of the present invention, the degree of fusion between characters due to image compression is estimated by taking the ratio of the height and width of the character rectangle, and the character rectangle is integrated in the line direction. As a method of setting the threshold value of the inter-rectangle distance for the above, a method of setting by the fusion degree or a method of setting by the distribution of the distance between the rectangles of the characters that are close to each other in the row direction is used depending on the degree of fusion degree. To select.

【００１１】請求項４記載の発明にあっては、文字の矩
形の該行方向への統合処理中に、注目した文字の矩形が
句読点またはそれに類する特定の記号の矩形である否か
の判定をし、句読点またはそれに類する特定の記号であ
ると判定したときに、当該注目した矩形と次の文字の矩
形との統合に関する矩形間距離の閾値を、句読点または
それに類する特定記号以外の文字の矩形の統合の場合よ
り増大させる。According to the fourth aspect of the present invention, during the process of integrating the character rectangles in the line direction, it is determined whether or not the focused character rectangle is a punctuation mark or a similar symbol rectangle. However, when it is determined that the rectangle is a punctuation mark or a specific symbol similar to it, the threshold value of the distance between rectangles regarding the integration of the rectangle of interest and the rectangle of the next character is set to a rectangle of a character other than the punctuation mark or a similar specific symbol. Increase more than the case of integration.

【００１２】また、請求項５記載の発明にあっては、文
字の矩形の該行方向への統合処理中に、統合しようとす
る二つの矩形の一方の矩形と他方の矩形がそれぞれ異方
向の行の文字の矩形であるか否かの判定をして、異方向
の行の文字の矩形であると判定したときには当該二つの
矩形の統合を禁止する。According to the fifth aspect of the invention, during the process of integrating the character rectangles in the line direction, one of the two rectangles to be integrated and the other rectangle are in different directions. When it is determined whether or not it is a rectangle of characters in a line, and when it is determined to be a rectangle of characters in a different direction, the integration of the two rectangles is prohibited.

【００１３】さらに、請求項６記載の発明にあっては、
請求項５記載の方法において、文字の矩形の該行方向へ
の統合によって得られた行の中に、該行方向の長さが短
い行が存在する場合に、該短い行より該行方向に対し垂
直な方向へ短い行を探索し統合する処理を行なう。Further, according to the invention of claim 6,
The method according to claim 5, wherein, when a line having a short length in the line direction is present in the lines obtained by integrating the rectangles of characters in the line direction, the line is moved from the short line to the line direction. On the other hand, the process of searching for short lines in the vertical direction and integrating them is performed.

【００１４】請求項８記載の発明は、文書等の画像を所
定倍率で圧縮した画像より黒画素連結成分に外接した矩
形を抽出し、抽出した矩形を文字の矩形とそれ以外の矩
形に分類し、文字の矩形を本文文字行方向へ統合し、こ
の矩形の統合によって得られた行を統合して文字部領域
を切り出すが、文字の矩形の本文文字行方向への統合に
よって得られた行を本文文字行と本文文字行以外の行と
に分類し、本文文字行以外の行に本文文字行とは別の固
有の属性を持たせることにより、行の文字部領域への統
合の際に本文行以外の行と本文文字行との統合を避け
る。According to an eighth aspect of the present invention, a rectangle circumscribing a black pixel connected component is extracted from an image obtained by compressing an image of a document or the like with a predetermined magnification, and the extracted rectangle is classified into a character rectangle and other rectangles. , The character rectangle is integrated in the body character line direction, and the line obtained by combining the rectangles is combined to cut out the character part area, but the line obtained by combining the character rectangle in the body character line direction is extracted. By classifying into body text lines and lines other than body text lines, and giving lines other than body text lines a unique attribute different from body text lines, the text is integrated when the lines are integrated into the character area. Avoid merging non-line lines with body text lines.

【００１５】請求項９記載の発明は、請求項８記載の発
明の方法において、矩形の分類によって文字、罫線以外
の矩形に分類された矩形の内部または周辺に存在する本
文文字行以外の行の中で、その周辺に本文文字行方向に
対して垂直の方向に統合可能な行がみつかるものと、見
つからないものとに分類し、それぞれに固有の属性を持
たせることを特徴とするものである。According to a ninth aspect of the invention, in the method of the eighth aspect of the invention, lines other than the text line of the text existing inside or around the rectangle classified into the rectangles other than the characters and the ruled lines by the classification of the rectangles. Among them, it is characterized by classifying it into a line in which a line that can be integrated in the direction perpendicular to the text line direction is found and a line in which it cannot be found, and giving each line a unique attribute. .

【００１６】請求項１０記載の発明の特徴は、請求項９
記載の発明の方法において、本文文字行以外の行につい
て、同じ属性を持つ行同士を統合することである。The features of the invention according to claim 10 are as follows.
In the method of the invention described above, regarding lines other than the text character line, lines having the same attribute are integrated.

【００１７】請求項１１記載の発明は、請求項９記載の
発明の方法において、本文文字行以外の属性を持つ行を
消去することを特徴とするものである。The eleventh aspect of the invention is characterized in that, in the method of the ninth aspect, lines having an attribute other than the text line are deleted.

【００１８】請求項１２記載の発明は、請求項８記載の
発明の方法において、矩形の分類によって文字、罫線以
外の矩形に分類された矩形を、図の矩形と写真の矩形と
に細分類し、写真の矩形の内部にある本文文字行以外の
行を消去し、または写真の属性を持たせることを特徴と
するものである。According to the twelfth aspect of the present invention, in the method of the eighth aspect, the rectangles classified into rectangles other than characters and ruled lines by the rectangle classification are subdivided into a figure rectangle and a photograph rectangle. , The lines other than the text line in the photo rectangle are erased, or the photo attribute is given.

【００１９】[0019]

【作用】文書等の文字間隔が広いか狭いかによって、文
書等の画像の圧縮画像上の文字間の融合度合に違いが生
じる。請求項１または３記載の発明は、この文字間の融
合度合を、文字矩形の縦横比より推定することによっ
て、処理対象文書等の文字間隔に適応させて、文字矩形
の行方向への統合のための矩形間距離の閾値を最適化す
ることができる。また、請求項２または３記載の発明
は、文字間の融合度合によっては閾値の精度が悪化しや
すい文字間隔が広い文書等でも、文字矩形間の距離の分
布に基づいて閾値を高精度に最適化することができる。
したがって、請求項１乃至３記載の発明によれば、文字
間隔の異なる多様な文書等に対応可能である。The difference in the degree of fusion between the characters on the compressed image of the image of the document or the like depends on whether the character interval of the document or the like is wide or narrow. The invention according to claim 1 or 3 estimates the fusion degree between the characters from the aspect ratio of the character rectangle, thereby adapting to the character interval of the document to be processed or the like, and integrating the character rectangle in the line direction. It is possible to optimize the threshold value of the distance between the rectangles. Further, the invention according to claim 2 or 3 optimizes the threshold value with high accuracy based on the distribution of the distance between the character rectangles even in the case of a character or the like in which the accuracy of the threshold value is easily deteriorated depending on the fusion degree between the characters. Can be converted.
Therefore, according to the inventions of claims 1 to 3, it is possible to deal with various documents having different character intervals.

【００２０】また、請求項１記載の発明によれば閾値の
決定処理の高速化を期待できることがあり、他方、請求
項２記載の発明によれば、より高精度の閾値の最適化を
期待できる。請求項３記載の発明によれば、そのような
二つの閾値決定方法を、処理文書の文字間隔の広狭に応
じて使い分けることができる。Further, according to the invention described in claim 1, it may be possible to expect a high-speed determination process of the threshold value, while according to the invention described in claim 2, it is possible to expect a more accurate optimization of the threshold value. . According to the third aspect of the invention, such two threshold value determining methods can be selectively used according to the width of the character spacing of the processed document.

【００２１】ワープロ文書等では、句読点や特定の記号
と、後続文字との間隔が、通常の文字の間隔より相当に
広いことがあるため、例えば請求項１乃至３記載の発明
の方法によって適応的に決定した、あるいは手動で設定
した文字矩形統合のための閾値が多少ずれている場合
に、通常の文字と同一行に統合されるべき句読点や特定
記号が統合されずに分離してしまう虞がある。請求項４
記載の発明によれば、句読点等と次の文字との統合のた
めの閾値を増大させることによって、句読点等の分離を
防止することができる。In a word processing document or the like, an interval between a punctuation mark or a specific symbol and a succeeding character may be considerably wider than an interval between ordinary characters. Therefore, for example, the method according to the present invention according to claims 1 to 3 is adaptive. If the thresholds for character rectangle integration that have been set to or that are set manually are slightly deviated, punctuation marks and specific symbols that should be integrated on the same line as normal characters may be separated without being integrated. is there. Claim 4
According to the described invention, the punctuation mark or the like can be prevented from being separated by increasing the threshold value for integrating the punctuation mark or the like and the next character.

【００２２】縦書き文書では、本文の文字行の端に、横
書きのヘッダやフッタの文字列が接近している場合があ
る。請求項５記載の発明によれば、このようなヘッダや
フッタの文字と本文行の文字との誤統合を防止すること
ができる。また、請求項６記載の発明によれば、ヘッダ
やフッタの文字列を本文文字領域とは別の文字領域
（行）として統合することができる。In a vertically written document, a horizontally written header or footer character string may be close to the end of a character line. According to the invention described in claim 5, it is possible to prevent such erroneous integration of the characters of the header or footer and the characters of the body line. Further, according to the invention of claim 6, the character strings of the header and the footer can be integrated as a character area (line) different from the body character area.

【００２３】さて、写真等のハーフトーン原稿や細かな
線図形で構成される一塊の図等を矩形分類で完全に文字
以外として分類できるほど、矩形分類処理の精度が高く
ない場合が多い。というのも、文字サイズ程度の大きさ
の矩形の中には、文字認識をしないと、文字であるかそ
うでないか分からないものが多く含まれているからであ
る。矩形分類で失敗する割合が多いと、ノイズやハーフ
トーン等で不要な小領域が文字領域として発生する。し
かし、文字認識の結果に基づいて矩形を厳密に分類する
のは、処理速度の点で得策でない。In many cases, the accuracy of the rectangle classification process is not high enough to classify a halftone original such as a photograph or a block of figures composed of fine line figures into completely non-characters by the rectangle classification. This is because a rectangle of about the character size contains many characters that cannot be recognized as a character unless character recognition is performed. If there is a high rate of failure in rectangle classification, unnecessary small areas will occur as character areas due to noise, halftone, and the like. However, strictly classifying the rectangles based on the result of character recognition is not good in terms of processing speed.

【００２４】請求項８乃至１２記載の発明は、矩形の大
きさからは文字の矩形か図その他の矩形か判別し難い場
合にも、周囲にある領域の情報から判断を行なうことが
できることに着目し、文字矩形の統合された行について
分類を行なうことにより、本文文字領域に他の領域が統
合されることを防止し、またハーフトーンの写真領域等
で不要な小領域の発生を防止する。According to the eighth to twelfth aspects of the present invention, even if it is difficult to determine whether it is a character rectangle or a figure or other rectangle from the size of the rectangle, it is possible to make a determination from the information of the surrounding area. However, by classifying the lines in which the character rectangles are integrated, it is possible to prevent other areas from being integrated into the text area of the body, and to prevent generation of unnecessary small areas such as a halftone photo area.

【００２５】請求項８の発明では、文字矩形を行に統合
した後で、行を本文文字行とそれ以外の行とに分類す
る。図９の発明では、文字、罫線以外の矩形との位置関
係に着目して、本文文字行以外の行をさらに分類する。
このような分類は、行の大きさや、次の行との行間距
離、他の文字矩形との距離等の情報から判定するので、
その処理自体は高速に行なうことが可能であるととも
に、矩形単独では判定が難しい文字矩形を判別できる割
合が増し、また行方向の異なる文字矩形も判別できる割
合が高くなるため、より的確な本文文字領域の抽出が可
能になる。そして、請求項１０または請求項１１の発明
によれば、本文文字行以外の行に分類された行を、同じ
属性のもの毎に統合しまたは消去するので、余分な小領
域の発生を防止できる。さらに、請求項１２の発明によ
れば、写真領域の内部に文字列があっても、その文字列
によって写真領域がばらばらにならない。According to the invention of claim 8, after the character rectangle is integrated into a line, the line is classified into a body character line and other lines. In the invention of FIG. 9, the lines other than the text line are further classified by paying attention to the positional relationship with the rectangles other than the characters and the ruled lines.
Such a classification is determined from information such as the size of a line, the distance between lines, the distance from other character rectangles, etc.
The process itself can be performed at high speed, and the rate at which character rectangles that are difficult to determine by a rectangle alone can be identified increases, and the rate at which character rectangles in different line directions can also be identified increases. It becomes possible to extract a region. Further, according to the invention of claim 10 or claim 11, the lines classified into lines other than the text character line are integrated or erased for each of the same attributes, so that an extra small area can be prevented from occurring. . According to the twelfth aspect of the invention, even if a character string exists inside the photo area, the character area does not cause the photo area to fall apart.

【００２６】[0026]

【実施例】図１に本発明を実施したシステム例の概略構
成を示す。図１において、スキャナ１によって読み取ら
れた文書画像のデータは、画像入力部２によって画像メ
モリ３に格納される。画像メモリ３内の文書画像データ
は、領域識別部６の画像圧縮部７に読み出され、ここで
所定の圧縮率のＯＲ圧縮を施されてから画像メモリ３に
戻される。画像メモリ３内の文書画像データは、画像表
示部４によってディスプレー５に表示させることができ
る。なお、画像入力部７より文書画像データを画像圧縮
部７へ直接入力し、ここで圧縮した画像データを画像メ
モリ３に格納することも可能である。DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS FIG. 1 shows a schematic configuration of an example system in which the present invention is implemented. In FIG. 1, the data of the document image read by the scanner 1 is stored in the image memory 3 by the image input unit 2. The document image data in the image memory 3 is read by the image compression section 7 of the area identification section 6, where OR compression is performed at a predetermined compression rate, and then the document image data is returned to the image memory 3. The document image data in the image memory 3 can be displayed on the display 5 by the image display unit 4. It is also possible to directly input the document image data from the image input unit 7 to the image compression unit 7 and store the image data compressed here in the image memory 3.

【００２７】領域識別部６は、前記の画像圧縮部７と、
外接矩形検出部８、矩形分類部９、文字領域統合部１０
からなる。The area identification unit 6 includes the image compression unit 7 and
Circumscribing rectangle detection unit 8, rectangle classification unit 9, character region integration unit 10
Consists of.

【００２８】外接矩形検出部８は、画像メモリ３内の圧
縮された文書画像データより、黒画素の繋りを調べて黒
画素連結成分に外接する矩形を抽出する部分である。こ
こで抽出された矩形には、文字（１文字または２文字以
上の文字列）の矩形のほかに罫線、ノイズ、表、グラ
フ、図その他、様々な属性の矩形が含まれる。矩形分類
部９は、このような様々な矩形を、矩形の形状情報及び
内部データから分類する部分である。この矩形分類の処
理方法は様々なものが考えられるが、その一例について
後に詳述する。The circumscribing rectangle detecting section 8 is a section for examining the connection of black pixels from the compressed document image data in the image memory 3 and extracting a rectangle circumscribing the black pixel connected component. The rectangles extracted here include rectangles of various attributes such as ruled lines, noises, tables, graphs, figures, in addition to rectangles of characters (a character string of one or more characters). The rectangle classification unit 9 is a unit that classifies such various rectangles from the shape information of the rectangles and the internal data. There are various possible methods for processing this rectangular classification, and an example thereof will be described in detail later.

【００２９】文字領域統合部１０は、矩形分類部９で文
字矩形に分類された矩形に対して統合処理を施して文字
領域に統合するもので、これが本発明の要旨に直接関係
する部分である。The character area integration unit 10 integrates the rectangles classified by the rectangle classification unit 9 into the character rectangles and integrates them into the character area. This is a part directly related to the gist of the present invention. .

【００３０】本実施例においては、領域識別部６による
領域識別の結果は、文字認識部１１による文字認識処理
に利用される。つまり、領域識別部６は光学的文字認識
の前処理部として利用される。しかし、これは一例に過
ぎない。In the present embodiment, the result of the area identification by the area identification unit 6 is used for the character recognition processing by the character recognition unit 11. That is, the area identification unit 6 is used as a preprocessing unit for optical character recognition. But this is just one example.

【００３１】図２に文字領域統合部１０の内部構成の一
例を示す。図２において、パラメータ設定部１０Ａは文
字矩形間統合のための閾値となるパラメータを設定する
部分であり、その処理内容については後に図３乃至図５
を参照して説明する。文字間統合部１０Ｂは、文字矩形
を行方向に統合する処理を行なう部分であり、その処理
内容については後に図６乃至図８を参照して説明する。
領域統合部１０Ｃは、文字間統合部１０Ｂの処理によっ
て統合された行を、行と垂直な方向へ統合する部分であ
り、その処理内容については図９に関連し後述する。FIG. 2 shows an example of the internal structure of the character area integrating unit 10. In FIG. 2, the parameter setting unit 10A is a unit that sets a parameter serving as a threshold for integration between character rectangles, and the processing content thereof will be described later with reference to FIGS.
Will be described with reference to. The inter-character integration unit 10B is a part that performs a process of integrating character rectangles in the row direction, and the content of the process will be described later with reference to FIGS. 6 to 8.
The region integration unit 10C is a unit that integrates the lines integrated by the process of the inter-character integration unit 10B in the direction perpendicular to the lines, and the processing content thereof will be described later with reference to FIG.

【００３２】パラメータ設定部１０Ａのパラメータ設定
処理の例１一般的な文書は行方向に文字が詰まっているので、ＯＲ
圧縮後の文書画像上では隣合った文字が融合されて行方
向に長い矩形となる。これに対して、ワープロ文書では
文字間が離れていることが多く、ＯＲ圧縮後においても
文字間の融合は起きにくい。Example 1 of Parameter Setting Process of Parameter Setting Unit 10A Since a general document has characters clogged in the line direction, OR
On the compressed document image, adjacent characters are fused to form a long rectangle in the line direction. On the other hand, in word processing documents, the characters are often separated from each other, and the characters are not easily fused even after the OR compression.

【００３３】このように文字間の融合度合は、原文書画
像上の文字間距離に依存する関係がある、つまり、ＯＲ
圧縮による文字の行方向の融合度合から文字間距離を推
定可能である。したがって、予め実験によって、文字間
融合度合に対応した最適な文字間統合パラメータ（閾
値）を求めておき、これをテーブル形式あるいは一般式
として設定しておくことにより、測定した文字間融合度
合に基づき最適なパラメータを自動的に設定することが
できる。これが本例における処理原理である。As described above, the degree of fusion between characters depends on the distance between characters on the original document image, that is, OR.
The inter-character distance can be estimated from the degree of fusion of characters in the line direction due to compression. Therefore, the optimum intercharacter integration parameter (threshold value) corresponding to the interfusion degree of characters is obtained in advance by experiment, and this is set as a table format or a general formula, and based on the measured interfusion degree of characters. Optimal parameters can be set automatically. This is the processing principle in this example.

【００３４】図３に示した概略フローチャートに沿っ
て、本例での処理内容を具体的に説明する。ステップ３
０Ａでは、横書き文書の場合として、圧縮画像より抽出
された矩形中の文字矩形に分類された矩形に関して、矩
形高さ（行方向に対して垂直な方向の矩形サイズ）のヒ
ストグラムをとり、最も頻度が大きい矩形高さを標準文
字サイズとする。次に、矩形高さが標準文字サイズ以下
の文字矩形について矩形幅（行方向の矩形サイズ）を調
べる。そして、標準文字サイズ以下の矩形高さを持つ文
字矩形の総数と、矩形幅が標準文字サイズ以下の矩形
（矩形高さは標準文字サイズ以下）の総数との比（文字
矩形の縦横比）を求める。The processing contents of this example will be described in detail with reference to the schematic flow chart shown in FIG. Step 3
In 0A, in the case of a horizontal writing document, a rectangle height (rectangle size in the direction perpendicular to the line direction) histogram is taken for the rectangles classified into the character rectangles in the rectangles extracted from the compressed image, and the frequency is the highest. The size of the rectangle with a large is the standard character size. Next, the rectangular width (rectangular size in the row direction) of the character rectangle whose height is equal to or smaller than the standard character size is checked. Then, calculate the ratio (aspect ratio of the character rectangle) of the total number of character rectangles with a rectangle height less than the standard character size and the total number of rectangles with a rectangle width less than the standard character size (rectangle height less than the standard character size). Ask.

【００３５】この比率は、おおよその文字間融合度合を
示している。前述のように新聞等の文字間が詰まった文
書の場合には、この比率はかなり小さな値となるが、ワ
ープロ文書のように文字間が空いている文書では、この
比率は１に近い値となる。This ratio shows the approximate degree of fusion between characters. As described above, in the case of a document in which characters are closely packed, such as a newspaper, this ratio has a fairly small value. However, in a document such as a word processing document in which characters are not sufficiently spaced, this ratio is close to 1. Become.

【００３６】次のステップ３０Ｂにおいて、前ステップ
で求められた比率を、前述の最適パラメータ（文字間統
合閾値）のテーブルまたは一般式に当てはめることによ
り、最適パラメータを決定する。In the next step 30B, the optimum parameter is determined by applying the ratio obtained in the previous step to the above-mentioned table of optimum parameters (character-to-character integrated threshold) or a general formula.

【００３７】文字間統合パラメータとして固定値を用い
る方法でも、処理対象文書を特化する場合には不都合は
ない。しかし、様々な文書を処理しようとする場合に
は、文書にあわせて最適パラメータを手動で設定し直す
必要があった。本例及び後記各例によれば、そのような
煩わしい手動設定を排除し、様々な文書に対して最適な
パラメータを設定し、適切な文字間統合を行なうことが
可能になる。Even if the fixed value is used as the intercharacter integration parameter, there is no inconvenience when the document to be processed is specialized. However, when processing various documents, it was necessary to manually set the optimum parameters according to the documents. According to this example and each example described later, it is possible to eliminate such a troublesome manual setting, set optimum parameters for various documents, and perform appropriate character-to-character integration.

【００３８】パラメータ設定部１０Ａのパラメータ設定
処理の例２図４に示した概略フローチャートに沿って、本例での処
理内容を説明する。ここでは、これまでの説明と同様
に、文書画像上の行方向が水平であるとする。Example 2 of Parameter Setting Process of Parameter Setting Section 10A The process contents of this example will be described with reference to the schematic flow chart shown in FIG. Here, it is assumed that the line direction on the document image is horizontal, as in the above description.

【００３９】ステップ４０Ａにおいて、圧縮画像より抽
出された矩形中で文字に分類された矩形に関し、行方向
（水平方向）に隣合った二つの文字矩形について、水平
距離（文字間隔）条件以外に統合するに不都合がないか
調べる。例えば、一方の文字矩形と他方の文字矩形の矩
形高さの差が所定の限度を越えている場合（例えば、縦
書き文書において、一方の文字矩形がある本文行上の文
字で、他方が本文に対するタイトル文字であるような場
合）は、両文字矩形を統合すべきでないと判断し、その
限度を越えない場合には、統合し得ると判断する。これ
を全ての隣合う文字矩形について行なう。In step 40A, regarding the rectangles classified into characters in the rectangles extracted from the compressed image, two character rectangles adjacent to each other in the row direction (horizontal direction) are integrated except for the horizontal distance (character spacing) condition. Check if there is any inconvenience. For example, when the difference between the rectangle heights of one character rectangle and the other character rectangle exceeds a predetermined limit (for example, in a vertical writing document, one character rectangle is a character on the body line and the other is the body text). (If it is a title character for), it is determined that both character rectangles should not be integrated, and if they do not exceed the limit, it is determined that they can be integrated. Do this for all adjacent character rectangles.

【００４０】ステップ４０Ｂにおいて、前ステップ４０
Ａで統合し得ると判断された隣合う文字矩形間の水平距
離を測定し、その水平距離のヒストグラムを求める。In step 40B, the previous step 40
The horizontal distance between adjacent character rectangles determined to be integrated in A is measured, and a histogram of the horizontal distance is obtained.

【００４１】ステップ４０Ｃにおいて、予め実験的に用
意された文字間隔（水平距離）と最適な文字間統合パラ
メータとの関係を表わすテーブルまたは一般式に、前ス
テップ４０Ｂで得られた水平距離ヒストグラムで頻度が
最高の水平距離値をあてはめることによって、最適な文
字間統合パラメータを決定する。In step 40C, the horizontal distance histogram obtained in the previous step 40B is used as a frequency in the table or the general formula representing the relation between the character spacing (horizontal distance) and the optimum intercharacter integrated parameter prepared in advance. Determines the optimal intercharacter integration parameter by fitting the highest horizontal distance value.

【００４２】本例の方法は、前記パラメータ設定処理例
１では最適化が難しい、文字間隔が広いような特殊な場
合に対するパラメータ最適化能力に優れている。The method of this example is excellent in the parameter optimization capability for a special case where the character setting process example 1 is difficult to optimize and the character spacing is wide.

【００４３】パラメータ設定部１０Ａのパラメータ設定
処理の例３図５に示した概略フローチャートに沿って、本例での処
理内容を説明する。なお、ここでは横書き文書を想定す
る。Example 3 of Parameter Setting Process of Parameter Setting Unit 10A The process contents of this example will be described with reference to the schematic flow chart shown in FIG. Note that a horizontal document is assumed here.

【００４４】ステップ５０Ａは、図３のステップ３０Ａ
と同じ内容の処理を行なうステップである。ステップ５
０Ｂは、前ステップ５０Ａで求められた比率（文字間融
合度合）によって次の処理方法を選択する。例えば、比
率が１に近い場合（文字間隔が広く、図３に示した処理
方法よりも図４に示した処理方法が適する場合）に、次
の処理としてステップ５０Ｄを選び、そうでない場合に
ステップ５０Ｃを選択する。ステップ５０Ｃは図３のス
テップ３０Ｂと同内容の処理ステップであり、ステップ
５０Ｄ〜５０Ｆは図４のステップ４０Ａ〜４０Ｃと同内
容の処理ステップである。Step 50A corresponds to step 30A in FIG.
This is a step for performing the same processing as. Step 5
0B selects the next processing method according to the ratio (degree of inter-character fusion) obtained in the previous step 50A. For example, when the ratio is close to 1 (when the character spacing is wide and the processing method shown in FIG. 4 is more suitable than the processing method shown in FIG. 3), step 50D is selected as the next processing, and if not, step 50D is selected. Select 50C. Step 50C is the same processing step as step 30B in FIG. 3, and steps 50D to 50F are the same processing steps as steps 40A to 40C in FIG.

【００４５】文字領域統合または領域種類判別のための
処理に標準文字サイズを利用する場合（例えば前記特願
平４−２６７３１３号特許出願に係る装置の場合）に
は、図３に関連して説明した処理方法のほうが処理時間
を短縮できることがある。すなわち、本例は、図３の処
理方法で精度面に問題がない場合にはステップ５０Ｃの
処理を選択して処理時間短縮の利益を期待できるように
し、文字間隔が広く精度の面で図４の処理方法が好まし
い場合にはステップ５０Ｄ以下の処理に切り替えてい
る。When the standard character size is used for the processing for character area integration or area type discrimination (for example, in the case of the device according to the above-mentioned Japanese Patent Application No. 4-267313), description will be given with reference to FIG. The processing method may reduce the processing time. That is, in this example, if there is no problem in terms of accuracy in the processing method of FIG. 3, the processing of step 50C is selected so that the benefit of shortening the processing time can be expected, and the character spacing is wide and the accuracy of FIG. If the above processing method is preferable, the processing is switched to the processing after step 50D.

【００４６】文字間統合部１０Ｂによる文字間統合処理
の例１ある文字矩形と他の文字矩形とを同じ行として統合する
か否かの判定には、行方向の文字矩形間距離、及び、行
（仮想行）に対し垂直方向の文字矩形位置を考慮するの
が一般的である。ここで、行方向の文字矩形間距離に関
する閾値（統合パラメータ）が固定値で最適化されてい
ない場合は勿論のこと、それが前述のように自動的に最
適化される場合でも多少のくるいがあったときに、ワー
プロ文書等では、本来一行に統合されるべき句読点や、
それに類する特定の記号の統合を失敗する虞がある。こ
れは、ワープロ文書等では、句読点あるいは、それに類
する特定の記号（句読点等と総称する）と、その次の文
字との間隔が、普通の文字の間隔に比べ広過ぎることが
あるからである。Example 1 of character integration processing by the character integration unit 10B: To determine whether or not to integrate a character rectangle and another character rectangle into the same line, the distance between the character rectangles in the line direction and the line It is general to consider the character rectangle position in the vertical direction with respect to (virtual line). Here, not only when the threshold value (integrated parameter) related to the distance between the character rectangles in the line direction is not optimized with a fixed value, but also when it is automatically optimized as described above, some wraparound occurs. When there is a punctuation mark that should be integrated into a single line in a word processing document,
There is a risk of failing to integrate certain similar symbols. This is because, in a word processing document or the like, a space between a punctuation mark or a specific symbol (generally referred to as a punctuation mark) similar to the punctuation mark and the next character may be too wide as compared with an ordinary character space.

【００４７】本例によれば、かかる句読点等の統合の失
敗を回避することができる。以下、図６のフローチャー
トに沿って、本例の処理内容について説明する。According to this example, it is possible to avoid the failure of integration of such punctuation marks. The processing contents of this example will be described below with reference to the flowchart of FIG.

【００４８】ステップ６０Ａにおいて、統合すべきか注
目している文字矩形が句読点等であるか判定する。この
判定には、注目した文字矩形の形状、大きさ、仮想行上
の位置関係等の情報を用いることができる。例えば、注
目した文字矩形が、他の文字矩形に比べ極端に小さい、
あるいは細長いときに、句読点等と判定する。In step 60A, it is determined whether the character rectangle to be integrated or focused is a punctuation mark or the like. Information such as the shape, size, and positional relationship on the virtual line of the focused character rectangle can be used for this determination. For example, the focused character rectangle is extremely smaller than other character rectangles,
Alternatively, when it is elongated, it is determined to be a punctuation mark or the like.

【００４９】ステップ６０Ａで、注目した文字矩形が句
読点等でないと判定した場合には、パラメータ設定部１
０Ａで設定されたパラメータをそのままステップ６０Ｄ
において行方向の文字矩形間距離の閾値として用いる。
しかし、句読点等であると判定した場合には、ステップ
６０Ｃにおいて、パラメータ設定部１０Ａで設定された
パラメータを、例えば文字サイズ程度だけ増加させ、こ
れをステップ６０Ｄで当該注目矩形に関する行方向の文
字矩形間距離の閾値として用いる。When it is determined in step 60A that the focused character rectangle is not a punctuation mark or the like, the parameter setting unit 1
The parameters set in 0A are used as they are in step 60D.
Is used as a threshold for the distance between the character rectangles in the line direction.
However, if it is determined to be a punctuation mark or the like, in step 60C, the parameter set in the parameter setting unit 10A is increased by, for example, the character size, and this is increased in step 60D to determine the character rectangle in the line direction related to the target rectangle. It is used as a threshold value for the distance.

【００５０】ステップ６０Ｄにおいては、注目している
文字矩形と、その右隣の（行の左端から右端へ向かって
処理するとして）文字矩形との統合条件を調べる。統合
条件が成立しない場合には、注目した文字矩形と、隣接
の文字矩形との統合をしない（ステップ６０Ｆ）。統合
条件が成立する場合には、注目した文字矩形と、隣接の
次文字矩形とを統合する（ステップ６０Ｅ）。In step 60D, the integration condition of the character rectangle of interest and the character rectangle immediately adjacent to it (assuming processing from the left end to the right end of the line) is checked. If the integration condition is not satisfied, the focused character rectangle and the adjacent character rectangle are not integrated (step 60F). When the integration condition is satisfied, the focused character rectangle and the adjacent next character rectangle are integrated (step 60E).

【００５１】このように、注目した文字矩形が句読点等
である場合には、行方向の文字矩形間距離の閾値を増加
させるので、ワープロ文書等で句読点等と次文字との距
離が通常の文字の間隔より広過ぎる場合にも、同一行と
して統合することができる。また、パラメータ設定部１
０Ａにより設定されるパラメータの多少のくるいを許容
できるという利益も期待できる。As described above, when the focused character rectangle is a punctuation mark or the like, the threshold value of the distance between the character rectangles in the line direction is increased. Therefore, the distance between the punctuation mark and the next character is a normal character in a word processing document or the like. If the interval is too wide, it can be merged in the same line. In addition, the parameter setting unit 1
The benefit of being able to tolerate some of the parameters set by 0A can also be expected.

【００５２】文字間統合部１０Ｂによる文字間統合処理
の例２例えば縦書き文書の場合に、本文の行（本文行）の端
と、横書きのフッタ（ページ数等）やヘッダ（タイトル
等）とが接近していることが少なくない。本例によれ
ば、このような接近した本文行文字とフッタやヘッダの
文字との誤統合を防止できる。以下、図７に示すフロー
チャートに沿って処理内容を説明する。Example 2 of character integration processing by the character integration unit 10B For example, in the case of a vertically written document, the end of the line of the text (text line), the footer (page number etc.) of horizontal writing, and the header (title etc.) Are often close to each other. According to the present example, it is possible to prevent such an erroneous integration of the body line characters that are close to each other and the characters of the footer and the header. Hereinafter, the processing content will be described with reference to the flowchart shown in FIG. 7.

【００５３】ステップ７０Ａにおいて、注目した二つの
文字矩形の行方向（本文行方向）の距離に関する統合条
件を満たすか（距離がパラメータ設定部１０Ａにより設
定された閾値以下であるか）調べる。文字間距離の統合
条件を満たさない場合にはステップ７０Ｆに分岐し、当
該文字矩形を統合せず次の文字矩形の処理に進むことに
なる。In step 70A, it is checked whether the integration condition regarding the distance in the line direction (text line direction) of the two focused character rectangles is satisfied (whether the distance is less than or equal to the threshold value set by the parameter setting unit 10A). If the integration condition of the inter-character distance is not satisfied, the process branches to step 70F to proceed to the process of the next character rectangle without integrating the character rectangle.

【００５４】統合条件が成立した場合には、ステップ７
０Ｂにおいて当該文字矩形のサイズ（例えば矩形高さ）
の差が所定値より小さいか調べる。サイズ差が大きい場
合にはステップ７０Ｆに分岐し統合せずに次の文字矩形
の処理に進む。したがって、例えば図８（ａ）に示すよ
うに、縦書き文書上の本文行の文字矩形Ｂに接近して横
書きのページ数の文字列の統合された高さの大きな文字
矩形ａが存在した場合に、その誤統合を防止できる。If the integration condition is satisfied, step 7
The size of the character rectangle in 0B (for example, the height of the rectangle)
Check whether the difference between is less than a predetermined value. If the size difference is large, the process branches to step 70F and the process proceeds to the next character rectangle processing without integration. Therefore, for example, as shown in FIG. 8A, when there is a large character rectangle a in which a character string having the number of horizontally written pages is integrated and is close to the character rectangle B of the text line in the vertically written document. In addition, the wrong integration can be prevented.

【００５５】ステップ７０Ｂで矩形サイズ差が小さいと
判定された場合は、ステップ７０Ｃにおいて、注目して
いる二つの文字矩形中の一方に、行（本文行）と垂直な
方向に近接した文字矩形が存在するか調べる。存在する
場合にはステップ７０Ｆに分岐する。このようなチェッ
クによって、例えば図８（ｂ）に示すように、本文行文
字矩形Ｂに接近して、フッタ（ページ数等）の文字矩形
ｂがあり、この文字矩形ｂの上下にフッタの文字矩形
ａ，ｃが存在するような場合に、文字矩形Ｂと文字矩形
ｂとの誤統合を防止できる。If it is determined in step 70B that the rectangle size difference is small, in step 70C, one of the two character rectangles of interest has a character rectangle adjacent to the line (text line) in the direction perpendicular to the line. Check if it exists. If it exists, the process branches to step 70F. As a result of such a check, as shown in FIG. 8B, for example, there is a character rectangle b of a footer (the number of pages, etc.) near the text line character rectangle B, and the footer characters are above and below this character rectangle b. When rectangles a and c exist, it is possible to prevent erroneous integration of the character rectangle B and the character rectangle b.

【００５６】ステップ７０Ｃで垂直方向に近接した文字
矩形がないと判定した場合には、ステップ７０Ｄにおい
て、一方の文字矩形の縦横比と通常の文字矩形の縦横比
との差が小さいか調べる。縦横比が普通であると判定さ
れた場合には、ステップ７０Ｅにおいて当該文字矩形の
統合が行なわれ、次の文字矩形の処理に進むことにな
る。If it is determined in step 70C that there is no vertically adjacent character rectangle, it is checked in step 70D whether the difference between the aspect ratio of one character rectangle and the normal character rectangle is small. When it is determined that the aspect ratio is normal, the character rectangles are integrated in step 70E, and the process proceeds to the next character rectangle.

【００５７】しかし、縦横比の差が大きい場合には、ス
テップ７０Ｆに分岐する。したがって、例えば図８
（ｃ）に示すように、フッタの小さな文字列が統合され
た細長い文字矩形ａが本文行文字矩形Ｂに接近して存在
する場合、文字矩形ａと本文行文字矩形Ｂとの誤統合を
防止できる。However, if the difference in the aspect ratio is large, the process branches to step 70F. Therefore, for example, in FIG.
As shown in (c), when a narrow character rectangle a in which a character string with a small footer is integrated exists close to the body line character rectangle B, erroneous integration of the character rectangle a and the body line character rectangle B is prevented. it can.

【００５８】領域統合部１０Ｃの処理大部分の文書では、本文行と行方向の異なるフッタやヘ
ッダの文字列は、本文行の文字との距離が大きいため
に、本文行に統合されることはない。また、本文行の文
字とフッタ等の文字との距離が小さい場合にも、図７に
示した文字間統合処理方法によれば、その統合が防止さ
れる。したがって、本文行と行方向の異なるフッタやヘ
ッダ等の文字列は、本文行に統合されずに、短い行とし
て残ることなる。Processing of the area integrating unit 10C In most documents, a footer or a character string of a header having a line direction different from that of the text line is integrated into the text line because the distance between the text line and the characters of the text line is large. Absent. Further, even when the distance between the characters in the body line and the characters such as the footer is small, the integration is prevented by the inter-character integration processing method shown in FIG. Therefore, a character string such as a footer or a header having a line direction different from that of the body line remains as a short line without being integrated into the body line.

【００５９】例えば図９において、ａ〜ｋは文字間統合
によって得られた行であり、その中でａ〜ｉは本文行、
ｊとｋはフッタ等の短い行である。For example, in FIG. 9, a to k are lines obtained by character integration, in which a to i are text lines,
j and k are short lines such as footers.

【００６０】領域統合部１０は、従来と同様の方法によ
って、図９のａ〜ｉのような本文行を文字領域（図９の
破線枠９０）に統合するが、この際に行の形状を調べ、
短い行が発生している場合には、その行より本文行と垂
直の方向に短い行を探索することによって、図９のｊ，
ｋのような短い行を本文領域とは行方向の異なる別の文
字領域（図９の破線枠９１）に統合する。The area integration unit 10 integrates text lines such as a to i in FIG. 9 into a character area (broken line frame 90 in FIG. 9) by a method similar to the conventional method. Look up,
If a short line has occurred, a line shorter than that line in the direction perpendicular to the main text line is searched for, so that j,
A short line such as k is integrated into another character region (broken line frame 91 in FIG. 9) whose line direction is different from the body region.

【００６１】矩形分類部９の処理矩形分類の一例を説明する。まず、矩形の大きさ及び形
状によって、矩形の種類を大雑把に分類する。例えば図
１０の（ａ）に示すように、矩形の高さ（ｈｅｉｇｈ
ｔ）を、標準文字サイズ（ｓｉｚｅ）との大小関係によ
ってクラス１からクラス３に分ける。また、矩形の幅
（ｗｉｄｔｈ）を、標準文字サイズとの大小関係によっ
てクラス１からクラス３に分ける。標準文字サイズは、
予め分かっている場合には、その値を用いればよいし、
予め分かっていない場合には文書画像より抽出された矩
形の高さの分布から適応的に決定するような方法を採用
すればよい。Processing of Rectangle Classification Unit 9 An example of rectangle classification will be described. First, the types of rectangles are roughly classified according to the size and shape of the rectangles. For example, as shown in FIG. 10A, the height of the rectangle (height
t) is classified into class 1 to class 3 according to the size relation with the standard character size (size). Also, the width of the rectangle is divided into class 1 to class 3 according to the size relationship with the standard character size. Standard font size is
If you know in advance, you can use that value,
If it is not known in advance, a method of adaptively determining from the distribution of the heights of the rectangles extracted from the document image may be adopted.

【００６２】このようにして得られた矩形高さクラス及
び矩形幅クラスを、例えば図１１の（ｂ）に示す分類テ
ーブルにあてはめて、矩形分類を決定する。矩形幅がク
ラス１またはクラス２で、矩形高さがクラス１またはク
ラス２ならば、その矩形は文字候補または大文字候補に
分類される。矩形幅がクラス１で、矩形高さがクラス３
ならば垂直罫線候補に分類される。The rectangle height class and rectangle width class thus obtained are applied to, for example, the classification table shown in FIG. 11B to determine the rectangle classification. If the rectangle width is class 1 or class 2 and the rectangle height is class 1 or class 2, the rectangle is classified as a character candidate or a capital letter candidate. Rectangle width is class 1 and rectangle height is class 3
If so, it is classified as a vertical ruled line candidate.

【００６３】表領域候補に分類された矩形については、
矩形領域内をスキャンして水平方向の所定値より長い黒
ランだけを抽出し、その接続したものを囲む外接矩形を
水平方向の架空罫線として抽出し、同様に垂直方向に長
い黒ランだけを抽出し、その接続したものを囲む外接矩
形を垂直方向の架空罫線として抽出する。そして、架空
罫線の数、位置、長さに着目し、当該矩形を＜表＞、＜
囲み枠＞または＜図、グラフその他＞に分類する。For the rectangles classified as the table area candidates,
Scan a rectangular area to extract only black runs longer than a specified horizontal value, and extract the circumscribed rectangle surrounding the connected ones as horizontal fictitious ruled lines. Similarly, extract only black runs that are long in the vertical direction. Then, the circumscribing rectangle surrounding the connected one is extracted as an imaginary ruled line in the vertical direction. Then, paying attention to the number, position and length of the imaginary ruled lines,
Enclosed frame> or <Figure, graph, etc.>.

【００６４】また、水平罫線候補とされた矩形について
は、当該矩形領域をスキャンして水平方向に長い黒ラン
だけを抽出し、抽出した長い黒ランの接続したものを囲
む外接矩形を水平方向の架空罫線として抽出し、架空罫
線の数、位置、長さに着目して、当該矩形を＜文字矩形
＞または文字領域を上下に区切る＜水平セパレータ＞に
分類する。As for the rectangles that are candidates for horizontal ruled lines, the rectangular area is scanned to extract only black runs that are long in the horizontal direction, and a circumscribed rectangle that surrounds the connected long black runs is extracted in the horizontal direction. It is extracted as an imaginary ruled line, and the rectangle is classified into a <character rectangle> or a <horizontal separator> that vertically divides a character region, focusing on the number, position, and length of the imaginary ruled lines.

【００６５】垂直罫線候補に分類された矩形について
は、当該矩形領域をスキャンして垂直方向に長い黒ラン
だけを抽出し、抽出した長い黒ランの接続したものを囲
む外接矩形を垂直方向の架空罫線として抽出し、架空罫
線の数、位置、長さに着目し、当該矩形を＜文字矩形
＞、文字領域を左右に区切る＜垂直セパレータ＞、また
は＜図、グラフその他＞に分類する。For rectangles classified as vertical ruled line candidates, the rectangular area is scanned to extract only black runs that are long in the vertical direction, and a circumscribing rectangle that encloses the connected long black runs is fictitious in the vertical direction. Extracted as ruled lines, focusing on the number, position, and length of the fictitious ruled lines, the rectangles are classified into <character rectangles>, <vertical separators> that divide the character region into left and right, or <figure, graph, etc.>.

【００６６】次に、本発明の他の実施例について説明す
る。本実施例のためのシステム構成は図１に示した前記
実施例のシステム構成と同様でよい。処理内容に関して
も、文字領域統合部１０の処理以外は前記実施例と同様
でよい。以下、本実施例における文字領域統合部１０の
処理内容について説明する。その処理の概略フローを図
１１に示す。Next, another embodiment of the present invention will be described. The system configuration for this embodiment may be the same as the system configuration of the above embodiment shown in FIG. The processing content may be the same as that of the above embodiment except the processing of the character area integrating unit 10. Hereinafter, the processing contents of the character area integration unit 10 in this embodiment will be described. A schematic flow of the processing is shown in FIG.

【００６７】矩形分類部９によって文字に分類された矩
形について、本文行の方向へ距離の小さいものを統合し
て行を抽出する（ステップ１０１）。統合する行の方向
は予め指定してもよいし、自動的に判別させてもよい。
また、この統合の際の矩形間距離の閾値は、予め指定し
てもよいし、前記実施例に述べた方法によって適応的に
設定してもよい。ここで行として抽出される矩形は、矩
形間距離が小さいものだけであって、特殊な処理を施す
必要はない。むしろ、距離が大きな矩形まで行に統合さ
れると悪影響が生じる心配がある。With respect to the rectangles classified into characters by the rectangle classification unit 9, those having a small distance in the direction of the text line are integrated to extract the line (step 101). The direction of the lines to be integrated may be designated in advance or may be automatically determined.
Further, the threshold value of the inter-rectangle distance at the time of this integration may be designated in advance, or may be adaptively set by the method described in the above embodiment. The rectangles extracted as rows here are only those with a small distance between rectangles, and no special processing is required. Rather, there is a risk that adverse effects will occur if rows with a large distance are integrated into a row.

【００６８】この文字矩形統合により抽出された行の中
に、幅の小さいものが生じることがある。このような行
は、単独の文字であったり、行方向の異なる文字（例え
ば注目している行方向と垂直方向に近接している文字矩
形）であったり、あるいは文字以外の図、写真の一部
（例えば、キャプション、グラフの座標値等の文字等）
である可能性が高い。The lines extracted by this character rectangle integration may have a small width. Such a line may be a single character, a character in a different line direction (for example, a character rectangle that is vertically adjacent to the line direction of interest), or a non-character figure or photograph. Part (for example, captions, characters such as graph coordinate values, etc.)
Is likely to be.

【００６９】そこで、本実施例においては、文字矩形の
統合により得られた行の中で幅の小さい行を、”本文文
字行以外の行”に分類し、他の幅の大きな行つまり”本
文文字行”とは別の属性を持たせる（ステップ１０
２）。この分類をより一般的に述べるならば、文字矩形
の本文行方向への統合により得られた行を、その大き
さ、次の行との行間距離、文字以外に分類された矩形と
の距離等の情報から、”本文文字行”と、”本文文字行
以外の行”とに分類するということである。Therefore, in the present embodiment, lines having a small width among the lines obtained by the integration of the character rectangles are classified into "lines other than the body character line", and other large lines, that is, "body lines". It has an attribute different from "character line" (step 10
2). To describe this classification in more general terms, the size of the line obtained by integrating the character rectangles in the text line direction, the line distance to the next line, the distance to the rectangle classified as other than the character, etc. It means that the information is classified into "body text line" and "line other than text line".

【００７０】矩形分類部９により文字でも罫線でもない
矩形に分類された矩形（”図その他矩形”と呼ぶ）の周
辺または内部に（つまり重なって）、前ステップで分類
された”本文文字行以外の行”が存在するか調べる（ス
テップ１０３）。Except for the "text line" classified in the previous step, around or inside (that is, overlapped with) a rectangle (referred to as "figure or other rectangle") classified into a rectangle that is neither a character nor a ruled line by the rectangle classification unit 9. Is present (step 103).

【００７１】”図その他矩形”の周辺または内部に存在
する”本文文字行以外の行”について、その周辺に、本
文文字行方向に対し垂直の方向に統合し得る行を探索
し、そのような統合できる行が見つかれば、当該”本文
文字行以外の行”を”垂直行”に最終分類し（ステップ
１０４）、探索した行と統合し、さらに距離の近い”垂
直行”同士を統合する（ステップ１０５）。Regarding the "lines other than the text line" existing around or inside the "figure or other rectangle", a line that can be integrated in the direction perpendicular to the text line direction is searched for around the line, and such a line is searched. If a line that can be integrated is found, the "line other than the text line" is finally classified into "vertical line" (step 104), integrated with the searched line, and "vertical lines" having a closer distance are integrated ( Step 105).

【００７２】”図その他矩形”の周辺または内部にあ
る”本文文字行以外の行”であっても、垂直方向に統合
できる行がみつからないものは、”図の行”に最終分類
し（ステップ１０４）、距離の近い”図の行”と統合す
るか、あるいは消去する（ステップ１０５）。Even if there are "lines other than the text line" located around or inside the "figure or other rectangle" and no line that can be integrated in the vertical direction is found, they are finally classified into "figure lines" (step 104), or merge with a close "row of the figure" or delete (step 105).

【００７３】ステップ１０２で”本文文字行以外の行”
に分類された行の中で、”図その他矩形”の周辺または
内部に存在しないものは、”本文文字行以外の行”に最
終分類され（ステップ１０４）、接近した同じ属性の行
同士で統合する（ステップ１０５）。In step 102, "lines other than text line"
Of the lines classified into, those that do not exist around or inside the "figure or other rectangle" are finally classified into "lines other than the text line" (step 104), and are merged between lines that have the same attribute and are close to each other. (Step 105).

【００７４】次に、”本文文字行”について、接近した
もの同士を統合して、本文領域を生成する（ステップ１
０６）。Next, with respect to the "text line", those close to each other are integrated to generate a text area (step 1).
06).

【００７５】なお、本実施例の一変形例によれば、矩形
分類部９において”図その他矩形”をさらに”図の矩
形”と”写真の矩形”とに細分類する。そして、ステッ
プ１０５において、”写真の矩形”の内部にある”本文
文字行以外の行”を消去し、あるいは写真の属性に変更
する。ハーフトーンの写真の領域は、線図形に比べて小
矩形の塊に分類されることが多いので、この変形例のよ
うにすれば、ハーフトーンの写真領域がばらばらになる
ことを防ぐうえで有効である。また、線図形領域内の文
字列を消去せず、文字列として抽出できる。According to a modification of the present embodiment, the rectangle classification unit 9 further subdivides the "drawing rectangle" into the "drawing rectangle" and the "photographic rectangle". Then, in step 105, the "line other than the text line" in the "photograph rectangle" is erased or changed to the photo attribute. Since halftone photo areas are often classified into small rectangular blocks compared to line figures, this modification is effective in preventing halftone photo areas from falling apart. Is. Further, the character string in the line figure area can be extracted as a character string without being erased.

【００７６】[0076]

【発明の効果】以上の説明から明らかなように、請求項
１乃至７の発明によれば、人手を煩わすことなく、文字
間隔の異なる様々な文書等の領域分割を行なうことがで
き、また、ワープロ文書等の句読点やそれに類する特定
記号の行統合の失敗や、縦書き文書のヘッダやフッタの
ような文字列と本文行の文字列との誤統合を防止でき、
さらに、ヘッダやフッタのような文字列を本文領域と別
の文字領域として統合することができる等、多様な文書
等に対する高精度の領域分割が可能になる。また、請求
項８乃至１２の発明によれば、本文文字行方向へ文字矩
形を統合した行について分類を行ない、本文文字行とそ
れ以外の行に分類し、本文文字行以外の行をさらに分類
することによって、より正確な本文文字領域の抽出が可
能になり、また、矩形単位の矩形分類の失敗の割合が多
い場合にも、文字認識の結果に基づいて矩形を厳密に分
類し直すような非効率的な処理を行なわずに実質的に矩
形の分類精度を向上させ、不要な文字領域の発生を防止
し、さらに写真領域の無用な分割を防止できる。As is apparent from the above description, according to the inventions of claims 1 to 7, it is possible to perform area division of various documents having different character intervals without the need for manpower. It is possible to prevent line integration of punctuation marks and similar symbols in word processing documents, and erroneous integration of character strings such as headers and footers of vertical writing documents with character strings of text lines.
Further, a character string such as a header or a footer can be integrated as a character area different from the body area, which enables highly accurate area division for various documents. Further, according to the inventions of claims 8 to 12, the lines in which the character rectangles are integrated in the body character line direction are classified, and the body character lines and other lines are classified, and the lines other than the body character lines are further classified. By doing so, it is possible to extract the body text area more accurately, and even if there is a high rate of failure in rectangle classification for each rectangle, the rectangles can be reclassified strictly based on the result of character recognition. It is possible to substantially improve the classification accuracy of rectangles without performing inefficient processing, prevent the generation of unnecessary character areas, and prevent unnecessary division of the photo area.

[Brief description of drawings]

【図１】本発明の一実施例に係るシステムの概略構成を
示す。FIG. 1 shows a schematic configuration of a system according to an embodiment of the present invention.

【図２】文字領域統合部の構成例を示す。FIG. 2 shows a configuration example of a character area integration unit.

【図３】パラメータ設定処理の例１のフローを示す。FIG. 3 shows a flow of a first example of parameter setting processing.

【図４】パラメータ設定処理の例２のフローを示す。FIG. 4 shows a flow of a second example of parameter setting processing.

【図５】パラメータ設定処理の例３のフローを示す。FIG. 5 shows a flow of a third example of parameter setting processing.

【図６】文字間統合処理の例１のフローを示す。FIG. 6 shows a flow of an example 1 of inter-character integration processing.

【図７】文字間統合処理の例２のフローを示す。FIG. 7 shows a flow of an example 2 of inter-character integration processing.

【図８】本文行の文字矩形との統合対象から排除される
文字矩形の例を示す。FIG. 8 shows an example of a character rectangle excluded from the integration target with the character rectangle of the text line.

【図９】領域統合処理の説明図である。FIG. 9 is an explanatory diagram of region integration processing.

【図１０】（ａ）矩形の高さ及び幅のクラス分けの説明
図である。（ｂ）矩形の高さ及び幅による属性分類のテーブルの説
明図である。FIG. 10A is an explanatory diagram of classification of height and width of a rectangle. (B) It is explanatory drawing of the table of the attribute classification by the height and width of a rectangle.

【図１１】本発明の他の実施例における文字領域統合部
の処理のフローを示す。FIG. 11 shows a processing flow of a character area unifying unit in another embodiment of the present invention.

[Explanation of symbols]

１スキャナ２画像入力部３画像メモリ４画像表示部５ディスプレー６領域識別部７画像圧縮部８外接矩形検出部９矩形分類部１０文字領域統合部１０Ａパラメータ設定部１０Ｂ文字間統合部１０Ｃ領域統合部１１文字認識部 1 Scanner 2 Image Input Section 3 Image Memory 4 Image Display Section 5 Display 6 Area Identification Section 7 Image Compression Section 8 Enclosing Rectangle Detection Section 9 Rectangular Classification Section 10 Character Area Integration Section 10A Parameter Setting Section 10B Character Integration Section 10C Area Integration Section 11 Character recognition part

Claims

[Claims]

1. A rectangle circumscribing a black pixel connected component is extracted from an image obtained by compressing an image of a document or the like with a predetermined magnification, the extracted rectangle is classified into a character rectangle and a rectangle other than the character rectangle, and the character rectangle is displayed. In the area division method that integrates in the direction, the degree of fusion between characters due to image compression is estimated by taking the ratio of the height and width of the rectangle of the character, and according to this degree of fusion, in the direction of the line of the rectangle of the character. An area dividing method characterized by setting a threshold value of a distance between rectangles for the integration of the.

2. A rectangle circumscribing a black pixel connected component is extracted from an image obtained by compressing an image of a document or the like at a predetermined magnification, the extracted rectangle is classified into a rectangle of a character and a rectangle other than that, and a rectangle of the character is displayed. In the area dividing method for integrating in the direction, a distribution of distances between rectangles of characters adjacent to each other in the row direction is measured, and based on the distribution, a distance between rectangles for integrating rectangles of the characters in the row direction is calculated. A region dividing method characterized by setting a threshold value.

3. A rectangle circumscribing a black pixel connected component is extracted from an image obtained by compressing an image of a document or the like with a predetermined magnification, the extracted rectangle is classified into a character rectangle and other rectangles, and a character rectangle is displayed. In the area segmentation method that integrates in the direction, the degree of fusion between characters due to image compression is estimated by taking the ratio of the height and width of the character rectangle, and the space between rectangles for integrating the character rectangle in the row direction is estimated. As a method of setting the distance threshold, a method of setting by the fusion degree or a method of setting by the distribution of the distance between the rectangles of the characters that are close to each other in the row direction is selected according to the magnitude of the fusion degree. Characteristic area division method.

4. A rectangle circumscribing a black pixel connected component is extracted from an image obtained by compressing an image of a document or the like with a predetermined magnification, the extracted rectangle is classified into a character rectangle and other rectangles, and a character rectangle is displayed. In the area dividing method of integrating in the direction, during the process of integrating the character rectangle in the line direction, it is determined whether the rectangle of the character of interest is a punctuation mark or a rectangle of a specific symbol similar thereto, and When it is determined that the rectangle is a similar specific symbol, the threshold value of the inter-rectangular distance related to the integration of the focused rectangle and the rectangle of the next character is increased as compared with the case of the integration of the rectangles of the characters other than the punctuation mark or the similar specific symbol. An area dividing method characterized by:

5. A rectangle circumscribing a black pixel connected component is extracted from an image obtained by compressing an image of a document or the like with a predetermined magnification, the extracted rectangle is classified into a rectangle of characters and a rectangle other than that, and a rectangle of characters is displayed. In the area dividing method for integrating in one direction, during integration processing of a character rectangle in the line direction, one of the two rectangles to be integrated and the other rectangle are rectangles of characters in different directions. A region dividing method, characterized in that, when it is determined whether or not it is a rectangle of characters in lines in different directions, integration of the two rectangles is prohibited.

6. If a line having a shorter length in the row direction is present in the lines obtained by integrating the rectangles of the characters in the line direction, the line is more perpendicular to the line direction than the short line. 6. The area dividing method according to claim 5, further comprising the step of searching for short rows in a direction and integrating them.

7. The area dividing method according to claim 5 or 6, wherein a threshold value of an inter-rectangular distance for integrating character rectangles in the row direction is set in the area dividing method according to claim 1, 2 or 3. An area dividing method characterized by determining a threshold value of a distance between rectangles.

8. A rectangle circumscribing a black pixel connected component is extracted from an image obtained by compressing an image of a document or the like with a predetermined magnification, and the extracted rectangle is classified into a character rectangle and other rectangles, and the character rectangle is the text. In the area division method that integrates in the character line direction and integrates the lines obtained by combining the rectangles to cut out the character part area, the line obtained by combining the character rectangles in the body character line direction is the body character line. And a line other than the text line, and by giving a line other than the text line a unique attribute different from the text line, the line other than the text line can be merged when the line is integrated into the character area. A region segmentation method characterized by avoiding the integration of lines and body text lines.

9. The area dividing method according to claim 8,
Lines other than text lines existing inside or around the rectangle classified into rectangles other than characters and ruled lines by the classification of rectangles, but lines that can be integrated around the text line in the direction perpendicular to the text line direction. A region segmentation method characterized by classifying it into those that can be found and those that cannot be found, and having unique attributes for each.

10. The region dividing method according to claim 9, wherein lines having the same attribute are integrated with respect to lines other than the text character line.

11. The area dividing method according to claim 9, wherein a line having an attribute other than a text character line is erased.

12. The area dividing method according to claim 8, wherein a rectangle classified into a rectangle other than a character or a ruled line by the rectangle classification is classified into a figure rectangle and a photograph rectangle, and the inside of the photograph rectangle is divided. A region segmentation method characterized by erasing lines other than a certain text line or adding a photo attribute.