JPH11232386A

JPH11232386A - Picture processing method and recording medium

Info

Publication number: JPH11232386A
Application number: JP10036090A
Authority: JP
Inventors: Hiroko Sugiura; 裕子杉浦
Original assignee: Ricoh Co Ltd
Current assignee: Ricoh Co Ltd
Priority date: 1998-02-18
Filing date: 1998-02-18
Publication date: 1999-08-27
Anticipated expiration: 2018-02-18
Also published as: JP3734614B2

Abstract

PROBLEM TO BE SOLVED: To improve recognition precision of a ruled line and a character by deciding a thinned picture that is one of the features of an input picture and extracting a thinned area as a pre-processing of recognition processing of the ruled line and the character. SOLUTION: A dotted line element selection part 6 selects a rectangle of a dotted line element size out of rectangles extracted from a binary picture. A combination processing part 8 integrates dotted line elements and holds them in a ruled line memory 9. Dotted line elements that cannot be combined are counted (11) and, when there are more than a specified number of such dotted line elements, it is judged to be a thinned document, marked (12) and is stored in a document kind memory 13.

Description

DETAILED DESCRIPTION OF THE INVENTION

【０００１】[0001]

【発明の属する技術分野】本発明は、罫線の識別精度を
向上させた画像処理方法および記録媒体に関する。[0001] 1. Field of the Invention [0002] The present invention relates to an image processing method and a recording medium having improved ruled line identification accuracy.

【０００２】[0002]

【従来の技術】表を構成する点線罫線を認識する方法と
して、例えば特開平７−２３０５２５号公報に記載され
た罫線認識方法がある。この方法は、所定のしきい値以
下の矩形を点線要素として抽出し、該要素間の間隔が所
定のいしき値以内の要素同士を統合した矩形を点線罫線
として抽出する。また、矩形の大きさおよび矩形間の間
隔のそれぞれの分散値を基に罫線としての妥当性も判断
している。2. Description of the Related Art As a method of recognizing dotted ruled lines constituting a table, there is, for example, a ruled line recognizing method described in JP-A-7-230525. According to this method, a rectangle having a predetermined threshold value or less is extracted as a dotted line element, and a rectangle obtained by integrating elements having an interval between the elements within a predetermined threshold value is extracted as a dotted ruled line. In addition, the validity of the ruled line is determined based on the respective variances of the size of the rectangle and the interval between the rectangles.

【０００３】ところで、罫線や文字を認識処理するとき
に、処理対象となる画像の特性が例えば濃い状態あるい
はかすれた状態にある場合には、認識処理に使用されて
いるアルゴリズムやしきい値では対応できずに、認識精
度が低下することがある。また、画像の特性は、スキャ
ナ等の入力装置によって入力するときの２値化しきし値
を変更することによって、入力画像の濃度などを補正
し、認識アルゴリズムに適応した画像特性を得ることが
可能である。[0003] By the way, when the characteristics of the image to be processed are, for example, in a dark state or a faint state when recognizing a ruled line or a character, the algorithm or threshold value used in the recognition processing is not applicable. Otherwise, recognition accuracy may decrease. In addition, the image characteristics can be corrected by changing the binarization threshold when inputting with an input device such as a scanner, thereby correcting the density of the input image and obtaining image characteristics adapted to a recognition algorithm. It is.

【０００４】[0004]

【発明が解決しようとする課題】上記した公報に記載さ
れた方法では、矩形の抽出処理を所定のしきい値で行う
と、例えば、文字を構成している矩形を、破線の構成要
素として誤って抽出する可能性がある。また、一度入力
された２値データがアルゴリズムに合わない特性である
という理由で、再度画像を入力し直すことは大変に手間
がかかる。In the method described in the above-mentioned publication, when a rectangle extraction process is performed at a predetermined threshold, for example, a rectangle constituting a character is erroneously identified as a broken line component. May be extracted. Also, it is very troublesome to re-input an image because the binary data once input has characteristics that do not match the algorithm.

【０００５】そこで、罫線や文字の認識処理の前処理と
して、処理画像の特性を把握し、その特性情報を用いる
ことができれば、その情報に対応したアルゴリズムやし
きい値などに変更することが可能となり、認識精度を向
上させることが可能となる。[0005] Therefore, if the characteristics of the processed image can be grasped and the characteristic information can be used as preprocessing of the ruled line and character recognition processing, it is possible to change to an algorithm or a threshold value corresponding to the information. And the recognition accuracy can be improved.

【０００６】本発明は上記した考察を基になされたもの
で、本発明の目的は、罫線や文字の認識処理の前処理と
して、入力画像の特性の一つであるかすれ画像を判定
し、またかすれ領域を抽出することにより、罫線や文字
の識別精度を向上させた画像処理方法および記録媒体を
提供することにある。SUMMARY OF THE INVENTION The present invention has been made based on the above considerations, and an object of the present invention is to determine a faint image, which is one of the characteristics of an input image, as preprocessing of a ruled line or character recognition process. It is an object of the present invention to provide an image processing method and a recording medium in which the blurred area is extracted to improve the accuracy of identifying ruled lines and characters.

【０００７】[0007]

【課題を解決するための手段】前記目的を達成するため
に、請求項１記載の発明では、２値化された画像データ
から黒画素連続成分の矩形を抽出し、該抽出された矩形
から点線要素に相当する矩形を抽出し、該抽出された点
線要素について、所定の距離内にある点線要素を結合処
理し、該処理の結果、結合しなかった点線要素を計数
し、該計数値が所定値以上であるとき、前記画像はかす
れ画像であると判定することを特徴としている。In order to achieve the above object, according to the present invention, a rectangle of a black pixel continuous component is extracted from binarized image data, and a dotted line is drawn from the extracted rectangle. A rectangle corresponding to the element is extracted, and for the extracted dotted line element, the dotted line elements within a predetermined distance are combined, and as a result of the processing, the unconnected dotted line elements are counted. When the value is equal to or greater than the value, the image is determined to be a blurred image.

【０００８】請求項２記載の発明では、２値化された画
像データから黒画素連続成分の矩形を抽出し、該抽出さ
れた矩形から点線要素に相当する矩形を抽出し、該抽出
された点線要素について、所定の距離内にある点線要素
を結合処理し、該処理の結果、結合しなかった点線要素
を統合することにより、前記画像から、かすれ領域を抽
出することを特徴としている。According to the second aspect of the present invention, a rectangle of a black pixel continuous component is extracted from the binarized image data, a rectangle corresponding to a dotted line element is extracted from the extracted rectangle, and the extracted dotted line is extracted. The method is characterized in that blurred areas are extracted from the image by combining dotted elements within a predetermined distance with respect to the elements and integrating unconnected dotted elements as a result of the processing.

【０００９】請求項３記載の発明では、２値化された画
像データから黒画素連続成分の矩形を抽出し、該抽出さ
れた矩形から点線要素に相当する矩形を抽出し、該抽出
された点線要素について、所定の距離内にある点線要素
を結合処理し、該処理の結果、結合しなかった点線要素
を計数し、該計数値が所定値以上であるとき、前記画像
はかすれ画像であると判定し、該判定されたかすれ画像
について、前記結合しなかった点線要素を統合すること
により、かすれ領域を抽出することを特徴としている。According to the third aspect of the present invention, a rectangle of a continuous component of black pixels is extracted from the binarized image data, a rectangle corresponding to a dotted line element is extracted from the extracted rectangle, and the extracted dotted line is extracted. For the element, the dotted line elements within a predetermined distance are combined, and as a result of the processing, the uncombined dotted line elements are counted.When the counted value is equal to or more than a predetermined value, the image is a blurred image. It is characterized in that a blur region is extracted by integrating the uncombined dotted line elements with respect to the determined blurred image.

【００１０】請求項４記載の発明では、前記かすれ領域
内に存在している点線を疑似点線として除去することを
特徴としている。The invention according to claim 4 is characterized in that the dotted line existing in the blurred area is removed as a pseudo dotted line.

【００１１】請求項５記載の発明では、前記かすれ画像
のサイズが所定値以上であるとき、前記画像がかすれ画
像であることを警告することを特徴としている。According to a fifth aspect of the present invention, when the size of the blurred image is equal to or larger than a predetermined value, a warning that the image is a blurred image is issued.

【００１２】請求項６記載の発明では、２値化された画
像データから黒画素連続成分の矩形を抽出する機能と、
該抽出された矩形から点線要素に相当する矩形を抽出す
る機能と、該抽出された点線要素について、所定の距離
内にある点線要素を結合処理する機能と、該処理の結
果、結合しなかった点線要素を計数する機能と、該計数
値が所定値以上であるとき、前記画像はかすれ画像であ
ると判定する機能と、前記結合しなかった点線要素を統
合することにより、かすれ領域を抽出する機能と、該か
すれ領域内に存在している点線を疑似点線として除去す
る機能と、前記かすれ画像のサイズが所定値以上である
とき、前記画像がかすれ画像であることを警告する機能
をコンピュータに実現させるためのプログラムを記録し
たコンピュータ読み取り可能な記録媒体であることを特
徴としている。According to the present invention, a function of extracting a rectangle of a continuous component of black pixels from the binarized image data;
A function of extracting a rectangle corresponding to a dotted line element from the extracted rectangle, a function of combining dotted line elements within a predetermined distance with respect to the extracted dotted line element, A function of counting the dotted line elements, a function of determining that the image is a blurred image when the counted value is equal to or more than a predetermined value, and extracting a blurred area by integrating the unconnected dotted line elements. A function, a function of removing a dotted line existing in the blurred area as a pseudo dotted line, and a function of warning that the image is a blurred image when the size of the blurred image is a predetermined value or more to the computer. It is a computer-readable recording medium on which a program for realizing the program is recorded.

【００１３】[0013]

【発明の実施の形態】以下、本発明の一実施例を図面を
用いて具体的に説明する。本発明の各実施例を説明する
前に、まず本発明で使用する用語を定義する。矩形：画像中に、連続している画像、または所定のしき
い値以上連続している画像部分（例えば、２値画像であ
れば連続黒画素部、もしくは連続白画素部）を一塊とし
て、それらが接触包含されるように外接四角形で囲んだ
範囲を矩形と定義する。DESCRIPTION OF THE PREFERRED EMBODIMENTS One embodiment of the present invention will be specifically described below with reference to the drawings. Before describing each embodiment of the present invention, first, terms used in the present invention will be defined. Rectangle: A continuous image or a continuous image portion (for example, a continuous black pixel portion or a continuous white pixel portion in the case of a binary image) which is continuous or more than a predetermined threshold in the image, and Is defined as a rectangle that is enclosed by a circumscribed rectangle so that is included in contact.

【００１４】矩形抽出：矩形の位置座標を抽出すること
を矩形抽出と定義する。単一要素点線：点線要素が１つである点線のことで、つ
まり、結合できなかった点線要素と同義である。点線：点線要素を結合処理したものをいう。Rectangle extraction: Extracting the position coordinates of a rectangle is defined as rectangle extraction. Single element dotted line: A dotted line with one dotted element, that is, the same as a dotted element that could not be combined. Dotted line: This is obtained by combining dotted elements.

【００１５】〈実施例１〉２値入力データを基にかすれ
画像か否かを判断するとき、かすれ画像の特徴として、
罫線や文字がかすれているために不連続となり、このた
め矩形の数が多いことが挙げられる。ところで、点線や
網点を含む画像であっても矩形の数が多くなり、矩形の
数を単純にカウントするだけでは、かすれ画像であるの
か、点線を多く含む、鮮明な画像であるのか判断ができ
ない。そこで、点線要素矩形とかすれによる矩形とを区
別できれば、かすれによる矩形の数がしきい値よりも多
い場合にかすれ画像であると判断できる。<Embodiment 1> When determining whether or not a blurred image is based on the binary input data, the features of the blurred image include:
The ruled lines and characters are blurred, resulting in discontinuity, and thus a large number of rectangles. By the way, even if the image includes dotted lines and halftone dots, the number of rectangles increases, and simply counting the number of rectangles makes it possible to determine whether the image is a blurred image or a clear image including many dotted lines. Can not. Therefore, if the dotted line element rectangle and the blurred rectangle can be distinguished from each other, it can be determined that the image is a blurred image when the number of blurred rectangles is larger than a threshold value.

【００１６】そこで、本実施例１では、かすれ画像にお
ける矩形数が多いという特徴を用いて、かすれによって
不連続になっている矩形を点線要素や網点要素矩形と区
別し、かすれによって発生した矩形数をカウントして、
その計数値を基にかすれ画像であるか否かを判断してい
る。Therefore, in the first embodiment, a rectangle that is discontinuous due to blurring is distinguished from a dotted line element or a halftone dot rectangle by using the feature that the number of rectangles in the blurred image is large. Count the number,
It is determined whether the image is a blurred image based on the count value.

【００１７】図１は、本発明の実施例１の構成を示す。
また、図２は、本発明の実施例１の処理フローチャート
を示す。スキャナなどの２値画像入力部１によって文書
などの画像を入力し、２値イメージメモリ２に格納する
（ステップ１０１）。矩形抽出部３では、２値イメージ
メモリ２から黒画素連続成分の矩形を抽出し、これを矩
形メモリ４に格納する（ステップ１０２）。FIG. 1 shows the configuration of the first embodiment of the present invention.
FIG. 2 shows a processing flowchart of the first embodiment of the present invention. An image such as a document is input by a binary image input unit 1 such as a scanner and stored in a binary image memory 2 (step 101). The rectangle extraction unit 3 extracts a rectangle of a black pixel continuous component from the binary image memory 2 and stores it in the rectangle memory 4 (step 102).

【００１８】点線抽出部５は、点線要素選択部６と点線
要素の結合処理部８からなり、点線要素選択部６は、矩
形メモリ４に保持された矩形の中から点線要素相当サイ
ズの矩形を選択し、点線要素メモリ７に保持する（ステ
ップ１０３）。The dotted line extracting section 5 comprises a dotted line element selecting section 6 and a dotted line element combining section 8. The dotted line element selecting section 6 selects a rectangle having a size corresponding to the dotted line element from the rectangles stored in the rectangular memory 4. The selected item is stored in the dotted line element memory 7 (step 103).

【００１９】続いて、点線要素の結合処理部８では、処
理方向において一定値内に位置している点線要素どうし
を結合処理していく。つまり、横方向に並んでいる点線
要素を結合する場合には、横方向（処理方向）において
一定値内に存在している点線要素どうしを結合してい
く。結合されて抽出された点線は、罫線メモリ９に保持
される（ステップ１０４、１０９）。また、点線要素の
結合処理部８で２つ以上の点線要素と結合しなかった点
線要素も、単一要素点線として罫線メモリ９に保持され
る（ステップ１０４、１０５）。Subsequently, the dotted line element combination processing unit 8 combines the dotted line elements located within a certain value in the processing direction. That is, when combining the dotted line elements arranged in the horizontal direction, the dotted line elements existing within a certain value in the horizontal direction (processing direction) are combined. The combined and extracted dotted lines are stored in the ruled line memory 9 (steps 104 and 109). Also, the dotted line elements that have not been combined with two or more dotted line elements by the dotted line element combination processing unit 8 are stored in the ruled line memory 9 as single element dotted lines (steps 104 and 105).

【００２０】単一要素点線とは、点線要素の結合処理で
要素結合できなかった点線であり、つまり、点線要素選
択部６で点線要素サイズとして選択された矩形である
が、点線要素矩形でないと判断して、かすれによる矩形
であるとみなす。ここで、単一要素点線の数は、かすれ
により発生した矩形の数と仮定することができる。かす
れ原稿判定部１０では以下のようにしてかすれ原稿を判
断する。かすれ原稿判定部１０は、単一要素点線をカウ
ントするカウンタ１１と、かすれ原稿にマーキングを付
与するマーキング部１２と、原稿種類メモリ１３から構
成されている。The single element dotted line is a dotted line which could not be combined by the combining process of the dotted element, that is, a rectangle selected as a dotted element size by the dotted element selecting section 6, but not a dotted element rectangle. Judgment is made, and it is regarded as a blurred rectangle. Here, the number of single element dotted lines can be assumed to be the number of rectangles generated by blurring. The blurred document determination unit 10 determines a blurred document as follows. The blurred document determination unit 10 includes a counter 11 that counts a single element dotted line, a marking unit 12 that gives a marking to a blurred document, and a document type memory 13.

【００２１】カウンタ１１が単一要素点線をカウントし
た結果（ステップ１０６）、単一要素点線が所定数以上
であれば、処理中の画像をかすれ画像と判定して（ステ
ップ１０７、１０８）、マーキング部１２では、入力画
像がかすれ画像であることを示す情報を付けて、原稿種
類メモリ１３に格納する。As a result of counting the single element dotted lines by the counter 11 (step 106), if the number of single element dotted lines is equal to or more than a predetermined number, the image being processed is determined to be a blurred image (steps 107 and 108) and marking is performed. The unit 12 stores the input image in the document type memory 13 with information indicating that the input image is a blurred image.

【００２２】〈実施例２〉上記した実施例１では、かす
れ画像と判定された画像において、画像のどの部分がか
すれているのかの情報は得られない。画像におけるかす
れ領域が分かれば、その後の処理、例えば抽出されたか
すれ領域に対してかすれ補正処理を実行させるなどの処
理に対して有効な情報を提供することができる。<Embodiment 2> In the above-described embodiment 1, information on which part of the image is blurred in the image determined to be a blurred image cannot be obtained. If the blurred area in the image is known, it is possible to provide effective information for subsequent processing, for example, processing for executing blur correction processing on the extracted blurred area.

【００２３】そこで、本実施例では、画像におけるかす
れ領域を抽出することによって、他の処理に有効な情報
を与える実施例である。Therefore, the present embodiment is an embodiment in which a faint region in an image is extracted to give information effective for other processing.

【００２４】図３は、本発明の実施例２の構成を示し、
また、図４は、本発明の実施例２の処理フローチャート
を示す。点線要素の結合処理部２８、罫線メモリ２９ま
での構成は、実施例１と同様である。本実施例では、単
一要素点線の結合部３１と、かすれ領域統合部３２から
なるかすれ領域抽出処理部３０と、かすれ領域メモリ３
３を設けて構成されている。FIG. 3 shows a configuration of a second embodiment of the present invention.
FIG. 4 shows a processing flowchart of the second embodiment of the present invention. The configurations up to the dotted line element combination processing unit 28 and the ruled line memory 29 are the same as in the first embodiment. In the present embodiment, a blurred area extraction processing unit 30 including a blurred area integrating unit 32 and a blurred area memory 3
3 is provided.

【００２５】単一要素点線の結合部３１は、罫線メモリ
２９に保持されている単一要素点線に対して、所定の範
囲内に位置している単一要素点線どうしを結合すること
により、小かすれ領域を形成し（ステップ２０５、２０
６）、さらに、かすれ領域統合部３２で小かすれ領域ど
うしを同様に結合成長させることにより、かすれ領域を
抽出する（ステップ２０７）。The single element dotted line combining unit 31 combines the single element dotted lines held in the ruled line memory 29 with the single element dotted lines located within a predetermined range to thereby reduce the size of the single element dotted line. A blur area is formed (steps 205 and 20).
6) Furthermore, the blurred area is extracted by causing the blurred area integrating unit 32 to grow the small blurred areas in the same manner (step 207).

【００２６】図５、６、７は、かすれ領域生成を説明す
る図である。図５、６、７において、Ａ、Ｂ、Ｃは単一
要素点線であり、２つ以上の点線要素と結合できなかっ
た点線要素である。まず、図５のＡを処理の基準とする
と、Ａを中心に縦横任意の値（幅）で広げた範囲をアと
し、範囲アがＡと結合する処理範囲となる。また、図５
ではアの処理範囲内にＢが存在しているので、ＡとＢを
結合して、図６に示すように小かすれ領域αを形成す
る。FIGS. 5, 6, and 7 are diagrams for explaining generation of a blurred area. In FIGS. 5, 6, and 7, A, B, and C are single element dotted lines, and are dotted elements that could not be combined with two or more dotted elements. First, assuming that A in FIG. 5 is a processing reference, a range extended by an arbitrary value (width) in the vertical and horizontal directions around A is defined as A, and the range A is a processing range to be combined with A. FIG.
Since B exists in the processing range of A, A and B are combined to form a small fading area α as shown in FIG.

【００２７】小かすれ領域が抽出されたら、次に、小か
すれ領域αを基準として、図６に示すように結合範囲イ
を設定する。この設定された結合範囲にＣが存在してい
るので、前述したと同様にＣと小かすれ領域を結合させ
ると、図７に示すように小かすれ領域αが形成される。
このような処理を、結合範囲内に単一要素点線がなくな
るまで繰返す。結合範囲内に単一要素点線がなくなった
時点で、Ａを基準とした小かすれ領域抽出処理が終了す
る。After the small blurred area is extracted, a connection range A is set based on the small blurred area α as shown in FIG. Since C exists in this set coupling range, when C is combined with a small blur area in the same manner as described above, a small blur area α is formed as shown in FIG.
Such processing is repeated until there is no single element dotted line in the connection range. When there is no single element dotted line in the combined range, the small blurred area extraction processing based on A is completed.

【００２８】単一要素点線の結合部３１は、以上の処理
を全ての単一要素点線に対して実行する。抽出された小
かすれ領域はかすれ領域メモリ３３に保持される。次い
で、かすれ領域統合部３２は、単一要素点線の結合部３
１で生成された小かすれ領域に対して統合処理を実行す
る。かすれ領域メモリ３３に保持されている小かすれ領
域データを基に結合部３１の処理と同様に小かすれ領域
どうしを結合する。つまり、結合部３１における単一要
素点線の結合処理を小かすれ領域に置き換えた処理を行
う。ここで統合されたかすれ領域は、再びかすれ領域メ
モリ３３に保持される。The single element dotted line combining section 31 executes the above processing for all the single element dotted lines. The extracted small blurred area is held in the blurred area memory 33. Next, the blurred area integration section 32 connects the single element dotted line connection section 3.
The integration process is performed on the small blurred area generated in step 1. Based on the small blurred area data held in the blurred area memory 33, the small blurred areas are combined with each other in the same manner as in the processing of the combining unit 31. That is, a process is performed in which the combining process of the single element dotted line in the combining unit 31 is replaced with a small fading region. Here, the integrated blurred area is held in the blurred area memory 33 again.

【００２９】〈実施例３〉全画像に対して実施例２のか
すれ領域処理を実行させると処理時間がかかる。そこで
本実施例では、実施例１でかすれ画像と判断された画像
に対してのみ、実施例２のかすれ領域抽出処理を実行さ
せ、処理時間を短縮した実施例である。<Embodiment 3> When the blurred area processing of the embodiment 2 is executed for all images, it takes a long processing time. Therefore, the present embodiment is an embodiment in which the blurred area extraction processing of the second embodiment is executed only on the image determined to be the blurred image in the first embodiment, and the processing time is reduced.

【００３０】図８は、実施例３の構成を示す。また、図
９は、実施例３の処理フローチャートを示す。実施例３
の構成は、実施例１と実施例２の構成を組み合わせたも
のである。つまり、実施例１の処理によって原稿種類メ
モリ４８には、処理中の画像の種類が保持されているの
で、メモリ４８を参照したときに、画像がかすれている
という情報を得た場合に（ステップ３０８）、実施例２
のかすれ領域抽出処理４９を実行する（ステップ３０
９）。FIG. 8 shows the configuration of the third embodiment. FIG. 9 shows a processing flowchart of the third embodiment. Example 3
Is a combination of the configurations of the first and second embodiments. That is, since the type of the image being processed is held in the document type memory 48 by the processing of the first embodiment, when the information that the image is blurred is obtained when the memory 48 is referred to (step 308), Example 2
A blurred area extraction process 49 is executed (step 30).
9).

【００３１】〈実施例４〉点線抽出処理は、点線要素サ
イズの矩形を抽出し、しきい値内に位置している点線要
素矩形どうしを結合することによって点線を形成させて
いる。しかし、画像内のかすれている領域では不連続に
なった矩形を誤って結合することにより、疑似点線が発
生してしまう。Embodiment 4 In the dotted line extraction processing, a dotted line element size rectangle is extracted, and a dotted line is formed by combining the dotted line element rectangles located within the threshold value. However, a false dotted line is generated in a faint region in an image by erroneously combining discontinuous rectangles.

【００３２】そこで、本実施例では、画像中のかすれ領
域を簡単な方法で抽出することによって、かすれ領域で
抽出された点線を疑似点線をみなして点線情報から除去
し、点線認識の精度を向上させた実施例である。Therefore, in the present embodiment, the faint area in the image is extracted by a simple method, and the dotted line extracted in the faint area is removed from the dotted line information as a pseudo dotted line, thereby improving the accuracy of the dotted line recognition. This is an example of the embodiment.

【００３３】図１０は、実施例４の構成を示し、実施例
３の構成に疑似点線除去部５１をさらに付加したもの
で、他の構成要素は実施例３と同様である。また、図１
１は、実施例４の処理フローチャートを示す。実施例２
で抽出されたかすれ領域はかすれ領域メモリ５０に保持
されていて、点線抽出部４５で抽出された点線が罫線メ
モリ４６に保持されている。FIG. 10 shows the configuration of the fourth embodiment, in which a pseudo-dotted line removing section 51 is further added to the configuration of the third embodiment, and the other components are the same as those of the third embodiment. FIG.
1 shows a processing flowchart of the fourth embodiment. Example 2
Is stored in the blurred area memory 50, and the dotted line extracted by the dotted line extracting unit 45 is stored in the ruled line memory 46.

【００３４】疑似点線除去部５１は、罫線メモリ４６内
の点線の位置座標とかすれ領域メモリ５０のかすれ領域
とを比較して、かすれ領域内に存在している点線（疑似
点線）あるいはかすれ領域に接している点線（疑似点
線）を、罫線メモリ４６から除去する（ステップ４１
０）。The pseudo-dotted line removing section 51 compares the position coordinates of the dotted line in the ruled line memory 46 with the blurred region in the blurred region memory 50, and determines whether a dotted line (pseudo dotted line) or a blurred region existing in the blurred region exists. The touching dotted line (pseudo dotted line) is removed from the ruled line memory 46 (step 41).
0).

【００３５】〈実施例５〉かすれの程度が著しい画像中
の罫線や文字の認識精度を向上させるためには、認識処
理のアルゴリズムを修正する方法が考えられるが、アル
ゴリズムの開発までには所要の時間が必要となる。<Embodiment 5> In order to improve the accuracy of recognizing ruled lines and characters in an image in which the degree of blur is remarkable, a method of modifying the algorithm of the recognition process can be considered. It takes time.

【００３６】そこで、以下の実施例５、６では、かすれ
画像の不具合に対処するために、実施例２で抽出された
かすれ領域の数、または面積が所定値よりも大きい画像
においては、ユーザーに対してかすれ画像であることを
警告する処理を行う。警告がでた画像に対してユーザー
側が再度画像を入力するか、あるいは画像の修正を行な
うものである。Therefore, in the following embodiments 5 and 6, in order to cope with the problem of the blurred image, in the case of the image in which the number or the area of the blurred region extracted in the embodiment 2 is larger than the predetermined value, the user is required. On the other hand, a process of warning that the image is a blurred image is performed. The user inputs the image again for the image for which the warning has been issued, or corrects the image.

【００３７】図１２は、実施例５の構成である。原稿の
種類メモリ４８を参照したとき、かすれ原稿であるとい
う情報を得た場合に、かすれ画像警告部５２は、かすれ
画像であるという警告を行う。例えば、表示画面上に警
告を表示させたり、音声で報知するなどの方法を用いて
ユーザに知らせる。FIG. 12 shows the structure of the fifth embodiment. When referring to the document type memory 48 and obtaining information that the document is a blurred document, the blurred image warning unit 52 issues a warning that the image is a blurred image. For example, a warning is displayed on the display screen, or the user is notified by using a method of notifying by voice.

【００３８】〈実施例６〉図１３は、実施例６の構成を
示す。実施例２で抽出されたかすれ領域はかすれ領域メ
モリ５０に保持されているので、かすれ領域面積計算部
５３は、かすれ領域メモリ５０からかすれ領域の面積の
総和、またはかすれ領域数の総和等の計算を行い、所定
のしきし値を超えた場合には、実施例５と同様に、かす
れ画像警告部５２でかすれ画像であることを警告する。<Embodiment 6> FIG. 13 shows the structure of Embodiment 6. Since the blurred area extracted in the second embodiment is held in the blurred area memory 50, the blurred area area calculator 53 calculates the sum of the areas of the blurred areas from the blurred area memory 50 or the sum of the number of blurred areas. When the threshold value is exceeded, the blurred image warning unit 52 warns that the image is a blurred image, as in the fifth embodiment.

【００３９】〈実施例７〉図１４は、実施例７の構成を
示し、ソフトウェアによって実現する実施例である。本
発明をソフトウェアによって実現する場合には、図１４
に示すように、ＣＰＵ、メモリ、表示装置、ハードディ
スク、キーボード、ＣＤ‐ＲＯＭドライブ、マウスなど
からなるコンピュータシステムを用意する。ＣＤ−ＲＯ
Ｍなどのコンピュータ読み取り可能な記録媒体には、本
発明の画像処理機能や処理手順を実現するプログラムな
どが記録されている。また、処理対象の文書などの画像
は例えばハードディスクなどに格納されている。そし
て、ＣＰＵは、記録媒体から上記した処理機能、処理手
順を実現するプログラムを読み出し、ハードディスクな
どから読み込まれた画像がかすれ画像であるか否かを判
定し、その判定結果をディスプレイなどに表示出力す
る。<Embodiment 7> FIG. 14 shows the configuration of Embodiment 7 and is an embodiment realized by software. When the present invention is realized by software, FIG.
1, a computer system including a CPU, a memory, a display device, a hard disk, a keyboard, a CD-ROM drive, a mouse, and the like is prepared. CD-RO
On a computer-readable recording medium such as M, a program for realizing the image processing function and the processing procedure of the present invention is recorded. Images such as documents to be processed are stored in, for example, a hard disk. Then, the CPU reads a program for realizing the above-described processing functions and processing procedures from the recording medium, determines whether the image read from the hard disk or the like is a blurred image, and outputs the determination result on a display or the like. I do.

【００４０】[0040]

【発明の効果】以上、説明したように、請求項１、６記
載の発明によれば、罫線や文字認識の前処理として、入
力画像がかすれているか否かを判定しているので、その
後の処理である罫線や文字の認識精度の向上のために有
効な情報を与えることができる。As described above, according to the first and sixth aspects of the present invention, whether or not an input image is blurred is determined as preprocessing for ruled lines and character recognition. Effective information can be given to improve the accuracy of recognition of ruled lines and characters as processing.

【００４１】請求項２、６記載の発明によれば、入力画
像中のかすれている領域を抽出しているので、該領域情
報を用いることにより、罫線や文字の認識精度をより向
上させることが可能となる。According to the second and sixth aspects of the present invention, the blurred area in the input image is extracted, so that the recognition accuracy of ruled lines and characters can be further improved by using the area information. It becomes possible.

【００４２】請求項３、６記載の発明によれば、かすれ
画像と判定された画像に対してのみかすれ領域の抽出処
理を実行しているので、処理時間が短縮される。According to the third and sixth aspects of the present invention, the processing time is shortened because the blurred area extraction processing is performed only on the image determined to be a blurred image.

【００４３】請求項４記載の発明によれば、かすれ領域
に存在する点線は疑似点線である確率が高いため、かす
れ領域内で抽出された点線を除外することにより、画像
中の疑似点線の除去率が向上し、従って点線罫線の抽出
精度が向上する。According to the fourth aspect of the present invention, since the dotted line existing in the blurred area has a high probability of being a pseudo dotted line, the dotted line extracted in the blurred area is removed to remove the pseudo dotted line from the image. Thus, the accuracy of extracting dotted ruled lines is improved.

【００４４】請求項５記載の発明によれば、入力画像が
かすれ画像であることをユーザに喚起することができ、
ユーザはかすれ画像に対して的確な処置を実行すること
が可能となる。According to the fifth aspect of the present invention, it is possible to alert the user that the input image is a blurred image.
The user can execute an appropriate treatment on the blurred image.

[Brief description of the drawings]

【図１】本発明の実施例１の構成を示す。FIG. 1 shows a configuration of a first exemplary embodiment of the present invention.

【図２】本発明の実施例１の処理フローチャートを示
す。FIG. 2 shows a processing flowchart of Embodiment 1 of the present invention.

【図３】本発明の実施例２の構成を示す。FIG. 3 shows a configuration of a second exemplary embodiment of the present invention.

【図４】本発明の実施例２の処理フローチャートを示
す。FIG. 4 shows a processing flowchart according to a second embodiment of the present invention.

【図５】かすれ領域生成を説明するための第１の図であ
る。FIG. 5 is a first diagram for describing generation of a blurred area.

【図６】かすれ領域生成を説明するための第２の図であ
る。FIG. 6 is a second diagram illustrating the generation of a blurred area.

【図７】かすれ領域生成を説明するための第３の図であ
る。FIG. 7 is a third diagram for describing generation of a blurred area.

【図８】本発明の実施例３の構成を示す。FIG. 8 shows a configuration of a third embodiment of the present invention.

【図９】本発明の実施例３の処理フローチャートを示
す。FIG. 9 shows a processing flowchart of Embodiment 3 of the present invention.

【図１０】本発明の実施例４の構成を示す。FIG. 10 shows a configuration of a fourth embodiment of the present invention.

【図１１】本発明の実施例４の処理フローチャートを示
す。FIG. 11 shows a processing flowchart of a fourth embodiment of the present invention.

【図１２】本発明の実施例５の構成を示す。FIG. 12 shows a configuration of Embodiment 5 of the present invention.

【図１３】本発明の実施例６の構成を示す。FIG. 13 shows a configuration of Embodiment 6 of the present invention.

【図１４】本発明の実施例７の構成を示す。FIG. 14 shows a configuration of Embodiment 7 of the present invention.

[Explanation of symbols]

１画像入力部２２値イメージメモリ３矩形抽出部４矩形メモリ５点線抽出部６点線要素選択部７点線要素メモリ８結合処理部９罫線メモリ１０かすれ原稿判定部１１カウンタ１２マーキング部１３原稿種類メモリ Reference Signs List 1 Image input unit 2 Binary image memory 3 Rectangle extraction unit 4 Rectangle memory 5 Dotted line extraction unit 6 Dotted line element selection unit 7 Dotted line element memory 8 Combination processing unit 9 Ruled line memory 10 Blurred document determination unit 11 Counter 12 Marking unit 13 Document type memory

Claims

[Claims]

1. A rectangle of a black pixel continuous component is extracted from binarized image data, a rectangle corresponding to a dotted line element is extracted from the extracted rectangle, and a predetermined distance is extracted from the extracted dotted line element. The combining process is performed on the dotted line elements that are within, as a result of the process, the uncombined dotted line elements are counted, and when the counted value is equal to or greater than a predetermined value, the image is determined to be a blurred image. Image processing method.

2. A rectangle of a continuous component of black pixels is extracted from the binarized image data, a rectangle corresponding to a dotted line element is extracted from the extracted rectangle, and a predetermined distance is extracted from the extracted dotted line element. An image processing method for extracting a blurred area from the image by combining the dotted line elements within the image and integrating the unconnected dotted line elements as a result of the processing.

3. A rectangle of a continuous component of black pixels is extracted from the binarized image data, a rectangle corresponding to a dotted line element is extracted from the extracted rectangle, and a predetermined distance is determined for the extracted dotted line element. The dotted line elements within are combined, and as a result of the processing, the uncombined dotted line elements are counted.When the counted value is equal to or more than a predetermined value, the image is determined to be a blurred image, and the determination is performed. An image processing method for extracting a blurred area by integrating the unconnected dotted line elements in the blurred image.

4. The image processing method according to claim 2, wherein a dotted line existing in the blurred area is removed as a pseudo dotted line.

5. The image processing method according to claim 1, wherein when the size of the blurred image is equal to or larger than a predetermined value, a warning is issued that the image is a blurred image.

6. A function of extracting a rectangle of a continuous component of black pixels from binarized image data, a function of extracting a rectangle corresponding to a dotted line element from the extracted rectangle, and a function of extracting a rectangle corresponding to a dotted line element. A function of combining dotted line elements within a predetermined distance, a function of counting dotted line elements that have not been combined as a result of the processing, and when the count value is equal to or greater than a predetermined value, the image is a blurred image. A function to determine that there is, a function to extract a blurred area by integrating the unconnected dotted line elements, a function to remove a dotted line existing in the blurred area as a pseudo dotted line, A computer-readable recording medium storing a program for causing a computer to realize a function of warning that the image is a blurred image when the size of the image is equal to or more than a predetermined value.