JP2012128635A

JP2012128635A - Image processing apparatus, image processing method and program

Info

Publication number: JP2012128635A
Application number: JP2010279161A
Authority: JP
Inventors: Masaru Sekiguchi; 優関口; Toshifumi Yamaai; 敏文山合
Original assignee: Ricoh Co Ltd
Current assignee: Ricoh Co Ltd
Priority date: 2010-12-15
Filing date: 2010-12-15
Publication date: 2012-07-05

Abstract

PROBLEM TO BE SOLVED: To determine whether an input image is a captured image or a pure electronic image.SOLUTION: The image processing apparatus includes: image data input means 201 for inputting image data; character rectangle extraction processing means 202 for extracting a character rectangle from the image data; character clustering processing means 203 for clustering a character element from the character rectangle obtained by the character rectangle extraction processing means 202; difference calculation means 204 for superimposing the character element which has been clustered by the character clustering processing means 203 and thereby calculating a difference; and image type determination means 205 for determining whether or not the input image is a captured image indicating the image data which has been optically read depending on whether or not a differential value is a predetermined threshold value or more based on the calculated differential value.

Description

本発明は入力画像が撮像画像か純電子画像かを判別する画像処理装置、画像処理方法及びプログラムに関する。 The present invention relates to an image processing apparatus, an image processing method, and a program for determining whether an input image is a captured image or a pure electronic image.

画像データは、従来から、例えば、ＰＤＦ（Portable Document Format）に格納されている画像を、再度ＯＣＲ（Optical Character Reader）などで画像処理を行ったり、文字認識したデータや、領域識別したデータを別の用途に再利用するなど、多様な用途に用いられている。
この画像データは、一般にスキャナ、ＣＣＤカメラなどで撮像した撮像画像と、文書アプリケーションや画像作成アプリケーションから直接ＰＤＦや画像に落とした画像データ（以下「純電子画像データ」と云う）の２種類に分類され、その処理は画像データの種類に応じて適不適がある。 Conventionally, for example, image data stored in PDF (Portable Document Format) is processed again with OCR (Optical Character Reader), etc., and character recognition data or area identification data is separated. It is used for various purposes, such as being reused for various purposes.
This image data is generally classified into two types: a captured image captured by a scanner, a CCD camera, etc., and image data (hereinafter referred to as “pure electronic image data”) dropped directly into a PDF or image from a document application or image creation application. The processing is not appropriate depending on the type of image data.

即ち、画像が撮像画像であれば、通常の画像と同じ処理を行うことができるが、純電子画像の場合は、アナログ変換を通らないことで、ノイズレス、色のぶれなどもない高画質な画像となるはずのところ、ノイズなどが入っていることが前提の従来の認識系のパラメータ処理を行うと、画質が低下したり、処理時間が長くなるという問題がある。
ここで、認識系のパラメータ処理としては、例えばスキュー補正、ノイズ除去、天地識別処理などで画像データを補正する処理が挙げられる。しかし、純電子画像データではこれらの処理は不要であるだけではなく、それを行うと画質の低下や、無駄な処理時間を要することになる。
そのため、撮像画像と純電子画像データを区別して制御することが必要であり、そのためには、処理対象画像が撮像画像か否かを知る必要がある。 In other words, if the image is a captured image, the same processing as a normal image can be performed. However, in the case of a pure electronic image, it does not pass through analog conversion, so that it is noiseless and has no image blur. However, when the conventional recognition system parameter processing is performed on the assumption that noise or the like is included, there is a problem that the image quality is deteriorated or the processing time is increased.
Here, as the parameter processing of the recognition system, for example, processing for correcting image data by skew correction, noise removal, top-and-bottom identification processing, and the like can be cited. However, these processes are not necessary for pure electronic image data, and if they are performed, image quality is deteriorated and useless processing time is required.
Therefore, it is necessary to distinguish and control the captured image and the pure electronic image data. To that end, it is necessary to know whether the processing target image is a captured image.

この点については、既に処理対象画像が撮像画像か否かを知るための装置及び方法が提案されている（特許文献１参照）。
即ち、特許文献１には、入力された画像データが撮像画像か純電子画像かを自動的に判定する画像処理装置が記載されており、この画像処理装置では、入力された画像データについて、処理対象となる領域としてその背景領域を特定し、特定した背景領域の画素値に着目して、例えば背景色のＲＧＢが一様であるか、ばらつきがあるかについて画素値の分散値を取得し、分散値が閾値以上か未満かによって、閾値以上であれば撮像画像と判断し、閾値未満であれば純電子画像と判断している。 Regarding this point, an apparatus and a method for knowing whether or not the processing target image is a captured image have been proposed (see Patent Document 1).
That is, Patent Document 1 describes an image processing device that automatically determines whether input image data is a captured image or a pure electronic image. In this image processing device, processing is performed on input image data. Identify the background area as the target area, paying attention to the pixel value of the specified background area, for example, obtain the variance value of the pixel value as to whether the background color RGB is uniform, or there is variation, Depending on whether the variance value is greater than or less than the threshold value, it is determined as a captured image if the variance value is greater than or equal to the threshold value, and is determined to be a pure electronic image if less than the threshold value.

しかしながら、特許文献１に記載された画像処理装置では、処理対象画像が撮像画像か否かの判断にあたり、その前提として文字矩形の抽出を行っていないために、文字認識のための処理に時間がかかるだけではなく、メモリも大きなものが必要となっている。 However, in the image processing apparatus described in Patent Document 1, character rectangles are not extracted as a precondition for determining whether a processing target image is a captured image. In addition to this, a large memory is required.

本発明は、前記従来の問題に鑑みてなされてものであって、その目的は画像データからそれが撮像画像か純電子画像データかの判別を、従来よりも短時間で行うことができ、かつメモリ量も低減できる画像処理装置等を提供することである。 The present invention has been made in view of the above-mentioned conventional problems, and its purpose is to determine whether it is a captured image or pure electronic image data from image data in a shorter time than in the past, and An object of the present invention is to provide an image processing apparatus that can reduce the amount of memory.

請求項１の発明は、画像データの入力処理を行う画像データ入力手段と、前記画像データから、文字矩形の抽出処理をする文字矩形抽出処理手段と、前記文字矩形抽出処理手段によって得られた少なくとも１つ以上の文字矩形から文字要素をクラスタリングする文字クラスタリング処理手段と、前記文字クラスタリング処理手段によってクラスタリングされた文字要素に対して、重ね合わせをして差分を算出する差分算出手段と、前記差分算出手段によって算出された差分値に基づいて、前記差分値が所定の閾値以上か否かにより、光学的に読み込んだ画像データを示す撮像画像であるか、純電子画像であるかを判断する画像種類判断手段と、を備えた画像処理装置である。
請求項２の発明は、請求項１に記載された画像処理装置において、さらに、前記画像種類判断手段の判断に基づいて、前記画像データに対する画像処理を、撮像画像用又は純電子画像用の画像処理に切り換える手段を備えた画像処理装置である。
請求項３の発明は、画像処理装置における、画像データの入力処理を行う画像データ入力ステップと、前記画像データから、文字矩形抽出処理をする文字矩形抽出処理ステップと、前記文字矩形抽出処理ステップで得られた少なくとも１つ以上の文字矩形から文字要素をクラスタリングする文字クラスタリング処理ステップと、前記文字クラスタリング処理ステップによってクラスタリングされた文字要素に対して、重ね合わせをして差分を算出する差分算出ステップと、前記差分算出ステップによって算出された差分値に基づいて、前記差分値が所定の閾値以上か否かにより、光学的に読み込んだ画像データを示す撮像画像であるか純電子画像であるかを判断する画像種類判断ステップと、を有することを特徴とする画像処理方法である。
請求項４の発明は、請求項３に記載された画像処理方法において、さらに、前記画像種類判断ステップにおける判断に基づいて、前記画像データに対する画像処理を、撮像画像用又は純電子画像用の画像処理に切り換えるステップを有することを特徴とする画像処理方法である。
請求項５の発明は、コンピュータを、画像データの入力処理を行う画像データ入力手段、前記画像データから文字矩形抽出処理をする文字矩形抽出処理手段、前記文字矩形抽出処理手段によって得られた少なくとも１つ以上の文字矩形から文字要素をクラスタリングする文字クラスタリング処理手段、前記文字クラスタリング処理手段によってクラスタリングされた文字要素に対して、重ね合わせをして差分を算出する差分算出手段、前記差分算出手段によって算出された差分値に基づいて、所定の閾値以上か否かにより、光学的に読み込んだ画像データを示す撮像画像であるか純電子画像であるかを判断する画像種類判断手段、として機能させる画像処理プログラムである。
請求項６の発明は、請求項５に記載された画像処理プログラムにおいて、コンピュータを、さらに、前記画像種類判断手段の判断に基づいて、前記画像データに対する画像処理を、撮像画像又は純電子画像用の画像処理に切り換える手段として機能させる画像処理プログラムである。 According to the first aspect of the present invention, there is provided image data input means for performing image data input processing, character rectangle extraction processing means for extracting character rectangles from the image data, and at least obtained by the character rectangle extraction processing means. Character clustering means for clustering character elements from one or more character rectangles, difference calculating means for calculating a difference by superimposing the character elements clustered by the character clustering means, and the difference calculation Based on the difference value calculated by the means, an image type for determining whether the difference value is equal to or greater than a predetermined threshold value is a captured image indicating image data read optically or a pure electronic image And an image processing apparatus.
According to a second aspect of the present invention, in the image processing apparatus according to the first aspect, the image processing for the image data is further performed based on the determination by the image type determination unit. An image processing apparatus having means for switching to processing.
According to a third aspect of the present invention, an image data input step for performing image data input processing, a character rectangle extraction processing step for performing character rectangle extraction processing from the image data, and the character rectangle extraction processing step in the image processing apparatus. A character clustering step of clustering character elements from the obtained at least one or more character rectangles; a difference calculating step of calculating a difference by superimposing the character elements clustered by the character clustering step; Based on the difference value calculated by the difference calculation step, whether the difference value is equal to or greater than a predetermined threshold value determines whether the image is a captured image indicating the optically read image data or a pure electronic image And an image type determination step.
According to a fourth aspect of the present invention, in the image processing method according to the third aspect, the image processing for the image data is further performed for an image for a captured image or a pure electronic image based on the determination in the image type determination step. An image processing method comprising a step of switching to processing.
According to a fifth aspect of the present invention, at least one obtained by an image data input means for performing image data input processing, a character rectangle extraction processing means for performing character rectangle extraction processing from the image data, and the character rectangle extraction processing means. Character clustering processing means for clustering character elements from two or more character rectangles, difference calculation means for calculating a difference by superimposing character elements clustered by the character clustering processing means, and calculation by the difference calculation means Image processing that functions as an image type determination unit that determines whether the image is a captured image or a pure electronic image indicating optically read image data based on whether or not the difference value is equal to or greater than a predetermined threshold value It is a program.
According to a sixth aspect of the present invention, in the image processing program according to the fifth aspect, the computer further performs image processing on the image data based on the determination by the image type determination means for a captured image or a pure electronic image. This is an image processing program that functions as means for switching to image processing.

本発明によれば、入力画像データから撮像画像と純電子画像を判別する際に、入力画像の文字矩形抽出を行い、抽出した文字矩形を用いて判別処理を行うため、従来の判別処理に比して処理時間の高速化、省メモリ化ができる。
また、入力画像の解像度に関係なく判別処理が可能であると共に、判別処理後においても、文字認識処理など文字矩形を利用して別の処理ができるため、その点からも無駄な処理やメモリを低減することができる。 According to the present invention, when the captured image and the pure electronic image are discriminated from the input image data, the character rectangle is extracted from the input image, and the discrimination processing is performed using the extracted character rectangle. As a result, processing time can be increased and memory can be saved.
In addition, discrimination processing is possible regardless of the resolution of the input image, and even after the discrimination processing, another processing such as character recognition processing can be performed using a character rectangle. Can be reduced.

本発明の実施形態に係る画像処理装置の構成図である。1 is a configuration diagram of an image processing apparatus according to an embodiment of the present invention. 図１の画像処理部の内部構成を示す図である。It is a figure which shows the internal structure of the image process part of FIG. 前記画像処理部の処理手順を示すフロー図である。It is a flowchart which shows the process sequence of the said image process part. 図３のフローにおけるサブルーチンを示すフロー図である。It is a flowchart which shows the subroutine in the flow of FIG. 差分算出手段と画像種類判定手段での処理の具体例を説明する図である。It is a figure explaining the specific example of the process in a difference calculation means and an image kind determination means.

本発明の実施形態について図面を参照して説明する。
図１は、本発明の実施形態に係る画像処理装置の構成図である。
本画像処理装置は、全体の制御を行うＣＰＵ（Central Processing Unit）１０１と、原稿などを読み取るスキャナ部１０２と、ハードディスクやＣＤ／ＤＶＤ／メモリカードなどの外部記憶部１０３と、外部記憶部１０３から原稿ファイルを読み取った入力画像を記憶する入力画像メモリ部１０４と、入力画像をプレビュー表示する表示部１０５と、ユーザが領域を指定するための操作を行うパネル部１０６と、パネル部１０６において指定された領域情報を格納する領域情報格納部１０７と、文字認識をはじめとする画像処理を行う画像処理部１０８と、ネットワーク網１１０から入力画像を取得したり、外部記憶部１０３に記憶された処理結果の情報を配達する通信制御部１０９と、から成っている。 Embodiments of the present invention will be described with reference to the drawings.
FIG. 1 is a configuration diagram of an image processing apparatus according to an embodiment of the present invention.
The image processing apparatus includes a central processing unit (CPU) 101 that performs overall control, a scanner unit 102 that reads a document, an external storage unit 103 such as a hard disk or a CD / DVD / memory card, and an external storage unit 103. An input image memory unit 104 that stores an input image obtained by reading an original file, a display unit 105 that displays a preview of the input image, a panel unit 106 that allows a user to specify an area, and a panel unit 106 that are designated. A region information storage unit 107 that stores the region information; an image processing unit 108 that performs image processing such as character recognition; and a processing result obtained by acquiring an input image from the network 110 or stored in the external storage unit 103 And a communication control unit 109 for delivering the information.

図２は、図１の画像処理部１０８の内部構成を示す図である。
この画像処理部１０８は、図示のように、画像データ入力手段２０１と、文字矩形抽出処理手段２０２と、文字クラスタリング処理手段２０３と、差分算出手段２０４と、画像種類判断手段２０５と、画像処理手段２０６とから成っている。
画像処理部１０８の前記各手段はいずれも、図示しない内部メモリ（ＲＯＭ（Read
Only Memory））に格納されたプログラムによって実現される機能実現手段である。そのプログラムはコンピュータ読取可能な周知の記録媒体に記録される。また、コンピュータはＣＰＵ１０１と図示しないＲＡＭ（Random Access Memory）及びＲＯＭで構成してもよいし、或いは画像処理部１０８専用のコンピュータであってもよい。 FIG. 2 is a diagram showing an internal configuration of the image processing unit 108 of FIG.
As shown, the image processing unit 108 includes an image data input unit 201, a character rectangle extraction processing unit 202, a character clustering processing unit 203, a difference calculation unit 204, an image type determination unit 205, and an image processing unit. It consists of 206.
Each of the means of the image processing unit 108 is not shown in an internal memory (ROM (Read
It is a function realization means realized by a program stored in Only Memory)). The program is recorded on a known computer-readable recording medium. Further, the computer may be constituted by a CPU 101 and a RAM (Random Access Memory) and ROM (not shown), or may be a computer dedicated to the image processing unit 108.

画像データ入力手段２０１は、スキャナ、デジタルカメラ、ネットワークなどにより、画像を取得する処理を行う。
文字矩形抽出処理手段２０２は、画像データ入力手段２０１で取得した画像データから文字矩形を抽出する処理を行う。この文字矩形の抽出は任意の方法でよいが、例えば、生成された文書画像の黒ランに基づいて、黒ランの外接矩形を求め、求めた黒ランの外接矩形から文字と見なせる矩形を抽出する処理を行うことで文字矩形の抽出を行う。文字要素と思われる矩形の判定は、例えばＯＣＲ処理可能な文字のサイズ制限に基づいて、矩形サイズを制限することにより実行可能である。 The image data input unit 201 performs processing for acquiring an image using a scanner, a digital camera, a network, or the like.
The character rectangle extraction processing unit 202 performs processing for extracting a character rectangle from the image data acquired by the image data input unit 201. The character rectangle can be extracted by any method. For example, a circumscribed rectangle of the black run is obtained based on the black run of the generated document image, and a rectangle that can be regarded as a character is extracted from the circumscribed rectangle of the obtained black run. Character rectangles are extracted by performing processing. The determination of a rectangle that seems to be a character element can be executed by limiting the rectangle size based on, for example, the size limit of characters that can be subjected to OCR processing.

文字クラスタリング処理手段２０３は、文字矩形抽出処理手段２０２で抽出した文字矩形データから、文字要素である同一字形の文字データにクラスタリングする（切り分ける）処理を行う。即ち、文字には「撥ね」、「偏」、「旁」などの特徴があるので、ここでは、文字字形の「撥ね」、「偏」、「旁」などの特徴をクラスタリングする。
ここで、文字矩形をクラスタリングする具体的方法については、例えば、特開平１０−２０８０４０号公報に記載された「処理対象画像より、参照画像中のパターンと形状および色の両方が同一のパターンを抽出する」方法を用いる。 The character clustering processing unit 203 performs processing for clustering (separating) the character rectangle data extracted by the character rectangle extraction processing unit 202 into character data of the same character shape as a character element. That is, since the characters have characteristics such as “pluck”, “bias”, and “旁”, the features such as “pluck”, “bias”, and “旁” of the character shape are clustered here.
Here, for a specific method for clustering character rectangles, for example, “a pattern having the same shape and color as a pattern in a reference image is extracted from an image to be processed” described in JP-A-10-208040. Use "method.

この方法は、例えば、パターンの形状と色の同一性を別々に判断する。まず、比較すべき文字矩形の二つのパターンの色相違度Ｄｃを、ＲＧＢ毎の入力画像と参照画像の差（の自乗）を矩形内の全画素にわたって加算して算出し、算出された色相違度が所定の閾値以下であるとき同一と判断し、次に、公知のパターンマッチング方法（例えば米国特許第５３０３３１３号明細書）で文字矩形のパターンの形状の同一性判断を行うものである。例えば、各パターンの各画素のＲＧＢ各濃度値から明度値を計算し、各パターンの明度のデータを用いてパターンマッチング処理を行い、このパターンマッチング処理で同一と判断された場合に、当該二つのパターンを同一のパターンと判断する。 In this method, for example, the identity of the pattern shape and color is determined separately. First, the color difference degree Dc of the two patterns of the character rectangles to be compared is calculated by adding the difference (square) between the input image and the reference image for each RGB over all the pixels in the rectangle, and the calculated color difference When the degree is equal to or less than a predetermined threshold value, it is determined that they are the same, and next, the identity of the shape of the rectangular character pattern is determined by a known pattern matching method (for example, US Pat. No. 5,303,313). For example, the brightness value is calculated from the RGB density values of each pixel of each pattern, the pattern matching process is performed using the brightness data of each pattern, and when it is determined that the pattern matching process is the same, The pattern is determined to be the same pattern.

また、パターンの形状と色の同一性を同時に判断する公知のパターンマッチング処理（例えば、米国特許第５３０３３１３号明細書に記載されたパターンマッチング処理）をＲＧＢの各色毎に行い、全てのパターンマッチングで同一と判断されたとき、当該パターンを形状及び色が同一と判断する方法でもよい。
なお、文字画像のクラスタリングは、次に示す既に公知の方法によってもよい。
特開２００７−０５８８８２「パターン認識装置」、
特開２００１−３３８２６４「文字認識パターン辞書作成装置、文字認識パターン辞書作成方法および記録媒体」、
特開平５−１７４１９３「文字認識方法」、
特開平４−１２３２８３「認識辞書作成方法」。 Also, a known pattern matching process (for example, a pattern matching process described in US Pat. No. 5,303,313) for simultaneously determining the identity of the pattern shape and color is performed for each color of RGB, and all pattern matching is performed. When it is determined that the patterns are the same, a method may be used in which the patterns are determined to have the same shape and color.
Note that the clustering of character images may be performed by the following known method.
JP 2007-058882 “Pattern recognition device”,
JP 2001-338264 “Character recognition pattern dictionary creation device, character recognition pattern dictionary creation method and recording medium”,
JP-A-5-174193 "Character recognition method",
JP-A-4-123283 “Recognition Dictionary Creation Method”.

差分算出手段２０４は、文字クラスタリング処理手段２０３でクラスタリングされたデータを重ね合わせて、差分を算出する処理を行う。
その上で、クラスタリングした文字データ同士を重ねて差分を算出する。この場合、クラスタリングされた文字データに対して、クラスタリングした文字データ全てについて、もしくはその一部だけについて、差分を算出する。
具体的には、文字を示す画素で構成される文字領域単位で、図５に示すように、「あ」（図中左の「あ」（要素１）は黄色、中間の「あ」（要素２）は赤、右の「あ」（要素３）は緑に着色されているものとする）という文字画像の撥ね、偏、旁などの特徴をクラスタリングして、別の「あ」という文字画像をクラスタリングした特徴についてのデータベースを参照して、その特徴の全て、もしくは一部の特徴に対しての差分を算出する。 The difference calculation unit 204 performs processing for calculating the difference by superimposing the data clustered by the character clustering processing unit 203.
Then, the difference is calculated by superimposing the clustered character data. In this case, with respect to the clustered character data, the difference is calculated for all or only part of the clustered character data.
More specifically, as shown in FIG. 5, “A” (“A” (element 1) on the left in the figure is yellow, middle “A” (element 2) is red, and the right “A” (element 3) is colored green). Is referred to a database of features clustered, and differences for all or some of the features are calculated.

画像種類判断手段２０５は、差分算出手段２０４で算出した差分値と予め定めた閾値を比較し、その差分が閾値未満であれば純電子画像、閾値以上であれば撮像画像と判断する。つまり、例えば、クラスタリングした数値が一定の基準値内に入っているときは、差分が少ないとみなし、純電子画像と判断する。
図５の例では、データベースの要素１の別の「あ」（黄色）とクラスタリングした要素２の「あ」（赤）との差分値での判断結果は「一致度が低い」であり、前記要素１とクラスタリングした要素３の「あ」（緑）との差分値での判断結果は「一致度が高い」である。 The image type determination unit 205 compares the difference value calculated by the difference calculation unit 204 with a predetermined threshold value, and determines that the difference is less than the threshold value, a pure electronic image, and if the difference is greater than or equal to the threshold value, the image type determination image. That is, for example, when the clustered numerical values are within a certain reference value, it is considered that the difference is small and it is determined as a pure electronic image.
In the example of FIG. 5, the determination result of the difference value between another “A” (yellow) of the element 1 of the database and “A” (red) of the clustered element 2 is “low coincidence”, The determination result of the difference value between “A” (green) of element 1 and clustered element 3 is “high degree of coincidence”.

画像処理手段２０６は、画像種類判断手段２０５で判断した判断結果にしたがって画像処理を切り換える手段を備え、それによって、判断された画像に適した処理を行うことができる。画像処理手段２０６の画像処理には、撮像画像に対するスキュー補正、ノイズ除去、天地識別などの画像データを補正処理や、純電子画像ではこれらの処理を省略する処理など画像に適した処理以外に、文字矩形を利用した処理も含まれる。 The image processing unit 206 includes a unit that switches the image processing according to the determination result determined by the image type determination unit 205, thereby performing processing suitable for the determined image. In the image processing of the image processing means 206, in addition to processing suitable for the image such as correction processing of image data such as skew correction, noise removal, and top / bottom identification for the captured image, and processing that omits these processing in a pure electronic image, Processing using character rectangles is also included.

図３は前記画像処理部１０８の処理手順を示すフロー図である。
図３において、本画像処理部１０８に画像データが入力されると（Ｓ１０１）、続いて画像判定を行い（Ｓ１０２）、画像判定後、その判定結果に基づきそれに応じた画像処理、或いは文字矩形データを利用した文字認識などの処理を行う（Ｓ１０３）。 FIG. 3 is a flowchart showing a processing procedure of the image processing unit 108.
In FIG. 3, when image data is input to the main image processing unit 108 (S101), image determination is subsequently performed (S102). After image determination, image processing corresponding to the determination result or character rectangle data is performed based on the determination result. Processing such as character recognition using S is performed (S103).

図４は図３のステップＳ１０２、Ｓ１０３の画像判定処理のサブルーチンである。
図４において、画像データ入力手段２０１がスキャナ、デジタルカメラ、ネットワークなどにより、画像を取得すると（Ｓ２０１）、文字矩形抽出処理手段２０２は、画像データ入力手段２０１で取得した画像データから文字矩形を抽出する（Ｓ２０２）。
続いて、文字クラスタリング処理手段２０３は、文字矩形抽出処理手段２０２で抽出した文字矩形データから、同一字形の文字データにクラスタリングを行い（Ｓ２０３）、差分算出手段２０４は、文字クラスタリング処理手段２０３でクラスタリングされたデータを重ね合わせて、その差分を算出する（Ｓ２０４）。 FIG. 4 is a subroutine of the image determination process in steps S102 and S103 in FIG.
In FIG. 4, when the image data input unit 201 acquires an image by a scanner, digital camera, network, or the like (S201), the character rectangle extraction processing unit 202 extracts a character rectangle from the image data acquired by the image data input unit 201. (S202).
Subsequently, the character clustering processing unit 203 performs clustering on the character data of the same character shape from the character rectangle data extracted by the character rectangle extraction processing unit 202 (S203), and the difference calculation unit 204 performs clustering by the character clustering processing unit 203. The obtained data is superimposed and the difference is calculated (S204).

画像種類判断手段２０５は、差分算出手段２０４で算出した差分値から前記重ね合わせた文字矩形データの一致度を判断し（Ｓ２０５）、一致度が低い、つまり差分値が予め設定した閾値未満であれば、撮像画像と判定し、撮像画像を想定した制御を行い、或いは撮像画像を特定するパラメータを設定し（Ｓ２０６）、差分値が設定した閾値以上であれば純電子画像と判定し、純電子画像を想定した制御を行うか、或いは純電子画像を特定するパラメータを設定して（Ｓ２０７）、ルーチンを終了する。 The image type determining unit 205 determines the matching degree of the superimposed character rectangle data from the difference value calculated by the difference calculating unit 204 (S205), and if the matching degree is low, that is, the difference value is less than a preset threshold value. For example, it is determined that the image is a captured image, and control is performed assuming the captured image, or a parameter for specifying the captured image is set (S206). Control is performed assuming an image, or a parameter for specifying a pure electronic image is set (S207), and the routine is terminated.

以上のように、本実施形態では、入力画像から文字矩形を抽出し、その文字矩形をクラスタリングする。クラスタリングされた矩形に対して、その文字矩形の中のビットマップの一致度を見る。一致度を見るための方法には、クラスタリングされた文字の字形を重ね合わせて差分を見て、差分が少ない、即ち一致度が高い要素が多ければ、純電子画像として判別する（図５はこの処理のイメージを示したものである）。これらの処理は入力画像の解像度に関係なく行うことができる。 As described above, in this embodiment, character rectangles are extracted from the input image, and the character rectangles are clustered. For a clustered rectangle, check the degree of matching of the bitmap in the character rectangle. As a method for viewing the degree of coincidence, a clustered character shape is overlapped to look at the difference. Image of processing). These processes can be performed regardless of the resolution of the input image.

以上説明したように、本実施形態によれば、画像入力と同時に、その画像を入力した手段（画像読取によるものか、文書アプリケーションによるものか）をパラメータとして取得することで、その画像に応じた処理を行う（例えば、純電子画像であれば、文字コード情報は画像情報からＯＣＲなどの手法を用いて取得するよりも、もともとの電子情報から抽出が可能であればその方が精度の良いデータが得られる。これに対し撮像画像であれば、ＯＣＲを利用することで文字コードデータを得ることができる）。 As described above, according to the present embodiment, at the same time as inputting an image, the means (whether by image reading or by a document application) that inputs the image is acquired as a parameter, so that the image is matched. (For example, in the case of a pure electronic image, the character code information is more accurate if it can be extracted from the original electronic information rather than acquiring it from the image information using a technique such as OCR. On the other hand, in the case of a captured image, character code data can be obtained by using OCR).

また、この処理によって得られた文字矩形は、次の画像処理手段２０６にて用いることもできる。次の画像処理手段２０６には、文字や写真、表という領域を識別する処理や、文字認識処理などをはじめとする画像処理を行うことができる。
また、本画像処理方法は、上記画像データのカテゴリごとに文字認識する技術と組み合わせて、ＯＣＲを高速化することもできるなどの応用例を追加する（効果を挙げる）ことができる。 The character rectangle obtained by this processing can also be used in the next image processing means 206. The next image processing means 206 can perform image processing including processing for identifying areas such as characters, photographs, and tables, character recognition processing, and the like.
In addition, this image processing method can be combined with a technique for recognizing characters for each category of the image data, and can add an application example (such as an effect) that can speed up OCR.

つまり、既に述べたように、撮像画像であれば、通常の画像と同じ処理で扱ってよいが、純電子画像であれば、アナログ変換が通らないため、ノイズなどが入っていることが前提となる従来の認識系のパラメータ処理を行うと、画質が低下したり、処理時間が長くなるという問題がある。
これに対し、本実施形態では、撮像画像と純電子画像データを区別して制御する、つまり文字矩形の中にある画素の一致度を見て、判定を行い、その結果から撮像画像か否かを判定して、その画像に適した処理を行うため、例えば純電子画像データはスキュー補正、ノイズ除去、天地識別などの画像データを補正する処理を省略することができ、処理時間の高速化、省メモリ化の効果がある。 In other words, as already mentioned, if it is a captured image, it may be handled by the same processing as a normal image, but if it is a pure electronic image, analog conversion does not pass, so it is assumed that noise etc. are included When the conventional recognition system parameter processing is performed, there is a problem that the image quality deteriorates and the processing time becomes long.
On the other hand, in the present embodiment, the captured image and the pure electronic image data are controlled separately, that is, the determination is made by looking at the degree of coincidence of the pixels in the character rectangle, and whether or not the captured image is the result is determined. Therefore, for example, pure electronic image data can be processed by correcting image data such as skew correction, noise removal, and top-and-bottom identification. There is an effect of memory.

１０１・・・ＣＰＵ、１０２・・スキャナ部、１０３・・・外部記憶部、１０４・・・入力画像メモリ部、１０５・・・表示部、１０６・・・パネル部、１０７・・・領域情報格納部、１０８・・・画像処理部、１０９・・・通信制御部、１１０・・・ネットワーク網、２０１・・・画像データ入力手段、２０２・・・文字矩形抽出処理手段、２０３・・・文字クラスタリング処理手段、２０４・・・差分算出手段、２０５・・・画像種類判断手段、２０６・・・画像処理手段。 DESCRIPTION OF SYMBOLS 101 ... CPU, 102..Scanner unit, 103 ... External storage unit, 104 ... Input image memory unit, 105 ... Display unit, 106 ... Panel unit, 107 ... Storage of area information 108, image processing unit, 109, communication control unit, 110, network, 201, image data input means, 202, character rectangle extraction processing means, 203, character clustering. Processing means 204... Difference calculation means 205... Image type determination means 206.

特開２００８−２５２８６２号公報JP 2008-252862 A

Claims

Image data input means for performing image data input processing, character rectangle extraction processing means for extracting character rectangles from the image data, and at least one character rectangle obtained by the character rectangle extraction processing means Character clustering processing means for clustering character elements, difference calculation means for calculating a difference by superimposing the character elements clustered by the character clustering processing means, and a difference value calculated by the difference calculation means An image type determination unit that determines whether the difference value is equal to or greater than a predetermined threshold based on whether the image is a captured image indicating the optically read image data or a pure electronic image. Processing equipment.

The image processing apparatus according to claim 1,
Further, an image processing apparatus comprising means for switching the image processing for the image data to image processing for a captured image or pure electronic image based on the determination of the image type determination means.

An image data input step for performing image data input processing in the image processing apparatus, a character rectangle extraction processing step for performing character rectangle extraction processing from the image data, and at least one or more obtained by the character rectangle extraction processing step A character clustering step of clustering character elements from the character rectangle, a difference calculating step of calculating a difference by superimposing the character elements clustered by the character clustering step, and calculating by the difference calculating step An image type determination step for determining whether the difference value is equal to or greater than a predetermined threshold based on the determined difference value, whether the image is a captured image or a pure electronic image indicating the optically read image data; An image processing method comprising:

The image processing method according to claim 3,
The image processing method further comprises a step of switching the image processing for the image data to image processing for a captured image or a pure electronic image based on the determination in the image type determination step.

An image data input means for performing image data input processing, a character rectangle extraction processing means for performing character rectangle extraction processing from the image data, and a character from at least one character rectangle obtained by the character rectangle extraction processing means. Character clustering processing means for clustering elements, difference calculation means for calculating a difference by superimposing the character elements clustered by the character clustering processing means, based on the difference value calculated by the difference calculation means An image processing program that functions as an image type determination unit that determines whether the image is a captured image or a pure electronic image indicating optically read image data depending on whether or not the threshold value is greater than or equal to a predetermined threshold.

In the image processing program according to claim 5,
An image processing program for causing a computer to further function as means for switching image processing for the image data to image processing for a captured image or a pure electronic image based on the determination of the image type determination unit.