JP5589825B2

JP5589825B2 - Image processing apparatus, image processing method, and image processing program

Info

Publication number: JP5589825B2
Application number: JP2010285183A
Authority: JP
Inventors: 美佐子諏訪
Original assignee: Fujitsu Ltd
Current assignee: Fujitsu Ltd
Priority date: 2010-12-21
Filing date: 2010-12-21
Publication date: 2014-09-17
Anticipated expiration: 2030-12-21
Also published as: JP2012133595A

Description

本発明は、画像処理装置等に関する。 The present invention relates to an image processing apparatus and the like.

近年、ＯＣＲ（Optical Character Reader）を利用して紙文書を電子化することが行われている。紙文書を電子化することで、利用者は書類の保管場所を節約できる。また、紙文書を電子化することで、文書の検索が容易になったり、文書の内容を再利用したりすることができる。 In recent years, paper documents have been digitized using an OCR (Optical Character Reader). By digitizing paper documents, users can save the storage space for documents. Further, by digitizing a paper document, it is possible to easily search for the document or reuse the contents of the document.

ここで、ＯＣＲが紙文書を電子化し、紙文書のレイアウトに対応した電子文書を作成するためには、文書画像から、文字、単語、行、文字ブロック等の切り出しを行う必要がある。 Here, in order for the OCR to digitize a paper document and create an electronic document corresponding to the layout of the paper document, it is necessary to cut out characters, words, lines, character blocks, and the like from the document image.

従来技術には、文書画像の黒画素を投影したヒストグラムを計算し、ヒストグラムと各閾値とを比較して文書画像から行を切り出すものがある。図１２〜図１４は、従来技術を説明するための図である。図１２の文書画像１ａは、紙文書を電子化した画像である。文書画像１ａの黒画素を投影してヒストグラムを計算すると、図１３のヒストグラム１ｂとなる。 In the prior art, there is a method in which a histogram obtained by projecting black pixels of a document image is calculated, and a line is cut out from the document image by comparing the histogram with each threshold value. 12-14 is a figure for demonstrating a prior art. A document image 1a in FIG. 12 is an image obtained by digitizing a paper document. When the histogram is calculated by projecting the black pixels of the document image 1a, the histogram 1b of FIG. 13 is obtained.

従来技術ではヒストグラム１ｂを計算した後に、幅Ｔ１以下の凸部分と幅Ｔ２以下の凹部分を削除することで平滑化を行う。ヒストグラム１ｂに対して平滑化を行うと、図１４のヒストグラム１ｃとなる。そして、従来技術では、隣接するヒストグラムの値の差分の絶対値と閾値Ｔ３とを比較して、絶対値が閾値Ｔ３を超えている位置を行として判定する。図１４に示す例では、位置ａと位置ｂとの間を行として判定する。 In the prior art, after calculating the histogram 1b, smoothing is performed by deleting a convex portion having a width T1 or less and a concave portion having a width T2 or less. When smoothing is performed on the histogram 1b, the histogram 1c of FIG. 14 is obtained. In the conventional technique, the absolute value of the difference between adjacent histogram values is compared with the threshold T3, and a position where the absolute value exceeds the threshold T3 is determined as a row. In the example illustrated in FIG. 14, the line between the position a and the position b is determined as a row.

上記従来技術では、文書画像から単語、行、文書ブロックを切り出す場合に、多くのパラメータを利用することになる。図１２〜図１４に示した従来技術では、行を特定するだけでＴ１〜Ｔ３のパラメータが必要となる。単語や文字ブロックを切り出す場合には、更に多くのパラメータを利用者が用意することになる。 In the above conventional technique, many parameters are used when a word, line, or document block is cut out from a document image. In the prior art shown in FIGS. 12 to 14, parameters T1 to T3 are required only by specifying a row. When a word or character block is cut out, the user prepares more parameters.

秋山照雄、増田功、「書式指定情報によらない紙面構成要素抽出法」、電子情報通信学会誌Ｊ６６−Ｄ、Ｎｏ．１Akiyama Teruo, Masuda Isao, “A Method for Extracting Constituent Elements of Paper Not Using Format Designation Information”, IEICE Journal J66-D, No. 1

ところで、画像は一般的に尺度に応じて観測される構造が異なる。以下では、この尺度をスケールと表記する。スケールは、画像の暈した場合の暈しの度合いに対応し、ガウス関数を使ったガウスフィルタで暈した場合は、ガウス関数の分散に等しい。更に、あるスケール値で暈した画像を、そのスケールでのスケール画像と表記する。また、ある構造を観測するために、最も適したスケールをその構造の固有スケールと表記する。文書画像では、この固有スケールが大局的に求まる場合がある。図１５は、スケールに応じて観測される構造が異なる文書画像の一例を示す図である。図１５に示すように、スケールを２ａとすると、観測される構造は単一文字の「Ｆ」となり、スケールを２ｂとすると、観測される構造は文字の「Ｅ」の集合となり、スケールを２ｃとすると、観測される構造は四角の集合となる。 By the way, an image generally has a different observed structure depending on a scale. Hereinafter, this scale is expressed as a scale. The scale corresponds to the degree of distortion when the image is blurred, and when scaled by a Gaussian filter using a Gaussian function, it is equal to the variance of the Gaussian function. Furthermore, an image that has been blurred at a certain scale value is referred to as a scale image at that scale. Moreover, the most suitable scale for observing a certain structure is described as the inherent scale of the structure. In a document image, this inherent scale may be obtained globally. FIG. 15 is a diagram illustrating an example of document images having different structures that are observed according to the scale. As shown in FIG. 15, when the scale is 2a, the observed structure is a single letter “F”, and when the scale is 2b, the observed structure is a set of letters “E”, and the scale is 2c. Then, the observed structure becomes a set of squares.

図１５に示したような文書画像から、文字やテクスチャ要素等の切り出しを行う場合には、利用者が文書画像を参照してスケールの値を適宜調整する必要があり、利用者にかかる負担が大きかった。なお、上記ヒストグラムを利用して文字や行等を判定する従来技術は、文字や行等を切り出す場合のパラメータを利用者自身が設定するものである。このため、従来技術では、図１５に示した文書画像から文字やテクスチャ要素を切り出す場合には、所望するスケールで文字やテクスチャ要素が認識できるように、各パラメータを利用者が設定する必要があり、利用者にかかる負担を軽減することはできない。 When cutting out characters, texture elements, and the like from a document image as shown in FIG. 15, it is necessary for the user to appropriately adjust the scale value with reference to the document image, and this imposes a burden on the user. It was big. Note that the conventional technique for determining characters, lines, and the like using the histogram described above sets parameters for cutting out characters, lines, etc. by the user himself. For this reason, in the prior art, when characters and texture elements are cut out from the document image shown in FIG. 15, the user needs to set each parameter so that the characters and texture elements can be recognized at a desired scale. The burden on users cannot be reduced.

開示の技術は、上記に鑑みてなされたものであって、大局的な固有スケールを自動的に導くことができ、かつその大局的な固有スケールにおいて観測される構造に対応する画像要素の領域を自動的に求めることができる画像処理装置、画像処理方法および画像処理プログラムを提供することを目的とする。 The disclosed technique has been made in view of the above, and is capable of automatically deriving a global eigenscale and determining a region of an image element corresponding to a structure observed in the global eigenscale. It is an object to provide an image processing apparatus, an image processing method, and an image processing program that can be automatically obtained.

開示の画像処理装置は、画像から暈かし具合の異なる複数のスケール画像を生成するスケール画像生成部を有する。また、開示の画像処理装置は、各スケール画像それぞれにおいて、スケール画像内で同一の濃度の持った画素の集合をブロブ画像として生成するブロブ画像生成部を有する。また、開示の画像処理装置は、画像の暈かし具合を示す値の変化に対するブロブ画像の面積または数の変化の割合が所定の閾値に収まる範囲の暈かし具合の値を計算する固有スケール計算部を有する。また、開示の画像処理装置は、固有スケール計算部が計算した暈かし具合の値に対応するスケール画像から抽出されるブロブ画像の領域を、文字認識を行う領域として出力する出力部を有することを要件とする。 The disclosed image processing apparatus includes a scale image generation unit that generates a plurality of scale images having different degrees of blur from the image. In addition, the disclosed image processing apparatus includes a blob image generation unit that generates, as a blob image, a set of pixels having the same density in the scale image in each scale image. Further, the disclosed image processing apparatus has a unique scale for calculating a degree of blurring in a range in which a ratio of a change in the area or number of blob images to a change in a value indicating the degree of blurring of an image falls within a predetermined threshold. It has a calculation part. The disclosed image processing apparatus further includes an output unit that outputs a blob image area extracted from the scale image corresponding to the degree of blur calculated by the inherent scale calculation unit as an area for character recognition. Is a requirement.

開示の画像処理装置によれば、大局的な固有スケールを自動的に導くことができ、かつその大局的な固有スケールにおいて観測される構造に対応する画像要素の領域を自動的に求めることができる。 According to the disclosed image processing apparatus, a global eigenscale can be automatically derived, and a region of an image element corresponding to a structure observed in the global eigenscale can be automatically obtained. .

図１は、画像処理装置の構成を示す図である。FIG. 1 is a diagram illustrating a configuration of an image processing apparatus. 図２ａは、文書画像を示す図である。FIG. 2a shows a document image. 図２ｂは、文書画像を固有スケール値ｔ_０〜ｔ_３での各スケール画像と、スケール画像から得られた各ブロブ画像とを示す図である。FIG. 2 b is a diagram illustrating each scale image of the document image with the inherent scale values t _{0 to} t ₃ and each blob image obtained from the scale image. 図３は、図２ａの画像のブロブ数とスケール値との関係を示す図である。FIG. 3 is a diagram showing the relationship between the number of blobs and the scale value of the image of FIG. 2a. 図４は、図３のグラフの１次微分の計算結果を示す図である。FIG. 4 is a diagram showing a calculation result of the first derivative of the graph of FIG. 図５は、各スケール画像から抽出される画像要素領域の一例を示す図（１）である。FIG. 5 is a diagram (1) illustrating an example of an image element region extracted from each scale image. 図６は、各スケール画像から抽出される画像要素領域の一例を示す図（２）である。FIG. 6 is a diagram (2) illustrating an example of an image element region extracted from each scale image. 図７は、各スケール画像から抽出される画像要素領域の一例を示す図（３）である。FIG. 7 is a diagram (3) illustrating an example of an image element region extracted from each scale image. 図８は、各スケール画像から抽出される画像要素領域の一例を示す図（４）である。FIG. 8 is a diagram (4) illustrating an example of an image element region extracted from each scale image. 図９は、固有スケール値を算出する処理の処理手順を示すフローチャートである。FIG. 9 is a flowchart illustrating a processing procedure of processing for calculating the unique scale value. 図１０は、画像要素領域の画像を切り出す処理の処理手順を示すフローチャートである。FIG. 10 is a flowchart showing a processing procedure of processing for cutting out an image of the image element area. 図１１は、実施例にかかる画像処理装置を構成するコンピュータのハードウェア構成を示す図である。FIG. 11 is a diagram illustrating a hardware configuration of a computer constituting the image processing apparatus according to the embodiment. 図１２は、従来技術を説明するための図（１）である。FIG. 12 is a diagram (1) for explaining the prior art. 図１３は、従来技術を説明するための図（２）である。FIG. 13 is a diagram (2) for explaining the prior art. 図１４は、従来技術を説明するための図（３）である。FIG. 14 is a diagram (3) for explaining the prior art. 図１５は、スケールに応じて観測される構造の異なる階層構造を持つ画像の一例を示す図である。FIG. 15 is a diagram illustrating an example of an image having a hierarchical structure with different structures observed according to the scale. 図１６は、画像要素領域判定部の処理を説明するための図である。FIG. 16 is a diagram for explaining processing of the image element region determination unit.

以下に、本願の開示する画像処理装置、画像処理方法および画像処理プログラムの実施例を図面に基づいて詳細に説明する。なお、この実施例によりこの発明が限定されるものではない。 Embodiments of an image processing apparatus, an image processing method, and an image processing program disclosed in the present application will be described below in detail with reference to the drawings. Note that the present invention is not limited to the embodiments.

本実施例にかかる画像処理装置の構成について説明する。図１は、画像処理装置の構成を示す図である。図１に示すように、この画像処理装置１００は、入力部１１０、画像読み取り部１２０、表示部１３０、スケール計算処理部１４０、画像要素切り出し部１５０を有する。 The configuration of the image processing apparatus according to the present embodiment will be described. FIG. 1 is a diagram illustrating a configuration of an image processing apparatus. As illustrated in FIG. 1, the image processing apparatus 100 includes an input unit 110, an image reading unit 120, a display unit 130, a scale calculation processing unit 140, and an image element cutout unit 150.

入力部１１０は、キーボードやマウス等の入力装置に対応する。画像読み取り部１２０は、紙媒体などから画像を読み取り、読み取った画像データをスケール計算処理部１４０、画像要素切り出し部１５０に出力する。例えば、画像読み取り部１２０は、スキャナに対応する。本実施例では、取り扱う画像をグレー画像または白黒の２値画像とする。なお、画像処理装置１００は、カラー画像が入力された場合には、明度変換を行い、カラー画像をグレー画像に変換してもよい。表示部１３０は、ディスプレイなどの表示装置に対応する。 The input unit 110 corresponds to an input device such as a keyboard or a mouse. The image reading unit 120 reads an image from a paper medium or the like, and outputs the read image data to the scale calculation processing unit 140 and the image element clipping unit 150. For example, the image reading unit 120 corresponds to a scanner. In this embodiment, the handled image is a gray image or a monochrome binary image. Note that when a color image is input, the image processing apparatus 100 may perform brightness conversion and convert the color image into a gray image. The display unit 130 corresponds to a display device such as a display.

スケール計算処理部１４０は、階層構造を有する画像データ固有の大局的なスケールを計算する処理部である。スケール計算処理部１４０は、スケール画像生成部１４１、ブロブ画像生成部１４２、固有スケール計算部１４３を有する。 The scale calculation processing unit 140 is a processing unit that calculates a global scale unique to image data having a hierarchical structure. The scale calculation processing unit 140 includes a scale image generation unit 141, a blob image generation unit 142, and a unique scale calculation unit 143.

スケール画像生成部１４１は、スケール値を多段階に変えたガウスフィルタを画像データに適用することで、複数のスケール画像を生成する処理部である。スケール画像生成部１４１は、複数のスケール画像を該当するスケール値と対応付けてブロブ画像生成部１４２に出力する。 The scale image generation unit 141 is a processing unit that generates a plurality of scale images by applying a Gaussian filter with scale values changed in multiple stages to image data. The scale image generation unit 141 outputs a plurality of scale images to the blob image generation unit 142 in association with the corresponding scale values.

ここで、スケール画像生成部１４１が利用するガウスフィルタは、ガウス関数を使って定義されるフィルタである。このガウスフィルタは、画像を暈かす場合に利用する。式（１）は、２次元のガウス関数ｇを示すものである。式（１）において、ｔはスケール値を示す。このスケール値は、暈かす度合いに対応するものである。また、ｘ、ｙは、画像上の座標を示すものである。
Here, the Gaussian filter used by the scale image generation unit 141 is a filter defined using a Gaussian function. This Gaussian filter is used when blurring an image. Equation (1) represents a two-dimensional Gaussian function g. In formula (1), t represents a scale value. This scale value corresponds to the degree of blur. X and y indicate coordinates on the image.

原画像をＩで表すと、この原画像Ｉとガウス関数で暈かした後の画像Ｂとの関係は、下記の式（２）、（３）で表される。式（２）の（ｘ、ｙ）は、着目画素の座標である。
When the original image is represented by I, the relationship between the original image I and the image B after being blurred by a Gaussian function is expressed by the following equations (2) and (3). In equation (2), (x, y) is the coordinates of the pixel of interest.

ここで、式（２）が成り立つのは、画像が連続でサイズが無限大の理想的な場合である。実際の画像は、離散的であり、サイズは有限である。このため、原画像Ｉと画像Ｂとの関係は、下記の式（４）、（５）で表される。式（４）のように、原画像Ｉと画像Ｂとの関係は、積分の代わりに和で表される。
Here, the equation (2) holds in an ideal case where the images are continuous and the size is infinite. Actual images are discrete and have a finite size. For this reason, the relationship between the original image I and the image B is represented by the following formulas (4) and (5). As in Expression (4), the relationship between the original image I and the image B is represented by a sum instead of integration.

式（４）、（５）では、ガウスフィルタとして、ガウス関数の代わりに有限サイズの行列が用いられる。また、ガウスフィルタの高さ、幅のサイズを２ｓ＋１とする。ｓは整数である。式（４）の（ｉ、ｊ）は、着目画素の座標である。ガウスフィルタのサイズ２ｓ＋１のｓは、一般的にｔの定数倍として設計される。例えば、ｓ＝ｔ、ｔ＝１とし、ガウスフィルタの中心を（ｍ、ｎ）＝（０、０）とした場合のガウスフィルタは、式（６）によって表される。
In equations (4) and (5), a finite size matrix is used instead of a Gaussian function as a Gaussian filter. The height and width of the Gaussian filter are 2s + 1. s is an integer. In equation (4), (i, j) is the coordinates of the pixel of interest. The Gaussian filter size 2s + 1 s is generally designed as a constant multiple of t. For example, the Gaussian filter when s = t, t = 1 and the center of the Gaussian filter is (m, n) = (0, 0) is expressed by Expression (6).

ブロブ画像生成部１４２は、各スケール画像からブロブ画像を生成する処理部である。ブロブ画像生成部１４２は、スケール画像を２値化し連結成分解析を行うことで、一つの連結成分を一つのブロブとする。このブロブは、スケール画像内で、背景より暗い、または明るい画素の集合といえる。ブロブ画像生成部１４２は、抽出した全てのブロブに対し、例えば背景領域が１ならブロブ領域は０というように、背景とは異なる値を付与した画像をブロブ画像とする。 The blob image generation unit 142 is a processing unit that generates a blob image from each scale image. The blob image generation unit 142 binarizes the scale image and performs connected component analysis, thereby setting one connected component as one blob. This blob can be said to be a collection of pixels darker or brighter than the background in the scale image. For each extracted blob, the blob image generation unit 142 sets an image to which a value different from the background is assigned, for example, if the background area is 1 and the blob area is 0, as the blob image.

例えば、ブロブ画像生成部１４２は、スケール値の各画素値と、閾値とを比較し、比較結果に基づいて、スケール画像を２値化することでブロブ画像を生成する。例えば、ブロブ画像生成部１４２は、スケール画像の各画素において、閾値以上となる画素値を１とし、閾値よりも小さい画素値を０に設定することでスケール画像を２値化する。ブロブ画像生成部１４２は、スケール画像毎に、ブロブ画像を生成する。 For example, the blob image generation unit 142 compares each pixel value of the scale value with a threshold value, and generates a blob image by binarizing the scale image based on the comparison result. For example, the blob image generation unit 142 binarizes the scale image by setting a pixel value equal to or greater than the threshold value to 1 and setting a pixel value smaller than the threshold value to 0 for each pixel of the scale image. The blob image generation unit 142 generates a blob image for each scale image.

図２ａは、文書画像を示す図である。図２ｂは、文書画像からスケール値ｔ_０〜ｔ_３で得られた各スケール画像と、スケール画像から得られた各ブロブ画像とを示す図である。スケール値の大小関係は、ｔ_０＜ｔ_１＜ｔ_２＜ｔ_３とする。図２ｂに示すように、ブロブ画像生成部１４２は、スケール値ｔ_０によって暈かされたスケール画像１０ｂを２値化し、ブロブ画像１０ｃを生成する。スケール値ｔ_０を０とする場合には、ブロブ画像生成部１４２は、図２ａの文書画像をそのまま用いてもよい。 FIG. 2a shows a document image. FIG. 2B is a diagram illustrating each scale image obtained from the document image with the scale values t _{0 to} t ₃ and each blob image obtained from the scale image. The magnitude relationship between the scale values is t ₀ <t ₁ <t ₂ <t ₃ . As shown in FIG. 2b, blob image generation unit 142, a halo or has been scaled image 10b by the scale value _{t 0} binarized to generate a blob image 10c. When the scale value t ₀ and 0, blob image generating unit 142, may be used as the document image in Figure 2a.

図２ｂの上から２段目に示すように、ブロブ画像生成部１４２は、スケール値ｔ_１によって暈かされたスケール画像１０ｄを２値化し、ブロブ画像１０ｅを生成する。図２ｂの上から３段目に示すように、ブロブ画像生成部１４２は、スケール値ｔ_２によって暈かされたスケール画像１０ｆを２値化し、ブロブ画像１０ｇを生成する。図２ｂの上から４段目に示すように、ブロブ画像生成部１４２は、スケール値ｔ_３によって暈かされたスケール画像１０ｈを２値化し、ブロブ画像１０ｉを生成する。図２ｂに示すように、スケール値が大きくなるにしたがい、ブロブの数が減少し、ブロブの面積の平均値が大きくなる。 As shown in the second row from the top in FIG. 2b, blob image generation unit 142, a halo or has been scaled image 10d by the scale factor t ₁ binarized to generate a blob image 10e. As shown in the third row from the top in FIG. 2b, blob image generation unit 142, a halo or has been scaled image 10f by the scale value t ₂ binarized to generate a blob image 10g. As shown in the fourth stage from the top in FIG. 2b, blob image generation unit 142, a halo or has been scaled image 10h by the scale value t ₃ is binarized to generate a blob image 10i. As shown in FIG. 2b, as the scale value increases, the number of blobs decreases and the average value of the blob area increases.

ブロブ画像生成部１４２は、スケール画像毎にブロブ画像を生成した後に、スケール画像毎に全体ブロブ情報を生成し、生成した全体ブロブ情報を固有スケール計算部１４３に出力する。 The blob image generation unit 142 generates a blob image for each scale image, then generates overall blob information for each scale image, and outputs the generated overall blob information to the eigenscale calculation unit 143.

ここで、全体ブロブ情報は、スケール値と、スケール画像と、ブロブ画像と、ブロブ数と、個別ブロブ情報を有する。この個別ブロブ情報は、同一のブロブ画像に含まれるブロブ毎に存在する。個別ブロブ情報は、ブロブの面積、ブロブの外接矩形の頂点の座標、ブロブの重心座標、ブロブ画素座標の配列を有する。 Here, the entire blob information includes a scale value, a scale image, a blob image, the number of blobs, and individual blob information. This individual blob information exists for each blob included in the same blob image. The individual blob information includes an array of blob areas, coordinates of vertices of circumscribed rectangles of the blob, barycentric coordinates of the blob, and blob pixel coordinates.

例えば、図２ａの上から４段目に対応する全体ブロブ情報について説明すると、この全体ブロブ情報は、スケール値「ｔ_３」、スケール画像１０ｈ、ブロブ画像１０ｉ、ブロブ数「３」、ブロブ１ａ、１ｂ、１ｃにそれぞれ対応する複数の個別ブロブ情報を含む。 For example, the overall blob information corresponding to the fourth row from the top in FIG. 2a will be described. The overall blob information includes the scale value “t ₃ ”, the scale image 10 h, the blob image 10 i, the blob number “3”, the blob 1 a, A plurality of individual blob information respectively corresponding to 1b and 1c is included.

図１の説明に戻る。固有スケール計算部１４３は、全体ブロブ情報に基づいて、複数種類の固有スケール値を算出する処理部である。各固有スケール値は、画像データから画像要素を切り出す場合に利用される値である。例えば、固有スケール値は、文書画像に含まれる、文字、単語、行、文字ブロックの領域を切り出す場合に利用される。固有スケール計算部１４３は、各固有スケール値と、固有スケール値に対応するスケール画像と、固有スケール値に対応する全体ブログ情報と、元の文書画像とを画像要素切り出し部１５０に出力する。 Returning to the description of FIG. The unique scale calculation unit 143 is a processing unit that calculates a plurality of types of unique scale values based on the entire blob information. Each unique scale value is a value used when an image element is cut out from image data. For example, the unique scale value is used when a region of characters, words, lines, and character blocks included in a document image is cut out. The unique scale calculation unit 143 outputs each unique scale value, the scale image corresponding to the unique scale value, the entire blog information corresponding to the unique scale value, and the original document image to the image element cutout unit 150.

ここで、ブロブ情報に含まれるブロブ数とスケール値との関係について説明する。図３は、図２ａの画像のブロブ数とスケール値との関係を示す図である。図３の横軸は、スケール値ｔに対応する軸であり、縦軸は、ブロブ数をスケール値の関数とみなしたとき、ブロブ数の対数ｌｎｆ（ｔ）に対応する軸である。例えば、文書画像のように、階層構造を持つ画像からブロブ数とスケール値との関係を求めると、スケール値を変化させてもブロブ数が変化しない定常状態が発生する。図３に示す例では、部分２０ａ、２０ｂ、２０ｃ、２０ｄにおいて定常状態となっている。この定常状態は、プラトーと呼ばれる。なお、異なる定常状態の間の状態は、急激にブロブ数が変化しており、この状態を状態遷移とする。 Here, the relationship between the number of blobs included in the blob information and the scale value will be described. FIG. 3 is a diagram showing the relationship between the number of blobs and the scale value of the image of FIG. 2a. The horizontal axis in FIG. 3 is an axis corresponding to the scale value t, and the vertical axis is an axis corresponding to the logarithm ln f (t) of the blob number when the blob number is regarded as a function of the scale value. For example, when the relationship between the number of blobs and the scale value is obtained from an image having a hierarchical structure such as a document image, a steady state occurs in which the number of blobs does not change even if the scale value is changed. In the example shown in FIG. 3, the portions 20a, 20b, 20c, and 20d are in a steady state. This steady state is called a plateau. It should be noted that the number of blobs rapidly changes between different steady states, and this state is referred to as state transition.

固有スケール計算部１４３は、定常状態の開始点に対応するスケール値を固定スケール値として算出する。図３に示す例では、定常状態２０ａの開始点に対応するスケール値がｔ_０となる。また、定常状態２ｂの開始点に対応するスケール値がｔ_１となる。また、定常状態２ｃの開始点に対応するスケール値がｔ_２となる。また、定常状態２０ｄの開始点に対応するスケール値がｔ_３となる。この場合には、固有スケール計算部１４３は、スケール値ｔ_０、ｔ_１、ｔ_２、ｔ_３を固有スケールとして算出する。 The inherent scale calculation unit 143 calculates a scale value corresponding to the starting point of the steady state as a fixed scale value. In the example shown in FIG. 3, the scale values corresponding to the start point of the steady-state 20a becomes t _0. Furthermore, the scale value corresponding to the start point of the steady-state 2b is t _1. Furthermore, the scale value corresponding to the start point of the steady-state 2c is t _2. Furthermore, the scale value corresponding to the start point of the steady-state 20d is t _3. In this case, the inherent scale calculation unit 143 calculates the scale values t ₀ , t ₁ , t ₂ , and t ₃ as inherent scales.

固有スケール計算部１４３の処理を具体的に説明する。まず、固有スケール計算部１４３は、ブロブ数の対数とスケール値との関係を示すグラフに対して、１次微分ｄｆ（ｔ）／ｄｔと２次微分ｄ^２ｆ（ｔ）／ｄｔを数値的に計算する。図４は、図３のグラフの１次微分の計算結果を示す図である。 The processing of the inherent scale calculation unit 143 will be specifically described. First, the eigenscale calculation unit 143 numerically calculates the first derivative df (t) / dt and the second derivative d ² f (t) / dt with respect to a graph indicating the relationship between the logarithm of the blob number and the scale value. To calculate. FIG. 4 is a diagram showing a calculation result of the first derivative of the graph of FIG.

固有スケール計算部１４３は、１次微分と２次微分とを計算した後に、１次微分が極小となるスケール値ｔ_ｍを全て検出する。１次微分が極小となる点は、２次微分の値が０となることに等しい。図４に示す例では、１次微分が極小となるスケール値は、ｔ_Ｍ０、ｔ_Ｍ１、ｔ_Ｍ２となる。 After calculating the first derivative and the second derivative, the inherent scale calculator 143 detects all the scale values t _m at which the first derivative is minimized. The point at which the first derivative is minimized is equivalent to the value of the second derivative being zero. In the example shown in FIG. 4, the scale values at which the first derivative is minimized are t _M0 , t _M1 , and t _M2 .

固有スケール計算部１４３は、１次微分が極小となるスケール値を求めた後に、このスケール値を徐々に増加させていき、１次微分の絶対値が初めて所定の閾値より小さくなるスケール値を算出する。このスケール値が、定常状態の開始点に対応するスケール値となる。図４に示す例では、１次微分の絶対値が初めて所定の閾値より小さくなるスケール値は、ｔ_１、ｔ_２、ｔ_３となる。 The eigenscale calculation unit 143 obtains a scale value at which the first derivative is minimized, and then gradually increases the scale value to calculate a scale value at which the absolute value of the first derivative becomes smaller than a predetermined threshold value for the first time. To do. This scale value becomes the scale value corresponding to the starting point of the steady state. In the example shown in FIG. 4, the scale values at which the absolute value of the first derivative becomes smaller than the predetermined threshold for the first time are t ₁ , t ₂ , and t ₃ .

また、固有スケール計算部１４３は、スケール値の初期値ｔ_０を検出する。このスケール値ｔ_０は、図３の定常状態２０ａの開始点に対応する。ｔ_０＝０として、暈かさない文書画像の明度画像としてもよい。固有スケール計算部１４３は、スケール値ｔ_０、ｔ_１、ｔ_２、ｔ_３を固有スケール値として、画像要素切り出し部１５０に出力する。１次微分の絶対値が初めて所定の閾値より小さくなるスケール値の数がＭ個の場合には、スケール値の初期値を合わせたＭ＋１個のスケール値が固有スケール値として検出されることになる。 Also, unique scale calculator 143 detects the initial value t ₀ of the scale values. The scale value t ₀ corresponds to the beginning of the steady state 20a in FIG. As t 0 _{= 0,} may be the brightness image of the document image not umbrella halo. The inherent scale calculation unit 143 outputs the scale values t ₀ , t ₁ , t ₂ , and t ₃ as inherent scale values to the image element cutout unit 150. When the number of scale values whose absolute value of the first derivative becomes smaller than a predetermined threshold value for the first time is M, M + 1 scale values including the initial values of the scale values are detected as unique scale values. .

図１の説明に戻る。画像要素切り出し部１５０は、固有スケール値を利用して、画像データから画像要素領域の画像を切り出す処理部である。例えば、画像要素領域の画像は、文字、単語、行、文字ブロックに対応する。この画像要素切り出し部１５０は、画像２値化部１５１、画像要素領域判定部１５２、切り出し部１５３を有する。画像要素切り出し部１５０は、出力部の一例である。 Returning to the description of FIG. The image element cutout unit 150 is a processing unit that cuts out an image of the image element region from the image data using the unique scale value. For example, the image in the image element area corresponds to a character, a word, a line, and a character block. The image element cutout unit 150 includes an image binarization unit 151, an image element region determination unit 152, and a cutout unit 153. The image element cutout unit 150 is an example of an output unit.

画像２値化部１５１は、画像読み取り部１２０から画像データを取得した場合に、画像データに含まれる画素を白画素と黒画素に２値化する処理部である。画像２値化部１５１は、文書画像を２値化した画像データを画像要素領域判定部１５２に出力する。 The image binarization unit 151 is a processing unit that binarizes pixels included in the image data into white pixels and black pixels when image data is acquired from the image reading unit 120. The image binarization unit 151 outputs image data obtained by binarizing the document image to the image element region determination unit 152.

画像要素領域判定部１５２は、画像データと固有スケール値と全体ブロブ情報に基づいて、画像データに含まれる画像要素の領域を判定する処理部である。例えば、文書画像データに含まれる画像要素は、文字、単語、行、文字ブロックに対応する。 The image element region determination unit 152 is a processing unit that determines the region of the image element included in the image data based on the image data, the unique scale value, and the entire blob information. For example, the image elements included in the document image data correspond to characters, words, lines, and character blocks.

具体的に、画像要素領域判定部１５２の処理を説明する。画像要素領域判定部１５２は、２値化した文書画像に対してラベリング処理をおこない、各連結成分の外接矩形を抽出する。画像要素領域判定部１５２は、全体ブロブ情報を基に、２値化した文書画像と、各ブロブの外接矩形とを比較して、ブロブの外接矩形に含まれる連結成分を判定する。そして、画像要素領域判定部１５２は、判定した連結成分全体の外接矩形を、該当する固有スケールでの画像要素領域として抽出する。画像要素領域判定部１５２は、抽出した外接矩形の座標を、画像要素領域の座標として切り出し部１５３に出力する。 Specifically, the processing of the image element region determination unit 152 will be described. The image element region determination unit 152 performs a labeling process on the binarized document image and extracts a circumscribed rectangle of each connected component. The image element region determination unit 152 compares the binarized document image with the circumscribed rectangle of each blob based on the entire blob information, and determines a connected component included in the circumscribed rectangle of the blob. Then, the image element region determination unit 152 extracts a circumscribed rectangle of the determined entire connected component as an image element region on the corresponding inherent scale. The image element area determination unit 152 outputs the extracted coordinates of the circumscribed rectangle to the clipping unit 153 as the coordinates of the image element area.

図１６は、画像要素領域判定部の処理を説明するための図である。図１６の２ａは、ある固有スケールのブロブに対応する。画像要素領域判定部１５２は、２ｂに示すように、２値化した文書画像とブロブの外接矩形とを比較すると、ブロブの外接矩形に含まれる連結成分は「Ｂｏｏｋ」となる。画像要素領域判定部１５２は、判定した連結成分全体の外接矩形２ｃを、該当する固有スケールでの画像要素領域として抽出する。 FIG. 16 is a diagram for explaining processing of the image element region determination unit. FIG. 16 2a corresponds to a blob of a certain inherent scale. As shown in 2b, the image element region determination unit 152 compares the binarized document image with the circumscribed rectangle of the blob, and the connected component included in the circumscribed rectangle of the blob is “Book”. The image element region determination unit 152 extracts the circumscribed rectangle 2c of the entire determined connected component as an image element region on the corresponding inherent scale.

なお、画像要素領域判定部１５２は、重心がブロブの外接矩形に含まれていない文字を除外する。例えば、図１６の２ｂにおいて、「Ｂｏｏｋ」の「Ｂ」の重心がブロブの外接矩形に含まれておらず、「ｏ」、「ｏ」、「ｋ」の重心がブロブの外接矩形に含まれている場合には、連結成分の外接矩形は、「ｏｏｋ」の外接矩形となる。なお、画像要素領域判定部１５２は、文字の外接矩形から文字の中心を求め、文字の中心を重心の代わりに用いてもよい。 Note that the image element region determination unit 152 excludes characters whose center of gravity is not included in the circumscribed rectangle of the blob. For example, in 2b of FIG. 16, the center of gravity of “B” of “Book” is not included in the circumscribed rectangle of the blob, and the centers of gravity of “o”, “o”, and “k” are included in the circumscribed rectangle of the blob. The circumscribed rectangle of the connected component is the circumscribed rectangle of “ok”. Note that the image element region determination unit 152 may obtain the center of the character from the circumscribed rectangle of the character and use the center of the character instead of the center of gravity.

図５〜図８は、各ブロブ画像から決定される画像要素領域の一例を示す図である。図５は、固有スケール値ｔ_１に対応するブロブ画像から決定した、文書画像上の画像要素領域を示す。図５に示す画像要素領域は、単語の領域に対応する。図６は、固有スケール値ｔ_２に対応するブロブ画像から決定した、文書像上の画像要素領域を示す。図６に示す画像領域要素領域は、行の領域に対応する。 5 to 8 are diagrams illustrating an example of image element regions determined from each blob image. FIG. 5 shows an image element area on the document image determined from the blob image corresponding to the unique scale value t ₁ . The image element area shown in FIG. 5 corresponds to a word area. 6 was determined from the blob image corresponding to the specific scale value t _2, shows an image element regions on the document image. The image area element area shown in FIG. 6 corresponds to a line area.

図７は、固有スケール値ｔ_３に対応するブロブ画像から決定した、文書画像上の画像要素領域を示す。図７に示す画像領域要素領域は、文字ブロックの領域に対応する。図８は、固有スケール値ｔ_４に対応するブロブ画像から決定した、文書画像上の画像要素領域を示す。 7 was determined from the blob image corresponding to the specific scale value t _3, shows an image element regions on the document image. The image region element region shown in FIG. 7 corresponds to a character block region. Figure 8 were determined from blob image corresponding to the specific scale value t _4, shows an image element regions on the document image.

切り出し部１５３は、画像要素領域の座標を基にして、元の文書画像、または、文書画像の２値画像から、画像要素領域の画像を切り出す処理部である。画像要素領域の画像は、例えば、文字、単語、行、文字ブロックに対応する。切り出し部１５３は、切り出した画像要素領域の画像を、表示部１３０に出力する。切り出し部１５３が切り出す画像要素領域は、各固有スケール値によって生成されるブロブ画像の領域に重心が含まれるもとの文書画像の２値画像の連結成分全体の外接矩形に対応する。なお、切り出し部１５３は、文字認識などを行う装置に、画像要素領域の画像データを出力してもよい。 The cutout unit 153 is a processing unit that cuts out the image element region image from the original document image or the binary image of the document image based on the coordinates of the image element region. The image of the image element area corresponds to, for example, a character, a word, a line, and a character block. The cutout unit 153 outputs the cut image element region image to the display unit 130. The image element region cut out by the cutout unit 153 corresponds to the circumscribed rectangle of the entire connected component of the binary image of the original document image in which the center of gravity is included in the region of the blob image generated by each unique scale value. Note that the cutout unit 153 may output the image element region image data to a device that performs character recognition and the like.

ところで、画像要素領域の切り出し対象となる画像が、図１５に示したような階層構造を持つ画像の場合には、画像要素切り出し部１５０は、画像要素領域２ａ、複数の画像要素領域２ｂ等が切り出されることとなる。画像要素切り出し部１５０が切り出した画像要素領域の画像は、文字認識など目的に応じて切り出される領域となる。 Incidentally, when the image to be cut out of the image element area is an image having a hierarchical structure as shown in FIG. 15, the image element cutout unit 150 includes the image element area 2a, the plurality of image element areas 2b, and the like. It will be cut out. The image of the image element area cut out by the image element cutout unit 150 is an area cut out according to the purpose such as character recognition.

次に、画像処理装置１００が、固有スケール値を算出する処理の処理手順について説明する。図９は、固有スケール値を算出する処理の処理手順を示すフローチャートである。図９に示す処理は、例えば、画像読み取り部１２０が画像データを読み取ったことを契機として実行される。図９に示すように、画像処理装置１００は、スケール値ｔの異なるガウスフィルタを画像データに適用することで、複数のスケール画像を生成する（ステップＳ１０１）。 Next, a processing procedure of processing in which the image processing apparatus 100 calculates the unique scale value will be described. FIG. 9 is a flowchart illustrating a processing procedure of processing for calculating the unique scale value. The process illustrated in FIG. 9 is executed, for example, when the image reading unit 120 reads image data. As shown in FIG. 9, the image processing apparatus 100 generates a plurality of scale images by applying Gaussian filters having different scale values t to the image data (step S101).

画像処理装置１００は、各スケール画像から全体ブロブ画像情報を抽出する（ステップＳ１０２）。画像処理装置１００は、スケールの変化に対するブロブ画像の状態変化を基にして、固有スケール値を算出する（ステップＳ１０３）。そして、画像処理装置１００は、固有スケール値を出力する（ステップＳ１０４）。 The image processing apparatus 100 extracts the entire blob image information from each scale image (step S102). The image processing apparatus 100 calculates a unique scale value based on the blob image state change with respect to the scale change (step S103). Then, the image processing apparatus 100 outputs a unique scale value (step S104).

次に、画像処理装置１００が、画像要素領域の画像を切り出す処理の処理手順について説明する。図１０は、画像要素領域の画像を切り出す処理の処理手順を示すフローチャートである。図１０に示す処理は、例えば、画像要素切り出し部１５０が、スケール計算処理部１４０から固有スケール値を取得したことを契機に実行される。 Next, a processing procedure of processing in which the image processing apparatus 100 cuts out an image of the image element area will be described. FIG. 10 is a flowchart showing a processing procedure of processing for cutting out an image of the image element area. The process illustrated in FIG. 10 is executed, for example, when the image element cutout unit 150 acquires the unique scale value from the scale calculation processing unit 140.

図１０に示すように、画像処理装置１００は、固有スケール値を取得し（ステップＳ２０１）、未選択の固有スケール値を選択する（ステップＳ２０２）。画像処理装置１００は、選択した固有スケール値において、画像要素領域を算出する（ステップＳ２０３）。 As shown in FIG. 10, the image processing apparatus 100 acquires a unique scale value (step S201), and selects an unselected unique scale value (step S202). The image processing apparatus 100 calculates an image element region at the selected unique scale value (step S203).

画像処理装置１００は、元の文書画像、または元の文書画像の２値画像から画像要素領域の画像を切り出し（ステップＳ２０４）、固有スケール値を全て選択したか否かを判定する（ステップＳ２０５）。画像処理装置１００は、固有スケール値を全て選択していない場合には（ステップＳ２０５，Ｎｏ）、ステップＳ２０２に移行する。一方、画像処理装置１００は、固有スケール値を全て選択した場合には（ステップＳ２０５，Ｙｅｓ）、切り出した画像を出力する（ステップＳ２０６）。 The image processing apparatus 100 cuts out the image element region image from the original document image or the binary image of the original document image (step S204), and determines whether all the unique scale values have been selected (step S205). . If all the unique scale values have not been selected (step S205, No), the image processing apparatus 100 proceeds to step S202. On the other hand, when all the unique scale values are selected (step S205, Yes), the image processing apparatus 100 outputs the cut image (step S206).

次に、本実施例にかかる画像処理装置１００の効果について説明する。画像処理装置１００は、画像データからスケール値の異なる複数のスケール画像を生成し、各スケール画像からブロブ画像を生成する。そして、画像処理装置１００は、ブロブ画像の数とスケール値との関係に基づいて、ブロブ画像の数の変化の割合が所定の閾値に収まるスケール値を固有スケール値として計算する。画像処理装置１００は、固有スケール値によって生成されるブロブ画像のブロブに重心が含まれる文書画像の２値画像の連結成分全体の外接矩形を、前記固有スケールにおける画像要素領域とし、対応する文書画像、または文書画像の２値画像の領域を、前記固有スケールでの画像要素として出力する。このため、画像が階層構造を有する画像であったとしても、各階層構造での画像要素領域の切り出しをすべき固有スケール値を自動的に求めることができる。 Next, effects of the image processing apparatus 100 according to the present embodiment will be described. The image processing apparatus 100 generates a plurality of scale images having different scale values from the image data, and generates a blob image from each scale image. Then, based on the relationship between the number of blob images and the scale value, the image processing apparatus 100 calculates a scale value at which the rate of change in the number of blob images falls within a predetermined threshold value as a unique scale value. The image processing apparatus 100 sets a circumscribed rectangle of the entire connected component of the binary image of the document image in which the centroid is included in the blob of the blob image generated based on the unique scale value as an image element region in the unique scale, and the corresponding document image. Or a binary image area of the document image is output as an image element on the inherent scale. For this reason, even if the image is an image having a hierarchical structure, it is possible to automatically obtain a unique scale value to which an image element region should be cut out in each hierarchical structure.

例えば、本実施例にかかる画像処理装置１００を利用すれば、図１５に示すような階層構造を有する画像であっても、文字認識を行う領域を自動的に出力することができる。画像処理装置１００は、図１５に示す画像から、文字認識を行う領域２ａ、複数の領域２ｂを自動的に求めることができる。 For example, if the image processing apparatus 100 according to the present embodiment is used, an area for character recognition can be automatically output even for an image having a hierarchical structure as shown in FIG. The image processing apparatus 100 can automatically obtain the area 2a for character recognition and the plurality of areas 2b from the image shown in FIG.

また、本実施例にかかる画像処理装置１００を利用すれば、文書画像のような階層構造を有する画像から、文字、単語、行、文字ブロックの領域を自動的に出力することができる。このうち、単語、行、文字ブロック等の領域の情報は、画像のレイアウトを電子データに復元する場合に利用することができる。また、単語の領域の情報は、レイアウトの復元のみだけではなく、該領域に対する文字認識結果に対し、単語のスペルチェックを行う場合にも利用することができる。 Further, if the image processing apparatus 100 according to the present embodiment is used, it is possible to automatically output areas of characters, words, lines, and character blocks from an image having a hierarchical structure such as a document image. Among these, information on areas such as words, lines, and character blocks can be used when the image layout is restored to electronic data. The information on the word area can be used not only for restoring the layout, but also when checking the spelling of the word against the character recognition result for the area.

ところで、固有スケール計算部１４３は、ブロブ数とスケール値との関係に基づいて、固有スケール値を計算していたが、これに限定されるものではない。例えば、固有スケール計算部１４３は、ブロブの平均面積とスケール値との関係に基づいて、固有スケール値を計算してもよい。すなわち、固有スケール計算部１４３は、スケール値の変化に対して、ブロブの平均面積の比が変化しない定常状態を検出し、この定常状態に対応するスケール値を固有スケール値として算出してもよい。 Incidentally, the eigenscale calculation unit 143 calculates the eigenscale value based on the relationship between the number of blobs and the scale value, but is not limited to this. For example, the unique scale calculation unit 143 may calculate the unique scale value based on the relationship between the average area of the blob and the scale value. That is, the eigenscale calculation unit 143 may detect a steady state where the ratio of the average area of the blob does not change with respect to the change in the scale value, and calculate the scale value corresponding to the steady state as the eigenscale value. .

また、固有スケール計算部１４３は、スケール画像の極大点の数と、スケール値との関係に基づいて、固有スケール値を計算してもよい。例えば、固有スケール計算部１４３は、スケール値の変化に対して、ブロブ画像の極大点の数が変化しない定常状態を検出し、この定常状態に対応するスケール値を固有スケール値として算出してもよい。 In addition, the unique scale calculation unit 143 may calculate the unique scale value based on the relationship between the number of maximum points of the scale image and the scale value. For example, the eigenscale calculation unit 143 detects a steady state in which the number of local maximum points of the blob image does not change with respect to the change in the scale value, and calculates the scale value corresponding to the steady state as the eigenscale value. Good.

なお、図１に示したスケール計算処理部１４０、画像要素切り出し部１５０は、例えば、ＡＳＩＣ（Application Specific Integrated Circuit）や、ＦＰＧＡ（Field Programmable Gate Array）などの集積装置に対応する。または、上記処理部１４０，１５０は、ＣＰＵ（Central Processing Unit）やＭＰＵ（Micro Processing Unit）等の電子回路に対応する。 Note that the scale calculation processing unit 140 and the image element cutout unit 150 illustrated in FIG. 1 correspond to an integrated device such as an ASIC (Application Specific Integrated Circuit) or an FPGA (Field Programmable Gate Array). Alternatively, the processing units 140 and 150 correspond to electronic circuits such as a CPU (Central Processing Unit) and an MPU (Micro Processing Unit).

画像処理装置１００は、既知のパーソナルコンピュータ、ワークステーション、携帯電話、ＰＨＳ端末、移動体通信端末またはＰＤＡなどの情報処理装置に、画像処理装置１００の各機能を搭載することによって実現することもできる。 The image processing apparatus 100 can also be realized by mounting each function of the image processing apparatus 100 on an information processing apparatus such as a known personal computer, workstation, mobile phone, PHS terminal, mobile communication terminal, or PDA. .

図１１は、実施例にかかる画像処理装置を構成するコンピュータのハードウェア構成を示す図である。図１１に示すように、このコンピュータ３０は、各種演算処理を実行するＣＰＵ３１と、ユーザからのデータの入力を受け付ける入力装置３２と、モニタ３３を有する。また、コンピュータ３０は、記憶媒体からプログラム等を読取る媒体読み取り装置３４と、ネットワークを介して他のコンピュータとの間でデータの授受を行うネットワークインターフェース装置３５を有する。また、コンピュータ３０は、画像を読み取るスキャナ３６と、各種情報を一時記憶するＲＡＭ３７と、ハードディスク装置３８を有する。各装置３１〜３８は、バス３９に接続される。 FIG. 11 is a diagram illustrating a hardware configuration of a computer constituting the image processing apparatus according to the embodiment. As shown in FIG. 11, the computer 30 includes a CPU 31 that executes various arithmetic processes, an input device 32 that receives input of data from a user, and a monitor 33. The computer 30 includes a medium reading device 34 that reads a program and the like from a storage medium, and a network interface device 35 that exchanges data with other computers via a network. The computer 30 also includes a scanner 36 that reads an image, a RAM 37 that temporarily stores various information, and a hard disk device 38. Each device 31 to 38 is connected to a bus 39.

そして、ハードディスク装置３８は、固有スケール計算プログラム３８ａ、画像要素切り出しプログラム３８ｂを記憶する。固有スケール計算プログラム３８ａは、スケール計算処理部１４０と同様の機能を有する。画像要素切り出し部３８ｂは、画像要素切り出し部１５０と同様の機能を有する。 The hard disk device 38 stores a unique scale calculation program 38a and an image element cutout program 38b. The inherent scale calculation program 38 a has the same function as the scale calculation processing unit 140. The image element cutout unit 38b has the same function as the image element cutout unit 150.

ＣＰＵ３１は、固有スケール計算プログラム３８ａ、画像要素切り出しプログラム３８ｂをハードディスク装置３８から読み出してＲＡＭ３７に展開する。固有スケール計算プログラム３８ａは、固有スケール計算プロセス３７ａとして機能する。画像要素切り出しプログラム３８ｂは、画像要素切り出しプロセス３７ｂとして機能する。 The CPU 31 reads the inherent scale calculation program 38 a and the image element cutout program 38 b from the hard disk device 38 and expands them in the RAM 37. The inherent scale calculation program 38a functions as an inherent scale calculation process 37a. The image element cutout program 38b functions as an image element cutout process 37b.

固有スケール計算プロセス３７ａは、スキャナ３４から画像データを取得し、この画像データを基にして、固有スケール値を算出する。画像要素切り出しプロセス３７ｂは、固有スケール値を利用して、画像データから画像要素領域の画像を切り出す。 The unique scale calculation process 37a acquires image data from the scanner 34, and calculates a unique scale value based on the image data. The image element cutout process 37b cuts out an image of the image element area from the image data using the unique scale value.

なお、上記の固有スケール計算プログラム３８ａ、画像要素切り出しプログラム３８ｂは、必ずしもハードディスク装置３８に格納されている必要はない。例えば、ＣＤ−ＲＯＭ等の記憶媒体に記憶されたプログラムを、コンピュータ３０が読み出して実行するようにしてもよい。また、公衆回線、インターネット、ＬＡＮ（Local Area Network）、ＷＡＮ（Wide Area Network）等にこのプログラムを記憶させておき、コンピュータ３０がこれらからプログラムを読み出して実行するようにしてもよい。 The inherent scale calculation program 38a and the image element cutout program 38b are not necessarily stored in the hard disk device 38. For example, the computer 30 may read and execute a program stored in a storage medium such as a CD-ROM. Alternatively, the program may be stored in a public line, the Internet, a LAN (Local Area Network), a WAN (Wide Area Network), or the like, and the computer 30 may read and execute the program.

以上の各実施例を含む実施形態に関し、さらに以下の付記を開示する。 The following supplementary notes are further disclosed with respect to the embodiments including the above examples.

（付記１）画像から暈かし具合の異なる複数のスケール画像を生成するスケール画像生成部と、
各スケール画像それぞれにおいて、スケール画像内の各画素値と所定の閾値とを比較してスケール画像を２値化することで複数のブロブを含むブロブ画像を生成するブロブ画像生成部と
画像の暈かし具合を示す値と、各ブロブの面積または数との関係に基づいて、前記画像の暈かし具合を示す値の変化に対するブロブの平均面積または数の変化の割合が所定の閾値未満となる範囲の暈かし具合の値を計算する固有スケール計算部と、
を備えたことを特徴とする画像処理装置。 (Supplementary Note 1) a scale image generation unit that generates a plurality of scale images having different degrees of blurring from an image;
In each scale image, a blob image generation unit that generates a blob image including a plurality of blobs by comparing each pixel value in the scale image with a predetermined threshold value and binarizing the scale image; Based on the relationship between the value indicating the degree of blur and the area or number of each blob, the ratio of the change in the average area or number of blobs to the change in the value indicating the degree of blurring of the image is less than a predetermined threshold. An eigenscale calculator that calculates the value of the extent of the range;
An image processing apparatus comprising:

（付記２）前記画像は文書画像であり、前記文書画像と前記ブロブ画像に含まれるブロブの領域とを比較し、前記ブロブの領域に含まれる文字または文字列の外接矩形の領域を、固有スケールでの画像要素領域として出力する出力部を更に有することを特徴とする付記１に記載の画像処理装置。 (Supplementary Note 2) The image is a document image, the document image is compared with a blob area included in the blob image, and a circumscribed rectangular area of a character or a character string included in the blob area is converted into a unique scale. The image processing apparatus according to appendix 1, further comprising an output unit that outputs an image element region in

（付記３）前記出力部は、前記ブロブの領域に含まれる文字の重心または文字列の各文字の重心と、該ブロブの領域とを比較し、ブロブの領域に重心が含まれる文字または文字列の外接矩形の領域を、固有スケールでの画像要素領域として出力することを特徴とする付記２に記載の画像処理装置。 (Supplementary Note 3) The output unit compares the centroid of the character included in the blob area or the centroid of each character of the character string with the blob area, and the character or character string including the centroid in the blob area The image processing apparatus according to appendix 2, wherein the circumscribed rectangular area is output as an image element area on a specific scale.

（付記４）前記出力部は、暈かし具合の異なるスケール画像から生成された各ブロブ画像のブロブの領域と、前記文書画像とを比較することで、文字の領域、単語の領域、行の領域、文字ブロックの領域を出力することを特徴とする付記２または３に記載の画像処理装置。 (Supplementary Note 4) The output unit compares the blob area of each blob image generated from the scale images having different scrambles with the document image, so that the character area, word area, line 4. The image processing apparatus according to appendix 2 or 3, wherein an area and a character block area are output.

（付記５）コンピュータが実行する画像処理方法であって、
画像から暈かし具合の異なる複数のスケール画像を生成し、
各スケール画像それぞれにおいて、スケール画像内の各画素値と所定の閾値とを比較してスケール画像を２値化することで複数のブロブを含むブロブ画像を生成し、
画像の暈かし具合を示す値と、各ブロブの面積または数との関係に基づいて、前記画像の暈かし具合を示す値の変化に対するブロブの平均面積または数の変化の割合が所定の閾値未満となる範囲の暈かし具合の値を計算することを特徴とする画像処理方法。 (Supplementary Note 5) An image processing method executed by a computer,
Generate multiple scale images with different degrees of blur from the image,
In each scale image, each pixel value in the scale image is compared with a predetermined threshold value, and the scale image is binarized to generate a blob image including a plurality of blobs.
Based on the relationship between the value indicating the degree of blurring of the image and the area or number of each blob, the ratio of the change in the average area or number of blobs to the change in the value indicating the degree of blurring of the image is a predetermined value. An image processing method comprising calculating a degree of blurring in a range that is less than a threshold value.

（付記６）前記画像は文書画像であり、前記文書画像と前記ブロブ画像に含まれるブロブの領域とを比較し、前記ブロブの領域に含まれる文字または文字列の外接矩形の領域を、固有スケールでの画像要素領域として出力することを特徴とする付記５に記載の画像処理方法。 (Supplementary Note 6) The image is a document image, the document image is compared with a blob area included in the blob image, and a circumscribed rectangular area of a character or a character string included in the blob area is converted into a unique scale. 6. The image processing method according to appendix 5, wherein the image element area is output as an image element area.

（付記７）前記ブロブの領域に含まれる文字の重心または文字列の各文字の重心と、該ブロブの領域とを比較し、ブロブの領域に重心が含まれる文字または文字列の外接矩形の領域を、固有スケールでの画像要素領域として出力することを特徴とする付記６に記載の画像処理方法。 (Supplementary note 7) The center of the character included in the blob region or the center of each character of the character string is compared with the region of the blob, and the circumscribed rectangle region of the character or character string including the center of gravity in the blob region Is output as an image element area on a specific scale.

（付記８）前記暈かし具合の異なるスケール画像から生成された各ブロブ画像のブロブの領域と、前記文書画像とを比較することで、文字の領域、単語の領域、行の領域、文字ブロックの領域を出力することを特徴とする付記６または７に記載の画像処理方法。 (Supplementary note 8) By comparing the blob area of each blob image generated from the scale images having different frauds with the document image, a character area, a word area, a line area, a character block The image processing method according to appendix 6 or 7, characterized in that:

（付記９）コンピュータに、
画像から暈かし具合の異なる複数のスケール画像を生成し、
各スケール画像それぞれにおいて、スケール画像内の各画素値と所定の閾値とを比較してスケール画像を２値化することで複数のブロブを含むブロブ画像を生成し、
画像の暈かし具合を示す値と、各ブロブの面積または数との関係に基づいて、前記画像の暈かし具合を示す値の変化に対するブロブの平均面積または数の変化の割合が所定の閾値未満となる範囲の暈かし具合の値を計算する処理を実行させる画像処理プログラム。 (Appendix 9)
Generate multiple scale images with different degrees of blur from the image,
In each scale image, each pixel value in the scale image is compared with a predetermined threshold value, and the scale image is binarized to generate a blob image including a plurality of blobs.
Based on the relationship between the value indicating the degree of blurring of the image and the area or number of each blob, the ratio of the change in the average area or number of blobs to the change in the value indicating the degree of blurring of the image is a predetermined value. An image processing program for executing a process of calculating a degree of blur in a range that is less than a threshold.

（付記１０）前記画像は文書画像であり、前記文書画像と前記ブロブ画像に含まれるブロブの領域とを比較し、前記ブロブの領域に含まれる文字または文字列の外接矩形の領域を、固有スケールでの画像要素領域として出力することを特徴とする付記９に記載の画像処理プログラム。 (Supplementary note 10) The image is a document image, the document image is compared with a blob area included in the blob image, and a circumscribed rectangular area of a character or a character string included in the blob area is converted into a unique scale. The image processing program according to appendix 9, wherein the image processing area is output as an image element area.

（付記１１）前記ブロブの領域に含まれる文字の重心または文字列の各文字の重心と、該ブロブの領域とを比較し、ブロブの領域に重心が含まれる文字または文字列の外接矩形の領域を、固有スケールでの画像要素領域として出力することを特徴とする付記１０に記載の画像処理プログラム。 (Supplementary Note 11) The circumscribed rectangle region of the character or character string whose center of gravity is included in the blob region by comparing the center of gravity of the character included in the region of the blob or the center of gravity of each character of the character string with the region of the blob Is output as an image element area on a specific scale.

（付記１２）前記暈かし具合の異なるスケール画像から生成された各ブロブ画像のブロブの領域と、前記文書画像とを比較することで、文字の領域、単語の領域、行の領域、文字ブロックの領域を出力することを特徴とする付記１０または１１に記載の画像処理プログラム。 (Additional remark 12) By comparing the blob area of each blob image generated from the scale image having a different degree of blurring with the document image, a character area, a word area, a line area, a character block The image processing program according to appendix 10 or 11, wherein the image processing program is output.

１００画像処理装置
１１０入力部
１２０画像読み取り部
１３０出力部
１４０スケール計算処理部
１５０画像要素切り出し部 DESCRIPTION OF SYMBOLS 100 Image processing apparatus 110 Input part 120 Image reading part 130 Output part 140 Scale calculation process part 150 Image element cutout part

Claims

A scale image generation unit for generating a plurality of scale images having different degrees of blur from the image;
In each scale image, a blob image generation unit that generates a blob image including a plurality of blobs by comparing each pixel value in the scale image with a predetermined threshold value and binarizing the scale image; Based on the relationship between the value indicating the degree of blur and the area or number of each blob, the ratio of the change in the average area or number of blobs to the change in the value indicating the degree of blurring of the image is less than a predetermined threshold. An image processing apparatus comprising: an eigenscale calculation unit that calculates a value of a range blur.

The image is a document image, the document image is compared with a blob area included in the blob image, and a circumscribed rectangular area of a character or a character string included in the blob area is converted into an image element on a specific scale. The image processing apparatus according to claim 1, further comprising an output unit that outputs the area.

The output unit compares the centroid of the character included in the blob area or the centroid of each character of the character string with the blob area, and the circumscribed rectangle of the character or character string including the centroid in the blob area The image processing apparatus according to claim 2, wherein the area is output as an image element area on a specific scale.

The output unit compares a blob area of each blob image generated from a scale image having a different degree of blurring with the document image, thereby obtaining a character area, a word area, a line area, and a character block. The image processing apparatus according to claim 2, wherein the region is output.

An image processing method executed by a computer,
Generate multiple scale images with different degrees of blur from the image,
In each scale image, each pixel value in the scale image is compared with a predetermined threshold value, and the scale image is binarized to generate a blob image including a plurality of blobs.
Based on the relationship between the value indicating the degree of blurring of the image and the area or number of each blob, the ratio of the change in the average area or number of blobs to the change in the value indicating the degree of blurring of the image is a predetermined value. An image processing method comprising calculating a degree of blurring in a range that is less than a threshold value.

On the computer,
Generate multiple scale images with different degrees of blur from the image,
In each scale image, each pixel value in the scale image is compared with a predetermined threshold value, and the scale image is binarized to generate a blob image including a plurality of blobs.
Based on the relationship between the value indicating the degree of blurring of the image and the area or number of each blob, the ratio of the change in the average area or number of blobs to the change in the value indicating the degree of blurring of the image is a predetermined value. An image processing program for executing a process of calculating a degree of blur in a range that is less than a threshold.