JP4394399B2

JP4394399B2 - Image analysis apparatus, image analysis program, storage medium, and image analysis method

Info

Publication number: JP4394399B2
Application number: JP2003304543A
Authority: JP
Inventors: 聖司石原; 利夫宮澤; 優関口; 学平劉; 高志齋藤; 史裕長谷川
Original assignee: Ricoh Co Ltd
Current assignee: Ricoh Co Ltd
Priority date: 2003-08-28
Filing date: 2003-08-28
Publication date: 2010-01-06
Anticipated expiration: 2023-08-28
Also published as: JP2005078149A

Description

本発明は、画像解析装置、画像解析プログラム、記憶媒体および画像解析方法に関する。 The present invention relates to an image analysis device, an image analysis program, a storage medium, and an image analysis method.

テクスチャ画像を対象とする領域分割については、従来から広く研究がなされている。その多くは、共起行列、マルコフ確率場（Markov Random Field：ＭＲＦ）やフーリエ変換等に基づく統計的アプローチによるものである。この種のアプローチに基づく領域分割の主要な目的の一つは、テクスチャ画像を一様性の評価の下に自由度の高い領域形状で分割することである。 A wide range of research has been conducted on area segmentation for texture images. Many of them are based on a statistical approach based on a co-occurrence matrix, Markov Random Field (MRF), Fourier transform, and the like. One of the main purposes of region segmentation based on this kind of approach is to segment a texture image into regions with a high degree of freedom under the evaluation of uniformity.

このような一様性の評価に際しては、テクスチャ解析を行う画素の範囲の扱いが一つの大きな焦点となる。より詳細には、画像処理の目的によっては、局所的なテクスチャ特徴に基づいて画像をいくつもの微小な領域に分割するよりも、大域的な特徴量を使っていくつかの大きな領域にまとめた方が望ましい場合が考えられる。例えば柄物のシャツなど、何らかのテクスチャによって構成されるテクスチャ特徴優位なオブジェクトを精度良く切出すような場合である。このような場合に、オブジェクトに相当する領域の内部を細かく分割して解析精度を向上させると、領域境界にひずみが生じてしまう。つまり、テクスチャ解析を行う画素の範囲を広げることによる解析精度の向上と領域境界のひずみとのトレードオフが問題となる。 When evaluating such uniformity, the handling of the range of pixels on which texture analysis is performed becomes one major focus. More specifically, depending on the purpose of the image processing, the image may be grouped into several large regions using global features rather than dividing the image into several smaller regions based on local texture features. May be desirable. For example, this is a case where an object having a superior texture feature, such as a patterned shirt, is cut out with high accuracy. In such a case, if the analysis accuracy is improved by finely dividing the inside of the area corresponding to the object, the boundary of the area is distorted. That is, there is a problem of a trade-off between improvement in analysis accuracy by expanding the range of pixels on which texture analysis is performed and distortion at the region boundary.

この種の問題への対処法として、非特許文献１には、一様範囲の推定によって解析の範囲を可変とする方法が提案されている。 As a method for dealing with this type of problem, Non-Patent Document 1 proposes a method of making the analysis range variable by estimating the uniform range.

内山俊郎,武川直樹,中村太一,金子博,“一様範囲の推定を用いたテクスチャ画像の領域分割,”信学論（Ｄ−II）,vol.Ｊ83−Ｄ−II,pp.1446−1459,2000.Toshio Uchiyama, Naoki Takegawa, Taichi Nakamura, and Hiroshi Kaneko, “Area Segmentation of Texture Images Using Uniform Range Estimation,” IEICE Theory (D-II), vol. J83-D-II, pp. 1446-1459 2000.

しかしながら、非特許文献１に提案されている方法は、濃度値に対する２次のＭＲＦに基づいて特徴をとらえる方法のため、処理の対象は比較的短い周期のテクスチャに限られる。 However, since the method proposed in Non-Patent Document 1 is a method of capturing features based on the second-order MRF with respect to the density value, the processing target is limited to a texture having a relatively short period.

また、近年においては、濃度値の変動特徴を空間−周波数領域において多様なスケールで効率的にとらえることのできる手段として、ウェーブレットによる多重解像度解析が注目されている。特に、テクスチャ特徴に対するシフト不変な解析手段としてウェーブレットフレームが盛んに取り上げられている。ただし，多様なスケールによって特徴付けられる大域的なテクスチャ特徴をとらえるには、それに相当する範囲の領域を解析する必要がある。 In recent years, multi-resolution analysis using wavelets has attracted attention as a means for efficiently capturing fluctuation characteristics of density values in various scales in the space-frequency domain. In particular, wavelet frames have been actively taken up as shift-invariant analysis means for texture features. However, in order to capture global texture features that are characterized by various scales, it is necessary to analyze a region corresponding to that range.

本発明は、解析精度が高く境界のひずみも少ないオブジェクト切出しを行うことができる画像解析装置、画像解析プログラム、記憶媒体および画像解析方法を提供することを目的とする。 An object of the present invention is to provide an image analysis apparatus, an image analysis program, a storage medium, and an image analysis method capable of performing object extraction with high analysis accuracy and little boundary distortion.

本発明の画像解析装置は、画像を所定の領域に分割する領域分割手段と、この領域分割手段により分割された前記領域ごとにウェーブレットフレームによる多重解像度解析を所定のレベルまで繰り返し実行する多重解像度解析手段と、この多重解像度解析手段によって得られるウェーブレット係数から所定の特徴量を抽出する特徴抽出手段と、この特徴抽出手段により抽出された前記特徴量に基づいて前記領域ごとにラベルを設定する領域ラベル設定手段と、所定の領域における所定の特徴量をクラスタ中心として前記各領域をクラスタリングするクラスタリング手段と、前記所定の領域および前記クラスタリングによって得られる所定のクラスタから成る領域とその周辺の領域との境界形状を所定の基準に従って最適化する領域境界最適化手段と、を具備する。
An image analysis apparatus according to the present invention includes an area dividing unit that divides an image into predetermined regions, and a multi-resolution analysis that repeatedly executes a multi-resolution analysis using wavelet frames for each of the regions divided by the region dividing unit. Means, a feature extraction means for extracting a predetermined feature quantity from the wavelet coefficients obtained by the multi-resolution analysis means, and a region label for setting a label for each area based on the feature quantity extracted by the feature extraction means A setting unit; a clustering unit that clusters each region with a predetermined feature amount in a predetermined region as a cluster center; and a boundary between the predetermined region and a region that includes the predetermined cluster obtained by the clustering and a surrounding region Region boundary optimization that optimizes shape according to predetermined criteria It includes a stage, a.

また本発明は、前記画像解析装置において、前記領域分割手段、前記多重解像度解析手段、前記特徴抽出手段並びに前記領域ラベル設定手段を、所定の終了条件が満たされるまで再帰的に実行する。
According to the present invention, in the image analysis apparatus, the region dividing unit, the multi-resolution analyzing unit, the feature extracting unit, and the region label setting unit are recursively executed until a predetermined end condition is satisfied.

また本発明は、前記画像解析装置において、前記領域分割手段、前記多重解像度解析手段、前記特徴抽出手段並びに前記クラスタリング手段を、所定の終了条件が満たされるまで再帰的に実行する。
According to the present invention, in the image analysis apparatus, the region dividing unit, the multi-resolution analyzing unit, the feature extracting unit, and the clustering unit are recursively executed until a predetermined end condition is satisfied.

また本発明は、前記何れか一記載の画像解析装置において、前記領域分割手段は、対象とする画像を所定の大きさで所定の数の矩形領域に分割する。
According to the present invention, in the image analysis device according to any one of the above, the area dividing unit divides the target image into a predetermined number of rectangular areas having a predetermined size.

また本発明は、前記何れか一記載の画像解析装置において、前記領域ラベル設定手段は、近傍領域間の特徴量に関する所定の類似度に応じて所定のラベルを付与する。
Further, according to the present invention, in the image analysis device according to any one of the above, the region label setting unit assigns a predetermined label according to a predetermined similarity regarding a feature amount between neighboring regions.

また本発明は、前記何れか一記載の画像解析装置において、前記領域境界最適化手段は、前記領域とその周辺の領域とにマルコフ確率場モデルをそれぞれ導入し、当該モデルに基づく当該領域の尤度が最大となる境界を探索する。
Further, the present invention is the image analysis device according to any one of the above, wherein the region boundary optimization unit introduces a Markov random field model to the region and the surrounding region, and the likelihood of the region based on the model. Search for the boundary with the highest degree.

また本発明は、前記何れか一記載の画像解析装置において、前記特徴抽出手段は、所定の各分解レベルから所定の成分毎に求めた平均エネルギーの各々を要素とするベクトルを特徴量とする。
According to the present invention, in the image analysis device according to any one of the above, the feature extraction unit uses a vector having each element of average energy obtained for each predetermined component from each predetermined decomposition level as a feature amount.

また本発明の画像解析プログラムは、コンピュータにインストールされ、画像を所定の領域に分割する領域分割機能と、この領域分割機能により分割された前記領域ごとにウェーブレットフレームによる多重解像度解析を所定のレベルまで繰り返し実行する多重解像度解析機能と、この多重解像度解析機能によって得られるウェーブレット係数から所定の特徴量を抽出する特徴抽出機能と、この特徴抽出機能により抽出された前記特徴量に基づいて前記領域ごとにラベルを設定する領域ラベル設定機能と、所定の領域における所定の特徴量をクラスタ中心として前記各領域をクラスタリングするクラスタリング機能と、前記所定の領域および前記クラスタリングによって得られる所定のクラスタから成る領域とその周辺の領域との境界形状を所定の基準に従って最適化する領域境界最適化機能と、をコンピュータに実行させる。
The image analysis program of the present invention is installed in a computer and has an area dividing function for dividing an image into predetermined areas, and multi-resolution analysis using wavelet frames for each of the areas divided by the area dividing function. A multi-resolution analysis function that is repeatedly executed, a feature extraction function that extracts a predetermined feature amount from wavelet coefficients obtained by the multi-resolution analysis function, and each region based on the feature amount extracted by the feature extraction function A region label setting function for setting a label, a clustering function for clustering each region with a predetermined feature amount in a predetermined region as a cluster center, a region composed of the predetermined region and a predetermined cluster obtained by the clustering, and The boundary shape with the surrounding area Executing a region boundary optimization to optimize according to the criteria of the constant, to the computer.

また本発明は、前記画像解析プログラムにおいて、前記領域分割機能、前記多重解像度解析機能、前記特徴抽出機能並びに前記領域ラベル設定機能を、所定の終了条件が満たされるまで再帰的にコンピュータに実行させる。
In the image analysis program, the present invention causes the computer to recursively execute the region division function, the multi-resolution analysis function, the feature extraction function, and the region label setting function until a predetermined end condition is satisfied.

また本発明は、前記画像解析プログラムにおいて、前記領域分割機能、前記多重解像度解析機能、前記特徴抽出機能並びに前記クラスタリング機能を、所定の終了条件が満たされるまで再帰的にコンピュータに実行させる。
In the image analysis program, the present invention causes the computer to recursively execute the region division function, the multi-resolution analysis function, the feature extraction function, and the clustering function until a predetermined end condition is satisfied.

また本発明の記憶媒体は、前記いずれか一記載の画像解析プログラムを記憶する。
A storage medium of the present invention stores any one of the image analysis programs described above.

また本発明の画像解析方法は、画像を所定の領域に分割する領域分割ステップと、この領域分割ステップにより分割された前記領域ごとにウェーブレットフレームによる多重解像度解析を所定のレベルまで繰り返し実行する多重解像度解析ステップと、この多重解像度解析ステップによって得られるウェーブレット係数から所定の特徴量を抽出する特徴抽出ステップと、この特徴抽出ステップにより抽出された前記特徴量に基づいて前記領域ごとにラベルを設定する領域ラベル設定ステップと、所定の領域における所定の特徴量をクラスタ中心として前記各領域をクラスタリングするクラスタリングステップと、前記所定の領域および前記クラスタリングによって得られる所定のクラスタから成る領域とその周辺の領域との境界形状を所定の基準に従って最適化する領域境界最適化ステップと、を具備する。
In addition, the image analysis method of the present invention includes an area dividing step for dividing an image into predetermined areas, and a multi-resolution that repeatedly executes multi-resolution analysis using wavelet frames for each of the areas divided by the area dividing step to a predetermined level. An analysis step, a feature extraction step for extracting a predetermined feature amount from the wavelet coefficients obtained by the multi-resolution analysis step, and a region in which a label is set for each region based on the feature amount extracted by the feature extraction step A label setting step, a clustering step of clustering each region with a predetermined feature amount in a predetermined region as a cluster center, a region composed of the predetermined region and a predetermined cluster obtained by the clustering, and a surrounding region Boundary shape is given It includes a region boundary optimization step of optimizing accordance quasi, the.

また本発明は、前記画像解析方法において、前記領域分割ステップ、前記多重解像度解析ステップ、前記特徴抽出ステップ並びに前記領域ラベル設定ステップを、所定の終了条件が満たされるまで再帰的に実行する。
According to the present invention, in the image analysis method, the region dividing step, the multi-resolution analyzing step, the feature extracting step, and the region label setting step are recursively executed until a predetermined end condition is satisfied.

また本発明は、前記画像解析方法において、前記領域分割ステップ、前記多重解像度解析ステップ、前記特徴抽出ステップ並びに前記クラスタリングステップを、所定の終了条件が満たされるまで再帰的に実行する。
According to the present invention, in the image analysis method, the region dividing step, the multi-resolution analysis step, the feature extraction step, and the clustering step are recursively executed until a predetermined end condition is satisfied.

また本発明は、前記何れか一記載の画像解析方法において、前記領域分割ステップは、対象とする画像を所定の大きさで所定の数の矩形領域に分割する。
According to the present invention, in the image analysis method according to any one of the above, the region dividing step divides the target image into a predetermined number of rectangular regions having a predetermined size.

また本発明は、前記何れか一記載の画像解析方法において、前記領域ラベル設定ステップは、近傍領域間の特徴量に関する所定の類似度に応じて所定のラベルを付与する。
According to the present invention, in the image analysis method according to any one of the above, the region label setting step assigns a predetermined label in accordance with a predetermined degree of similarity related to a feature amount between neighboring regions.

また本発明は、前記何れか一記載の画像解析方法において、前記領域境界最適化ステップは、前記領域とその周辺の領域とにマルコフ確率場モデルをそれぞれ導入し、当該モデルに基づく当該領域の尤度が最大となる境界を探索する。
Further, the present invention is the image analysis method according to any one of the above, wherein the region boundary optimization step introduces a Markov random field model to each of the region and the surrounding region, and the likelihood of the region based on the model. Search for the boundary with the highest degree.

また本発明は、前記何れか一記載の画像解析方法において、前記特徴抽出ステップは、所定の各分解レベルから所定の成分毎に求めた平均エネルギーの各々を要素とするベクトルを特徴量とする。 According to the present invention, in the image analysis method according to any one of the above, the feature extraction step uses, as a feature amount, a vector having each element of average energy obtained for each predetermined component from each predetermined decomposition level.

本発明によれば、ウェーブレットフレームを用いた多様なスケールでの空間−周波数解析に基づく粗い領域分割の後、領域境界の最適化を行うことによって、解析精度が高く境界のひずみも少ないオブジェクト切出しを行うことができる。 According to the present invention, object segmentation with high analysis accuracy and low boundary distortion is performed by performing region boundary optimization after rough region division based on space-frequency analysis at various scales using wavelet frames. It can be carried out.

本発明の実施の一形態を図１ないし図６に基づいて説明する。 An embodiment of the present invention will be described with reference to FIGS.

図１は、本発明が適用される画像解析装置１のハードウェア構成を概略的に示すブロック図である。図１に示すように、画像解析装置１は、例えばパーソナルコンピュータやワークステーションであり、コンピュータの主要部であって各部を集中的に制御するＣＰＵ（Central Processing Unit）２を備えている。このＣＰＵ２には、ＢＩＯＳなどを記憶した読出し専用メモリであるＲＯＭ（Read Only Memory）３と、各種データを書換え可能に記憶するＲＡＭ（Random Access Memory）４とがバス５で接続されている。ＲＡＭ４は、各種データを書換え可能に記憶する性質を有していることから、ＣＰＵ２の作業エリアとして機能し、例えば入力バッファ等の役割を果たす。 FIG. 1 is a block diagram schematically showing a hardware configuration of an image analysis apparatus 1 to which the present invention is applied. As shown in FIG. 1, the image analysis apparatus 1 is, for example, a personal computer or a workstation, and includes a CPU (Central Processing Unit) 2 that is a main part of the computer and controls each part centrally. The CPU 2 is connected by a bus 5 to a ROM (Read Only Memory) 3 which is a read-only memory storing BIOS and a RAM (Random Access Memory) 4 which stores various data in a rewritable manner. Since the RAM 4 has a property of storing various data in a rewritable manner, the RAM 4 functions as a work area for the CPU 2 and functions as an input buffer, for example.

さらにバス５には、外部記憶装置であるＨＤＤ（Hard Disk Drive）６と、配布されたプログラムであるコンピュータプログラムを読み取るための機構としてＣＤ（Compact Disc）−ＲＯＭ７を読み取るＣＤ−ＲＯＭドライブ８と、画像解析装置１とネットワーク９との通信を司る通信制御装置１０と、入力手段として機能するキーボードやマウスなどの入力装置１１と、ＣＲＴ（Cathode Ray Tube）、ＬＣＤ（Liquid Crystal Display）などの表示装置１２とが、図示しないＩ／Ｏを介して接続されている。ＨＤＤ６には、各種の情報が、コンピュータ上のファイルシステム構造におけるファイルやフォルダなどの形式で記憶されている。 Further, the bus 5 includes an HDD (Hard Disk Drive) 6 that is an external storage device, a CD-ROM drive 8 that reads a CD (Compact Disc) -ROM 7 as a mechanism for reading a computer program that is a distributed program, A communication control device 10 that controls communication between the image analysis device 1 and the network 9, an input device 11 such as a keyboard or a mouse that functions as input means, and a display device such as a CRT (Cathode Ray Tube) or LCD (Liquid Crystal Display). 12 are connected via an I / O (not shown). Various types of information are stored in the HDD 6 in the form of files and folders in the file system structure on the computer.

図１に示すＣＤ−ＲＯＭ７は、この発明の記憶媒体を実施するものであり、Windows（登録商標）などのＯＳ（Operating System）や各種コンピュータプログラムが記憶されている。ＣＰＵ２は、ＣＤ−ＲＯＭ７に記憶されているコンピュータプログラムをＣＤ−ＲＯＭドライブ８で読み取り、ＨＤＤ６にインストールする。 A CD-ROM 7 shown in FIG. 1 implements the storage medium of the present invention, and stores an OS (Operating System) such as Windows (registered trademark) and various computer programs. The CPU 2 reads the computer program stored in the CD-ROM 7 with the CD-ROM drive 8 and installs it in the HDD 6.

なお、記憶媒体としては、ＣＤ−ＲＯＭ７のみならず、ＤＶＤなどの各種の光ディスク、各種光磁気ディスク、フレキシブル・ディスクなどの各種磁気ディスク等、半導体メモリ等の各種方式のメディアを用いることができる。また、通信制御装置１０を介してインターネットなどのネットワーク９からコンピュータプログラムをダウンロードし、ＨＤＤ６にインストールするようにしてもよい。この場合に、送信側のサーバでコンピュータプログラムを記憶している記憶装置も、この発明の記憶媒体である。なお、コンピュータプログラムは、所定のＯＳ（Operating System）上で動作するものであってもよいし、その場合に後述の各種処理の一部の実行をＯＳに肩代わりさせるものであってもよいし、所定のアプリケーションソフトやＯＳなどを構成する一群のプログラムファイルの一部として含まれているものであってもよい。 As the storage medium, not only the CD-ROM 7 but also various types of media such as semiconductor memory such as various optical disks such as DVD, various magnetic disks such as various magneto-optical disks and flexible disks, and the like can be used. Further, a computer program may be downloaded from the network 9 such as the Internet via the communication control device 10 and installed in the HDD 6. In this case, the storage device that stores the computer program in the transmitting server is also a storage medium of the present invention. Note that the computer program may operate on a predetermined OS (Operating System), and in that case, the OS may take over the execution of some of the various processes described below, It may be included as part of a group of program files that constitute predetermined application software, OS, or the like.

このシステム全体の動作を制御するＣＰＵ２は、このシステムの主記憶として使用されるＨＤＤ６上にロードされたコンピュータプログラムに基づいて各種処理を実行する。 The CPU 2 that controls the operation of the entire system executes various processes based on a computer program loaded on the HDD 6 used as the main storage of the system.

次に、画像解析装置１のＣＰＵ２がコンピュータプログラムに基づいて実行する演算処理の内容について説明する。ここでは、画像解析装置１のＣＰＵ２が実行する各種の演算処理のうち、本実施の形態の特長的な処理について以下に説明する。 Next, the contents of arithmetic processing executed by the CPU 2 of the image analysis apparatus 1 based on the computer program will be described. Here, the characteristic processing of the present embodiment among the various types of arithmetic processing executed by the CPU 2 of the image analysis device 1 will be described below.

図２は、画像解析装置１が実行する処理を概略的に示すフローチャートである。図２に示すように、画像解析装置１のＣＰＵ２は、まず、カウンタｎを０に初期化する（ステップＳ１）。 FIG. 2 is a flowchart schematically showing processing executed by the image analysis apparatus 1. As shown in FIG. 2, the CPU 2 of the image analysis apparatus 1 first initializes a counter n to 0 (step S1).

次に、解析対象とする画像（原画像）を所定の大きさで所定の矩形領域（以降、タイルと呼ぶ）に分割する（ステップＳ２）。例えば、原画像を（２^ｎ＋１）^２個のタイルに等分割する方法などが考えられる。 Next, an image to be analyzed (original image) is divided into a predetermined rectangular area (hereinafter referred to as a tile) with a predetermined size (step S2). For example, a method of equally dividing the original image into (2 ^{n + 1} ) ² tiles can be considered.

次に、タイル毎に、所定の分解レベルまでウェーブレットフレーム分解を実行する（ステップＳ３）。ウェーブレットフレーム分解の最も簡単な実施法は、ウェーブレット分解におけるダウンサンプリングを省略することである。また、分解レベルの限度は、対象とするタイルの大きさを考慮して決めれば良い。 Next, wavelet frame decomposition is executed for each tile up to a predetermined decomposition level (step S3). The simplest implementation of wavelet frame decomposition is to omit downsampling in wavelet decomposition. Further, the limit of the decomposition level may be determined in consideration of the size of the target tile.

続いて、タイル毎に特徴ベクトルの抽出を行う（ステップＳ４）。ステップＳ３のウェーブレットフレーム分解により、分解レベル毎に、ＬＬ成分、ＨＬ成分、ＬＨ成分そしてＨＨ成分の４種類にそれぞれ対応する係数値がタイルの画素数と同数ずつ得られる。ここで、Ｌは低域、Ｈは高域を表すアルファベットである。また、これらが二つ並んだ場合には、前に位置するアルファベットが水平方向、後に位置するアルファベットが垂直方向に対応するものとする。特徴ベクトルの要素には、各成分の平均エネルギーなどを採用すれば良い。なお、必ずしもすべての分解レベルやすべての成分を特徴ベクトルの要素に採用する必要はない。例えば、微細な周期的変動による影響を抑制したい場合には、周期の短いフレームを用いる分解レベルの成分を使用しない方法もある。また、低域成分のエネルギーが高くなりすぎてしまう場合には、各分解レベルのＬＬ成分を使用しないことも考えられる。 Subsequently, feature vectors are extracted for each tile (step S4). By the wavelet frame decomposition in step S3, coefficient values corresponding to the four types of LL component, HL component, LH component, and HH component are obtained for each decomposition level by the same number as the number of pixels of the tile. Here, L is an alphabet representing a low frequency and H is a high frequency. When two of these are arranged, the alphabet positioned in front corresponds to the horizontal direction, and the alphabet positioned behind corresponds to the vertical direction. The average energy of each component may be adopted as the feature vector element. Note that it is not always necessary to employ all decomposition levels and all components as elements of feature vectors. For example, when it is desired to suppress the influence of fine periodic fluctuations, there is a method that does not use a decomposition level component using a frame with a short period. In addition, when the energy of the low frequency component becomes too high, it may be considered not to use the LL component at each decomposition level.

次に、カウンタｎと繰り返し回数の上限Ｍを比較する（ステップＳ５）。カウンタｎが繰り返し回数の上限Ｍに達していないときは（ステップＳ５のＹ）、ステップＳ４にて抽出した特徴ベクトルに基づいて各タイルのラベルを設定する領域ラベル設定処理を行う（ステップＳ６）。 Next, the counter n is compared with the upper limit M of the number of repetitions (step S5). When the counter n has not reached the upper limit M of the number of repetitions (Y in step S5), region label setting processing for setting the label of each tile based on the feature vector extracted in step S4 is performed (step S6).

ここで、ステップＳ６における領域ラベル設定処理について説明する。図３は、領域ラベル設定処理を概略的に示すフローチャートである。図３に示すように、まず、画像内におけるタイルの位置を表す変数ｋおよび変数ｌをそれぞれ０に初期化すると共に、すべてのタイルにラベルの初期値として０を付与する（ステップＳ２１）。そして、カウンタｊに０を代入する（ステップＳ２２）。 Here, the region label setting process in step S6 will be described. FIG. 3 is a flowchart schematically showing the region label setting process. As shown in FIG. 3, first, a variable k and a variable l representing the position of the tile in the image are each initialized to 0, and 0 is assigned to all the tiles as initial labels (step S21). Then, 0 is assigned to the counter j (step S22).

次に、タイル（ｋ，ｌ）とその近傍に位置するタイル（ｋ＋τ_ｊ，ｌ＋τ_ｊ）（ｊ＝０，１，・・・，Ｊ−１）との間の誤差ｄを算出する（ステップＳ２３）。ここで、Ｊは近傍タイルの数を表す。誤差ｄの値としては、各タイルの特徴ベクトル同士の二乗誤差等を用いれば良い。また、近傍タイルの範囲は、設計者が任意で設定する。 Next, an error d between the tile (k, l) and the tile (k + τ _j , l + τ _j ) (j = 0, 1,..., J−1) located in the vicinity thereof is calculated (step S23). ). Here, J represents the number of neighboring tiles. As the value of the error d, a square error between feature vectors of each tile may be used. Also, the range of neighboring tiles is arbitrarily set by the designer.

その後、誤差ｄと所定の閾値Ｄを比較する（ステップＳ２４）。誤差ｄが閾値Ｄに達しているときは（ステップＳ２４のＮ）、ステップＳ２６に進む。一方、誤差ｄが閾値Ｄに達していないときは（ステップＳ２４のＹ）、タイル（ｋ，ｌ）と近傍タイル（ｋ＋τ_ｊ，ｌ＋τ_ｊ）とに同一のラベルを新たに付与する（ステップＳ２５）。具体的には、例えば、以下に示すような条件（１）〜（４）にしたがってラベルの値を対応させれば良い。ここで、タイル（ｋ，ｌ）におけるラベルの値を、ｓ（ｋ，ｌ）と表すことにする。 Thereafter, the error d is compared with a predetermined threshold value D (step S24). When the error d has reached the threshold value D (N in step S24), the process proceeds to step S26. On the other hand, when the error d has not reached the threshold value D (Y in step S24), the same label is newly assigned to the tile (k, l) and the neighboring tile (k + τ _j , l + τ _j ) (step S25). . Specifically, for example, the label values may be made to correspond in accordance with the following conditions (1) to (4). Here, the value of the label in the tile (k, l) is represented as s (k, l).

（１）ｓ（ｋ，ｌ）＝０、ｓ（ｋ＋τ_ｊ，ｌ＋τ_ｊ）＝０であるとき、双方に新規のラベルを与える。 (1) When s (k, l) = 0 and s (k + τ _j , l + τ _j ) = 0, both are given new labels.

（２）ｓ（ｋ，ｌ）＝０、ｓ（ｋ＋τ_ｊ，ｌ＋τ_ｊ）≠０であるとき、ｓ（ｋ，ｌ）←ｓ（ｋ＋τ_ｊ，ｌ＋τ_ｊ）とする。 (2) When s (k, l) = 0 and s (k + τ _j , l + τ _j ) ≠ 0, s (k, l) ← s (k + τ _j , l + τ _j ) is assumed.

（３）ｓ（ｋ，ｌ）≠０、ｓ（ｋ＋τ_ｊ，ｌ＋τ_ｊ）＝０であるとき、ｓ（ｋ＋τ_ｊ，ｌ＋τ_ｊ）←ｓ（ｋ，ｌ）とする。 (3) When s (k, l) ≠ 0 and s (k + τ _j , l + τ _j ) = 0, s (k + τ _j , l + τ _j ) ← s (k, l).

（４）ｓ（ｋ，ｌ）≠０、ｓ（ｋ＋τ_ｊ，ｌ＋τ_ｊ）≠０であるとき、ｓ（ｋ＋τ_ｊ，ｌ＋τ_ｊ）との間の誤差が他のいずれの近傍タイルとの間の誤差よりも低い場合、ｓ（ｋ，ｌ）←ｓ（ｋ＋τ_ｊ，ｌ＋τ_ｊ）とする。 (4) When s (k, l) ≠ 0 and s (k + τ _j , l + τ _j ) ≠ 0, the error between s (k + τ _j , l + τ _j ) is not between any other neighboring tiles. When it is lower than the error, s (k, l) ← s (k + τ _j , l + τ _j ) is set.

なお、前記（４）の場合には、ラベル変更前のｓ（ｋ，ｌ）と同じ値をもつ他のタイルについても、ｓ（ｋ，ｌ）の変更に伴ってラベルの値を変えることが考えられる。具体的には、そのようなタイルのうち、ｓ（ｋ＋τ_ｊ，ｌ＋τ_ｊ）と同じ値をもつタイルが近傍に存在し、そのタイルとの誤差が他のいずれの近傍タイルとの誤差より低いもののラベルを再帰的に変更していく方法などが挙げられる。 In the case of (4), the label value can be changed in accordance with the change of s (k, l) for other tiles having the same value as s (k, l) before the label change. Conceivable. Specifically, among such tiles, a tile having the same value as s (k + τ _j , l + τ _j ) exists in the vicinity, and an error with the tile is lower than an error with any other neighboring tiles. For example, a method of recursively changing labels.

ステップＳ２６では、カウンタｊと近傍タイルの数Ｊを比較する。カウンタｊが近傍タイルの数Ｊに達していないときは（ステップＳ２６のＹ）、カウンタｊを１インクリメントして（ステップＳ２７）、ステップＳ２３以降の処理を繰り返す。一方、カウンタｊが近傍タイルの数Ｊに達しているときは（ステップＳ２６のＮ）、ステップＳ２８に進む。 In step S26, the counter j is compared with the number J of neighboring tiles. If the counter j has not reached the number J of neighboring tiles (Y in step S26), the counter j is incremented by 1 (step S27), and the processes in and after step S23 are repeated. On the other hand, when the counter j has reached the number J of neighboring tiles (N in step S26), the process proceeds to step S28.

ステップＳ２８では、変数ｋと水平方向のタイル数Ｋを比較する。変数ｋが水平方向のタイル数Ｋに達していないときは（ステップＳ２８のＹ）、変数ｋを１インクリメントして（ステップＳ２９）、ステップＳ２２以降の処理を繰り返す。一方、変数ｋが水平方向のタイル数Ｋに達しているときは（ステップＳ２８のＮ）、ステップＳ３０に進む。 In step S28, the variable k is compared with the number of tiles K in the horizontal direction. When the variable k has not reached the number of tiles K in the horizontal direction (Y in step S28), the variable k is incremented by 1 (step S29), and the processes after step S22 are repeated. On the other hand, when the variable k has reached the number of tiles K in the horizontal direction (N in step S28), the process proceeds to step S30.

ステップＳ３０では、変数ｌと垂直方向のタイル数Ｌを比較する。変数ｌが垂直方向のタイル数Ｌに達していないときは（ステップＳ３０のＹ）、変数ｋに０を代入すると共に変数ｌを１インクリメントして（ステップＳ３１）、ステップＳ２２以降の処理を繰り返す。一方、変数ｌが垂直方向のタイル数Ｌに達しているときは（ステップＳ３０のＮ）、領域ラベル設定処理を終了する。 In step S30, the variable l is compared with the number of tiles L in the vertical direction. When the variable l has not reached the number of tiles L in the vertical direction (Y in step S30), 0 is substituted for the variable k and the variable l is incremented by 1 (step S31), and the processes in and after step S22 are repeated. On the other hand, when the variable l has reached the number of tiles L in the vertical direction (N in step S30), the region label setting process ends.

このようにして領域ラベル設定処理（ステップＳ６）を終了すると、カウンタｎを１インクリメントして（ステップＳ７）、ステップＳ２以降の処理を繰り返す。 When the area label setting process (step S6) is thus completed, the counter n is incremented by 1 (step S7), and the processes after step S2 are repeated.

一方、カウンタｎが繰り返し回数の上限Ｍに達したときは（ステップＳ５のＮ）、ステップＳ８に進む。 On the other hand, when the counter n reaches the upper limit M of the number of repetitions (N in step S5), the process proceeds to step S8.

ステップＳ８では、カウンタｎと繰り返し回数の上限Ｍの異同を判定する。カウンタｎと繰り返し回数の上限Ｍが等しいときは（ステップＳ８のＹ）、所定の領域（以降、選択領域と呼ぶ）を選択し（ステップＳ９）、ステップＳ１１以降のクラスタリングを行う処理に進む。選択領域については、オブジェクト等の切り出したい領域に合致した領域ラベルを選択すれば良い。例えば、あらかじめ画像の重心付近に存在することがわかっているテクスチャ特徴優位なオブジェクトの切り出しを目的とする場合には、画像の重心に相当する座標に近い所定の領域のラベルを選択する方法が考えられる。 In step S8, the difference between the counter n and the upper limit M of the number of repetitions is determined. When the counter n is equal to the upper limit M of the number of repetitions (Y in step S8), a predetermined area (hereinafter referred to as a selection area) is selected (step S9), and the process proceeds to clustering after step S11. For the selected area, an area label that matches the area to be cut out, such as an object, may be selected. For example, for the purpose of extracting an object with a texture feature that is known to exist in the vicinity of the center of gravity of the image in advance, a method of selecting a label in a predetermined area close to the coordinates corresponding to the center of gravity of the image is considered. It is done.

一方、カウンタｎと繰り返し回数の上限Ｍが異なるときは（ステップＳ８のＮ）、ステップＳ１０に進む。ステップＳ１０では、カウンタｎと繰り返し回数の上限Ｎを比較する。カウンタｎが繰り返し回数の上限Ｎに達していないときは（ステップＳ１０のＹ）、選択領域内の特徴ベクトルからクラスタ中心を設定する（ステップＳ１１）。なお、クラスタ中心の設定は、k-meansアルゴリズムやＬＢＧアルゴリズムなどに基づいて行えば良い。また、クラスタ中心の数は任意である。 On the other hand, when the upper limit M of the number of repetitions is different from the counter n (N in step S8), the process proceeds to step S10. In step S10, the counter n is compared with the upper limit N of the number of repetitions. When the counter n has not reached the upper limit N of the number of repetitions (Y in step S10), the cluster center is set from the feature vector in the selected area (step S11). The cluster center may be set based on the k-means algorithm, the LBG algorithm, or the like. Further, the number of cluster centers is arbitrary.

続いて、全タイルを対象にクラスタリング処理を行い（ステップＳ１２）、カウンタｎを１インクリメントして（ステップＳ７）、ステップＳ２以降の処理を繰り返す。 Subsequently, a clustering process is performed on all tiles (step S12), the counter n is incremented by 1 (step S7), and the processes after step S2 are repeated.

ここで、ステップＳ１２におけるクラスタリング処理について説明する。図４は、部分空間法を用いたクラスタリング処理を概略的に示すフローチャートである。なお、当該クラスタリングの処理については、ＬＢＧアルゴリズムなど他の方法を用いても良い。図４に示すように、まず、選択領域内のクラスタを表す変数ｃ、タイルの配置を表す変数ｋおよび変数ｌをそれぞれ０に初期化する（ステップＳ４１）。 Here, the clustering process in step S12 will be described. FIG. 4 is a flowchart schematically showing clustering processing using the subspace method. For the clustering process, another method such as an LBG algorithm may be used. As shown in FIG. 4, first, a variable c representing a cluster in the selected region, a variable k representing a tile arrangement, and a variable l are each initialized to 0 (step S41).

次に、選択領域のクラスタｃについて、その固有値と固有ベクトルを求め、所定の上位固有値に対応する固有ベクトルによって部分空間を定める（ステップＳ４２）。 Next, the eigenvalue and eigenvector are obtained for the cluster c of the selected region, and a subspace is defined by the eigenvector corresponding to the predetermined higher eigenvalue (step S42).

その後、当該部分空間に基づいて、タイル（ｋ，ｌ）の特徴ベクトルとクラスタｃとの適合度ｒを測る（ステップＳ４３）。具体的には、当該部分空間を構成する固有ベクトルと当該特徴ベクトルとの内積を求めれば良い。なお、タイル（ｋ，ｌ）が選択領域に含まれる場合、ステップＳ４３の処理をスキップしても良い。 Thereafter, based on the partial space, the degree of matching r between the feature vector of the tile (k, l) and the cluster c is measured (step S43). Specifically, the inner product of the eigenvector and the feature vector constituting the partial space may be obtained. Note that when the tile (k, l) is included in the selection region, the process of step S43 may be skipped.

適合度ｒを算出後、その値を所定の閾値Ｒと比較する（ステップＳ４４）。適合度ｒが閾値Ｒに達しているときは（ステップＳ４４のＹ）、タイル（ｋ，ｌ）を、選択領域との類似性が高いタイル（以降、類似領域候補タイルと呼ぶ）であると判定し（ステップＳ４５）、ステップＳ４６に進む。一方、適合度ｒが閾値Ｒに達していないときは（ステップＳ４４のＮ）、そのままステップＳ４６に進む。 After calculating the fitness r, the value is compared with a predetermined threshold R (step S44). When the fitness r has reached the threshold value R (Y in step S44), it is determined that the tile (k, l) is a tile having high similarity to the selected region (hereinafter referred to as a similar region candidate tile). (Step S45), the process proceeds to Step S46. On the other hand, when the fitness r does not reach the threshold value R (N in step S44), the process directly proceeds to step S46.

ステップＳ４６では、変数ｋと水平方向のタイル数Ｋを比較する。変数ｋと水平方向のタイル数Ｋとの比較の結果、変数ｋが水平方向のタイル数Ｋに達していないときは（ステップＳ４６のＹ）、変数ｋを１インクリメントして（ステップＳ４７）、ステップＳ４２以降の処理を繰り返す。一方、変数ｋが水平方向のタイル数Ｋに達しているときは（ステップＳ４６のＮ）、ステップＳ４８に進む。 In step S46, the variable k is compared with the number of tiles K in the horizontal direction. As a result of the comparison between the variable k and the horizontal tile number K, if the variable k has not reached the horizontal tile number K (Y in step S46), the variable k is incremented by 1 (step S47). The processes after S42 are repeated. On the other hand, when the variable k has reached the number of tiles K in the horizontal direction (N in step S46), the process proceeds to step S48.

ステップＳ４８では、変数ｌと垂直方向のタイル数Ｌを比較する。変数ｌが垂直方向のタイル数Ｌに達していないときは（ステップＳ４８のＹ）、変数ｋに０を代入すると共に変数ｌを１インクリメントして（ステップＳ４９）、ステップＳ４２以降の処理を繰り返す。変数ｌが垂直方向のタイル数Ｌに達しているときは（ステップＳ４８のＮ）、ステップＳ５０に進む。 In step S48, the variable l is compared with the number of tiles L in the vertical direction. If the variable l has not reached the number of tiles L in the vertical direction (Y in step S48), 0 is substituted for the variable k and the variable l is incremented by 1 (step S49), and the processes in and after step S42 are repeated. When the variable l has reached the number of tiles L in the vertical direction (N in step S48), the process proceeds to step S50.

ステップＳ５０では、変数ｃとクラスタ数の上限Ｃを比較する。クラスタ数の上限Ｃの値は、設計者があらかじめ定める。変数ｃがクラスタ数の上限Ｃに達していないときは（ステップＳ５０のＹ）、変数ｃを１インクリメントして（ステップＳ５１）、ステップＳ４２以降の処理を繰り返す。 In step S50, the variable c is compared with the upper limit C of the number of clusters. The upper limit C of the number of clusters is predetermined by the designer. When the variable c has not reached the upper limit C of the number of clusters (Y in step S50), the variable c is incremented by 1 (step S51), and the processes after step S42 are repeated.

一方、変数ｃがクラスタ数の上限Ｃに達しているときは（ステップＳ５０のＮ）、抽出された類似領域候補タイルのうち、選択領域に直接もしくは間接的に隣接するものの集合を類似領域として設定する（ステップＳ５２）。ここで、間接的に隣接するとは、類似領域となるタイルを伝って選択領域に至ることができることを指す。ステップＳ５２の処理は、領域の空間的連続性を保存するために画像空間上で離れた位置にあるタイルを類似領域から除外するためのものであり、選択領域のタイルから隣接する類似領域候補タイルを再帰的に辿っていくことで容易に実行できる。 On the other hand, when the variable c has reached the upper limit C of the number of clusters (N in step S50), a set of extracted similar region candidate tiles that are directly or indirectly adjacent to the selected region is set as the similar region. (Step S52). Here, being indirectly adjacent means that the selected area can be reached through a tile that is a similar area. The process of step S52 is for excluding tiles located at positions distant from the image space from the similar region in order to preserve the spatial continuity of the region, and the similar region candidate tile adjacent to the selected region tile. Can be easily executed by recursively following.

このようにクラスタリング処理においては、ステップＳ１１で設定したクラスタ中心に基づいて選択領域内で部分空間を構成し、全タイルについて、各部分空間との適合度ｒを測り、選択領域外のタイルのうち、適合度ｒの値が閾値を上回るものを類似領域とする。ただし、選択領域に隣接しないタイルは除く。なお、類似領域となるタイルを伝って選択領域に至ることができるタイルは、選択領域に隣接するとみなす。 As described above, in the clustering process, a partial space is configured in the selected area based on the cluster center set in step S11, and the degree of matching r with each partial space is measured for all tiles. , Those having a fitness r value exceeding a threshold value are defined as similar regions. However, tiles that are not adjacent to the selected area are excluded. Note that a tile that can reach a selection area through a tile that is a similar area is considered to be adjacent to the selection area.

以上により、ステップＳ１２におけるクラスタリング処理が終了する。 Thus, the clustering process in step S12 ends.

一方、カウンタｎが繰り返し回数の上限Ｎに達しているときは（ステップＳ１０のＮ）、ステップＳ１３に進む。ステップＳ１３では、選択領域およびステップＳ１２のクラスタリング処理の結果から得られる所定のクラスタから成る領域（以降、類似領域と呼ぶ、また、選択領域と類似領域を合わせて中心領域と呼ぶ）と、その周辺の所定の領域（以降、周辺領域と呼ぶ）との境界を、必要に応じて変形し最適化する。 On the other hand, when the counter n has reached the upper limit N of the number of repetitions (N in step S10), the process proceeds to step S13. In step S13, a region composed of the selected region and a predetermined cluster obtained from the result of the clustering process in step S12 (hereinafter referred to as a similar region, and the selected region and the similar region are collectively referred to as a central region) and its surroundings The boundary with a predetermined area (hereinafter referred to as a peripheral area) is deformed and optimized as necessary.

ここで、ステップＳ１３における尤度関数を用いた領域境界最適化の処理について説明する。図５は、尤度関数を用いた領域境界最適化の処理を概略的に示すフローチャートである。なお、尤度関数の計算には、ガウスマルコフ確率場モデル（ガウシアンＭＲＦ（ＧaussianＭＲＦ：ＧＭＲＦ）モデル）を使用する。図５に示すように、まず、中心領域の画素（ｕ_ｉ，ｖ_ｉ）（（ｕ_ｉ，ｖ_ｉ）∈中心領域、ｉ＝０，１，・・・，Ｍｃ−１）に対応する変数ｅ（ｕ_ｉ，ｖ_ｉ）を１に、所定の周辺領域の画素（ｓ_ｊ，ｔ_ｊ）（（ｓ_ｊ，ｔ_ｊ）∈中心領域、ｊ＝０，１，・・・，Ｍａ−１）に対応する変数ｅ（ｓ_ｊ，ｔ_ｊ）を−１に初期化する（ステップＳ６１）。なお、周辺領域は、中心領域に直接もしくは間接的に隣接する任意のタイルによって構成すれば良い。 Here, the region boundary optimization process using the likelihood function in step S13 will be described. FIG. 5 is a flowchart schematically showing region boundary optimization processing using a likelihood function. The likelihood function is calculated using a Gaussian Markov random field model (Gaussian MRF (GMRF) model). As shown in FIG. 5, first, variables corresponding to pixels (u _i , v _i ) ((u _i , v _i ) ∈ center region, i = 0, 1,..., Mc−1) in the center region. e (u _i , v _i ) is 1, and pixel (s _j , t _j ) ((s _j , t _j ) ∈ central region, j = 0, 1,..., Ma−1 in a predetermined peripheral region The variable e (s _j , t _j ) corresponding to) is initialized to −1 (step S61). Note that the peripheral area may be constituted by an arbitrary tile that is directly or indirectly adjacent to the central area.

次に、変数ｅ'（ｘ_ｈ，ｙ_ｈ）にｅ（ｘ_ｈ，ｙ_ｈ）（ｅ（ｘ_ｈ，ｙ_ｈ）＝１またはｅ（ｘ_ｈ，ｙ_ｈ）＝−１、ｈ＝０，１，・・・，Ｍｃ＋Ｍａ−１）の値を代入し、ｅ（ｘ_ｈ，ｙ_ｈ）＝１となる領域の尤度をｐ１に、ｅ（ｘ_ｈ，ｙ_ｈ）＝−１となる領域の尤度をｐ−１にそれぞれ代入する。さらに、変数ｍ、変数ｗにそれぞれ０を代入する。（ステップＳ６２）。 Next, the variable e ′ (x _h , y _h ) is set to e (x _h , y _h ) (e (x _h , y _h ) = 1 or e (x _h , y _h ) = − 1, h = 0, 1,..., Mc + Ma−1), and the likelihood of the region where e (x _h , y _h ) = 1 is p1, and the region where e (x _h , y _h ) = − 1 Is assigned to p−1. Further, 0 is assigned to each of the variable m and the variable w. (Step S62).

その後、画素（ｘ_ｍ，ｙ_ｍ）が中心領域と周辺領域の境界に該当するか否かをｅ（ｘ_ｍ，ｙ_ｍ）の値に基づいて調べる（ステップＳ６３）。具体的には、ｅ（ｘ_ｍ，ｙ_ｍ）が近傍のｅ（ｘ_ｍ＋δ_ｉ，ｙ_ｍ＋δ_ｉ）（ｉ＝０，１，・・・，Ｉ）と異なるとき、画素（ｘ_ｍ，ｙ_ｍ）を境界と判定すれば良い。ここで、Iは近傍画素の数を表す。なお、近傍画素については、４近傍や８近傍を考えれば良い。 Thereafter, whether or not the pixel (x _m , y _m ) corresponds to the boundary between the central region and the peripheral region is examined based on the value of e (x _m , y _m ) (step S63). Specifically, when e (x _m , y _m ) is different from neighboring e (x _m + δ _i , y _m + δ _i ) (i = 0, 1,..., I), the pixel (x _m , y _m ) may be determined as the boundary. Here, I represents the number of neighboring pixels. As for neighboring pixels, it is only necessary to consider 4 neighborhoods or 8 neighborhoods.

画素（ｘ_ｍ，ｙ_ｍ）を境界と判定したときは（ステップＳ６３のＹ）、ｅ'（ｘ_ｍ，ｙ_ｍ）に−ｅ'（ｘ_ｍ，ｙ_ｍ）を代入し（ステップＳ６４）、ステップＳ６５に進む。一方、画素（ｘ_ｍ，ｙ_ｍ）を境界でないと判定したときは（ステップＳ６３のＮ）、ステップＳ６９に進む。 When it is determined that the pixel _(x m, _{y m)} and a boundary (Y in step S63), e substituting _(x m, _{y m)} _'(x m, _{y m)} -e to' (step S64), Proceed to step S65. On the other hand, when it is determined that the pixel (x _m , y _m ) is not a boundary (N in step S63), the process proceeds to step S69.

ステップＳ６５では、ｅ'（ｘ_ｈ，ｙ_ｈ）＝１となる領域の尤度をｐ'_１に、ｅ'（ｘ_ｈ，ｙ_ｈ）＝−１となる領域の尤度をｐ'_−１にそれぞれ代入する。 In step S65, the likelihood of the region where e ′ (x _h , y _h ) = 1 is set to p ′ ₁ , and the likelihood of the region where e ′ (x _h , y _h ) = − 1 is set to p ′ _−1. Respectively.

次に、ｐ'_１ｐ'_−１の値を、ｐ_１ｐ_−１の値と比較する（ステップＳ６６）。ｐ'_１ｐ'_−１がｐ_１ｐ_−１より大きいときは（ステップＳ６６のＹ）、ｅ（ｘ_ｍ，ｙ_ｍ）に−ｅ（ｘ_ｍ，ｙ_ｍ）を代入すると共にｗに１を代入し（ステップＳ６７）、ステップＳ６８に進む。一方、ｐ'_１ｐ'_−１がｐ_１ｐ_−１以下であるときは（ステップＳ６６のＮ）、そのままステップＳ６８に進む。 Next, the value of p ′ ₁ p ′ ₋₁ is compared with the value of p ₁ p ₋₁ (step S66). p When _'1 p' _-1 is greater than _p 1 _{p -1} is 1 to w with substitutes (Y in step S66), _{e (x} m, _{y m)} -e to _(x m, _{y m)} Substitute (step S67) and proceed to step S68. On the other hand, when p ′ ₁ p ′ ₋₁ is equal to or less than p ₁ p ₋₁ (N in step S66), the process proceeds to step S68 as it is.

ステップＳ６８では、ｅ'（ｘ_ｍ，ｙ_ｍ）に−ｅ'（ｘ_ｍ，ｙ_ｍ）を代入する。 In step S68, substitutes _{_{(x m, y m) '}} -e to _{_{(x m, y m)'}} e.

ステップＳ６９では、変数ｍを、中心領域の画素数Ｍｃと周辺領域の画素数Ｍａの合計と比較する。変数ｍがＭｃ＋Ｍａに達していないときは（ステップＳ６９のＹ）、変数ｍを１インクリメントして（ステップＳ７０）、ステップＳ６３以降の処理を繰り返す。一方、変数ｍがＭｃ＋Ｍａに達しているときは（ステップＳ６９のＮ）、ステップＳ７１に進む。 In step S69, the variable m is compared with the sum of the number of pixels Mc in the central region and the number of pixels Ma in the peripheral region. When the variable m does not reach Mc + Ma (Y in step S69), the variable m is incremented by 1 (step S70), and the processes after step S63 are repeated. On the other hand, when the variable m has reached Mc + Ma (N in step S69), the process proceeds to step S71.

ステップＳ７１では、変数ｗと変数ｌの異同を判定する。変数ｗが変数ｌに等しいときは（ステップＳ７１のＹ）、ステップＳ６２以降の処理を繰り返す。一方、変数ｗが変数ｌでないときは（ステップｓ７１のＮ）、処理を終了する。 In step S71, the difference between the variable w and the variable l is determined. When the variable w is equal to the variable l (Y in step S71), the processes after step S62 are repeated. On the other hand, when the variable w is not the variable l (N in step s71), the process is terminated.

このように空間−周波数領域において、中心領域の各クラスタおよび周辺領域の各クラスタそれぞれにガウスマルコフ確率場モデル（ガウシアンＭＲＦ（ＧaussianＭＲＦ：ＧＭＲＦ）モデル）をあてはめ、中心領域の境界画素に対するラベルを最適化する。つまり、画像のウェーブレット係数を確率過程とみなした上で各境界画素について、各モデルパラメータに基づく確率を求め、それが最大となるモデルに対応する領域ラベルを付与する処理を繰り返し行うことで、領域境界の最適化を行う。 In this way, in the space-frequency domain, a Gaussian Markov random field model (Gaussian MRF (GMRF) model) is applied to each cluster in the central region and each cluster in the peripheral region, and labels for boundary pixels in the central region are optimized. To do. In other words, after considering the wavelet coefficient of the image as a stochastic process, the probability based on each model parameter is obtained for each boundary pixel, and the region label corresponding to the model with the maximum is repeatedly performed, thereby performing region processing. Perform boundary optimization.

ここで、図６はウェーブレットフレームを用いた注目テクスチャ切出しの一例を示す説明図である。ここでは、図６（ａ）のサンプル画像に対して、シャツのオブジェクトを切出すことを想定し、上述した処理を実行した。選択領域については、画像の重心に相当する座標に最も近い領域のラベルを選んだ。特徴ベクトルの要素には、空間−周波数領域における各成分の平均エネルギーを採用した。クラスタリングの際のクラスタ中心については、その数を３とし、k-means法により決定した。なお、領域境界の最適化を行う際には、中心領域全体および周辺領域全体をそれぞれ一つのクラスタとして扱った。図６（ｂ）は切出された領域である。図６（ｂ）に示された領域は、注目オブジェクトを全体的にカバーしているだけでなく、境界のひずみも少ない。ただし、一部ではあるが、シャツに切込みが入ったような境界が見られる。これについては、境界の形状に対する制約やモデル等を新たに設けることによって改善が期待できる。 Here, FIG. 6 is an explanatory diagram showing an example of extracting a texture of interest using a wavelet frame. Here, it is assumed that a shirt object is cut out from the sample image in FIG. For the selected area, the label of the area closest to the coordinates corresponding to the center of gravity of the image was selected. The average energy of each component in the space-frequency domain was adopted as the feature vector element. The number of cluster centers at the time of clustering was set to 3 and determined by the k-means method. When the region boundary was optimized, the entire central region and the entire peripheral region were each treated as one cluster. FIG. 6B shows a cut out region. The area shown in FIG. 6B not only covers the object of interest as a whole, but also has less boundary distortion. However, although it is a part, there is a boundary where the shirt is cut. In this regard, improvement can be expected by newly providing constraints on the shape of the boundary, a model, and the like.

このように本実施の形態においては、ウェーブレットフレームを用いた多様なスケールでの空間−周波数解析に基づく粗い領域分割の後、領域境界の最適化を行う。これにより、解析精度が高く境界のひずみも少ないオブジェクト切出しを行うことができる。 As described above, in the present embodiment, the region boundary is optimized after rough region division based on space-frequency analysis at various scales using wavelet frames. This makes it possible to perform object extraction with high analysis accuracy and little boundary distortion.

本発明の実施の一形態の画像解析装置のハードウェア構成を概略的に示すブロック図である。It is a block diagram which shows roughly the hardware constitutions of the image analysis apparatus of one Embodiment of this invention. 画像解析装置が実行する処理を概略的に示すフローチャートである。It is a flowchart which shows roughly the process which an image analysis apparatus performs. 領域ラベル設定処理を概略的に示すフローチャートである。It is a flowchart which shows an area | region label setting process roughly. 部分空間法を用いたクラスタリング処理を概略的に示すフローチャートである。It is a flowchart which shows roughly the clustering process using a subspace method. 尤度関数を用いた領域境界最適化の処理を概略的に示すフローチャートである。It is a flowchart which shows roughly the process of area | region boundary optimization using a likelihood function. ウェーブレットフレームを用いた注目テクスチャ切出しの一例を示す説明図である。It is explanatory drawing which shows an example of the attention texture extraction using a wavelet frame.

Explanation of symbols

１画像解析装置
７記憶媒体
1 Image analysis device 7 Storage medium

Claims

Area dividing means for dividing the image into predetermined areas;
Multi-resolution analysis means for repeatedly executing multi-resolution analysis by a wavelet frame to a predetermined level for each of the areas divided by the area dividing means;
Feature extraction means for extracting a predetermined feature amount from the wavelet coefficients obtained by the multi-resolution analysis means;
Based on the feature amount extracted by the feature extraction unit, an error of the feature amount between neighboring regions is calculated, and the region and a region located in the vicinity of the region are determined according to the similarity of the calculated error By assigning the same label to a region label setting means for setting a label for each region,
A region with a label that matches a predetermined region is set as a selection region, a predetermined feature amount in the selection region is set as a cluster center, a partial space is defined in the selection region based on the cluster center, and the partial space and each Clustering means for measuring the degree of fitness with a region, and clustering into similar regions similar to the selected region according to the similarity of the fitness ,
A Markov random field model is introduced into each of the center region including the similar region composed of the selected region and the predetermined cluster obtained by the clustering, and the peripheral region that is a peripheral region thereof, and the likelihood of the region based on the model is determined. Region boundary optimization means for optimizing a boundary shape between the central region and the peripheral region by searching for a boundary having a maximum degree ;
An image analysis apparatus comprising:

Recursively executing the region dividing unit, the multi-resolution analyzing unit, the feature extracting unit, and the region label setting unit until a predetermined end condition is satisfied,
The image analysis apparatus according to claim 1.

Recursively executing the region dividing unit, the multi-resolution analyzing unit, the feature extracting unit, and the clustering unit until a predetermined end condition is satisfied,
The image analysis apparatus according to claim 1.

The area dividing means divides a target image into a predetermined number of rectangular areas with a predetermined size,
The image analysis apparatus according to claim 1.

The feature extraction means uses a vector having each of the average energy obtained for each predetermined component from each predetermined decomposition level as a feature amount.
The image analysis apparatus according to claim 1.

Installed on the computer,
An area dividing function for dividing an image into predetermined areas;
A multi-resolution analysis function that repeatedly executes a multi-resolution analysis by a wavelet frame to a predetermined level for each of the areas divided by the area division function;
A feature extraction function for extracting a predetermined feature amount from a wavelet coefficient obtained by the multi-resolution analysis function;
Based on the feature amount extracted by the feature extraction function, an error of the feature amount between neighboring regions is calculated, and the region is located in the vicinity of the region according to the similarity of the calculated error A region label setting function for setting a label for each region by assigning the same label to
A region with a label that matches a predetermined region is set as a selection region, a predetermined feature amount in the selection region is set as a cluster center, a partial space is defined in the selection region based on the cluster center, and the partial space and each A clustering function that measures the degree of fitness with a region, and clusters the similar regions similar to the selected region according to the similarity of the fitness ,
A Markov random field model is introduced into each of the center region including the similar region composed of the selected region and the predetermined cluster obtained by the clustering, and the peripheral region that is a peripheral region thereof, and the likelihood of the region based on the model is determined. A region boundary optimization function that optimizes a boundary shape between the central region and the peripheral region by searching for a boundary having the maximum degree ;
Image analysis program that causes a computer to execute.

Causing the computer to recursively execute the region division function, the multi-resolution analysis function, the feature extraction function, and the region label setting function until a predetermined end condition is satisfied,
The image analysis program according to claim 6 .

Causing the computer to recursively execute the region division function, the multi-resolution analysis function, the feature extraction function, and the clustering function until a predetermined termination condition is satisfied,
The image analysis program according to claim 6 .

Storage medium for storing the image analysis program according to any one of claims 6-8.

An area dividing step for dividing the image into predetermined areas;
A multi-resolution analysis step for repeatedly executing a multi-resolution analysis by a wavelet frame to a predetermined level for each of the regions divided by the region division step;
A feature extraction step for extracting a predetermined feature amount from the wavelet coefficients obtained by the multi-resolution analysis step;
Based on the feature amount extracted by the feature extraction step, the error of the feature amount between neighboring regions is calculated, and the region is located in the vicinity of the region according to the similarity of the calculated error An area label setting step for setting a label for each area by giving the same label to
A region with a label that matches a predetermined region is set as a selection region, a predetermined feature amount in the selection region is set as a cluster center, a partial space is defined in the selection region based on the cluster center, and the partial space and each A clustering step of measuring the degree of fitness with a region and clustering into similar regions similar to the selected region according to the similarity of the fitness ,
A Markov random field model is introduced into each of the center region including the similar region composed of the selected region and the predetermined cluster obtained by the clustering, and the peripheral region that is a peripheral region thereof, and the likelihood of the region based on the model is determined. A region boundary optimization step for optimizing a boundary shape between the central region and the peripheral region by searching for a boundary having a maximum degree ;
An image analysis method comprising:

Recursively executing the region dividing step, the multi-resolution analysis step, the feature extraction step, and the region label setting step until a predetermined end condition is satisfied,
The image analysis method according to claim 10 .

Recursively executing the region dividing step, the multi-resolution analysis step, the feature extraction step, and the clustering step until a predetermined end condition is satisfied,
The image analysis method according to claim 10 .

The region dividing step divides a target image into a predetermined number of rectangular regions with a predetermined size,
Image analysis method of any one of claims 10 to 12.

The feature extraction step uses a vector having each of the average energy obtained for each predetermined component from each predetermined decomposition level as a feature amount.
Image analysis method of any one of claims 10 to 12.