JPH0822544A

JPH0822544A - Picture recognition method

Info

Publication number: JPH0822544A
Application number: JP6158394A
Authority: JP
Inventors: Satoshi Nakagawa; 聰中川; Yuji Kuno; 裕次久野; Takahiro Watanabe; 孝弘渡辺; Yoshinori Shimosakota; 義則下迫田
Original assignee: Oki Electric Industry Co Ltd
Current assignee: Oki Electric Industry Co Ltd
Priority date: 1994-07-11
Filing date: 1994-07-11
Publication date: 1996-01-23

Abstract

PURPOSE:To obtain the accurate position of a structure model. CONSTITUTION:The operation is started from a given initial position in a step S301, and a position of energy minimizing is obtained by search of the vicinity of one point in the energy minimizing processing for search of the vicinity of one point in a step S302. The model is moved so as to reduce the energy in a step S303 if it is simultaneously moved to vicinities of two points. In the convergence discrimination of a step S304, the operation is returned to the step S302 to try to minimize the energy if the combination to reduce the energy is found in the step S303; but convergence is discriminated and the position of the model at this time is defined as the position of energy minimizing in a step S305 if it is not found in the step S303.

Description

Detailed Description of the Invention

【０００１】[0001]

【産業上の利用分野】本発明は、入力された画像中の対
象物の輪郭等の構造を認識する画像認識方法に関するも
のである。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to an image recognition method for recognizing a structure such as a contour of an object in an input image.

【０００２】[0002]

【従来の技術】従来、このような分野の技術としては、
例えば、次のような文献に記載されるものがあった。文献；ファーストインターナショナルコンファレン
スオンコンピュータビジョン（First Internat
ional Conference on Computer Vision)（１９８７）Ｉ
ＥＥＥ（米）Michael Kass,Andrew Witkin,andDemetri
Terzopoulos 著"Snakes:Active contour models"Ｐ．２
５９−２６８従来、画像認識方法として例えば輪郭抽出方法には、前
記文献に記載されるように、動的輪郭モデルによって入
力画像の輪郭を抽出する方法がある。動的輪郭モデル
は、線やエッジのような画像特徴に引き寄せられる拘束
条件と形状に関しての拘束条件を満たすような閉曲線
を、エネルギー最小化によって求める輪郭抽出方法であ
る。以下、従来の動的輪郭モデルによる輪郭抽出方法に
ついて説明する。2. Description of the Related Art Conventionally, techniques in such a field include:
For example, some documents were described in the following documents. Literature; First International Conference on Computer Vision (First Internat
ional Conference on Computer Vision) (1987) I
EEE (US) Michael Kass, Andrew Witkin, and Demetri
Terzopoulos "Snakes: Active contour models" P. Two
59-268 Conventionally, as an image recognition method, for example, a contour extraction method includes a method of extracting a contour of an input image by an active contour model as described in the above-mentioned document. The dynamic contour model is a contour extraction method that obtains a closed curve satisfying a constraint condition attracted to image features such as lines and edges and a constraint condition regarding a shape by energy minimization. Hereinafter, a conventional contour extraction method using an active contour model will be described.

【０００３】図２は、従来の動的輪郭モデルによる輪郭
抽出方法の処理内容を示す図である。図３は、図２にお
いてエネルギーが最小値に収束するのに伴い、閉曲線が
画像特徴に引き付けられて行く様子を示す動的輪郭モデ
ルの概念図である。なお、以下の説明では、符号の前に
「ｖ」を付してベクトル記号を表すことにする。図中で
は太字で表す。図３に示すように、図２中の原画像デー
タ１０より、抽出したい輪郭線のモデルである閉曲線
（以下、snake と記す）の位置を、画像上の座標系ｘ，
ｙにおいて、ｎ個の節点ｖｖ_ｉ＝（ｘ_ｉ，ｙ_ｉ），ｉ＝
１，…，ｎで表す。但し、閉曲線であるのでｖｖ_i+n＝
ｖｖ_ｉとする。この輪郭線のモデルsnake に対して次式
（１）のようなエネルギー関数Ｅ^* _sn _akeの定義４０を
与える。FIG. 2 is a diagram showing the processing contents of a conventional contour extraction method using an active contour model. FIG. 3 is a conceptual diagram of an active contour model showing how a closed curve is attracted to image features as energy converges to a minimum value in FIG. In the following description, "v" is added before the symbol to represent the vector symbol. It is shown in bold type in the figure. As shown in FIG. 3, from the original image data 10 in FIG. 2, the position of the closed curve (hereinafter, referred to as snake) that is a model of the contour line to be extracted is determined by the coordinate system x,
In y, n nodes vv _i = (x _i , y _i ), i =
It is represented by 1, ..., N. However, since it is a closed curve, vv _{i + n} =
vv _i . Energizing function E ^* _sn _ake definitions 40, such as: (1) the model snake of the contour.

【数１】（１）式のＥ_intはsnake の形状に関する拘束条件を表
す内部エネルギーであり、次式（２）のように表せる。[Equation 1] E _{int in the} equation (1) is an internal energy that represents a constraint condition regarding the shape of the snake, and can be expressed as the following equation (2).

【数２】（２）式の各項は１階及び２階の連続性に関する項であ
り、第１項はsnake が短くなるほど、第２項はsnake が
滑らかになるほど小さくなるエネルギーである（但し、
α_i＞０，β_i＞０の場合）。（１）式のＥ_extは、sn
ake が画像特徴に引き寄せられる拘束条件を表す外部エ
ネルギーであり、例えば、次式（３）のように定義す
る。Ｅ_ext(i) ＝ｗ_edge（−｜▽Ｉ（ｘ_i，ｙ_i）｜²）・・・（３）但し、ｗ_edge；正の係数Ｉ（ｘ_i，ｙ_i）；画像の濃度外部エネルギーＥ_extを（３）式のように定義すれば、
snake が画像のエッジに近付くほど、Ｅ_extが小さくな
る。[Equation 2] Each term in the equation (2) is a term related to the continuity of the first and second orders, and the first term is energy that becomes smaller as the snake becomes shorter and the second term becomes smaller as the snake becomes smoother (however,
If α _i > 0, β _i > 0). E _ext of the equation (1) is sn
ake is an external energy that represents a constraint condition that is attracted to the image feature, and is defined by the following equation (3), for example. E _ext (i) = w _edge (− | ▽ I (x _i , y _i ) | ² ) (3) where w _edge ; positive coefficient I (x _i , y _i ); image density external If the energy E _ext is defined as in equation (3),
The closer the snake is to the edge of the image, the smaller E _ext .

【０００４】以上のように、形状に関する拘束条件と画
像特徴に関する拘束条件を満たす場合にエネルギーが最
小になるようなエネルギー関数を定義し、このエネルギ
ー関数を最小にするようなsnake の位置を求めることに
よって輪郭抽出を行うのが動的輪郭モデルである。例え
ば、（２）式で表されるようなsnake の形状が「滑らか
である」という形状の拘束条件と、（３）式で表される
ようなsnake が「画像のエッジ上にある」という画像特
徴の拘束条件を与えた場合、画像中の対象物の輪郭に一
致する滑らかな輪郭線が抽出される。（１）式のエネル
ギー関数Ｅ^* _snakeを最小化するようなsnake の位置
は、図２のエネルギー最小化処理２０において、変分法
により、次式（４）の方程式を解くことによって得られ
る。As described above, the energy function that minimizes the energy when the constraint condition regarding the shape and the constraint condition regarding the image feature are satisfied is defined, and the position of the snake that minimizes this energy function is determined. The active contour model is used to extract the contours. For example, the constraint condition that the shape of the snake is “smooth” as expressed by the expression (2) and the image that the snake as expressed by the expression (3) is “on the edge of the image”. When the feature constraint condition is given, a smooth contour line that matches the contour of the object in the image is extracted. The position of the _snake that minimizes the energy function E ^* _snake of the equation (1) is obtained by solving the equation of the following equation (4) by the variation method in the energy minimization process 20 of FIG.

【数３】（４）式は、行列形式で次式（５）、（６）のように表
せる。なお、以下の説明では、符号の前に「Ｖ」を付し
て行列記号を表すことにする。図中では太字で表す。(Equation 3) The equation (4) can be expressed in matrix form as the following equations (5) and (6). In the following description, “V” is added before the code to represent the matrix symbol. It is shown in bold type in the figure.

【０００５】ＶＡ・ＶＸ＋ＶＦ_x（ＶＸ，ＶＹ）＝０・・・（５）ＶＡ・ＶＹ＋ＶＦ_y（ＶＸ，ＶＹ）＝０・・・（６）但し、ＶＸ＝（ｘ₁・・・ｘ_n）^T ＶＹ＝（ｙ₁・・・ｙ_n）^T ＶＦ_x＝（ｆ_x(1) ・・・ｆ_x(n))^T ＶＦ_y＝（ｆ_y(1) ・・・ｆ_y(n))^T ＶＡ；α_i，β_iのみから決まるｎ×ｎの係数行列この方程式（５），（６）は、図２の初期位置データ５
０として与えられる初期位置ＶＸ₀，ＶＹ₀から、次式
（７），（８）の反復計算によって解くことができ、そ
れを輪郭抽出結果３０として出力する。ＶＸ_t＝（ＶＡ＋γ・ＶＩ）^-1 ・（γ・ＶＸ_t-1−ＶＦ_x（ＶＸ_t-1，ＶＹ_t-1））・・・（７）ＶＹ_t＝（ＶＡ＋γ・ＶＩ）^-1 ・（γ・ＶＹ_t-1−ＶＦ_y（ＶＸ_t-1，ＶＹ_t-1））・・・（８）但し、ＶＩ；ｎ×ｎの単位行列ｔ；繰り返し数 γ；更新ステップのステップサイズを決める係数以上のような従来の動的輪郭モデルによる輪郭抽出方法
における図２のエネルギー最小化処理２０のフローチャ
ートを図４に示す。このエネルギー最小化処理２０で
は、まず、初期位置処理２１で、図２の初期位置データ
５０よりsnake の初期値ＶＸ₀，ＶＹ₀を与え、外部エ
ネルギー勾配の計算処理２２へ進む。外部エネルギー勾
配の計算処理２２では、snake 上の全節点での外部エネ
ルギーの勾配ＶＦ_x，ＶＦ_yを、図２中のエネルギー関
数の定義４０をもとに、入力された原画像１０より計算
し、座標値の更新処理２３へ進む。座標値の更新処理２
３では、（７），（８）式を用いて、更新されたsnake
の位置ＶＸ_t，ＶＹ_tを求め、収束判定処理２４へ進
む。収束判定処理２４では、各節点の更新された量｜Ｖ
Ｘ_t−ＶＸ_t-1｜，｜ＶＹ_t−ＶＹ_t-1｜が十分０に近
付いていれば、計算が収束したと判定し、そのときのＶ
Ｘ_t，ＶＹ_tを処理２５でエネルギー最小化値とし、図
２中の輪郭抽出結果３０として出力する。これに対して
収束判定処理２４で、収束したと判定されなければ、外
部エネルギー勾配の計算処理２２へ戻り、前記の処理を
繰り返す。以上の処理により、（１）式のエネルギー関
数Ｅ^* _snakeを最小化するsnakeの位置が得られる。こ
のときのsnake は、「滑らかで画像のエッジ上にある」
といった形状の拘束条件と画像特徴の拘束条件を満たす
輪郭線となるため、図３に示すように画像中の対象物の
輪郭に一致する。VA · VX + VF _x (VX, VY) = 0 (5) VA · VY + VF _y (VX, VY) = 0 (6) where VX = (x ₁ ... x _n ). ^T VY = (y ₁ ... Y _n ) ^T VF _x = (f _x (1) ・・・ f _x (n)) ^T VF _y = (f _y (1) ・・・_fy (n)) ^T VA; n × n coefficient matrix determined only by α _i and β _i The equations (5) and (6) are the initial position data 5 of FIG.
The initial positions VX ₀ and VY ₀ given as ₀ can be solved by iterative calculation of the following equations (7) and (8), which are output as the contour extraction result 30. VX _t = (VA + γ · VI) ⁻¹ · (γ · VX _t−1 −VF _x (VX _t−1 , VY _t−1 )) (7) VY _t = (VA + γ · VI) ⁻¹ · (Γ · VY _t-1 −VF _y (VX _t-1 , VY _t-1 )) (8) where VI: n × n identity matrix t; number of iterations γ; update step size FIG. 4 shows a flowchart of the energy minimization processing 20 of FIG. 2 in the contour extraction method using the conventional active contour model as described above. In the energy minimization process 20, first, in the initial position process 21, the initial values VX ₀ and VY ₀ of the snake are given from the initial position data 50 of FIG. 2, and the process proceeds to the external energy gradient calculation process 22. In the external energy gradient calculation process 22, the external energy gradients VF _x and VF _y at all nodes on the snake are calculated from the input original image 10 based on the definition 40 of the energy function in FIG. , And proceeds to coordinate value update processing 23. Coordinate value update process 2
3 uses the formulas (7) and (8) to update the updated snake.
The positions VX _t and VY _t are calculated and the process proceeds to the convergence determination process 24. In the convergence determination processing 24, the updated amount | V of each node
_{_{X t -VX t-1 |,}} | VY t -VY t-1 | if the long close enough 0, it is determined that the calculation has converged, V at that time
X _t, and energy minimization value processing 25 VY _t, and outputs the contour extraction result 30 in FIG. On the other hand, if the convergence determination processing 24 does not determine that the convergence has occurred, the process returns to the external energy gradient calculation processing 22 and the above processing is repeated. By the above processing, the position of the snake that minimizes the energy function E ^* _snake of the equation (1) is obtained. The snake at this time is "smooth and is on the edge of the image."
As described above, the contour line satisfies the constraint condition of the shape and the constraint condition of the image feature, and thus the contour line matches the contour of the object in the image as shown in FIG.

【０００６】[0006]

【発明が解決しようとする課題】しかしながら、従来の
輪郭抽出方法においては、次のような課題があり、それ
を解決することが困難であった。（ａ）従来の輪郭抽出方法では、輪郭線のみを認識対
象としており、認識対象を表現するモデルは閉曲線に限
定していた。このため、閉曲線だけでは表現できないよ
うな、より複雑な構造を持つ対象の認識への応用ができ
なかった。（ｂ）複眼視画像や動画像のように、複数枚の画像か
らの認識に応用する場合は、個々の画像での従来の輪郭
抽出方法による結果を統合するための別の機構が必要で
あった。（ｃ）前記（ａ）、（ｂ）のような問題を解決しよう
とすれば、さらに複雑な処理が必要となるが、これを効
率的に処理することは困難であり、十分な処理速度が得
られない。前記（ａ）〜（ｃ）のような課題を解決すべく本出願人
等は、先に特願平５−２４６０４明細書（以下、先の提
案と呼ぶ）において、閉曲線に限らない、より複雑な構
造を持つような入力画像を認識対象とし、複数枚の画像
からの認識も統一的に処理し、さらに、このような拡張
された認識方法を効率的に処理可能な画像認識方法を提
案した。すなわち、エネルギー最小化によって入力画像
の画像認識を行う画像認識方法において、認識対象の構
造のモデルを、任意枚数の画像上の任意個の点で表され
る構造モデルとして定義し、構造モデルが画像特徴に引
き寄せられる拘束条件と、該構造モデルの構造に関する
拘束条件を満たす場合にエネルギーが最小となるような
エネルギー関数を定義し、このエネルギー関数を最小化
する構造モデルの位置を求めることにより入力画像から
任意の構造を抽出するようにしている。しかしながら、
先の提案では、エネルギー関数を最小化する構造モデル
の位置を１点の近傍点へ移動した場合に、エネルギーが
減少するような１点の近傍の探索処理しか行わず、その
ため正確な構造モデルの位置が求められなかったという
問題があった。However, the conventional contour extraction method has the following problems and it is difficult to solve them. (A) In the conventional contour extraction method, only the contour line is the recognition target, and the model expressing the recognition target is limited to the closed curve. Therefore, it cannot be applied to the recognition of objects with more complicated structures that cannot be represented by closed curves alone. (B) When applied to recognition from a plurality of images such as a compound-eye image or a moving image, another mechanism for integrating the results of the conventional contour extraction method for each image is necessary. It was (C) In order to solve the above problems (a) and (b), more complicated processing is required, but it is difficult to process this efficiently, and sufficient processing speed is required. I can't get it. In order to solve the above problems (a) to (c), the present applicants have previously described in Japanese Patent Application No. 5-24604 (hereinafter, referred to as the above-mentioned proposal) not only a closed curve but also a more complicated one. We proposed an image recognition method that can process input images with various structures, recognizes multiple images uniformly, and can efficiently process such an extended recognition method. . That is, in the image recognition method for recognizing the input image by energy minimization, the model of the structure to be recognized is defined as the structure model represented by the arbitrary number of points on the arbitrary number of images, and the structural model is the image. The input image is obtained by defining a constraint condition attracted to the feature and an energy function that minimizes energy when the constraint condition regarding the structure of the structural model is satisfied, and determining the position of the structural model that minimizes this energy function. Any structure is extracted from. However,
In the previous proposal, when the position of the structural model that minimizes the energy function is moved to a point close to one point, only the search processing of the vicinity of one point that reduces the energy is performed, and therefore the accurate structural model There was a problem that the position was not requested.

【０００７】[0007]

【課題を解決するための手段】第１の発明は、前記課題
を解決するために、入力画像中の認識対象の構造を表現
するモデルを画像上の任意個の点で表されるモデルとし
て定義し、前記モデルが画像特徴に引き寄せられる拘束
条件と、該モデルの構造に関する拘束条件を満たす場合
にエネルギーが最小となるようなエネルギー関数を定義
し、前記エネルギー関数を最小化する前記モデルの位置
を求めることにより、前記入力画像から任意の構造を抽
出する画像認識方法において、以下のように構成する。
すなわち、前記エネルギー関数を最小化する前記モデル
の位置を、前記モデル上の１点の近傍点へ移動した場合
にエネルギーが減少するような移動をする１点の近傍の
探索処理及び前記モデル上の複数の点をそれぞれの近傍
点へ同時に移動した場合にエネルギーが減少するような
移動をする複数の点の近傍点への同時探索処理を行うこ
とにより求める。第２の発明は、第１の発明において、
まず前記１点の近傍の探索処理によってエネルギー最小
化処理を行い、その後前記モデル上の２点の近傍点へ同
時に移動した場合にエネルギーが減少するような移動を
行い、該移動があれば前記１点の近傍の探索処理に戻り
処理を繰り返すことによってエネルギー最小化を行うよ
うにしている。第３の発明は、第１の発明の複数の点の
近傍点への同時探索処理において、エネルギーの変化量
を、探索する複数の点の近傍への移動量の評価関数とし
て表し、前記評価関数に基づいてエネルギーが減少する
ような移動をするようにしている。第４の発明は、第３
の発明の複数の点の近傍点への同時探索処理において、
探索する複数の点の近傍への移動量の関数として表され
たエネルギーの変化量のうち、それぞれの点の１点の移
動量のみに依存した項を探索を開始する前にあらかじめ
計算して記憶しておき、前記記憶した１点の移動量のみ
に依存した項を前記複数の点の近傍点への同時探索処理
において用いるようにしている。In order to solve the above-mentioned problems, the first invention defines a model expressing the structure of a recognition target in an input image as a model represented by arbitrary points on the image. Then, the constraint condition that the model is attracted to the image feature and the energy function that minimizes the energy when the constraint condition regarding the structure of the model are satisfied are defined, and the position of the model that minimizes the energy function is defined. An image recognition method for extracting an arbitrary structure from the input image by obtaining the image is configured as follows.
That is, when the position of the model that minimizes the energy function is moved to a point near one point on the model, the search process for the neighborhood of one point that moves such that the energy decreases and the position on the model It is obtained by performing simultaneous search processing to the neighboring points of a plurality of points that move so that the energy decreases when the plurality of points move to their neighboring points at the same time. The second invention is the same as the first invention,
First, energy minimization processing is performed by search processing in the vicinity of the one point, and then movement is performed so that energy is reduced when moving to two neighboring points on the model at the same time. The energy is minimized by returning to the search processing in the vicinity of the point and repeating the processing. A third aspect of the present invention represents the amount of change in energy as an evaluation function of the amount of movement to the vicinity of a plurality of points to be searched in the simultaneous search processing of the plurality of points into the vicinity of the point, the evaluation function Based on the above, the movement is such that the energy decreases. The fourth invention is the third invention.
In the simultaneous search process of a plurality of points to neighboring points of the invention of
Of the amount of change in energy expressed as a function of the amount of movement to the vicinity of multiple points to be searched, terms that depend only on the amount of movement of one point at each point are calculated and stored in advance before starting the search. In addition, the term that depends only on the stored amount of movement of one point is used in the simultaneous search process for the neighboring points of the plurality of points.

【０００８】[0008]

【作用】第１の発明によれば、以上のように画像認識方
法を構成したので、エネルギー関数を最小化するモデル
の位置を、１点の近傍点へ移動した場合にエネルギーが
減少するような１点の近傍の探索処理により求める。こ
の１点の近傍の探索処理により求められたモデルの位置
では、複数の点をそれぞれの近傍点へ同時に移動した場
合にエネルギーが減少する場合が探索されていないの
で、正確なモデルの位置が求められていない。そこで、
複数の点をそれぞれの近傍点へ同時に移動した場合にエ
ネルギーが減少するような移動をも行い正確なモデルの
位置を求める。第４の発明によれば、複数の点の近傍点
への同時探索処理において、エネルギーの変化量を探索
する複数の点の近傍への移動量の関数として表し、それ
ぞれの点の１点の移動量のみに依存した項を探索を開始
する前にあらかじめ計算しておき、同時探索処理におい
てそれを用いる。例えば、探索する近傍点の個数がｍ個
で、２点の近傍点への同時探索処理を行う場合では、ｍ
×ｍ回の探索処理が必要となるが、それぞれの点の１点
の移動量のみに依存した項のｍ通りの値を探索を開始す
る前にあらかじめ計算しておき、同時探索処理において
それを使用するので、処理効率が向上する。従って、前
記課題を解決できるのである。According to the first aspect of the invention, since the image recognition method is configured as described above, the energy is reduced when the position of the model that minimizes the energy function is moved to one neighboring point. It is obtained by a search process in the vicinity of one point. At the position of the model obtained by the search processing of the vicinity of this one point, the case where the energy decreases when a plurality of points are simultaneously moved to their respective neighboring points has not been searched, so the accurate position of the model is obtained. Has not been done. Therefore,
An accurate model position is also obtained by moving so that the energy decreases when multiple points are moved to their neighboring points at the same time. According to the fourth invention, in the simultaneous search process of a plurality of points in the vicinity, the amount of change in energy is expressed as a function of the amount of movement of the plurality of points to be searched in the vicinity, and the movement of one point of each point. The term that depends only on the quantity is calculated in advance before starting the search, and is used in the simultaneous search processing. For example, when the number of neighboring points to be searched is m and the simultaneous search processing is performed for two neighboring points, m
It requires xm times of search processing, but m values of terms that depend only on the amount of movement of one point of each point are calculated in advance before starting the search, and are calculated in the simultaneous search processing. Since it is used, the processing efficiency is improved. Therefore, the above problem can be solved.

【０００９】[0009]

【実施例】本実施例では、まず、先の提案の画像認識方
法の実施例の概略（Ｉ）と、その問題点（II）とを説明
した後、その問題点を解決するための本発明の画像認識
方法の実施例(III）を説明する。（Ｉ）先の提案の画像認識方法の実施例先の提案では、従来の動的輪郭モデルによる輪郭抽出方
法のように、認識する対象を表現するモデルを閉曲線に
限定せず、より複雑な対象の構造を表現できる動的な構
造のモデル（以下、動的モデルと呼ぶ）を考える。動的
モデルは、入力画像中の認識対象の輪郭等の構造を画像
上の任意個の点のモデルで表現し、それぞれの点が入力
画像の画像特徴に引き寄せられる拘束条件と各点の位置
関係に関する拘束条件とを満たすようなモデルの位置を
エネルギー最小化によって求める画像認識方法である。
動的モデルでは、認識対象の輪郭等の構造を、画像上の
ｎ個の点で表されているようなモデル（以下、model と
呼ぶ）で表現する。このｎ個の点を画像座標系ｘ，ｙ
で、ｖｖ_i＝（ｖ_1i，ｖ_2i）＝（ｘ_i，ｙ_i），ｉ＝
１，…，ｎと表す。このmodel が画像特徴に引き寄せら
れる拘束条件と、model の構造に関する拘束条件を満た
す場合にエネルギーを最小化するmodel の位置を求める
ことにより構造の抽出を行う。EXAMPLE In this example, first, an outline (I) of the example of the previously proposed image recognition method and its problem (II) will be described, and then the present invention for solving the problem will be described. An example (III) of the image recognition method will be described. (I) Example of Image Recognition Method of Previous Proposal In the previous proposal, a model expressing a recognition target is not limited to a closed curve as in the conventional contour extraction method using an active contour model, and a more complex target is obtained. Consider a dynamic structure model that can express the structure of (hereinafter referred to as a dynamic model). The dynamic model expresses the structure such as the outline of the recognition target in the input image by a model of any number of points on the image, and the constraint condition that each point is attracted to the image feature of the input image and the positional relationship of each point. This is an image recognition method that finds the position of a model that satisfies the constraint condition regarding by energy minimization.
In the dynamic model, the structure such as the outline of the recognition target is represented by a model represented by n points on the image (hereinafter referred to as model). These n points are converted into the image coordinate system x, y.
Where vv _i = (v _1i , v _2i ) = (x _i , y _i ), i =
1, ..., n. The structure is extracted by finding the constraint condition that this model is attracted to the image feature and the position of the model that minimizes the energy when the constraint condition concerning the structure of the model is satisfied.

【００１０】図５（ａ）〜（ｃ）は先の提案の動的モデ
ルの例を示す図である。例えば、図５（ｃ）に示すよう
に、任意個の点があり、これらが画像特徴に引き寄られ
る拘束条件と、これらの点の間に図に示すような構造に
関する拘束条件が与えられているモデルを考える。ここ
で、図５（ａ）のような入力画像が与えられると、同図
（ｂ）に示すような入力画像の特徴に一致するような構
造が抽出できる。このmodel に対するエネルギー関数Ｅ
_modelを次式（９）のように定義する。Ｅ_model＝Ｅ_int＋Ｅ_ext ＝Ｅ_int＋Ｅ_image＋Ｅ_con ・・・（９）（９）式のＥ_intは、model の構造に関する拘束条件を
表現する内部エネルギー関数であり、次式（１０）のよ
うに定義する。FIGS. 5A to 5C are diagrams showing examples of the dynamic model proposed above. For example, as shown in FIG. 5C, there are arbitrary points, and a constraint condition that these points are drawn to the image feature and a constraint condition regarding the structure as shown in the figure are given between these points. Consider a model that has Here, when an input image as shown in FIG. 5A is given, a structure that matches the features of the input image as shown in FIG. 5B can be extracted. Energy function E for this model
_{The model} is defined as the following expression (9). E _model = E _int + E _ext = E _int + E _image + E _con (9) E _int in the equation (9) is an internal energy function expressing the constraint condition regarding the structure of the model, and is represented by the following equation (10). To be defined as

【数４】但し、Ａ_{λi μj}，Ｂ_λi，Ｃは係数であり、Ａ
_{λi μj}＝Ａ_{μj λi}とする。また、Σの添字ｉ，ｊは
１，…，ｎを、λ、μは１，２の値をとる。同様に、以
下の説明でもΣの添字の範囲は省略して記述するが、
ｉ，ｊのような英小文字の添字は１，…，ｎ、ギリシャ
小文字は１，２の値をとるものとする。例えば、[Equation 4] However, A _{λi μj} , B _λi , and C are coefficients, and
_{Let λi μj} = A _{μj λi} . The subscripts i and j of Σ are 1, ..., N, and λ and μ are 1 and 2. Similarly, in the following description, the range of the subscript of Σ is omitted,
The subscripts of lowercase letters such as i and j take the values 1, ..., N, and the Greek lowercase letters take the values of 1 and 2. For example,

【数５】のような意味で用いる。(Equation 5) Is used in the sense of.

【００１１】（１０）式のような内部エネルギー関数Ｅ
_intを用いると、以下のようなエネルギー項を表現でき
る。The internal energy function E as shown in equation (10)
_{Using int,} we can express the following energy terms.

【数６】これは、図６に示すように、点ｖｖ_iを固定点ｖｓ_iに
引き寄せる項として働く。(Equation 6) This acts as a term that draws the point vv _i to the fixed point vs _i , as shown in FIG.

【数７】これは、図７に示すように、点ｖｖ_iから点ｖｖ_jに向
かうベクトルｖｖ_j−ｖｖ_iを、定ベクトルｖｔ_ijに近
付ける項として働く。ｖｔ_ij＝０の場合は、従来の
（２）式の右辺第１項α_i・｜ｖｖ_i−ｖｖ_i-1｜²／
２と同じように、ｖｖ_iとｖｖ_jを引き付け合う。但
し、従来の輪郭モデルでは、隣り合う２節点間のみにこ
のようなエネルギー項の設定が可能である。(Equation 7) As shown in FIG. 7, this acts as a term that brings the vector vv _j −vv _i from the point vv _i toward the point vv _j closer to the constant vector vt _ij . When vt _ij = 0, the first term on the right-hand side of the conventional equation (2) α _i · | vv _i −vv _i−1 | ² /
As in 2, attract vv _i and vv _j . However, in the conventional contour model, such an energy term can be set only between two adjacent nodes.

【数８】これは、図８に示すように、点ｖｖ_iから点ｖｖ_jへ向
かうベクトルと、点ｖｖ_jから点ｖｖ_kへ向かうベクト
ルの変化量、即ち折れ線ｖｖ_i−ｖｖ_j−ｖｖ_kの曲り
具合ｖｖ_i−２ｖｖ_j＋ｖｖ_kを定ベクトルｖｕ_ijkに
近付ける項として働く。(Equation 8) This is, as shown in FIG. 8, the amount of change between the vector from the point vv _i to the point vv _j and the vector from the point vv _j to the point vv _k , that is, the bending degree vv of the polygonal line vv _i −vv _j −vv _k. _It works as a term that brings _i −2vv _j + vv _k close to the constant vector vu _ijk .

【００１２】[0012]

【数９】これは、図９に示すように、３点ｖｖ_i，ｖｖ_j，ｖｖ
_kが囲む三角形の面積を大きくする項として働く。この
項を組み合わせることによって、図１０に示すように、
model 上のｎ個の点のうちの任意のｍ個の点よりなるｍ
角形が囲む面積に関する拘束条件を表現できる。以上の
ようなエネルギー項の他にも、２ｎ個の変数Ｖ_λi（ｉ
＝１，…，ｎ、λ＝１，２）による２次以下の多項式で
表されるような任意のエネルギー項を（１０）式で表現
できる。（９）式のＥ_ext＝Ｅ_image＋Ｅ_conは、mode
l に対する外部エネルギー関数である。Ｅ_conは、外部
からの制御のためのエネルギー関数である。Ｅ
_imageは、model が画像特徴に引き寄せられる拘束条件
を表現する関数であり、次式（１１）のように定義す
る。[Equation 9] As shown in FIG. 9, this is the three points vv _i , vv _j , vv.
_It works as a term that increases the area of the triangle surrounded by _k . By combining these terms, as shown in FIG.
m consisting of arbitrary m points out of n points on model
It is possible to express the constraint condition regarding the area enclosed by the polygon. In addition to the above energy terms, 2n variables V _λi (i
= 1, ..., N, λ = 1, 2), an arbitrary energy term represented by a polynomial of second order or less can be expressed by the equation (10). E _ext = E _image + E _con in the equation (9) is mode
is an external energy function for l. E _con is an energy function for external control. E
_image is a function that represents a constraint condition in which model is attracted to the image feature, and is defined by the following equation (11).

【数１０】例えば、Ｅ_i（ｖ_1i，ｖ_2i）を次式（１２）のように定
義すれば、点ｖｖ_iは画像エッジに引き寄せられる。Ｅ_i（ｖ_1i，ｖ_2i）＝ｗ_edge（−｜▽Ｉ（ｖ_1i，ｖ_2i）｜²）・・・（１２）但し、ｗ_edgeは正の係数、Ｉ（ｖ_1i，ｖ_2i）は画像の濃
度である。以上のように定義された（９）式のエネルギ
ー関数Ｅ_modelが最小となるようmodel の位置を求める
ことにより、画像特徴に関する拘束条件とmodel の構造
に関する拘束条件を満たす認識対象の構造を抽出でき
る。[Equation 10] For example, if E _i (v _1i , v _2i ) is defined by the following expression (12), the point vv _i is attracted to the image edge. E _i (v _1i , v _2i ) = w _edge (− | ▽ I (v _1i , v _2i ) | ² ) (12) where w _edge is a positive coefficient, I (v _1i , v _2i ). Is the density of the image. By obtaining the position of the model so that the energy function E _{model of the} equation (9) defined as above is minimized, it is possible to extract the structure of the recognition target that satisfies the constraint condition regarding the image feature and the constraint condition regarding the structure of the model. .

【００１３】図１１は、先の出願のエネルギー関数の最
小化処理のフローチャートである。図１１に示す先の出
願のエネルギー最小化処理では、model 上の各点ｖｖ_i
の近傍での（９）式のＥ_modelの変化を評価し、Ｅ
_modelが減少する近傍点への移動を繰り返すことにより
エネルギー関数の最小化を行う。すなわち、スイップＳ
１４１で与えられた初期位置からはじめて、ステップＳ
１４６でmodel が収束するまで、ステップＳ１４２から
Ｓ１４５までの処理を繰り返す。ステップＳ１４２，Ｓ
１４４，及びＳ１４５で、カウンタｉを１からmodel の
点数ｎまで進め、ステップＳ１４３でそれぞれのｉにつ
いて、以下で述べる点ｖｖ_iの近傍探索処理を行う。す
なわち、model 上のすべての点について近傍探索処理を
行う。ステップＳ１４６の収束判定では、すべての点の
近傍にエネルギー関数を減少させる近傍点が存在しなく
なったところで収束したと判定して、このときのmodel
の位置をステップ１４７でエネルギー最小化位置とす
る。以下、図１１中のステップＳ１４３の近傍探索処理
における、ｖｖ_iの近傍での（９）式のＥ_modelの変化
の評価方法について説明する。FIG. 11 is a flowchart of the energy function minimization process of the previous application. In the energy minimization process of the previous application shown in FIG. 11, each point vv _i on the model
Evaluating the change of E _{model in} the equation (9) in the vicinity of
_The energy function is minimized by repeating the movement to the neighboring points where the _model decreases. That is, the sweep S
Starting from the initial position given in 141, step S
The processing from steps S142 to S145 is repeated until the model converges in 146. Steps S142, S
In 144, and S145, the counter i is incremented from 1 to the number n of the model, and in step S143, the neighborhood search process of the point vv _i described below is performed for each i. That is, the neighborhood search process is performed for all points on the model. In the convergence determination in step S146, it is determined that convergence has occurred when there are no neighboring points that reduce the energy function in the vicinity of all points, and the model at this time is determined.
Is set as the energy minimization position in step 147. Hereinafter, a method of evaluating a change in E _model of the expression (9) in the vicinity of vv _i in the neighborhood search processing of step S143 in FIG. 11 will be described.

【００１４】（９）式において、model 上の点ｖｖ_iを
δｖｖ_i＝（δｖ_1i，δｖ_2i）だけ変化させた、すなわ
ち、ｖｖ_iをｖｖ_i＋δｖｖ_iへ移動した時の（９）式
のＥ_modelの変化量δ_iＥ_modelは、次式（１３）のよ
うになる。In the equation (9), the point vv _i on the model is changed by δvv _i = (δv _1i , δv _2i ), that is, when vv _i is moved to vv _i + δvv _i variation [delta] _i E _model of E _model is expressed by the following equation (13).

【数１１】この変化量δ_iＥ_modelが負となる、すなわち、δ_iＥ
_model＜０となるような移動δｖｖ_iは、（９）式のよ
うなエネルギー評価関数Ｅ_modelを減少させる。このよ
うな点ｖｖ_iの近傍における（９）式のエネルギー関数
Ｅ_modelの変化の様子を評価するために、次式（１４）
のような近傍エネルギー評価関数ｅ_i（δｖｖ_i）を定
義する。[Equation 11] This change amount δ _i E _model becomes negative, that is, δ _i E _model
_The movement δvv _i such that _model <0 reduces the energy evaluation function E _model as expressed by the equation (9). In order to evaluate such a change in the energy function E _model of the equation (9) in the vicinity of the point vv _i , the following equation (14)
The neighborhood energy evaluation function e _i (δvv _i ) such as

【数１２】但し、Ｄ_λiは、次式（１５）であり、これは、δｖｖ
_iに依存しないので、近傍の探索を開始する前にあらか
じめ計算しておくことができる。(Equation 12) However, D _λi is the following equation (15), which is δvv
Since it does not depend on _i , it can be calculated in advance before starting the search for the neighborhood.

【００１５】[0015]

【数１３】また、Ｅ_i（ｖｖ_i）は、入力画像と点ｖｖ_iの座標値
から計算されるが、入力画像が定まれば、これはｖｖ_i
の座標値に関する２変数（ｖ_1i，ｖ_2i）の関数となる。
したがって、入力画像の全画素に対してＥ_i（ｘ，ｙ）
を計算し、これをエネルギー関数画像（画素の値がＥ_i
（ｘ，ｙ）であるような画像）としてあらかじめ得てお
くこともできる。ただし、上記の手法を用いるとＥ
_i（ｖｖ_i）が、各ｖｖ_i（ｉ＝１，…，ｎ）ごとの個
別の関数であるので、ｎ枚ものエネルギー関数画像をあ
らかじめ得ておく必要があり、大量の記憶容量を必要と
する。そこで、以下のようにすることでよってエネルギ
ー関数画像の枚数を少なくすることもできる。すなわ
ち、すべてのＥ_i（ｘ，ｙ），ｉ＝１，…，ｎがｎ_f種
類の関数ｆ_j（ｘ，ｙ），ｊ＝１，…，ｎ_f（但し、ｎ
_f＜ｎ）により、次式（１６）のように表されていれ
ば、ｎ_f枚のエネルギー関数画像をあらかじめ計算して
おくことによって、エネルギー最小化処理時のＥ_i（ｖ
ｖ_i）の計算を単純な積和計算で処理できる。(Equation 13) Further, E _i (vv _i ) is calculated from the input image and the coordinate value of the point vv _i , but if the input image is determined, this is vv _i.
It is a function of two variables (v _1i , v _2i ) related to the coordinate value of.
Therefore, for all pixels of the input image, E _i (x, y)
Of the energy function image (the pixel value is E _i
It can also be obtained in advance as an image such as (x, y). However, using the above method, E
_{Since i} (vv _i ) is an individual function for each vv _i (i = 1, ..., N), it is necessary to obtain n energy function images in advance, which requires a large storage capacity. To do. Therefore, the number of energy function images can be reduced by doing the following. That is, all E _i (x, y), i = 1, ..., N are n _f type functions f _j (x, y), j = 1, ..., n _f (however, n
_{If f} <n) is represented by the following equation (16), by calculating n _f energy function images in advance, E _i (v
The calculation of v _i ) can be processed by a simple product-sum calculation.

【００１６】[0016]

【数１４】（１３）式のδ_iＥ_modelは、式（１４）の近傍エネル
ギー評価関数ｅ_i（δｖｖ_i）を用いて次式（１７）の
ように表される。 δ_iＥ_model＝ｅ_i（δｖｖ_i）−Ｅ_i（ｖｖ_i）−Ｅ_con ＝ｅ_i（δｖｖ_i）−ｅ_i（０）・・・（１７）したがって、ｅ_i（δｖｖ_i）＜ｅ_i（０）となるよう
な近傍点への移動δｖｖ_iが存在すれば、ｖｖ_iをｖｖ
_i＋δｖｖ_iへ移動することによって（９）式のＥ
_modelは減少する。このような移動をmodel 上の全ての
点について繰り返すことによって、（９）式のエネルギ
ー関数Ｅ_modelを最小化することができる。[Equation 14] The δ _i E _model of the equation (13) is represented by the following equation (17) using the neighborhood energy evaluation function e _i (δvv _i ) of the equation (14). δ _i E _model = e _i (δvv _i ) −E _i (vv _i ) −E _con = e _i (δvv _i ) −e _i (0) (17) Therefore, e _i (δvv _i ) <e If there is a movement δvv _i to a neighboring point such that _i (0), then vv _i becomes vv _i
_By moving to _i + δvv _i , E in Eq. (9)
_model decreases. By repeating such movement for all points on the model, the energy function E _{model of the} equation (9) can be minimized.

【００１７】図１２は、図１１中のステップＳ１４３の
近傍探索処理のフローチャートである。図１３は、近傍
探索の様子を示す概念図である。図１２に示すように、
まず、（１４）式の近傍エネルギー評価関数ｅ_i（δｖ
ｖ_i）の計算を効率的に行うために、ステップＳ２０１
で、（１５）式のＤ_λiをあらかじめ計算しておく。ス
テップＳ２０２で、近傍エネルギー評価関数ｅ_i（δｖ
ｖ_i）の最小値Ｅ_mi _nと、最小値Ｅ_minを与える近傍点
への移動δｖｖ_iを現在の位置で初期化する。ステップ
Ｓ２０３、Ｓ２０７，Ｓ２０８でカウンタｊを１から近
傍点数ｍまで進めて、それぞれのｊについて、点ｖｖ_i
のｊ番目の近傍点ｖｖ_i＋δｖｖ_ijに関する処理Ｓ２０
４，Ｓ２０５，Ｓ２０６を行う。例えば、近傍点数がｍ
＝８の場合に、ｊ＝５番目の近傍点を探索している様子
を図１３に示す。ステップＳ２０４で、ｊ番目の近傍点
への移動δｖｖ_ijによる近傍エネルギー評価値を、ステ
ップＳ２０１で求めたＤ_λiを使って、（１４）式の近
傍エネルギー評価関数ｅ_i（δｖｖ_ij）で評価する。ス
テップＳ２０５，Ｓ２０６で、これら評価値の最小値Ｅ
_minと、最小値Ｅ_mi _nを与える近傍点への移動δｖｖ_i
を求める。ステップＳ２０９，Ｓ２１０で、近傍エネル
ギー評価関数の最小値を与える点が現在位置でなけれ
ば、点ｖｖ_iをｖｖ_i＋δｖｖ_iへ移動する。FIG. 12 is a flowchart of the neighborhood search processing of step S143 in FIG. FIG. 13 is a conceptual diagram showing a state of neighborhood search. As shown in FIG.
First, the neighborhood energy evaluation function e _i (δv of Eq. (14)
In order to efficiently calculate v _i ), step S201
Then, D _λi in the equation (15) is calculated in advance. In step S202, the neighborhood energy evaluation function e _i (δv
v _i and the minimum value E _mi _n of) initializes the movement Derutavv _i to neighborhood points at the current position that gives the minimum value E _min. In steps S203, S207, S208, the counter j is incremented from 1 to the neighborhood score m, and the point vv _{i is obtained} for each j.
S20 regarding the j-th neighbor point vv _i + δvv _ij of
4, S205 and S206 are performed. For example, the number of neighbors is m
FIG. 13 shows how the j = 5th neighbor point is searched when = 8. In step S204, the neighborhood energy evaluation value by the movement δvv _ij to the j-th neighbor point is evaluated by the neighborhood energy evaluation function e _i (δvv _ij ) of the equation (14) using D _λi obtained in step S201. . In steps S205 and S206, the minimum value E of these evaluation values
movement Derutavv _i of the _min, the neighboring point which gives the minimum value E _mi _n
Ask for. In steps S209 and S210, if the point giving the minimum value of the neighborhood energy evaluation function is not the current position, the point vv _i is moved to vv _i + δvv _i .

【００１８】以上のように、図１２に示したような近傍
探索処理により、点ｖｖ_iを、その近傍点のうちで
（９）式のエネルギー関数Ｅ_modelを減少させる点へ移
動することができる。この近傍探索処理を、図１１に示
すように、model 上のすべての点について繰り返し、す
べての点が移動しなくなったところで処理を終了する。
この時のmodel は、（９）式のエネルギー関数Ｅ_model
が最小となるため、エネルギー関数として与えられた拘
束条件を満たすような認識対象の構造が抽出できる。図
１４は、以上のような先の提案の動的モデルによる画像
認識方法の構成図である。図１４に示すように、１枚あ
るいは複数枚の入力画像データ１１０より、エネルギー
関数画像計算処理１２０で、モデルの定義１６０をもと
に、エネルギー関数画像１３０を計算する。エネルギー
最小化処理１４０では、入力画像データ１１０より、モ
デルの定義１６０をもとに、初期位置データ１７０か
ら、図１１、図１２の処理によって、構造の抽出結果１
５０を得る。As described above, by the neighborhood search processing as shown in FIG. 12, the point vv _i can be moved to a point among the neighboring points where the energy function E _{model of the} equation (9) is reduced. . As shown in FIG. 11, this neighborhood search process is repeated for all points on the model, and the process ends when all points have stopped moving.
The model at this time is the energy function E _{model of} equation (9).
Since is the minimum, the structure of the recognition target that satisfies the constraint condition given as the energy function can be extracted. FIG. 14 is a block diagram of an image recognition method based on the dynamic model proposed above. As shown in FIG. 14, an energy function image 130 is calculated from one or a plurality of input image data 110 by an energy function image calculation process 120 based on a model definition 160. In the energy minimization process 140, based on the model definition 160 from the input image data 110, the structure extraction result 1 from the initial position data 170 is processed by the processes of FIGS.
Get 50.

【００１９】（II）先の提案の問題点図１５は先の提案の動的モデルにおける問題点を示す図
である。図１５（ａ）の状態のmodel が、節点ｖｖ_iの
近傍探索時に図１５（ｂ）のような移動をしても、節点
ｖｖ_jの近傍探索時に図１５（ｃ）のような移動をして
もエネルギーが増加してしまうような場合でも、図１５
（ｄ）に示すように、節点ｖｖ_iとｖｖ_jを同時に動か
した状態の方が、図１５（ａ）の状態よりもエネルギー
が小さい場合がある。すなわち、先の提案の方法では、
図１５（ａ）の状態から移動できないため、この状態を
エネルギーが最小であるとして、エネルギー最小化処理
を終了してしまうが、実際には、図１５（ｄ）のような
さらにエネルギーの小さい状態が存在する場合があり、
エネルギー最小化処理が正確に行なえていなかった。(II) Problems of Previous Proposal FIG. 15 is a diagram showing problems in the dynamic model of the previous proposal. Even if the model in the state of FIG. 15A moves as shown in FIG. 15B during the neighborhood search of the node vv _i , it moves as shown in FIG. 15C during the neighborhood search of the node vv _j . However, even if the energy increases, FIG.
As shown in (d), the state where the nodes vv _i and vv _j are moved at the same time may have lower energy than the state shown in FIG. That is, in the method proposed above,
Since the state cannot be moved from the state of FIG. 15 (a), the energy minimization process is terminated assuming that this state has the lowest energy, but in reality, the state of smaller energy as shown in FIG. 15 (d). May exist,
The energy minimization process was not performed correctly.

【００２０】(III）本発明の実施例図１は、本発明の実施例を示す画像認識方法のエネルギ
ー最小化処理のフローチャートである。本実施例のエネ
ルギー最小化処理は、先の出願と同様な１点の近傍探索
によるエネルギー最小化処理を行った後、２点のそれぞ
れの近傍点へ同時に移動した場合にエネルギーが最小と
なるような同時近傍探索によるmodel の移動処理を行う
ようにしている。図１６は、図１中の２点の近傍探索に
よるmodel の移動処理ステップＳ３０３を示すフローチ
ャートである。これら図を参照しつつ、本実施例の画像
認識方法のエネルギー最小化処理の方法について説明す
る。図１に示すように、まず、ステップＳ３０１で与え
られた初期位置からはじめて、ステップＳ３０２の一点
の近傍探索によるエネルギー最小化処理で、先の出願と
同様に図１１及び図１２に示すようなフローチャートに
したがって１点の近傍探索によるエネルギー最小化位置
を求める。(III) Embodiment of the Present Invention FIG. 1 is a flow chart of energy minimization processing of an image recognition method showing an embodiment of the present invention. In the energy minimization process of this embodiment, the energy is minimized when the energy minimization process is performed by the one point neighborhood search as in the previous application and then the two points are simultaneously moved. The model is moved by such simultaneous neighborhood search. FIG. 16 is a flowchart showing step S303 of moving a model by a neighborhood search of two points in FIG. The method of energy minimization processing of the image recognition method of the present embodiment will be described with reference to these drawings. As shown in FIG. 1, first, from the initial position given in step S301, in the energy minimization process by the neighborhood search of one point in step S302, the flowcharts as shown in FIGS. Then, the energy minimization position is obtained by the neighborhood search of 1 point.

【００２１】次に、ステップＳ３０３で図１６に示すよ
うな２点の近傍探索によるmodel の移動を行う。これ
は、全ての２点の組み合わせに対して２点を同時に移動
した場合にエネルギーの減少する組み合わせがあれば、
その移動を行う処理である。図１６に示す２点の近傍探
索によるmodel の移動処理は、ステップＳ３２１、Ｓ３
２６、及びＳ３２７で、カウンタｉを１からｎ−１まで
進めながらそれぞれのｉについて、ステップＳ３２２か
らＳ３２５までの処理を行う。ステップＳ３２２、Ｓ３
２４、及びＳ３２５でカウンタｊをｉ＋１からｎまで進
めながらそれぞれのｊについて、ステップＳ３２３の処
理を行う。すなわち、全ての点ｖｖ_i，ｖｖ_jの組み合
わせについて、以下で述べる点ｖｖ_i，ｖｖ_jの近傍探
索処理を行い、２点を同時に移動した場合にエネルギー
の減少する組み合わせの移動を行う。図１中のステップ
Ｓ３０４の収束判定では、ステップＳ３０３でエネルギ
ーの減少する組み合わせが見つかった場合は、ステップ
Ｓ３０２に戻りさらにエネルギー最小化を試みる。ステ
ップＳ３０３でエネルギーの減少する組み合わせが見つ
からなければ収束したと判定して、このときのmodel の
位置をステップＳ３０５でエネルギー最小化位置とす
る。以下、図１６中のステップＳ３２３のｖｖ_i，ｖｖ
_jの近傍探索処理におけるｖｖ_i、ｖｖ_jの近傍での
（９）式のＥ_modelの変化の評価方法について説明す
る。Next, in step S303, the model is moved by a two-point neighborhood search as shown in FIG. This is because if there are combinations that reduce energy when moving two points at the same time for all combinations of two points,
This is the process of moving. The process of moving the model by the two-point neighborhood search shown in FIG. 16 includes steps S321 and S3.
26 and S327, the process from step S322 to S325 is performed for each i while advancing the counter i from 1 to n-1. Steps S322 and S3
24, and the process of step S323 is performed for each j while advancing the counter j from i + 1 to n in S325. That is, all points vv _i, for the combination of vv _j, performs local search processing vv _i, vv _j points described below, to move the combination of decreasing energy when moving two points simultaneously. In the convergence determination of step S304 in FIG. 1, if a combination of decreasing energy is found in step S303, the process returns to step S302 to try to further minimize energy. If no combination of decreasing energy is found in step S303, it is determined that convergence has occurred, and the position of model at this time is set as the energy minimization position in step S305. Hereinafter, vv _i and vv in step S323 in FIG.
_A method of evaluating the change in E _{model in} the expression (9) in the neighborhood of vv _i and vv _j in the neighborhood search processing of _j will be described.

【００２２】（９）式において、model 上の点ｖｖ_i及
びｖｖ_jをδｖｖ_i＝（δｖ_1i，δｖ_2i）及びδｖｖ_j
＝（δｖ_1j，δｖ_2j）だけ変化させた、すなわち、ｖｖ
_i，ｖｖ_jをｖｖ_i＋δｖｖ_i，ｖｖ_j＋δｖｖ_jへ移
動した時の（９）式のＥ_mode _lの変化量δ_ijＥ
_modelは、次式（１８）のようになる。In equation (9), points vv _i and vv _j on model are expressed as δvv _i = (δv _1i , δv _2i ) and δvv _j
= (Δv _1j , δv _2j ), that is, vv
The change amount δ _ij E of E _mode _{l in} the equation (9) when _i , vv _j are moved to vv _i + δvv _i , vv _j + δvv _j
_{The model} is as in the following expression (18).

【数１５】この変化量δ_ijＥ_modelが負となる、すなわち、δ_ijＥ
_model＜０となるような移動δｖｖ_i，δｖｖ_jは、式
（９）のＥ_modelを減少させる。(Equation 15) This change amount δ _ij E _model becomes negative, that is, δ _ij E _model
_The movements δvv _i and δvv _j such that _model <0 reduces E _model in equation (9).

【００２３】このような、点ｖｖ_i，ｖｖ_jにおける
（９）式のエネルギー関数Ｅ_modelの変化の様子を評価
するために、次式（１９）、（２０）のような近傍エネ
ルギー評価関数ｄ_i（δｖｖ_i），ｇ_ij（δｖｖ_i，δ
ｖｖ_j）を定義する。In order to evaluate such changes in the energy function E _model of the equation (9) at the points vv _i and vv _j, the neighborhood energy evaluation function d as in the following equations (19) and (20) _i (δvv _i ), g _ij (δvv _i , δ
vv _j ) is defined.

【数１６】（１８）式の変化量δ_ijＥ_modelは、（１９）、（２
０）式の近傍エネルギー評価関数ｄ_i（δｖｖ_i），ｇ
_ij（δｖ_i，δｖ_j）を用いて次式（２１）のように表
される。 δ_ijＥ_model＝ｄ_i（δｖｖ_i）＋ｄ_j（δｖｖ_j）＋ｇ_ij（δｖｖ_i，δｖｖ_j） −ｄ_i(0) −ｄ_j(0) −ｇ_ij(0,0) ・・・（２１）よって、ｄ_i（δｖｖ_i）＋ｄ_j（δｖｖ_j）＋ｇ
_ij（δｖｖ_i，δｖｖ_j）＜ｄ_i (0)＋ｄ_j(0) ＋ｇ_ij
(0,0) となるような近傍への移動δｖｖ_i，δｖｖ_jが
存在すればｖｖ_i，ｖｖ_jをｖｖ_i＋δｖｖ_i，ｖｖ_j
＋δｖｖ_jへ同時に移動することによって（９）式のＥ
_modelは減少する。すなわち、現在の状態（δｖｖ_i＝
δｖｖ_j＝０）を含めた近傍点の組み合せのうち、近傍
エネルギー評価関数ｄ_i（δｖｖ_i）＋ｄ_j（δｖ
ｖ_j）＋ｇ_ij（δｖｖ_i，δｖｖ_j）が最小となるよう
な移動をすればよい。[Equation 16] The amount of change δ _ij E _{model in the} equation (18) is (19), (2
0) neighborhood energy evaluation function d _i (δvv _i ), g
_It is expressed by the following equation (21) using _ij (δv _i , δv _j ). δ _ij E _model = d _i (δvv _i ) + d _j (δvv _j ) + g _ij (δvv _i , δvv _j ) −d _i (0) −d _j (0) −g _ij (0,0) ・・・ ( 21) Therefore, d _i (δvv _i ) + d _j (δvv _j ) + g
_ij (δvv _i , δvv _j ) <d _i (0) + d _j (0) + g _ij
If there are movements δvv _i and δvv _j to be (0,0), then vv _i and vv _j are vv _i + δvv _i and vv _j
By moving to + δvv _j simultaneously, E in Eq. (9)
_model decreases. That is, the current state (δvv _i =
Among combinations of neighboring points including δvv _j = 0), the neighborhood energy evaluation function d _i (δvv _i ) + d _j (δv
The movement may be such that v _j ) + g _ij (δvv _i , δvv _j ) is minimized.

【００２４】図１７は、図１６中のステップＳ３２３の
ｖｖ_i，ｖｖ_jの近傍探索処理のフローチャートであ
る。図１８は、２点の近傍探索の様子を示す概念図であ
る。以下、図１７を参照しつつ図１６中のステップＳ３
２３の処理について説明する。まず、（２１）式の近傍
エネルギー評価関数ｄ_i（δｖｖ_i）＋ｄ_j（δｖ
ｖ_j）＋ｇ_ij（δｖｖi ，δｖｖ_j）の計算を効率的に
行うために、ｄ_i（δｖｖ_i），ｄ_j（δｖｖ_j）は、
それぞれの点の移動量δｖｖ_i，δｖｖ_jのみに依存し
た項であるので、それぞれのｍ個の近傍点での値をステ
ップＳ３３１で式（１９）を用いてあらかじめ計算して
おく。この際、Ｄ_λi，Ｄ_λjをあらかじめ計算してか
らこの処理を行えばさらに効率が良い。ステップＳ３３
２で、エネルギー評価値の最小値Ｅ_minと最小値Ｅ_min
を与える近傍点への移動δｖｖ_i，δｖｖ_jを現在の位
置で初期化する。ステップ３３３，Ｓ３３７でカウンタ
ｈ及びｋをそれぞれ１から近傍点数ｍまで進めて、＜
ｈ，ｋ＞のｍ×ｍ個の全ての組み合わせについて、点ｖ
ｖ_i及びｖｖ_jをそれぞれｈ番目の近傍点ｖｖ_i＋δｖ
ｖ_h及びｋ番目の近傍点ｖｖ_j＋δｖｖ_kへ同時に移動
したときのエネルギーの評価処理Ｓ３３４，Ｓ３３５，
Ｓ３３６を行う。FIG. 17 is a flowchart of the vv _i , vv _j neighborhood search process of step S323 in FIG. FIG. 18 is a conceptual diagram showing a situation of a two-point neighborhood search. Hereinafter, with reference to FIG. 17, step S3 in FIG.
The processing of 23 will be described. First, the neighborhood energy evaluation function d _i (δvv _i ) + d _j (δv) of the equation (21)
In order to efficiently calculate v _j ) + g _ij (δvvi, δvv _j ), d _i (δvv _i ), d _j (δvv _j ) is
Since the term depends only on the movement amounts δvv _i and δvv _{j of} each point, the values at each of the m neighboring points are calculated in advance in step S331 using the equation (19). At this time, if D _λi and D _λj are calculated in advance and this process is performed, the efficiency is further improved. Step S33
In 2, the minimum value E _min and the minimum value E _min of the energy evaluation value
Initialize the movements δvv _i , δvv _j to the neighboring points that give In steps 333 and S337, the counters h and k are respectively incremented from 1 to the neighborhood number m, and <
For all m × m combinations of h, k>, point v
v _i and vv _j are respectively the h-th neighbor point vv _i + δv
v evaluation process of energy when moving _h and k th neighboring point of vv _j + δvv _k simultaneously to S334, S335,
S336 is performed.

【００２５】たとえば、近傍点数がｍ＝８の場合に、＜
ｈ，ｋ＞＝＜７，５＞番目の近傍点を探索している様子
を図１８に示す。ステップ３３４で、＜ｈ，ｋ＞番目の
近傍点への移動δｖｖ_h，δｖｖ_kによる近傍エネルギ
ーの評価値ｄ_i（δｖｖ_h）＋ｄ_j（δｖｖ_k）＋ｇ_ij
（δｖｖ_h，δｖｖ_k）をステップ３３１で求めておい
たｄ_i（δｖｖ_i），ｄ_j（δｖｖ_j）を用いて計算す
る。ステップ３３４は、ｍ×ｍ回実行されるが、ｄ
_i（δｖｖ_i），ｄ_j（δｖｖ_j）は、それぞれｍ通り
の値しかとらないため、ステップ３３４でｍ×ｍ回も計
算するのは無駄であり、ステップ３３１であらかじめそ
れぞれのｍ通りの値を計算しておいて記憶しておくこと
で、ステップ３３４の計算が効率的に行える。ステップ
Ｓ３３５，Ｓ３３６で、これらの評価値の最小値Ｅ_min
と、最小値Ｅ_minを与える近傍点への移動δｖｖ_i，δ
ｖｖ_jを求める。ステップＳ３３８，Ｓ３３９で近傍エ
ネルギー評価関数の最小値を与える点が現在位置でなけ
れば、点ｖｖ_i及びｖｖ_jをｖｖ_i＋δｖｖ_i及びｖｖ
_j＋δｖｖ_jへ移動する。以上のように、図１７に示し
たような処理で点ｖｖ_iおよびｖｖ_jをそれぞれの近傍
点のうちで（９）式のエネルギー関数Ｅ_modelを減少さ
せる点へ移動することができる。図１に示すように、mo
del 上の全ての２点の組み合わせに対してこの処理を行
うことにより、１点の近傍探索ではエネルギーが減少し
なくなくなったmodel を、２点の近傍探索によってさら
にエネギーが減少する位置へ移動することができる。For example, if the number of neighboring points is m = 8, <
FIG. 18 shows how the h, k> = <7,5> th neighbor point is searched. In step 334, the evaluation value d _i (δvv _h ) + d _j (δvv _k ) + g _ij of the neighborhood energy by the movements δvv _h and δvv _k to the <h, k> -th neighborhood point.
(Δvv _h , δvv _k ) is calculated using d _i (δvv _i ) and d _j (δvv _j ) obtained in step 331. Step 334 is performed m × m times, but d
_{Since i} (δvv _i ) and d _j (δvv _j ) each take only m values, it is useless to calculate m × m times in step 334, and in step 331 each m value is calculated in advance. Is calculated and stored, the calculation in step 334 can be performed efficiently. In steps S335 and S336, the minimum value E _min of these evaluation values
And moving to a neighboring point that gives the minimum value E _min δvv _i , δ
Find vv _j . If the point giving the minimum value of the neighborhood energy evaluation function is not the current position in steps S338 and S339, the points vv _i and vv _j are set to vv _i + δvv _i and vv.
Move to _j + δvv _j . As described above, the points vv _i and vv _j can be moved to the point of decreasing the energy function E _{model of the} equation (9) among the neighboring points by the processing shown in FIG. As shown in Figure 1, mo
By performing this process for all combinations of two points on del, the model whose energy does not decrease in the one-point neighborhood search is moved to the position where the energy decreases further by the two-point neighborhood search. be able to.

【００２６】図１に示すように、２点の近傍探索によっ
てmodel が移動すれば１点の近傍探索へ戻って処理を繰
り返し、２点の近傍探索によっても点が移動しなくなっ
たところで処理を終了する。このときのmodel は、
（９）式のエネルギー関数Ｅ_mode _lが最小となるため、
エネルギー関数として与えられた拘束条件を満たすよう
な認識対象の構造が抽出できる。以上のように、本実施
例では、先の提案の動的モデルにおいて、１点の近傍の
みを探索していたためにエネルギーが最小となっていな
いにもかかわらず処理を終了してしまい、認識対象の輪
郭等の構造の抽出に失敗する場合があるという問題点を
解決し、２点を同時に移動して探索を繰り返すことによ
って、正確にエネルギー関数を最小化する認識対象の構
造を抽出できるという利点がある。なお、本発明は、上
記実施例に限定されず種々の変形が可能である。その変
形例としては、例えば次のようなものがある。（１）本実施例では、２点を同時に移動した場合の探
索を行う方法について説明したが、さらに３点以上の点
を同時に移動した場合の探索も可能である。すなわち、
３点以上の複数の点を同時に移動した時のエネルギー関
数の変化量を計算しエネルギーの減少する移動を繰り返
すことによってエネルギー最小化処理を行う。（２）本実施例では、１点の近傍探索の後に２点の近
傍探索を行う方法について説明したが、１点の近傍探索
処理と２点の近傍探索処理とを本実施例と異なる組み合
わせで処理するなど、種々の変形が可能である。As shown in FIG. 1, if the model moves by the two-point neighborhood search, the process returns to the one-point neighborhood search and the process is repeated, and the process ends when the points do not move even after the two-point neighborhood search. To do. Model at this time is
Since the energy function E _mode _{l of the} equation (9) is the minimum,
The structure of the recognition target that satisfies the constraint condition given as the energy function can be extracted. As described above, in the present embodiment, the processing is ended even though the energy is not minimum because the dynamic model proposed above searches only the vicinity of one point, and the recognition target It is possible to extract the structure of the recognition target that minimizes the energy function accurately by solving the problem that the extraction of the structure such as the contour of the object may fail and moving the two points at the same time and repeating the search. There is. The present invention is not limited to the above embodiment, and various modifications can be made. The following are examples of such modifications. (1) In this embodiment, the method of performing a search when two points are moved at the same time has been described, but a search when three or more points are moved at the same time is also possible. That is,
Energy minimization processing is performed by calculating the amount of change in the energy function when moving a plurality of points of three or more points at the same time and repeating the movement of decreasing the energy. (2) In the present embodiment, the method of performing the two-point neighborhood search after the one-point neighborhood search has been described. However, the one-point neighborhood search process and the two-point neighborhood search process may be combined differently from the present embodiment. Various modifications such as processing are possible.

【００２７】[0027]

【発明の効果】以上詳細に説明したように、第１〜第６
の発明によれば、画像認識方法において、エネルギー関
数を最小化するモデルの位置を、モデル上の１点の近傍
点へ移動した場合にエネルギーが減少するような移動を
する１点の近傍の探索処理及び前記モデル上の複数の点
をそれぞれの近傍点へ同時に移動した場合にエネルギー
が減少するような移動をする複数の点の近傍点への同時
探索処理を行うことにより求めるので、正確にエネルギ
ーを最小化する認識対象の構造を抽出できる。As described in detail above, the first to sixth aspects
According to the invention, in the image recognition method, a search is made for a neighborhood of one point that moves so that energy is reduced when the position of the model that minimizes the energy function is moved to a neighborhood point of the one point on the model. Since it is determined by processing and simultaneous search processing of a plurality of points that move so that energy decreases when a plurality of points on the model are simultaneously moved to their respective neighboring points, the energy is accurately calculated. The structure of the recognition target that minimizes can be extracted.

[Brief description of drawings]

【図１】本発明の実施例を示す画像認識方法のエネルギ
ー最小化処理のフローチャートである。FIG. 1 is a flowchart of energy minimization processing of an image recognition method according to an embodiment of the present invention.

【図２】従来の動的輪郭モデルによる輪郭抽出方法を示
す図である。FIG. 2 is a diagram showing a conventional contour extraction method using an active contour model.

【図３】図２の動的輪郭モデルを示す図である。FIG. 3 is a diagram showing the active contour model of FIG. 2;

【図４】図２中のエネルギー最小化処理を示すフローチ
ャートである。FIG. 4 is a flowchart showing energy minimization processing in FIG.

【図５】先の提案の動的モデルを示す図である。FIG. 5 is a diagram showing the dynamic model proposed above.

【図６】固定点に引き寄せるエネルギー項を示す図であ
る。FIG. 6 is a diagram showing an energy term attracted to a fixed point.

【図７】２点を結ぶベクトルに関する項を示す図であ
る。FIG. 7 is a diagram showing terms related to a vector connecting two points.

【図８】３点を結ぶ折れ線に関する項を示す図である。FIG. 8 is a diagram showing terms related to a polygonal line connecting three points.

【図９】３点を頂点とする三角形の面積に関する項を示
す図である。FIG. 9 is a diagram showing terms relating to the area of a triangle having three points as vertices.

【図１０】ｍ点を頂点とするｍ角形の面積に関する項を
示す図である。FIG. 10 is a diagram showing terms relating to the area of an m-sided polygon having an apex at the point m.

【図１１】先の提案のエネルギー最小化処理を示すフロ
ーチャートである。FIG. 11 is a flowchart showing an energy minimization process of the above proposal.

【図１２】図１１中の近傍探索処理を示すフローチャー
トである。FIG. 12 is a flowchart showing a neighborhood search process in FIG.

【図１３】先の提案の近傍探索を示す図である。FIG. 13 is a diagram showing the neighborhood search proposed above.

【図１４】先の提案の動的モデルの構成を示す図であ
る。FIG. 14 is a diagram showing the structure of the previously proposed dynamic model.

【図１５】先の提案の動的モデルにおける問題点を示す
図である。FIG. 15 is a diagram showing a problem in the previously proposed dynamic model.

【図１６】図１中の近傍探索によるmodel の移動を示す
図である。FIG. 16 is a diagram showing movement of model by neighborhood search in FIG. 1.

【図１７】図１６中の２点の近傍探索によるmodel の移
動を示す図である。FIG. 17 is a diagram showing movement of a model by a neighborhood search of two points in FIG.

【図１８】２点の近傍探索処理を示す図である。FIG. 18 is a diagram showing a two-point neighborhood search process.

[Explanation of symbols]

１１０原画像データ１２０エネルギー画像計算処理１３０エネルギー関数画像１４０エネルギー最小化処理１６０構造モデルの定義 110 Original Image Data 120 Energy Image Calculation Processing 130 Energy Function Image 140 Energy Minimization Processing 160 Definition of Structural Model

───────────────────────────────────────────────────── フロントページの続き (72)発明者下迫田義則東京都港区虎ノ門１丁目７番12号沖電気工業株式会社内 ─────────────────────────────────────────────────── ─── Continuation of the front page (72) Inventor Yoshinori Shimosakoda 1-7-12 Toranomon, Minato-ku, Tokyo Oki Electric Industry Co., Ltd.

Claims

[Claims]

1. A model representing a structure of a recognition target in an input image is defined as a model represented by an arbitrary number of points on the image, and a constraint condition for attracting the model to image features and a structure of the model. In an image recognition method for extracting an arbitrary structure from the input image by defining an energy function that minimizes energy when a constraint condition regarding is satisfied, and determining a position of the model that minimizes the energy function. , The position of the model that minimizes the energy function,
Search processing for the neighborhood of one point that moves so that the energy decreases when the point moves to one of the neighboring points on the model, and energy when moving a plurality of points on the model to the respective neighboring points at the same time. An image recognition method characterized in that it is obtained by performing simultaneous search processing of a plurality of points that move such that the number of points decreases in the vicinity of the points.

2. First, energy minimization processing is performed by a search processing in the vicinity of the one point, and then 2
The energy is minimized by performing a movement such that the energy is reduced when the points simultaneously move to a neighboring point, and if there is a movement, returning to the search processing in the neighborhood of the one point and repeating the processing. The image recognition method according to item 1.

3. In the simultaneous search process for the neighboring points of the plurality of points, the amount of change in energy is expressed as an evaluation function of the amount of movement of the plurality of points to be searched in the vicinity, and the energy is calculated based on the evaluation function. The image recognition method according to claim 1, wherein the movement is performed so as to decrease.

4. In the simultaneous search process for neighboring points of the plurality of points, one point of each point is selected from among the amounts of change in energy expressed as a function of the amount of movement of the plurality of points to be searched to the neighboring points. A term that depends only on the movement amount is calculated in advance before the search is started, and the term that depends only on the calculated movement amount of one point is used in the simultaneous search process for the neighboring points of the plurality of points. The image recognition method according to claim 3, characterized in that