JP2961272B1

JP2961272B1 - Object recognition apparatus and method using feature vector

Info

Publication number: JP2961272B1
Application number: JP25566398A
Authority: JP
Inventors: 茂木村; 勝之中野; 博義山口; 哲也新保; 英二川村; 正人緒方
Original assignee: BOEICHO GIJUTSU KENKYU HONBUCHO; SAIUAASU KK; Komatsu Ltd; Mitsubishi Precision Co Ltd
Current assignee: BOEICHO GIJUTSU KENKYU HONBUCHO; SAIUAASU KK; Komatsu Ltd; Mitsubishi Precision Co Ltd
Priority date: 1998-09-09
Filing date: 1998-09-09
Publication date: 1999-10-12
Anticipated expiration: 2018-09-09
Also published as: JP2000088523A

Abstract

【要約】【課題】オクルージョン（物体の隠蔽）の発生などを考
慮して類似度を補正することにより、精度のよい距離推
定を行うようにする。【解決手段】撮像手段で撮像された撮像画像の各選択画
素に対応する各撮像手段の対応候補点近傍の明度情報等
の局所情報を、仮定した距離毎に抽出する対応候補点情
報抽出手段６０６〜６０８と、ここで抽出した各撮像手
段の対応候補点近傍の局所情報を入力として、局所的な
特徴を的確に特徴付ける特徴ベクトルを生成する特徴ベ
クトルの生成手段６０９と、その生成された特徴ベクト
ルのデータを要素毎に位置的または時間的に集積するこ
とによる情報の安定化の処理を行う特徴ベクトルの安定
化手段６１０と、それらの安定化された特徴ベクトルの
要素間の関係を利用して物体までの距離を推定する距離
推定手段６１１を具えることにより、３次元空間の情報
を推定する。An object of the present invention is to perform distance estimation with high accuracy by correcting similarity in consideration of occurrence of occlusion (concealment of an object) and the like. Corresponding candidate point information extracting means for extracting local information such as brightness information near a corresponding candidate point of each image capturing means corresponding to each selected pixel of an image captured by an image capturing means for each assumed distance. To 608, and local information near the corresponding candidate point of each imaging unit extracted here as input, a feature vector generating unit 609 for generating a feature vector for accurately characterizing a local feature, and a generated feature vector Using a feature vector stabilizing unit 610 that performs information stabilization processing by accumulating the data of each of the elements in a position or time manner, and utilizing the relationship between the elements of the stabilized feature vector. By providing the distance estimating means 611 for estimating the distance to the object, information in a three-dimensional space is estimated.

Description

DETAILED DESCRIPTION OF THE INVENTION

【０００１】[0001]

【発明の属する技術分野】本発明は、物体の認識装置お
よびその方法に関し、異なる位置に配置された複数の撮
像手段による画像情報から三角測量の原理を利用して認
識対象物体までの距離情報など、認識対象物体の３次元
情報を算出して、この３次元情報を用いて物体を認識す
るような場合に適用して好適な装置および方法に関す
る。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to an apparatus and a method for recognizing an object, and more particularly to distance information to an object to be recognized by using triangulation principles from image information obtained by a plurality of image pickup means arranged at different positions. More specifically, the present invention relates to an apparatus and a method which are preferably applied to a case where three-dimensional information of a recognition target object is calculated and an object is recognized using the three-dimensional information.

【０００２】[0002]

【従来の技術および発明が解決しようとする課題】従来
より、撮像手段たる画像センサの撮像結果に基づき認識
対象物体までの距離を計測する方法として、ステレオビ
ジョン（ステレオ視）による計測方法が広く知られてい
る。2. Description of the Related Art Conventionally, as a method of measuring a distance to an object to be recognized based on an image pickup result of an image sensor as an image pickup means, a measurement method using stereo vision (stereo vision) has been widely known. Have been.

【０００３】この計測は、２次元画像から、距離、深
度、奥行きといった３次元情報を得るために有用な方法
である。[0003] This measurement is a useful method for obtaining three-dimensional information such as distance, depth and depth from a two-dimensional image.

【０００４】すなわち、２台の画像センサを例えば左右
に配置し、これら２台の画像センサで同一の認識対象物
を撮像したときに生じる視差から、三角測量の原理で、
対象物までの距離を測定するという方法である。このと
きの左右の画像センサの対は、ステレオ対と呼ばれてお
り、２台で計測を行うことから２眼ステレオ視と呼ばれ
ている。[0004] That is, two image sensors are arranged, for example, on the left and right, and the parallax generated when the same image of the object to be recognized is picked up by these two image sensors is calculated based on the principle of triangulation.
This is a method of measuring the distance to an object. The pair of left and right image sensors at this time is called a stereo pair, and is called a twin-lens stereo view because measurement is performed by two units.

【０００５】図１２は、こうした２眼ステレオ視の原理
を示したものである。同図に示すように、２眼ステレオ
視では、左右の画像センサ１、２の画像＃１（撮像面１
ａ上で得られる）、画像＃２（撮像面２ａ上で得られ
る）中の、対応する点Ｐ１、Ｐ２同士の位置の差である
視差（ディスパリティ）ｄを計測する必要がある。一般
に視差ｄは、３次元空間中の点５０ａ（認識対象物体５
０上の点）までの距離ｚとの間に、次式で示す関係が成
立する。FIG. 12 shows the principle of such two-lens stereo vision. As shown in the figure, in binocular stereo vision, images # 1 of the left and right image sensors 1 and 2 (imaging plane 1
a) and an image # 2 (obtained on the imaging surface 2a), it is necessary to measure a parallax (disparity) d which is a difference between the positions of the corresponding points P1 and P2. Generally, the parallax d is a point 50a (a recognition target object 5) in a three-dimensional space.
The relationship expressed by the following equation is established between the distance z to the point on 0).

【０００６】ｚ＝Ｆ・Ｂ/ ｄ …（１）ここに、Ｂは左右の画像センサ１、２間の距離（基線
長）であり、Ｆは画像センサ１のレンズ３１、画像セン
サ２のレンズ３２の焦点距離である。通常、基線長Ｂと
焦点距離Ｆは既知であるので、視差ｄが分かれば、距離
ｚは一義的に求められることになる。Z = F · B / d (1) where B is the distance (base line length) between the left and right image sensors 1 and 2, and F is the lens 31 of the image sensor 1 and the lens of the image sensor 2. 32 focal length. Usually, since the base line length B and the focal length F are known, if the parallax d is known, the distance z can be uniquely obtained.

【０００７】この視差ｄは、両画像＃１、＃２間で、ど
の点がどの点に対応するかを逐一探索することにより算
出することができる。このときの一方の画像＃１上の点
Ｐ₁に対応する他方の画像＃２上の点Ｐ₂のことを「対応
点」と以下呼ぶこととし、対応点を探索することを、以
下「対応点探索」と呼ぶことにする。物体５０までの距
離を仮定したとき、一方の画像＃１上の点Ｐ₁に対応す
る他方の画像＃２上の点のことを以下「対応候補点」と
呼ぶことにする。The parallax d can be calculated by searching each point between the images # 1 and # 2 to determine which point corresponds to which point. That of the other image # point on 2 P ₂ corresponding to one of the image # point P ₁ on 1 at this time and be referred to hereinafter as "corresponding points", to explore the corresponding point, hereinafter "corresponds Point search ". When assuming a distance to the object 50, it will be referred to as "candidate corresponding points" below that of one of the image # 1 on the point a point on the other image # 2 corresponding to P _1.

【０００８】２眼ステレオ視による計測を行う場合、上
記対応点探索を行った結果、真の距離ｚに対応する真の
対応点Ｐ₂を検出することができれば、真の視差ｄを算
出することができたことになり、このとき対象物５０上
の点５０ａまでの真の距離ｚが計測できたといえる。[0008] In the case of performing measurement by binocular stereo vision, if the true corresponding point P ₂ corresponding to the true distance z can be detected as a result of the corresponding point search, the true parallax d is calculated. Is obtained, and it can be said that the true distance z to the point 50a on the object 50 can be measured at this time.

【０００９】こうした処理を、一方の画像＃１の全画素
について実行することにより、画像＃１の全選択画素に
距離情報を付与した画像（距離画像）が生成されること
になる。By executing such processing for all pixels of one image # 1, an image (distance image) in which distance information is added to all selected pixels of image # 1 is generated.

【００１０】上記対応点を探索して真の距離を求める処
理を、図１３、図１４、図１７を用いて詳述する。The processing for finding the corresponding distance by searching for the corresponding point will be described in detail with reference to FIGS. 13, 14 and 17. FIG.

【００１１】図１７は、従来の２眼ステレオ視による距
離計測装置（物体認識装置）の構成を示すブロック図で
ある。FIG. 17 is a block diagram showing the configuration of a conventional distance measuring device (object recognizing device) based on binocular stereo vision.

【００１２】基準画像入力部１０１には、視差ｄ（距
離ｚ）を算出する際に基準となる画像センサ１で撮像さ
れた基準画像＃１が取り込まれる。一方、画像入力部１
０２には、基準画像＃１上の点に対応する対応点が存在
する画像である画像センサ２の撮像画像＃２が取り込ま
れる。The reference image input unit 101 receives a reference image # 1 taken by the image sensor 1 serving as a reference when calculating the parallax d (distance z). On the other hand, the image input unit 1
In 02, a captured image # 2 of the image sensor 2 which is an image having a corresponding point corresponding to a point on the reference image # 1 is captured.

【００１３】つぎに、対応候補点座標発生部１０３、対
応候補点情報抽出部１０４、類似度算出部１０５、ウイ
ンドウ内加算部１０６、距離推定部１０７における処理
を図１３を用いて説明すると、まず、対応候補点座標発
生部１０３では、基準画像＃１の各画素に対して、仮定
した距離ｚ_n毎に、画像センサ２の画像＃２の対応候補
点の位置座標が記憶、格納されており、これを読み出す
ことにより対応候補点の位置座標を発生する。Next, the processing in the correspondence candidate point coordinate generation unit 103, the correspondence candidate point information extraction unit 104, the similarity calculation unit 105, the in-window addition unit 106, and the distance estimation unit 107 will be described with reference to FIG. The corresponding candidate point coordinate generation unit 103 stores and stores the position coordinates of the corresponding candidate points of the image # 2 of the image sensor 2 for each assumed distance z _n for each pixel of the reference image # 1. By reading this, the position coordinates of the corresponding candidate point are generated.

【００１４】すなわち、基準画像センサ１の基準画像＃
１の中から位置（ｉ、ｊ）で特定される画素Ｐ₁が選択
されるとともに、認識対象物体５０までの距離ｚ_nが仮
定される。そして、この仮定距離ｚ_nに対応する他方の
画像センサ２の画像＃２内の対応候補点Ｐ₂の位置座標
が読み出される。That is, reference image # of reference image sensor 1
1, a pixel P ₁ specified by the position (i, j) is selected, and a distance z _n to the recognition target object 50 is assumed. Then, the position coordinates of the corresponding candidate point P ₂ in the image # 2 of the other image sensor 2 corresponding to this assumption distance z _n is read.

【００１５】つぎに、対応候補点情報抽出部１０４で
は、このようにして対応候補点座標発生部１０３によっ
て発生された画像センサ２の対応候補点Ｐ₂の位置座標
に基づき対応候補点Ｐ₂の局所情報を抽出する処理が実
行される。ここで、局所情報とは、対応候補点の近傍の
画素を考慮して得られる対応候補点の画像情報のことで
ある。画像情報として代表的なものは、画素の明度であ
り、画像に対して前処理を行った後の画像情報であって
もよい。Next, the corresponding candidate point information extracting unit 104 determines the corresponding candidate point P ₂ based on the position coordinates of the corresponding candidate point P ₂ of the image sensor 2 generated by the corresponding candidate point coordinate generating unit 103 in this manner. A process for extracting local information is executed. Here, the local information is image information of a corresponding candidate point obtained in consideration of a pixel near the corresponding candidate point. A representative example of the image information is the brightness of a pixel, and may be image information obtained by performing preprocessing on an image.

【００１６】さらに、類似度算出部１０５では、上記対
応候補点情報抽出部１０４で得られた対応候補点Ｐ₂の
画像情報（局所情報）と基準画像の選択画素Ｐ₁の画像
情報との類似度が算出される。類似度の算出にあたって
は、一般的に位置的に近傍にあるデータを集積すること
により情報の安定化が図られる。例としては、エリアベ
ースのウインドウを用いたマッチングがある。Furthermore, the similarity calculating unit 105, similar to the above image information of the corresponding candidate information extracting section corresponding candidate point P ₂ obtained in 104 (local information) and the reference image selected pixel image information P ₁ The degree is calculated. In calculating the similarity, information is generally stabilized by accumulating data that is close in position. An example is matching using area-based windows.

【００１７】そこで、ウインドウ内加算部１０６では、
基準画像＃１の選択された画素の周囲の領域と、画像セ
ンサ２の画像＃２の対応候補点の周囲の領域とのパター
ンマッチングにより、両画像の領域同士が比較されて、
類似度という情報を安定化させた安定化類似度が算出さ
れる。Therefore, in-window adding section 106
By pattern matching between a region around the selected pixel of the reference image # 1 and a region around the corresponding candidate point of the image # 2 of the image sensor 2, the regions of both images are compared with each other,
A stabilized similarity obtained by stabilizing the information of the similarity is calculated.

【００１８】すなわち、図１３に示すように、基準画像
＃１の選択画素Ｐ₁の位置座標を中心とするウインドウ
ＷＤ₁が切り出されるとともに、画像センサ２の画像＃
２の対応候補点Ｐ₂の位置座標を中心とするウインドウ
ＷＤ₂が切り出され、これらウインドウＷＤ₁、ＷＤ₂同
士についてパターンマッチングを行うことにより、これ
らの安定化類似度が算出される。このパターンマッチン
グは各仮定距離ｚ_n毎に行われる。そして同様のパター
ンマッチングが、基準画像＃１の各選択画素毎に全画素
について行われる。[0018] That is, as shown in FIG. 13, with the window WD ₁ around the position coordinates of the selected pixel P ₁ reference picture # 1 is cut, the image of the image sensor 2 #
Window WD ₂ around the second position coordinates of the corresponding candidate point P ₂ is cut out, by performing pattern matching for these windows WD _1, WD ₂ together, these stabilizing similarity is calculated. This pattern matching is performed for each assumed distance z _n . Then, the same pattern matching is performed for all pixels for each selected pixel of the reference image # 1.

【００１９】なお、安定化には、上述した空間的な領域
（ウインドウ）内での空間的な安定化の他に、時間的に
安定化を図る方法もある。In addition to the stabilization in the spatial area (window) described above, there is a method for stabilization in time as well as the stabilization.

【００２０】図１４は、仮定距離ｚ_nと（安定化）類似
度の逆数Ｑssadとの対応関係を示すグラフである。FIG. 14 is a graph showing the correspondence between the assumed distance z _n and the reciprocal Qssad of the (stabilized) similarity.

【００２１】図１３のウインドウＷＤ₁と、仮定距離が
ｚ’_nのときの対応候補点の位置を中心とするウインド
ウＷＤ’₂とのマッチングを行った結果は、図１４に示
すように類似度の逆数Ｑssadとして大きな値が得られて
いる（類似度は小さくなっている）が、図１３のウイン
ドウＷＤ₁と、仮定距離がｚ_nxのときの対応候補点の位
置を中心とするウインドウＷＤ₂とのマッチングを行っ
た結果は、図１４に示すように類似度の逆数Ｑssadは小
さくなっている（類似度は大きくなっている）のが分か
る。[0021] the window WD ₁ in FIG. 13, as a result of assumptions distance was matching with the ₂ 'window WD and the center position of the corresponding candidate points when the _n' where z is similarity as shown in FIG. 14 and a large value is obtained as the reciprocal Qssad (is smaller similarity) is the window WD ₁ in FIG. 13, the window WD ₂ assumptions distance around the position of the corresponding candidate points when the z _nx It can be seen from the result of the matching that the inverse number Qssad of the similarity is small (the similarity is large) as shown in FIG.

【００２２】このようにして仮定距離ｚ_nと類似度の逆
数Ｑssadとの対応関係から、最も類似度が高くなる点
（類似度の逆数Ｑssadが最小値となる点）を判別し、こ
の最も類似度が高くなっている点に対応する仮定距離ｚ
_nxを最終的に、認識対象物体５０上の点５０ａまでの真
の距離（最も確からしい距離）と推定する。つまり、図
１３における仮定距離ｚ_nxに対応する対応候補点Ｐ₂が
選択画素Ｐ₁に対する対応点であるとされる。このよう
に、距離推定部１０７では、基準画像＃１の選択画素に
ついて仮定距離ｚ_nを順次変化させて得られた各類似度
の中から、最も類似度の高くなるものが判別され、最も
類似度が高くなる仮定距離ｚ_nxが真の距離と推定され出
力される。In this way, from the correspondence between the hypothetical distance z _n and the reciprocal of the similarity Qssad, the point with the highest similarity (the point at which the reciprocal of the similarity Qssad has the minimum value) is determined. Assumed distance z corresponding to the point with higher degree
Finally, _nx is estimated as the true distance (the most probable distance) to the point 50a on the recognition target object 50. That is a candidate corresponding point P ₂ corresponding to the assumed distance z _nx in FIG. 13 is a corresponding point on the selected pixel P _1. As described above, the distance estimating unit 107 determines the one having the highest similarity among the similarities obtained by sequentially changing the assumed distance z _n for the selected pixel of the reference image # 1, and determines the most similar one. The hypothetical distance z _{nx at} which the degree becomes higher is estimated and output as the true distance.

【００２３】以上、２眼ステレオ視による場合を説明し
たが、３台以上の画像センサを用いてもよい。３台以上
の画像センサを用いて距離計測（物体認識）を行うこと
を、多眼ステレオ視による距離計測（物体認識）と称す
ることにする。Although the above description has been made of the case of the binocular stereo vision, three or more image sensors may be used. Performing distance measurement (object recognition) using three or more image sensors is referred to as distance measurement (object recognition) by multi-view stereo vision.

【００２４】この多眼ステレオ視による距離計測装置
（物体認識装置）では、複数の画像センサを、２台の画
像センサからなるステレオ対に分割し、それぞれのステ
レオ対に対し、前述した２眼ステレオ視の原理を繰り返
し適用する方式をとっている。In the distance measuring apparatus (object recognizing apparatus) based on multi-view stereo vision, a plurality of image sensors are divided into a stereo pair consisting of two image sensors, and each of the stereo pairs is subjected to the above-described twin-lens stereo. It employs a system that applies the principle of vision repeatedly.

【００２５】すなわち、複数ある画像センサの中から基
準となる画像センサを選択し、この基準画像センサと他
の画像センサとの間で、ステレオ対を構成する。そし
て、各ステレオ対に対して２眼ステレオ視の場合の処理
を適用していく。この結果、基準画像センサから基準画
像センサの視野内に存在する認識対象物までの距離が計
測されることになる。That is, a reference image sensor is selected from a plurality of image sensors, and a stereo pair is formed between the reference image sensor and another image sensor. Then, the processing in the case of binocular stereo vision is applied to each stereo pair. As a result, the distance from the reference image sensor to the recognition target present in the field of view of the reference image sensor is measured.

【００２６】従来の多眼ステレオにおけるステレオ対の
関係を図１５を参照して説明すると、図１３に示す２眼
ステレオでは、基準画像＃１と対をなす対応画像は＃２
の１つであったが、多眼ステレオでは、基準画像＃１と
画像センサ２の画像＃２の対、基準画像＃１と画像セン
サ３の画像＃３の対、…、基準画像＃１と画像センサＮ
の画像＃Ｎの対という具合に複数のステレオ対が存在す
る。こうした対応画像と基準画像の各ステレオ対に基づ
く処理を行う前には、画像センサたるカメラの取付け歪
みなどを考慮する必要があり、通常はキャリブレーショ
ンによる補正処理を前もって行うようにしている。The relationship between stereo pairs in a conventional multi-view stereo will be described with reference to FIG. 15. In the twin-lens stereo shown in FIG. 13, the corresponding image paired with the reference image # 1 is # 2.
However, in the multi-view stereo, the pair of the reference image # 1 and the image # 2 of the image sensor 2, the pair of the reference image # 1 and the image # 3 of the image sensor 3, ..., the reference image # 1 Image sensor N
There are a plurality of stereo pairs such as the pair of image #N. Before performing the processing based on each stereo pair of the corresponding image and the reference image, it is necessary to consider the mounting distortion of the camera, which is an image sensor, and the correction processing by calibration is usually performed in advance.

【００２７】多眼ステレオによって対応点を探索して真
の距離を求める処理を、前述した２眼ステレオの図１
３、図１７に対応する図１５、図１８を用いて詳述す
る。The process of searching for a corresponding point by a multi-view stereo to obtain a true distance is the same as that of FIG.
This will be described in detail with reference to FIGS. 15 and 18 corresponding to FIGS.

【００２８】図１８は、従来の多眼ステレオ視による距
離計測装置（物体認識装置）の構成を説明する図であ
る。FIG. 18 is a diagram for explaining the configuration of a conventional distance measuring device (object recognition device) based on multi-view stereo vision.

【００２９】なお、各画像センサ１、２、３、…、Ｎ
は、水平、垂直あるいは斜め方向に所定の間隔で配置さ
れているものとする（説明の便宜上、図１５では一定間
隔で左右に配置されている場合を示している）。Each of the image sensors 1, 2, 3,..., N
Are arranged at predetermined intervals in the horizontal, vertical or diagonal direction (for convenience of explanation, FIG. 15 shows a case where they are arranged on the left and right at regular intervals).

【００３０】基準画像入力部２０１には、視差ｄ（距離
ｚ）を算出する際に基準となる画像センサ１で撮像され
た基準画像＃１が取り込まれる。一方、画像入力部２０
２には、基準画像＃１上の点に対応する対応点が存在す
る画像である画像センサ２の撮像画像＃２が取り込まれ
る。他の画像入力部２０３、２０４においても、基準画
像＃１に対応する画像センサ３の画像＃３が、基準画像
＃１に対応する画像センサＮの画像＃Ｎがそれぞれ取り
込まれる。The reference image input unit 201 receives a reference image # 1 captured by the image sensor 1 serving as a reference when calculating the parallax d (distance z). On the other hand, the image input unit 20
2, a captured image # 2 of the image sensor 2 which is an image having a corresponding point corresponding to a point on the reference image # 1 is captured. Also in the other image input units 203 and 204, the image # 3 of the image sensor 3 corresponding to the reference image # 1 and the image #N of the image sensor N corresponding to the reference image # 1 are captured.

【００３１】対応候補点座標発生部２０５では、基準画
像＃１の各画素に対して、仮定した距離ｚ_n毎に、画像
センサ２の画像＃２の対応候補点の位置座標、画像セン
サ３の画像＃３の対応候補点の位置座標、画像センサＮ
の画像＃Ｎの対応候補点の位置座標がそれぞれ記憶、格
納されており、これらを読み出すことにより各対応候補
点の位置座標を発生する。[0031] In the corresponding candidate point coordinate generating unit 205, for each pixel of the reference image # 1, for each assumed distance z _n, of the image # 2 image sensor 2 position coordinates of the corresponding candidate points of the image sensor 3 Position coordinates of the corresponding candidate point of image # 3, image sensor N
The position coordinates of the corresponding candidate points of the image #N are stored and stored, and by reading these, the position coordinates of each corresponding candidate point are generated.

【００３２】すなわち、基準画像センサ１の基準画像＃
１の中から所定位置Ｐ₁に存在する（ｉ、ｊ）で特定さ
れる画素が選択されるとともに、認識対象物体５０まで
の距離ｚ_nが仮定される。そして、この仮定距離ｚ_nに対
応する画像センサ２の画像＃２内の対応候補点Ｐ₂の位
置座標が読み出される。同様にして、基準画像＃１の選
択画素Ｐ₁（ｉ、ｊ）、仮定距離ｚ_nに対応する画像セン
サ３の画像＃３の対応候補点Ｐ₃の位置座標が読み出さ
れ、基準画像＃１の選択画素Ｐ₁（ｉ、ｊ）、仮定距離
ｚ_nに対応する画像センサＮの画像＃Ｎの対応候補点Ｐ_N
の位置座標が読み出される。そして、仮定距離ｚ_nを順
次変化させて同様の読み出しが行われる。また、選択画
素を順次変化させることによって同様の読み出しが行わ
れる。こうして対応候補点Ｐ₂の位置座標（Ｘ₂、
Ｙ₂）、対応候補点Ｐ₃の位置座標（Ｘ₃、Ｙ₃）、・・
・、対応候補点Ｐ_Nの位置座標（Ｘ_N、Ｙ_N）が対応候補
点座標発生部２０５から出力される。That is, the reference image # of the reference image sensor 1
1, a pixel specified by (i, j) existing at the predetermined position P ₁ is selected, and a distance z _n to the recognition target object 50 is assumed. Then, the position coordinates of the corresponding candidate point P ₂ in the image # 2 image sensor 2 corresponding to this assumption distance z _n is read. Similarly, the position coordinates of the corresponding candidate point P ₃ of the image # 3 of the image sensor 3 corresponding to the selected pixel P ₁ (i, j) and the assumed distance z _n of the reference image # 1 are read, and 1 selected pixel P ₁ (i, j), corresponding candidate point P _N of image #N of image sensor N corresponding to assumed distance z _n
Is read out. Then, similar reading is performed by sequentially changing the assumed distance z _n . Similar reading is performed by sequentially changing the selected pixels. Position coordinates (X ₂ corresponding candidate point P ₂ Thus,
Y ₂ ), position coordinates (X ₃ , Y ₃ ) of corresponding candidate point P ₃ ,...
The position coordinates (X _N , Y _N ) of the corresponding candidate point P _N are output from the corresponding candidate point coordinate generator 205.

【００３３】つぎに、対応候補点情報抽出部２０６で
は、このようにして対応候補点座標発生部２０５によっ
て発生された画像センサ２の対応候補点Ｐ₂の位置座標
に基づき対応候補点Ｐ₂の局所情報を抽出する処理が実
行される。同様にして、対応候補点情報抽出部２０７で
は、対応候補点座標発生部２０５で発生された画像セン
サ３の画像＃３の対応候補点Ｐ₃の位置座標に基づい
て、対応候補点Ｐ₃の局所情報が、対応候補点情報抽出
部２０８では、対応候補点座標発生部２０５で発生され
た画像センサＮの画像＃Ｎの対応候補点Ｐ_Nの位置座標
に基づいて、対応候補点Ｐ_Nの局所情報がそれぞれ求め
られる。Next, the corresponding candidate point information extracting unit 206 determines the corresponding candidate point P ₂ based on the position coordinates of the corresponding candidate point P ₂ of the image sensor 2 generated by the corresponding candidate point coordinate generating unit 205 in this manner. A process for extracting local information is executed. Similarly, in correspondence candidate information extracting unit 207, based on the position coordinates of the corresponding candidate point P ₃ of the image # 3 of the image sensor 3 generated by the corresponding candidate point coordinate generating unit 205, the candidate corresponding point P ₃ local information, the candidate corresponding point information extraction unit 208, based on the position coordinates of the corresponding candidate point P _N of the image #N image sensors N which is generated in the corresponding candidate point coordinate generating unit 205, the candidate corresponding points P _N Local information is obtained respectively.

【００３４】さらに、類似度算出部２０９では、上記対
応候補点情報抽出部２０６で得られた対応候補点Ｐ₂の
局所情報と基準画像＃１の選択画素Ｐ₁の画像情報との
類似度が算出される。そして、ウインドウ内加算部２１
２では、基準画像＃１の選択された画素の周囲の領域
と、画像センサ２の画像＃２の対応候補点の周囲の領域
とのパターンマッチングにより、両画像の領域同士が比
較されて、安定化類似度が算出される。Further, the similarity calculating section 209 calculates the similarity between the local information of the corresponding candidate point P ₂ obtained by the corresponding candidate point information extracting section 206 and the image information of the selected pixel P ₁ of the reference image # 1. Is calculated. Then, the in-window addition unit 21
In No. 2, the regions of the two images are compared with each other by pattern matching between the region around the selected pixel of the reference image # 1 and the region around the corresponding candidate point of the image # 2 of the image sensor 2, and stable. The similarity is calculated.

【００３５】すなわち、図１５に示すように、基準画像
＃１の選択画素Ｐ₁の位置座標を中心とするウインドウ
ＷＤ₁が切り出されるとともに、画像センサ２の画像＃
２の対応候補点Ｐ₂の位置座標を中心とするウインドウ
ＷＤ₂が切り出され、これらウインドウＷＤ₁、ＷＤ₂同
士についてパターンマッチングを行うことにより、これ
らの類似度が算出される。このパターンマッチングは各
仮定距離ｚ_n毎に行われる。図１６（１）は、仮定距離
ｚ_nとステレオ対（基準画像センサ１と画像センサ２）
の安定化類似度の逆数Ｑs₁との対応関係を示すグラフで
ある。[0035] That is, as shown in FIG. 15, with the window WD ₁ around the position coordinates of the selected pixel P ₁ reference picture # 1 is cut, the image of the image sensor 2 #
Window WD ₂ around the second position coordinates of the corresponding candidate point P ₂ is cut out, by performing pattern matching for these windows WD _1, WD ₂ together, these similarities are calculated. This pattern matching is performed for each assumed distance z _n . FIG. 16A shows a hypothetical distance z _n and a stereo pair (reference image sensor 1 and image sensor 2).
It is a graph showing a relationship between a reciprocal Qs ₁ stabilizing similarity.

【００３６】図１５のウインドウＷＤ₁と、仮定距離が
ｚ’_nのときの対応候補点の位置座標を中心とするウイ
ンドウＷＤ’₂とのマッチングを行った結果は、図１６
（１）に示すように類似度の逆数Ｑs₁として大きな値が
得られている（類似度は小さくなっている）が、図１５
のウインドウＷＤ₁と、仮定距離がｚ_nxのときの対応候
補点の位置座標を中心とするウインドウＷＤ₂とのマッ
チングを行った結果は、図１６（１）に示すように類似
度の逆数Ｑs₁は小さくなっている（類似度は大きくなっ
ている）のが分かる。[0036] the window WD ₁ in FIG. 15, as a result of assumptions distance was matching with the ₂ 'window WD around the position coordinates of the corresponding candidate points when the _n' where z is 16
As shown in (1), a large value is obtained as the reciprocal Qs ₁ of the similarity (similarity is small), but FIG.
The window WD _1, matching the results of the window WD ₂ around the position coordinates of the corresponding candidate points when the assumptions distance z _nx is the reciprocal of the similarity, as shown in FIG. 16 (1) Qs _It can be seen that ₁ is smaller (similarity is larger).

【００３７】同様にして類似度算出部２１０では、上記
対応候補点情報抽出部２０７で得られた対応候補点Ｐ₃
の局所情報と基準画像＃１の選択画素Ｐ₁の画像情報と
の類似度が算出される。そして、ウインドウ内加算部２
１３では、基準画像＃１の選択画素Ｐ₁の位置座標を中
心とするウインドウＷＤ₁と、画像センサ３の画像＃３
の対応候補点Ｐ₃の位置座標を中心とするウインドウＷ
Ｄ₃とのパターンマッチングが実行され、これらの安定
化類似度が算出される。そして、パターンマッチングが
各仮定距離ｚ_n毎に行われることによって、このステレ
オ対（基準画像センサ１と画像センサ３）についても図
１６（２）に示すような仮定距離ｚ_nと安定化類似度の
逆数Ｑs₂との対応関係を示すグラフが求められる。同様
にして類似度算出部２１１では、上記対応候補点情報抽
出部２０８で得られた対応候補点Ｐ_Nの局所情報と基準
画像＃１の選択画素Ｐ₁の画像情報との類似度が算出さ
れる。そして、ウインドウ内加算部２１４では、基準画
像＃１の選択画素Ｐ₁の位置座標を中心とするウインド
ウＷＤ₁と、画像センサＮの画像＃Ｎの対応候補点Ｐ_N
の位置座標を中心とするウインドウＷＤ_Nとのパターン
マッチングが実行され、これらの安定化類似度が算出さ
れる。そして、パターンマッチングが各仮定距離ｚ_n毎
に行われることによって、このステレオ対（基準画像セ
ンサ１と画像センサＮ）についても図１６（Ｎ）に示す
仮定距離ｚ_nと安定化類似度の逆数Ｑs_Nとの対応関係が
求められる。Similarly, in the similarity calculating section 210, the corresponding candidate point P ₃ obtained by the corresponding candidate point information extracting section 207 is obtained.
Similarity between the local information and the reference image # 1 of the image information of the selected pixel P ₁ of is calculated. Then, the in-window addition unit 2
In 13, the window WD ₁ around the position coordinates of the selected pixel P ₁ reference picture # 1, the image # 3 image sensors 3
Window W centered on the position coordinates of the corresponding candidate point P ₃ of
Pattern matching between D ₃ is executed, these stabilizing similarity is calculated. Then, by performing pattern matching for each assumed distance z _n , the stereo pair (reference image sensor 1 and image sensor 3) is also assumed to have the assumed distance z _n and the stabilized similarity as shown in FIG. graph showing a relationship between a reciprocal Qs ₂ of sought. In the similarity calculation unit 211 in the same manner, the similarity between the local information and the reference image # 1 of the image information of the selected pixel P ₁ of the corresponding candidate points P _N obtained above corresponding candidate point information extraction unit 208 is calculated You. Then, the intra-window addition unit 214 calculates a corresponding candidate point P _N between the window WD ₁ centered on the position coordinates of the selected pixel P ₁ of the reference image # ₁ and the image #N of the image sensor _N.
Pattern matching between the window WD _N around the position coordinates of the runs, these stabilizing similarity is calculated. Then, since pattern matching is performed for each assumed distance z _n , the stereo pair (reference image sensor 1 and image sensor N) also has the assumed distance z _n and the reciprocal of the stabilized similarity shown in FIG. correspondence between the Qs _N is required.

【００３８】最終的な類似度算出部２１５では、各ステ
レオ対毎に得られた仮定距離ｚ_nと安定化類似度の逆数
との対応関係が仮定距離毎に加算され、仮定距離と安定
化類似度の逆数の加算値（全体的な安定化類似度の逆
数）との対応関係が求められる（図１６参照）。In the final similarity calculating section 215, the correspondence between the assumed distance z _n obtained for each stereo pair and the reciprocal of the stabilized similarity is added for each assumed distance, and the assumed distance and the stabilized similarity are added. The correspondence relationship with the reciprocal of the degree (the reciprocal of the overall stabilized similarity) is obtained (see FIG. 16).

【００３９】さらに図１６（融合結果）に示すように、
仮定距離ｚ_nと安定化類似度の逆数の加算値との対応関
係から、最も類似度が高くなる点（類似度の逆数の加算
値が最小値となる点）を判別し、この最も類似度が高く
なっている点に対応する仮定距離ｚ_nxを最終的に、認識
対象物体５０上の点５０ａまでの真の距離（最も確から
しい距離）と推定する。これは距離推定部２１６で実行
される。かかる処理は、基準画像＃１の各選択画素毎に
全画素について行われる。Further, as shown in FIG.
From the correspondence between the hypothetical distance z _n and the added value of the reciprocal of the stabilized similarity, the point having the highest similarity (the point at which the added value of the reciprocal of the similarity becomes the minimum value) is determined. The hypothetical distance z _nx corresponding to the point where is higher is finally estimated as the true distance (the most likely distance) to the point 50 a on the recognition target object 50. This is executed by the distance estimation unit 216. This process is performed for all pixels for each selected pixel of the reference image # 1.

【００４０】以上のようにして、距離推定部２１６で
は、仮定距離ｚ_nを順次変化させて得られた安定化類似
度の加算値の中から、最も類似度の加算値が高くなるも
のが判別され、最も類似度の加算値が高くなる仮定距離
ｚ_nxが真の距離と推定され、出力される。そして、かか
る距離推定が基準画像＃１の全画素について行われるこ
とから、基準画像＃１の全選択画素に距離情報を付与し
た画像（距離画像）が生成されることになる。As described above, the distance estimating unit 216 determines the one having the highest similarity addition value from among the addition values of the stabilized similarity obtained by sequentially changing the assumed distance z _n. Then, the assumed distance z _{nx at} which the added value of the similarity becomes the highest is estimated as the true distance and is output. Then, since such distance estimation is performed for all pixels of the reference image # 1, an image (distance image) in which distance information is added to all selected pixels of the reference image # 1 is generated.

【００４１】ここに、上述した図１８の各ウインドウ内
加算部２１２、２１３、…２１４および最終的な類似度
算出部２１５で行われる処理に関する論文として、論文
１「複数の基線長を利用したステレオマッチング」（電
子情報通信学会論文誌、D-2VO．J75-D-2 No．８１９
９２−８、奥富正敏、金出武雄）が知られている。この
論文では、各ステレオ画像対に対して、まずウインドウ
内の明度値の差の２乗和（以下、SSD:Sum of Suqared D
ifferences）を計算し、次いで、全てのステレオ画像対
からのＳＳＤを単純に加算して、全体の類似度の逆数を
算出している。この方法を用いると、計測対象の物体上
に似たような模様が繰り返し存在するような場合でも、
２眼ステレオと比較して誤対応が少ない良好な計測結果
が得られる。As a paper on the processing performed by the in-window addition units 212, 213,... 214 and the final similarity calculation unit 215 in FIG. Matching ”(Transactions of the Institute of Electronics, Information and Communication Engineers, D-2VO. J75-D-2 No. 819)
92-8, Masatoshi Okutomi, Takeo Kanade) are known. In this paper, for each stereo image pair, first, the sum of squares of the differences in the brightness values within the window (hereinafter, SSD: Sum of Suqared D
ifferences), and then simply add the SSDs from all stereo image pairs to calculate the reciprocal of the overall similarity. By using this method, even if a similar pattern repeatedly exists on the object to be measured,
Good measurement results with less erroneous correspondence are obtained as compared with the twin-lens stereo.

【００４２】すなわち、上記論文１には、基準画像の選
択画素の位置を中心とするウインドウを切り出すととも
に、対応画像の対応候補点の位置を中心とするウインド
ウを切り出し、これらウインドウ同士についてパターン
マッチングを行うことにより、基準画像の選択画素につ
いてウインドウ内加算された類似度を演算する技術が記
載されている。具体的には、一方のウインドウ内の画素
の画像情報と、この画素に対応する他方のウインドウ内
の画素の画像情報との差の２乗を、ウインドウ内の各画
素毎に求め、この画像情報の差の２乗値を、ウインドウ
内の全画素について加算したものを、安定化類似度の逆
数としている。そして、各ステレオ対毎に求められた安
定化類似度の逆数が加算されることによって、全体的な
安定化類似度の逆数が算出される（図１６参照）。That is, in the above article 1, a window centered on the position of the selected pixel of the reference image is cut out, a window centered on the position of the corresponding candidate point of the corresponding image is cut out, and pattern matching is performed for these windows. A technique is described in which the similarity calculated in a window for a selected pixel of a reference image is calculated. Specifically, the square of the difference between the image information of the pixel in one window and the image information of the pixel in the other window corresponding to this pixel is obtained for each pixel in the window. The sum of the squared values of the differences for all the pixels in the window is defined as the reciprocal of the stabilized similarity. Then, the reciprocal of the stabilized similarity obtained for each stereo pair is added to calculate the reciprocal of the overall stabilized similarity (see FIG. 16).

【００４３】以上のように、多眼ステレオでは、計測対
象の物体上のある点が全ての画像センサで共通のものと
して観測される場合であれば、対応点の検出精度が損な
われることはない。As described above, in the multi-view stereo, if a certain point on the object to be measured is observed as a common point by all image sensors, the detection accuracy of the corresponding point is not impaired. .

【００４４】しかし、実際には、図１９に示すように、
独立した２個の物体５０Ｆ、５０Ｒが前後に並んで配置
されたもの（５０Ｆを前側の物体、５０Ｒを後側の物体
とする）を左右の画像センサ１、２で観測するような場
合には、図２０（ａ）、（ｂ）に示すように、一方の基
準画像センサ１で、これら２個の物体５０Ｆ、５０Ｒの
全体の外観を観測することができたとしても、別の画像
センサ２では、後側の物体５０Ｒを一部しか観測するこ
とができない場合がある。このように、物体の一部が、
手前の物体の陰に隠れて見えなくなるような状態（これ
を、オクルージョン(隠蔽)という）が発生する場合に
は、対応点の検出精度は著しく損なわれることになる。However, actually, as shown in FIG.
In a case where two independent objects 50F and 50R are arranged side by side in front and rear (50F is a front object and 50R is a rear object), the left and right image sensors 1 and 2 observe the object. As shown in FIGS. 20A and 20B, even if one reference image sensor 1 can observe the entire appearance of these two objects 50F and 50R, another image sensor 2 In some cases, only a part of the rear object 50R can be observed. Thus, part of the object
If a state occurs in which the object is hidden behind a foreground object and is invisible (this is called occlusion (concealment)), the detection accuracy of the corresponding point is significantly impaired.

【００４５】すなわち、オクルージョンが発生した画像
センサ２の撮像画像＃２には、正しい対応位置の情報で
はない間違った情報が入り込んでくるため、真の距離を
正確に求めることができなくなる。That is, since the erroneous information which is not the information of the correct corresponding position enters the captured image # 2 of the image sensor 2 in which the occlusion has occurred, the true distance cannot be obtained accurately.

【００４６】もちろん、２眼ステレオに比べれば多眼ス
テレオの方が画像センサが多いため、あるステレオ対で
オクルージョンが発生したとしても、別のステレオ対で
は正常に観測される確率が高くなるために、全体として
オクルージョンの影響は多少緩和されることになる。Of course, multi-view stereo has more image sensors than binocular stereo, so even if occlusion occurs in one stereo pair, the probability of normal observation in another stereo pair increases. However, the effect of occlusion is somewhat alleviated as a whole.

【００４７】とはいえ、論文１に示されるように、各ス
テレオ対毎に、安定化類似度の逆数を演算し、これら各
ステレオ対毎に求められた安定化類似度の逆数を加算す
ることによって、全体的な安定化類似度の逆数を演算す
るという単純な多眼ステレオの方法では、オクルージョ
ンの影響を排除することはできない。However, as shown in the article 1, the reciprocal of the stabilized similarity is calculated for each stereo pair, and the reciprocal of the stabilized similarity obtained for each stereo pair is added. Therefore, the effect of occlusion cannot be eliminated by a simple multi-view stereo method of calculating the reciprocal of the overall stabilized similarity.

【００４８】このオクルージョンの問題に対処すること
ができる代表的な方法が記載された論文として、論文２
「カメラマトリックスを用いた高精細ステレオＳＥＡに
おける隠れ検出法の検討」（電子情報通信学会信学技
報、1996-03、IE95-158，PRU95-245、筑波大：佐藤清
秀、大田友一）がある。As a paper describing a typical method that can deal with the problem of occlusion, see Paper 2
"A Study on Hidden Detection Method in High Definition Stereo SEA Using Camera Matrix" (IEICE Technical Report, 1996-03, IE95-158, PRU95-245, University of Tsukuba: Kiyohide Sato, Yuichi Ohta) is there.

【００４９】この論文には、マトリックス状に配置した
画像センサに対して、オクルージョンが発生するパター
ンを予め複数用意しておき、オクルージョンが発生しな
いと仮定した場合の類似度とオクルージョンが発生した
場合の各パターン毎の類似度を求め、これらに基づき最
終的にオクルージョンの発生の有無とオクルージョンが
有った場合にはオクルージョンのパターンを決定し、既
に演算した各類似度の中から、この決定したパターンに
対応する類似度を選択する技術が記載されている。ここ
で、オクルージョンの発生パターンは、オクルージョン
が発生していない有効な画像センサの台数および種類を
考慮して生成される。In this paper, a plurality of patterns in which occlusion occurs are prepared in advance for image sensors arranged in a matrix, and the similarity when occlusion does not occur and the similarity when occlusion occurs are described. The similarity of each pattern is obtained, and based on these, the presence / absence of occlusion and the presence of occlusion are determined, and the occlusion pattern is determined. A technique for selecting a degree of similarity corresponding to. Here, the occlusion occurrence pattern is generated in consideration of the number and types of effective image sensors in which occlusion has not occurred.

【００５０】上記論文２に対応する多眼ステレオ装置の
構成を図２１に示す。この装置では、オクルージョンの
発生パターン（オクルージョンが発生しないパターンも
含む）が、ｍ個ある場合を想定している。FIG. 21 shows the configuration of a multi-view stereo apparatus corresponding to the above article 2. In this apparatus, it is assumed that there are m occurrence patterns (including patterns in which occlusion does not occur).

【００５１】基準画像入力部３０１および画像入力部３
０２、３０３、３０４は、図１８に示す基準画像入力部
２０１および画像入力部２０２、２０３、２０４と同一
であり、対応候補点座標発生部３０５は、図１８に示す
対応候補点座標発生部２０５と同一であり、対応候補点
情報抽出部３０６、３０７、３０８は、図１８に示す対
応候補点情報抽出部２０６、２０７、２０８と同一であ
り、類似度算出部３０９、３１０、３１１は、図１８に
示す類似度算出部２０９、２１０、２１１と同一であ
り、ウインドウ内加算部３１２、３１３、３１４は、図
１８に示すウインドウ内加算部２１２、２１３、２１４
と同一であり、既にした説明と重複するので、これらの
説明は省略する。Reference image input unit 301 and image input unit 3
02, 303, and 304 are the same as the reference image input unit 201 and the image input units 202, 203, and 204 shown in FIG. 18, and the corresponding candidate point coordinate generating unit 305 includes the corresponding candidate point coordinate generating unit 205 shown in FIG. The corresponding candidate point information extracting units 306, 307, and 308 are the same as the corresponding candidate point information extracting units 206, 207, and 208 shown in FIG. The intra-window addition units 312, 313, and 314 are the same as the similarity calculation units 209, 210, and 211 shown in FIG. 18, and the intra-window addition units 212, 213, and 214 shown in FIG.
Since these are the same as those described above and are the same as those already described, their description will be omitted.

【００５２】各オクルージョンパターン１、２、３…ｍ
毎に類似度算出部３１５、３１６、３１７…３１８が設
けられており、各類似度算出部では、ウインドウ内加算
部３１２〜３１４で演算された安定化類似度に基づき、
対応するパターンのオクルージョンが発生していると仮
定したときの全体的な安定化類似度が算出される。Each occlusion pattern 1, 2, 3... M
.. 318 are provided for each of them. In each similarity calculation section, based on the stabilized similarity calculated by the in-window addition sections 312 to 314, the similarity calculation sections 315, 316, 317.
The overall stabilization similarity is calculated assuming that occlusion of the corresponding pattern has occurred.

【００５３】類似度選択部３１９では、各オクルージョ
ンパターン毎に算出された全体的な安定化類似度を比較
して、これらの中から、最も確からしいオクルージョン
パターンが決定されるとともに、この決定したオクルー
ジョンパターンに対応する全体的な安定化類似度が、各
類似度算出部３１５、３１６、３１７…３１８で算出さ
れた各全体的な安定化類似度の中から選択される。The similarity selection section 319 compares the overall stabilization similarity calculated for each occlusion pattern, determines the most probable occlusion pattern from these, and determines the determined occlusion pattern. The overall stabilized similarity corresponding to the pattern is selected from the overall stabilized similarities calculated by the similarity calculating units 315, 316, 317,.

【００５４】距離推定部３２０では、仮定距離毎に、選
択された全体的な安定化類似度の中から最も類似度の高
いものが判別され、真の距離が推定される。The distance estimating unit 320 determines the highest similarity among the selected overall stabilization similarities for each assumed distance, and estimates the true distance.

【００５５】このような処理を行うことで、確かに、オ
クルージョンが発生したとしても、精度よく距離を推定
することができる。By performing such processing, it is possible to accurately estimate the distance even if occlusion occurs.

【００５６】ただし、想定されるオクルージョンの全て
のパターン１〜ｍ毎に、全体的な安定化類似度の計算を
実行する必要があり、演算処理量が大きくなるととも
に、演算に要する時間が大きくなる。オクルージョンの
パターン数が多ければ、多いほど、これら演算処理の負
担が増大する。さらには、ハードウェアの構成が複雑な
なものとなり、ハードウェアの実現が困難になる。However, it is necessary to calculate the overall stabilization similarity for each of the assumed patterns 1 to m of the occlusion, which increases the amount of calculation processing and the time required for calculation. . The greater the number of occlusion patterns, the greater the load of these arithmetic processes. Further, the hardware configuration becomes complicated, and it is difficult to realize the hardware.

【００５７】以上のように、従来技術にあっては、つぎ
のような問題点があった。As described above, the prior art has the following problems.

【００５８】（１）論文１に示されるような構成の多眼
ステレオ装置（図１８）では、オクルージョンが発生し
た場合を考慮して精度よく距離推定を行うことができな
い。(1) In the multi-view stereo apparatus (FIG. 18) configured as shown in the article 1, distance estimation cannot be performed accurately in consideration of the case where occlusion has occurred.

【００５９】（２）論文２に示されるような構成の多眼
ステレオ装置（図２１）では、オクルージョンが発生し
た場合を考慮して精度よく距離推定を行うことができる
ものの、扱うデータが多くなるとともに、ハードウェア
の実現が困難となる。(2) In the multi-view stereo apparatus (FIG. 21) configured as shown in the article 2, although the distance can be accurately estimated in consideration of the case where the occlusion occurs, the data to be handled is increased. At the same time, it is difficult to realize hardware.

【００６０】本発明の第１発明は、従来の多眼ステレオ
装置の処理の手順を見直し最適化することで、扱うデー
タを少なくでき、オクルージョンの発生を考慮するよう
なハードウェアの実現を容易にすることを第１の解決課
題とするものである。すなわち、従来の多眼ステレオ装
置は、２眼ステレオ装置の単なる拡張であり、処理の効
率が悪い。これは、前述したように、各ステレオ対に対
してまずウインドウ内の明度値の差を求め（各ステレオ
対毎に安定化類似度の逆数を求め）、最後に全ステレオ
対分を単純に加算して全体的な安定化類似度の逆数を求
めている（各ステレオ対毎に求めた安定化類似度を加算
して全体的な安定化類似度を求めている）からである。
こうした各ステレオ対毎に求めた安定化類似度を加算し
て全体的な安定化類似度を求める従来の方法では、各ス
テレオ対分のハードウェア資源が必要である。さらに、
オクルージョンの影響を除去するためには、各オクルー
ジョンパターン毎に、上述した全体的な安定化類似度を
求める必要があり、扱うデータ量や処理量が膨大となっ
てしまうとともに、ハードウェア資源がさらに膨大にな
ってしまう。本発明の第１発明は、こうした実状に鑑み
てなされたものであり、オクルージョンの発生を考慮す
るような複雑な問題においても、画像データを早い段階
でステレオ処理に必要かつ十分な情報（可能性のあるオ
クルージョンの方向や強度等の情報を要素として含む特
徴ベクトル）へと圧縮し、この特徴ベクトルに対し適切
な安定化処理を行い、さらに特徴ベクトルの情報とベク
トル間の関係などを利用して３次元空間の情報を得るこ
とで、保持する情報量を少なくしかつ処理を簡略化して
ハードウェア資源を減らすことを第１の解決課題とする
ものである。According to the first invention of the present invention, by reviewing and optimizing the processing procedure of the conventional multi-view stereo apparatus, it is possible to reduce the data to be handled and to easily realize hardware which takes into account the occurrence of occlusion. Is a first solution. That is, the conventional multi-view stereo apparatus is merely an extension of the binocular stereo apparatus, and the processing efficiency is low. As described above, for each stereo pair, the difference in the brightness value within the window is first determined (the reciprocal of the stabilized similarity is determined for each stereo pair), and finally, all the stereo pairs are simply added. This is because the reciprocal of the overall stabilized similarity is obtained (the overall stabilized similarity is obtained by adding the stabilized similarities obtained for each stereo pair).
In such a conventional method of adding the stabilized similarities obtained for each stereo pair to obtain the overall stabilized similarity, hardware resources for each stereo pair are required. further,
In order to eliminate the effects of occlusion, it is necessary to obtain the overall stabilization similarity described above for each occlusion pattern, and the amount of data and processing to be handled becomes enormous, and hardware resources are further increased. It will be huge. The first invention of the present invention has been made in view of such a situation. Even in a complicated problem in which the occurrence of occlusion is taken into consideration, the image data needs to have sufficient and sufficient information (possibility of stereo processing) at an early stage. Into a feature vector that includes information such as the direction and intensity of a certain occlusion as an element, perform appropriate stabilization processing on this feature vector, and further utilize the feature vector information and the relationship between the vectors. A first object of the present invention is to obtain information in a three-dimensional space to reduce the amount of information to be held and simplify processing to reduce hardware resources.

【００６１】ところで、類似度を安定化させる際に使用
されるウインドウのサイズ、形状は一義的である。しか
し、図９（ａ）に示すように、画像＃１中のエッジ５１
が顕著な場合には、その近辺に物体５０の縁が存在する
可能性があり、この場合に一義的な形状である正方形の
ウインドウＷＤを使用すると、背景の影響を大きく受け
て、類似度の安定化処理の精度、ひいては距離の計測の
精度が損なわれることがある。本発明の第２発明は、こ
うした実状に鑑みてなされたものであり、画像中の特徴
に応じてウインドウの形状など、安定化のための領域を
変化させることで、計測の精度を向上させることを第２
の解決課題とするものである。By the way, the size and shape of the window used for stabilizing the similarity are unique. However, as shown in FIG.
Is remarkable, the edge of the object 50 may be present in the vicinity thereof. In this case, if a square window WD having a unique shape is used, the similarity of the similarity is greatly affected by the background. The accuracy of the stabilization process, and thus the accuracy of the distance measurement, may be impaired. The second invention of the present invention has been made in view of such a situation, and improves measurement accuracy by changing a region for stabilization such as a window shape according to a feature in an image. The second
Is a problem to be solved.

【００６２】また、前述したように従来のステレオ装置
では、類似度を安定化させる際に使用されるウインドウ
のサイズは、計測対象である物体の特徴とは無関係に固
定であり、パターンマッチングを実施する領域の特徴が
少ないような場合には、安定化が不十分となり、計測精
度が悪くなるという問題があった。そこで、このような
場合は、ウインドウのサイズを大きくして、画像＃１内
の特徴をより多く取り入れるようにして、安定化を十分
に行うようにするものである。安定化を十分に行うこと
で、距離の計測の精度を向上させることができる。本発
明の第３発明は、こうした実状に鑑みてなされたもので
あり、画像中の特徴に応じてウインドウのサイズなど、
安定化のための領域を変化させることで、計測の精度を
向上させることを第３の解決課題とするものである。As described above, in the conventional stereo apparatus, the size of the window used for stabilizing the similarity is fixed regardless of the characteristics of the object to be measured, and pattern matching is performed. When the characteristic of the region to be measured is small, there is a problem that the stabilization becomes insufficient and the measurement accuracy deteriorates. Therefore, in such a case, the size of the window is increased so that more features in the image # 1 are taken in, so that sufficient stabilization is performed. Sufficient stabilization can improve the accuracy of distance measurement. The third invention of the present invention has been made in view of such a situation, and has a window size and the like according to features in an image.
A third object of the present invention is to improve measurement accuracy by changing a region for stabilization.

【００６３】また、従来のステレオ装置では、図１４あ
るいは図１６に示すように、類似度の逆数Ｑssadが最も
小さくなったときの仮定距離ｚ_nxをもって一義的に、真
の距離であると推定している。Further, in the conventional stereo apparatus, as shown in FIG. 14 or FIG. 16, the assumed distance z _nx when the reciprocal Qssad of the similarity becomes the smallest is uniquely estimated as the true distance. ing.

【００６４】しかし、図１１に示すように、仮定距離と
類似度の逆数Ｑssadの対応関係によっては、類似度の逆
数Ｑssadが最も小さくなったときの仮定距離ｚ_x2ではな
くて、値としては最も小さくはないが、その近辺の変化
率が大きい極小点における仮定距離ｚ_x1が真の距離であ
る場合がある。つまり、類似度の逆数Ｑssadが最も小さ
くなったときの仮定距離ｚ_x2をもって一義的に、真の距
離であると推定すると、推定の精度が損なわれることが
ある。However, as shown in FIG. 11, depending on the correspondence between the assumed distance and the reciprocal Qssad of the similarity, the value is not the assumed distance z _x2 when the reciprocal Qssad of the similarity is minimized, but the value Although not small, the assumed distance z _{x1 at the} local minimum point where the rate of change is large in the vicinity may be the true distance. That is, if the assumed distance z _x2 when the reciprocal Qssad of the similarity becomes the smallest is uniquely estimated as the true distance, the accuracy of the estimation may be impaired.

【００６５】本発明の第４発明は、こうした実状に鑑み
てなされたものであり、類似度の大きさ以外に、類似度
の変化率を考慮して、距離を推定することで、距離推定
の精度を向上させることを第４の解決課題とするもので
ある。The fourth invention of the present invention has been made in view of such a situation, and estimates the distance in consideration of not only the magnitude of the similarity but also the rate of change of the similarity. It is a fourth solution to improve the accuracy.

【００６６】[0066]

【課題を解決するための手段および効果】そこで、本発
明の第１発明は、上記第１の課題解決のために、複数の
撮像手段を所定間隔をもって配置し、これら複数の撮像
手段のうちの一の撮像手段で対象物体を撮像したときの
当該一の撮像手段の撮像画像中の選択画素に対応する前
記物体上の点までの距離を算出し、この各画素毎に求め
られた距離に基づき前記物体を認識するようにした物体
の認識装置において、前記選択画素に対応する各撮像手
段の対応候補点近傍の明度情報等の局所情報を、仮定し
た距離毎に抽出する対応候補点情報抽出手段と、ここで
抽出した各撮像手段の対応候補点近傍の局所情報を入力
として、局所的な特徴を的確に特徴付ける特徴ベクトル
を生成する特徴ベクトルの生成手段と、その生成された
特徴ベクトルのデータを要素毎に位置的または時間的に
集積することによる情報の安定化の処理を行う特徴ベク
トルの安定化手段と、それらの安定化された特徴ベクト
ルの要素間の関係を利用して物体までの距離を推定する
距離推定手段を具えることにより、3次元空間の情報を
推定するようにしている。Therefore, in order to solve the above-mentioned first problem, a first invention of the present invention arranges a plurality of image pickup means at predetermined intervals, and comprises a plurality of image pickup means. Calculate the distance to a point on the object corresponding to the selected pixel in the image captured by the one imaging unit when the target object is imaged by the one imaging unit, based on the distance obtained for each pixel In the object recognition device configured to recognize the object, corresponding candidate point information extracting means for extracting local information such as brightness information near a corresponding candidate point of each imaging means corresponding to the selected pixel for each assumed distance And a feature vector generating means for generating a feature vector for accurately characterizing a local feature by using local information in the vicinity of a corresponding candidate point of each imaging means extracted here as input, and a data of the generated feature vector. Feature vector stabilization means that performs information stabilization processing by accumulating position or time data for each element, and using the relationship between the elements of these stabilized feature vectors to the object. By providing a distance estimating means for estimating the distance of the three-dimensional space, information of a three-dimensional space is estimated.

【００６７】すなわち、かかる構成によれば、図１に示
すように、画像データを早い段階でステレオ処理に必要
かつ十分な情報へと圧縮し、この特徴ベクトルに対し適
切な安定化処理を行い、さらに特徴ベクトルの情報とベ
クトル間の関係などを利用して３次元空間の情報を得る
ことができる。よって、多眼ステレオ装置においてオク
ルージョンの発生を考慮するような複雑な問題において
も、扱うデータ量や処理量を少なくすることができると
ともに、ハードウェア資源を減らすことができる。That is, according to this configuration, as shown in FIG. 1, image data is compressed into necessary and sufficient information for stereo processing at an early stage, and an appropriate stabilizing process is performed on this feature vector. Further, information of a three-dimensional space can be obtained by utilizing information of the feature vector and a relationship between the vectors. Therefore, even in the case of a complicated problem in which the occurrence of occlusion is considered in the multi-view stereo apparatus, the amount of data to be handled and the amount of processing can be reduced, and the hardware resources can be reduced.

【００６８】また、本発明の第２発明では、上記第２の
課題解決のために、同様な物体の認識装置において、前
記撮像手段で撮像された撮像画像の各選択画素毎に特徴
を抽出する特徴抽出手段と、前記特徴抽出手段で抽出さ
れた特徴に応じて、前記安定化手段で安定化するための
空間領域および時間領域を変化させる手段とを具えるよ
うにしている。According to a second aspect of the present invention, in order to solve the second problem, in a similar object recognizing apparatus, a feature is extracted for each selected pixel of an image picked up by the image pickup means. The image processing apparatus includes a feature extracting unit and a unit that changes a spatial domain and a time domain for stabilization by the stabilizing unit in accordance with the feature extracted by the feature extracting unit.

【００６９】すなわち、かかる構成によれば、図９
（ｂ）に示すように、抽出された特徴（エッジ５１の方
向）に応じて、安定化するための空間領域および時間領
域を変化させるようにしている。つまり、ウインドウＷ
ＤＸの形状を、エッジ５１の方向に沿った形状に変化さ
せている。これに対して従来は、類似度を安定化させる
際に使用されるウインドウのサイズ、形状は一義的であ
った。このため、図９（ａ）に示すように、画像＃１中
のエッジ５１が顕著な場合には、その近辺に物体５０の
縁が存在する可能性があり、この場合に一義的な形状の
正方形のウインドウＷＤを使用すると、背景の影響を大
きく受けて、類似度の安定化処理の精度、ひいては距離
の計測の精度が損なわれることがあった。これに対し
て、本発明の第２発明では、ウインドウＷＤＸの形状
を、エッジ５１の方向に沿った形状に変化させるように
しているので、背景の影響を大きく受けることがなくな
り、計測の精度を向上させることができる。That is, according to this configuration, FIG.
As shown in (b), the space region and the time region for stabilization are changed according to the extracted features (the direction of the edge 51). That is, the window W
The shape of DX is changed to a shape along the direction of the edge 51. On the other hand, conventionally, the size and shape of the window used for stabilizing the similarity have been univocal. For this reason, as shown in FIG. 9A, when the edge 51 in the image # 1 is remarkable, there is a possibility that the edge of the object 50 exists near the edge 51, and in this case, a unique shape When the square window WD is used, the accuracy of the similarity stabilization process and the accuracy of the distance measurement may be impaired due to the influence of the background. On the other hand, in the second invention of the present invention, the shape of the window WDX is changed to a shape along the direction of the edge 51, so that it is not greatly affected by the background, and the measurement accuracy is improved. Can be improved.

【００７０】また、本発明の第３発明では、上記第３の
課題解決のために、同様な物体の認識装置において、前
記撮像手段で撮像された撮像画像の各選択画素毎に特徴
を抽出する特徴抽出手段と、前記特徴抽出手段で抽出さ
れた特徴に応じて、前記安定化手段で安定化するための
空間領域のサイズを変化させる手段とを具えるようにし
ている。According to a third aspect of the present invention, in order to solve the third problem, in a similar object recognizing apparatus, a feature is extracted for each selected pixel of an image picked up by the image pickup means. The image processing apparatus further includes a feature extracting unit, and a unit that changes a size of a space area to be stabilized by the stabilizing unit according to the feature extracted by the feature extracting unit.

【００７１】すなわち、かかる構成によれば、図１０
（ａ）に示すように、基準画像の明るさの変化率の絶対
値を通常のサイズのウインドウＷＤ内で加算した値が一
定のしきい値よりも小さい場合には、特徴ベクトルＶＱ
を安定化させるために用いられるウインドウＷＤのサイ
ズを、図１０（ｂ）に示すように、通常よりも大きいサ
イズのウインドウＷＤＸに変化させている。これに対し
て従来は、類似度を安定化させる際に使用されるウイン
ドウのサイズは固定であった。このため、計測対象物体
の特徴が少ない場合には、安定化が不十分となり計測精
度が損なわれることがあった。これに対して、本発明の
第３発明では、特徴量が少ないと判断した場合には、ウ
インドウのサイズを大きくして、画像＃１内の特徴をよ
り多く取り入れるようにして、安定化を十分に行うよう
にしているので、距離の計測の精度を向上させることが
できる。That is, according to this configuration, FIG.
As shown in (a), when the value obtained by adding the absolute value of the change rate of the brightness of the reference image within the window WD of a normal size is smaller than a certain threshold value, the feature vector VQ
Is changed to a window WDX having a size larger than usual, as shown in FIG. 10B. On the other hand, conventionally, the size of the window used when stabilizing the similarity is fixed. For this reason, when the feature of the object to be measured is small, the stabilization is insufficient and the measurement accuracy may be impaired. On the other hand, according to the third aspect of the present invention, when it is determined that the feature amount is small, the window size is increased to take in more features in the image # 1, and sufficient stabilization is achieved. The accuracy of the distance measurement can be improved.

【００７２】また、本発明の第４発明では、上記第４の
課題解決のために、同様な物体の認識装置において、前
記撮像手段で撮像された撮像画像の各選択画素毎に特徴
を抽出する特徴抽出手段と、前記特徴抽出手段で抽出さ
れた特徴に応じて、前記推定距離における前記全体的な
安定化類似度の変化率を予測し、この予測した変化率で
前記全体的な安定化類似度が変化している仮定距離を、
前記推定距離の候補として出力する手段とを具えるよう
にしている。According to a fourth aspect of the present invention, in order to solve the fourth problem, in a similar object recognizing apparatus, a feature is extracted for each selected pixel of an image picked up by the image pickup means. A feature extraction unit for predicting a change rate of the overall stabilization similarity at the estimated distance in accordance with the feature extracted by the feature extraction unit; The assumed distance at which the degree is changing,
Means for outputting the estimated distance as a candidate.

【００７３】すなわち、かかる構成によれば、図１１に
示すように、抽出された特徴（明度の変化率）に応じ
て、推定距離における全体的な安定化類似度の逆数Ｑ’
ssadの変化率が予測される。つまり、明度の変化率が大
きい場合には、真の距離における類似度の変化率は大き
いと予測される。そこで、明度の変化率が大きい場合に
は、推定距離ｚ_x1における全体的な安定化類似度の逆数
Ｑ’ssad の変化率は大きくなっているものと予測し、
この大きい変化率で全体的な安定化類似度の逆数Ｑ’ss
ad が変化している仮定距離ｚ_x1が、推定距離の候補と
される。That is, according to this configuration, as shown in FIG. 11, the reciprocal Q ′ of the overall stabilization similarity at the estimated distance according to the extracted features (rate of change in brightness).
The rate of change of ssad is predicted. That is, when the rate of change in brightness is large, the rate of change in similarity at a true distance is predicted to be large. Therefore, when the change rate of the brightness is large, it is predicted that the change rate of the reciprocal Q'ssad of the overall stabilized similarity at the estimated distance z _x1 is large,
With this large rate of change, the inverse Q'ss of the overall stabilization similarity
The hypothetical distance z _x1 in which ad changes is set as a candidate for the estimated distance.

【００７４】この結果、全体的な安定化類似度の逆数
Ｑ’ssad が最小になっている仮定距離ｚ_x2ではなく
て、これよりも全体的な安定化類似度の逆数Ｑ’ssadの
変化率が大きくなっている仮定距離ｚ_x1が、推定距離と
決定される。As a result, not the assumed distance z _{x2 in} which the reciprocal Q'ssad of the overall stabilized similarity is minimized, but the change rate of the reciprocal Q'ssad of the overall stabilized similarity more than this is assumed distance z _x1 which is larger, is determined as the estimated distance.

【００７５】ここに、従来のステレオ装置の距離推定部
３２０（１０７、２１６）では、図１４あるいは図１６
に示すように、類似度の逆数Ｑssadが最も小さくなった
ときの仮定距離ｚ_nxをもって一義的に、真の距離である
と推定していた。Here, the distance estimating section 320 (107, 216) of the conventional stereo apparatus has the configuration shown in FIG.
As shown in (1), the assumed distance z _nx when the reciprocal Qssad of the similarity becomes the smallest is uniquely estimated as the true distance.

【００７６】しかし、図１１に示すように、仮定距離ｚ
_nと類似度の逆数Ｑssadの対応関係によっては、類似度
の逆数Ｑssadが最も小さくなったときの仮定距離ｚ_x2で
はなくて、値としては最も小さくはないが、その近辺の
変化率が大きい極小となったときの仮定距離ｚ_x1が真の
距離である場合がある。つまり、従来のものでは、類似
度の逆数Ｑssadが最も小さくなったときの仮定距離ｚ_x2
をもって一義的に、真の距離であると推定しているため
に、変化率が大きい極小となったときの仮定距離ｚ_x1が
真の距離であることを見逃してしまうことがあった。However, as shown in FIG.
Depending on the correspondence between _n and the reciprocal Qssad of the similarity, it is not the hypothetical distance z _x2 when the reciprocal Qssad of the similarity is the smallest, but the value is not the smallest but the local minimum where the rate of change in the vicinity is large. _May be the true distance. That is, in the related art, the assumed distance z _x2 when the reciprocal Qssad of the similarity becomes the smallest is obtained.
Because it is uniquely estimated that the distance is a true distance, the assumption distance z _{x1 at the} time when the rate of change becomes a minimum is sometimes overlooked as a true distance.

【００７７】この点、本第４発明によれば、類似度の大
きさ以外に、類似度の変化率を考慮して、距離を推定す
るようにしているので、真の距離を見逃してしまうこと
なく、距離推定の精度を向上させることができる。In this respect, according to the fourth aspect of the present invention, since the distance is estimated in consideration of the rate of change of the similarity in addition to the magnitude of the similarity, the true distance may be overlooked. In addition, the accuracy of distance estimation can be improved.

【００７８】[0078]

【発明の実施の形態】以下、図面を参照して本発明の実
施形態について説明する。ここでは、１例として類似度
を使用した場合を説明する。Embodiments of the present invention will be described below with reference to the drawings. Here, a case where the similarity is used as an example will be described.

【００７９】図２２は、本実施形態の多眼ステレオ視に
よる物体認識装置の構成を示すブロック図である。FIG. 22 is a block diagram showing the configuration of an object recognition apparatus for multi-view stereoscopic vision according to this embodiment.

【００８０】同図に示す実施形態の装置では、９台の画
像センサ１、２、３、…、Ｎ（Ｎ＝９）が、基準画像セ
ンサ１を中心にして、「田の字」状に配置されているも
のとする。なお、画像センサの配置、画像センサの台数
が、本実施形態と異なっている場合であっても、以下に
述べるのと同様な処理を適用することで同等の効果を得
ることができる。基準画像入力部４０１には、視差ｄ
（距離ｚ）を算出する際に基準となる画像センサ１で撮
像された基準画像＃１が取り込まれる。一方、画像入力
部４０２には、基準画像＃１上の点に対応する対応点が
存在する画像である画像センサ２の撮像画像＃２が取り
込まれる。他の画像入力部４０３、…、４０４において
も、基準画像＃１に対応する画像センサ３の画像＃３
が、…、基準画像＃１に対応する画像センサＮ（Ｎ＝
９）の画像＃Ｎがそれぞれ取り込まれる。In the apparatus of the embodiment shown in FIG. 9, nine image sensors 1, 2, 3,..., N (N = 9) are arranged in a “field” around the reference image sensor 1. It is assumed that they are arranged. Note that, even when the arrangement of the image sensors and the number of image sensors are different from those in the present embodiment, the same effects can be obtained by applying the same processing as described below. The reference image input unit 401 has a parallax d
A reference image # 1 captured by the image sensor 1 serving as a reference when calculating (distance z) is captured. On the other hand, the image input unit 402 captures a captured image # 2 of the image sensor 2 which is an image having a corresponding point corresponding to a point on the reference image # 1. Also in other image input units 403,..., 404, image # 3 of image sensor 3 corresponding to reference image # 1.
,..., The image sensor N (N =
The image #N of 9) is captured, respectively.

【００８１】対応候補点座標発生部４０５では、基準画
像＃１の各画素に対して、仮定した距離ｚ_n毎に、画像
センサ２の画像＃２の対応候補点の位置座標、画像セン
サ３の画像＃３の対応候補点の位置座標、…、画像セン
サＮの画像＃Ｎの対応候補点の位置座標がそれぞれ記
憶、格納されており、これらを読み出すことにより各対
応候補点の位置座標を発生する。The corresponding candidate point coordinate generator 405 calculates the position coordinates of the corresponding candidate point of the image # 2 of the image sensor 2 and the position coordinates of the corresponding image of the image sensor 3 for each pixel of the reference image # 1 at every assumed distance z _n . The position coordinates of the corresponding candidate point of the image # 3,..., The position coordinates of the corresponding candidate point of the image #N of the image sensor N are stored and stored, and by reading these, the position coordinates of each corresponding candidate point are generated. I do.

【００８２】すなわち、基準画像センサ１の基準画像＃
１の中から所定位置Ｐ₁に存在する（ｉ、ｊ）で特定さ
れる画素が選択されるとともに、認識対象物体５０まで
の距離ｚ_nが仮定される。そして、この仮定距離ｚ_nに対
応する画像センサ２の画像＃２内の対応候補点Ｐ₂の位
置座標が読み出される。同様にして、基準画像＃１の選
択画素Ｐ₁（ｉ、ｊ）、仮定距離ｚ_nに対応する画像セン
サ３の画像＃３の対応候補点Ｐ₃の位置座標が読み出さ
れ、…、基準画像＃１の選択画素Ｐ₁（ｉ、ｊ）、仮定
距離ｚ_nに対応する画像センサＮの画像＃Ｎの対応候補
点Ｐ_Nの位置座標が読み出される。そして、仮定距離ｚ_n
を順次変化させて同様の読み出しが行われる。また、選
択画素を順次変化させることによって同様の読み出しが
行われる。こうして対応候補点Ｐ₂の位置座標（Ｘ₂、Ｙ
₂）、対応候補点Ｐ₃の位置座標（Ｘ₃、Ｙ₃）、・・・、
対応候補点Ｐ_Nの位置座標（Ｘ_N、Ｙ_N）が対応候補点座
標発生部４０５から出力される。That is, the reference image # of the reference image sensor 1
1, a pixel specified by (i, j) existing at the predetermined position P ₁ is selected, and a distance z _n to the recognition target object 50 is assumed. Then, the position coordinates of the corresponding candidate point P ₂ in the image # 2 image sensor 2 corresponding to this assumption distance z _n is read. Similarly, the position coordinates of the selected pixel P ₁ (i, j) of the reference image # 1 and the corresponding candidate point P ₃ of the image # 3 of the image sensor 3 corresponding to the assumed distance z _n are read out. image # 1 of the selected pixel P ₁ (i, j), the position coordinates of the corresponding candidate point P _n of the image #N image sensors n which corresponds to the assumed distance z _n is read. And the assumed distance z _n
Are sequentially changed, and similar reading is performed. Similar reading is performed by sequentially changing the selected pixels. Thus the corresponding candidate point P ₂ position coordinates (X _2, Y
₂ ), position coordinates (X ₃ , Y ₃ ) of the corresponding candidate point P ₃ ,.
The position coordinates (X _N , Y _N ) of the corresponding candidate point P _N are output from the corresponding candidate point coordinate generator 405.

【００８３】つぎに、局所情報抽出部４０６では、この
ようにして対応候補点座標発生部４０５によって発生さ
れた画像センサ２の対応候補点Ｐ₂の位置座標（Ｘ₂、Ｙ
₂）に基づき対応候補点Ｐ₂の局所情報を抽出する処理が
実行される。同様にして、局所情報抽出部４０７では、
対応候補点座標発生部４０５で発生された画像センサ３
の画像＃３の対応候補点Ｐ₃の位置座標（Ｘ₃、Ｙ₃）に
基づいて、対応候補点Ｐ₃の局所情報が、…、局所情報
抽出部４０８では、対応候補点座標発生部４０５で発生
された画像センサＮの画像＃Ｎの対応候補点Ｐ_Nの位置
座標（Ｘ_N、Ｙ_N）に基づいて、対応候補点Ｐ_Nの局所情
報がそれぞれ求められる。Next, in the local information extracting section 406, the position coordinates (X ₂ , Y 2) of the corresponding candidate point P ₂ of the image sensor 2 generated by the corresponding candidate point coordinate generating section 405 in this way.
Process of extracting the local information of the corresponding candidate points P ₂ based on ₂₎ is performed. Similarly, in the local information extraction unit 407,
Image sensor 3 generated by corresponding candidate point coordinate generation unit 405
Image # 3 position coordinates of the corresponding candidate point P ₃ (X _3, Y ₃₎ based on the local information of the corresponding candidate point P ₃ is, ..., the local information extracting unit 408, the corresponding candidate point coordinate generating unit 405 The local information of the corresponding candidate point P _N is obtained based on the position coordinates (X _N , Y _N ) of the corresponding candidate point P _N of the image #N of the image sensor N generated in the above.

【００８４】さらに、類似度算出部４０９では、上記局
所情報抽出部４０６で得られた対応候補点Ｐ₂の局所情
報と基準画像＃１の選択画素Ｐ₁の画像情報との類似度
が算出される。[0084] Further, the similarity calculation unit 409, the degree of similarity between the local information extracting unit of the corresponding candidate point P ₂ obtained in 406 local information and the reference image # 1 of the image information of the selected pixel P ₁ is calculated You.

【００８５】同様にして類似度算出部４１０では、上記
局所情報抽出部４０７で得られた対応候補点Ｐ₃の局所
情報と基準画像＃１の選択画素Ｐ₁の画像情報との類似
度が算出される。同様にして類似度算出部４１１では、
上記局所情報抽出部４０８で得られた対応候補点Ｐ_Nの
局所情報と基準画像＃１の選択画素Ｐ₁の画像情報との
類似度が算出される。Similarly, the similarity calculating section 410 calculates the similarity between the local information of the corresponding candidate point P ₃ obtained by the local information extracting section 407 and the image information of the selected pixel P ₁ of the reference image # 1. Is done. Similarly, in the similarity calculation unit 411,
Similarity between the local information and the reference image # 1 of the image information of the selected pixel P ₁ of the corresponding candidate points P _N obtained above the local information extracting unit 408 is calculated.

【００８６】以上は、図２１に示す従来装置と共通して
いる構成である。The configuration described above is common to the conventional apparatus shown in FIG.

【００８７】その後段の特徴ベクトルの計算部４１２で
は、上記類似度算出部４０９〜４１１で、各ステレオ対
毎に演算された類似度に基づき、後述する局所オクルー
ジョンベクトル（オクルージョンベクトルのことを、適
宜、特徴ベクトルと称する）ＶＱが、基準画像＃１の各
選択画素Ｐ₁毎に計算される。The subsequent-stage feature vector calculation unit 412 determines a local occlusion vector (which will be described later) on the basis of the similarity calculated for each stereo pair by the similarity calculation units 409 to 411. , referred to as a feature vector) VQ is calculated based on the image # each selected pixel P ₁ of 1.

【００８８】特徴ベクトルの安定化部４１３では、後述
するように、所定ウインドウ内の各局所オクルージョン
ベクトルＶＱを加算することで、安定化した大局オクル
ージョンベクトルＳＶＱが算出される。The feature vector stabilizing section 413 calculates the stabilized global occlusion vector SVQ by adding each local occlusion vector VQ within a predetermined window, as described later.

【００８９】補正類似度算出部４１４では、安定化した
大局オクルージョンベクトルＳＶＱに基づき、オクルー
ジョンの発生の有無が判断されるとともに、オクルージ
ョンが発生している場合には、オクルージョンの影響を
除去するように、安定化全体類似度の逆数Ｑssad ( i,
j,ｚ_n) が補正され、補正された安定化全体類似度の逆
数Ｑ’ssad ( i,j,ｚ_n) が算出される。The correction similarity calculating section 414 determines whether or not occlusion has occurred based on the stabilized global occlusion vector SVQ, and removes the influence of occlusion if occlusion has occurred. , The inverse of the stabilized overall similarity Qssad (i,
j, z _n ) is corrected, and the reciprocal Q'ssad (i, j, z _n ) of the corrected stabilized overall similarity is calculated.

【００９０】距離推定部４１５では、仮定距離ｚ_nと、
上記補正された安定化全体類似度の逆数Ｑssad ( i,j,
ｚ_n) との対応関係が求められ、この対応関係から、最
も類似度が高くなる点（類似度の逆数が最小値となる
点）が判別され、この類似度の逆数Ｑssad ( i,j,ｚ_n)
が最も低くなっている点に対応する仮定距離ｚ_nxが最終
的に、認識対象物体５０上の点５０ａまでの真の距離
（最も確からしい距離）と推定される。かかる処理は、
基準画像＃１の各選択画素毎に全画素について行われ
る。In the distance estimating section 415, the assumed distance z _n ,
The inverse Qssad (i, j,
z _n ), a point having the highest similarity (a point at which the reciprocal of the similarity becomes the minimum value) is determined from the correspondence, and the reciprocal Qssad (i, j, z _n )
The hypothetical distance z _nx corresponding to the point where is lowest is finally estimated as the true distance (the most likely distance) to the point 50a on the recognition target object 50. Such processing is
This is performed for all pixels for each selected pixel of the reference image # 1.

【００９１】以上のようにして、距離推定部４１５で
は、仮定距離ｚ_nを順次変化させて得られた類似度の中
から、最も類似度が高くなるものが判別され、最も類似
度が高くなる仮定距離ｚ_nxが真の距離と推定され、出力
される。そして、かかる距離推定が基準画像＃１の全画
素について行われることから、基準画像＃１の全選択画
素に距離情報を付与した画像（距離画像）が生成される
ことになる。As described above, the distance estimating section 415 determines the one having the highest similarity from the similarities obtained by sequentially changing the assumed distance z _n , and has the highest similarity. The hypothetical distance z _nx is estimated as the true distance and is output. Then, since such distance estimation is performed for all pixels of the reference image # 1, an image (distance image) in which distance information is added to all selected pixels of the reference image # 1 is generated.

【００９２】つぎに、図２〜図７を併せ参照して、上記
特徴ベクトルの計算部４１２、特徴ベクトルの安定化部
４１３、補正類似度算出部４１４、距離推定部４１５で
行われる処理について詳述する。Next, with reference to FIGS. 2 to 7, the processing performed by the feature vector calculation section 412, feature vector stabilization section 413, correction similarity calculation section 414, and distance estimation section 415 will be described in detail. Will be described.

【００９３】・オクルージョンベクトルの生成方法さて、オクルージョンベクトルを生成する方法として
は、前述した論文２に示されるように、オクルージョン
が発生するパターンを予め用意しておき、この発生パタ
ーンに応じて、オクルージョンベクトルを生成していく
方法と、オクルージョンの発生パターンを用意すること
なく、オクルージョンベクトルを生成する方法とがあ
る。A method of generating an occlusion vector As a method of generating an occlusion vector, a pattern in which occlusion occurs is prepared in advance, as shown in the above-mentioned paper 2, and an occlusion vector is generated in accordance with the generated pattern. There are a method of generating a vector and a method of generating an occlusion vector without preparing an occlusion occurrence pattern.

【００９４】以下、それぞれの場合に分けて説明する
が、これらを組み合わせて実施してもよい。Hereinafter, each case will be described separately, but they may be implemented in combination.

【００９５】・（方法１）オクルージョン発生パターン
を予め用意する場合の実施形態この方法は、９台の画像センサ１〜９で物体５０を観測
したときに起こり得ると想定されるオクルージョンの各
発生パターンを予め用意しておき、これを記憶テーブル
（図３参照）に記憶しておき、類似度算出部４０９〜４
１１で、各ステレオ対毎に求められた類似度の逆数Ｑad
( i,j,k,ｚ_n)に基づき、これらの分布（図２（ａ）参
照）を求め、この分布を、２値化した分布のパターン
（図２（ｂ）参照）に変換し、この２値化した分布のパ
ターンに一致する記憶テーブルの記憶パターンに応じ
て、局所的なオクルージョンベクトルＶＱを求めるもの
である。(Method 1) Embodiment in which Occlusion Occurrence Patterns are Preliminarily Prepared This method uses each of the occlusion occurrence patterns assumed to be possible when the object 50 is observed by the nine image sensors 1 to 9. Are prepared in advance, and stored in a storage table (see FIG. 3).
At 11, the reciprocal Qad of the similarity obtained for each stereo pair
Based on (i, j, k, z _n ), these distributions (see FIG. 2 (a)) are obtained, and this distribution is converted into a binarized distribution pattern (see FIG. 2 (b)). The local occlusion vector VQ is obtained according to the storage pattern of the storage table that matches the binarized distribution pattern.

【００９６】特徴ベクトルの計算部４１２では、以下の
ようにして局所オクルージョンベクトルＶＱが計算され
る。なお、以下、ステレオ対ｋ（ｋ＝１、２…８）は、
基準画像センサ１と他の画像センサ２…９の対のことを
いうものとする。つまり、ステレオ対１は、基準画像セ
ンサ１と画像センサ２の対のことで、ステレオ対２は、
基準画像センサ１と画像センサ３の対のことで、…ステ
レオ対８とは、基準画像センサ１と画像センサ９の対の
ことをいうものとする。The feature vector calculator 412 calculates the local occlusion vector VQ as follows. Hereinafter, the stereo pair k (k = 1, 2,... 8)
A pair of the reference image sensor 1 and the other image sensors 2... That is, the stereo pair 1 is a pair of the reference image sensor 1 and the image sensor 2, and the stereo pair 2 is
A pair of the reference image sensor 1 and the image sensor 3... A stereo pair 8 means a pair of the reference image sensor 1 and the image sensor 9.

【００９７】まず、図２（ａ）に示すように、所定の２
次元座標系Ｘ−Ｙが設定され、この座標系Ｘ−Ｙの原点
に、基準画像センサ１が位置決めされる。他の画像セン
サ２〜９については、それぞれ、この座標系Ｘ−Ｙ上の
原点回りの各所定の座標位置に位置決めされる。First, as shown in FIG.
A dimensional coordinate system XY is set, and the reference image sensor 1 is positioned at the origin of the coordinate system XY. The other image sensors 2 to 9 are respectively positioned at predetermined coordinate positions around the origin on the coordinate system XY.

【００９８】ここで、Ｘ−Ｙ座標系の各所定の座標位置
に、各画像センサ２〜９（各ステレオ対ｋ＝１〜８）毎
に演算された類似度の逆数Ｑad( i,j,k,ｚ_n) を対応づ
けたとすると、図２（ａ）に示すような類似度の逆数Ｑ
ad( i,j,k,ｚ_n) の分布が得られる。つまり、この図２
（ａ）で、山が高くなっている座標位置に対応する画像
センサ（ステレオ対）では、類似度の逆数が大きくなっ
ており、そこのステレオ対では、オクルージョンが発生
している可能性が高いことを意味している。Here, the reciprocal Qad (i, j,) of the similarity calculated for each of the image sensors 2 to 9 (each stereo pair k = 1 to 8) is provided at each predetermined coordinate position in the XY coordinate system. k, z _n ), the reciprocal Q of the similarity as shown in FIG.
ad (i, j, k, z n) distribution is obtained. That is, FIG.
In (a), the reciprocal of the similarity is large in the image sensor (stereo pair) corresponding to the coordinate position where the mountain is high, and there is a high possibility that occlusion has occurred in the stereo pair there. Means that.

【００９９】つぎに、類似度算出部４０９〜４１１で、
各ステレオ対ｋ毎に求められた類似度の逆数Ｑad( i,j,
k,ｚ_n) に基づき、これらの平均値μ( i,j,ｚ_n )と標準
偏差σ(i,j,ｚ_n ) が求められる。Next, the similarity calculating sections 409 to 411
The reciprocal of the similarity Qad (i, j,
k, based on the z _n), these average values _{μ (i, j, z n} ) and the standard deviation _{σ (i, j, z n} ) is determined.

【０１００】そして、この求められた平均値μ( i,j,ｚ
_n)＋標準偏差σ( i,j,ｚ_n )をスレッシュホールド（し
きい値）として、それぞれの類似度の逆数Ｑad( i,j,k,
ｚ_n)の大きさが２値的に判定される。Then, the obtained average value μ (i, j, z
_n ) + standard deviation σ (i, j, z _n ) as a threshold (threshold), and the reciprocal Qad (i, j, k,
The magnitude of z _n ) is determined in a binary manner.

【０１０１】すなわち、Ｑad( i,j,k,ｚ_n) ＞ μ( i,j,ｚ_n) ＋ σ( i,j,ｚ_n) …（２）の条件を満たしたときに、０と判定し、上記（２）式の
条件を満たしていないときに、１と判定する。That is, when the condition of Qad (i, j, k, z _n )> μ (i, j, z _n ) + σ (i, j, z _n ) (2) is satisfied, 0 and When it is determined that the condition of the above equation (2) is not satisfied, it is determined as 1.

【０１０２】ここで、類似度の逆数Ｑad( i,j,k,ｚ_n)
の大きさが０と判定されたときは、所定の基準レベルよ
りも類似度の逆数Ｑad( i,j,k,ｚ_n) が大きくなってお
り、このステレオ対ｋでは、オクルージョンが発生して
いる可能性が高いことを意味する。逆に、１と判定され
たときは、そのステレオ対ｋでのオクルージョンの発生
可能性は低いことになる。Here, the reciprocal Qad (i, j, k, z _n ) of the similarity is
Is determined to be 0, the reciprocal Qad (i, j, k, z _n ) of the similarity is greater than a predetermined reference level, and in this stereo pair k, occlusion occurs. Means that it is likely to be Conversely, when it is determined to be 1, the possibility of occurrence of occlusion in the stereo pair k is low.

【０１０３】上記（２）式に示す２値的な判定は、一例
であり、つぎのような実施も可能である。The binary decision shown in the above equation (2) is an example, and the following implementation is also possible.

【０１０４】すなわち、上記（２）式の代わりに、Ｑad( i,j,k,ｚ_n) ＞ μ( i,j,ｚ_n) …（２）’ を使用してもよく、また、各類似度の逆数Ｑad( i,j,k,
ｚ_n) の中から、値の大きい上位ｈ個（ｈは予め設定し
ておく）を選択し、これらを０とし、それ以下のものを
１と判定するような実施も可能である。That is, instead of the above equation (2), Qad (i, j, k, z _n )> μ (i, j, z _n ) (2) ′ may be used. The reciprocal Qad (i, j, k,
z _n ), it is possible to select the top h (h is set in advance) with a large value, set these to 0, and judge those below it to be 1.

【０１０５】こうして、図２（ｂ）に示すように、２値
化された類似度の逆数の分布パターンが生成される。つ
まり、この図２（ｂ）で、「０」になっている座標位置
に対応する画像センサ（ステレオ対）では、類似度の逆
数が大きくなっており、そこのステレオ対では、オクル
ージョンが発生している可能性が高いことを意味してい
る。この２値化された類似度の逆数の分布パターンのこ
とを、オクルージョンパターン（オクルージョンが発生
する可能性の大小の分布を示すパターン）というものと
する。In this way, as shown in FIG. 2B, a distribution pattern of the reciprocal of the binarized similarity is generated. In other words, in the image sensor (stereo pair) corresponding to the coordinate position “0” in FIG. 2B, the reciprocal of the degree of similarity is large, and occlusion occurs in the stereo pair. Means that it is likely to be. The distribution pattern of the reciprocal of the binarized similarity is referred to as an occlusion pattern (a pattern indicating the distribution of the possibility of occurrence of occlusion).

【０１０６】つぎに、こうして生成されたオクルージョ
ンパターンに一致するパターンが、記憶テーブルに記憶
された各オクルージョンパターンの中から選択される。Next, a pattern that matches the occlusion pattern thus generated is selected from the occlusion patterns stored in the storage table.

【０１０７】図３は、９台の画像センサ１〜９で物体５
０を観測したときに起こり得ると想定されるオクルージ
ョンの各発生パターンであり、これらが予め用意され、
記憶テーブルに記憶される。FIG. 3 shows an object 5 with nine image sensors 1 to 9.
Occlusion patterns assumed to occur when 0 is observed. These patterns are prepared in advance,
Stored in a storage table.

【０１０８】同図３において「×」は、類似度の逆数Ｑ
ad( i,j,k,ｚ_n)の大きさ「０」を示す。また、各記憶
オクルージョンパターンには、正規化したオクルージョ
ンベクトルＶ（ox1(ｍ)、oy1(ｍ)）のＸ成分ox1(ｍ)、
Ｙ成分oy1(ｍ)が対応づけられている。In FIG. 3, “×” indicates the reciprocal Q of the similarity.
Indicates the size “0” of ad (i, j, k, z _n ). Each of the stored occlusion patterns includes an X component ox1 (m) of the normalized occlusion vector V (ox1 (m), oy1 (m)),
The Y component oy1 (m) is associated.

【０１０９】ここで、正規化したオクルージョンベクト
ルＶとは、原点から、類似度の逆数Ｑad(i,j,k,ｚ_n)が
「０」の方向に向かい、スカラ量が１のベクトルのこと
である。このベクトルの方向が、オクルージョンが発生
している可能性の高いステレオ対ｋを指し示している。Here, the normalized occlusion vector V is a vector in which the reciprocal Qad (i, j, k, z _n ) of the similarity goes from the origin to the direction of “0” and the scalar quantity is 1. It is. The direction of this vector indicates the stereo pair k in which occlusion is likely to occur.

【０１１０】よって、図２（ｂ）のように生成されたオ
クルージョンパターンに対応するものとして、図３の記
憶テーブルの中から、矢印に示すオクルージョンパター
ンが選択され、これに対応づけられた正規化オクルージ
ョンベクトルＶのＸ成分、Ｙ成分（０、１）より、図４
（ａ）に示すように正規化オクルージョンベクトルＶが
参照されることになる。Therefore, an occlusion pattern indicated by an arrow is selected from the storage table of FIG. 3 as the one corresponding to the occlusion pattern generated as shown in FIG. 2B, and the normalized occlusion pattern corresponding to this is selected. From the X component and the Y component (0, 1) of the occlusion vector V, FIG.
The normalized occlusion vector V is referred to as shown in FIG.

【０１１１】なお、図３に示す記憶テーブルに該当する
パターンがないとき、つまり２値化分布パターンの内容
がすべて１である場合には、「オクルージョンが発生し
ていない」と判断される。When there is no corresponding pattern in the storage table shown in FIG. 3, that is, when the contents of the binarized distribution pattern are all 1, it is determined that "occlusion has not occurred".

【０１１２】つぎに、上記正規化オクルージョンベクト
ルＶ（ox1(ｍ)、oy1(ｍ)）に対して類似度の逆数の大き
さ（類似度の低さ）を加味した局所オクルージョンベク
トルＶＱ(ＱOX ( i,j,ｚ_n) , ＱOY (i,j,ｚ_n) )が、
次式によって求められる。Next, the local occlusion vector VQ (QOX (QOX ( i, j, z _n ), QOY (i, j, z _n ))
It is obtained by the following equation.

【０１１３】ＱOX ( i,j,ｚ_n) ＝ Z × ox1(ｍ) …（３−１）ＱOY ( i,j,ｚ_n) ＝ Z × oy1(ｍ) …（３−２）ここで、スカラ量Ｚはである。ただし、（３−３）式において、ｋは、２値化
された類似度の逆数が０になっているステレオ対であ
る。[0113] _{QOX (i, j, z n} ) = Z × ox1 (m) ... (3-1) QOY (i, j, z n) = Z × oy1 (m) ... (3-2) where, The scalar quantity Z is It is. However, in Expression (3-3), k is a stereo pair in which the reciprocal of the binarized similarity is 0.

【０１１４】以上のようにして、原点から、類似度の逆
数Ｑad( i,j,k,ｚ_n) が所定のレベル（μ( i,j,ｚ_n) ＋
σ( i,j,ｚ_n)）よりも大きくなっているステレオ対ｋ
の座標位置に向かい、かつ類似度の逆数Ｑad( i,j,k,ｚ
_n) の大きさに応じた大きさ（類似度が大きいほど小さ
くなる）のスカラ量Zの局所オクルージョンベクトルＶ
Ｑ(ＱOX ( i,j,ｚ_n) , ＱOY (i,j,ｚ_n) )が求められ
る。As described above, from the origin, the reciprocal Qad (i, j, k, z _n ) of the similarity is at a predetermined level (μ (i, j, z _n ) +
stereo pair k larger than σ (i, j, z _n ))
And the reciprocal of the similarity Qad (i, j, k, z
_n ) is a local occlusion vector V of a scalar quantity Z of a size (smaller as the degree of similarity increases).
Q (QOX (i, j, z n), QOY (i, j, z n)) is obtained.

【０１１５】すなわち、図４（ｂ）に示すように、この
ベクトルＶＱの方向が、オクルージョンが発生している
可能性の高いステレオ対ｋを指し示しており、このベク
トルＶＱの大きさが、オクルージョンが発生している可
能性の高さ（発生度合い）を示している。That is, as shown in FIG. 4 (b), the direction of this vector VQ points to the stereo pair k which has a high possibility that occlusion has occurred, and the magnitude of this vector VQ indicates that the occlusion is The degree of occurrence (the degree of occurrence) is shown.

【０１１６】一方において、各ステレオ対ｋ毎に演算さ
れた類似度の逆数Ｑad( i,j,k,ｚ_n)に基づき、基準画像
センサ１の選択画素Ｐ₁の画像情報と、他の全ての画像
センサ２〜９の各対応候補点Ｐ₂〜Ｐ₉の各画像情報とを
突き合わせた全体的な類似度Ｑsad ( i,j,ｚ_n)が、下記
（４）式によって求められる。On the other hand, based on the reciprocal Qad (i, j, k, z _n ) of the similarity calculated for each stereo pair k, the image information of the selected pixel P ₁ of the reference image sensor 1 and all other overall similarity Qsad abutted with each image information of each candidate corresponding points P ₂ to P ₉ of the image sensor 2 to 9 (i, j, z _n) is determined by the following equation (4).

【０１１７】ここで、Ｑsad ( i,j,ｚ_n)は、オクルージョンが発生し
ていないと仮定した場合の全体的な類似度の逆数を表し
ている。[0117] Here, Qsad (i, j, z _n ) represents the reciprocal of the overall similarity on the assumption that no occlusion has occurred.

【０１１８】・（方法２）オクルージョン発生パターン
を予め用意しない場合の実施形態この方法では、正規化オクルージョンベクトルから局所
オクルージョンベクトルを求めるのではなく、各ステレ
オ対毎に求めたベクトルを合成することで、局所オクル
ージョンベクトルＶＱを求めるようにしている。(Method 2) Embodiment in which Occlusion Occurrence Patterns are Not Prepared In this method, a local occlusion vector is not obtained from a normalized occlusion vector, but a vector obtained for each stereo pair is synthesized. , The local occlusion vector VQ is determined.

【０１１９】すなわち、図５に示すように、各ステレオ
対ｋ毎に、原点からの方向を示す方向余弦ベクトルＶ
_k( ox2 (ｋ)、oy2 (ｋ)）が求められる。たとえば、原
点からステレオ対１（画像センサ２）の座標位置に向か
う方向余弦ベクトルＶ₁は、Ｖ₁（-1／√2，1／√2）と
求められる。That is, as shown in FIG. 5, the direction cosine vector V indicating the direction from the origin is provided for each stereo pair k.
_k (ox2 (k), oy2 (k)) is obtained. For example, the direction cosine vector V ₁ directed from the origin to the coordinate position of the stereo pair 1 (image sensor 2) is determined as _{V 1 (-1 / √2,1 / √2} ).

【０１２０】こうして、各ステレオ対ｋ毎に、方向余弦
ベクトルＶ_kが演算されると、これら方向余弦ベクトル
Ｖ_kに対して、対応する類似度の逆数Ｑad( i,j,ｋ,ｚ_n)
が乗算されることで、類似度の逆数の大きさが加味され
たベクトルＶ_kＱ（Ｑad(i,j,ｋ,ｚ_n)×ox2（ｋ) ，Ｑad
( i,j,ｋ,ｚ_n)×oy2 (ｋ)）が、各ステレオ対ｋ毎に、
求められる。When the direction cosine vector V _k is calculated for each stereo pair k, the reciprocal Qad (i, j, k, z _n ) of the corresponding similarity is calculated for these direction cosine vectors V _k .
By There is multiplied, the similarity of the inverse of the magnitude is taken into account the vector _{V k Q (Qad (i,} j, k, z n) × ox2 (k), Qad
(i, j, k, z _n ) × oy2 (k)) for each stereo pair k,
Desired.

【０１２１】このようにして求められたベクトルＶ_kＱ
（ｋ＝１〜８）を図６（ａ）に示す。なお、同図６
（ａ）では、ベクトルＶ₄ＱとベクトルＶ₈Ｑが大きさが
零である場合を示している。The vector V _k Q obtained in this way
(K = 1 to 8) is shown in FIG. FIG.
(A) shows a case where the magnitudes of the vectors V ₄ Q and V ₈ Q are zero.

【０１２２】つぎに、各ステレオ対ｋ毎に求められた、
類似度の逆数の大きさが加味されたベクトルＶ₁Ｑ、Ｖ₂
Ｑ、…が、下記（５−１）、（５−２）式に示すよう
に、加算されることで、局所オクルージョンベクトルＶ
Ｑ(ＱOX ( i,j,ｚ_n) , ＱOY(i,j,ｚ_n) )が求められ
る。Next, for each stereo pair k,
Vectors V ₁ Q and V ₂ to which the magnitude of the reciprocal of similarity is added
.. Are added as shown in the following equations (5-1) and (5-2), whereby the local occlusion vector V
Q (QOX (i, j, z n), QOY (i, j, z n)) is obtained.

【０１２３】すなわち、図６（ｂ）に示すように、このベクトルＶＱ
の方向が、オクルージョンが発生している可能性の高い
ステレオ対ｋを指し示しており、このベクトルＶＱの大
きさが、オクルージョンが発生している可能性の高さ
（発生度合い）を示している。[0123] That is, as shown in FIG.
Indicates the stereo pair k in which the possibility of occurrence of occlusion is high, and the magnitude of the vector VQ indicates the height (degree of occurrence) of the possibility of occurrence of occlusion.

【０１２４】一方において、各ステレオ対ｋ毎に演算さ
れた類似度の逆数Ｑad( i,j,k,ｚ_n)に基づき、基準画像
センサ１の選択画素Ｐ₁の画像情報と、他の全ての画像
センサ２〜９の各対応候補点Ｐ₂〜Ｐ₉の各画像情報とを
突き合わせた全体的な類似度の逆数Ｑsad ( i,j,ｚ_n)
が、下記（４）式によって求められる。On the other hand, based on the reciprocal Qad (i, j, k, z _n ) of the similarity calculated for each stereo pair k, the image information of the selected pixel P ₁ of the reference image sensor 1 and all other The reciprocal Qsad (i, j, z _n ) of the overall similarity that is matched with the image information of each of the corresponding candidate points P _{2 to} P ₉ of the image sensors 2 to 9
Is obtained by the following equation (4).

【０１２５】ここで、Ｑsad ( i,j,ｚ_n)は、オクルージョンが発生し
ていないと仮定した場合の全体的な類似度の逆数を表し
ている。[0125] Here, Qsad (i, j, z _n ) represents the reciprocal of the overall similarity on the assumption that no occlusion has occurred.

【０１２６】・オクルージョンベクトルの安定化つぎに、特徴ベクトルの安定化部４１３では、上述する
ようにして求められた局所オクルージョンベクトルＶＱ
(ＱOX ( i,j,ｚ_n) , ＱOY (i,j,ｚ_n ) )を時間的にあ
るいは空間的に安定化させた大局オクルージョンベクト
ルＳＶＱ(ＱSOX(i,j,ｚ_n) , ＱSOY ( i,j,ｚ_n) )が求
められる。Next, the feature vector stabilizing section 413 determines whether the local occlusion vector VQ obtained as described above is obtained.
(QOX (i, j, z n), QOY (i, j, z n)) temporally or spatially global occlusion vector SVQ which stabilized (QSOX (i, j, z n), QSOY ( i, j, z _n )).

【０１２７】本実施形態では、空間的に安定化させる場
合を想定している。In the present embodiment, it is assumed that the space is stabilized.

【０１２８】すなわち、下記式（６−２）、（６−３）
に示す演算を行うことで、大局オクルージョンベクトル
ＳＶＱ( ＱSOX ( i,j,ｚ_n), ＱSOY ( i,j,ｚ_n) )が求め
られる。この演算は、所定のウインドウ領域Ｗ_ijを設定
して、このウインドウ領域内の局所オクルージョンベク
トルＶＱを加算することで、オクルージョンの情報を安
定化させた大局オクルージョンベクトルＳＶＱを求める
というものである。That is, the following formulas (6-2) and (6-3)
By performing the calculation shown in, global occlusion vector SVQ (QSOX (i, j, z n), QSOY (i, j, z n)) is obtained. In this calculation, a predetermined window area W _ij is set, and a local occlusion vector VQ in the window area is added to obtain a global occlusion vector SVQ in which the occlusion information is stabilized.

【０１２９】同時に、下記式（６−１）に示す演算を行
うことで、上記全体類似度の逆数Ｑsad(i,j,ｚ_n)を空間
的に安定化させた安定化全体類似度の逆数Ｑssad( i,j,
ｚ_n)が求められる。At the same time, the reciprocal of the stabilized overall similarity Qsad (i, j, z _n ) obtained by spatially stabilizing the reciprocal Qsad (i, j, z _n ) of the overall similarity by performing the operation shown in the following equation (6-1). Qssad (i, j,
z _n ) is determined.

【０１３０】Ｑssad( i,j,ｚ_n) ＝Σp，q W_i,j Ｑsad (p,q,ｚ_n) …（６−１）ＱSOX ( i,j,ｚ_n) ＝Σp，q W_i,j ＱOX (p,q,ｚ_n) …（６−２）ＱSOY ( i,j,ｚ_n) ＝Σp，q W_i,j ＱOY (p,q,ｚ_n) …（６−３）ここで、W_i,j ：パターンマッチングを行う基準画像セ
ンサ１の画素i,jの周りのウインドウ領域 p,q ：パターンマッチングを行うウインドウ領域Ｗ_i,j
内の各画素の位置なお、本実施形態では、空間的に安定化する場合を想定
しているが、時間的に安定化させてもよく、距離空間的
に安定化させてもよい。[0130] _{Qssad (i, j, z n} ) = Σp, q W i, j Qsad (p, q, z n) ... (6-1) QSOX (i, j, z n) = Σp, q W i _{, j QOX (p, q,} z n) ... (6-2) QSOY (i, j, z n) = Σp, q W i, j QOY (p, q, z n) ... (6-3) where W _{i, j} : window area around pixel i, j of reference image sensor 1 for performing pattern matching p, q: window area W _{i, j} for performing pattern matching
In this embodiment, it is assumed that spatial stabilization is performed. However, temporal stabilization may be performed, and distance stabilization may be performed.

【０１３１】・補正値類似度の算出つぎに、補正類似度算出部４１４で行われる処理につい
て説明する。Calculation of Correction Value Similarity Next, processing performed by the correction similarity calculation unit 414 will be described.

【０１３２】この補正類似度算出部４１４では、上述す
るようにして求められた安定化全体類似度の逆数Ｑssad
( i,j,ｚ_n) が、大局オクルージョンベクトルＳＶＱ
（ＱSOX(i,j,ｚ_n) ，ＱSOY ( i,j,ｚ_n) ）を用いて補正
され、補正された安定化全体類似度の逆数Ｑ’ssad (i,
j,ｚ_n) が求められる。In the corrected similarity calculating section 414, the reciprocal Qssad of the stabilized overall similarity obtained as described above is obtained.
(i, j, z _n ) is the global occlusion vector SVQ
(QSOX (i, j, z n), QSOY (i, j, z n)) is corrected using a corrected stabilized whole similarity reciprocal Q'ssad (i,
j, z _n ).

【０１３３】次式（７）は、この補正演算の一例を示す
式である。The following equation (7) is an equation showing an example of this correction operation.

【０１３４】Ｑ’ssad( i,j,ｚ_n)＝Ｑssad( i,j,ｚ_n)−ｋ₁×｛ＱSOX ( i,j,ｚ_n)² ＋ＱSOY ( i,j,ｚ_n)²｝^1/2 …（７）なお、ｋ₁は正の定数であるとする。[0134] _{Q'ssad (i, j, z n} ) = Qssad (i, j, z n) -k 1 × {QSOX (i, j, z n) 2 + QSOY (i, j, z n) 2} ^1/2 (7) where the k ₁ is a positive constant.

【０１３５】この（７）式に示されるように、大局オク
ルージョンベクトルＳＶＱのスカラ量が大きいほど、補
正量は大きくなる。As shown in the equation (7), the larger the scalar amount of the global occlusion vector SVQ, the larger the correction amount.

【０１３６】また、安定化全体類似度の逆数Ｑssad( i,
j,ｚ_n) の大きさと、大局オクルージョンベクトルＳＶ
Ｑ（ＱSOX ( i,j,ｚ_n) ，ＱSOY ( i,j,ｚ_n)）の大きさ
を比較して、オクルージョン発生の有無を判断し、この
結果、オクルージョンが発生していると判断された場合
のみに、補正演算を行う方法が考えられる。Also, the inverse Qssad (i,
j, z _n ) and the global occlusion vector SV
Q (QSOX (i, j, z n), QSOY (i, j, z n)) by comparing the magnitude of, and determine the presence or absence of occlusion occurs, as a result, it is determined that occlusion has occurred Only when this occurs, a method of performing a correction operation is conceivable.

【０１３７】すなわち、下記（８−１）式にしたがい、
まず、オクルージョン発生の有無が判断される。That is, according to the following equation (8-1),
First, it is determined whether or not occlusion has occurred.

【０１３８】Ｑssad( i,j,ｚ_n)−ｋ₂×｛ＱSOX ( i,j,ｚ_n)² ＋ＱSOY ( i,j,ｚ_n)²｝^1/2 ＜ε …（８−１）ただし、ｋ₂、εは、正の定数である。[0138] _{Qssad (i, j, z n} ) -k 2 × {QSOX (i, j, z n) 2 + QSOY (i, j, z n) 2} 1/2 <ε ... (8-1) provided that , K ₂ and ε are positive constants.

【０１３９】上記（８−１）式が満たされたとき、つま
り安定化全体類似度の逆数Ｑssad(i,j,ｚ_n) の大きさに
対して、大局オクルージョンベクトルＳＶＱのスカラ量
が相対的に大きくなっているときに、オクルージョンが
発生したと判断するものである。When the above equation (8-1) is satisfied, that is, the scalar amount of the global occlusion vector SVQ is relative to the magnitude of the inverse Qssad (i, j, z _n ) of the stabilized overall similarity. Is determined to be occlusion has occurred.

【０１４０】そして、オクルージョンが発生したと判断
された場合には、上記（７）と同様にして、下記（８−
２）式による補正演算が実行される。If it is determined that occlusion has occurred, the following (8-
The correction calculation based on the expression 2) is performed.

【０１４１】Ｑ’ssad( i,j,ｚ_n)＝Ｑssad( i,j,ｚ_n)−ｋ₂×｛ＱSOX ( i,j,ｚ_n)² ＋ＱSOY ( i,j,ｚ_n)²｝^1/2 …（８−２）一方、上記（８−１）式が満たされないとき、つまり安
定化全体類似度の逆数Ｑssad( i,j,ｚ_n) の大きさに対
して、大局オクルージョンベクトルＳＶＱのスカラ量が
相対的に小さくなっているときには、オクルージョンは
発生していないと判断される。[0141] _{Q'ssad (i, j, z n} ) = Qssad (i, j, z n) -k 2 × {QSOX (i, j, z n) 2 + QSOY (i, j, z n) 2} ^1/2 (8-2) On the other hand, the (8-1) when the equation is not satisfied, that is stabilized whole similarity reciprocal _{Qssad (i, j, z n} ) with respect to the size of global occlusion vector When the SVQ scalar amount is relatively small, it is determined that occlusion has not occurred.

【０１４２】そして、オクルージョンが発生していない
と判断された場合には、下記（８−３）式に示すよう
に、安定化全体類似度の逆数Ｑssad( i,j,ｚ_n) をその
まま最終的な安定化全体類似度の逆数Ｑssad( i,j,ｚ_n)
として出力する。If it is determined that occlusion has not occurred, the reciprocal Qssad (i, j, z _n ) of the stabilized overall similarity is finally used as shown in the following equation (8-3). Inverse Qssad (i, j, z _n )
Output as

【０１４３】Ｑ’ssad (i,j,ｚ_n)＝Ｑssad( i,j,ｚ_n) …（８−３）・距離推定最後に、距離推定部４１５では、図７に示すように、仮
定距離ｚ_nと、補正された安定化全体類似度の逆数Ｑ’
ssad ( i,j,ｚ_n)との対応関係（破線にて示す）が求め
られ、この対応関係から、最も類似度が高くなる点（類
似度の逆数が最小値となる点）が判別され、この最も類
似度が高くなっている点に対応する仮定距離ｚ_nxが最終
的に、認識対象物体５０上の点５０ａまでの真の距離
（最も確からしい距離）と推定される。かかる処理は、
基準画像＃１の各選択画素毎に全画素について行われ
る。Q′ssad (i, j, z _n ) = Qssad (i, j, z _n ) (8-3) Distance Estimation Finally, the distance estimating unit 415 sets the assumption as shown in FIG. The distance z _n and the inverse Q ′ of the corrected stabilized overall similarity
A correspondence relationship (shown by a broken line) with ssad (i, j, z _n ) is obtained, and from this correspondence relationship, a point having the highest similarity (a point at which the reciprocal of the similarity has the minimum value) is determined. The hypothetical distance z _nx corresponding to the point with the highest similarity is finally estimated as the true distance (the most likely distance) to the point 50 a on the recognition target object 50. Such processing is
This is performed for all pixels for each selected pixel of the reference image # 1.

【０１４４】図７において、実線で示す対応関係は、従
来のオクルージョンを考慮した補正を行わなかった場合
に得られる、仮定距離ｚ_nと、安定化全体類似度の逆数
Ｑssad(i,j,ｚ_n)との対応関係である。同図７で破線と
実線を比較すると、従来のオクルージョンを考慮した補
正を行わなかった場合には、真の距離ｚ_nxで類似度の逆
数は最小値になっておらず、真の距離ｚ_nxを正確に捕ら
えることができない虞があるが、本実施形態のように、
オクルージョンを考慮した補正を行った場合には、真の
距離ｚ_nxで類似度の逆数は最小値になっており、真の距
離ｚ_nxを正確に捕らえることができるのが分かる。この
ように、本実施形態によれば、オクルージョンが発生し
た場合を考慮した補正を行うことで、真の距離ｚ_nxを正
確に捕らえることができ、精度のよい距離推定が行える
ようになる。In FIG. 7, the correspondence indicated by the solid line is a hypothetical distance z _n obtained when no correction is performed in consideration of the conventional occlusion, and a reciprocal Qssad (i, j, z) of the stabilized overall similarity. _n ). Comparing the dashed line with the solid line in FIG. 7, when the conventional occlusion is not corrected, the reciprocal of the similarity at the true distance z _nx is not the minimum value, and the true distance z _nx May not be accurately captured, but as in this embodiment,
When performing correction in consideration of the occlusion, the reciprocal of the similarity true distance z _nx is at the minimum value, it can be seen that it is possible to capture accurately the true distance z _nx. As described above, according to the present embodiment, by performing correction in consideration of the case where occlusion has occurred, the true distance z _nx can be accurately captured, and accurate distance estimation can be performed.

【０１４５】また、本実施形態によれば、オクルージョ
ンが発生した場合を考慮した精度のよい距離推定が行え
るようになるのはもちろんのこと、従来の図２１に示す
構成のように、オクルージョンの影響を除去するため
に、各オクルージョンパターン毎に、全体的な安定化類
似度を求める処理は必要なく、前もって、安定化全体類
似度の逆数Ｑssad( i,j,ｚ_n)を１つだけ求めておくだけ
で済み、計算量が大幅に低減される。Further, according to the present embodiment, it is possible not only to perform accurate distance estimation in consideration of the case where occlusion has occurred, but also to consider the effect of occlusion as in the conventional configuration shown in FIG. It is not necessary to perform the process of calculating the overall stabilized similarity for each occlusion pattern in order to eliminate the following. Only one reciprocal Qssad (i, j, z _n ) of the stabilized overall similarity is calculated in advance. It is only necessary to keep it, and the amount of calculation is greatly reduced.

【０１４６】しかも、この安定化全体類似度の逆数Ｑss
ad( i,j,ｚ_n)を求めるには、従来のように、各ステレオ
対毎に、ウインド内加算を行うことによって安定化類似
度を求め、最後に、各ステレオ対毎に求めた安定化類似
度を加算して全体的な安定化類似度を求めるという繁雑
な演算をしなくてもよい。In addition, the reciprocal Qss of this stabilized overall similarity
To obtain ad (i, j, z _n ), as in the conventional method, the in-window addition is performed for each stereo pair to obtain the stabilized similarity, and finally, the stability obtained for each stereo pair is obtained. It is not necessary to perform the complicated calculation of adding the generalized similarity to obtain the overall stabilized similarity.

【０１４７】つまり、ステレオ対毎にウインド内加算を
することなく、全体的な類似度の逆数Ｑsad( i,j,ｚ_n)
を演算しておき（上記（４）式参照）、これだけを安定
化させた安定化全体類似度Ｑssad( i,j,ｚ_n)を求めるだ
けの簡易な演算で済む（上記（６−１）式参照）。That is, the reciprocal Qsad (i, j, z _n ) of the overall similarity is calculated without performing in-window addition for each stereo pair.
(See the above equation (4)), and only a simple operation of obtaining the stabilized overall similarity Qssad (i, j, z _n ) by stabilizing this (see the above (6-1)) See formula).

【０１４８】よって、多眼ステレオ装置において、扱う
データ量や処理量を少なくすることができるとともに、
ハードウェア資源を減らすことができる。このことが
可能であるのは、従来の多眼ステレオ装置は、２眼ステ
レオの方式を複数組み合わせた構成になっているのに対
して、本実施形態の基本的な処理の流れが、１）画像データを早い段階でステレオ処理に必要かつ十
分な情報（特徴ベクトル）へと圧縮する、２）特徴ベクトルに対し、適切な安定化処理を実行す
る、３）特徴ベクトルの情報とベクトル間の関係などを利用
して、最終的に目的とする３次元空間の情報を得る、の３段階になっているからで
ある。Therefore, in the multi-view stereo apparatus, the amount of data to be handled and the amount of processing can be reduced.
Hardware resources can be reduced. This is possible because the conventional multi-view stereo apparatus has a configuration in which a plurality of binocular stereo systems are combined, whereas the basic processing flow of the present embodiment is as follows: Compress image data into information (feature vectors) necessary and sufficient for stereo processing at an early stage. 2) Perform appropriate stabilization processing on feature vectors. 3) Relationship between feature vector information and vectors. This is because there are three stages of finally obtaining the information of the target three-dimensional space by using such a method.

【０１４９】画像処理とは、２次元的に広がった膨大な
データの中から、必要とする情報（圧縮された少ない量
の情報）を抜き出すことである。たとえば、リンゴを写
した画像から、リンゴがあることを認識して、リンゴが
存在するという情報を得たとする。この場合、結果とし
て得られた情報は、元の画像データに比べ、極めて圧縮
された情報になっている。この場合、画像処理の初期の
段階で、リンゴが存在するかどうかを判断する為に必要
十分な情報が得られれば、後の処理はこの情報のみを利
用すればよいことになり、処理内容を簡略化でき、保持
する情報量も少なくなる。Image processing refers to extracting necessary information (a small amount of compressed information) from a huge amount of data that has spread two-dimensionally. For example, suppose that an apple is recognized from an image of an apple, and information that an apple is present is obtained. In this case, the resulting information is extremely compressed information compared to the original image data. In this case, if sufficient information necessary to determine whether an apple is present can be obtained at the initial stage of the image processing, the subsequent processing only needs to use this information. It can be simplified and the amount of information to be held is reduced.

【０１５０】これに対して、従来装置のように、最終的
な判断を行う直前まで、画像情報と同程度の量の情報を
持ち回る様な画像処理の場合には、処理や保持する情報
量は大規模になる。On the other hand, in the case of image processing in which the same amount of information as the image information is carried out until immediately before the final determination as in the conventional apparatus, the amount of information to be processed and held is Will be large.

【０１５１】画像処理を行う場合、元の画像から如何に
早く、的確に必要不可欠な情報に圧縮していくかが重要
になる。When performing image processing, it is important how to quickly and accurately compress the original image into essential information.

【０１５２】元の画像から早く、的確に必要不可欠な情
報に圧縮することが可能になれば、処理内容の重複を排
除でき、効率のよい処理が実現される。また、早い段階
で情報の冗長性を排除すれば、不要な情報を保持する必
要がない。If it becomes possible to quickly and accurately compress necessary information from an original image, duplication of processing contents can be eliminated, and efficient processing can be realized. In addition, if information redundancy is eliminated at an early stage, it is not necessary to hold unnecessary information.

【０１５３】このことは、多眼ステレオ処理についても
同様である。The same applies to multi-view stereo processing.

【０１５４】各ステレオ対に対して、同様のステレオ処
理を実施してから、再度その結果を使って距離を推定す
るという従来の方法には、処理内容に冗長性がある。The conventional method of performing the same stereo processing for each stereo pair and then estimating the distance using the result again has redundancy in the processing contents.

【０１５５】これに対して処理全体を一つのステレオ処
理と考えて、初期の段階で３次元空間の情報を得るため
に必要となる特徴ベクトルに圧縮しておけば、全体とし
て冗長性が少なく、かつ各画像情報を有効に利用したス
テレオ処理を実行することができる。On the other hand, if the entire processing is considered as one stereo processing and is compressed into a feature vector necessary for obtaining information of the three-dimensional space at an initial stage, the redundancy as a whole is small, In addition, stereo processing that effectively uses each image information can be executed.

【０１５６】従来の多眼ステレオ装置では、画像センサ
の数が増えると、それに比例して処理内容と中間処理情
報が増加する。これに対して、本実施形態では、特徴ベ
クトル生成のための入力情報が増加するだけで、圧縮結
果としての特徴ベクトルの情報量は常に同じになる。従
って処理全体が簡潔になり、画像センサ数の増加に柔軟
かつ、的確に対応することができる。このことは、複数
の画像センサから得られる情報を余すところなく利用
し、目的とする３次元空間情報を得ることができること
を意味する。In the conventional multi-view stereo apparatus, as the number of image sensors increases, the processing content and intermediate processing information increase in proportion thereto. On the other hand, in the present embodiment, the information amount of the feature vector as the compression result is always the same, only by increasing the input information for generating the feature vector. Therefore, the entire processing is simplified, and it is possible to flexibly and accurately cope with an increase in the number of image sensors. This means that the desired three-dimensional spatial information can be obtained by making full use of information obtained from a plurality of image sensors.

【０１５７】なお、本実施形態では、オクルージョンの
除去を目的として特徴ベクトル（オクルージョンベクト
ル）を生成しているが、これ以外の目的の下に特徴ベク
トルを生成して、これに基づき類似度を補正してもよ
い。たとえば、画像センサの画面から物体がはみ出して
撮影されるという「はみだし」を考慮した補正を行うた
めに、特徴ベクトルを生成し、これに基づき類似度を補
正してもよい。In the present embodiment, a feature vector (occlusion vector) is generated for the purpose of removing occlusion. However, a feature vector is generated for other purposes, and the similarity is corrected based on the generated feature vector. May be. For example, a feature vector may be generated, and the similarity may be corrected based on the feature vector in order to perform correction in consideration of “protrusion” in which an object protrudes from the image sensor screen and is shot.

【０１５８】すなわち、特徴ベクトルが持つべき情報
は、目的とする処理内容によって異なる。従来の多眼ス
テレオ装置では、各ステレオ対の類似度を求めている
が、本実施形態では、単なる個別の類似度を、特徴ベク
トルの要素としているのではなく、目的に即した形でベ
クトル情報を持っている点が異なる。That is, the information that the feature vector should have differs depending on the target processing content. In the conventional multi-view stereo apparatus, the similarity of each stereo pair is obtained. However, in the present embodiment, the individual individual similarities are not used as the elements of the feature vector, but are vector information in a form suitable for the purpose. Is different.

【０１５９】本実施形態のように、オクルージョンを考
慮したステレオ画像処理を行う場合には、オクルージョ
ンの強度と方向が、特徴ベクトルの要素となる。When stereo image processing is performed in consideration of occlusion as in the present embodiment, the intensity and direction of occlusion are elements of a feature vector.

【０１６０】一方、画像センサの画面から物体がはみ出
して撮影されるという「はみだし」を考慮した補正を行
うためには、「はみだし」の強度と方向が、特徴ベクト
ルの要素となる。On the other hand, in order to perform the correction in consideration of the "protrusion" that an object protrudes from the screen of the image sensor and is taken, the strength and direction of the "protrusion" are elements of the feature vector.

【０１６１】また、本実施形態では、（６−２）、（６
−３）式により安定化処理を行うようにしているが、こ
れに限定されるわけではない。In this embodiment, (6-2), (6)
Although the stabilization processing is performed according to the equation -3), the present invention is not limited to this.

【０１６２】安定化処理は、特徴ベクトルを、空間的、
時間的に平滑化する処理である。In the stabilizing process, the feature vector is spatially and
This is a process for smoothing temporally.

【０１６３】特徴ベクトルは、各画素毎に計算される情
報であるため、局所的なノイズの影響を受けることがあ
る。また、全体の傾向にそって、個別の特徴ベクトルを
補正したり、一定領域における傾向を表現した特徴ベク
トルを求める必要がある。このような場合に、一定の近
傍領域での部分和などを求めることによって平滑化を行
い、安定化した特徴ベクトルを得ようとするものであ
る。Since the feature vector is information calculated for each pixel, it may be affected by local noise. In addition, it is necessary to correct individual feature vectors according to the overall tendency or to find a feature vector expressing the tendency in a certain area. In such a case, smoothing is performed by obtaining a partial sum or the like in a certain neighborhood area to obtain a stabilized feature vector.

【０１６４】安定化を行うことにより、各画素毎の特徴
情報を全体の傾向や、処理内容の特性にあわせて調整す
ることができる。これにより、特徴ベクトルを使用する
最終的な処理をより円滑に、かつ効果的に行うことがで
きる。By performing the stabilization, the characteristic information of each pixel can be adjusted according to the overall tendency and the characteristics of the processing contents. As a result, final processing using the feature vector can be performed more smoothly and effectively.

【０１６５】よって、安定化の方法としては、ウインド
ウ内加算、空間平滑化の他に、時間軸で特徴ベクトルを
加算していく方法や、連続性や、特徴ベクトルの特性か
ら得られる拘束条件を利用して安定化する方法などを使
用してもよい。これら安定化処理は、周りの状況に合わ
せて、特徴ベクトル自身の情報内容を変更する処理であ
る。Therefore, as a method of stabilization, in addition to in-window addition and spatial smoothing, a method of adding feature vectors on the time axis, continuity, and a constraint condition obtained from the characteristics of feature vectors are used. For example, a method of stabilizing by utilizing may be used. These stabilization processes are processes for changing the information content of the feature vector itself according to the surrounding situation.

【０１６６】また、安定化処理は、単に平滑化を行うの
ではなく、最終的な処理の目的に応じて安定化処理の内
容を調整するところが特徴である。したがって、特徴ベ
クトルそれ自身の情報や、その他の情報を利用して安定
化の強さや、安定化処理の処理内容を調整することも可
能である。Further, the stabilization processing is characterized in that the content of the stabilization processing is adjusted according to the purpose of the final processing, rather than simply performing smoothing. Therefore, it is also possible to adjust the strength of stabilization and the processing content of the stabilization processing by using the information of the feature vector itself and other information.

【０１６７】さて、本実施形態によれば、補正値ＱOX(
i,j,ｚ_n)、ＱOY( i,j,ｚ_n)を求め、これを安定化させた
安定化補正値ＱSOX( i,j,ｚ_n)、ＱSOY( i,j,ｚ_n)によっ
て、全体的な安定化類似度の逆数Ｑssad( i,j,ｚ_n)を補
正するようにしているので、全体的な安定化類似度の逆
数Ｑssad( i,j,ｚ_n)を求めることができさえすれば、オ
クルージョンが発生した場合を考慮した精度のよい距離
推定が行われることになる。According to the present embodiment, the correction value QOX (
i, j, z _n ) and QOY (i, j, z _n ) and stabilizing them, and using the stabilized correction values QSOX (i, j, z _n ) and QSOY (i, j, z _n ) , the reciprocal of the overall stability of similarity _{Qssad (i, j, z n} ) because so as to correct the reciprocal of the overall stabilization similarity _{Qssad (i, j, z n} ) be obtained If possible, accurate distance estimation will be performed in consideration of the case where occlusion has occurred.

【０１６８】つまり、図２２に示す構成のように、ステ
レオ対毎にウインド内加算をすることなく、単に類似度
の逆数Ｑad( i,j,ｚ_n)を加算するだけで全体的な類似度
Ｑsad(i,j,ｚ_n)を求めておき（上記（４）式参照）、こ
れを安定化させた安定化全体類似度の逆数Ｑssad( i,j,
ｚ_n)を求めるだけの簡易な演算で済ませるようにした
（上記（６−１）式参照）本実施形態装置の構成に限定
されるわけではない。That is, as in the configuration shown in FIG. 22, the overall similarity is calculated by simply adding the reciprocal Qad (i, j, z _n ) of the similarity without performing in-window addition for each stereo pair. Qsad (i, j, z _n ) is obtained (see the above equation (4)), and the reciprocal Qssad (i, j,
z _n ) is simply calculated (see equation (6-1)), and is not limited to the configuration of the apparatus of the present embodiment.

【０１６９】従来の図１８に示す多眼ステレオ装置のよ
うに、各ステレオ対毎に、ウインド内加算を行うことに
よって安定化類似度を求め、最後に、各ステレオ対毎に
求めた安定化類似度を加算して全体的な安定化類似度Ｑ
ssad( i,j,ｚ_n)を求めるという繁雑な演算を行う必要が
ある構成のものに対しても適用可能である。As in the conventional multi-view stereo apparatus shown in FIG. 18, a stabilized similarity is obtained by performing in-window addition for each stereo pair, and finally, a stabilized similarity obtained for each stereo pair. Add the degree and the overall stabilized similarity Q
The present invention can also be applied to a configuration that requires a complicated operation of obtaining ssad (i, j, z _n ).

【０１７０】図１８に示す多眼ステレオ装置の最終的な
類似度算出部２１５では、オクルージョンを何ら考慮し
ない全体的な安定化類似度の逆数Ｑssad( i,j,ｚ_n)が求
められる。ここで、大局オクルージョンベクトルＳＶＱ
に基づき補正値ＱOX(i,j,ｚ_n)、ＱOY( i,j,ｚ_n)を別途
求めておけば、この補正値ＱOX(i,j,ｚ_n)、ＱOY( i,j,
ｚ_n)によって、最終的な類似度算出部２１５で算出され
た全体的な安定化類似度の逆数Ｑssad( i,j,ｚ_n)を、た
とえば上記（７）式のようにして補正することが可能で
ある。この結果、オクルージョンが発生した場合を考慮
した精度のよい距離推定が行えるようになる。The final similarity calculating section 215 of the multi-view stereo apparatus shown in FIG. 18 obtains the reciprocal Qssad (i, j, z _n ) of the overall stabilized similarity without considering any occlusion. Here, the global occlusion vector SVQ
Based on the correction value _{QOX (i, j, z n} ), QOY (i, j, z n) if separately sought, this correction value _{QOX (i, j, z n} ), QOY (i, j,
z _n ), the reciprocal Qssad (i, j, z _n ) of the overall stabilized similarity calculated by the final similarity calculation unit 215 is corrected, for example, as in the above equation (7). Is possible. As a result, accurate distance estimation can be performed in consideration of the case where occlusion has occurred.

【０１７１】つぎに、基準画像＃１の特徴に基づき、よ
り精度のよい距離推定を行うことができる実施形態につ
いて説明する。Next, an embodiment in which more accurate distance estimation can be performed based on the features of the reference image # 1 will be described.

【０１７２】この実施例では、基準画像センサ１から得
られた基準画像＃１の各画素毎に、明度の変化率、エッ
ジの方向などの２次元的な画像の特徴を抽出し、この抽
出した特徴に応じて、図２２に示す特徴ベクトルの安定
化部４１３、補正類似度算出部４１４、距離推定部４１
５の演算処理を制御することで、より精度のよい距離推
定を行うものである。In this embodiment, for each pixel of the reference image # 1 obtained from the reference image sensor 1, features of a two-dimensional image such as a rate of change in brightness and a direction of an edge are extracted and extracted. According to the features, a feature vector stabilizing unit 413, a correction similarity calculating unit 414, and a distance estimating unit 41 shown in FIG.
By controlling the arithmetic processing of No. 5, more accurate distance estimation is performed.

【０１７３】図８に、この実施形態の多眼ステレオ装置
の構成を示す。FIG. 8 shows the configuration of the multi-view stereo apparatus of this embodiment.

【０１７４】同図に示すように、基準画像入力部５０１
および画像入力部５０２、５０３、５０４は、図２２に
示す基準画像入力部４０１および画像入力部４０２、４
０３、４０４と同一であり、対応候補点座標発生部５０
５は、図１に示す対応候補点座標発生部４０５と同一で
あり、局所情報抽出部５０６、５０７、５０８は、図２
２に示す局所情報抽出部４０６、４０７、４０８と同一
であり、類似度算出部５０９、５１０、５１１は、図２
２に示す類似度算出部４０９、４１０、４１１と同一で
あり、特徴ベクトルの生成部５１２は、図２２に示す特
徴ベクトルの生成部４１２と同一であり、特徴ベクトル
の安定化部５１３は、図２２に示す特徴ベクトルの安定
化部４１３と同一であり、補正類似度算出部５１４は、
図２２に示す補正類似度算出部４１４と同一であり、距
離推定部５１５は、図２２に示す距離推定部４１５と同
一であり、既にした説明と重複するので、これらの説明
は省略する。As shown in the figure, the reference image input unit 501
The image input units 502, 503, and 504 include a reference image input unit 401 and image input units 402, 4 shown in FIG.
03, 404, and the corresponding candidate point coordinate generation unit 50
5 is the same as the corresponding candidate point coordinate generation unit 405 shown in FIG. 1, and the local information extraction units 506, 507, 508
2 is the same as the local information extraction units 406, 407, and 408 shown in FIG.
2 are the same as the similarity calculation units 409, 410, and 411 shown in FIG. 2, the feature vector generation unit 512 is the same as the feature vector generation unit 412 shown in FIG. 22 is the same as the feature vector stabilizing unit 413 shown in FIG.
The same as the correction similarity calculating section 414 shown in FIG. 22, and the distance estimating section 515 is the same as the distance estimating section 415 shown in FIG. 22, and the description already described is omitted.

【０１７５】基準画像の２次元的特徴抽出部５１６で
は、基準画像センサ１で得られた基準画像＃１の各選択
画素毎に２次元的特徴が抽出される。なお、この基準画
像＃１の特徴は、対象とする画素単独の情報を用いても
よく、また、対象とする画素の周辺の情報に対して、空
間的な安定化を施したものを用いてもよい。The reference image two-dimensional feature extraction unit 516 extracts a two-dimensional feature for each selected pixel of the reference image # 1 obtained by the reference image sensor 1. Note that the feature of the reference image # 1 may use information of the target pixel alone, or may use information obtained by performing spatial stabilization on information around the target pixel. Is also good.

【０１７６】特徴ベクトル生成の制御部５１７は、特徴
抽出部５１６で抽出された基準画像＃１の２次元的特徴
に応じて、特徴ベクトルの生成部５１２で行われる特徴
ベクトル生成の処理を制御するものである。The feature vector generation control unit 517 controls the feature vector generation processing performed by the feature vector generation unit 512 according to the two-dimensional features of the reference image # 1 extracted by the feature extraction unit 516. Things.

【０１７７】特徴ベクトル安定化の制御部５１８は、特
徴抽出部５１６で抽出された基準画像＃１の２次元的特
徴に応じて、特徴ベクトルの安定化部５１３で行われる
安定化の処理を制御するものである。すなわち、特徴ベ
クトルＶＱの安定化処理（（６−２）、（６−３）式）
あるいは全体類似度の逆数Ｑsad ( i,j,ｚ_n)の安定化処
理（（６−１）式）に用いられるウインドウ領域Ｗ_i,j
（以下、ウインドウＷＤという）のサイズ、形状を変化
させる処理が行われる。The feature vector stabilization control unit 518 controls the stabilization processing performed by the feature vector stabilization unit 513 according to the two-dimensional feature of the reference image # 1 extracted by the feature extraction unit 516. Is what you do. That is, the stabilization processing of the feature vector VQ (Equations (6-2) and (6-3))
Alternatively, the window area W _{i, j} used for the stabilization process (Equation (6-1)) of the reciprocal Qsad (i, j, z _n ) of the overall similarity
Processing for changing the size and shape of the window (hereinafter referred to as a window WD) is performed.

【０１７８】補正類似度算出の制御部５１９は、特徴抽
出部５１６で抽出された基準画像＃１の２次元的特徴に
応じて、補正類似度の算出部５１４で行われる補正類似
度の算出処理を制御するものである。The correction similarity calculation control unit 519 performs a correction similarity calculation process performed by the correction similarity calculation unit 514 according to the two-dimensional feature of the reference image # 1 extracted by the feature extraction unit 516. Is controlled.

【０１７９】距離推定の制御部５２０は、特徴抽出部５
１６で抽出された基準画像＃１の２次元的特徴に応じ
て、距離推定部５１５で行われる距離推定の処理を制御
するものである。The control unit 520 for distance estimation includes the feature extraction unit 5
The processing of the distance estimation performed by the distance estimating unit 515 is controlled in accordance with the two-dimensional characteristics of the reference image # 1 extracted in step S16.

【０１８０】つぎに、特徴ベクトル安定化の制御部５１
８で行われる内容について図９、図１０を併せ参照して
説明する。Next, the controller 51 for stabilizing the feature vector
8 will be described with reference to FIGS. 9 and 10. FIG.

【０１８１】・基準画像＃１の２次元的特徴としてエッ
ジの方向を抽出する場合特徴抽出部５１６で、基準画像＃１の２次元的特徴とし
て、エッジ５１の方向が抽出された場合を想定する。When the direction of the edge is extracted as the two-dimensional feature of the reference image # 1 It is assumed that the feature extraction unit 516 extracts the direction of the edge 51 as the two-dimensional feature of the reference image # 1. .

【０１８２】いま、基準画像＃１の選択画素Ｐ１の情報
から「エッジの方向」が抽出されたものとすると、特徴
ベクトルＶＱあるいは全体類似度の逆数Ｑsad ( i,j,ｚ
_n)を安定化させるために用いられるウインドウＷＤの形
状が、通常の正方形の形状（図９（ａ）参照）から、図
９（ｂ）に示すように、抽出されたエッジ５１の方向に
沿った形状のウインドウＷＤＸに変更される。Now, assuming that the “edge direction” is extracted from the information of the selected pixel P1 of the reference image # 1, the feature vector VQ or the reciprocal Qsad (i, j, z) of the overall similarity is assumed.
The shape of the window WD used to stabilize _n ) changes from a normal square shape (see FIG. 9A) to the direction of the extracted edge 51 as shown in FIG. 9B. The window WDX is changed to the window WDX having the changed shape.

【０１８３】ここに、特徴ベクトルＶＱあるいは全体類
似度の逆数Ｑsad ( i,j,ｚ_n)を安定化させるために用い
られるウインドウＷＤの形状を、正方形などに一義的に
設定した場合には、つぎのような問題が生ずる。Here, when the shape of the window WD used to stabilize the feature vector VQ or the reciprocal Qsad (i, j, z _n ) of the overall similarity is uniquely set to a square or the like, The following problems arise.

【０１８４】すなわち、図９（ａ）に示すように、基準
画像＃１中のエッジ５１が顕著な場合には、その近辺に
物体５０の縁が存在する可能性がある。この場合に一義
的な正方形のウインドウＷＤを使用すると、背景の影響
を大きく受けて、特徴ベクトルＶＱあるいは全体類似度
の逆数Ｑsad ( i,j,ｚ_n)の安定化処理の精度、ひいては
距離の計測の精度が損なわれる虞があった。That is, as shown in FIG. 9A, when the edge 51 in the reference image # 1 is remarkable, there is a possibility that the edge of the object 50 exists near the edge 51. With the window WD of the unique square in this case, greatly affected by background feature vector VQ or overall similarity reciprocal _{Qsad (i, j, z n} ) accuracy of stabilization treatment, hence the distance There is a possibility that the accuracy of the measurement is impaired.

【０１８５】この点、本実施形態では、図９（ｂ）に示
すように、ウインドウＷＤＸの形状が、エッジ５１の方
向に沿った形状に変化されるので、背景の影響を大きく
受けることがなくなり、安定化処理の精度、ひいては距
離の計測の精度を向上させることができる。In this respect, in the present embodiment, as shown in FIG. 9B, the shape of the window WDX is changed to a shape along the direction of the edge 51, so that it is not greatly affected by the background. In addition, it is possible to improve the accuracy of the stabilization process, and thus the accuracy of the distance measurement.

【０１８６】・基準画像＃１の２次元的特徴として明度
の変化率を抽出する場合特徴抽出部５１６で、基準画像＃１の２次元的特徴とし
て、明度の変化率が抽出された場合を想定する。In the case of extracting the change rate of lightness as a two-dimensional feature of the reference image # 1 It is assumed that the feature extractor 516 extracts the change rate of lightness as the two-dimensional feature of the reference image # 1. I do.

【０１８７】いま、基準画像＃１の選択画素Ｐ₁の情報
から「明度の変化率」が抽出されたものとすると、図１
０（ａ）に示すように、この抽出された明度の変化率の
絶対値が、通常のサイズのウインドウＷＤ内で加算され
る。Now, assuming that the “brightness change rate” is extracted from the information of the selected pixel P ₁ of the reference image # 1, FIG.
As shown in FIG. 0 (a), the absolute value of the extracted change rate of the brightness is added within the window WD of the normal size.

【０１８８】そこで、この加算された値ａが一定のしき
い値よりも小さい場合には、特徴ベクトルＶＱあるいは
全体類似度の逆数Ｑsad ( i,j,ｚ_n)を安定化させるため
に用いられるウインドウＷＤのサイズが、通常のサイズ
（図１０（ａ）参照）から、図１０（ｂ）に示すよう
に、通常よりも大きいサイズのウインドウＷＤＸに変更
される。[0188] Therefore, when the added value a is smaller than a certain threshold value is used to stabilize the feature vector VQ or overall similarity reciprocal _{Qsad (i, j, z n} ) of The size of the window WD is changed from a normal size (see FIG. 10A) to a window WDX having a size larger than normal, as shown in FIG. 10B.

【０１８９】これは、明度の変化率の絶対値をウインド
ウ内で加算した値ａが、ある一定のしきい値以上でない
とすると、画像＃１内の特徴が不足しており、安定化が
不十分であると予測されるからである。そこで、このよ
うな場合は、ウインドウのサイズを大きくして、画像＃
１内の特徴をより多く取り入れるようにして、安定化を
十分に行うようにするものである。安定化を十分に行う
ことで、距離の計測の精度を向上させることができる。This is because if the value a obtained by adding the absolute value of the change rate of the brightness within the window is not equal to or greater than a certain threshold value, the features in the image # 1 are insufficient, and the stabilization is not possible. This is because it is expected to be sufficient. Therefore, in such a case, the size of the window is increased and the image #
In order to stabilize sufficiently, the feature in 1 is taken in more. Sufficient stabilization can improve the accuracy of distance measurement.

【０１９０】つぎに、距離推定の制御部５２０で行われ
る内容について図１１を併せ参照して説明する。Next, the contents performed by the control unit 520 for distance estimation will be described with reference to FIG.

【０１９１】特徴抽出部５１６で、基準画像＃１の２次
元的特徴として、明度の変化率が抽出された場合を想定
する。It is assumed that the feature extraction unit 516 extracts a change rate of brightness as a two-dimensional feature of the reference image # 1.

【０１９２】いま、基準画像＃１の選択画素Ｐ₁の情報
から「明度の変化率」が抽出されたものとすると、この
抽出された明度の変化率の絶対値が、ウインドウＷＤ内
で加算される。[0192] Now, when the reference image # 1 information of the selected pixel P ₁ and an "brightness change rate" is extracted, the absolute value of the rate of change of the extracted brightness, are added in the window WD You.

【０１９３】そこで、この加算された値ａが一定のしき
い値以上である場合には、真の距離における全体的な安
定化類似度の逆数Ｑ’ssadの変化率は、大きいと予測さ
れる。逆に、この加算された値ａが一定のしきい値より
も小さい場合には、真の距離における全体的な安定化類
似度の逆数Ｑ’ssadの変化率は、小さいと予測される。Therefore, when the added value a is equal to or larger than a certain threshold value, the rate of change of the reciprocal Q'ssad of the overall stabilization similarity at the true distance is predicted to be large. . Conversely, if this added value a is smaller than a certain threshold value, the rate of change of the reciprocal Q'ssad of the overall stabilized similarity at the true distance is predicted to be small.

【０１９４】つまり、加算値ａが大きい（明度の変化率
が大きい）画素Ｐ₁では、画像＃１の特徴は大きく、真
の距離近傍における全体的な安定化類似度の逆数Ｑ’ss
adの変化率は、大きいと予測される。That is, in the pixel P ₁ having the large addition value a (the change rate of the lightness is large), the feature of the image # 1 is large, and the reciprocal Q′ss of the overall stabilized similarity near the true distance is obtained.
The rate of change of ad is expected to be large.

【０１９５】そこで、いま、加算値ａが一定のしきい値
以上（明度の変化率が大きい）であると判断された場合
には、図１１に示すように、大きい変化率で全体的な安
定化類似度の逆数Ｑ’ssadが変化している極小点が探索
され、これに対応する仮定距離ｚ_x1が、推定距離（真の
距離）の候補とされる。Therefore, if it is determined that the added value a is equal to or larger than a certain threshold value (the change rate of brightness is large), as shown in FIG. A minimum point at which the reciprocal Q'ssad of the generalized similarity is changed is searched for, and a hypothetical distance z _x1 corresponding to this is set as a candidate for an estimated distance (true distance).

【０１９６】一方、全体的な安定化類似度の逆数Ｑ’ss
adが最小になっている仮定距離ｚ_x2が探索されるととも
に、この仮定距離ｚ_x2近傍における全体的な安定化類似
度の逆数Ｑ’ssadの変化率が求められる。ここで、この
仮定距離ｚ_x2近傍における変化率と、上述するように既
に求められた仮定距離ｚ_x1近傍における変化率とが比較
され、大きい変化率になっている仮定距離ｚ_x1が最終的
に真の距離であると推定される。On the other hand, the inverse Q'ss of the overall stabilization similarity
The hypothetical distance z _x2 in which ad is minimized is searched, and the rate of change of the reciprocal Q′ssad of the overall stabilized similarity near the hypothetical distance z _x2 is obtained. Here, the rate of change in the assumption distance z _x2 vicinity, and the rate of change in the assumed distance z _x1 vicinity already asked to described above are compared, assuming has become larger rate of change distance z _x1 finally Estimated to be true distance.

【０１９７】なお、逆に、加算値ａが一定のしきい値よ
りも小さくなっている（明度の変化率が小さい）と判断
された場合には、全体的な安定化類似度の逆数Ｑ’ssad
の変化率が小さくなっている極小点を探索し、これに対
応する仮定距離を、推定距離（真の距離）の候補とすれ
ばよい。Conversely, if it is determined that the addition value a is smaller than a certain threshold value (the change rate of brightness is small), the reciprocal Q ′ of the overall stabilization similarity is determined. ssad
It is sufficient to search for a minimum point where the rate of change of is small, and to set a hypothetical distance corresponding to this as a candidate for the estimated distance (true distance).

【０１９８】ここに、従来のステレオ装置の距離推定部
３２０（あるいは距離推定部１０７、距離推定部２１
６）では、図１４あるいは図１６に示すように、類似度
の逆数Ｑssadが最も小さくなったときの仮定距離ｚ_nxを
もって一義的に、真の距離であると推定していた。Here, distance estimating section 320 (or distance estimating section 107, distance estimating section 21) of the conventional stereo apparatus is used.
In 6), as shown in FIG. 14 or FIG. 16, the assumed distance z _nx when the reciprocal Qssad of the similarity becomes the smallest is uniquely estimated as the true distance.

【０１９９】しかし、図１１に示すように、仮定距離ｚ
と類似度の逆数Ｑssadの対応関係によっては、類似度の
逆数Ｑssadが最も小さくなったときの仮定距離ｚ_x2では
なくて、値としては最も小さくはないが、その近辺の変
化率が大きい極小となったときの仮定距離ｚ_x1が真の距
離である場合がある。つまり、従来のものでは、類似度
の逆数Ｑssadが最も小さくなったときの仮定距離ｚ_x2を
もって一義的に、真の距離であると推定しているため
に、変化率が大きい極小となったときの仮定距離ｚ_x1が
真の距離であることを見逃してしまうことがあった。However, as shown in FIG.
Depending on the correspondence relationship between the reciprocal Qssad of the similarity and the reciprocal Qssad of the similarity, instead of the assumed distance z _x2 when the reciprocal Qssad of the similarity becomes the smallest, the value is not the smallest, but the local minimum where the rate of change in the vicinity is large In some cases, the assumed distance z _x1 may be a true distance. In other words, in the related art, when the reciprocal Qssad of the similarity is minimized, the assumed distance z _x2 when the reciprocal Qssad becomes the minimum is uniquely estimated as the true distance. _May be overlooked that the assumed distance z _x1 is a true distance.

【０２００】この点、本実施形態によれば、類似度の大
きさ以外に、類似度の変化率を考慮して、距離を推定す
るようにしているので、真の距離を見逃してしまうこと
なく、距離推定の精度を向上させることができる。In this regard, according to the present embodiment, the distance is estimated in consideration of the rate of change of the similarity other than the magnitude of the similarity, so that the true distance is not missed. , The accuracy of distance estimation can be improved.

【０２０１】なお、本実施形態では、図９、図１０に示
すウインドウの形状、サイズを変化させる処理が、図８
の特徴ベクトルの安定化部における特徴ベクトルＶＱの
安定化のための処理に適用されるものとして説明した
が、本発明としては、これに限定されるわけではない。
図９、図１０に示すウインドウの形状、サイズを変化さ
せる処理を、図１７、図１８、図２１のウインドウ内加
算部における類似度の安定化処理に適用してもよい。In this embodiment, the processing for changing the shape and size of the window shown in FIGS.
Has been described as being applied to the process for stabilizing the feature vector VQ in the feature vector stabilizing unit, but the present invention is not limited to this.
The processing for changing the shape and size of the window shown in FIGS. 9 and 10 may be applied to the similarity stabilization processing in the intra-window addition unit in FIGS. 17, 18, and 21.

【０２０２】また、実施形態では、図９、図１０に示す
ようにウインドウを空間的に変化させているが、時間的
にウインドウを変化させてもよい。Further, in the embodiment, the windows are changed spatially as shown in FIGS. 9 and 10, but the windows may be changed temporally.

【０２０３】また、本実施形態では、図１１に示す真の
距離の候補を求める処理が、図８の距離推定部における
真の距離の推定処理に適用されるものとして説明した
が、本発明としては、これに限定されるわけではない。
図１１に示す真の距離の候補を求める処理を、図１７、
図１８、図２１の距離推定部における真の距離の推定処
理に適用してもよい。Further, in the present embodiment, the processing for obtaining the true distance candidates shown in FIG. 11 has been described as being applied to the true distance estimation processing in the distance estimating unit in FIG. Is not limited to this.
The processing for obtaining the true distance candidates shown in FIG.
The present invention may be applied to the processing of estimating a true distance in the distance estimating unit in FIGS.

【０２０４】また、実施形態では、画像の特徴を、基準
画像センサ１で取得された基準画像＃１から抽出してい
るが、これ以外の画像センサ２〜Ｎで取得された画像＃
２〜＃Ｎから抽出してもよい。In the embodiment, the features of the image are extracted from the reference image # 1 acquired by the reference image sensor 1, but the image # 2 acquired by the other image sensors 2 to N is extracted.
2 to #N.

[Brief description of the drawings]

【図１】図１は本発明の装置の構成を示すブロック図で
ある。FIG. 1 is a block diagram showing a configuration of an apparatus of the present invention.

【図２】図２（ａ）、（ｂ）は、オクルージョンパター
ンを生成する様子を示す図である。FIGS. 2A and 2B are diagrams showing how an occlusion pattern is generated.

【図３】図３は実施例の記憶テーブルの内容を示す図で
ある。FIG. 3 is a diagram illustrating contents of a storage table according to the embodiment;

【図４】図４（ａ）、（ｂ）は正規化オクルージョンベ
クトルから局所オクルージョンベクトルを生成する処理
を説明する図である。FIGS. 4A and 4B are diagrams illustrating a process of generating a local occlusion vector from a normalized occlusion vector.

【図５】図５はステレオ対毎に求めた正規化オクルージ
ョンベクトルを示す図である。FIG. 5 is a diagram showing normalized occlusion vectors obtained for each stereo pair.

【図６】図６（ａ）、（ｂ）はステレオ対毎に求めた正
規化オクルージョンベクトルから局所オクルージョンベ
クトルを合成する処理を説明する図である。FIGS. 6A and 6B are diagrams for explaining a process of synthesizing a local occlusion vector from a normalized occlusion vector obtained for each stereo pair.

【図７】図７は仮定距離と類似度の逆数および補正類似
度の逆数の対応関係を示す図である。FIG. 7 is a diagram illustrating a correspondence relationship between an assumed distance, a reciprocal of a similarity, and a reciprocal of a corrected similarity;

【図８】図８は本発明の基準画像の２次元的特徴を利用
する実施形態の装置の構成を示すブロック図である。FIG. 8 is a block diagram showing a configuration of an apparatus according to an embodiment of the present invention that uses two-dimensional features of a reference image.

【図９】図９（ａ）、（ｂ）は安定化のためのウインド
ウの形状を変化させる処理を説明する図である。FIGS. 9A and 9B are diagrams illustrating a process of changing the shape of a window for stabilization.

【図１０】図１０（ａ）、（ｂ）は安定化のためのウイ
ンドウの大きさを変化させる処理を説明する図である。FIGS. 10A and 10B are diagrams illustrating a process of changing the size of a window for stabilization.

【図１１】図１１は真の距離を、２つの候補点から推定
する様子を説明する図である。FIG. 11 is a diagram for explaining how a true distance is estimated from two candidate points.

【図１２】図１２は従来の２眼ステレオの画像処理の原
理を示した図である。FIG. 12 is a diagram illustrating the principle of image processing of a conventional twin-lens stereo.

【図１３】図１３は従来の２眼ステレオの距離計測の処
理を説明するための図である。FIG. 13 is a diagram for explaining a conventional distance measurement process of a twin-lens stereo.

【図１４】図１４は２眼ステレオの場合の仮定距離と類
似度の逆数との対応関係を示すグラフである。FIG. 14 is a graph showing a correspondence relationship between an assumed distance and a reciprocal of similarity in the case of binocular stereo.

【図１５】図１５は従来の多眼ステレオの距離計測の処
理内容を説明するための図である。FIG. 15 is a diagram for explaining the processing content of a conventional multi-lens stereo distance measurement.

【図１６】図１６は多眼ステレオの場合の仮定距離と類
似度の逆数との対応関係を示すグラフである。FIG. 16 is a graph showing the correspondence between the assumed distance and the reciprocal of the similarity in the case of multi-view stereo.

【図１７】図１７は従来の２眼ステレオ装置の構成を示
したブロック図である。FIG. 17 is a block diagram showing a configuration of a conventional twin-lens stereo apparatus.

【図１８】図１８は従来の多眼ステレオ装置の構成を示
したブロック図である。FIG. 18 is a block diagram showing a configuration of a conventional multi-view stereo apparatus.

【図１９】図１９は独立した２つの物体を撮像する様子
を示す斜視図である。FIG. 19 is a perspective view showing a state in which two independent objects are imaged.

【図２０】図２０（ａ）、（ｂ）はオクルージョンを説
明するために用いた図である。FIGS. 20A and 20B are diagrams used to explain occlusion. FIGS.

【図２１】図２１は従来の多眼ステレオ装置の構成を示
したブロック図である。FIG. 21 is a block diagram showing a configuration of a conventional multi-view stereo apparatus.

【図２２】図２２は本発明の実施形態の装置の構成を示
すブロック図である。FIG. 22 is a block diagram illustrating a configuration of an apparatus according to an embodiment of the present invention.

[Explanation of symbols]

１〜Ｎ画像センサ４１２、５１２特徴ベクトルの生成部４１３、５１３特徴ベクトルの安定化部４１４、５１４補正類似度算出部４１５、５１５距離推定部５１６基準画像の２次元的特徴抽出部５１７特徴ベクトル生成の制御部５１８特徴ベクトル安定化の制御部５１９補正類似度算出の制御部５２０距離推定の制御部 1 to N Image sensor 412, 512 Feature vector generation unit 413, 513 Feature vector stabilization unit 414, 514 Corrected similarity calculation unit 415, 515 Distance estimation unit 516 Reference image two-dimensional feature extraction unit 517 Feature vector generation Control unit 518 control unit for feature vector stabilization 519 control unit for correction similarity calculation 520 control unit for distance estimation

フロントページの続き (72)発明者木村茂神奈川県川崎市宮前区菅生ヶ丘９−１− 403 (72)発明者中野勝之東京都目黒区中目黒２−２−30 (72)発明者山口博義神奈川県平塚市四之宮2597 株式会社小松製作所特機事業本部研究部内 (72)発明者新保哲也神奈川県平塚市四之宮2597 株式会社小松製作所特機事業本部研究部内 (72)発明者川村英二神奈川県川崎市宮前区有馬２丁目８番24 号株式会社サイヴァース内 (72)発明者緒方正人神奈川県鎌倉市上町屋345番地三菱プレシジョン株式会社内 (56)参考文献特開平10−283474（ＪＰ，Ａ) 特開平９−231358（ＪＰ，Ａ) 特開平９−196633（ＪＰ，Ａ) 特開平９−27969（ＪＰ，Ａ) 特開平９−9294（ＪＰ，Ａ) 特開平８−313212（ＪＰ，Ａ) 特開平６−265322（ＪＰ，Ａ) (58)調査した分野(Int.Cl.⁶，ＤＢ名) G01B 11/00 G01C 3/06 G06T 7/00 Continued on the front page (72) Inventor Shigeru Kimura 9-1-403, Suugagaoka, Miyamae-ku, Kawasaki-shi, Kanagawa (72) Inventor Katsuyuki Nakano 2-2-30 Nakameguro, Meguro-ku, Tokyo (72) Inventor Hiroyoshi Yamaguchi 2597 Shinomiya, Hiratsuka-shi, Kanagawa Prefecture, Komatsu Ltd.Specialty Equipment Division, Research Department (72) Inventor Tetsuya Shinbo 2597 Shinomiya, Hiratsuka-shi, Kanagawa Prefecture, Komatsu Ltd. 2-8-24 Arima, Miyamae-ku Inside Cyvers Corporation (72) Inventor Masato Ogata 345 Kamimachiya, Kamakura City, Kanagawa Prefecture Inside Mitsubishi Precision Corporation (56) References JP-A-10-283474 (JP, A JP-A-9-231358 (JP, A) JP-A-9-196633 (JP, A) JP-A-9-27969 (JP, A) JP-A-9-9294 (JP, A) JP-A 8- 313212 (JP, A) JP-A-6-265322 (JP, A) (58) Fields investigated (Int. Cl. ⁶ , DB name) G01B 11/00 G01C 3/06 G06T 7/00

Claims

(57) [Claims]

A plurality of image pickup means are arranged at a predetermined interval, and correspond to a selected pixel in a picked-up image of the one image pickup means when one of the plurality of image pickup means picks up a target object. An object recognition device that calculates a distance to a point on the object and recognizes the object based on the distance obtained for each pixel; a correspondence candidate of each imaging unit corresponding to the selected pixel; The corresponding candidate point information extracting means for extracting local information such as brightness information near the point for each assumed distance, and the local information near the corresponding candidate point of each imaging means extracted here as input, and A feature vector generating means for generating a feature vector that accurately characterizes the feature, and a feature for performing information stabilization processing by integrating the generated feature vector data positionally or temporally for each element. Estimating information in three-dimensional space by providing a vector stabilizing means and a distance estimating means for estimating the distance to an object using the relationship between the elements of the stabilized feature vectors. Characteristic space recognition device.

2. A feature vector generating means for generating a feature vector including information such as a possible occlusion direction and intensity as an element by using local information of a corresponding candidate point of each imaging means as an input; Using the feature vector stabilization means that performs information stabilization processing by accumulating the data of feature vectors that have been positioned or temporally for each element, and using the relationship between those stabilized feature vector elements 2. The space recognition apparatus according to claim 1, further comprising a distance estimating means for estimating a distance to the object, thereby estimating information in a three-dimensional space from which influence of occlusion of the object has been removed.

3. The method of claim 1, wherein when a target object is imaged by one of the plurality of imaging means, an image obtained by a captured image input means of the one imaging means is used for each selected pixel. A feature extraction unit that extracts a feature of a two-dimensional image such as a change rate or an edge direction of the image, and a feature vector generation unit that changes a feature vector generation method according to the feature extracted by the feature extraction unit. 2. A space recognition apparatus according to claim 1, further comprising means for causing the space to be recognized.

4. A method for controlling the brightness of each selected pixel by using an image obtained by a captured image input unit of one of the plurality of imaging units when a target object is imaged. A feature extraction unit for extracting a feature of a two-dimensional image such as a rate of change or an edge direction, and a space for stabilization by the feature vector stabilization unit according to the feature extracted by the feature extraction unit. 2. The space recognition apparatus according to claim 1, further comprising means for changing a region or a time region.

5. The method according to claim 1, wherein when one of the plurality of imaging units captures an image of the target object, the image obtained by the captured image input unit of the one imaging unit is used to adjust the brightness of each selected pixel. A feature extraction unit for extracting a feature of a two-dimensional image such as a rate of change or an edge direction, and a unit for changing a method of estimating a distance by the distance estimation unit according to the feature extracted by the feature extraction unit. The space recognition apparatus according to claim 1, further comprising:

6. A plurality of imaging means are arranged at a predetermined interval, and correspond to a selected pixel in a captured image of the one imaging means when one of the plurality of imaging means images a target object. A distance to a point on the object to be calculated, and an object recognition device configured to recognize the object based on the distance obtained for each pixel; a correspondence candidate of each imaging unit corresponding to the selected pixel; A corresponding candidate point information extraction process for extracting local information such as brightness information near a point for each assumed distance, and a local feature based on the local information near the corresponding candidate point of each imaging unit extracted here as input. A feature vector generation process for generating a feature vector that accurately characterizes, and a feature for performing information stabilization processing by integrating the generated feature vector data positionally or temporally for each element. Estimating information in 3D space by providing a vector stabilization process and a distance estimation process that estimates the distance to an object using the relationship between elements of these stabilized feature vectors A featured space recognition method.

7. A feature vector generation process for generating a feature vector including information such as a possible occlusion direction and intensity as an element by using local information of a corresponding candidate point of each imaging unit as an input, and generating the feature vector. Using the feature vector stabilization process that performs information stabilization processing by accumulating the feature vector data for each element positionally or temporally, and using the relationship between those stabilized feature vector elements 6. The space recognition method according to claim 5, further comprising: estimating information in a three-dimensional space from which influence of occlusion of the object has been removed by providing a distance estimation step of estimating a distance to the object.

8. A brightness for each selected pixel using an image obtained by a captured image input unit of one of the plurality of imaging units when the target object is imaged by the one of the plurality of imaging units. A feature extraction step of extracting a feature of a two-dimensional image such as a rate of change or an edge direction, and a feature vector generation method is changed in the feature vector generation step according to the feature extracted in the feature extraction step. 6. The space recognition method according to claim 5, further comprising the steps of:

9. A method of controlling the brightness of each selected pixel using an image obtained by a captured image input unit of one of the plurality of imaging units when the target object is imaged. A feature extraction step for extracting a feature of a two-dimensional image such as a rate of change or an edge direction, and a space for stabilizing the feature vector in a stabilization step of the feature vector according to the feature extracted in the feature extraction step. 6. The space recognition method according to claim 5, further comprising a step of changing a region or a time region.

10. A brightness for each selected pixel using an image obtained by a captured image input unit of one of the plurality of imaging units when the target object is imaged by the one of the plurality of imaging units. A feature extraction step of extracting a two-dimensional image feature such as a change rate or an edge direction of the image; and a step of changing a distance estimation method in the distance estimation step according to the feature extracted in the feature extraction step. 6. The space recognition method according to claim 5, further comprising: