JP5267330B2

JP5267330B2 - Image processing apparatus and method

Info

Publication number: JP5267330B2
Application number: JP2009128028A
Authority: JP
Inventors: 博則墨友
Original assignee: Konica Minolta Inc
Current assignee: Konica Minolta Inc
Priority date: 2009-05-27
Filing date: 2009-05-27
Publication date: 2013-08-21
Anticipated expiration: 2029-05-27
Also published as: JP2010277262A

Abstract

<P>PROBLEM TO BE SOLVED: To enable a user to easily understand the movement of a moving body in an image processing apparatus for making a display device display the pickup image of a driver recorder. <P>SOLUTION: A three-dimensional position information calculation part 11 calculates a three-dimensional position information from a time-series stereo pickup image from a drive recorder, and a moving body extraction part 12a extracts the same moving body, and a face setting part 13 sets a projection face desired by a user when displaying a pickup image, that is, the direction of a line of sight, and a three-dimensional position information integration part 15 integrates time-series images on the set projection face, and a three-dimensional position information calculation part 16 calculates each position of the moving body on the integrated screen, and a three-dimensional position information projection part 14 makes a display device 3 display it. Therefore, it is possible to convert the movement of the moving body analyzed from the time-series three-dimensional pickup image into an image viewed from the line of sight of a driver or the line of sight of the eyewitness of an accident for display. Thus, it is possible for a user to easily understand the movement of the moving body. <P>COPYRIGHT: (C)2011,JPO&INPIT

Description

本発明は、車載カメラなどによる時系列撮像画像から、着目動体の動きを解析して表示装置に表示させる画像処理装置および方法に関する。 The present invention relates to an image processing apparatus and method for analyzing a motion of a moving object of interest from a time-series captured image by an in-vehicle camera or the like and displaying it on a display device.

近年、自動車業界では、安全性向上を目的とした様々なシステムの研究がなされている。それには、センサやカメラを使用して、車両周辺の情報、特に距離情報を取得することで、衝突危険性を判定し、事故回避に役立てられている。中でも、カメラを利用した危機回避システムでは、カメラの撮像画像を元に、車両周辺の障害物等の特定や、その障害物の動きの解析等を行い、障害物を回避するようになっている。 In recent years, various systems for the purpose of improving safety have been studied in the automobile industry. For this purpose, by using a sensor or a camera to acquire information around the vehicle, particularly distance information, the collision risk is determined, which is useful for avoiding accidents. In particular, in a crisis avoidance system using a camera, obstacles around the vehicle are identified and the movement of the obstacles is analyzed based on the captured image of the camera to avoid the obstacles. .

一方で、危険を回避できずに事故が起きた場合には、その原因究明等に役立つような様々な情報を、事故前後の画像を分析して抽出するシステムも開発されている。たとえば、特許文献１には、交差点等に設置されたカメラによって事故前後の画像を取得し、その画像を解析することで、たとえば事故車両の速度等、事故における状況を分析するシステムが開示されている。このシステムは、事故現場となり得る（事故が多発する）交差点等に予めカメラを設置するとともに、路面や横断歩道等のその交差点における静止体の平面図データを用意しておき、この平面図データに前記事故の前後の画像を投影させる（車両軌跡を描写する）ことで、事故の状況を分析可能とするものである。 On the other hand, when an accident occurs without avoiding danger, a system has been developed that extracts various information useful for investigating the cause by analyzing images before and after the accident. For example, Patent Document 1 discloses a system that analyzes images of an accident such as the speed of an accident vehicle by acquiring images before and after the accident with a camera installed at an intersection or the like and analyzing the images. Yes. This system installs cameras in advance at intersections where accidents can occur (where accidents occur frequently), and prepares plan view data of stationary bodies at such intersections such as road surfaces and pedestrian crossings. By projecting images before and after the accident (depicting a vehicle trajectory), the situation of the accident can be analyzed.

また、特許文献２には、単眼動画像の特徴点から、移動体に搭載されたカメラの３次元位置と姿勢とを示すカメラベクトル値（ＣＶ値）を求め、求められたＣＶ値に基づいて画像上にカメラ位置を重畳表示するものである。 Further, in Patent Document 2, a camera vector value (CV value) indicating a three-dimensional position and posture of a camera mounted on a moving body is obtained from feature points of a monocular moving image, and based on the obtained CV value. The camera position is superimposed on the image.

特開２００４−１０２４２６号公報JP 2004-102426 A 特開２００８−５４７８号公報JP 2008-5478 A

しかしながら、特許文献１に開示された従来技術は、車両の状況を詳しく解析できるものの、前記平面図データを用意しておく必要があることから、予め定められた場所での事故の状況分析にしか対応できないという問題もある。 However, although the prior art disclosed in Patent Document 1 can analyze the situation of the vehicle in detail, it is necessary to prepare the plan view data, so that it can only analyze the situation of an accident at a predetermined location. There is also a problem that it cannot be handled.

これに対して、特許文献２に開示された従来技術においては、動体領域が支配的な画像では正確なＣＶ値を算出することは困難であるという問題がある。さらに、カメラ、すなわち移動体（自車両）の動きを求めることはできるが、画像中におけるそれら以外の動体の位置を求めることについては考慮していない。 On the other hand, the conventional technique disclosed in Patent Document 2 has a problem that it is difficult to calculate an accurate CV value for an image in which a moving object region is dominant. Furthermore, although the movement of the camera, that is, the moving body (own vehicle) can be obtained, it is not considered to obtain the positions of other moving bodies in the image.

本発明の目的は、時系列の３次元撮像画像から解析した動体の動きを、使用者が容易に理解することができる形態で表示させる画像処理装置および方法を提供することである。 An object of the present invention is to provide an image processing apparatus and method for displaying a motion of a moving object analyzed from a time-series three-dimensional captured image in a form that can be easily understood by a user.

本発明の画像処理装置は、時系列の撮像画像中での被写体の３次元位置情報を取得可能な画像データを入力とし、その入力画像データを処理して表示装置に表示させる画像処理装置において、前記時系列の撮像画像から同一の動体及び静止体を抽出する動体抽出部と、
前記表示にあたっての投影面を設定する面設定部と、前記面設定部にて設定された投影面上に、前記動体抽出部で抽出された動体における前記３次元位置情報を前記静止体に基づいて投影した表示画像を作成する３次元位置情報投影部とを含むことを特徴とする。
The image processing apparatus of the present invention, when the obtained image data which can be collected by the three-dimensional position information of the subject in the imaging field image series as an input, the image processing apparatus to be displayed on the display device processes the input image data A moving object extraction unit that extracts the same moving object and a stationary object from the time-series captured images;
A plane setting unit for setting a projection plane for the display, and the three-dimensional position information of the moving object extracted by the moving object extraction unit on the projection plane set by the plane setting unit based on the stationary object And a three-dimensional position information projection unit for creating a projected display image.

本発明の画像処理方法は、時系列の撮像画像中での被写体の３次元位置情報を取得可能な画像データを入力とし、その入力画像データを処理して表示画像を作成するための方法において、前記時系列の撮像画像から同一の動体及び静止体を抽出する工程と、前記表示にあたっての投影面を設定する工程と、設定された投影面上に、抽出された動体における前記３次元位置情報を前記静止体に基づいて投影した表示画像を作成する工程とを含むことを特徴とする。
The image processing method of the present invention, when taken three-dimensional position information of the subject in the imaging field image series as input the resulting possible image data, a method for creating a display image by processing the input image data The step of extracting the same moving body and stationary body from the time-series captured image, the step of setting a projection plane for the display, and the three-dimensional position of the extracted moving body on the set projection plane Creating a display image in which information is projected based on the stationary body .

上記の構成によれば、入力画像データとして、たとえばステレオカメラによる時系列の撮像画像を入力とし、そのステレオカメラの左右間の画像を対応点探索処理することで、前記撮像画像中での被写体の３次元位置情報を得たり、或いは単眼カメラの時系列の撮像画像にレーダなどによる距離情報を入力として被写体の３次元位置情報を得たりすることで、時系列の撮像画像に、その撮像画像中での被写体の３次元位置情報を合わせて取得し、その入力画像データを処理して表示装置に表示させる画像処理装置において、動体抽出部と、入力部と、３次元位置情報投影部とを設ける。そして、前記動体抽出部は、前記時系列の撮像画像から同一の動体及び静止体を抽出する一方、面設定部からは、使用者が前記表示にあたっての所望とする投影面、すなわち視線方向を設定し、これによって前記３次元位置情報投影部は、前記面設定部にて設定された投影面上に、前記動体抽出部で抽出された動体の前記時系列の各点における前記３次元位置情報を、たとえば軌跡の線や点、或いは実写の動体像自体の重ね合わせなどで前記静止体に基づいて投影した表示画像を作成し、前記表示装置に表示させる。
According to the above configuration, as input image data, for example, a time-series captured image by a stereo camera is input, and a corresponding point search process is performed on an image between the left and right of the stereo camera, so that the subject in the captured image is detected. By obtaining 3D position information, or obtaining 3D position information of a subject by inputting distance information from a radar or the like to a time-series captured image of a monocular camera, a time-series captured image is included in the captured image. In the image processing apparatus that acquires the three-dimensional position information of the subject at the same time, processes the input image data and displays it on the display device, a moving object extraction unit, an input unit, and a three-dimensional position information projection unit are provided. . The moving object extraction unit extracts the same moving object and stationary object from the time-series captured images, while the surface setting unit sets a projection plane desired by the user for the display, that is, a line-of-sight direction. Thus, the three-dimensional position information projection unit displays the three-dimensional position information at each point in the time series of the moving object extracted by the moving object extraction unit on the projection plane set by the surface setting unit. For example, a display image projected based on the stationary object is created by superimposing lines and points of a locus or a moving body image of a real image, and displayed on the display device.

したがって、時系列の３次元撮像画像から解析した動体の動きを、たとえば運転者の目線や、事故の目撃者の目線から見た画像に変換して表示することができ、前記動体の動きを使用者が容易に理解することができる。 Therefore, the motion of the moving object analyzed from the time-series three-dimensional captured image can be converted into an image viewed from the driver's eye or the eye of the accident witness, for example, and the motion of the moving object can be used. Can be easily understood.

さらにまた、本発明の画像処理装置では、前記動体抽出部の出力から、動体の各フレーム間の３次元位置を前記静止体を基準として統合する３次元情報統合部と、前記３次元情報統合部で統合されたフレーム間において、或るフレームにおける動体の３次元位置を基準として、残余のフレームにおける動体の３次元位置を算出し、前記３次元位置情報投影部へ出力する３次元位置情報算出部とをさらに備えることを特徴とする。
Furthermore, in the image processing apparatus of the present invention, the output of the previous SL moving object extraction unit, and the 3-dimensional information integration section a three-dimensional position to integrate the basis of the stationary body between the frames of the moving object, the three-dimensional information integration 3D position information calculation for calculating the 3D position of the moving object in the remaining frames and outputting the 3D position information to the 3D position information projecting unit based on the 3D position of the moving object in a certain frame as a reference. And a section.

上記の構成によれば、静止体を基準として各フレーム間における動体の位置情報を統合した後、投影を行うので、前記時系列の撮像画像を得るカメラが動いている、すなわち前記カメラをドライブレコーダなどの車載カメラとした場合においも、前記運転者の目線だけでなく、路上に居る前記目撃者の目線などの任意の目線（投影面）方向からの表示画像を作成することができる。 According to the above configuration, since the position information of the moving body between the frames is integrated based on the stationary body and then the projection is performed, the camera that obtains the time-series captured images is moving, that is, the camera is used as a drive recorder. Even in the case of an in-vehicle camera such as the above, it is possible to create a display image not only from the driver's eyes but also from any eye (projection plane) direction such as the eyes of the witness on the road.

また、本発明の画像処理装置では、前記３次元位置情報投影部は、フレーム毎に、前記投影面上に、前記動体抽出部で抽出された動体および静止体の３次元位置情報を投影し、その投影された各フレーム画像において、前記静止体の位置合せを行うことで、前記各投影面の統合を行う投影画像統合部をさらに備えることを特徴とする。
In the image processing apparatus of the present invention, the pre-Symbol 3-dimensional position information projection unit, for each frame, on the projection plane, and projecting the three-dimensional position information of the moving object extracted by the moving object extraction unit and a stationary member The projected image integration unit further includes a projection image integration unit that integrates the projection planes by aligning the stationary body in the projected frame images.

上記の構成によれば、フレーム毎の投影結果を、静止体を基準として統合するので、前記時系列の撮像画像を得るカメラが動いている、すなわち前記カメラをドライブレコーダなどの車載カメラとした場合においも、前記運転者の目線だけでなく、路上に居る前記目撃者の目線などの任意の目線（投影面）方向からの表示画像を作成することができる。 According to the above configuration, since the projection results for each frame are integrated with reference to a stationary body, the camera that obtains the time-series captured images is moving, that is, the camera is an in-vehicle camera such as a drive recorder In addition, not only the driver's line of sight but also a display image from any line of sight (projection plane) direction such as the line of sight of the witness on the road can be created.

さらにまた、本発明の画像処理装置では、前記面設定部には前記動体抽出部の出力が入力され、或るフレームにおける抽出結果を基準とした仮想投影面が設定され、前記３次元位置情報投影部は、前記動体抽出部の出力から、各フレームにおける動体の３次元位置を前記静止体を基準として統合し、前記面設定部にて設定された仮想投影面上に投影することを特徴とする。 Furthermore, in the image processing apparatus of the present invention, before Symbol plane setting unit are inputted an output of the moving object extraction unit, a virtual projection plane relative to the extraction result in a certain frame is set, the three-dimensional position information The projection unit integrates the three-dimensional position of the moving body in each frame from the output of the moving body extraction unit on the basis of the stationary body, and projects it on the virtual projection plane set by the surface setting unit. To do.

上記の構成によれば、静止体を基準として各フレーム間における動体の位置情報を統合した後、仮想投影面に投影を行うので、前記時系列の撮像画像を得るカメラが動いている、すなわち前記カメラをドライブレコーダなどの車載カメラとした場合においも、前記運転者の目線だけでなく、路上に居る前記目撃者の目線などの任意の目線（投影面）方向からの表示画像を作成することができる。 According to the above configuration, since the position information of the moving body between the frames is integrated with respect to the stationary body and then projected onto the virtual projection plane, the camera that obtains the time-series captured image is moving, that is, Even when the camera is a vehicle-mounted camera such as a drive recorder, it is possible to create a display image not only from the driver's line of sight but also from an arbitrary line of sight (projection plane) such as the line of sight of the witness on the road. it can.

また、本発明の画像処理装置では、前記３次元位置情報投影部は、前記３次元位置情報に、前記動体抽出部で抽出された各動体の動きに関する情報を併せて前記表示画像を作成することを特徴とする。 In the image processing device according to the aspect of the invention, the three-dimensional position information projection unit may create the display image by combining the three-dimensional position information with information regarding the motion of each moving object extracted by the moving object extraction unit. It is characterized by.

上記の構成によれば、前記３次元位置情報に、速度の大きさ表す矢印や数値などの各動体の動きに関する情報を併せて表示することで、前記動体の動きを使用者がより容易に理解することができる。 According to the above configuration, the user can more easily understand the movement of the moving object by displaying the information on the movement of each moving object such as an arrow or a numerical value indicating the magnitude of the speed together with the three-dimensional position information. can do.

さらにまた、本発明の画像処理装置では、前記３次元位置情報投影部において、前記投影面およびそれに投影される３次元位置情報は、前記入力画像データを、前記面設定部にて設定された角度から見た画像に変換した実写画像であることを特徴とする。 Furthermore, in the image processing apparatus according to the present invention, in the three-dimensional position information projection unit, the projection plane and the three-dimensional position information projected thereon are the angles set by the surface setting unit. It is a real image converted into an image seen from the above.

上記の構成によれば、実写画像を角度変換して動体像を作成するので、リアリティを持たせることができる。 According to the above configuration, the moving image is created by converting the angle of the photographed image, so that reality can be provided.

また、本発明の画像処理装置では、前記３次元位置情報投影部において、前記投影面は前記面設定部にて設定された角度から見た模式化された道路面であり、前記３次元位置情報は、前記入力画像データから抽出した実写のランドマークから成ることを特徴とする。 In the image processing apparatus of the present invention, in the three-dimensional position information projection unit, the projection plane is a schematic road surface viewed from an angle set by the surface setting unit, and the three-dimensional position information Consists of landmarks of actual photographs extracted from the input image data.

上記の構成によれば、コンピュータグラフィックのような道路面に、実写画像によるランドマークを貼付けて表示画像が作成されるので、認識し易さとともに、リアリティを持たせることもできる。 According to the above configuration, since a display image is created by pasting a landmark based on a live-action image on a road surface such as a computer graphic, it is possible to provide a reality with ease of recognition.

さらにまた、本発明の画像処理装置では、前記投影面およびそれに投影される３次元位置情報は、前記面設定部にて設定された角度から見た模式化された絵図面から成ることを特徴とする。 Furthermore, in the image processing apparatus of the present invention, the projection plane and the three-dimensional position information projected thereon are composed of a schematic drawing viewed from an angle set by the plane setting unit. To do.

上記の構成によれば、模式化された絵図面、たとえば前記模式化された道路面上に、前記模式化された通行区分や横断歩道、信号などの道路交通のための識別記号を合成したものに、前記動体位置を投影し、さらに使用者の視線方向を任意に設定して表示可能であるので、使用者は必要最小限の情報を表示するコンピュータグラフィックのような画像から、前記動体の動きをより容易に理解することができる。 According to the above configuration, a schematic drawing, for example, the above-described schematic road surface, such as an identification symbol for road traffic such as the above-described schematic traffic division, pedestrian crossing, and signal is synthesized. In addition, since the position of the moving object can be projected and the user's line-of-sight direction can be arbitrarily set and displayed, the user can move the movement of the moving object from an image such as a computer graphic displaying the minimum necessary information. Can be understood more easily.

本発明の画像処理装置および方法は、以上のように、時系列の撮像画像に、その撮像画像中での被写体の３次元位置情報を合わせて取得可能な画像データを入力とし、その入力画像データを処理して表示装置に表示させる画像処理装置において、前記動体抽出部が、前記時系列の撮像画像から同一の動体を抽出する一方、面設定部からは、使用者が前記表示にあたっての所望とする投影面、すなわち視線方向を設定し、これに応じて３次元位置情報投影部が、前記面設定部にて設定された投影面上に、前記動体抽出部で抽出された動体の前記時系列の各点における前記３次元位置情報を、たとえば軌跡の線や点、或いは動体像自体の重ね合わせなどで投影した表示画像を作成し、前記表示装置に表示させる。 As described above, the image processing apparatus and method according to the present invention uses, as input, image data that can be acquired by combining a time-series captured image with the three-dimensional position information of the subject in the captured image. In the image processing apparatus that processes the image and displays it on the display device, the moving object extraction unit extracts the same moving object from the time-series captured images, while the surface setting unit determines that the user desires for the display. The projection plane to be operated, that is, the line-of-sight direction is set, and the three-dimensional position information projection unit according to this sets the time series of the moving object extracted by the moving object extraction unit on the projection plane set by the surface setting unit. A display image is created by projecting the three-dimensional position information at each point by, for example, superimposition of trajectory lines and points, or the moving body image itself, and displayed on the display device.

それゆえ、時系列の３次元撮像画像から解析した動体の動きを、たとえば運転者の目線や、事故の目撃者の目線から見た画像に変換して表示することができ、前記動体の動きを使用者が容易に理解することができる。 Therefore, the motion of the moving object analyzed from the time-series three-dimensional captured image can be converted into an image viewed from the eyes of the driver or the eyewitness of the accident, for example, and the motion of the moving object can be displayed. It can be easily understood by the user.

本発明の実施の第１の形態に係る画像処理装置を備えて成る事故検証システムの電気的構成を示すブロック図であり、定点カメラを用いるものである。It is a block diagram which shows the electrical constitution of the accident verification system provided with the image processing apparatus which concerns on the 1st Embodiment of this invention, and uses a fixed point camera. 本発明の実施の第１の形態に係る画像処理装置を備えて成る事故検証システムの電気的構成を示すブロック図であり、ドライブレコーダを用いるものである。It is a block diagram which shows the electric constitution of the accident verification system provided with the image processing apparatus which concerns on the 1st Embodiment of this invention, and uses a drive recorder. 使用者による動体対応画像部分の特定作業を説明するための図である。It is a figure for demonstrating the specific operation | work of the moving body corresponding | compatible image part by a user. 本発明の実施の一形態による動体の時系列位置の統合表示例を示す図であり、道路面上を走行する自転車の軌跡を示す。It is a figure which shows the integrated display example of the time-sequential position of the moving body by one Embodiment of this invention, and shows the locus | trajectory of the bicycle which drive | works on a road surface. 本発明の実施の一形態による統合表示にあたって、視線方向の切換え方法を説明するための模式図である。FIG. 10 is a schematic diagram for explaining a method of switching the line-of-sight direction in the integrated display according to the embodiment of the present invention. 静止体の抽出方法を説明するための図である。It is a figure for demonstrating the extraction method of a stationary body. ３次元情報の前記統合の処理を説明するための図である。It is a figure for demonstrating the said integration process of three-dimensional information. 時系列画像における動体の対応付けを説明するための図である。It is a figure for demonstrating matching of the moving body in a time series image. 図３の動体の時系列位置を統合表示した結果を２つの視線方向から示す図である。It is a figure which shows the result which integratedly displayed the time-sequential position of the moving body of FIG. 3 from two gaze directions. 動体の時系列位置を統合表示するにあたって、絵図面で模式化して示す図である。FIG. 5 is a diagram schematically showing a pictorial drawing when displaying time-series positions of moving objects in an integrated manner. 図５で示す模式化画像に、動きに関する情報を併せて投影した図である。It is the figure which projected together the information regarding a motion on the schematic image shown in FIG. 他の動き情報のポップアップ表示を説明するための図である。It is a figure for demonstrating the pop-up display of other movement information. 図５で示す模式化画像の他の例を示す図である。It is a figure which shows the other example of the schematic image shown in FIG. 本発明の実施の第２の形態に係る画像処理装置を備えて成る事故検証システムの電気的構成を示すブロック図である。It is a block diagram which shows the electric constitution of the accident verification system provided with the image processing apparatus which concerns on the 2nd Embodiment of this invention. 本発明の実施の第２の形態による絵図面による投影面の位置合せを説明するための図である。It is a figure for demonstrating position alignment of the projection surface by the pictorial drawing by the 2nd Embodiment of this invention. 本発明の実施の第３の形態に係る画像処理装置を備えて成る事故検証システムの電気的構成を示すブロック図である。It is a block diagram which shows the electric constitution of the accident verification system provided with the image processing apparatus which concerns on the 3rd Embodiment of this invention. 本発明の実施の第３の形態による仮想投影面への投影方法を説明するための図である。It is a figure for demonstrating the projection method to the virtual projection surface by the 3rd Embodiment of this invention. ステレオカメラからの出力画像に対する３次元演算（距離演算）の手法を説明するための図である。It is a figure for demonstrating the method of the three-dimensional calculation (distance calculation) with respect to the output image from a stereo camera. 前記３次元演算に用いるステレオカメラの左右視差を求めるにあたっての基準画像に対する参照画像の対応点探索方法を説明するための図である。It is a figure for demonstrating the corresponding point search method of the reference image with respect to the reference | standard image in calculating | requiring the right-and-left parallax of the stereo camera used for the said three-dimensional calculation. 前記対応点探索に効果的な多重解像度戦略を説明するための図である。It is a figure for demonstrating the multi-resolution strategy effective for the said corresponding point search. 位相限定相関法（ＰＯＣ）による相関値を示すグラフである。It is a graph which shows the correlation value by a phase only correlation method (POC). 前記位相限定相関法における対応点探索の一例を示す図である。It is a figure which shows an example of the corresponding point search in the said phase only correlation method. 前記位相限定相関法における対応点探索範囲を説明するための図である。It is a figure for demonstrating the corresponding point search range in the said phase only correlation method. ＩＣＰアルゴリズムを説明するための図である。It is a figure for demonstrating an ICP algorithm.

（実施の形態１）
図１および図２は、本発明の実施の第１の形態に係る画像処理装置１，１ａをそれぞれ備えて成る事故検証システムの電気的構成を示すブロック図である。図１の事故検証システムは、交差点監視カメラのように、事故多発交差点などに設置される定点カメラ２に、交通監視室などに設置される前記画像処理装置１および表示装置３を備えて構成される。また、図２の事故検証システムは、タクシーなどの車輌に搭載されるドライブレコーダ４に、事業所などに設置される前記画像処理装置１ａおよび前記表示装置３を備えて構成される。 (Embodiment 1)
FIG. 1 and FIG. 2 are block diagrams showing the electrical configuration of an accident verification system including the image processing apparatuses 1 and 1a according to the first embodiment of the present invention. The accident verification system of FIG. 1 is configured by including the image processing apparatus 1 and the display apparatus 3 installed in a traffic monitoring room or the like on a fixed point camera 2 installed at an accident-prone intersection or the like like an intersection monitoring camera. The The accident verification system in FIG. 2 includes the drive recorder 4 mounted on a vehicle such as a taxi and the like, and includes the image processing device 1a and the display device 3 installed in a business office or the like.

前記定点カメラ２は、時系列の撮像画像に、その撮像画像中での被写体の３次元位置情報を合わせて取得可能な画像データを作成するものである。このため、この図１で示すように、ステレオカメラ２１の撮像画像を画像記録部２２に連続記録してゆくような構成で実現でき、この場合には、前記画像処理装置１には、前記画像記録部２２からの時系列の撮像画像を入力とし、前記ステレオカメラ２１の左右間の画像を対応点探索処理することで、前記撮像画像中での被写体の３次元位置情報を得る３次元位置情報算出部１１が設けられる。ステレオカメラ２１からの撮像画像による３次元計測については、複数例を後に詳述する。一方、前記定点カメラ２として、単眼カメラの時系列の撮像画像にレーダなどによる距離情報を併せて記録してゆくものであれば、前記画像処理装置１に３次元位置情報算出部１１は設けられなくてもよい。すなわち、前記画像処理装置１は、入力画像データとして、時系列の撮像画像に、その撮像画像中での被写体の３次元位置情報を合わせて取得可能なデータが入力されればよい。 The fixed point camera 2 creates image data that can be acquired by combining a time-series captured image with the three-dimensional position information of the subject in the captured image. For this reason, as shown in FIG. 1, it is possible to realize a configuration in which images captured by the stereo camera 21 are continuously recorded in the image recording unit 22. In this case, the image processing apparatus 1 includes the image Three-dimensional position information for obtaining three-dimensional position information of a subject in the captured image by inputting a time-series captured image from the recording unit 22 and performing a corresponding point search process on the left and right images of the stereo camera 21 A calculation unit 11 is provided. A plurality of examples of the three-dimensional measurement using the captured image from the stereo camera 21 will be described later. On the other hand, if the fixed-point camera 2 is to record time-series captured images of a monocular camera together with distance information from a radar or the like, a three-dimensional position information calculation unit 11 is provided in the image processing apparatus 1. It does not have to be. In other words, the image processing apparatus 1 may input data that can be acquired by combining time-series captured images with the three-dimensional position information of the subject in the captured images as input image data.

前記ドライブレコーダ４は、前記ステレオカメラ２１および画像記録部２２に、一時記憶部４１およびトリガ発生部４２を備えて構成され、前記ステレオカメラ２１の撮像画像をリングバッファ等の一時記憶部４１に記憶し、加速度センサなどのトリガ発生部４２で発生されたトリガタイミング（衝突可能性がある等で危険なタイミング）の所定時間だけ以前から所定時間に亘って、前記一時記憶部４１から画像記録部２２に撮像画像データを転送して記録してゆくような構成で実現できる。前記画像記録部２２は、記録すべき画像データが記録容量を超えた場合には、古い画像データに上書きしてゆくように構成される。また、画像記録部２２に充分な記録容量が有る場合には、前記ステレオカメラ２１からの画像を該画像記録部２２が連続記録してゆき、トリガ発生部４２でトリガが発生したタイミング、またはその所定時間だけ以前のタイミングからの画像データに、後に抽出し易くするためのマーキングを施すようにしてもよい。 The drive recorder 4 includes a temporary storage unit 41 and a trigger generation unit 42 in the stereo camera 21 and the image recording unit 22, and stores a captured image of the stereo camera 21 in a temporary storage unit 41 such as a ring buffer. Then, from the temporary storage unit 41 to the image recording unit 22 for a predetermined time before a predetermined time of the trigger timing generated by the trigger generation unit 42 such as an acceleration sensor (a dangerous timing due to the possibility of a collision or the like). This can be realized with a configuration in which captured image data is transferred and recorded. The image recording unit 22 is configured to overwrite old image data when the image data to be recorded exceeds the recording capacity. When the image recording unit 22 has a sufficient recording capacity, the image recording unit 22 continuously records images from the stereo camera 21 and the trigger generation unit 42 generates a trigger, or Marking may be applied to image data from a previous timing for a predetermined time so that it can be easily extracted later.

注目すべきは、前記画像処理装置１は、上述のような時系列の撮像画像に、その撮像画像中での被写体の３次元位置情報を合わせて取得し、処理画像を表示装置３に表示させるにあたって、前記時系列の撮像画像から同一の動体を抽出する動体抽出部１２と、前記表示にあたっての投影面を設定する面設定部１３と、前記面設定部１３にて設定された投影面上に、前記動体抽出部１２で抽出された動体における前記３次元位置情報を投影した表示画像を作成する３次元位置情報投影部１４とを備えて構成されることである。また注目すべきは、前記画像処理装置１ａは、前記３次元位置情報算出部１１、動体抽出部１２、面設定部１３および３次元位置情報投影部１４に加えて、３次元情報統合部１５および３次元位置情報算出部１６が設けられることである。 It should be noted that the image processing apparatus 1 acquires the time-series captured image as described above together with the three-dimensional position information of the subject in the captured image, and causes the display apparatus 3 to display the processed image. The moving object extracting unit 12 that extracts the same moving object from the time-series captured images, the surface setting unit 13 that sets a projection surface for the display, and the projection plane set by the surface setting unit 13 And a three-dimensional position information projecting unit 14 for creating a display image obtained by projecting the three-dimensional position information on the moving object extracted by the moving object extracting unit 12. It should be noted that the image processing apparatus 1a includes a three-dimensional information integration unit 15 in addition to the three-dimensional position information calculation unit 11, the moving object extraction unit 12, the surface setting unit 13, and the three-dimensional position information projection unit 14. The three-dimensional position information calculation unit 16 is provided.

前記３次元位置情報算出部１１は、時間的に異なる複数の同一動体の３次元位置を算出するものであり、図１の定点カメラ２のようにステレオカメラ２１が固定されている場合には、各画像で得られる動体の３次元位置を算出すればよい。これに対して、図２のドライブレコーダ４のようにステレオカメラ２１が車両に搭載されている場合は、各画像で得られる動体の３次元位置は、撮影したときのカメラの位置に対する３次元位置であるので、該ステレオカメラ２１を搭載している車両と撮影している動体とが等速に同じ方向に移動している場合は、動体の３次元位置は常に同じ値になるので、該ステレオカメラ２１の座標系を、或る基準となるフレームに併せる必要がある。 The three-dimensional position information calculation unit 11 calculates three-dimensional positions of a plurality of identical moving objects that are temporally different. When the stereo camera 21 is fixed like the fixed point camera 2 in FIG. What is necessary is just to calculate the three-dimensional position of the moving body obtained by each image. On the other hand, when the stereo camera 21 is mounted on the vehicle as in the drive recorder 4 of FIG. 2, the three-dimensional position of the moving object obtained in each image is the three-dimensional position relative to the position of the camera at the time of shooting. Therefore, when the vehicle on which the stereo camera 21 is mounted and the moving object being photographed are moving at the same speed in the same direction, the three-dimensional position of the moving object always has the same value. It is necessary to match the coordinate system of the camera 21 with a certain reference frame.

前記動体抽出部１２は、前記時系列の撮像画像から同一の動体を抽出する。ここで、動体とは、自動車やバイク等の車両、自転車、歩行者等の地面に対して実際に移動している物体を言う。以下の説明では、ステレオカメラ２１は車両等の移動体に搭載されて撮像を行う、すなわちドライブレコーダ４の場合の動体抽出について説明する。この場合、搭載車両自体が移動するので、ステレオカメラ２１の撮像画像中に相対的に移動している物体があっても、動体であるとは限らない。そこで、以下に、ステレオカメラ２１で生成された時系列画像における動体対応画像部分の特定方法について説明する。なお、動体対応画像部分の特定においては、前記動体抽出部１２は、３次元位置情報算出部１１で求められた３次元座標、２次元動きベクトルおよび３次元動きベクトル等の３次元位置情報を用いる。なお、画像上の動体対応画像部分を特定するとは、具体的には、画像として表されている物体のうち動体が表示されている箇所を特定し、その３次元画像情報を取得することを言う。また、動体対応画像部分とは、画像中に表示された動体に対応する箇所を言う。 The moving object extraction unit 12 extracts the same moving object from the time-series captured images. Here, the moving body refers to an object that is actually moving with respect to the ground, such as a vehicle such as an automobile or a motorcycle, a bicycle, or a pedestrian. In the following description, the stereo camera 21 is mounted on a moving body such as a vehicle and performs imaging, that is, moving object extraction in the case of the drive recorder 4 will be described. In this case, since the mounted vehicle itself moves, even if there is a relatively moving object in the captured image of the stereo camera 21, it is not necessarily a moving object. Therefore, a method for specifying the moving object corresponding image portion in the time-series image generated by the stereo camera 21 will be described below. In specifying the moving object corresponding image portion, the moving object extracting unit 12 uses the three-dimensional position information such as the three-dimensional coordinates, the two-dimensional motion vector, and the three-dimensional motion vector obtained by the three-dimensional position information calculating unit 11. . Note that specifying a moving object-corresponding image portion on an image specifically refers to specifying a portion where a moving object is displayed among objects represented as an image and acquiring the three-dimensional image information. . The moving object corresponding image portion refers to a portion corresponding to the moving object displayed in the image.

先ず、動きの消失点を用いて動体対応画像部分を特定する方法がある。ここで、動きの消失点とは、画像上の各点における動きベクトルをその方向に沿って延長した直線が交わる点である。この消失点は、画像上の物体の移動方向に応じて定まる。すなわち、カメラが同一方向に移動している場合において、同一物体であれば同一方向に移動していることから、その物体に対しての消失点が存在する。また、画像上の物体が静止体である場合に、静止体である物体すべてに対して同一の消失点が存在する（「主成分分析を用いた移動物体認識法の検討」，情報処理学会研究報告 − コンピュータビジョンとイメージメディアＶｏｌ．１９９６，Ｎｏ．３１，１９９５−ＣＶＩＭ−０９９，文献番号：ＩＰＳＪ−ＣＶＩＭ９５０９９００８参照）。なお、ステレオカメラ２１で撮像される被写体の画像のほとんどは、信号機や、路面、横断歩道、壁等の静止体に対応する静止体対応画像部分で占められていると考えられる。ここで、静止体対応画像部分とは、画像中に表示された、静止体に対応する箇所を言う。そして、そのように仮定すると、最も多くの動きベクトルに対する消失点が静止体対応画像部分に対応する静止体の消失点であると推測される。したがって、画像において存在する消失点の内、最も多くの動きベクトルに対する消失点を除いた後に存在する各消失点が動体対応画像部分に対応する動体の消失点であると推定できる。 First, there is a method of specifying a moving object corresponding image portion using a vanishing point of movement. Here, the vanishing point of the motion is a point where a straight line obtained by extending the motion vector at each point on the image along the direction intersects. This vanishing point is determined according to the moving direction of the object on the image. That is, when the camera is moving in the same direction, if it is the same object, it has moved in the same direction, so there is a vanishing point for that object. In addition, when the object on the image is a stationary object, the same vanishing point exists for all the objects that are stationary objects ("Examination of moving object recognition method using principal component analysis", Information Processing Society of Japan Report-Computer Vision and Image Media Vol. 1996, No. 31, 1995-CVIM-099, literature number: IPSJ-CVIM 9509008). Note that most of the subject images captured by the stereo camera 21 are considered to be occupied by a stationary object-corresponding image portion corresponding to a stationary object such as a traffic light, a road surface, a pedestrian crossing, or a wall. Here, the stationary object-corresponding image portion refers to a portion displayed in the image and corresponding to the stationary object. Then, assuming that, the vanishing point for the most motion vectors is estimated to be the vanishing point of the stationary object corresponding to the stationary object corresponding image portion. Therefore, it can be estimated that each vanishing point that exists after removing vanishing points for the most motion vectors among vanishing points existing in the image is the vanishing point of the moving object corresponding to the moving object corresponding image portion.

そこで、動体抽出部１２は、３次元位置情報算出部１１で算出した時系列画像において求められる動きベクトルをその方向に沿って延長して、それらが交わる点である消失点を画像上において求める。そして、それら消失点の内、最も多くの動きベクトルに対する消失点以外の各消失点を動体対応画像部分に対応する消失点であると推定する。さらに、このようにして、推定された動体対応画像部分の消失点をもとに、画像上の動体対応画像部分を特定し、その３次元画像情報を取得する。このようにして、各時系列画像における動体対応画像部分を特定することができる。なお、動きベクトルは、３次元位置情報算出部１１で算出されているので、消失点を求めるために新たに動きベクトルを算出する必要はなく、消失点を容易に算出することができる。 Therefore, the moving object extraction unit 12 extends the motion vector obtained in the time-series image calculated by the three-dimensional position information calculation unit 11 along the direction, and obtains a vanishing point on the image where they intersect. Of these vanishing points, each vanishing point other than the vanishing points for the most motion vectors is estimated to be a vanishing point corresponding to the moving object corresponding image portion. Further, based on the vanishing point of the estimated moving object corresponding image portion in this way, the moving object corresponding image portion on the image is specified, and its three-dimensional image information is acquired. In this way, the moving object corresponding image portion in each time-series image can be specified. Since the motion vector is calculated by the three-dimensional position information calculation unit 11, it is not necessary to calculate a new motion vector to obtain the vanishing point, and the vanishing point can be easily calculated.

次に、パターン認識あるいはテンプレートマッチング等によって動体対応画像部分を特定する方法について説明する。たとえば、自動車、バイク、自転車等の車両や歩行者などのように、被写体として存在することが予想される動体について、パターン認識あるいはテンプレートマッチングを用いて、画像における動体対応画像部分を特定してもよい。パターン認識においては、動体抽出部１２は、上記動体に関するパターン認識のためのデータを予め記憶しておき、その記憶されたデータを用いて撮像画像中でパターン認識を行うことで動体対応画像部分を特定し、その３次元位置情報を取得する。さらに、パターン認識においては、例えばＳＶＭ（Support vector machine；サポートベクターマシン）やＡｄａＢｏｏｓｔ等の手法を用いて、パターンデータを学習してゆくことで、より効率良く動体対応画像部分を特定することができる。また、テンプレートマッチングにおいては、動体抽出部１２は、上記動体に関するテンプレートを予め記憶しておき、前述の対応点探索と同様に、そのテンプレートと相関値の高い箇所を画像から探索することで、画像上の動体対応画像部分を特定し、その３次元位置情報を取得する。 Next, a method for specifying a moving object corresponding image portion by pattern recognition or template matching will be described. For example, for a moving object that is expected to exist as a subject, such as a vehicle such as a car, a motorcycle, or a bicycle, or a pedestrian, the moving object corresponding image portion in the image may be specified using pattern recognition or template matching. Good. In the pattern recognition, the moving object extraction unit 12 stores data for pattern recognition related to the moving object in advance, and performs pattern recognition in the captured image using the stored data, so that a moving object corresponding image portion is obtained. The three-dimensional position information is acquired. Furthermore, in pattern recognition, for example, a moving object corresponding image portion can be identified more efficiently by learning pattern data using a technique such as SVM (Support Vector Machine) or AdaBoost. . In template matching, the moving object extraction unit 12 stores a template related to the moving object in advance, and searches for a portion having a high correlation value with the template in the same manner as in the corresponding point search described above, thereby obtaining an image. The upper moving object-corresponding image portion is specified, and its three-dimensional position information is acquired.

また、上記パターン認識およびテンプレートマッチングと同様、動体候補を用いて動体対応画像部分を特定する方法として、画像中のエッジ分布と左右対称性等とから、画像上の車両を特定する方法もある（たとえば、特開平７−３３４８００号公報参照）。この方法によって、動体抽出部１２が、画像上における車両等の動体対応画像部分を特定し、その３次元位置情報を取得することとしてもよい。 As in the above pattern recognition and template matching, as a method for specifying a moving object corresponding image portion using a moving object candidate, there is a method for specifying a vehicle on an image from edge distribution in the image, left-right symmetry, and the like ( For example, see JP-A-7-334800). By this method, the moving object extraction unit 12 may specify a moving object corresponding image portion such as a vehicle on the image and acquire the three-dimensional position information thereof.

また、ステレオ時系列画像から求めた３次元動きベクトルに対して、このステレオ時系列画像を生成したステレオカメラ２１の移動速度（車速）によってこれらを補正することで、画像上の静止体対応画像部分と動体対応画像部分とを判別する方法もある（たとえば、特開２００６−１３４０３５号参照）。この方法を用いる場合は、動体抽出部１２は、ステレオカメラ２１が搭載された車両の速度情報を受け、３次元位置情報算出部１１で算出された３次元動きベクトルを用いて、画像上の動体対応画像部分を特定し、その３次元位置情報を取得することができる。 In addition, the three-dimensional motion vector obtained from the stereo time-series image is corrected by the moving speed (vehicle speed) of the stereo camera 21 that has generated the stereo time-series image, so that the stationary object corresponding image portion on the image is corrected. There is also a method for discriminating a moving object-corresponding image portion (see, for example, JP-A-2006-134035). When this method is used, the moving object extraction unit 12 receives speed information of the vehicle on which the stereo camera 21 is mounted, and uses the three-dimensional motion vector calculated by the three-dimensional position information calculation unit 11 to move the moving object on the image. The corresponding image portion can be specified and its three-dimensional position information can be acquired.

また、ステレオカメラ２１で生成された画像を見ながら、使用者がその画像中から動体対応画像部分を選ぶことで、動体対応画像部分が特定されることとしてもよい。図３は使用者が動体対応画像部分を特定する場合について説明するための図である。使用者が図示しない入力部などを用いて、表示装置３および画像記録部２２に指示することで、表示装置３は、画像記録部２２に記録されている実写画像を、図３（ａ）で示すように、そのまま表示する。そして、使用者は、前記入力部のマウス等を操作することで、図３（ｂ）において、枠掛けして示すように、表示装置３に表示された画像の一部を選択することができることとすればよい。選択された箇所は前記動体対応画像部分として特定される。 Further, the moving object corresponding image portion may be specified by the user selecting the moving object corresponding image portion from the image while viewing the image generated by the stereo camera 21. FIG. 3 is a diagram for explaining a case where the user specifies a moving object corresponding image portion. When the user instructs the display device 3 and the image recording unit 22 using an input unit (not shown) or the like, the display device 3 displays the photographed image recorded in the image recording unit 22 in FIG. As shown, it is displayed as it is. The user can select a part of the image displayed on the display device 3 by operating the mouse or the like of the input unit, as shown by the frame in FIG. And it is sufficient. The selected location is specified as the moving object corresponding image portion.

具体的には、３次元位置情報算出部１１によって３次元位置情報の算出されている画像が、画像記録部２２から読み出されて表示装置３に表示される。表示装置３には、表示された画像以外に、たとえばマウスによって表示装置３の画面上での位置を操作できるカーソル等が表示され、当該カーソルによって前記画面上の特定の部分を選択することで、選択された部分の３次元位置情報が動体抽出部１２に入力され、動体対応画像部分が特定される。たとえば、前記図３（ａ）に示すように、使用者が表示装置３に表示された画像から、自動車ｍ１，ｍ２を含む動体対応画像部分Ｍ１，Ｍ２と、歩行者ｍ３を含む動体対応画像部分Ｍ３とを入力部で選択することで、動体抽出部１２はこれらの画像上の動体対応画像部分を特定し、その３次元位置情報を取得する。 Specifically, the image for which the three-dimensional position information calculation unit 11 calculates the three-dimensional position information is read from the image recording unit 22 and displayed on the display device 3. In addition to the displayed image, the display device 3 displays a cursor or the like that can operate the position of the display device 3 on the screen with a mouse, for example, and by selecting a specific portion on the screen with the cursor, The three-dimensional position information of the selected part is input to the moving object extraction unit 12, and the moving object corresponding image part is specified. For example, as shown in FIG. 3A, from the image displayed on the display device 3 by the user, the moving object corresponding image portions M1 and M2 including automobiles m1 and m2, and the moving object corresponding image portion including pedestrian m3. By selecting M3 with the input unit, the moving object extracting unit 12 specifies moving object corresponding image portions on these images and acquires the three-dimensional position information thereof.

このような動体対応画像部分の選択は、画像毎に行われてもよいが、煩雑であるので、動体抽出部１２が自動的に追尾して選択を行うようにしてもよい。たとえば、図３（ｃ）は、図３（ａ）および（ｂ）よりもΔｔ秒後の画像であるが、この画像についても枠掛けして示すように、自動車および歩行者が動体対応画像部分Ｍ１，Ｍ２；Ｍ３として選択されている。このような動体対応画像部分の自動追尾は前述の対応点探索による方法だけでなく、たとえば後述のＬｕｃａｓ−Ｋａｎａｄｅ法等の動きベクトルを算出する演算を用いる方法等がある。前記Ｌｕｃａｓ−Ｋａｎａｄｅ法は、画像間における動きベクトルを求める手法であるが、動きベクトルを求めることで、画像間における対応付けも可能であることから、動体対応画像部分の追尾も可能である。 Such selection of the moving object-corresponding image portion may be performed for each image, but since it is complicated, the moving object extraction unit 12 may automatically perform tracking and selection. For example, FIG. 3C is an image after Δt seconds after FIGS. 3A and 3B. As shown in FIG. M1, M2; M3 are selected. Such automatic tracking of the moving object corresponding image portion includes not only the above-described method based on the corresponding point search but also a method using an operation for calculating a motion vector such as the Lucas-Kanade method described later. The Lucas-Kanade method is a method for obtaining a motion vector between images, but by obtaining a motion vector, association between images is possible, and tracking of a moving object corresponding image portion is also possible.

また、動体抽出部１２は、上述した方法の内、１つの方法によって動体対応画像部分を特定してもよいし、いずれかの方法を選択的に用いて特定してもよい。たとえば、パターン認識またはテンプレートマッチングによって、先ず動体対応画像部分を特定することとし、これらの方法で動体対応画像部分を特定できない場合には、使用者が入力部を用いて動体対応画像部分を特定することとしてもよい。 Moreover, the moving body extraction part 12 may specify a moving body corresponding | compatible image part by one method among the methods mentioned above, and may specify it using any method selectively. For example, the moving object corresponding image portion is first specified by pattern recognition or template matching, and if the moving object corresponding image portion cannot be specified by these methods, the user specifies the moving object corresponding image portion using the input unit. It is good as well.

一方、前記面設定部１３からは、使用者が前記表示にあたっての所望とする投影面、すなわち視線方向が設定される。これに応じて前記３次元位置情報投影部１４は、前記面設定部１３にて設定された投影面上に、前記動体抽出部１２で抽出された動体の前記時系列の各点における前記３次元位置情報を投影した表示画像を作成し、前記表示装置３に表示させる。図４は、その投影結果の一例を示す図であり、前記定点カメラ２の撮像画像から得た３次元位置情報をそのまま投影したものである。図４は、道路面５１上を走行する自転車５２の軌跡を示すものであり、実線５３は道路面５１上に投影した自転車５２の軌跡を示す。この図４では、実写による前記道路面５１およびＣＧ合成の自転車５２の軌跡の線および点に加えて、該自転車５２の実写像自体も重ね合わて投影している。このような軌跡の線および点ならびに動体像自体の内、何れを選択して投影するのかは、煩雑に（見難く）ならない範囲で、より分り易くなるように、適宜選択されればよい。 On the other hand, the plane setting unit 13 sets a projection plane that the user desires for the display, that is, a line-of-sight direction. In response to this, the three-dimensional position information projection unit 14 on the projection plane set by the plane setting unit 13, the three-dimensional point at each point in the time series of the moving object extracted by the moving object extraction unit 12. A display image on which the position information is projected is created and displayed on the display device 3. FIG. 4 is a diagram showing an example of the projection result, in which the three-dimensional position information obtained from the captured image of the fixed point camera 2 is projected as it is. FIG. 4 shows the locus of the bicycle 52 traveling on the road surface 51, and the solid line 53 shows the locus of the bicycle 52 projected on the road surface 51. In FIG. 4, in addition to the road surface 51 and the locus lines and points of the CG-combined bicycle 52 by the actual image, the actual image itself of the bicycle 52 is also superimposed and projected. Which of these trajectory lines and points and the moving body image itself is selected and projected may be appropriately selected so as to be easily understood within a range that is not complicated (difficult to see).

これに対して、図５には、前記視線方向の切換えを模式的に示す。図５（ａ）は、前記図４に対応した定点カメラ２の設置位置から見た画像であるが、図５（ｂ）は、図５（ａ）において参照符号５４で示す事故の目撃者などの視線方向から見た画像である。 On the other hand, FIG. 5 schematically shows the switching of the line-of-sight direction. 5A is an image viewed from the installation position of the fixed point camera 2 corresponding to FIG. 4, but FIG. 5B is an accident witness indicated by reference numeral 54 in FIG. 5A. It is the image seen from the line-of-sight direction.

ここで、図１で示す定点カメラ２の画像処理装置１では、前記動体抽出部１２で抽出された動体の動きに伴う３次元位置情報を、順に投影していけばよい。しかしながら、図２で示すドライブレコーダ４の画像処理装置１ａでは、自車両も移動しているので、各フレーム間の位置合せが必要となる。このため、前記画像処理装置１ａでは、動体抽出部１２ａは、静止体も合わせて抽出しており、さらに注目すべきは、該画像処理装置１ａには、前記動体抽出部１２ａの出力から、動体の各フレーム間の３次元位置を前記静止体を基準として統合する３次元情報統合部１５と、前記３次元情報統合部１５で統合されたフレーム間において、或るフレームにおける動体の３次元位置を基準として、残余のフレームにおける動体の３次元位置を算出し、前記３次元位置情報投影部１４へ出力する３次元位置情報算出部１６とがさらに設けられていることである。 Here, in the image processing apparatus 1 of the fixed point camera 2 shown in FIG. 1, the three-dimensional position information accompanying the motion of the moving object extracted by the moving object extracting unit 12 may be projected in order. However, in the image processing device 1a of the drive recorder 4 shown in FIG. 2, since the host vehicle is also moving, alignment between the frames is necessary. For this reason, in the image processing apparatus 1a, the moving object extraction unit 12a extracts the stationary object as well, and it should be noted that the image processing apparatus 1a receives the moving object from the output of the moving object extraction unit 12a. Between the frames integrated by the 3D information integration unit 15 and the 3D information integration unit 15 that integrates the 3D positions between the frames with reference to the stationary body. As a reference, a three-dimensional position information calculation unit 16 that calculates the three-dimensional position of the moving object in the remaining frames and outputs the calculated three-dimensional position information projection unit 14 is further provided.

図６は、前記静止体の抽出方法を説明するための図である。前記動体抽出部１２はまた、前記３次元位置情報算出部１１において算出された３次元座標、２次元動きベクトルおよび３次元動きベクトル等をもとに、各画像における静止体対応画像部分Ｓ１〜Ｓ４を特定する。ここで、静止体とは、信号機、路面、横断歩道、看板、壁等のランドマークであって、地面に固定されているものである。図６では、道路と歩道との境界付近および壁面等を含む静止体対応画像部分Ｓ１、信号機および横断歩道等の路面を含む静止体対応画像部分Ｓ２、歩道、路面および壁面等を含む静止体対応画像部分Ｓ３および路面および路面に形成された車線等を含む静止体対応画像部分Ｓ４が選択され、前記動体抽出部１２はこれらの画像上の静止体対応画像部分Ｓ１〜Ｓ４も特定し、その３次元画像情報を取得する。 FIG. 6 is a diagram for explaining a method of extracting the stationary body. The moving object extraction unit 12 also includes still body corresponding image portions S1 to S4 in each image based on the three-dimensional coordinates, the two-dimensional motion vector, the three-dimensional motion vector, and the like calculated by the three-dimensional position information calculation unit 11. Is identified. Here, the stationary body is a landmark such as a traffic light, a road surface, a pedestrian crossing, a signboard, a wall, etc., and is fixed to the ground. In FIG. 6, a stationary object corresponding image portion S1 including the vicinity of the boundary between the road and the sidewalk and a wall surface and the like, a stationary object corresponding image portion S2 including a traffic light and a road surface such as a pedestrian crossing, and a stationary object corresponding to the sidewalk, the road surface and the wall surface, etc. The stationary object corresponding image part S4 including the image part S3 and the road surface and the lane formed on the road surface is selected, and the moving object extracting unit 12 also specifies the stationary object corresponding image parts S1 to S4 on these images, and 3 Get dimensional image information.

ここで、ステレオカメラ２１は車両に搭載されていることから、該ステレオカメラ２１自体も移動し、時系列画像上において、前記静止体対応画像部分Ｓ１〜Ｓ４は移動している。このように、画像上では固定されていないが、実際には移動していない静止体における静止体対応画像部分Ｓ１〜Ｓ４を画像から特定する方法としては、以下の方法がある。前記動体抽出部１２は、これらの方法を用いて、撮像画像から前記静止体対応画像部分Ｓ１〜Ｓ４を特定する。また、撮像画像上において、該動体抽出部１２が特定した動体対応画像部分Ｍ１〜Ｍ３以外を静止体として特定してもよい。 Here, since the stereo camera 21 is mounted on the vehicle, the stereo camera 21 itself is also moved, and the stationary object corresponding image portions S1 to S4 are moved on the time-series image. As described above, there are the following methods for identifying the stationary object corresponding image portions S1 to S4 in the stationary object that are not fixed on the image but are not actually moved from the image. The moving body extraction unit 12 specifies the still body corresponding image portions S1 to S4 from the captured image using these methods. Moreover, you may specify as a stationary body other than the moving body corresponding | compatible image parts M1-M3 which this moving body extraction part 12 specified on the captured image.

先ず、動きの消失点を用いて静止体対応画像部分を特定する方法について説明する。前記動体抽出部１２は、前記３次元位置情報算出部１１で算出した時系列画像において消失点を求め、それら消失点の内、最も多くの動きベクトルに対する消失点を静止体対応画像部分に対応する静止体の消失点であると推定する。さらに、このようにして推定された静止体の消失点をもとに、画像上の静止体対応画像部分を特定し、その３次元画像情報を取得する。このようにして、各時系列画像における静止体対応画像部分を特定することができる。なお、動きベクトルは、３次元位置情報算出部１１において算出されているので、消失点を求めるために新たに動きベクトルを算出する必要はなく、消失点を容易に算出することができる。 First, a method for specifying a stationary object corresponding image portion using a vanishing point of motion will be described. The moving object extraction unit 12 obtains vanishing points in the time-series image calculated by the three-dimensional position information calculation unit 11, and among these vanishing points, the vanishing points for the most motion vectors correspond to the stationary object corresponding image portion. Presumed to be the vanishing point of the stationary object. Furthermore, based on the vanishing point of the stationary object estimated in this way, the stationary object corresponding image part on the image is specified, and the three-dimensional image information is acquired. In this way, it is possible to specify the stationary object corresponding image portion in each time-series image. Since the motion vector is calculated by the three-dimensional position information calculation unit 11, it is not necessary to calculate a new motion vector in order to obtain the vanishing point, and the vanishing point can be easily calculated.

また、動体抽出部１２は、前記信号機、標識、看板等のように、存在することが予想される静止体すなわちランドマークを、パターン認識あるいはテンプレートマッチングによって検出することで、静止体対応画像部分を特定してもよい。なお、この際に用いるパターンデータおよびテンプレートは該動体抽出部１２に予め記憶しておくこととすればよい。このようにして、動体抽出部１２は、画像上の静止体対応画像部分を特定し、その３次元画像情報を取得する。なお、動体抽出時と同様に、パターンデータを学習してゆくことで、より効率良く静止体対応画像部分を特定することができる。また、ステレオカメラ２１で生成された画像を見ながら、使用者がその画像中から静止体対応画像部分を選ぶことで、静止体対応画像部分が特定されることとしてもよい。 Further, the moving object extraction unit 12 detects a stationary object that is expected to exist, that is, a landmark such as a traffic light, a sign, a signboard, or the like by pattern recognition or template matching, so that a stationary object corresponding image portion is obtained. You may specify. The pattern data and template used at this time may be stored in advance in the moving object extraction unit 12. In this way, the moving object extraction unit 12 specifies the still object corresponding image portion on the image and acquires the three-dimensional image information. As in the case of moving object extraction, the stationary object corresponding image portion can be identified more efficiently by learning the pattern data. Alternatively, the still object-corresponding image portion may be specified by the user selecting the still object-corresponding image portion from the image while viewing the image generated by the stereo camera 21.

図７は、前記３次元情報統合部１５の処理を説明するための図である。時刻Ｔにおいて図７（ａ）で示すような撮像画像が得られており、時刻Ｔ＋Δｔにおいて図７（ｂ）で示すような撮像画像が得られているとき、前述の図５で示すように静止体領域を抽出すると、それぞれ図７（ｃ）および図７（ｄ）で示すような画像となる。これらの図７（ｃ）および図７（ｄ）では、動体領域を黒く塗り潰している。そして、先ず図８（ａ）および図８（ｂ）で示すように、２つの画像間の対応付けを行う。それには、後述の対応点探索方法を用いてもよいし、前述のＬｕｃａｓ−Ｋａｎａｄｅ法を用いてもよい。 FIG. 7 is a diagram for explaining the processing of the three-dimensional information integration unit 15. When a captured image as shown in FIG. 7A is obtained at time T and a captured image as shown in FIG. 7B is obtained at time T + Δt, the captured image is stationary as shown in FIG. When the body region is extracted, images as shown in FIGS. 7C and 7D are obtained. In these FIG. 7 (c) and FIG. 7 (d), the moving object region is blacked out. First, as shown in FIGS. 8A and 8B, the two images are associated with each other. For this purpose, a corresponding point search method described later may be used, or the aforementioned Lucas-Kanade method may be used.

図８（ａ）および図８（ｂ）は、前記図７（ｃ）および図７（ｄ）にそれぞれ対応するものであり、こうして２つの画像間で対応付けが行われると、第１の統合方法では、その対応付けられた点６１〜６５のうち、同一直線状にない３点を選択し、時刻Ｔと時刻Ｔ＋Δｔとにおいて、それぞれ３点から構成される面を一致させるような回転（面の法線ベクトルを合わせる）および並進成分（どれか１点を合わせる、または３点の重心位置を合わせる）を算出する。また、第２の方法では、対応付けられた点６１〜６５のうち、任意の数点を選択し、その選択した数点を初期値として、ＩＣＰ（Iterative Closest Points）を用いて回転および並進成分を算出する（ＩＣＰアルゴリズムについては後述）。 FIGS. 8 (a) and 8 (b) correspond to FIGS. 7 (c) and 7 (d), respectively. When the association between the two images is thus performed, the first integration is performed. In the method, three points that are not collinear are selected from the associated points 61 to 65, and the rotations (surfaces) are made to coincide with each other at the time T and the time T + Δt. ) And a translational component (match any one point, or match the centroid position of three points). In the second method, any number of the associated points 61 to 65 is selected, and the selected several points are used as initial values to rotate and translate components using ICP (Iterative Closest Points). (The ICP algorithm will be described later).

次に、時刻Ｔ＋Δｔの３次元情報を、算出した回転および並進成分を用いて変換する。変換後の時刻Ｔ＋Δｔの３次元情報と、時刻Ｔの３次元情報とを重ね合わせると、静止体領域は一致するが、動体領域については一致せず、同一被写体が２つ存在することになる。その同一被写体において、時間の異なる３次元位置を重畳することで、前述の図４で示すように動体の軌跡を容易に知ることができる。また、図９（ａ）には、前述の図３（ｂ）および（ｃ）の運転者による視線の模式図の統合画像を示し、図９（ｂ）には、図９（ａ）の俯瞰画像を示す。図９からは、右側の車両ｍ２が左側の車両ｍ１よりもスピードを出していることが分かる。 Next, the three-dimensional information at time T + Δt is converted using the calculated rotation and translation components. When the three-dimensional information at time T + Δt after conversion and the three-dimensional information at time T are overlapped, the stationary body region matches, but the moving body region does not match, and two identical subjects exist. By superimposing three-dimensional positions at different times on the same subject, the locus of the moving object can be easily known as shown in FIG. FIG. 9A shows an integrated image of a schematic diagram of the line of sight by the driver shown in FIGS. 3B and 3C, and FIG. 9B shows an overhead view of FIG. 9A. Images are shown. From FIG. 9, it can be seen that the vehicle m2 on the right side is faster than the vehicle m1 on the left side.

以上、時間の異なる２つの３次元情報を統合する方法について説明したが、より多くの時系列画像についても同様に統合することができ、前述の図４で示すような実写画像を得ることができる。具体的には、第１の方法では、時刻Ｔの画像を基準として、時刻Ｔ＋Δｔ，時刻Ｔ＋２Δｔ，・・・の３次元情報を位置合わせする。また、第２の方法では、１つ前の時刻の画像を基準として、３次元情報を逐次位置合わせしてゆく。 As described above, the method for integrating two pieces of three-dimensional information at different times has been described. However, more time-series images can be integrated in the same manner, and a photographed image as shown in FIG. 4 can be obtained. . Specifically, in the first method, the three-dimensional information at time T + Δt, time T + 2Δt,... Is aligned using the image at time T as a reference. In the second method, the three-dimensional information is sequentially aligned based on the image at the previous time.

また、統合のために選択される点は、各時刻で異なる点を選択しても構わない。たとえば、前記第１の方法において、時刻Ｔと時刻Ｔ＋Δｔとのペアで選択された点が、時刻Ｔと時刻Ｔ＋２Δｔでも存在するとは限らないので、時間の変化が生じたときは、選択する点も更新する方が好ましい。また、選択する点が互いに近接している場合は、局所的な部分における３次元の一致を算出することになるので、その局所領域での３次元の一致は正確にできるものの、画像全体で見れば結果が不安定になりやすいので、選択する３点は、できるだけ離れるように選択することで、安定した結果を得ることができ、好ましい。さらにまた、第１の方法では、３点を選択しているけれども、３点のセットを複数選択し、これらの複数の３点のセットから、最小二乗的に解を求めても構わない。こうすることで、安定して解を求めることができる。さらにまた、ドライブレコーダ４の場合、トリガ発生時の画像を基準とすることで、該トリガ発生時の３次元情報を高精度に出力でき、３次元情報を統合するにあたって、誤差の蓄積が生じても、前記トリガ発生前後の３次元情報の誤差は少なくなるので、事故解析などで有用である。 Also, the points selected for integration may be different points at each time. For example, in the first method, the point selected by the pair of time T and time T + Δt does not always exist at time T and time T + 2Δt. It is preferable to update. If the selected points are close to each other, the three-dimensional match in the local part is calculated. Therefore, the three-dimensional match in the local region can be accurately performed, but can be seen in the entire image. Since the result tends to be unstable, it is preferable to select the three points to be separated as much as possible because a stable result can be obtained. Furthermore, in the first method, although three points are selected, a plurality of sets of three points may be selected, and a solution may be obtained in a least square manner from the set of these three points. By doing so, the solution can be obtained stably. Furthermore, in the case of the drive recorder 4, by using the image at the time of the trigger as a reference, the three-dimensional information at the time of the trigger can be output with high accuracy, and errors are accumulated when integrating the three-dimensional information. However, since the error of the three-dimensional information before and after the occurrence of the trigger is reduced, it is useful for accident analysis and the like.

このように構成することで、時系列の３次元撮像画像から解析した動体の３次元の動きを、たとえば図４や図５（ａ）で示すような俯瞰画像や、図３で示すような運転者の目線からの画像、図５（ｂ）で示すような事故の目撃者の目線からの画像などの任意の視線方向の２次元の画像に変換して表示することができ、前記動体の動きを使用者が容易に理解することができる。 By configuring in this way, the three-dimensional motion of the moving object analyzed from the time-series three-dimensional captured image can be obtained by, for example, an overhead image as shown in FIG. 4 or FIG. 5A or a driving as shown in FIG. The motion of the moving object can be displayed after being converted into a two-dimensional image in an arbitrary line-of-sight direction, such as an image from the viewer's eye, an image from the eye of the accident witness as shown in FIG. Can be easily understood by the user.

また、前記画像処理装置１ａでは、静止体を基準として各フレーム間における動体の位置情報を統合した後、投影を行うので、前記時系列の撮像画像を得るステレオカメラ２１が動いている、すなわち前記ステレオカメラ２１をドライブレコーダ４の車載カメラとした場合においも、前記運転者の目線だけでなく、路上に居る前記目撃者の目線などの任意の目線（投影面）方向からの表示画像を作成することができる。 Further, in the image processing apparatus 1a, since the position information of the moving body between the respective frames is integrated based on the stationary body and then the projection is performed, the stereo camera 21 that obtains the time-series captured images is moving, that is, Even when the stereo camera 21 is an in-vehicle camera of the drive recorder 4, a display image is generated not only from the driver's line of sight but also from any line of sight (projection plane) such as the line of sight of the witness on the road. be able to.

好ましくは、前記３次元位置情報投影部１４において、前記投影面およびそれに投影される３次元位置情報を、前記面設定部１３にて設定された角度から見た模式化された絵図面としてもよい。図１０にその一例を示す。図１０（ａ）は前述の図４に類似する実写映像で、それを模式化して前記絵図面とした図が図１０（ｂ）で示すものである。参照符号５４は、自転車５２と同じタイミングでの自車の走行軌跡である。 Preferably, in the three-dimensional position information projection unit 14, the projection plane and the three-dimensional position information projected thereon may be a schematic drawing viewed from the angle set by the plane setting unit 13. . An example is shown in FIG. FIG. 10A is a photographed image similar to the above-described FIG. 4, and FIG. Reference numeral 54 is a travel locus of the host vehicle at the same timing as the bicycle 52.

このように、模式化された絵図面、たとえば前記模式化された道路面５１上に、前記模式化された通行区分５５や横断歩道５６、信号などの道路交通のための識別記号を合成したものに、前記動体位置を投影し、さらに使用者の視線方向を任意に設定して表示可能とすることで、使用者は必要最小限の情報を表示するコンピュータグラフィックのような画像から、前記動体の動きをより容易に理解することができる。前記道路交通のための識別記号は、パターン認識などを利用して抽出すればよい。また、このようなコンピュータグラフィックの道路面５１の画像に、前記実写画像を部分的に合成してもよい。具体的には、道路面５１や動体位置がコンピュータグラフィックで作成され、前記信号機、標識、看板などのランドマークは、実写画像を使用するというものである。これによって、認識し易いコンピュータグラフィックの画像に、リアリティを持たせることもできる。 In this way, a schematic drawing, for example, the above-described schematic road surface 51, on which the above-mentioned schematic traffic division 55, pedestrian crossing 56, identification symbols for road traffic such as signals are synthesized. In addition, by projecting the position of the moving object, and further enabling display by arbitrarily setting the user's line-of-sight direction, the user can display the moving object from an image such as a computer graphic displaying the minimum necessary information. Can understand movement more easily. The identification symbol for the road traffic may be extracted using pattern recognition or the like. Moreover, you may synthesize | combine the said live-action image partially with the image of the road surface 51 of such a computer graphic. Specifically, the road surface 51 and the moving object position are created by computer graphics, and landmarks such as traffic lights, signs, signboards, and the like use actual images. Thus, it is possible to give reality to a computer graphic image that is easy to recognize.

さらにまた、画像処理装置１における動体抽出部１２および画像処理装置１ａにおける３次元情報算出部１６からは、前記動体抽出部１２，１２ａで抽出した動体対応画像部分について、投影にあたって、いずれか任意のフレームにおける時系列画像を基準として、他のフレームにおける対応画像部分も統合された３次元座標が算出されることになる。この統合画像を得るための３次元座標のことを、以下では基準化３次元座標と言う。そこで、自身も移動している画像処理装置１ａにおける３次元情報算出部１６でのこの基準化３次元座標の算出方法について以下に詳しく説明する。先ず、３次元情報統合部１５は、前記面設定部１３で設定される任意の基準画像における静止体対応画像部分に含まれる任意の３点を選択する。その３点の画像毎の３次元座標は算出されているので、３次元情報統合部１５が、同一直線上にない３点を選択することは容易にできる。同様に、３次元情報統合部１５は、基準化３次元座標を算出する基準画像とは別フレームの画像上における、前記基準画像において選択された３点に対応する３点を取得する。この対応する３点については動体抽出部１２ａで算出したデータを用いてもよいし、該３次元情報統合部１５で対応点探索または後述のＬｕｃａｓ−Ｋａｎａｄｅ法等により求めてもよい。 Furthermore, from the moving object extraction unit 12 in the image processing apparatus 1 and the three-dimensional information calculation unit 16 in the image processing apparatus 1a, any of the moving object corresponding image portions extracted by the moving object extraction units 12 and 12a is projected. Based on the time-series images in the frames, the three-dimensional coordinates in which the corresponding image portions in the other frames are integrated are calculated. Hereinafter, the three-dimensional coordinates for obtaining the integrated image are referred to as standardized three-dimensional coordinates. Therefore, the calculation method of the standardized three-dimensional coordinates in the three-dimensional information calculation unit 16 in the image processing apparatus 1a that is also moving will be described in detail below. First, the three-dimensional information integration unit 15 selects arbitrary three points included in the stationary object corresponding image portion in the arbitrary reference image set by the surface setting unit 13. Since the three-dimensional coordinates for each of the three images are calculated, the three-dimensional information integration unit 15 can easily select three points that are not on the same straight line. Similarly, the three-dimensional information integration unit 15 acquires three points corresponding to the three points selected in the reference image on an image of a frame different from the reference image for calculating the normalized three-dimensional coordinates. For these three corresponding points, the data calculated by the moving object extraction unit 12a may be used, or the three-dimensional information integration unit 15 may determine the corresponding points by the Lucas-Kanade method described later or the like.

こうして、３次元情報統合部１５は、時刻Ｔにおける画像の静止体対応画像部分から同一直線上にはない３点を選択し、これらに対応する、時刻Ｔ＋Δｔにおける画像上の点を求める。そして、３次元情報統合部１５は、時刻Ｔにおける３点により構成される面に、時刻Ｔ＋Δｔにおける３点により構成される面を一致させるために必要な、時刻Ｔ＋Δｔにおける３点の３次元座標の座標変換に必要な回転成分および並進成分を算出する。つまり、３次元情報統合部１５は、時刻Ｔにおける３点から構成される面の法線ベクトルに、時刻Ｔ＋Δｔにおける３点から構成される面の法線ベクトルを一致させ、時刻Ｔにおける３点のいずれか１点に時刻Ｔ＋Δｔにおける３点のいずれかを合わせるか、時刻Ｔにおける３点の重心に時刻Ｔ＋Δｔにおける３点の重心を合わせるような座標変換を行う回転成分および並進成分を算出する。そして、３次元位置情報算出部１５は、時刻Ｔ＋Δｔの画像における特定した動体対応画像部分の３次元座標を、算出された回転成分および並進成分により変換することで、時刻Ｔの画像を基準とする基準化３次元座標を算出することができる。 In this way, the three-dimensional information integration unit 15 selects three points that are not on the same straight line from the still body corresponding image portion of the image at time T, and obtains corresponding points on the image at time T + Δt. The three-dimensional information integration unit 15 then calculates the three-dimensional coordinate of the three points at time T + Δt necessary to match the surface formed by the three points at time T + Δt with the surface formed by the three points at time T. The rotation component and translation component necessary for coordinate transformation are calculated. That is, the three-dimensional information integration unit 15 matches the normal vector of the surface composed of three points at time T + Δt with the normal vector of the surface composed of three points at time T, and A rotation component and a translation component are calculated to perform coordinate transformation such that any one of the three points at time T + Δt is matched with any one point or the centroid of three points at time T is matched with the centroid of three points at time T + Δt. Then, the three-dimensional position information calculation unit 15 converts the three-dimensional coordinates of the specified moving object-corresponding image portion in the image at time T + Δt with the calculated rotation component and translation component, thereby using the image at time T as a reference. Normalized three-dimensional coordinates can be calculated.

ここで、統合画像において選択された３点は、３次元座標においてそれぞれ互いに離れていることが好ましい。それにより、局所的な一致でなく、静止体対応画像部分における広い範囲において、静止体対応画像部分同士が一致することとなり、より確実に一致することとなる。そして、３次元情報統合部１５は、これら複数組により、最小二乗的に、上記回転成分および並進成分を算出すればよい。それにより、３次元情報統合部１５は、より安定した解（回転成分および並進成分）を求めることができ、３次元座標の変換精度が高くなる。 Here, it is preferable that the three points selected in the integrated image are separated from each other in the three-dimensional coordinates. As a result, the stationary object-corresponding image portions match each other in a wide range in the stationary object-corresponding image portion, and not the local matching. Then, the three-dimensional information integration unit 15 may calculate the rotation component and the translation component in a least square manner using the plurality of sets. Thereby, the three-dimensional information integration unit 15 can obtain a more stable solution (rotation component and translation component), and the conversion accuracy of the three-dimensional coordinates is increased.

また、統合画像を基準とする、特定した動体対応画像部分の３次元座標の変換の方法として、別の方法について説明する。具体的には、前記ＩＣＰアルゴリズムを用いる方法である。それによれば、３次元情報統合部１５は動体抽出部１２ａで抽出された静止体対応画像部分の任意の複数の点における３次元座標を初期値とし、これら複数の点に対応する、他の時系列画像上の点を取得する。そして、３次元情報統合部１５は、前記ＩＣＰアルゴリズムを用いることで、時刻Ｔに撮像された基準画像の静止体対応画像部分における複数の点に、これらに対応する時刻Ｔ＋Δｔの画像の静止体対応画像部分における複数の点を３次元座標において一致させるような座標変換に必要な回転成分および並進成分を算出することができる。さらに３次元情報統合部１５は、時刻Ｔ＋Δｔの画像における特定した動体対応画像部分の３次元座標を、算出された回転成分および並進成分により変換することで、時刻Ｔの画像を基準とする時刻Ｔ＋Δｔの画像における特定した動体対応画像部分の基準化３次元座標を算出することができる。このように、ＩＣＰアルゴリズムを用いることで、対応する複数の点について、３次元情報統合部１５は、ノイズに影響されにくいロバストな座標変換が可能である。 Another method will be described as a method of converting the three-dimensional coordinates of the specified moving object corresponding image portion with the integrated image as a reference. Specifically, this is a method using the ICP algorithm. According to this, the three-dimensional information integration unit 15 uses the three-dimensional coordinates at arbitrary points of the stationary object corresponding image portion extracted by the moving object extraction unit 12a as initial values, and corresponds to these multiple points at other times. Acquires a point on the series image. Then, by using the ICP algorithm, the three-dimensional information integration unit 15 corresponds to a plurality of points in the stationary object corresponding image portion of the reference image captured at time T, and corresponds to the stationary object corresponding to the image at time T + Δt. It is possible to calculate a rotation component and a translation component necessary for coordinate conversion so as to match a plurality of points in the image portion in three-dimensional coordinates. Further, the three-dimensional information integration unit 15 converts the three-dimensional coordinates of the specified moving object corresponding image portion in the image at the time T + Δt by the calculated rotation component and translation component, so that the time T + Δt with the image at the time T as a reference. It is possible to calculate the normalized three-dimensional coordinates of the identified moving object corresponding image portion in the image. In this way, by using the ICP algorithm, the three-dimensional information integration unit 15 can perform robust coordinate transformation that is not easily affected by noise for a plurality of corresponding points.

なお、時刻Ｔにおける基準画像を基準として、時刻Ｔ＋Δｔにおける画像の特定した動体対応画像部分の３次元座標の変換について説明したが、３次元情報統合部１５は他の時系列画像の特定した動体対応画像部分の３次元座標の変換についても、同様に回転成分および並進成分を算出して、変換していけばよい。なお、ステレオカメラ１を搭載した移動体が直進していれば、前方の離れた箇所にある静止体に対応する静止体対応画像部分は、複数の時系列画像に存在するが、移動体が左折あるいは右折する等、曲がった場合は、その後の時系列画像に存在する静止体対応画像部分が変化してゆく。そこで、各時系列画像に応じて、最初に基準画像において選択した点の対応点がなくなる場合もあるが、このような場合であっても、３次元情報統合部１５は選択した点を新たな点に変更（更新）していけばよい。そして、座標変換を複数回行うことで、基準化３次元座標の算出は可能である。３次元情報統合部１５は、このように、静止体の３次元画像情報を用いて、移動体の動きに制限されることなく、基準化３次元座標を算出することができる。 Note that the conversion of the three-dimensional coordinates of the moving object corresponding image portion specified in the image at the time T + Δt has been described with reference to the reference image at the time T. However, the three-dimensional information integration unit 15 has specified the moving object corresponding to another time-series image. Similarly, the conversion of the three-dimensional coordinates of the image portion may be performed by calculating the rotation component and the translation component. Note that if the moving body on which the stereo camera 1 is mounted is going straight, the stationary body corresponding image portion corresponding to the stationary body at a distant place in the front exists in a plurality of time-series images, but the moving body turns left. Alternatively, when the vehicle is bent, such as turning right, the stationary object corresponding image portion existing in the subsequent time series image changes. Therefore, depending on each time-series image, there may be no corresponding point of the point selected first in the reference image. Even in such a case, the three-dimensional information integration unit 15 adds the selected point to a new point. Change (update) to the point. Then, it is possible to calculate standardized three-dimensional coordinates by performing coordinate transformation a plurality of times. As described above, the three-dimensional information integration unit 15 can calculate the standardized three-dimensional coordinates using the three-dimensional image information of the stationary body without being limited by the movement of the moving body.

前記３次元情報統合部１５では、こうして特定した動体対応画像部分について、基準化３次元座標を算出しているので、この３次元情報統合部１５で求められた、或いは、定点カメラ２につき、予め基準化３次元座標が求められている動体抽出部１２の出力から、動体対応画像部分に対応するその動体の動きに関する情報を算出することが可能である。その情報とは、たとえば動体の速度、加速度、速度ベクトル、加速度ベクトル等である。また、３次元画像情報統合部１５において算出した動きベクトル等も情報の１つである。そこで、前記基準化３次元座標を用いて、特定した動体対応画像部分に対応する動体の速度、加速度およびベクトルを算出する演算方法について説明する。先ず、ｔ秒のフレーム間隔で、同一の動体に対応する動体対応画像部分における連続した３フレームの基準化３次元座標を用算出した結果を、それぞれ（ｘ１，ｙ１，ｚ１）、（ｘ２，ｙ２，ｚ２）および（ｘ３，ｙ３，ｚ３）とする。次に、（ｘ１，ｙ１，ｚ１）、（ｘ２，ｙ２，ｚ２）から、これらを撮像した際の動体の速度ｖ１は、下式で表すことができる。 Since the three-dimensional information integration unit 15 calculates the normalized three-dimensional coordinates for the moving object corresponding image portion specified in this way, the three-dimensional information integration unit 15 obtains the standardized three-dimensional coordinates or determines the fixed point camera 2 in advance. It is possible to calculate information relating to the motion of the moving object corresponding to the moving object corresponding image portion from the output of the moving object extracting unit 12 for which the normalized three-dimensional coordinates are obtained. The information is, for example, the speed, acceleration, speed vector, acceleration vector, etc. of the moving object. The motion vector calculated by the three-dimensional image information integration unit 15 is also one piece of information. Therefore, a calculation method for calculating the speed, acceleration, and vector of the moving object corresponding to the specified moving object corresponding image portion using the normalized three-dimensional coordinates will be described. First, calculation results obtained by using standardized three-dimensional coordinates of three consecutive frames in a moving object corresponding image portion corresponding to the same moving object at a frame interval of t seconds are respectively (x1, y1, z1) and (x2, y2). , Z2) and (x3, y3, z3). Next, from (x1, y1, z1) and (x2, y2, z2), the velocity v1 of the moving body when these are imaged can be expressed by the following equation.

ｖ１＝｛（Ｖｘ１）^２＋（Ｖｙ１）^２＋（Ｖｚ１）^２｝^１／２
ただし、
（Ｖｘ１，Ｖｙ１，Ｖｚ１）
＝（（ｘ２−ｘ１）／ｔ，（ｙ２−ｙ１）／ｔ，（ｚ２−ｚ１）／ｔ）
である。 v1 = {(Vx1) ² + (Vy1) ² + (Vz1) ² } ^1/2
However,
(Vx1, Vy1, Vz1)
= ((X2-x1) / t, (y2-y1) / t, (z2-z1) / t)
It is.

同様に、（ｘ２，ｙ２，ｚ２）、（ｘ３，ｙ３，ｚ３）から、これらを撮像した際の動体の速度ｖ２は、下式で表すことができる。 Similarly, from (x2, y2, z2) and (x3, y3, z3), the velocity v2 of the moving body when these are imaged can be expressed by the following equation.

ｖ２＝｛（Ｖｘ２）^２＋（Ｖｙ２）^２＋（Ｖｚ２）^２｝^１／２
ただし、
（Ｖｘ２，Ｖｙ２，Ｖｚ２）
＝（（ｘ３−ｘ２）／ｔ，（ｙ３−ｙ２）／ｔ，（ｚ３−ｚ２）／ｔ）
である。 ^{^{v2 = {(Vx2) 2 +}} (Vy2) 2 + (Vz2) 2} 1/2
However,
(Vx2, Vy2, Vz2)
= ((X3-x2) / t, (y3-y2) / t, (z3-z2) / t)
It is.

したがって、３つの画像の各対応点から求められる動体の加速度ａは、下式で表すことができる。 Therefore, the acceleration a of the moving body obtained from the corresponding points of the three images can be expressed by the following equation.

ａ＝｛（Ａｘ）^２＋（Ａｙ）^２＋（Ａｚ）^２｝^１／２
ただし、
（Ａｘ，Ａｙ，Ａｚ）
＝（（Ｖｘ２−Ｖｘ１）／ｔ，（Ｖｙ２−Ｖｙ１）／ｔ，（Ｖｚ２−Ｖｚ１）／ｔ）
である。 a = {(Ax) ² + (Ay) ² + (Az) ² } ^1/2
However,
(Ax, Ay, Az)
= ((Vx2-Vx1) / t, (Vy2-Vy1) / t, (Vz2-Vz1) / t)
It is.

また、３次元動きベクトル（Ｕｘ１，Ｕｙ１，Ｕｚ１）、（Ｕｘ２，Ｕｙ２，Ｕｚ２）は、
（Ｕｘ１，Ｕｙ１，Ｕｚ１）＝（ｘ２−ｘ１，ｙ２−ｙ１，ｚ２−ｚ１）
（Ｕｘ２，Ｕｙ２，Ｕｚ２）＝（ｘ３−ｘ２，ｙ３−ｙ２，ｚ３−ｚ２）
である。 Also, the three-dimensional motion vectors (Ux1, Uy1, Uz1) and (Ux2, Uy2, Uz2) are
(Ux1, Uy1, Uz1) = (x2-x1, y2-y1, z2-z1)
(Ux2, Uy2, Uz2) = (x3-x2, y3-y2, z3-z2)
It is.

好ましくは、前記３次元位置情報投影部１４は、こうして求めた動体の動きに関する情報を併せて前記表示画像を作成することである。図１１は、前記図５（ａ）で示す模式化画像に、前記動きに関する情報を併せて投影したものである。具体的に、図１１（ａ）は、フレーム間の動き情報を動きベクトル（動体の同じ位置同士を結んでいる）として表示したものであり、その動きベクトルを矢印で表している。図１１（ｂ）は、フレーム間の動き情報を速度ベクトルとして表示したものであり、その速度ベクトルを速度に応じて長さの変わる矢印で表している。図１１（ｃ）はフレーム間の動き情報として、速度（km/h）をそのまま重畳表示したものである。このように各動体の動きに関する情報を併せて表示することで、前記動体の動きを使用者がより容易に理解することができる。 Preferably, the three-dimensional position information projection unit 14 creates the display image together with information on the motion of the moving body thus obtained. FIG. 11 is a schematic image shown in FIG. 5A in which information related to the motion is projected together. Specifically, FIG. 11A shows motion information between frames as motion vectors (connecting the same positions of moving objects), and the motion vectors are represented by arrows. FIG. 11B shows motion information between frames as a velocity vector, and the velocity vector is represented by an arrow whose length changes according to the velocity. FIG. 11C shows the speed (km / h) superimposed as it is as motion information between frames. Thus, by displaying together the information regarding the movement of each moving object, the user can more easily understand the movement of the moving object.

一方、図１２は、前記図１１（ａ）で示す動きベクトル表示に、マウスなどを使って、クリック（或いは重ねるだけでもよい)すると、他の動き情報（図１１（ｃ）の速度）をポップアップ表示したものである。また、図１１や図１２のように時間的に異なる全てのフレームの動体位置や動き情報を重畳表示するのではなく、図１３（ａ）〜（ｃ）で示すように、それぞれのフレームにおける動体位置を、時系列的に都度表示（アニメーション表示など）するようにしてもよい。 On the other hand, FIG. 12 pops up other motion information (the speed in FIG. 11C) by clicking (or simply overlaying) on the motion vector display shown in FIG. It is displayed. In addition, the moving object positions and movement information of all the temporally different frames as shown in FIG. 11 and FIG. 12 are not superimposed and displayed as shown in FIGS. 13 (a) to 13 (c). The position may be displayed in time series (animation display, etc.).

（実施の形態２）
図１４は、本発明の実施の第２の形態に係る画像処理装置１ｂを備えて成る事故検証システムの電気的構成を示すブロック図である。この事故検証システムは、前述の図１および図２で示す事故検証システムに類似し、対応する部分には同一の参照符号を付して示し、その説明を省略する。注目すべきは、本実施の形態では、前記動体抽出部１２ａは静止体も合わせて抽出し、その後、先ず３次元位置情報投影部１４が、面設定部１３で設定された投影面上に、フレーム毎に、前記動体抽出部１２ａで抽出された動体および静止体の３次元位置情報を投影し、その投影された各フレーム画像において、投影画像統合部１７が前記静止体の位置合せを行うことで、前記各投影面の統合を行うことである。 (Embodiment 2)
FIG. 14 is a block diagram showing an electrical configuration of an accident verification system including the image processing apparatus 1b according to the second embodiment of the present invention. This accident verification system is similar to the accident verification system shown in FIG. 1 and FIG. 2 described above, and corresponding portions are denoted by the same reference numerals, and description thereof is omitted. It should be noted that in the present embodiment, the moving object extraction unit 12a also extracts a stationary object, and then the three-dimensional position information projection unit 14 is first placed on the projection plane set by the plane setting unit 13. For each frame, the three-dimensional position information of the moving body and the stationary body extracted by the moving body extraction unit 12a is projected, and the projected image integration unit 17 aligns the stationary body in each of the projected frame images. Then, the projection planes are integrated.

図１５は、前記絵図面による投影面の位置合せを説明するための図である。各投影面が曲面であっても同様に位置合わせを行うことが可能であるが、説明を簡素化するために、平面として説明する。先ず、図１５（ａ）は、４枚の時系列の各画像において、動体を平面上に投影した状態を表す。これらの時間的に異なる複数の画像を図１５（ｂ）では、静止体である前記道路交通のための識別記号に基づいて、平面の位置合わせを行っている。具体的には、図の上下方向は横断歩道５６で、左右方向は通行区分５５で位置合わせを行っている。そして、図１５（ｃ）では、位置合わせ後の平面を統合した状態を表している。 FIG. 15 is a diagram for explaining the alignment of the projection plane according to the picture drawing. Even if each projection surface is a curved surface, it is possible to perform alignment in the same manner. However, in order to simplify the description, it will be described as a plane. First, FIG. 15A shows a state in which a moving object is projected on a plane in each of four time-series images. In FIG. 15B, the plurality of images different in time are aligned on the basis of the identification symbol for the road traffic that is a stationary body. Specifically, the vertical alignment in the figure is performed at the pedestrian crossing 56, and the horizontal alignment is performed at the traffic division 55. FIG. 15C shows a state in which the planes after alignment are integrated.

こうして、フレーム毎の投影結果を、静止体を基準として統合することで、前記時系列の撮像画像を得るステレオカメラ２１が動いている、すなわち前記ステレオカメラ２１をドライブレコーダ４などの車載カメラとした場合においも、前記運転者の目線だけでなく、路上に居る前記目撃者の目線などの任意の目線（投影面）方向からの表示画像を作成することができる。 Thus, by integrating the projection results for each frame on the basis of a stationary body, the stereo camera 21 that obtains the time-series captured images is moving. That is, the stereo camera 21 is an in-vehicle camera such as the drive recorder 4. In some cases, a display image can be created not only from the driver's line of sight but also from an arbitrary line of sight (projection plane) such as the line of sight of the witness on the road.

（実施の形態３）
図１６は、本発明の実施の第３の形態に係る画像処理装置１ｃを備えて成る事故検証システムの電気的構成を示すブロック図である。この事故検証システムは、前述の図１および図２で示す事故検証システムに類似し、対応する部分には同一の参照符号を付して示し、その説明を省略する。注目すべきは、本実施の形態では、前記動体抽出部１２ａは静止体も合わせて抽出する一方、面設定部１３ｃには前記動体抽出部１２ａの出力が入力され、或るフレームにおける抽出結果を基準とした仮想投影面が設定され、３次元位置情報投影部１４ｃは、前記動体抽出部１２ｃの出力から、各フレームにおける動体の３次元位置を前記静止体を基準として統合し、前記面設定部１３ｃにて設定された仮想投影面上に投影することである。 (Embodiment 3)
FIG. 16 is a block diagram showing an electrical configuration of an accident verification system including the image processing apparatus 1c according to the third embodiment of the present invention. This accident verification system is similar to the accident verification system shown in FIG. 1 and FIG. 2 described above, and corresponding portions are denoted by the same reference numerals, and description thereof is omitted. It should be noted that in the present embodiment, the moving object extraction unit 12a extracts a stationary object as well, while the surface setting unit 13c receives the output of the moving object extraction unit 12a, and extracts the extraction result in a certain frame. A virtual projection plane as a reference is set, and the three-dimensional position information projection unit 14c integrates the three-dimensional position of the moving body in each frame based on the stationary body from the output of the moving body extraction unit 12c. Projecting onto the virtual projection plane set at 13c.

図１７は、そのような仮想投影面への投影方法を説明するための図である。本実施の形態でも、前記仮想投影面が曲面であっても、同様に位置合わせを行うことが可能であるが、説明を簡素化するために、平面として説明する。前述の図１５では、各フレームで設定した平面に動体を投影した後、それらの平面の位置合わせを行うことで、全体を俯瞰できる平面図を作成している。これに対して、本実施の形態では、全体を俯瞰できる仮想平面を用意し、３次元位置情報投影部１４ｃは、図１７（ａ）で示すような各画像の仮想平面の中で、基準となるフレームで設定した仮想平面に、図１７（ｂ）で示すように残余のフレームで設定した平面を連結して作成する。 FIG. 17 is a diagram for explaining such a projection method onto the virtual projection plane. Even in the present embodiment, even if the virtual projection plane is a curved surface, it is possible to perform alignment in the same manner. However, in order to simplify the description, it will be described as a plane. In FIG. 15 described above, a moving object is projected onto the plane set in each frame, and then the plane is aligned, thereby creating a plan view allowing the bird's-eye view of the whole. On the other hand, in the present embodiment, a virtual plane that can be viewed from the whole is prepared, and the three-dimensional position information projection unit 14c has a reference plane in the virtual plane of each image as shown in FIG. As shown in FIG. 17B, the plane set by the remaining frames is connected to the virtual plane set by the frame to be created.

ここで、上り坂の場合の仮想平面は図１７（ｃ）で示すようになるので、投影した動体の位置精度が低下する。そこで、このような場合はたとえばナビゲーションシステムからＧＰＳの情報を用いることで、坂道か否かの判断が可能になるので、設定する仮想平面の傾きを変えることで、精度を高めることが可能になる。また、精度をより高めるためには、図１７（ｄ）のように、仮想曲面を設定するようにすればよい。 Here, since the virtual plane in the case of the uphill is as shown in FIG. 17C, the positional accuracy of the projected moving object is lowered. In such a case, for example, it is possible to determine whether the road is a slope by using GPS information from the navigation system. Therefore, it is possible to improve the accuracy by changing the inclination of the virtual plane to be set. . In order to further improve the accuracy, a virtual curved surface may be set as shown in FIG.

このように静止体を基準として各フレーム間における動体の位置情報を統合した後、仮想投影面に投影を行うことで、前記ステレオカメラ２１が動いている、すなわち前記ステレオカメラ２１をドライブレコーダ４などの車載カメラとした場合においも、前記運転者の目線だけでなく、路上に居る前記目撃者の目線などの任意の目線（投影面）方向からの表示画像を作成することができる。 Thus, after integrating the position information of the moving body between the frames based on the stationary body, the stereo camera 21 is moved by projecting onto the virtual projection plane, that is, the stereo camera 21 is moved to the drive recorder 4 or the like. Even in the case of the in-vehicle camera, it is possible to create a display image not only from the driver's eyes but also from any eye (projection plane) direction such as the eyes of the witness on the road.

図１８は、前記ステレオカメラ２１（２１−１，２１−２）の出力画像に対する３次元位置情報算出部１１での３次元演算（距離演算）の手法を説明するための図である。説明の簡単化の為に、ステレオカメラ２１−１，２１−２の収差は良好に補正されており、かつ平行に設置されているものとする。実際のハードがこのような条件に無くても、画像処理により、同等の画像に変換することも可能である。ハード的或いは画像処理によって平行化された画像を用いる利点は、図１９（ｃ）や図２０で後述するように、対応点の探索領域を1次元に限定できるということであるが、後述する位相限定相関法のように２次元探索が容易な手法の場合であれば、平行化されていないステレオ画像で対応付けを行い、得られた対応点結果を直接３次元化することも可能である（画像処理で平行化することで、画像にノイズが重畳されるので、平行化画像で対応付けを行うと精度が低下する。平行化前の画像から直接対応点を求めてから、最後に３次元化することで、ノイズの影響を最小限に抑えることができる。）。 FIG. 18 is a diagram for explaining a method of three-dimensional calculation (distance calculation) in the three-dimensional position information calculation unit 11 for the output image of the stereo camera 21 (21-1, 21-2). For simplification of explanation, it is assumed that the aberrations of the stereo cameras 21-1, 21-2 are corrected well and are installed in parallel. Even if the actual hardware is not in such a condition, it can be converted into an equivalent image by image processing. The advantage of using an image parallelized by hardware or image processing is that the search area for corresponding points can be limited to one dimension as will be described later with reference to FIG. 19C and FIG. In the case of a technique that is easy to perform a two-dimensional search, such as the limited correlation method, it is possible to perform correspondence using stereo images that have not been parallelized, and to directly three-dimensionalize the obtained corresponding point results ( Since parallelization is performed by image processing, noise is superimposed on the image, so if the correspondence is performed using the parallelized image, the accuracy is reduced. To minimize the effect of noise.)

前記ステレオカメラ２１−１，２１−２としては、少なくとも焦点距離（ｆ）、撮像面（ＣＣＤ）Ｓ１，Ｓ２の画素数、１画素の大きさ（μ）が相互に等しいものを用い、所定の基線（ベースライン）長（Ｂ）だけ前記左右に離間させて光軸Ｌ１，Ｌ２を相互に平行に配置して被写体Ｐを撮影したとき、撮像面Ｓ１，Ｓ２上の視差（ずれ画素数）がΔｄ（＝ｄ１＋ｄ２）であると、被写体Ｐまでの距離（Ｄ）は、
Ｄ＝ｆ・Ｂ／Δｄ
で求めることができる。 As the stereo cameras 21-1 and 21-2, those having at least a focal length (f), the number of pixels of the imaging surfaces (CCD) S1 and S2, and the size (μ) of the pixels are equal to each other. When the subject P is photographed with the optical axes L1 and L2 arranged in parallel to each other with the base line (baseline) length (B) spaced apart from each other, the parallax (the number of displaced pixels) on the imaging surfaces S1 and S2 When Δd (= d1 + d2), the distance (D) to the subject P is
D = f · B / Δd
Can be obtained.

また、被写体Ｐの各部の３次元位置（Ｘ，Ｙ，Ｚ）は、ｘ、ｙを画素上での位置とすると、以下で計算される。 The three-dimensional position (X, Y, Z) of each part of the subject P is calculated as follows, where x and y are positions on the pixel.

Ｘ＝ｘ・Ｄ／ｆ
Ｙ＝ｙ・Ｄ／ｆ
Ｚ＝Ｄ
ここで、たとえば車載用のステレオカメラには、前述のように遠方の先行車までの距離を高精度に測定したいというニーズとともに、小型化による設置し易さも求められる。
ステレオカメラの奥行き方向分解能ΔＺは、
ΔＺ＝（Ｄ^２／Ｂ）・（１／ｆ）・Δｄ
で表されることから、高精度化の方法として、焦点距離ｆを大きくする、基線長Ｂを大きくするという方法が考えられる。ところが、前述のように前者では視野範囲が狭くなり、後者では装置が大型化するという欠点がある。上記欠点の無い高精度化の方法として、対応付けのサブピクセル化がある。対応付け演算を画素単位以下の分解能で行うことで、視差の分解能Δｄを小さくして、ステレオ３次元計測の分解能を細かくできるからである。 X = x · D / f
Y = y · D / f
Z = D
Here, for example, an in-vehicle stereo camera is required to be easy to install due to downsizing as well as needs to measure the distance to a distant preceding vehicle with high accuracy as described above.
The depth direction resolution ΔZ of the stereo camera is
ΔZ = (D ² / B) · (1 / f) · Δd
Therefore, as a method for improving accuracy, a method of increasing the focal length f and a method of increasing the baseline length B can be considered. However, as described above, the former has a disadvantage that the visual field range is narrow, and the latter is large in size. As a method for improving the accuracy without the above drawbacks, there is subpixelization of correspondence. This is because by performing the associating operation with a resolution equal to or less than the pixel unit, the resolution of the parallax Δd can be reduced and the resolution of the stereo three-dimensional measurement can be made fine.

図１９は、前記３次元位置情報算出部１１において、前記視差Δｄを求めるにあたっての基準画像（ステレオカメラ２１−１）に対する参照画像（ステレオカメラ２１−２）の対応点探索方法を説明するための図である。先ず、図１９（ａ）で示すように、基準画像Ｆ１上に、注目点Ｐを中心または重心とする所定サイズの２次元のウィンドウＷ１を設定する。同様に、図１９（ｂ）で示すように、参照画像Ｆ２上に、考えられる全ての位置に、所定サイズのウィンドウＷ２を多数設定する。ここで、図１９（ｃ）で示すように、前述のように基準画像Ｆ１と参照画像Ｆ２とがほとんど平行に配置されている場合、基準画像Ｆ１上の注目点ＰのＹ座標位置Ｐｙ上に、参照画像Ｆ２の対応位置が乗っていると仮定できるので、このライン上にのみウィンドウＷ２を設定すればよい（基本的には1画素ずつずらしながらウィンドウＷ２を設定する。）。 FIG. 19 illustrates a method for searching for corresponding points of a reference image (stereo camera 21-2) with respect to a reference image (stereo camera 21-1) in obtaining the parallax Δd in the three-dimensional position information calculation unit 11. FIG. First, as shown in FIG. 19A, a two-dimensional window W1 having a predetermined size with the attention point P as the center or the center of gravity is set on the reference image F1. Similarly, as shown in FIG. 19B, a large number of windows W2 having a predetermined size are set on all possible positions on the reference image F2. Here, as shown in FIG. 19C, when the standard image F1 and the reference image F2 are arranged almost in parallel as described above, the Y coordinate position Py of the point of interest P on the standard image F1. Since the corresponding position of the reference image F2 can be assumed, the window W2 may be set only on this line (basically, the window W2 is set while shifting by one pixel).

さらにまた、基準画像Ｆ１と参照画像Ｆ２とがほとんど平行に配置されていて、かつ、基準画像Ｆ１の注目点Ｐと参照画像Ｆ２の対応位置との視差Δｄが或る程度分かっている場合は、図１９（ｄ）で示すように、その視差Δｄの範囲Δｄ’にのみウィンドウＷ２を設定すればよい。 Furthermore, when the standard image F1 and the reference image F2 are arranged almost in parallel and the parallax Δd between the attention point P of the standard image F1 and the corresponding position of the reference image F2 is known to some extent, As shown in FIG. 19D, the window W2 may be set only in the range Δd ′ of the parallax Δd.

一方、前記対応点探索にあたって、多くの対応点を探索するために、或いは高解像の画像から短時間で対応点を探索するのに好適な手法として、多重解像度戦略によるウィンドウ設定を用いてもよい。図２０は、前記基準画像Ｆ１と参照画像Ｆ２とがほとんど平行に配置されていると仮定した場合の前記多重解像度戦略を説明するための図である。図２０（ａ）は、前述の図１９（ａ）および（ｃ）と同様に、基準画像Ｆ１上の注目点ＰのＹ座標位置Ｐｙ上に、参照画像Ｆ２の複数のウィンドウＷ２を設定している。しかしながら、次に図２０（ｂ）で示すように、それぞれの画像Ｆ１，Ｆ２を解像度変換して、低解像度の画像Ｆ１’，Ｆ２’を作成し、この低解像度の画像Ｆ１’，Ｆ２’間で対応付けを行う。したがって、低解像度の画像Ｆ１’，Ｆ２’では、画素数が少なくなった分だけ、探索する画素数が少なくなり、たとえば解像度を１／２にすると、探索画素数は１／２になる。 On the other hand, in searching for corresponding points, a window setting based on a multi-resolution strategy may be used as a suitable method for searching for many corresponding points or for searching for corresponding points in a short time from a high-resolution image. Good. FIG. 20 is a diagram for explaining the multi-resolution strategy when it is assumed that the base image F1 and the reference image F2 are arranged almost in parallel. In FIG. 20 (a), a plurality of windows W2 of the reference image F2 are set on the Y coordinate position Py of the point of interest P on the standard image F1, as in the above-described FIGS. 19 (a) and (c). Yes. However, as shown in FIG. 20B, the resolutions of the respective images F1 and F2 are converted to create low-resolution images F1 ′ and F2 ′, and between these low-resolution images F1 ′ and F2 ′. Associate with. Therefore, in the low-resolution images F1 'and F2', the number of pixels to be searched is reduced by the amount of the decrease in the number of pixels. For example, when the resolution is halved, the number of search pixels is halved.

こうして、低解像度で対応位置Ｐ’を求めた後は、図２０（ｃ）で示すように、高解像度の画像Ｆ１，Ｆ２に戻って探索を行うが、低解像度で求めた対応位置から、おおよその探索範囲がわかるので、高解像度での探索は非常に狭い範囲でのみ探索を行えばよい。こうして探索範囲ΔＷを狭めることで、同じ時間内で、前述のように多くの対応点を探索することができ、或いは高解像の画像から探索を行うことができる。 After obtaining the corresponding position P ′ at the low resolution in this way, as shown in FIG. 20C, the search is performed by returning to the high-resolution images F1 and F2, but from the corresponding position obtained at the low resolution, Since the search range is known, a high-resolution search need only be performed within a very narrow range. By narrowing the search range ΔW in this way, it is possible to search many corresponding points as described above within the same time, or to search from a high-resolution image.

なお、上述の説明では、低解像度画像を１段階だけ作成しているが、複数段階で作成して、探索位置を順次絞り込んでゆくようにしてもよい。たとえば、入力画像が１２８０×９６０ピクセルのとき、第１段階目の低解像度画像として６４０×４８０ピクセル、第２段階目の低解像度画像として３２０×２４０ピクセル、第３段階目の低解像度画像として１６０×１２０ピクセルの３種類の低解像度画像を作成して、１６０×１２０ピクセルの画像から順に対応位置を探索してゆく。 In the above description, the low-resolution image is created in only one stage, but it may be created in a plurality of stages and the search position may be narrowed down sequentially. For example, when the input image is 1280 × 960 pixels, the first-stage low-resolution image is 640 × 480 pixels, the second-stage low-resolution image is 320 × 240 pixels, and the third-stage low-resolution image is 160 Three types of low resolution images of × 120 pixels are created, and corresponding positions are searched in order from an image of 160 × 120 pixels.

さらにまた、前記３次元位置情報算出部１１における対応点探索の他の手法としては、ロバストなパターン類似度演算手法として知られている振幅成分を抑制した相関法を用いることができる。そのような相関法は、パターンの周波数分解信号から、振幅成分を抑制した位相成分のみの信号を用いて類似度演算を行うので、ステレオカメラ２１の撮影条件の差や、ノイズなどの影響を受けにくく、前記ロバストな相関演算が実現可能である。また、濃淡データを用いた従来の２次元相関法や特徴抽出法とは異なり、外乱に強く、明るさやコントラストの低い画像でも、精度良く演算ができるという特徴を有している。そのようなパターンの周波数分解信号を計算する手法として、フーリエ変換、離散コサイン（サイン）変換、ウエーブレット変換、アダマール変換などが知られている。前記離散コサイン（ＤＣＴ）符号限定相関法については、たとえば「画像信号処理と画像パターン認識の融合-ＤＣＴ符号限定相関とその応用」（貴家仁志首都大学東京システムデザイン学部動的画像処理実利用化ワークショップ2007（2007.3.8-9））の論文を参照することができる。 Furthermore, as another method for searching for corresponding points in the three-dimensional position information calculation unit 11, a correlation method that suppresses an amplitude component, which is known as a robust pattern similarity calculation method, can be used. In such a correlation method, the similarity calculation is performed using only the phase component signal in which the amplitude component is suppressed from the frequency resolution signal of the pattern. Therefore, the correlation method is affected by a difference in photographing conditions of the stereo camera 21 and noise. It is difficult to achieve the robust correlation calculation. Further, unlike the conventional two-dimensional correlation method and feature extraction method using grayscale data, it has a feature that it is resistant to disturbances and can calculate with high accuracy even for images with low brightness and contrast. Known techniques for calculating the frequency-resolved signal having such a pattern include Fourier transform, discrete cosine transform, wavelet transform, Hadamard transform, and the like. As for the discrete cosine (DCT) code-only correlation method, for example, “Fusion of image signal processing and image pattern recognition—DCT code-only correlation and its application” (Kishiya Hitoshi, Faculty of System Design, Tokyo Metropolitan University) You can refer to articles from Shop 2007 (2007.3.8-9)).

そして、ロバストな相関演算が実現可能な相関法の代表としての位相限定相関法（ＰＯＣ）は、変換にフーリエ変換を用い、フーリエ級数の振幅成分を抑制した位相成分のみの相関演算を行う。以下に、その位相限定相関法（ＰＯＣ）を詳細を説明する。 A phase-only correlation method (POC), which is a representative correlation method capable of realizing a robust correlation calculation, uses a Fourier transform for conversion, and performs a correlation calculation only for phase components in which the amplitude component of the Fourier series is suppressed. The details of the phase-only correlation method (POC) will be described below.

先ず、画像サイズＮ_１×Ｎ_２ピクセルの２つの画像Ｆ１，Ｆ２をｆ（ｎ_１，ｎ_２），ｇ（ｎ_１，ｎ_２）とし、定式化の便宜上、離散空間のインデックスをｎ_１＝−Ｍ_１，・・・Ｍ_１，ｎ_２＝−Ｍ_２，・・・Ｍ_２とし、画像サイズをＮ_１＝２Ｍ_１＋１ピクセル，Ｎ_２＝２Ｍ_２＋１ピクセルとすると、これらの画像の２次元フーリエ変換（２ＤＤＦＴ）は、それぞれ下式で与えられる。 First, let f (n ₁ , n ₂ ) and g (n ₁ , n ₂ ) be two images F1 and F2 having an image size N ₁ × N ₂ pixels, and for the convenience of formulation, the index of the discrete space is n ₁ = If -M ₁ ,... M ₁ , n ₂ = −M ₂ ,... M ₂ and the image sizes are N ₁ = 2M ₁ +1 pixels and N ₂ = 2M ₂ +1 pixels, 2 of these images The dimensional Fourier transform (2D DFT) is given by the following equation, respectively.

ここで、ｋ_１＝−Ｍ_１，・・・Ｍ_１，ｋ_２＝−Ｍ_２，・・・Ｍ_２ _{_{_{Here, k 1 = -M 1, ···}}} M 1, k 2 = -M 2, ··· M 2

であり、Σ_ｎ１ｎ２は、 And Σ _n1n2 is

である。また、Ａ_Ｆ（ｋ_１，ｋ_２），Ａ_Ｇ（ｋ_１，ｋ_２）は振幅成分であり、ｅ^{ｊθＦ（ｋ１，ｋ２）}，ｅ^{ｊθＧ（ｋ１，ｋ２）}は位相成分である。 It is. A _F (k ₁ , k ₂ ) and A _G (k ₁ , k ₂ ) are amplitude components, and e ^{jθF (k1, k2)} and e ^{jθG (k1, k2)} are phase components.

そして、位相限定相関法（ＰＯＣ）は、こうして求められたフーリエ級数の振幅成分を抑制した位相成分のみの相関演算を行う。それには先ず、パターンｆ，ｇの合成位相スペクトル＾Ｒ（ｋ_１，ｋ_２）は、下記のように定義される。 Then, the phase only correlation method (POC) performs a correlation calculation of only the phase component in which the amplitude component of the Fourier series thus obtained is suppressed. First, the combined phase spectrum ＲR (k ₁ , k ₂ ) of the patterns f and g is defined as follows.

ここで、Ｇ（ｋ_１，ｋ_２）の複素共役は、上線を付して示す。また、θ（ｋ_１，ｋ_２）＝θ_Ｆ（ｋ_１，ｋ_２）−θ_Ｇ（ｋ_１，ｋ_２）である。 Here, the complex conjugate of G (k ₁ , k ₂ ) is shown with an overline. Further, θ (k ₁ , k ₂ ) = θ _F (k ₁ , k ₂ ) −θ _G (k ₁ , k ₂ ).

この合成位相スペクトル＾Ｒ（ｋ_１，ｋ_２）を逆フーリエ変換することで、相関演算を行うことができる。すなわち、θ（ｋ_１，ｋ_２）＝θ_Ｆ（ｋ_１，ｋ_２）−θ_Ｇ（ｋ_１，ｋ_２）であり、ＰＯＣ関数＾ｒ（ｎ_１，ｎ_２）はＲ（ｋ_１，ｋ_２）の２次元離散フーリエ逆変換（２ＤＩＤＦＴ）であり、次式で定義される。 Correlation calculation can be performed by performing inverse Fourier transform on the combined phase spectrum ^ R (k ₁ , k ₂ ). That is, θ (k ₁ , k ₂ ) = θ _F (k ₁ , k ₂ ) −θ _G (k ₁ , k ₂ ), and the POC function ｒr (n ₁ , n ₂ ) is R (k ₁ , n ₂ ). k ₂ ) is a two-dimensional discrete Fourier inverse transform (2D IDFT) and is defined by the following equation.

ここで、Σ_ｎ１ｎ２は、 Where Σ _n1n2 is

である。 It is.

上記ＰＯＣ関数の処理で得られるＰＯＣ値は、図２１に示すように、画像間（基準ウインドウと参照ウインドウ）の移動量の座標に急峻な類似度ピークを持つことが知られており、画像マッチングにおけるロバスト性が高い。そのＰＯＣのピークの高さが、パターン類似度を示す。そして、位置情報算出部１１が、ＰＯＣのピーク位置を推定することにより位置ズレ量（＝視差ｄｓｕｂ）の推定を行う。このとき、ＰＯＣは離散的に求まるので、ピーク位置をサブピクセルで補間推定することによって、高分解な対応領域座標を求めることができる。ピーク位置の補間推定方法としては、放物線などの関数をフィッティングして行うことができる。そして、候補領域間の位置ズレ量Δｄは、候補領域間のピクセルレベルの位置ズレ量ｄｐｉｘｅｌに、ＰＯＣ法で求めたサブピクセルの位置ズレ量ｄｓｕｂを加えた量となる。 The POC value obtained by the above POC function processing is known to have a sharp similarity peak in the coordinates of the amount of movement between images (reference window and reference window) as shown in FIG. Robustness is high. The height of the peak of the POC indicates the pattern similarity. Then, the position information calculation unit 11 estimates the position shift amount (= parallax dsub) by estimating the peak position of the POC. At this time, since the POC is obtained discretely, high resolution corresponding region coordinates can be obtained by interpolating and estimating the peak position with subpixels. The peak position interpolation estimation method can be performed by fitting a function such as a parabola. The positional deviation amount Δd between the candidate areas is an amount obtained by adding the positional deviation amount dsub of the sub-pixel obtained by the POC method to the positional deviation quantity dpixel at the pixel level between the candidate areas.

したがって、前記位相限定相関法における具体的な対応点探索の一例としては、以下の通りとなる。前述のようにＰＯＣ値は、画像間（基準ウインドウＷ１と参照ウインドウＷ２）の移動量の座標に急峻な相関ピークを持つことが知られているので、図２２（ａ）に示すように、基準画像Ｆ１上の点Ｐに対応する参照画像Ｆ２上の点をＰ’とし、点Ｐ，Ｐ’がそれぞれ重心位置になるようなウィンドウＷ１，Ｗ２を設定すると、ウィンドウＷ１，Ｗ２間のＰＯＣ値は、ウィンドウＷ１，Ｗ２の重心位置にピークが立つ。したがって、参照ウインドウＷ２を点Ｐ’が重心位置になるように設定するのではなく、横方向に１画素ずれるように設定すると、ＰＯＣ値も、重心位置から１画素ずれた位置にピークが立つ。同様に、図２２（ｂ）に示すように、参照ウインドウＷ２を点Ｐ’が重心位置からさらに横方向に２画素ずれるようにを設定すると、ＰＯＣ値も、重心位置から２画素ずれた位置にピークが立つので、図１９で説明したように、参照画像Ｆ２側に設定するウィンドウＷ２は１画素ずつずらす必要はなく、或るサンプリング間隔を持って設定すればよい。そこでどれくらいのサンプリング間隔で設定すればよいかは、探索できる範囲Ｗ３に依存するが、一般的には、図２３のように、ウィンドウサイズの半分位であると言われている（重心位置に対して、±１／４位）ので、サンプリング間隔としては、たとえばウィンドウサイズの半分位が重なるように設定すればよい。したがって、基準画像Ｆ１と参照画像Ｆ２での視差の最大を１２８画素、ウィンドウサイズを３１×３１、ＰＯＣで探索できる範囲は重心位置に対して、±８画素と仮定すると、最大１２８画素の視差を探索するためには、ウィンドウを１６画素ずつずらせばよいので、８個のウィンドウを設定すればよい。 Therefore, an example of a specific corresponding point search in the phase-only correlation method is as follows. As described above, it is known that the POC value has a steep correlation peak in the coordinates of the movement amount between images (the reference window W1 and the reference window W2). Therefore, as shown in FIG. When the point on the reference image F2 corresponding to the point P on the image F1 is P ′ and the windows W1 and W2 are set such that the points P and P ′ are respectively located at the center of gravity, the POC value between the windows W1 and W2 is A peak appears at the center of gravity of the windows W1 and W2. Therefore, if the reference window W2 is not set so that the point P ′ is located at the center of gravity, but is set so as to be shifted by one pixel in the horizontal direction, the POC value also peaks at a position shifted by one pixel from the center of gravity. Similarly, as shown in FIG. 22B, when the reference window W2 is set so that the point P ′ is further shifted by two pixels laterally from the center of gravity position, the POC value is also shifted by two pixels from the center of gravity position. Since the peak is set, as described with reference to FIG. 19, the window W2 set on the reference image F2 side does not need to be shifted pixel by pixel, and may be set with a certain sampling interval. Therefore, how much sampling interval should be set depends on the searchable range W3, but generally, it is said to be about half the window size as shown in FIG. Therefore, the sampling interval may be set so that, for example, the half of the window size overlaps. Therefore, assuming that the maximum disparity in the base image F1 and the reference image F2 is 128 pixels, the window size is 31 × 31, and the range that can be searched with POC is ± 8 pixels with respect to the center of gravity position, the disparity of 128 pixels at maximum is assumed. In order to perform the search, the windows need only be shifted by 16 pixels, so that eight windows may be set.

さらに、前記多重解像度戦略を用いると、図２０で説明したように、画像サイズを縮小することで、探索範囲も縮小することができる。具体的に、上述の図２２では、基準画像Ｆ１上の或る1点につき、参照画像Ｆ２上にウィンドウを８個設定する必要があったが、画像を１／２に縮小すると、設定するウィンドウは半分の４個でよい。さらに１／２に縮小すると設定するウィンドウは２個になり、さらに１／２に縮小すると設定するウィンドウは１個となる。つまり、上述のように視差の最大が１２８画素の場合は、画像を（１／２）^４＝１／１６に縮小することで、最大視差が８画素になるので、１個のウィンドウで探索することができることになる。したがって、先ずこの１／１６に縮小した画像上での対応位置が求まると、その結果を１／８縮小した画像での初期位置としてウィンドウを１個設定して対応位置を求めるという作業を、以降順次繰返せばよい。 Further, when the multi-resolution strategy is used, the search range can be reduced by reducing the image size as described in FIG. Specifically, in FIG. 22 described above, it is necessary to set eight windows on the reference image F2 for a certain point on the standard image F1, but when the image is reduced to ½, the window to be set is set. May be half, four. If the image is further reduced to 1/2, the number of windows to be set is two. If the image is further reduced to 1/2, the number of windows to be set is one. That is, when the maximum parallax is 128 pixels as described above, the maximum parallax is 8 pixels by reducing the image to (1/2) ⁴ = 1/16, so search is performed in one window. Will be able to. Accordingly, when the corresponding position on the image reduced to 1/16 is first obtained, the operation of setting one window as the initial position on the image reduced to 1/8 and obtaining the corresponding position will be described hereinafter. What is necessary is repeated sequentially.

前記３次元位置情報算出部１１における対応点探索のさらに他の手法としては、ＳＡＤ（濃度差の絶対値和）法や、ＳＳＤ（濃度差の二乗和）法、ＮＣＣ（正規化相互相関）法などを用いることもできる。 Still other methods of searching for corresponding points in the three-dimensional position information calculation unit 11 include an SAD (absolute sum of density differences) method, an SSD (sum of squares of density differences) method, and an NCC (normalized cross correlation) method. Etc. can also be used.

また、前述のＬｕｃａｓ−Ｋａｎｅｄａ法による動きベクトル演算について、以下に説明する。時系列画像など、２枚の画像間の見かけの動きは、動きベクトル（オプティカルフロー）と呼ばれる。その動きベクトルいは、同一点は２枚の画像上で同じ輝度であると仮定すると、以下の式が成り立つ。 The motion vector calculation by the above-mentioned Lucas-Kaneda method will be described below. The apparent movement between two images such as a time series image is called a motion vector (optical flow). Assuming that the same point has the same brightness on the two images, the following equation holds.

ただし、Ｉは画像の輝度、ｘ，ｙは画像上の座標、ｖｘ，ｖｙは動きベクトルである。 Here, I is the luminance of the image, x and y are the coordinates on the image, and vx and vy are the motion vectors.

上式をテーラー展開することで、次式が得られる。 The following equation is obtained by Taylor expansion of the above equation.

上式を変形すると、 Transforming the above equation,

となる。この式は、オプティカルフローの拘束式を呼ばれる。 It becomes. This equation is called an optical flow constraint equation.

ところで、画像上の１点（ｘ，ｙ）に対して、上式１つで動きベクトルを求めることはできない。そこで、Ｌｕｃａｓ−Ｋａｎａｄｅ法では、画像上の１点（ｘ，ｙ）の周辺にウインドウを設定し、ウインドウ内で、動きベクトルが変化しないという仮定の下、上記拘束式を重み付けして連立させて、（ｘ，ｙ）での動きベクトルを計算する。具体的には、下式を解くことで実現できる。 By the way, it is not possible to obtain a motion vector for one point (x, y) on the image using the above equation. Therefore, in the Lucas-Kanade method, a window is set around one point (x, y) on the image, and the constraint equation is weighted and combined under the assumption that the motion vector does not change in the window. , (X, y). Specifically, it can be realized by solving the following equation.

さらにまた、前述のＩＣＰアルゴリズムについて以下に説明する。ＩＣＰアルゴリズムとは、反復計算により対応点間の誤差を最小化するものであり、処理のフローとしては以下のようになる。図２４で示すように、Ｎ_ｔ個の点からなる点群Ｔ＝｛ｔ_ｉ｜ｉ∈Ｎ_ｔ｝と、異なるＮ_ｓ個の点からなる点群Ｓ＝｛ｓ_ｉ｜ｓ∈Ｎ_ｓ｝の位置合わせを行うとすると、点群Ｓの各点ｓ_ｉにおいて点群Ｔとの距離は以下のようになるとする。 Furthermore, the ICP algorithm described above will be described below. The ICP algorithm minimizes the error between corresponding points by iterative calculation, and the processing flow is as follows. As shown in Figure 24, _{N t} point group of points of the individual T = | a _{_{t} i _{i∈N t},} different _{N s} point group of points of the individual _{_{S = {s i | s∈N s}} } , The distance from the point group T at each point s _i of the point group S is as follows.

各点ｓ_ｉに対応する点をｍ_ｉ∈Ｔとすると、点ｓ_ｉの対応点集群Ｍは、
Ｍ＝Ｃ（Ｓ，Ｔ）
となる。ただし、Ｃは最近傍点を求める関数である。 If a point corresponding to each point _{s i} and _{m i} ∈T, the corresponding point bunching M of the points _{s i,}
M = C (S, T)
It becomes. However, C is a function for obtaining the nearest point.

こうして点群Ｓの対応点群Ｍが求まると、位置合せのパラメータ（回転行列Ｒ、移動ベクトルｔ）は、下式を最小化することで求められる。 When the corresponding point group M of the point group S is thus obtained, the alignment parameters (rotation matrix R, movement vector t) can be obtained by minimizing the following expression.

この誤差が十分小さくなるまで繰り返すことで、位置合わせを行うことができる。 By repeating this error until it becomes sufficiently small, alignment can be performed.

１，１ａ，１ｂ，１ｃ画像処理装置
２定点カメラ
３表示装置
４ドライブレコーダ
１１３次元位置情報算出部
１２，１２ａ動体抽出部
１３，１３ｃ面設定部
１４，１４ｃ３次元位置情報投影部
１５３次元情報統合部
１６３次元位置情報算出部
１７投影画像統合部
２１；２１−１，２１−２ステレオカメラ
２２画像記録部
４１一時記憶部
４２トリガ発生部
５１道路面
５２自転車
５３軌跡
５４視線方向
５５通行区分
５６横断歩道
Ｆ１基準画像
Ｆ２参照画像
ｍ１，ｍ２自動車
ｍ３歩行者
Ｍ１，Ｍ２，Ｍ３動体対応画像部分 1, 1a, 1b, 1c Image processing device 2 Fixed point camera 3 Display device 4 Drive recorder 11 Three-dimensional position information calculation unit 12, 12a Moving object extraction unit 13, 13c Surface setting unit 14, 14c Three-dimensional position information projection unit 15 Three-dimensional Information integration unit 16 Three-dimensional position information calculation unit 17 Projected image integration unit 21; 21-1, 21-2 Stereo camera 22 Image recording unit 41 Temporary storage unit 42 Trigger generation unit 51 Road surface 52 Bicycle 53 Trajectory 54 Line-of-sight direction 55 Traffic Category 56 Crosswalk F1 Standard image F2 Reference image m1, m2 Car m3 Pedestrian M1, M2, M3

Claims

An image processing apparatus for time-yield can be image data collected by the three-dimensional position information of the subject in the imaging field image sequence as input, it displayed on the display device processes the input image data,
A moving object extraction unit that extracts the same moving object and a stationary object from the time-series captured images;
A plane setting unit for setting a projection plane for the display;
A three-dimensional position information projection unit that creates a display image obtained by projecting the three-dimensional position information of the moving object extracted by the moving object extraction unit on the projection plane set by the surface setting unit based on the stationary object; An image processing apparatus comprising:

The input image data is a time-series captured image by a stereo camera input from a drive recorder,
The three-dimensional position information calculation part which acquires the three-dimensional position information of the to-be-photographed object in the said captured image by carrying out corresponding point search processing of the image between the right and left of the said stereo camera, The further characterized by the above-mentioned. Image processing device.

The image processing apparatus according to claim 1, wherein the input image data includes a time series captured image of a monocular camera and radar distance information.

From the output of the previous SL moving object extraction unit, and the 3-dimensional information integration section that integrates the basis of the stationary body a three-dimensional positions between the frames of the moving object,
Between the frames integrated by the three-dimensional information integration unit, the three-dimensional position of the moving object in the remaining frames is calculated with reference to the three-dimensional position of the moving object in a certain frame, and is output to the three-dimensional position information projection unit. The image processing apparatus according to claim 1, further comprising a three-dimensional position information calculation unit.

Before Symbol 3-dimensional position information projection unit, for each frame, on the projection plane, and projecting the three-dimensional position information of the moving object extracted by the moving object extraction unit and the stationary body,
The projection image integration part which integrates each said projection surface by aligning the said stationary body in each projected said frame image is further provided, The any one of Claims 1-3 characterized by the above-mentioned. An image processing apparatus according to 1.

Before SL plane setting unit are inputted an output of the moving object extraction unit, a virtual projection plane relative to the extraction result in a certain frame is set,
The three-dimensional position information projection unit integrates the three-dimensional position of the moving body in each frame based on the stationary body from the output of the moving body extraction unit, and projects it on the virtual projection plane set by the surface setting unit The image processing apparatus according to claim 1, wherein the image processing apparatus is an image processing apparatus.

The said three-dimensional position information projection part produces the said display image combining the information regarding the motion of each moving body extracted in the said three-dimensional position information by the said moving body extraction part, The said display image is characterized by the above-mentioned. The image processing apparatus according to any one of the above.

In the three-dimensional position information projection unit, the projection plane and the three-dimensional position information projected thereon are real images obtained by converting the input image data into an image viewed from an angle set by the plane setting unit. The image processing apparatus according to claim 1, wherein the image processing apparatus is an image processing apparatus.

In the three-dimensional position information projection unit, the projection surface is a schematic road surface viewed from an angle set by the surface setting unit, and the three-dimensional position information is a real image extracted from the input image data. The image processing apparatus according to claim 8, comprising:

8. The projection plane and the three-dimensional position information projected thereon comprise a schematic drawing viewed from an angle set by the plane setting unit. An image processing apparatus according to 1.

The image processing apparatus according to claim 10, wherein the pictorial drawing is obtained by combining an identification symbol for the schematic road traffic on the schematic road surface.

A method for time-obtained image data which can be collected by the three-dimensional position information of the subject in the imaging field image sequence as input, and creates a display image by processing the input image data,
Extracting the same moving object and stationary object from the time-series captured images;
Setting a projection plane for the display;
And a step of creating a display image obtained by projecting the three-dimensional position information of the extracted moving object on the set projection plane based on the stationary object .