JP2010244207A

JP2010244207A - Moving object tracking device, moving object tracking method, and moving object tracking program

Info

Publication number: JP2010244207A
Application number: JP2009090489A
Authority: JP
Inventors: Shunsuke Kamijo; 俊介上條
Original assignee: University of Tokyo NUC
Current assignee: University of Tokyo NUC
Priority date: 2009-04-02
Filing date: 2009-04-02
Publication date: 2010-10-28
Anticipated expiration: 2029-04-02
Also published as: WO2010113417A1; JP5371040B2

Abstract

<P>PROBLEM TO BE SOLVED: To provide a moving object tracking device which detects a moving object in an image accurately, even in an image whose background changes. <P>SOLUTION: A moving object tracking device, which is a device for tracking a moving object in an image, detects the moving object in the image by applying image processing to time series images, and includes an object map storage unit, a contour extraction unit, and a correction unit. The object map storage unit allows each image in the time series images to be divided into a plurality of blocks, attaches an identification code indicating the moving object in the image to a block corresponding to the moving object, and stores the identification code. The contour extraction unit extracts the contour of the moving object from the time series images. The correction unit corrects the identification code of the block stored in the object map storage unit based on the contour extracted by the contour extraction unit. <P>COPYRIGHT: (C)2011,JPO&INPIT

Description

本発明は、時系列画像を画像処理して画像中の移動物体（車、自転車、動物等の移動可能なもの）を追跡する移動物体追跡装置、移動物体追跡方法および移動物体追跡プログラムに関する。 The present invention relates to a moving object tracking device, a moving object tracking method, and a moving object tracking program that process time-series images and track moving objects (movable objects such as cars, bicycles, and animals) in the images.

近年、カメラで撮像された画像を画像処理して、画像中の移動物体を正確に検出する技術が求められている。たとえば、交通事故の早期発見は、迅速な救助活動により人命救助の成功率を高めるだけでなく、警察の実地検分などを迅速にすることにより事故渋滞を緩和することもできるので、様々な交通事故の認識自動化が期待されている。このように交通事故の認識率を高めるためには、カメラで撮像された画像を画像処理して移動物体を正確に検出する必要がある。 In recent years, there has been a demand for a technique for accurately detecting a moving object in an image by processing an image captured by a camera. For example, early detection of traffic accidents can not only increase the success rate of lifesaving through quick rescue activities, but also reduce traffic jams by speeding up on-site inspection by the police. Recognition automation is expected. Thus, in order to increase the recognition rate of traffic accidents, it is necessary to accurately detect a moving object by performing image processing on an image captured by a camera.

図２０は、高速道路の中央線上方に設置されたカメラで撮像された時刻ｔ＝１〜４の画像を模式的に示す。 FIG. 20 schematically shows an image at time t = 1 to 4 captured by a camera installed above the center line of the expressway.

画像上で車両同士が頻繁に重なるので、画像処理により各車両を追跡するのが困難になる。この問題を解決するには、道路に沿って複数台のカメラを設置し、それらの画像を総合的に画像処理する必要がある。 Since vehicles frequently overlap each other on an image, it is difficult to track each vehicle by image processing. In order to solve this problem, it is necessary to install a plurality of cameras along the road and comprehensively process these images.

しかし、カメラ及び画像処理装置を複数台備える必要があるので、コスト高になる。また、各カメラの撮影画像を関係付けて総合的に画像処理しなければならないので、画像処理が複雑になる。 However, since it is necessary to provide a plurality of cameras and image processing apparatuses, the cost increases. In addition, the image processing becomes complicated because the captured images of the respective cameras must be related to each other and comprehensively processed.

そこで、本願発明者らは、次のように時間を遡って移動物体を検出する方法でこの問題を解決した（特許文献１および特許文献２参照。）。 Therefore, the inventors of the present application solved this problem by detecting the moving object by going back in time as follows (see Patent Document 1 and Patent Document 2).

たとえば、時刻ｔ＝１〜４の時系列画像を一時記憶しておき、時刻ｔ＝４から出発して車両Ｍ１とＭ２を識別し、車両Ｍ１とＭ２の動きベクトルを求め、この動きベクトルで時刻ｔ＝４の画像中の車両Ｍ１とＭ２を移動させて、車両Ｍ１とＭ２が識別されている時刻ｔ＝３の画像を想定し、これと時刻ｔ＝３の実際の画像との相関関係から、時刻ｔ＝３の画像中の車両Ｍ１とＭ２を識別する。 For example, time-series images at time t = 1 to 4 are temporarily stored, vehicles M1 and M2 are identified starting from time t = 4, and motion vectors of the vehicles M1 and M2 are obtained. Assuming an image at time t = 3 in which the vehicles M1 and M2 are identified by moving the vehicles M1 and M2 in the image at t = 4, from the correlation between this and the actual image at time t = 3 The vehicles M1 and M2 in the image at time t = 3 are identified.

次に、時刻ｔ＝３と時刻ｔ＝２との画像について同様の画像処理により、時刻ｔ＝２の画像中の車両Ｍ１とＭ２を識別する。次に、時刻ｔ＝２と時刻ｔ＝１の画像について同様の画像処理により、時刻ｔ＝１の画像中の車両Ｍ１とＭ２を識別する。 Next, vehicles M1 and M2 in the image at time t = 2 are identified by similar image processing for the images at time t = 3 and time t = 2. Next, vehicles M1 and M2 in the image at time t = 1 are identified by similar image processing for the images at time t = 2 and time t = 1.

このような画像処理により、１台のカメラで車両Ｍ１とＭ２とを追跡することが可能となる。 Such image processing makes it possible to track the vehicles M1 and M2 with a single camera.

特開２００２−１３３４２１号公報JP 2002-133421 A 特開２００４−２０７７８６号公報Japanese Patent Laid-Open No. 2004-207786

このような従来技術においては、カメラが固定されている状態で撮像された画像、すなわち、背景が固定されている画像を画像処理して、画像中の移動物体を正確に検出していた。 In such a conventional technique, an image picked up with a camera fixed, that is, an image with a fixed background is subjected to image processing to accurately detect a moving object in the image.

これに対して、カメラをパンニングまたはズーミングした場合には、それまで固定されていた背景画像も、カメラをパンニングまたはズーミングに応じて変動してしまう。このように背景が変動する画像を画像処理した場合には、移動物体の画像領域と背景画像との境界が明確でなくなり、画像中の移動物体を正確に検出できないという問題があった。 On the other hand, when the camera is panned or zoomed, the background image that has been fixed so far also changes according to the panning or zooming of the camera. When image processing is performed on an image whose background changes in this way, there is a problem that the boundary between the image area of the moving object and the background image is not clear, and the moving object in the image cannot be accurately detected.

本発明は、このような事情に鑑みてなされたもので、その目的は、背景が変動する画像からでも、画像中の移動物体を正確に検出することができる移動物体追跡装置、移動物体追跡方法および移動物体追跡プログラムを提供することにある。 The present invention has been made in view of such circumstances, and an object thereof is a moving object tracking device and a moving object tracking method capable of accurately detecting a moving object in an image even from an image with a changing background. And providing a moving object tracking program.

この発明は上述した課題を解決するためになされたもので、請求項１に記載の発明は、時系列画像を画像処理して画像中の移動物体を検出する画像上移動物体追跡装置であって、前記時系列画像の各画像が複数のブロックに分割され、当該画像中の移動物体を示す識別符号が前記移動物体に対応する前記ブロックに付けられて記憶されているオブジェクトマップ記憶部と、前記時系列画像の画像から前記移動物体の輪郭を抽出する輪郭抽出部と、前記輪郭抽出部が抽出した輪郭に基づいて、前記オブジェクトマップ記憶部に記憶されている前記ブロックの識別符号を補正する補正部と、を有することを特徴とする移動物体追跡装置である。 The present invention has been made to solve the above-described problems, and the invention according to claim 1 is an on-image moving object tracking device that detects a moving object in an image by processing a time-series image. Each of the time-series images is divided into a plurality of blocks, and an identification code indicating a moving object in the image is attached to the block corresponding to the moving object and stored therein, and the object map storage unit, A contour extracting unit that extracts the contour of the moving object from a time-series image, and a correction that corrects the block identification code stored in the object map storage unit based on the contour extracted by the contour extracting unit. And a moving object tracking device.

請求項２に記載の発明は、前記輪郭抽出部が、前記オブジェクトマップ記憶部に記憶されている前記移動物体のブロックに対応する画像領域に基づいて、前記移動物体の輪郭を抽出する対象領域を設定する対象領域設定部と、前記対象領域設定部が設定した対象領域に対して、前記時系列画像の画像から前記移動物体の輪郭を抽出する輪郭抽出処理部と、を有することを特徴とする請求項１に記載の移動物体追跡装置である。 According to a second aspect of the present invention, the contour extracting unit extracts a target region from which the contour of the moving object is extracted based on an image region corresponding to the block of the moving object stored in the object map storage unit. A target region setting unit to be set; and a contour extraction processing unit that extracts a contour of the moving object from the image of the time-series image with respect to the target region set by the target region setting unit. A moving object tracking device according to claim 1.

請求項３に記載の発明は、前記輪郭抽出部が、前記時系列画像の画像をエッジ抽出処理した画像に対して、前記対象領域設定部が設定した対象領域内でエッジに対応する画素数の個数についてのヒストグラムを座標軸毎に射影して生成し、当該座標軸毎に生成したヒストグラムに基づいて前記対象領域設定部が設定した対象領域を前記座標軸毎に補正する対象領域補正部、を有し、前記輪郭抽出処理部が、前記対象領域補正部が補正した対象領域に対して、前記時系列画像の画像から前記移動物体の輪郭を抽出する、ことを特徴とする請求項２に記載の移動物体追跡装置である。 According to a third aspect of the present invention, the contour extraction unit has the number of pixels corresponding to an edge in the target region set by the target region setting unit with respect to an image obtained by performing edge extraction processing on the time-series image. A target area correction unit that generates a histogram for the number of projections for each coordinate axis and corrects the target area set by the target area setting unit based on the histogram generated for each coordinate axis, for each coordinate axis, The moving object according to claim 2, wherein the contour extraction processing unit extracts a contour of the moving object from the image of the time-series image with respect to the target region corrected by the target region correction unit. A tracking device.

請求項４に記載の発明は、前記輪郭抽出部が、オクルージョンが発生している複数の移動物体を一体の移動物体とし、前記時系列画像の画像から前記一体とした移動物体の輪郭を抽出する、ことを特徴とする請求項１から請求項３のいずれかに記載の移動物体追跡装置である。 According to a fourth aspect of the present invention, the contour extracting unit extracts a plurality of moving objects in which occlusion occurs as an integral moving object, and extracts the contour of the integral moving object from the images of the time-series images. The moving object tracking device according to claim 1, wherein the moving object tracking device is a moving object tracking device according to claim 1.

請求項５に記載の発明は、前記補正部は、前記オブジェクトマップ記憶部に記憶されている前記オクルージョンが発生している複数の移動物体に対応する識別符号に基づいた当該複数の移動物体の境界を示す情報と、前記輪郭抽出部により前記オクルージョンが発生している複数の移動物体を一体とした抽出された移動物体の輪郭とに基づいて、前記オブジェクトマップ記憶部に記憶されている前記ブロックの識別符号を補正する、ことを特徴とする請求項４に記載の移動物体追跡装置である。 According to a fifth aspect of the present invention, the correction unit includes a boundary between the plurality of moving objects based on identification codes corresponding to the plurality of moving objects in which the occlusion occurs, which is stored in the object map storage unit. Of the block stored in the object map storage unit based on the information indicating the contour of the moving object extracted by integrating the plurality of moving objects in which the occlusion is generated by the contour extraction unit. 5. The moving object tracking device according to claim 4, wherein the identification code is corrected.

請求項６に記載の発明は、背景画像が変動しているか否かを判定する判定部と、前記判定部が背景画像が変動していると判定した場合に、前記輪郭抽出部を制御して輪郭を抽出させ、前記補正部を制御して前記オブジェクトマップ記憶部に記憶されている前記ブロックの識別符号を補正させる第１の制御部と、を有することを特徴とする請求項１から請求項５に記載の移動物体追跡装置である。 According to a sixth aspect of the present invention, when the determination unit determines whether or not the background image is fluctuating, and the determination unit determines that the background image is fluctuating, the contour extraction unit is controlled. A first control unit that extracts a contour and controls the correction unit to correct the identification code of the block stored in the object map storage unit. 5. The moving object tracking device according to 5.

請求項７に記載の発明は、前記オブジェクトマップ記憶部に記憶されている移動物体のサイズまたは移動量の変動量を、前記識別符号に基づいて単位時間毎に検出する移動物体変動量検出部と、前記移動物体変動量検出部により検出された単位時間毎の前記移動物体のサイズまたは移動量の変動量が、予め定められているサイズまたは移動量の変動量よりも大きい場合に、前記輪郭抽出部を制御して輪郭を抽出させ、前記補正部を制御して前記オブジェクトマップ記憶部に記憶されている前記ブロックの識別符号を補正させる第２の制御部と、を有することを特徴とする請求項１から請求項６に記載の移動物体追跡装置である。 According to a seventh aspect of the present invention, there is provided a moving object fluctuation amount detection unit that detects a fluctuation amount of the size or movement amount of the moving object stored in the object map storage unit per unit time based on the identification code. The contour extraction is performed when the moving object size or moving amount variation amount per unit time detected by the moving object variation detecting unit is larger than a predetermined size or moving amount variation amount. And a second control unit that controls the unit to extract a contour and controls the correction unit to correct the identification code of the block stored in the object map storage unit. The moving object tracking device according to any one of claims 1 to 6.

請求項８に記載の発明は、前記オブジェクトマップ記憶部には、前記識別符号が前記移動物体に対応する前記ブロックに付けられているとともに、前記ブロックに対応する前記移動物体の動きベクトルが前記ブロックに付けられて記憶されており、前記時系列画像を画像処理した結果に基づいて、前記オブジェクトマップ記憶部に記憶されているブロックの識別符号と動きベクトルとを更新する移動物体追跡部、を有し、前記移動物体追跡部は、前記時系列画像のうち連続するＮ画像（Ｎ≧２）の各々について、隣り合うブロックの動きベクトルの差の絶対値が所定値以内のブロックに同一の識別符号を付けることにより、画像上で互いに重なった移動物体に互いに異なる識別符号を付ける識別符号手順と、前記Ｎ画像の各々において、第１識別符号が付けられたブロック群である第１オブジェクトと第２識別符号が付けられたブロック群である第２オブジェクトとが接し、かつ、前記Ｎ画像について時間的に隣り合う画像の第１オブジェクト間の相関度が所定値以上であるか否かを判定する判定手順と、前記判定手順で肯定と判定された後に、時間を遡って前記第１オブジェクトと前記第２オブジェクトとを追跡する追跡手順と、前記追跡手順により時間を遡って追跡された前記第１オブジェクトと前記第２オブジェクトとに基づいて、前記オブジェクトマップ記憶部に記憶されているブロックの識別符号と動きベクトルとを更新する更新手順と、を実行することを特徴とする請求項１から請求項７に記載の移動物体追跡装置である。 In the invention according to claim 8, in the object map storage unit, the identification code is attached to the block corresponding to the moving object, and the motion vector of the moving object corresponding to the block is the block. And a moving object tracking unit that updates a block identification code and a motion vector stored in the object map storage unit based on the result of image processing of the time-series image. The moving object tracking unit, for each of the consecutive N images (N ≧ 2) of the time-series images, has the same identification code as the block whose absolute value of the motion vector difference between adjacent blocks is within a predetermined value. In each of the N images, an identification code procedure for attaching different identification codes to moving objects that overlap each other on the image, A first object that is a group of blocks to which another code is attached and a second object that is a group of blocks to which a second identification code is attached, and between the first objects of images that are temporally adjacent to the N images A determination procedure for determining whether or not the degree of correlation is greater than or equal to a predetermined value, and a tracking procedure for tracking the first object and the second object retroactively after it is determined affirmative in the determination procedure; An update procedure for updating a block identification code and a motion vector stored in the object map storage unit based on the first object and the second object tracked back in time by the tracking procedure; The moving object tracking device according to claim 1, wherein:

請求項９に記載の発明は、時系列画像を画像処理して画像中の移動物体を検出する画像上移動物体追跡装置において用いられる画像上移動物体追跡方法であって、輪郭抽出部が、前記時系列画像の画像から前記移動物体の輪郭を抽出する輪郭抽出手順と、補正部が、前記時系列画像の各画像が複数のブロックに分割され、当該画像中の移動物体を示す識別符号が前記移動物体に対応する前記ブロックに付けられて記憶されているオブジェクトマップ記憶部に記憶されている前記ブロックの識別符号を、前記輪郭抽出部が抽出した輪郭に基づいて補正する補正手順と、を有することを特徴とする移動物体追跡方法である。 The invention according to claim 9 is an on-image moving object tracking method used in an on-image moving object tracking device that performs time-series image processing to detect a moving object in an image, wherein the contour extraction unit includes the contour extracting unit, The contour extraction procedure for extracting the contour of the moving object from the image of the time-series image, and the correction unit divides each image of the time-series image into a plurality of blocks, and the identification code indicating the moving object in the image is A correction procedure for correcting the block identification code stored in the object map storage unit attached to the block corresponding to the moving object based on the contour extracted by the contour extraction unit. This is a moving object tracking method.

請求項１０に記載の発明は、時系列画像を画像処理して画像中の移動物体を検出する画像上移動物体追跡装置としてのコンピュータに、輪郭抽出部が、前記時系列画像の画像から前記移動物体の輪郭を抽出する輪郭抽出手順を実行し、補正部が、前記時系列画像の各画像が複数のブロックに分割され、当該画像中の移動物体を示す識別符号が前記移動物体に対応する前記ブロックに付けられて記憶されているオブジェクトマップ記憶部に記憶されている前記ブロックの識別符号を、前記輪郭抽出部が抽出した輪郭に基づいて補正する補正手順を実行する、ようにさせるための移動物体追跡プログラムである。 According to a tenth aspect of the present invention, in a computer as an on-image moving object tracking device that performs image processing on a time-series image and detects a moving object in the image, a contour extraction unit includes the movement from the image of the time-series image. A contour extraction procedure for extracting the contour of the object is executed, and the correction unit divides each image of the time-series image into a plurality of blocks, and an identification code indicating the moving object in the image corresponds to the moving object. Movement for causing the block identification code stored in the object map storage unit attached to the block to be corrected based on the contour extracted by the contour extraction unit It is an object tracking program.

この本発明によれば、抽出した輪郭に基づいて、オブジェクトマップ記憶部に記憶されているブロックの識別符号を補正することにより、背景が変動する画像からでも、画像中の移動物体を正確に検出することができるという効果を奏する。 According to the present invention, by correcting the block identification code stored in the object map storage unit based on the extracted contour, it is possible to accurately detect a moving object in the image even from an image whose background changes. There is an effect that can be done.

交差点及びこれに配置された本発明の移動物体追跡装置を用いた移動物体追跡システムの概略を示すブロック図である。It is a block diagram which shows the outline of the moving object tracking system using the intersection and the moving object tracking apparatus of this invention arrange | positioned at this. 図１の移動物体追跡装置の一例としての構成を示すブロック図である。It is a block diagram which shows the structure as an example of the moving object tracking apparatus of FIG. フレーム画像内の、交差点への４つの入口及び交差点からの４つの出口にそれぞれ設定されたスリット及びブロックに付与された移動物体のＩＤを示す説明図である。It is explanatory drawing which shows ID of the moving object provided to the slit and block which were each set to the four entrances to an intersection and the four exits from an intersection in a frame image. （Ａ）及び（Ｂ）はそれぞれ時刻ｔ−１及びｔの画像をブロック境界線とともに模式的に示す図である。(A) And (B) is a figure which shows typically the image of the time t-1 and t with a block boundary line, respectively. （Ａ）及び（Ｂ）はそれぞれ時刻ｔ−１及びｔの画像を画素境界線とともに模式的に示す図である。(A) And (B) is a figure which shows typically the image of the time t-1 and t with a pixel boundary line, respectively. （Ａ）及び（Ｂ）はそれぞれ時刻ｔ−１及びｔの画像を、ブロックに付与された動きベクトルとともに模式的に示す図である。(A) And (B) is a figure which shows typically the image of the time t-1 and t with the motion vector provided to the block, respectively. （Ａ）及び（Ｂ）はそれぞれ時刻ｔ−１及びｔのオブジェクトマップに付与された動きベクトル及びオブジェクト境界を模式的に示す図である。(A) and (B) are diagrams schematically showing motion vectors and object boundaries assigned to the object maps at times t-1 and t, respectively. 未定動きベクトルの推定方法を示すフローチャートである。It is a flowchart which shows the estimation method of an undetermined motion vector. （Ａ）及び（Ｂ）は図８の処理を説明するための、オブジェクトマップに付与された動きベクトル及びオブジェクト境界を模式的に示す図である。(A) And (B) is a figure which shows typically the motion vector and object boundary provided to the object map for demonstrating the process of FIG. （Ａ）〜（Ｃ）は図８の処理を説明するための、オブジェクトマップに付与された動きベクトル及びオブジェクト境界を模式的に示す図である。(A)-(C) is a figure which shows typically the motion vector and object boundary provided to the object map for demonstrating the process of FIG. 移動物体追跡装置２０におけるモードの状態遷移を示す状態遷移図である。FIG. 6 is a state transition diagram showing state transition of modes in the moving object tracking device 20. 図１１の補正モードにおける移動物体追跡装置２０の動作を示すフローチャート図である。It is a flowchart figure which shows operation | movement of the moving object tracking device 20 in the correction mode of FIG. 面積項Ｅａｒｅａを示す説明図である。It is explanatory drawing which shows the area term Earea. 一例としてのＳｎａｋｅｓ処理結果を示す説明図である。It is explanatory drawing which shows the Snakes process result as an example. 一例としてのエッジ分布およびヒストグラムの生成を示す説明図である。It is explanatory drawing which shows the generation of edge distribution and a histogram as an example. Ｓｎａｋｅｓによるオブジェクトマップの修正例を示す説明図である。It is explanatory drawing which shows the example of correction of the object map by Snakes. カメラ１０が変動する場合において、階層間協調アルゴリズムが無い場合における移動物体検出の結果を示す結果図である。It is a result figure which shows the result of a moving object detection in case the camera 10 fluctuates, when there is no cooperation algorithm between layers. カメラ１０が変動する場合において、階層間協調アルゴリズムがある場合における移動物体検出の結果を示す結果図である。It is a result figure which shows the result of a moving object detection in case the camera 10 fluctuates, when there exists a cooperation algorithm between layers. カメラ１０が変動する場合において、オクルージョンが発生している場合における移動物体検出の結果を示す結果図である。It is a result figure which shows the result of a moving object detection in case the occlusion has generate | occur | produced when the camera 10 fluctuates. 高速道路の中央線上方に設置されたカメラで撮像された時刻ｔ＝１〜４の画像を模式的に示す図である。It is a figure which shows typically the image of the time t = 1-4 imaged with the camera installed above the center line of the highway.

以下、図面を参照して、本発明の実施の形態について説明する。図１は、この発明の一実施形態による移動物体追跡装置２０を用いた移動物体追跡システムの構成を示す概略ブロック図である。この移動物体追跡システムは、一例として図１に示すように、交差点を撮像して画像信号を出力する電子カメラ１０と、その画像を処理して移動物体を追跡する移動物体追跡装置２０とを備えている。 Embodiments of the present invention will be described below with reference to the drawings. FIG. 1 is a schematic block diagram showing a configuration of a moving object tracking system using a moving object tracking device 20 according to an embodiment of the present invention. As shown in FIG. 1 as an example, this moving object tracking system includes an electronic camera 10 that captures an intersection and outputs an image signal, and a moving object tracking device 20 that processes the image and tracks the moving object. ing.

電子カメラ１０で撮影された時系列画像は、例えば１２フレーム／秒のレートで、移動物体追跡装置２０が有する後述する画像メモリ２１に格納され、最も古いフレームが新しいフレーム画像で書き換えられる。この電子カメラ１０は、パンニングまたはズーミングされ、撮影する画像領域を変更することができる。この電子カメラ１０に対するパンニングまたはズーミングは、たとえば、移動物体追跡装置２０が制御してもよいし、当該移動物体追跡システムを制御する上位制御装置が制御してもよい。 A time-series image captured by the electronic camera 10 is stored in an image memory 21 (described later) of the moving object tracking device 20 at a rate of 12 frames / second, for example, and the oldest frame is rewritten with a new frame image. The electronic camera 10 can be panned or zoomed to change the image area to be photographed. The panning or zooming for the electronic camera 10 may be controlled by, for example, the moving object tracking device 20 or may be controlled by a host control device that controls the moving object tracking system.

移動物体追跡装置２０は、電子カメラ１０で撮影された時系列画像（後述する画像メモリ２１に格納された時系列画像）を画像処理して、時系列画像を画像処理して画像中の移動物体を検出する。 The moving object tracking device 20 performs image processing on a time-series image (a time-series image stored in an image memory 21 to be described later) taken by the electronic camera 10 and performs image processing on the time-series image to move the moving object in the image. Is detected.

次に図２を用いて、移動物体追跡装置２０の一例としての構成について説明する。 Next, a configuration as an example of the moving object tracking device 20 will be described with reference to FIG.

画像変換部２２は、画像メモリ２１内の各フレーム画像について、これをフレームバッファメモリ２３にコピーし、コピーされた画像のデータを用いて画像メモリ２１内の対応するフレーム画像を空間的差分フレーム画像に変換する。この変換は２段階で行われる。 The image conversion unit 22 copies each frame image in the image memory 21 to the frame buffer memory 23, and uses the copied image data to convert the corresponding frame image in the image memory 21 into a spatial difference frame image. Convert to This conversion is performed in two stages.

元のフレーム画像の第ｉ行第ｊ列の画素値（輝度値）をＧ（ｉ，ｊ）とすると、第１段階における変換後の第ｉ行第ｊ列の画素値Ｈ（ｉ，ｊ）は、次式で表される。 If the pixel value (luminance value) of the i-th row and j-th column of the original frame image is G (i, j), the pixel value H (i, j) of the i-th row and j-th column after the conversion in the first stage. Is expressed by the following equation.

Ｈ（ｉ，ｊ）＝Σneighberpixcels｜Ｇ（ｉ＋ｄｉ，ｊ＋ｄｊ）−Ｇ（ｉ，ｊ）｜・・・（式１） H (i, j) = Σneighberpixcels | G (i + di, j + dj) −G (i, j) | (Expression 1)

ここにΣneighberpixcelsは、ｃを自然数とすると、ｄｉ＝−ｃ〜ｃ及びｄｊ＝−ｃ〜ｃにわたっての総和を意味し、例えばｃ＝１のとき、第ｉ行第ｊ列の画素と隣り合う８画素にわたる総和である。照度が変化すると、画素値Ｇ（ｉ，ｊ）とその付近の画素値Ｇ（ｉ＋ｄｉ，ｊ＋ｄｊ）が同様に変化するので、Ｈ（ｉ，ｊ）の画像は、照度の変化に対し不変である。 Here, Σneighberpixcels means the sum of di = −c to c and dj = −c to c, where c is a natural number. For example, when c = 1, 8 adjacent to the pixel in the i-th row and j-th column. The sum over the pixels. When the illuminance changes, the pixel value G (i, j) and the neighboring pixel value G (i + di, j + dj) change in the same manner, so the image of H (i, j) is invariant to the change in illuminance. .

ここで、隣り合う画素の差分の絶対値は、一般に画素値が大きいほど大きい。移動物体追跡の成功率を高めるには、画素値が小さくて差分が小さい場合も、画素値及び差分が大きい場合とほぼ等価にエッジ情報を取得した方が好ましい。そこで、Ｈ（ｉ，ｊ）を次のように規格化する。 Here, the absolute value of the difference between adjacent pixels is generally larger as the pixel value is larger. In order to increase the success rate of moving object tracking, it is preferable to acquire edge information almost equivalently to the case where the pixel value and the difference are large even when the pixel value is small and the difference is small. Therefore, H (i, j) is normalized as follows.

Ｈ（ｉ，ｊ）＝Σneighberpixcels｜Ｇ（ｉ＋ｄｉ，ｊ＋ｄｊ）−Ｇ（ｉ，ｊ）｜／（Ｇｉ，ｊ，max／Ｇmax）・・・（式２） H (i, j) = Σneighberpixcels | G (i + di, j + dj) −G (i, j) | / (Gi, j, max / Gmax) (Expression 2)

ここに、Ｇｉ，ｊ，maxは、Ｈ（ｉ，ｊ）の計算に用いた元の画素の値の最大値であり、例えばｃ＝１の場合、第ｉ行第ｊ列の画素を中心とする３×３画素の値の最大値であり、Ｇmaxは画素値Ｇ（ｉ，ｊ）の取りうる最大値、例えば画素値が８ビットで表される場合、２５５である。以下、ｃ＝１、Ｇmax＝２５５である場合について説明する。 Here, Gi, j, max is the maximum value of the original pixel value used in the calculation of H (i, j). For example, when c = 1, the pixel in the i-th row and j-th column is the center. Gmax is the maximum value that can be taken by the pixel value G (i, j), for example, 255 when the pixel value is represented by 8 bits. Hereinafter, a case where c = 1 and Gmax = 255 will be described.

Ｈ（ｉ，ｊ）の取りうる最大値は、移動物体毎に異なる。例えばＧ（ｉ，ｊ）＝Ｇmaxで第ｉ行第ｊ列の画素に隣り合う８画素の値がいずれも０で有る場合、Ｈ（ｉ，ｊ）＝８Ｇmaxとなって、Ｈ（ｉ，ｊ）を８ビットで表すことができない。 The maximum value that H (i, j) can take differs for each moving object. For example, if G (i, j) = Gmax and the values of 8 pixels adjacent to the pixel in the i-th row and j-th column are all 0, H (i, j) = 8Gmax and H (i, j ) Cannot be represented by 8 bits.

一方、移動物体のエッジ部のＨ（ｉ，ｊ）の値のヒストグラムを作成してみると、頻度の大部分がＨ＝５０〜１１０の範囲に含まれることがわかった。すなわち、Ｈの値が約１１０より大きくなるほど移動物体追跡のためのエッジ情報の数が少ないので、重要度が低い。 On the other hand, when a histogram of the value of H (i, j) at the edge portion of the moving object was created, it was found that most of the frequency was included in the range of H = 50 to 110. That is, since the number of edge information for tracking a moving object is smaller as the value of H is larger than about 110, the importance is lower.

したがって、Ｈの値の大きい部分を抑圧して変換画素のビット長を短くすることにより、画像処理を高速に行なったほうが好ましい。そこで、第２段階として、このＨ（ｉ，ｊ）を、シグモイド関数を用いた次式により、Ｉ（ｉ，ｊ）に変換する。 Therefore, it is preferable to perform image processing at high speed by suppressing the portion with a large value of H and shortening the bit length of the converted pixel. Therefore, as a second stage, this H (i, j) is converted to I (i, j) by the following equation using a sigmoid function.

Ｉ＝Ｇmax／｛１＋ｅｘｐ〔−β（Ｈ−α）〕｝・・・（式３） I = Gmax / {1 + exp [−β (H−α)]} (Equation 3)

シグモイド関数はＨ＝αの付近で線形性が良い。そこで、閾値αの値を、エッジ情報を持つＨの度数分布の最頻値、例えば８０にする。 The sigmoid function has good linearity in the vicinity of H = α. Therefore, the threshold value α is set to the most frequent value of the frequency distribution of H having edge information, for example, 80.

画像変換部２２は、上式（２）及び（３）に基づいて、画素値Ｇ（ｉ，ｊ）の画像を画素値Ｉ（ｉ，ｊ）の空間的差分フレーム画像に変換し、これを画像メモリ２１に格納する。 Based on the above equations (2) and (3), the image conversion unit 22 converts the image having the pixel value G (i, j) into a spatial difference frame image having the pixel value I (i, j). Store in the image memory 21.

背景画像生成部２４、ＩＤ生成／消滅部２５及び移動物体追跡部２７は、画像メモリ２１中の空間的差分フレーム画像に基づいて処理を行う。以下においては、空間的差分フレーム画像を単にフレーム画像と称す。 The background image generation unit 24, the ID generation / annihilation unit 25, and the moving object tracking unit 27 perform processing based on the spatial difference frame image in the image memory 21. Hereinafter, the spatial difference frame image is simply referred to as a frame image.

背景画像生成部２４は、記憶部と処理部とを備え、処理部は、画像メモリ２１をアクセスし、例えば過去１０分間の全てのフレーム画像の対応する画素について画素値のヒストグラムを作成し、その最頻値（モード）をその画素の画素値とする画像を、移動物体が存在しない背景画像として生成し、これを該記憶部に格納する。背景画像は、この処理が定期的に行われて更新される。 The background image generation unit 24 includes a storage unit and a processing unit. The processing unit accesses the image memory 21 and creates a histogram of pixel values for pixels corresponding to all frame images for the past 10 minutes, for example. An image having the mode value (mode) as the pixel value of the pixel is generated as a background image in which no moving object exists, and is stored in the storage unit. The background image is updated by performing this process periodically.

ＩＤ生成／消滅部２５には、図３に示す如くフレーム画像内の、交差点への４つの入口及び交差点からの４つの出口にそれぞれ配置されるスリットＥＮ１〜ＥＮ４及びＥＸ１〜ＥＸ４の位置及びサイズのデータが予め設定されている。ＩＤ生成／消滅部２５は、画像メモリ２１から入口スリットＥＮ１〜ＥＮ４内の画像データを読み込み、これら入口スリット内に移動物体が存在するかどうかをブロック単位で判定する。図３中のメッシュの升目はブロックであり、１ブロックは例えば８×８画素であり、１フレームが４８０×６４０画素の場合、１フレームは６０×８０ブロックに分割される。あるブロックに移動物体が存在するかどうかは、このブロック内の各画素と背景画像の対応する画素との差の絶対値の総和が所定値以上であるかどうかにより判定する。この判定は、移動物体追跡部２７においても行われる。 As shown in FIG. 3, the ID generation / annihilation unit 25 has positions and sizes of slits EN1 to EN4 and EX1 to EX4 arranged at four entrances to the intersection and four exits from the intersection, respectively, in the frame image. Data is preset. The ID generation / annihilation unit 25 reads the image data in the entrance slits EN1 to EN4 from the image memory 21, and determines in block units whether or not a moving object exists in these entrance slits. The meshes in FIG. 3 are blocks, and one block is, for example, 8 × 8 pixels. When one frame is 480 × 640 pixels, one frame is divided into 60 × 80 blocks. Whether there is a moving object in a certain block is determined by whether the sum of absolute values of differences between the pixels in the block and the corresponding pixels in the background image is equal to or greater than a predetermined value. This determination is also performed in the moving object tracking unit 27.

ＩＤ生成／消滅部２５は、ブロック内に移動物体が存在すると判定すると、このブロックに新たなオブジェクト識別符号（ＩＤ）を付与する。ＩＤ生成／消滅部２５は、ＩＤ付与済ブロックと隣接しているブロックに移動物体が存在すると判定すると、この隣接ブロックに付与済ブロックと同一のＩＤを付与する。このＩＤ付与済ブロックは入口スリットに隣接しているブロックも含まれる。例えば図３中の入口スリットＥＮ１内のブロックにはＩＤ＝１が付与される。 If the ID generation / annihilation unit 25 determines that there is a moving object in the block, the ID generation / annihilation unit 25 assigns a new object identification code (ID) to the block. If the ID generation / annihilation unit 25 determines that a moving object is present in a block adjacent to the ID-assigned block, the ID generation / annihilation unit 25 assigns the same ID as the assigned block to this adjacent block. This ID-added block includes a block adjacent to the entrance slit. For example, ID = 1 is assigned to the block in the entrance slit EN1 in FIG.

ＩＤの付与は、オブジェクトマップ記憶部２６内の対応するブロックに対して行われる。オブジェクトマップ記憶部２６は、上述の場合６０×８０ブロックのオブジェクトマップを記憶するためのものであり、各ブロックには、ＩＤが付与されているかどうかのフラグ、ＩＤが付与されている場合にはその番号と後述のブロック動きベクトルとがブロック情報として付与される。なお、該フラグを用いずに、ＩＤ＝０のときＩＤが付与されていないと判定してもよい。また、ＩＤの最上位ビットをフラグとしてもよい。 The ID is assigned to the corresponding block in the object map storage unit 26. The object map storage unit 26 is for storing an object map of 60 × 80 blocks in the above-described case, and each block has a flag indicating whether or not an ID is assigned. The number and a block motion vector described later are given as block information. Instead of using the flag, it may be determined that no ID is assigned when ID = 0. The most significant bit of the ID may be used as a flag.

入口スリットを通過したクラスタに対しては、移動物体追跡部２７により、移動方向のブロックに対するＩＤの付与及び移動と反対方向のブロックに対するＩＤの消滅、すなわちクラスタの追跡処理が行われる。移動物体追跡部２７による追跡処理は、各クラスタについて出口スリット内まで行われる。 For the cluster that has passed through the entrance slit, the moving object tracking unit 27 assigns an ID to the block in the moving direction and extinguishes the ID of the block in the direction opposite to the movement, that is, a cluster tracking process. The tracking processing by the moving object tracking unit 27 is performed up to the inside of the exit slit for each cluster.

ＩＤ生成／消滅部２５はさらに、オブジェクトマップ記憶部２６の内容に基づき出口スリットＥＸ１〜ＥＸ４内のブロックにＩＤが付与されているかどうかを調べ、付与されていれば、出口スリットをクラスタが通過したときにそのＩＤを消滅させる。例えば図３中の出口スリットＥＸ１内のブロックにＩＤ＝３が付されている状態から、ＩＤが付されない状態に変化したときに、ＩＤ＝３を消滅させる。消滅ＩＤは、次の生成ＩＤとして用いることができる。 The ID generation / annihilation unit 25 further checks whether an ID is assigned to the blocks in the exit slits EX1 to EX4 based on the contents of the object map storage unit 26, and if so, the cluster has passed through the exit slit. Sometimes the ID disappears. For example, when the block in the exit slit EX1 in FIG. 3 changes from a state where ID = 3 is assigned to a block where no ID is assigned, ID = 3 is extinguished. The disappearance ID can be used as the next generation ID.

移動物体追跡部２７は、オブジェクトマップ記憶部２６に格納されている時刻（ｔ−１）のオブジェクトマップと、画像メモリ２１に格納されている時刻（ｔ−１）及びｔのフレーム画像とに基づいて、時刻ｔのオブジェクトマップをオブジェクトマップ記憶部２６内に作成する。以下、これを説明する。 The moving object tracking unit 27 is based on the object map at time (t−1) stored in the object map storage unit 26 and the frame image at time (t−1) and t stored in the image memory 21. Thus, an object map at time t is created in the object map storage unit 26. This will be described below.

図４〜図７はいずれも、時刻ｔ−１とｔの画像を模式的に示す。図４、図６及び図７中の点線はブロックの境界線であり、図５中の点線は画素の境界線である。 4 to 7 schematically show images at times t-1 and t. The dotted lines in FIGS. 4, 6 and 7 are block boundaries, and the dotted lines in FIG. 5 are pixel boundaries.

第ｉ行第ｊ列のブロックをＢ（ｉ，ｊ）、時刻ｔでの第ｉ行第ｊ列のブロックをＢ（ｔ：ｉ，ｊ）と表記する。ブロックＢ（ｔ−１：１，４）の動きベクトルがＭＶであるとする。ブロックＢ（ｔ−１：１，４）をＭＶ移動させた領域に最も対応する、時刻ｔのブロックを見つける。図４（Ｂ）の場合、このブロックはＢ（ｔ：１，５）である。図５に示すように、ブロックＢ（ｔ：１，５）の画像と、時刻ｔ−１のブロックサイズの領域ＡＸの画像との相関度を、所定範囲ＡＭ内で領域ＡＸを１画素移動させる毎に求める（ブロックマッチング）。 The block in the i-th row and the j-th column is denoted as B (i, j), and the block in the i-th row and the j-th column at the time t is denoted as B (t: i, j). It is assumed that the motion vector of the block B (t−1: 1, 4) is MV. Find the block at time t that most corresponds to the area where block B (t−1: 1, 4) has been moved by MV. In the case of FIG. 4B, this block is B (t: 1, 5). As shown in FIG. 5, the correlation between the image of the block B (t: 1, 5) and the image of the block size area AX at time t−1 is moved by one pixel within the predetermined range AM. Calculate every block (block matching).

範囲ＡＭはブロックよりも大きく、その一辺は例えばブロックの一辺の画素数の１．５倍である。範囲ＡＭの中心は、ブロックＢ（ｔ：１，５）の中心を略−ＭＶ移動させた位置にある画素である。 The range AM is larger than the block, and one side thereof is, for example, 1.5 times the number of pixels on one side of the block. The center of the range AM is a pixel at a position obtained by moving the center of the block B (t: 1, 5) by approximately −MV.

相関度は、例えば時空的テクスチャ相関度であり、ブロックＢ（ｔ：１，５）と領域ＡＸの対応する画素値の差の絶対値の総和である評価値ＵＤが小さいほど、大きいとする。 The correlation degree is, for example, a spatio-temporal texture correlation degree, and it is assumed that the smaller the evaluation value UD, which is the sum of absolute values of the difference between corresponding pixel values in the block B (t: 1, 5) and the region AX, is larger.

範囲ＡＭ内で相関度が最大になる領域ＡＸを求め、この領域の中心を始点としブロックＢ（１，５）の中心を終点とするベクトルを、ブロックＢ（ｔ：１，５）の動きベクトルと決定する。また、相関度が最大になる領域ＡＸに最も近い、時刻ｔ−１のブロックのＩＤを、ブロックＢ（ｔ：１，５）のＩＤと決定する。 An area AX having the maximum degree of correlation within the range AM is obtained, and a vector starting from the center of this area and ending at the center of the block B (1, 5) is defined as a motion vector of the block B (t: 1, 5). And decide. Further, the ID of the block at time t−1 that is closest to the region AX where the degree of correlation is maximum is determined as the ID of the block B (t: 1, 5).

移動物体追跡部２７は、隣り合うブロックの動きベクトルの差の絶対値が所定値以下のブロックに同一のＩＤを付与する。これにより、１つのクラスタであっても、互いに異なるＩＤをもつ複数のオブジェクト（移動物体）に分割される。図６では、オブジェクト間の境界を太線で示している。 The moving object tracking unit 27 assigns the same ID to blocks whose absolute value of the difference between the motion vectors of adjacent blocks is a predetermined value or less. Thereby, even one cluster is divided into a plurality of objects (moving objects) having different IDs. In FIG. 6, the boundary between objects is indicated by a bold line.

オブジェクトマップ上には移動物体の画像が存在しないが、図６では理解を容易にするためにオブジェクトマップ上に移動物体が模式的に描かれている。図７は、オブジェクトマップにオブジェクトの境界を太線で示したものであり、図６に対応している。 Although an image of a moving object does not exist on the object map, in FIG. 6, the moving object is schematically drawn on the object map for easy understanding. FIG. 7 shows the boundary of an object in the object map with a thick line, and corresponds to FIG.

例えば図３の入口スリットＥＮ１で１つのクラスタが検出され、複数のオブジェクトに分割されず、その後、時刻ｔ１に上記のようにして複数のオブジェクトに分割された場合、時刻ｔ１から時間を遡って、時間が正方向の場合と同様にオブジェクトマップを求めることにより、時刻ｔ１よりも前のオブジェクトマップに対し、複数のオブジェクトに分割する。これにより、分割できなかったオブジェクトを分割して認識することができ、個々のオブジェクトを追跡することができる。 For example, when one cluster is detected at the entrance slit EN1 of FIG. 3 and is not divided into a plurality of objects, and then divided into a plurality of objects as described above at time t1, the time goes back from time t1, By obtaining the object map in the same way as when the time is in the positive direction, the object map before time t1 is divided into a plurality of objects. Thereby, an object that could not be divided can be divided and recognized, and individual objects can be tracked.

特許文献１では、１つのクラスタが複数のクラスタに分離してから時間を遡って個々のオブジェクトを追跡していたが、本実施形態によれば、複数のクラスタに分離する前に、例えば図２０のｔ＝４より前のｔ＝２から、時間を遡って個々のオブジェクトを追跡することができるので、画像メモリ２１の記憶容量を低減でき、また、画像処理量を低減してＣＰＵの負担を軽くすることができる。 In Patent Document 1, individual objects are traced back in time after one cluster is separated into a plurality of clusters. However, according to the present embodiment, before separation into a plurality of clusters, for example, FIG. Since t = 2 before t = 4, each object can be traced back in time, so that the storage capacity of the image memory 21 can be reduced, and the image processing amount can be reduced to reduce the burden on the CPU. Can be lightened.

上記説明では、クラスタ内のブロックの動きベクトルが求まる場合について説明したが、図９（Ａ）に示すように、動きベクトルが求まらないブロックが存在する場合、その位置によっては該ブロックがどちらのオブジェクトに属するのか不明である。ある移動物体に属するブロック内の各画素の色がほぼ同一であった場合、上述のブロックマッチングにより動きベクトルを決定することができない。例えば、画像（空間的差分フレーム画像）を２値画像に変換し、ブロック内に‘１’の画素の数が所定値以下であれば、上記方法により動きベクトルを求めるのに適しないブロックと判定する。 In the above description, the case where the motion vector of the block in the cluster is obtained has been described. However, as shown in FIG. 9A, when there is a block for which the motion vector cannot be obtained, which block depends on the position. It is unknown whether it belongs to any object. When the color of each pixel in a block belonging to a certain moving object is almost the same, the motion vector cannot be determined by the block matching described above. For example, if an image (spatial difference frame image) is converted into a binary image, and the number of “1” pixels in the block is equal to or less than a predetermined value, the block is determined to be unsuitable for obtaining a motion vector by the above method. To do.

このようなブロックの動きベクトルを、図８に示す方法で推定する。 The motion vector of such a block is estimated by the method shown in FIG.

（Ｓ１）未定の動きベクトルが存在すればステップＳ２へ進み、そうでなければ未定動きベクトル推定処理を終了する。 (S1) If there is an undetermined motion vector, the process proceeds to step S2, and if not, the undetermined motion vector estimation process is terminated.

（Ｓ２）動きベクトルが未定のブロックＢ（ｉ，ｊ）の回りの８個のブロックのうち、決定されている動きベクトルＭＶ１〜ＭＶｎを取り出す。 (S2) The determined motion vectors MV1 to MVn are extracted from the eight blocks around the block B (i, j) whose motion vector is undetermined.

（Ｓ３）ステップＳ２で決定済動きベクトルが存在すればステップＳ４へ進み、そうでなければステップＳ６へ進む。 (S3) If there is a motion vector determined in step S2, the process proceeds to step S4, and if not, the process proceeds to step S6.

（Ｓ４）動きベクトルＭＶ１〜ＭＶｎを、ベクトル間の差の絶対値が所定値以内のグループに分ける。 (S4) The motion vectors MV1 to MVn are divided into groups in which the absolute value of the difference between the vectors is within a predetermined value.

（Ｓ５））動きベクトル数が最大のグループの動きベクトルの平均値を、ブロックＢ（ｉ，ｊ）の動きベクトルと推定する。動きベクトル数が最大のグループが複数存在する場合、任意の１つのグループの動きベクトルの平均値を、ブロックＢ（ｉ，ｊ）の動きベクトルと推定する。次にステップＳ１へ戻る。 (S5)) The average value of the motion vectors of the group having the largest number of motion vectors is estimated as the motion vector of the block B (i, j). When there are a plurality of groups having the largest number of motion vectors, the average value of the motion vectors of any one group is estimated as the motion vector of the block B (i, j). Next, the process returns to step S1.

なお、同一グループの動きベクトルは互いに略等しいので、該同一グループの動きベクトルの任意の１つをブロックＢ（ｉ，ｊ）の動きベクトルと推定してもよい。 Since the motion vectors of the same group are substantially equal to each other, any one of the motion vectors of the same group may be estimated as the motion vector of the block B (i, j).

（Ｓ６）ステップＳ５で推定された動きベクトルを、決定された動きベクトルとみなし、ステップＳ１へ戻る。 (S6) The motion vector estimated in step S5 is regarded as the determined motion vector, and the process returns to step S1.

このような処理により、未定動きベクトルを一意的に推定することができる。 By such processing, the undetermined motion vector can be estimated uniquely.

次に具体例を説明する。図９（Ａ）において、第ｉ行第ｊ列のブロックＢ（ｉ，ｊ）の動きベクトルをＭＶ（ｉ，ｊ）と表記する。図９（Ａ）では、ブロックＢ（２，２）、Ｂ（２，４）及びＢ（３，３）の動きベクトルが未定である。 Next, a specific example will be described. In FIG. 9A, the motion vector of the block B (i, j) in the i-th row and j-th column is denoted as MV (i, j). In FIG. 9A, the motion vectors of the blocks B (2,2), B (2,4) and B (3,3) are undetermined.

ブロックＢ（２，２）の回りのブロックの動きベクトルは、ＭＶ（２，１）、ＭＶ（３，１）、ＭＶ（３，２）及びＭＶ（２，３）のグループと、ＭＶ（１，２）及びＭＶ（１，３）のグループに分けられるので、前者のグループを選択し、
ＭＶ（２，２）＝（ＭＶ（２，１）＋ＭＶ（３，１）＋ＭＶ（３，２）＋ＭＶ（２，３））／４
と推定する。 The motion vectors of the blocks around the block B (2, 2) are the group of MV (2, 1), MV (3, 1), MV (3, 2) and MV (2, 3), and MV (1 , 2) and MV (1, 3), so select the former group,
MV (2,2) = (MV (2,1) + MV (3,1) + MV (3,2) + MV (2,3)) / 4
Estimated.

ブロックＢ（２，４）の回りのブロックの動きベクトルは、ＭＶ（２，３）、ＭＶ（３，４）及びＭＶ（３，５）のグループと、ＭＶ（１，３）、ＭＶ（１，４）、ＭＶ（１，５）及びＭＶ（２，５）のグループに分けられるので、後者のグループを選択し、
ＭＶ（２，４）＝（ＭＶ（１，３）＋ＭＶ（１，４）＋ＭＶ（１，５）＋ＭＶ（２，５））／４
と推定する。 The motion vectors of the blocks around the block B (2, 4) are a group of MV (2, 3), MV (3, 4) and MV (3, 5), and MV (1, 3), MV (1 4), MV (1,5) and MV (2,5), so select the latter group,
MV (2,4) = (MV (1,3) + MV (1,4) + MV (1,5) + MV (2,5)) / 4
Estimated.

ブロックＢ（３，３）の回りのブロックの動きベクトルは、ＭＶ（２，３）、ＭＶ（３，２）、ＭＶ（４，２）、ＭＶ（４，４）及びＭＶ（３，４）の１グループであるので、
ＭＶ（３，３）＝（ＭＶ（２，３）＋ＭＶ（３，２）＋ＭＶ（４，２）＋ＭＶ（４，４）＋ＭＶ（３，４））／５
と推定する。 The block motion vectors around block B (3, 3) are MV (2, 3), MV (3, 2), MV (4, 2), MV (4, 4) and MV (3,4). Because it is one group of
MV (3,3) = (MV (2,3) + MV (3,2) + MV (4,2) + MV (4,4) + MV (3,4))) / 5
Estimated.

このようにして、図９（Ｂ）に示すようなオブジェクトマップが生成される。図９（Ｂ）では、オブジェクトの境界を太線で示している。 In this way, an object map as shown in FIG. 9B is generated. In FIG. 9B, the boundary of the object is indicated by a bold line.

図１０（Ａ）のように未定動きベクトルの数が多い場合であっても、ステップＳ３で否定判定されるまで、ステップＳ１〜Ｓ５を繰り返すと、一意的に動きベクトルが推定されて図１０（Ｂ）のようになる。次に、ステップＳ６で推定動きベクトルを、決定された動きベクトルとみなして、再度ステップＳ１〜Ｓ５を実行することにより、ブロックＢ（３，４）の動きベクトルが一意的に推定されて、図１０（Ｃ）のようになる。次に、隣り合うブロックの動きベクトルの差の絶対値が所定値以下のブロックに同一のＩＤを付与することにより、１つのクラスタが、互いに異なるＩＤをもつ複数のオブジェクトに分割される。 Even when the number of undetermined motion vectors is large as shown in FIG. 10A, when steps S1 to S5 are repeated until a negative determination is made in step S3, motion vectors are uniquely estimated and FIG. B). Next, in step S6, the estimated motion vector is regarded as the determined motion vector, and by executing steps S1 to S5 again, the motion vector of the block B (3,4) is uniquely estimated. 10 (C). Next, by assigning the same ID to blocks whose absolute values of motion vector differences between adjacent blocks are equal to or less than a predetermined value, one cluster is divided into a plurality of objects having different IDs.

なお、移動物体追跡部２７は、オブジェクトマップ記憶部２６に格納されているオブジェクトマップの時系列を、追跡結果として不図示のハードディスクに格納する。 The moving object tracking unit 27 stores the time series of the object map stored in the object map storage unit 26 in a hard disk (not shown) as a tracking result.

上述したような処理により、オブジェクトマップ記憶部２６には、時系列画像の各画像が複数のブロックに分割され、当該画像中の移動物体を示す識別符号が移動物体に対応するブロックに付けられているとともに、ブロックに対応する移動物体の動きベクトルがブロックに付けられて記憶されている。 Through the processing described above, each image of the time-series image is divided into a plurality of blocks in the object map storage unit 26, and an identification code indicating the moving object in the image is attached to the block corresponding to the moving object. In addition, the motion vector of the moving object corresponding to the block is attached to the block and stored.

そして移動物体追跡部２７は、上述したように時系列画像を画像処理した結果に基づいて、オブジェクトマップ記憶部２６に記憶されているブロックの識別符号と動きベクトルとを更新する。具体的には、移動物体追跡部２７は、次の（１）から（４）の手順により、オブジェクトマップ記憶部２６に記憶されているブロックの識別符号と動きベクトルとを更新する。 Then, the moving object tracking unit 27 updates the block identification code and the motion vector stored in the object map storage unit 26 based on the result of image processing of the time-series image as described above. Specifically, the moving object tracking unit 27 updates the block identification code and the motion vector stored in the object map storage unit 26 according to the following procedures (1) to (4).

（１）時系列画像のうち連続するＮ画像（Ｎ≧２）の各々について、隣り合うブロックの動きベクトルの差の絶対値が所定値以内のブロックに同一の識別符号を付けることにより、画像上で互いに重なった移動物体に互いに異なる識別符号を付ける識別符号手順。 (1) For each of consecutive N images (N ≧ 2) of time-series images, by adding the same identification code to blocks whose absolute values of motion vector differences between adjacent blocks are within a predetermined value, An identification code procedure for attaching different identification codes to moving objects that overlap each other.

（２）Ｎ画像の各々において、第１識別符号が付けられたブロック群である第１オブジェクトと第２識別符号が付けられたブロック群である第２オブジェクトとが接し、かつ、Ｎ画像について時間的に隣り合う画像の第１オブジェクト間の相関度が所定値以上であるか否かを判定する判定手順。 (2) In each of the N images, a first object that is a block group to which a first identification code is attached is in contact with a second object that is a block group to which a second identification code is attached, and the time for the N image A determination procedure for determining whether or not the degree of correlation between first objects of adjacent images is equal to or greater than a predetermined value.

（３）判定手順で肯定と判定された後に、時間を遡って第１オブジェクトと第２オブジェクトとを追跡する追跡手順。 (3) A tracking procedure in which the first object and the second object are traced back in time after it is determined affirmative in the determination procedure.

（４）追跡手順により時間を遡って追跡された第１オブジェクトと第２オブジェクトとに基づいて、オブジェクトマップ記憶部２６に記憶されているブロックの識別符号と動きベクトルとを更新する更新手順。 (4) An update procedure for updating the block identification code and the motion vector stored in the object map storage unit 26 based on the first object and the second object tracked back in time by the tracking procedure.

輪郭抽出部３０は、時系列画像の画像から移動物体の輪郭を抽出する。また、この輪郭抽出部３０は、オクルージョンが発生している複数の移動物体を一体の移動物体とし、時系列画像の画像から一体とした移動物体の輪郭を抽出する。 The contour extraction unit 30 extracts the contour of the moving object from the time-series image. The contour extracting unit 30 extracts a plurality of moving objects having occlusions as a single moving object, and extracts a contour of the moving object as a single unit from a time-series image.

この輪郭抽出部３０は、対象領域設定部３０１と、対象領域補正部３０２と、輪郭抽出処理部３０３とを、有している。 The contour extraction unit 30 includes a target region setting unit 301, a target region correction unit 302, and a contour extraction processing unit 303.

対象領域設定部３０１は、オブジェクトマップ記憶部２６に記憶されている移動物体のブロックに対応する画像領域に基づいて、移動物体の輪郭を抽出する対象領域を設定する。 The target area setting unit 301 sets a target area for extracting the contour of the moving object based on the image area corresponding to the block of the moving object stored in the object map storage unit 26.

輪郭抽出処理部３０３は、対象領域設定部３０１が設定した対象領域に対して、時系列画像の画像から移動物体の輪郭を抽出する。 The contour extraction processing unit 303 extracts the contour of the moving object from the time-series image for the target region set by the target region setting unit 301.

対象領域補正部３０２は、時系列画像の画像をエッジ抽出処理した画像に対して、対象領域設定部３０１が設定した対象領域内でエッジに対応する画素数の個数についてのヒストグラムを座標軸毎に射影して生成し、当該座標軸毎に生成したヒストグラムに基づいて対象領域設定部３０１が設定した対象領域を座標軸毎に補正する。 The target area correction unit 302 projects, for each coordinate axis, a histogram of the number of pixels corresponding to the edges in the target area set by the target area setting unit 301 with respect to an image obtained by performing edge extraction processing on a time-series image. The target area set by the target area setting unit 301 is corrected for each coordinate axis based on the histogram generated for each coordinate axis.

そして上述した輪郭抽出処理部３０３は、対象領域補正部３０２が補正した対象領域に対して、時系列画像の画像から移動物体の輪郭を抽出してもよい。 The contour extraction processing unit 303 described above may extract the contour of the moving object from the time-series image for the target region corrected by the target region correction unit 302.

補正部３１は、輪郭抽出部３０の輪郭抽出処理部３０３が抽出した輪郭に基づいて、オブジェクトマップ記憶部２６に記憶されているブロックの識別符号および動きベクトルを補正する。 The correction unit 31 corrects the block identification code and the motion vector stored in the object map storage unit 26 based on the contour extracted by the contour extraction processing unit 303 of the contour extraction unit 30.

また、この補正部３１は、オブジェクトマップ記憶部２６に記憶されているオクルージョンが発生している複数の移動物体に対応する識別符号に基づいた当該複数の移動物体の境界を示す情報と、輪郭抽出部３０によりオクルージョンが発生している複数の移動物体を一体とした抽出された移動物体の輪郭とに基づいて、オブジェクトマップ記憶部２６に記憶されているブロックの識別符号および動きベクトルを補正する。 In addition, the correction unit 31 includes information indicating boundaries of the plurality of moving objects based on identification codes corresponding to the plurality of moving objects in which occlusion occurs and stored in the object map storage unit 26, and contour extraction The block identification code and the motion vector stored in the object map storage unit 26 are corrected based on the extracted outline of the moving object obtained by integrating the plurality of moving objects in which occlusion has occurred.

判定部３２は、背景画像が変動しているか否かを判定する。たとえば判定部３２は、カメラ１０がパンニングまたはズーミングされたことを示す信号がカメラ１０から入力され、この信号に基づいて、背景画像が変動しているか否かを判定する。 The determination unit 32 determines whether or not the background image is changing. For example, the determination unit 32 receives a signal indicating that the camera 10 has been panned or zoomed from the camera 10, and determines whether or not the background image has changed based on this signal.

または判定部３２は、背景画像生成部２４が生成した背景画像と、カメラ１０から入力される画像とを比較して、背景画像が変動しているか否かを判定してもよい。または、背景となる領域に予めマーカが埋め込まれており、判定部３２は、カメラ１０から入力される画像に含まれているマーカの画像における位置の変動を検出することにより、背景画像が変動しているか否かを判定してもよい。 Alternatively, the determination unit 32 may compare the background image generated by the background image generation unit 24 with the image input from the camera 10 and determine whether or not the background image has changed. Alternatively, a marker is embedded in the background region in advance, and the determination unit 32 detects a change in the position of the marker image included in the image input from the camera 10, thereby changing the background image. It may be determined whether or not.

制御部３４（第１の制御部）は、判定部３２が背景画像が変動していると判定した場合に、たとえば予め定められている所定期間毎にまたは所定フレーム毎に、輪郭抽出部３０を制御して輪郭を抽出させ、補正部３１を制御してオブジェクトマップ記憶部２６に記憶されているブロックの識別符号および動きベクトルを補正させる。 When the determination unit 32 determines that the background image is fluctuating, the control unit 34 (first control unit) controls the contour extraction unit 30 for example every predetermined period or every predetermined frame. The contour is extracted by control, and the correction unit 31 is controlled to correct the block identification code and the motion vector stored in the object map storage unit 26.

移動物体変動量検出部３３については、後述する。 The moving object fluctuation amount detection unit 33 will be described later.

＜背景ブロック＞
上記の説明においては、ブロック単位で背景画像と比較することにより、オブジェクトが存在するかどうかを調べているので、背景画像を特別扱いしなければならない。また、例えば過去１０分間の撮影画像に基づいて背景画像を生成しているので、カメラが揺れた場合には、この揺れを背景画像に反映させることができない。 <Background block>
In the above description, whether or not an object exists is examined by comparing it with the background image in units of blocks, so the background image must be treated specially. Further, for example, since the background image is generated based on the photographed images for the past 10 minutes, when the camera shakes, this shake cannot be reflected in the background image.

そこで、背景画像もオブジェクトとみなしてオブジェクトマップを作成してもよい。オブジェクトマップ生成方法は、背景画像と比較してブロックに移動物体が存在するかどうかを判定する点のみが異なる。背景画像もオブジェクトとみなすので、全てのブロックについて、ブロックマッチングを行うことによりＩＤを付与しＭＶを決定する。 Therefore, the object map may be created by regarding the background image as an object. The object map generation method is different from the background image only in determining whether or not a moving object exists in the block. Since the background image is also regarded as an object, block matching is performed on all the blocks to assign IDs and determine MVs.

なお、この背景画像には、背景画像に対して予め定められているＩＤを付与するようにしてもよい。この予め定められているＩＤにより背景画像と、移動物体とが識別しやすくなる。 The background image may be given a predetermined ID for the background image. The background ID and the moving object can be easily identified by this predetermined ID.

このように、背景画像を１つのブロックとして、背景画像に対してＩＤを付与するようにしても、背景画像と移動物体との間で図４から図７に示したようにブロックの属する画像を判定することが可能である。 As described above, even if the background image is set as one block and an ID is given to the background image, the image to which the block belongs is displayed between the background image and the moving object as shown in FIGS. It is possible to determine.

このように、背景画像を１つのブロックとすることにより、カメラがパンニングまたはズーミングされたことに応じて背景画像が変動する場合においても、背景画像が固定されていた場合と同様に処理できるようになる。 In this way, by making the background image into one block, even when the background image fluctuates according to the panning or zooming of the camera, it can be processed in the same manner as when the background image is fixed. Become.

＜移動物体追跡装置２０に動作＞
次に図１１と図１２とを用いて、移動物体追跡装置２０の動作ついて説明する。まず図１１を用いて、移動物体追跡装置２０に動作モードについて説明する。 <Operation on Moving Object Tracking Device 20>
Next, the operation of the moving object tracking device 20 will be described with reference to FIGS. 11 and 12. First, the operation mode of the moving object tracking device 20 will be described with reference to FIG.

まず、移動物体追跡装置２０は、カメラ固定モードの状態にあるとする。このカメラ固定モードにおいては、背景画像はオブジェクトとしてのＩＤが付与されることなしに、背景画像生成部２４により生成されている画像を背景画像として用いている。これはカメラが固定されているため、背景画像に変動がないためである。 First, it is assumed that the moving object tracking device 20 is in the camera fixing mode. In this camera fixing mode, an image generated by the background image generation unit 24 is used as a background image without being given an ID as an object. This is because the camera is fixed and the background image does not change.

このカメラ固定モードの状態にある場合には、背景画像が固定されているために、移動物体追跡部２７により移動物体を識別されている。 In the camera fixing mode, the moving object tracking unit 27 identifies the moving object because the background image is fixed.

次にカメラがパンニングまたはズーミングされたことに応じて、判定部３２が、背景画像が変動していることを判定する。この判定に応じて制御部３４が、カメラ固定モードから、カメラ変動モードに遷移させるとともに（ステップＳ１）、背景画像をオブジェクトに登録してＩＤを付与する。 Next, in response to the panning or zooming of the camera, the determination unit 32 determines that the background image has changed. In response to this determination, the control unit 34 shifts from the camera fixing mode to the camera variation mode (step S1), registers the background image in the object, and assigns an ID.

カメラ変動モードにおいて、予め定められている所定期間毎にまたは所定フレーム毎に、制御部３４は、補正モードに遷移させる（ステップＳ２）。この補正モードにおいて、制御部３４は、輪郭抽出部３０を制御して輪郭を抽出させ、補正部３１を制御してオブジェクトマップ記憶部２６に記憶されているブロックの識別符号および動きベクトルを補正させる。 In the camera variation mode, the control unit 34 makes a transition to the correction mode every predetermined period or every predetermined frame (step S2). In this correction mode, the control unit 34 controls the contour extraction unit 30 to extract a contour, and controls the correction unit 31 to correct the block identification code and motion vector stored in the object map storage unit 26. .

制御部３４は、補正モードにおいて補正が完了すると、補正モードからカメラ変動モードに遷移させる（ステップＳ３）。 When the correction is completed in the correction mode, the control unit 34 transitions from the correction mode to the camera fluctuation mode (step S3).

その後、カメラ１０が変動している期間においては、制御部３４は、カメラ変動モードと補正モードとを交互に遷移させる。 Thereafter, during a period in which the camera 10 is changing, the control unit 34 alternately changes between the camera change mode and the correction mode.

その後、カメラ１０の変動が停止したことを判定部３２が判定したことに応じて、制御部３４は、背景画像に付与されているＩＤを削除し、背景画像生成部２４により生成されている画像を背景画像として用いる。 Thereafter, in response to the determination unit 32 determining that the fluctuation of the camera 10 has stopped, the control unit 34 deletes the ID assigned to the background image, and the image generated by the background image generation unit 24. Is used as a background image.

なお、背景画像生成部２４により背景画像が生成されるには、たとえば１０分などの所定の期間を要する。そのため、背景画像生成部２４により背景画像が生成されるまでは、背景画像をオブジェクトとして用いることが望ましい。 In addition, for the background image generation unit 24 to generate the background image, a predetermined period such as 10 minutes is required. Therefore, it is desirable to use the background image as an object until the background image is generated by the background image generation unit 24.

次に図１２を用いて、図１１の補正モードにおいて、制御部３４が、輪郭抽出部３０を制御して輪郭を抽出させ、補正部３１を制御してオブジェクトマップ記憶部２６に記憶されているブロックの識別符号および動きベクトルを補正させる動作について説明する。 Next, referring to FIG. 12, in the correction mode of FIG. 11, the control unit 34 controls the contour extraction unit 30 to extract a contour, controls the correction unit 31, and is stored in the object map storage unit 26. An operation for correcting the block identification code and the motion vector will be described.

まず、オクルージョン検出部３５が、オブジェクトマップ記憶部２６に記憶されているオブジェクトマップに基づいて、オクルージョンの発生の有無を判定する（ステップＳ１２０１）。 First, the occlusion detection unit 35 determines whether or not occlusion has occurred based on the object map stored in the object map storage unit 26 (step S1201).

このステップＳ１２０１でオクルージョンの発生が無いと判定された場合には、対象領域設定部３０１が、オクルージョンが無い移動物体に対して対象領域を設定する（ステップＳ１２０２）。 If it is determined in step S1201 that no occlusion has occurred, the target area setting unit 301 sets a target area for a moving object without occlusion (step S1202).

次に、対象領域補正部３０２が、上述したヒストグラムを生成し、生成したヒストグラムに基づいて対象領域設定部３０１が設定した対象領域を座標軸毎に補正する（ステップＳ１２０３）。 Next, the target region correction unit 302 generates the above-described histogram, and corrects the target region set by the target region setting unit 301 for each coordinate axis based on the generated histogram (step S1203).

次に輪郭抽出処理部３０３は、対象領域補正部３０２が補正した対象領域に対して、時系列画像の画像から移動物体の輪郭を抽出する（ステップＳ１２０４）。 Next, the contour extraction processing unit 303 extracts the contour of the moving object from the time-series image for the target region corrected by the target region correction unit 302 (step S1204).

次に補正部３１は、輪郭抽出部３０が抽出した輪郭に基づいて、オブジェクトマップ記憶部２６に記憶されているブロックの識別符号および動きベクトルを補正する（ステップＳ１２０５）。 Next, the correction unit 31 corrects the block identification code and the motion vector stored in the object map storage unit 26 based on the contour extracted by the contour extraction unit 30 (step S1205).

一方ステップＳ１２０１でオクルージョンの発生があると判定された場合には、輪郭抽出部３０の対象領域設定部３０１が、オクルージョンが発生している複数の移動物体を一体の移動物体とする（ステップＳ１２１２）。 On the other hand, if it is determined in step S1201 that occlusion has occurred, the target area setting unit 301 of the contour extraction unit 30 sets a plurality of moving objects in which occlusion has occurred as an integrated moving object (step S1212). .

次に対象領域設定部３０１が、一体とした移動物体に対して対象領域を設定する（ステップＳ１２１３）。 Next, the target area setting unit 301 sets a target area for an integrated moving object (step S1213).

次に、対象領域補正部３０２が、一体とした移動物体に対して上述したヒストグラムを生成し、生成したヒストグラムに基づいて対象領域設定部３０１が設定した対象領域を座標軸毎に補正する（ステップＳ１２１４）。 Next, the target area correction unit 302 generates the above-described histogram for the moving object integrated, and corrects the target area set by the target area setting unit 301 for each coordinate axis based on the generated histogram (step S1214). ).

次に輪郭抽出処理部３０３は、対象領域補正部３０２が補正した対象領域に対して、時系列画像の画像から、一体とした移動物体移動物体の輪郭を抽出する（ステップＳ１２１５）。 Next, the contour extraction processing unit 303 extracts the contour of the moving object integrated with the target region corrected by the target region correction unit 302 from the image of the time-series image (step S1215).

次に補正部３１は、図４から図７を用いて説明したようにして検出されているオクルージョンが生じている移動物体の境界と、輪郭抽出部３０が抽出した輪郭とに基づいて、オブジェクトマップ記憶部２６に記憶されているブロックの識別符号および動きベクトルを補正する（ステップＳ１２１６）。 Next, the correction unit 31 performs an object map based on the boundary of the moving object where the occlusion detected as described with reference to FIGS. 4 to 7 and the contour extracted by the contour extraction unit 30 are performed. The block identification code and the motion vector stored in the storage unit 26 are corrected (step S1216).

上記に説明した処理が、オブジェクトマップ記憶部２６に記憶されている全ての移動物体に対して終了した後、制御部３４は、補正モードからカメラ変動モードへと遷移させる。 After the processing described above is completed for all moving objects stored in the object map storage unit 26, the control unit 34 transitions from the correction mode to the camera variation mode.

以上図１１と図１２とを用いて説明したようにして、移動物体追跡装置２０は、カメラが固定している場合でも変動している場合でも、移動物体を追跡することができる。なおカメラ固定モードでは背景画像をオブジェクトに登録していないことにより、図４から図７に示すような処理を背景画像のオブジェクトに対して実行する必要がないために、移動物体追跡装置２０における処理量または負荷を軽減することができる。 As described above with reference to FIGS. 11 and 12, the moving object tracking device 20 can track a moving object regardless of whether the camera is fixed or fluctuating. In the camera fixed mode, since the background image is not registered in the object, it is not necessary to execute the processes shown in FIGS. 4 to 7 on the object of the background image. The amount or load can be reduced.

以降、オブジェクトマップ記憶部２６に記憶されているブロック毎の識別符号および動きベクトルを、「時空間ＭＲＦ（ＭａｒｋｏｖＲａｎｄｏｍＦｉｅｌｄ）」と称して説明する。次に、上記図１２を用いて説明した動作とその結果について、図１３から図１９を用いて具体的に説明する。 Hereinafter, the identification code and motion vector for each block stored in the object map storage unit 26 will be referred to as “space-time MRF (Markov Random Field)”. Next, the operation and the result described with reference to FIG. 12 will be specifically described with reference to FIGS.

＜Ｓｎａｋｅｓ＞
まず、輪郭抽出部３０による画像から移動物体の輪郭を抽出について説明する。輪郭抽出部３０の構成のうち、まず、輪郭抽出処理部３０３による移動物体の輪郭を抽出する一例としての技術について詳述する。ここでは、輪郭を抽出する技術としてＳｎａｋｅｓ（文献１参照）を用いる場合について説明する。 <Snakes>
First, extraction of the contour of the moving object from the image by the contour extraction unit 30 will be described. Of the configuration of the contour extraction unit 30, first, a technique as an example of extracting the contour of a moving object by the contour extraction processing unit 303 will be described in detail. Here, the case where Snakes (refer literature 1) is used as a technique which extracts an outline is demonstrated.

（文献１）Kass et.al “Snakes: Active contour models”, Proc. of 1st ICCV, pp.259-268, 1987 (Reference 1) Kass et.al “Snakes: Active contour models”, Proc. Of 1st ICCV, pp.259-268, 1987

最初に、このＳｎａｋｅｓの概要について説明する。一般にＳｎａｋｅｓは、画像平面（ｘ，ｙ）上で媒介変数表現されたスプライン（制御点の集合）ｖ（ｓ）＝（ｘ（ｓ）、ｙ（ｓ））（０≦ｓ≦１）を、次式（４）で定義されるエネルギー関数を最小化するように変形し、エネルギーの極小状態としてその形状が決まる輪郭抽出のモデルである。 First, an outline of the Snakes will be described. In general, Snakes expresses a spline (set of control points) v (s) = (x (s), y (s)) (0 ≦ s ≦ 1) expressed as a parameter on the image plane (x, y), This is a contour extraction model that is deformed so as to minimize the energy function defined by the following equation (4) and whose shape is determined as a minimum state of energy.

この式（４）の第一項Ｅｉｎｔは、内部エネルギーと呼ばれ、これにより、Ｓｎａｋｅｓのスプラインが凸型に滑らかに収縮する性質をもつ。理論上の定義は次式（５）で表され、式（５）における第一項によりスプラインが凸型に滑らかになり、第二項によりスプラインが収縮する性質をもつ。 The first term Eint of the equation (4) is called internal energy, and has the property that the Snakes spline contracts smoothly into a convex shape. The theoretical definition is expressed by the following equation (5). The spline is smoothed by the first term in equation (5), and the spline is contracted by the second term.

次に、式（４）の第二項Ｅｉｍａｇｅは画像エネルギーと呼ばれ、スプライン全体として、エッジ（輪郭などの輝度の勾配が大きい箇所）上に存在する割合が大きいほど、その値が小さくなるという性質をもつ。本エネルギーは画像の輝度Ｉ（ｖ（ｓ））により次式（６）で定義する。今回、照度によらず安定して輪郭エッジを抽出するために、これまで我々が開発してきた照度不変フィルタ画像を画像エネルギーとして用いた。 Next, the second term Eimage of Equation (4) is called image energy, and the larger the ratio of the spline as a whole on the edge (the portion where the luminance gradient such as the outline is large), the smaller the value. Has properties. This energy is defined by the following equation (6) according to the luminance I (v (s)) of the image. This time, in order to extract contour edges stably regardless of illuminance, we used the illuminance invariant filter image that we have developed so far as image energy.

ここで、照度不変フィルタ画像とは、画像変換部２２により上述した式１から式３により変換された画像のことである。 Here, the illuminance invariant filter image is an image converted by the image conversion unit 22 according to Expressions 1 to 3 described above.

そして、式（４）の第三項Ｅｃｏｎは外部エネルギーと言われ、Ｓｎａｋｅｓに外部から強制的に力を働かせる場合に用いる。このエネルギーは必要に応じて定義できる。今回は、内部エネルギーに用いられるＥｉｎｔの影響で抽出が困難であった凹形状の輪郭抽出用に提案された面積項（文献２参照）を外部エネルギーとして定義した。面積項Ｅａｒｅａは次式（７）で導出される（図１３参照）。 The third term Econ in the formula (4) is called external energy, and is used when force is applied to Snakes from the outside. This energy can be defined as needed. This time, the area term proposed for extracting the contour of the concave shape, which was difficult to extract due to the influence of Eint used for internal energy (see Document 2), was defined as external energy. The area term Earea is derived by the following equation (7) (see FIG. 13).

（文献２）荒木昭一, 横矢直和, 岩佐英彦, 竹村治雄:“複数物体の抽出を目的とした交差判定により分裂する動的輪郭モデル”, 電子情報通信学会論文誌(D-II)Vol.J79-DII, No.10,pp1704-1711(Oct,1996) (Reference 2) Shoichi Araki, Naokazu Yokoya, Hidehiko Iwasa, Haruo Takemura: “A dynamic contour model that splits by intersection detection for the purpose of extracting multiple objects”, IEICE Transactions (D-II) Vol. J79-DII, No.10, pp1704-1711 (Oct, 1996)

図１４にＳｎａｋｅｓ処理結果を示す。まず、輪郭を抽出したい対象の周囲に、初期制御点を配置する（図１４（ａ））。次に、Ｓｎａｋｅｓは収縮を始め（図１４（ｂ））、最後に輪郭線付近で収縮が止まる（図１４（ｃ））。 FIG. 14 shows the Snakes processing result. First, initial control points are arranged around the object whose contour is to be extracted (FIG. 14A). Next, Snakes starts to contract (FIG. 14B), and finally stops contracting near the contour line (FIG. 14C).

ここで、この図１４に示す初期制御点が、対象領域設定部３０１が設定した対象領域に対応する。 Here, the initial control points shown in FIG. 14 correspond to the target area set by the target area setting unit 301.

＜局所領域におけるエッジ分布の解析＞
次に、局所領域におけるエッジ分布の解析について説明する。Ｓｎａｋｅｓは，エネルギー関数Ｅ_{ｓｎａｋｅｓ}を最小化するようにスプラインを変形し，極小状態をもってその探索を終えるアルゴリズムである。背景エッジが多く含まれる場合に初期制御点を輪郭から離れた場所に配置すると、スプラインがオブジェクトの輪郭線に収束する前に背景エッジに捉われ、エネルギーが極小状態になり、オブジェクトの輪郭線抽出に失敗してしまうことがある。 <Analysis of edge distribution in local region>
Next, analysis of edge distribution in the local region will be described. Snakes is an algorithm that transforms a spline so as to minimize the energy function _{E_snakes} and ends the search with a minimal state. If there are many background edges and the initial control points are placed away from the outline, the spline will be caught by the background edges before converging on the object outline, and the energy will be minimal, extracting the object outline May fail.

そのため、初期制御点をある程度オブジェクトの輪郭付近に配置しなければならない。一方、時空間ＭＲＦで得られる人物の局所領域は、パンニング中は背景オブジェクトと人物オブジェクトの境界が曖昧になることが時折見受けられる。 Therefore, the initial control points must be arranged to some extent near the contour of the object. On the other hand, in the local region of the person obtained by the spatio-temporal MRF, it is sometimes seen that the boundary between the background object and the person object becomes ambiguous during panning.

そこで、オブジェクトマップによって得られる人物オブジェクトの周辺矩形内におけるエッジ分布（照度不変フィルタ画像の２値分布）を解析し、人物領域を推定し、その周囲に初期制御点を配置することで、輪郭抽出の精度を向上する。エッジ分布の解析は、水平軸、鉛直軸にそれぞれ射影し、ヒストグラムを生成することで解析を行う（図１５参照）。 Therefore, by analyzing the edge distribution (binary distribution of the illuminance invariant filter image) in the surrounding rectangle of the human object obtained by the object map, estimating the human area and placing the initial control points around it, the contour extraction Improve accuracy. The analysis of the edge distribution is performed by projecting onto the horizontal axis and the vertical axis and generating a histogram (see FIG. 15).

次に、一例としてのエッジ分布解析の手順について説明する。エッジ分布解析は、次のＳＴＥＰ１からＳＴＥＰ３の手順により実行される。 Next, an example of an edge distribution analysis procedure will be described. The edge distribution analysis is executed by the following steps STEP1 to STEP3.

（ＳＴＥＰ１：エッジ分布の前処理）
エッジ画像をラベリングし、面積の小さいものをノイズとして除外する。 (STEP 1: Preprocessing of edge distribution)
The edge image is labeled, and a small area is excluded as noise.

（ＳＴＥＰ２：水平軸ヒストグラム解析）
ＳＴＥＰ１で得られたエッジ分布から水平軸ヒストグラムを生成する。人物は縦に連続する長いエッジを持つことから、縦方向の連続性の弱いエッジを除去した後、一度水平軸に正射影し、縦方向の強い分布から人物の水平領域を絞る。 (STEP 2: Horizontal axis histogram analysis)
A horizontal axis histogram is generated from the edge distribution obtained in STEP 1. Since a person has long continuous edges, after removing edges with low vertical continuity, the person is orthogonally projected once to narrow down the horizontal region of the person from a strong vertical distribution.

その後、絞られた領域内において、得られたエッジ分布を水平軸に射影し、ヒストグラムを生成する。そして、得られた水平軸ヒストグラムにおいてウインドウをスキャンし、連続して分布する領域を求めることで、水平成分における人物エッジの分布領域の推定を行う（図１５（ａ））。 Thereafter, the obtained edge distribution is projected onto the horizontal axis in the narrowed region, and a histogram is generated. Then, a window is scanned in the obtained horizontal axis histogram to obtain a continuously distributed region, thereby estimating a human edge distribution region in the horizontal component (FIG. 15A).

（ＳＴＥＰ３：鉛直軸ヒストグラム解析）
ＳＴＥＰ１で得られたエッジ分布から鉛直軸ヒストグラムを生成する。人物は横方向にもある程度長い連続エッジを持つことから、横方向の連続性の弱いエッジを除去した後、一度鉛直軸に射影し、横方向の分布から人物の鉛直領域を絞る。 (STEP3: Vertical axis histogram analysis)
A vertical axis histogram is generated from the edge distribution obtained in STEP 1. Since a person has continuous edges that are somewhat long in the horizontal direction, after removing edges with low horizontal continuity, the person is projected once onto the vertical axis, and the vertical region of the person is narrowed down from the distribution in the horizontal direction.

その後、絞られた領域内において、エッジ分布を鉛直軸に射影し、ヒストグラムを生成する。そして、得られた水平軸ヒストグラムにおいてウインドウをスキャンし、連続して分布する領域を求めることで、水平成分における人物エッジの分布領域の推定を行う（図１５（ｂ））。 Thereafter, within the narrowed region, the edge distribution is projected onto the vertical axis, and a histogram is generated. Then, a window is scanned in the obtained horizontal axis histogram to obtain a continuously distributed region, thereby estimating a human edge distribution region in the horizontal component (FIG. 15B).

なお、このＳＴＥＰ３において、ＳＴＥＰ１で得られたエッジ分布から鉛直軸ヒストグラムを生成しているが、鉛直軸ヒストグラムは、ＳＴＥＰ２で狭められた局所領域内のエッジ分布から生成することもできる。 In this STEP 3, the vertical axis histogram is generated from the edge distribution obtained in STEP 1. However, the vertical axis histogram can also be generated from the edge distribution in the local region narrowed in STEP 2.

以上のステップにより、エッジの分布情報を参照することで、より正確な人物の外接矩形領域を得ることができる（図１５（ｃ））。 By referring to the edge distribution information through the above steps, a more accurate circumscribed rectangular region of the person can be obtained (FIG. 15C).

なお、ヒストグラム閾値（初期制御点）の設定方法として次のようにしてもよい。
まず、ヒストグラムの頻度値を２つのグループにクラスタリングする。クラスタリング方法はk-mean法（１次元）など、いずれの方法でもよい。これにより、高頻度クラスタと低頻度クラスタに分かれることになる。 Note that the histogram threshold (initial control point) may be set as follows.
First, the histogram frequency values are clustered into two groups. The clustering method may be any method such as a k-mean method (one-dimensional). Thereby, it will be divided into a high frequency cluster and a low frequency cluster.

次に、画像の両端から内側へ探索して、初めて高頻度クラスタに属する頻度値に当たった場合、直前の低頻度の位置と当該高頻度の位置との間を境界とする。この場合、高頻度のさらに内側に低頻度のものがあったとしても問題ない。 Next, when searching from the both ends of the image to the inside and hitting a frequency value belonging to the high frequency cluster for the first time, a boundary is set between the immediately preceding low frequency position and the high frequency position. In this case, there is no problem even if there is a low frequency inside the high frequency.

＜階層間協調アルゴリズム＞
ここで、時空間ＭＲＦとＳｎａｋｅｓとの階層間協調によるトラッキングの処理ステップを述べる。以降、時空間ＭＲＦとＳｎａｋｅｓとを相互に用いてオブジェクトマップを修正することを、階層間協調と称して説明する。 <Inter-layer cooperation algorithm>
Here, a processing step of tracking by cooperation between layers of the spatio-temporal MRF and Snakes will be described. Hereinafter, correcting the object map using the spatio-temporal MRF and Snakes will be referred to as inter-layer cooperation.

まず、Ｓｎａｋｅｓによるオブジェクトマップの修正（オクルージョンなし）の場合、すなわち、図１２のステップＳ１２０２からＳ１２０５に対応する処理について説明する。 First, in the case of object map correction (without occlusion) by Snakes, that is, processing corresponding to steps S1202 to S1205 in FIG. 12 will be described.

ＳＴＥＰ１：時空間ＭＲＦの出力として、オブジェクトマップを受け取り、各オブジェクトにおける外接矩形領域の情報を得る。 STEP 1: An object map is received as an output of the spatio-temporal MRF, and information on a circumscribed rectangular area in each object is obtained.

ＳＴＥＰ２：各オブジェクトにおいて、ＳＴＥＰ２で得られた局所領域においてエッジの分布解析を行い、オブジェクトの輪郭周辺にＳｎａｋｅｓの初期制御点を配置する。 STEP 2: For each object, an edge distribution analysis is performed in the local region obtained in STEP 2, and Snakes initial control points are placed around the contour of the object.

ＳＴＥＰ３：各オブジェクトにおいて、Ｓｎａｋｅｓを実行する。ＳＴＥＰ１で得られた外接矩形の大きさと比較し、スプラインが収縮しすぎてしまったオブジェクトに関してはオブジェクトマップを修正せず、それ以外のオブジェクトに対してはＳｎａｋｅｓの処理結果を反映し、オブジェクトマップを修正する。 STEP 3: Execute Snakes in each object. Compared with the size of the circumscribed rectangle obtained in STEP 1, the object map is not modified for objects whose splines are contracted too much, and the object map is reflected for the other objects. Correct it.

次に、Ｓｎａｋｅｓによるオブジェクトマップの修正（オクルージョンあり）の場合、すなわち、図１２のステップＳ１２１２からＳ１２１５に対応する処理について説明する。 Next, in the case of object map correction by Snakes (with occlusion), that is, processing corresponding to steps S1212 to S1215 in FIG. 12 will be described.

ＳＴＥＰ１：時空間ＭＲＦの出力として、オブジェクトマップを受け取り、各オブジェクトにおける外接矩形領域の情報を得る。オクルージョンが検知された（他のオブジェクトの外接矩形と重複領域をもつ）オブジェクトに対しては、オクルージョンし合っているものを一つのグループとして外接矩形領域を求める。 STEP 1: An object map is received as an output of the spatio-temporal MRF, and information on a circumscribed rectangular area in each object is obtained. For objects in which occlusion is detected (having overlapping areas with circumscribed rectangles of other objects), the circumscribed rectangular area is obtained with the objects that are occluded as one group.

ＳＴＥＰ２：ＳＴＥＰ２で得られた局所領域においてエッジの分布解析を行い、オブジェクトの輪郭周辺にＳｎａｋｅｓの初期制御点を配置する。 STEP 2: An edge distribution analysis is performed in the local region obtained in STEP 2, and initial control points of Snakes are arranged around the contour of the object.

ＳＴＥＰ３：Ｓｎａｋｅｓを実行する。ＳＴＥＰ１で得られた外接矩形の大きさと比較し、スプラインが収縮しすぎてしまったオブジェクトに関してはオブジェクトマップを修正せず、それ以外のオブジェクトに対してはＳｎａｋｅｓの処理結果を反映し、オブジェクトマップを修正する。その際、Ｓｎａｋｅｓで抽出された輪郭に囲まれた内部領域の各オブジェクトのＩＤ番号のラベリングにおいては、時空間ＭＲＦモデルの出力結果によりラベリングを行なう。ただし、当該輪郭内部領域において、背景オブジェクトと認識されたブロックについては、ラベルを背景オブジェクトから不定へ変更することで、次フレームにおいて時空間ＭＲＦモデルによるＩＤ割り当てが行われる。あるいは、現フレームにおいて、当該ブロックのみ時空間ＭＲＦモデルによるＩＤ割り当てを再度行なうことも可能である。 STEP3: Execute Snakes. Compared to the size of the circumscribed rectangle obtained in STEP 1, the object map is not modified for objects whose splines are contracted too much, and the object map is reflected for the other objects. Correct it. At that time, in the labeling of the ID number of each object in the inner region surrounded by the contour extracted by Snakes, the labeling is performed based on the output result of the spatio-temporal MRF model. However, for the block recognized as a background object in the contour inner region, the ID is assigned by the spatiotemporal MRF model in the next frame by changing the label from the background object to indefinite. Alternatively, in the current frame, it is also possible to perform ID assignment by the spatiotemporal MRF model only for that block.

図１６にＳｎａｋｅｓによるオブジェクトマップの修正例を示す。図１６（１−ａ）と（１−ｂ）はオクルージョンのない場合の例、図１６（２−ａ）と（２−ｂ）はオクルージョン時の例である。 FIG. 16 shows a modification example of the object map by Snakes. 16 (1-a) and (1-b) are examples in the case where there is no occlusion, and FIGS. 16 (2-a) and (2-b) are examples in the case of occlusion.

図１６（２−ａ）と（２−ｂ）では、一度ＩＤ番号６とＩＤ番号７の人物を一つのグループとし、Ｓｎａｋｅｓによりグループの輪郭線（背景と人物との境界）を求めている。そして、グループ内の領域分割は、時空間ＭＲＦによる出力情報を反映している。 In FIGS. 16 (2-a) and (2-b), the persons with ID numbers 6 and 7 are once made into one group, and the outline of the group (the boundary between the background and the person) is obtained by Snakes. The area division within the group reflects the output information by the spatiotemporal MRF.

＜Ｓｎａｋｅｓによるオブジェクトマップの修正＞
次に図１７と図１８とを用いて、カメラ１０が変動する場合における階層間協調アルゴリズムによる効果を示す。ここでは図１７と図１８とに示すように、同じシーンの同じフレームに対して、階層間協調なしと階層間協調ありとの場合における移動物体検出の処理結果を対比して説明する。 <Correction of object map by Snakes>
Next, FIG. 17 and FIG. 18 are used to show the effect of the inter-layer cooperation algorithm when the camera 10 changes. Here, as shown in FIGS. 17 and 18, the processing results of moving object detection in the case of no inter-layer cooperation and inter-layer cooperation for the same frame of the same scene will be described.

図１７（ａ）は階層間協調なしの処理結果であり、図１７（ｂ）はそのオブジェクトマップである。図１８（ａ）は階層間協調ありの処理結果であり、図１８（ｂ）はそのオブジェクトマップである。 FIG. 17A shows a processing result without cooperation between layers, and FIG. 17B shows an object map thereof. FIG. 18A shows a processing result with cooperation between layers, and FIG. 18B shows an object map thereof.

カメラのパンニングはフレーム番号８０の直後から始まる。階層間協調アルゴリズムを行わない場合、２０フレーム超のトラッキングに成功しているものの、人物オブジェクトと背景オブジェクトとの境界は、その徐々に曖昧になってしまっている（フレーム９５およびフレーム１０７）。 Camera panning begins immediately after frame number 80. When the inter-layer cooperation algorithm is not performed, although the tracking of more than 20 frames is successful, the boundary between the person object and the background object gradually becomes ambiguous (frame 95 and frame 107).

その一方で、Ｓｎａｋｅｓにより協調させた場合、時空間ＭＲＦの出力を参照することでオブジェクト間の境界を補正することができており、長時間のトラッキングが可能となることがわかる。また、ズーミングした場合においても、階層間協調アルゴリズムの効果があることがわかる（フレーム１４２およびフレーム１５３の右２列）。 On the other hand, when coordinated by Snakes, it is understood that the boundary between objects can be corrected by referring to the output of the spatiotemporal MRF, and tracking for a long time is possible. In addition, even when zooming, it can be seen that there is an effect of the cooperation algorithm between layers (the right two columns of the frame 142 and the frame 153).

図１９はオクルージョンが発生している場合に、カメラ１０が変動した場合に移動物体を検出した結果である。図１９（ａ）ではオクルージョンが発生しておらず、その後、図１９（ｂ）でオクルージョンが発生している。いずれの場合においても、本実施形態による移動物体追跡装置２０は、移動物体を追跡できていることがわかる。 FIG. 19 shows the result of detecting a moving object when the camera 10 fluctuates when occlusion occurs. In FIG. 19A, no occlusion has occurred, and thereafter, in FIG. 19B, occlusion has occurred. In any case, it can be seen that the moving object tracking device 20 according to the present embodiment can track the moving object.

以上説明したように、本実施形態による移動物体追跡装置２０は、抽出した輪郭に基づいて、オブジェクトマップ記憶部に記憶されているブロックの識別符号および動きベクトルを補正することにより、背景が変動する画像からでも、画像中の移動物体を正確に検出することができる。 As described above, the moving object tracking device 20 according to the present embodiment changes the background by correcting the block identification code and the motion vector stored in the object map storage unit based on the extracted contour. Even from an image, a moving object in the image can be accurately detected.

なお、上記の実施形態の説明においては、オブジェクトマップ記憶部に記憶されているブロックの識別符号とともに動きベクトルを補正するものとして説明したが、識別符号のみを補正してもよい。このようにしても、同様に、背景が変動する画像からでも、画像中の移動物体を正確に検出することができる。 In the above description of the embodiment, the motion vector is corrected together with the block identification code stored in the object map storage unit. However, only the identification code may be corrected. Even in this case, similarly, a moving object in an image can be accurately detected from an image with a changing background.

なお、上述した移動物体変動量検出部３３は、オブジェクトマップ記憶部２６に記憶されている移動物体のサイズまたは移動量の変動量を、識別符号または動きベクトルに基づいて単位時間毎に検出する。 The moving object variation detection unit 33 described above detects the size of the moving object or the variation of the movement amount stored in the object map storage unit 26 based on the identification code or the motion vector for each unit time.

そして制御部３４（第２の制御部）は、移動物体変動量検出部３３により検出された単位時間毎の移動物体のサイズまたは移動量の変動量が、予め定められているサイズまたは移動量の変動量よりも大きい場合に、輪郭抽出部３０を制御して輪郭を抽出させ、補正部３１を制御してオブジェクトマップ記憶部２６に記憶されているブロックの識別符号および動きベクトルを補正させる。 Then, the control unit 34 (second control unit) determines that the size of the moving object or the amount of movement of the moving object per unit time detected by the moving object fluctuation amount detection unit 33 is a predetermined size or movement amount. When it is larger than the fluctuation amount, the contour extraction unit 30 is controlled to extract the contour, and the correction unit 31 is controlled to correct the block identification code and the motion vector stored in the object map storage unit 26.

このようにオブジェクトマップ記憶部２６に記憶されている移動物体のサイズまたは移動量の変動量も応じて、オブジェクトマップ記憶部２６に記憶されているブロックの識別符号および動きベクトルを補正させてもよい。 As described above, the block identification code and the motion vector stored in the object map storage unit 26 may be corrected in accordance with the size of the moving object stored in the object map storage unit 26 or the variation amount of the movement amount. .

これにより制御部３４が、単に予め定められている所定期間毎にまたは所定フレーム毎にオブジェクトマップを補正する場合に対比して、移動物体を検出することに失敗しそうなタイミングで、オブジェクトマップを補正することが可能となる。そのために、移動物体をより的確に検出して追跡することが可能となる。 As a result, the control unit 34 corrects the object map at a timing at which it is likely to fail to detect the moving object, as compared with the case where the control unit 34 simply corrects the object map every predetermined period or every predetermined frame. It becomes possible to do. Therefore, it becomes possible to detect and track a moving object more accurately.

なお上述した図２のフレームバッファメモリ３、画像メモリ２１またはオブジェクトマップ記憶部２６などの記憶部は、ハードディスク装置や光磁気ディスク装置、フラッシュメモリ等の不揮発性のメモリや、ＣＤ−ＲＯＭ等の読み出しのみが可能な記憶媒体、ＲＡＭ（ＲａｎｄｏｍＡｃｃｅｓｓＭｅｍｏｒｙ）のような揮発性のメモリ、あるいはこれらの組み合わせにより構成されるものとする。 The storage unit such as the frame buffer memory 3, the image memory 21, or the object map storage unit 26 of FIG. 2 described above is a non-volatile memory such as a hard disk device, a magneto-optical disk device, a flash memory, or a CD-ROM. Only a storage medium that can only be used, a volatile memory such as a RAM (Random Access Memory), or a combination thereof.

また図２における画像変換部２２、背景生成部２４、ＩＤ生成／消滅部２５、移動物体追跡部２７、輪郭抽出部３０、補正部３１、判定部３２、移動物体変動量検出部３３、制御部３４またはオクルージョン検出部３５という処理部は、専用のハードウェアにより実現されるものであってもよく、また、この処理部はメモリおよびＣＰＵ（中央演算装置）により構成され、処理部の機能を実現するためのプログラムをメモリにロードして実行することによりその機能を実現させるものであってもよい。 2, the image conversion unit 22, the background generation unit 24, the ID generation / annihilation unit 25, the moving object tracking unit 27, the contour extraction unit 30, the correction unit 31, the determination unit 32, the moving object fluctuation amount detection unit 33, and the control unit. The processing unit 34 or the occlusion detection unit 35 may be realized by dedicated hardware, and this processing unit is configured by a memory and a CPU (central processing unit) to realize the function of the processing unit. The function may be realized by loading a program to be executed into a memory and executing the program.

また、図１における画像変換部２２、背景生成部２４、ＩＤ生成／消滅部２５、移動物体追跡部２７、輪郭抽出部３０、補正部３１、判定部３２、移動物体変動量検出部３３、制御部３４またはオクルージョン検出部３５という処理部の機能を実現するためのプログラムをコンピュータ読み取り可能な記録媒体に記録して、この記録媒体に記録されたプログラムをコンピュータシステムに読み込ませ、実行することにより、この処理部による処理を実行してもよい。なお、ここでいう「コンピュータシステム」とは、ＯＳや周辺機器等のハードウェアを含むものとする。 Also, the image conversion unit 22, the background generation unit 24, the ID generation / annihilation unit 25, the moving object tracking unit 27, the contour extraction unit 30, the correction unit 31, the determination unit 32, the moving object variation detection unit 33, and the control in FIG. By recording a program for realizing the function of the processing unit called the unit 34 or the occlusion detection unit 35 on a computer-readable recording medium, causing the computer system to read and execute the program recorded on the recording medium, You may perform the process by this process part. The “computer system” here includes an OS and hardware such as peripheral devices.

また、「コンピュータシステム」は、ＷＷＷシステムを利用している場合であれば、ホームページ提供環境（あるいは表示環境）も含むものとする。
また、「コンピュータ読み取り可能な記録媒体」とは、フレキシブルディスク、光磁気ディスク、ＲＯＭ、ＣＤ−ＲＯＭ等の可搬媒体、コンピュータシステムに内蔵されるハードディスク等の記憶装置のことをいう。さらに「コンピュータ読み取り可能な記録媒体」とは、インターネット等のネットワークや電話回線等の通信回線を介してプログラムを送信する場合の通信線のように、短時間の間、動的にプログラムを保持するもの、その場合のサーバやクライアントとなるコンピュータシステム内部の揮発性メモリのように、一定時間プログラムを保持しているものも含むものとする。また上記プログラムは、前述した機能の一部を実現するためのものであっても良く、さらに前述した機能をコンピュータシステムにすでに記録されているプログラムとの組み合わせで実現できるものであっても良い。 Further, the “computer system” includes a homepage providing environment (or display environment) if a WWW system is used.
The “computer-readable recording medium” refers to a portable medium such as a flexible disk, a magneto-optical disk, a ROM, and a CD-ROM, and a storage device such as a hard disk built in the computer system. Furthermore, the “computer-readable recording medium” dynamically holds a program for a short time like a communication line when transmitting a program via a network such as the Internet or a communication line such as a telephone line. In this case, a volatile memory in a computer system serving as a server or a client in that case, and a program that holds a program for a certain period of time are also included. The program may be a program for realizing a part of the functions described above, and may be a program capable of realizing the functions described above in combination with a program already recorded in a computer system.

以上、この発明の実施形態を図面を参照して詳述してきたが、具体的な構成はこの実施形態に限られるものではなく、この発明の要旨を逸脱しない範囲の設計等も含まれる。 The embodiment of the present invention has been described in detail with reference to the drawings. However, the specific configuration is not limited to this embodiment, and includes design and the like within a scope not departing from the gist of the present invention.

１０…カメラ、２０…移動物体追跡装置、２１…画像メモリ、２２…画像変換部、２３…フレームバッファメモリ、２４…背景画像生成部、２５…ＩＤ生成／消滅部、２６…オブジェクトマップ記憶部、２７…移動物体追跡部、３０…輪郭抽出部、３１…補正部、３２…判定部、３３…移動物体変動量検出部、３４…制御部、３５…オクルージョン検出部、３０１…対象領域設定部、３０２…対象領域補正部、３０３…輪郭抽出処理部 DESCRIPTION OF SYMBOLS 10 ... Camera, 20 ... Moving object tracking apparatus, 21 ... Image memory, 22 ... Image conversion part, 23 ... Frame buffer memory, 24 ... Background image generation part, 25 ... ID production | generation / annihilation part, 26 ... Object map storage part, 27 ... Moving object tracking unit, 30 ... Contour extraction unit, 31 ... Correction unit, 32 ... Determination unit, 33 ... Moving object fluctuation amount detection unit, 34 ... Control unit, 35 ... Occlusion detection unit, 301 ... Target region setting unit, 302: Target area correction unit, 303 ... Contour extraction processing unit

Claims

A moving object tracking device on an image that detects a moving object in an image by processing a time-series image,
Each image of the time-series image is divided into a plurality of blocks, and an identification code indicating a moving object in the image is attached to the block corresponding to the moving object, and is stored.
A contour extracting unit that extracts a contour of the moving object from the image of the time series image;
A correction unit that corrects the identification code of the block stored in the object map storage unit based on the contour extracted by the contour extraction unit;
A moving object tracking device comprising:

The contour extraction unit
A target area setting unit that sets a target area for extracting the contour of the moving object based on an image area corresponding to the block of the moving object stored in the object map storage unit;
A contour extraction processing unit that extracts a contour of the moving object from the image of the time-series image with respect to the target region set by the target region setting unit;
The moving object tracking device according to claim 1, comprising:

The contour extraction unit
A histogram of the number of pixels corresponding to the edge in the target region set by the target region setting unit is generated by projecting the image of the time series image by edge extraction processing for each coordinate axis, A target region correction unit that corrects the target region set by the target region setting unit for each coordinate axis based on a histogram generated for each coordinate axis;
Have
The contour extraction processing unit
For the target region corrected by the target region correction unit, the contour of the moving object is extracted from the image of the time series image.
The moving object tracking device according to claim 2.

The contour extraction unit
A plurality of moving objects in which occlusion occurs are defined as an integral moving object, and the contour of the integral moving object is extracted from the image of the time series image.
The moving object tracking device according to claim 1, wherein the moving object tracking device is a moving object tracking device.

The correction unit is
Information indicating boundaries of the plurality of moving objects based on identification codes corresponding to the plurality of moving objects in which the occlusion occurs, which is stored in the object map storage unit, and the occlusion is generated by the contour extraction unit Correcting the identification code of the block stored in the object map storage unit based on the outline of the extracted moving object integrated with a plurality of moving objects that are integrated,
The moving object tracking device according to claim 4.

A determination unit that determines whether or not the background image is fluctuating;
When the determination unit determines that the background image is fluctuating, the contour extraction unit is controlled to extract a contour, and the correction unit is controlled to store the block stored in the object map storage unit. A first control unit for correcting the identification code;
The moving object tracking device according to claim 1, further comprising:

A moving object fluctuation amount detection unit that detects a fluctuation amount of the size or movement amount of the moving object stored in the object map storage unit per unit time based on the identification code;
The contour extracting unit when the moving object size or moving amount variation amount per unit time detected by the moving object variation detecting unit is larger than a predetermined size or moving amount variation amount. A second control unit that extracts the contour by controlling the correction unit, and controls the correction unit to correct the identification code of the block stored in the object map storage unit;
The moving object tracking device according to claim 1, further comprising:

In the object map storage unit,
The identification code is attached to the block corresponding to the moving object, and a motion vector of the moving object corresponding to the block is attached to the block and stored.
A moving object tracking unit that updates a block identification code and a motion vector stored in the object map storage unit based on a result of image processing of the time-series image;
Have
The moving object tracking unit includes:
For each of consecutive N images (N ≧ 2) of the time-series images, by attaching the same identification code to blocks whose absolute values of motion vector differences between adjacent blocks are within a predetermined value, An identification code procedure for attaching different identification codes to overlapping moving objects;
In each of the N images, a first object that is a block group to which a first identification code is attached is in contact with a second object that is a block group to which a second identification code is attached, and the N image is temporally related. A determination procedure for determining whether or not the degree of correlation between the first objects of images adjacent to each other is equal to or greater than a predetermined value;
A tracking procedure for tracking the first object and the second object retroactively after being determined affirmative in the determination procedure;
An update procedure for updating a block identification code and a motion vector stored in the object map storage unit based on the first object and the second object tracked back in time by the tracking procedure;
The moving object tracking device according to claim 1, wherein:

An on-image moving object tracking method used in an on-image moving object tracking device that detects time-series images and detects a moving object in the image,
A contour extracting unit that extracts a contour of the moving object from the image of the time-series image;
An object map storage unit in which each image of the time-series image is divided into a plurality of blocks and an identification code indicating the moving object in the image is attached to the block corresponding to the moving object and stored in the correction unit A correction procedure for correcting the identification code of the block stored in the image based on the contour extracted by the contour extraction unit;
A moving object tracking method comprising:

To the computer as a moving object tracking device on the image that detects the moving object in the image by processing the time series image,
A contour extraction unit executes a contour extraction procedure for extracting the contour of the moving object from the image of the time series image,
An object map storage unit in which each image of the time-series image is divided into a plurality of blocks and an identification code indicating the moving object in the image is attached to the block corresponding to the moving object and stored in the correction unit Executing a correction procedure for correcting the identification code of the block stored in the block based on the contour extracted by the contour extraction unit;
Moving object tracking program to make