JPH04260979A

JPH04260979A - Detecting and tracking system for mobile objection

Info

Publication number: JPH04260979A
Application number: JP3314304A
Authority: JP
Inventors: Hidenori Inai; 秀則井内; Piitaa Makurokurin Nairu; ナイル・ピーター・マクロクリン
Original assignee: Hitachi Ltd
Current assignee: Hitachi Ltd
Priority date: 1990-11-28
Filing date: 1991-11-28
Publication date: 1992-09-16
Also published as: GB9025797D0; GB9125376D0; GB2253109A

Abstract

PURPOSE:To offer a system for detecting the motion of plural mobile objections included in a continued image frame and tracking an intended specific mobile objection. CONSTITUTION:After calculating an optical flow from the luminance information of respective picture elements in a continued image frame by a simplified MOC filter 104, motion information relating to the boundary of a mobile objection identified by the optical flow is converted by a time space normalizing filter 105 for time spatially averaging and regulating the motion information so as to be concentrated into the center of the mobile objection, projection distribution is applied to the converted data by a feature extracting filter 106 and the distributed data are converted into one-dimensional speed information and then inputted to a competition type neural network 102 having three-layer structure. A box (window) including the specific intended mobile objection is defined from the center information and contour information of the mobile objection is defined in order to track the mobile objection, the shape information of the mobile objection in the box is extracted by a feature extracting filter 108 and inputted to a novel filter neural network 109 having one-layer structure to recognize the shape of the mobile objection.

Description

[Detailed description of the invention]

【０００１】0001

【産業上の利用分野】本発明は移動物体の検出、追跡を
行なうシステムに関する。さらに詳しくは、複数の移動
物体の検出、追跡を人間の視覚機能を模倣したニューラ
ルネットワーク技術を応用することにより実時間で行な
うことを可能にする移動物体の検出、追跡を行なうシス
テムに関する。物流システム管理、自立型ロボットの視
覚などの応用が考えられる。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a system for detecting and tracking moving objects. More specifically, the present invention relates to a system for detecting and tracking moving objects that can detect and track multiple moving objects in real time by applying neural network technology that imitates human visual function. Possible applications include logistics system management and autonomous robot vision.

【０００２】0002

【従来の技術】ビデオ映像上の明点の追跡技術について
は、米国特許第４６４４３９７号「ビデオ映像に於ける
明点を他の明点の干渉を除外しながら意図する明点のみ
を追跡する方法」（ＭＥＴＨＯＤ　ＯＦ　ＰＲＯＣＥＳ
ＳＩＮＧ　Ａ　ＶＩＤＥＯ　ＩＭＡＧＥ　ＴＯ　ＴＲＡ
ＣＫ　Ａ　ＢＲＩＧＨＴ　ＳＰＯＴ　ＡＮＤ　ＴＯ　Ｐ
ＲＯＴＥＣＴ　ＳＡＩＤ　ＴＲＡＣＫＩＮＧ　ＦＲＯＭ
　ＩＮＴＥＲＦＥＲＥＮＣＥ　ＦＲＯＭ　ＯＴＨＥＲ　
ＢＲＩＧＨＴ　ＳＰＯＴＳ）が挙げられる。従来の技術
は、ほぼ円形に近い移動物体に対して、それを包含する
ような仮想的なウィンドウを設け、この仮想ウィンドウ
内での移動前後の二つの画像の相関が最も大きくなる方
向に仮想ウィンドウを移動することによって追跡を行な
っていた。[Prior Art] Regarding a technology for tracking bright points on a video image, see US Pat. ”(METHOD OF PROCES
SING A VIDEO IMAGE TO TRA
CK A BRIGHT SPOT AND TO P
ROTECT SAID TRACKING FROM
INTERFERENCE FROM OTHER
BRIGHT SPOTS). In conventional technology, a virtual window is created to encompass a nearly circular moving object, and the virtual window is opened in the direction that maximizes the correlation between the two images before and after the movement within this virtual window. Tracking was done by moving the .

【０００３】0003

【発明が解決しようとする課題】移動物体の輪郭点に関
する速度情報（オプティカルフロー）は物体の輝度が一
様である場合には物体の輪郭点に於いてのみ求められる
。速度情報は、速さと移動方向の２成分からなるが、一
つの塊としての、移動物体についての速度情報を求める
ためには物体の輪郭点における速度情報を何らかの手段
により統合する必要がある。しかしながら、一般的には
、この目的のためには最初に移動物体を他の移動物体、
あるいは静止物体から分離することが必要となる。[Problem to be Solved by the Invention] Velocity information (optical flow) regarding the contour points of a moving object can be obtained only at the contour points of the object when the brightness of the object is uniform. Velocity information consists of two components, speed and moving direction, but in order to obtain velocity information about a moving object as a single mass, it is necessary to integrate velocity information at contour points of the object by some means. However, in general, for this purpose, a moving object must first be connected to another moving object,
Or it may be necessary to separate it from a stationary object.

【０００４】上記従来技術は、予め設けられた仮想ウィ
ンドウ内に入ってくる他の移動点に対しては、これを排
除するための別のサブウインドウを用意するという手段
を用いて追跡すべき移動点に対してサブウインドウ内の
移動点が与える影響を除外していた。このために追跡す
べき移動点の数が増大すると新たなサブウィンドウを計
算し設定するための処理時間が増大するという問題があ
った。[0004] In the above-mentioned prior art, the movement to be tracked is determined by preparing another sub-window to exclude other moving points that enter a pre-provided virtual window. The influence of moving points within the subwindow on points was excluded. For this reason, there is a problem in that as the number of moving points to be tracked increases, the processing time for calculating and setting a new sub-window increases.

【０００５】本発明の目的は、画面上の移動物体の数に
よらず追跡に要する処理を一定時間で実行する移動物体
の検出追跡システム、及び移動物体の画面上での他の移
動物体との衝突、他の物体による隠蔽に影響を受けない
移動物体の検出追跡システムを提供することにある。An object of the present invention is to provide a moving object detection and tracking system that executes the processing required for tracking in a fixed time regardless of the number of moving objects on the screen, and a system for detecting and tracking moving objects that performs the processing required for tracking in a fixed time regardless of the number of moving objects on the screen. An object of the present invention is to provide a moving object detection and tracking system that is not affected by collision or concealment by other objects.

【０００６】[0006]

【課題を解決するための手段】上記目的を達成するため
に、まず、連続する２つの画像フレームに対して簡単化
ＭＯＣフィルタを適用することにより移動物体の輪郭に
関する動き情報（オプティカルフロー）を計算する。簡
単化ＭＯＣフィルタは３つの処理段階（空間フィルタに
よる方向別エッジ検出部、時間フィルタによる輝度差検
出部、空間フィルタと時間フィルタの積和組合せ部）か
ら構成されるオプティカルフロー算出フィルタである。次に、オプティカルフローに対して時空間規格化フィル
タを移動方向毎に適用することにより、移動物体に関す
る移動方向毎の速さ情報を規格化して抽出する。移動物
体に関する速さ情報の移動方向、及び移動方向に垂直な
方向への一次元投影分布をとった後、方向（移動方向、
投影分布方向）ごとの局所極大値検出ニューラルネット
ワークに入力する。局所極大値検出ニューラルネットワ
ークは３層構造の競合型ニューラルネットワークである
。[Means for solving the problem] In order to achieve the above object, first, motion information (optical flow) regarding the outline of a moving object is calculated by applying a simplified MOC filter to two consecutive image frames. do. The simplified MOC filter is an optical flow calculation filter that includes three processing stages (directional edge detection section using a spatial filter, brightness difference detection section using a temporal filter, and product-sum combination section of a spatial filter and a temporal filter). Next, by applying a spatio-temporal normalization filter to the optical flow for each direction of movement, speed information regarding the moving object for each direction of movement is standardized and extracted. After taking the one-dimensional projection distribution of the velocity information regarding the moving object in the direction of movement and in the direction perpendicular to the direction of movement, the direction (direction of movement,
input to the local maximum detection neural network for each projection distribution direction). The local maximum value detection neural network is a competitive neural network with a three-layer structure.

【０００７】次に、局所極大値検出ニューラルネットワ
ークにより検出された移動物体の中心、輪郭情報から移
動物体を包含する箱を定義し、この箱の中で新奇性フィ
ルタニューラルネットワークにより物体認識を行なう。Next, a box containing the moving object is defined from the center and contour information of the moving object detected by the local maximum value detection neural network, and object recognition is performed within this box by the novelty filter neural network.

【０００８】[0008]

【作用】連続する２つの画像フレームに対して簡単化Ｍ
ＯＣフィルタを適用することにより移動物体の輪郭に関
する動き情報（オプティカルフロー）が計算される。簡
単化ＭＯＣフィルタは物体の輝度によらない速度情報の
計算が可能であることが特徴である。オプティカルフロ
ー情報を移動方向毎に時空間規格化フィルタに入力する
ことにより一つの塊としての、移動物体に関する規格化
された速さ情報を抽出することができる。移動物体に関
する速度情報の一次元投影分布波形を方向（移動方向、
投影分布方向）ごとに局所極大値検出ニューラルネット
ワークに入力して投影分布方向ごとに、各移動方向に対
応する局所極大値検出ニューラルネットワークの出力を
比較し、最大値を検出することにより各移動物体の中心
の座標、及び移動方向を決定することができる。この競
合型の局所極大値検出ニューラルネットワークは、外部
入力信号の局所極大値を時間の経過と共に先鋭化する、
ダイナミック動作をする短期メモリとして動作する。競
合型の局所極大値検出ニューラルネットワーク群により
求められる移動物体の中心情報、及びオプティカルフロ
ーにより求められる輪郭情報を用いることにより移動物
体を包含する箱（ウィンドウ）を定義することができる
。この箱（ウィンドウ）の中の移動物体に対して新奇性
フィルタニューラルネットワークを適用する。新奇性フ
ィルタニューラルネットワークは、学習したパターンに
対して新奇なパターン部分を出力するニューラルネット
ワークである。新奇性フィルタの持つこの性質により他
の移動物体との衝突、他の物体による隠蔽が発生した場
合にも、追跡すべき移動物体が学習パターンとして記憶
されていれば、追跡すべき移動物体の形状を連想出力で
きる。このために他の移動物体との衝突、他の物体によ
る隠蔽に影響を受けない物体認識が可能になる。[Action] Simplify M for two consecutive image frames
Motion information (optical flow) regarding the contour of the moving object is calculated by applying the OC filter. The simplified MOC filter is characterized in that velocity information can be calculated without depending on the brightness of the object. By inputting the optical flow information to the spatio-temporal normalization filter for each direction of movement, it is possible to extract normalized speed information regarding the moving object as a single block. The one-dimensional projected distribution waveform of velocity information about a moving object is expressed in the direction (moving direction,
Each moving object is The coordinates of the center of and the direction of movement can be determined. This competitive local maximum value detection neural network sharpens the local maximum value of the external input signal over time.
It acts as a dynamic short-term memory. A box (window) containing the moving object can be defined by using the center information of the moving object obtained by the competitive local maximum detection neural network group and the contour information obtained by the optical flow. A novelty filter neural network is applied to the moving object within this box (window). A novelty filter neural network is a neural network that outputs a novel pattern part for a learned pattern. Due to this property of the novelty filter, even if a collision with another moving object or concealment by another object occurs, if the moving object to be tracked is memorized as a learning pattern, the shape of the moving object to be tracked is can be output associatively. This makes it possible to recognize objects that are not affected by collisions with other moving objects or concealment by other objects.

【０００９】[0009]

【実施例】以下、本発明の一実施例を詳細に説明する。図１は本発明に基づく移動物体の検出追跡システムの構
成図を示している。センサ１０１はビットマップ画像を
動き検出サブシステム（ＩＰＬＳ）１０２、動き追跡サ
ブシステム（ＭＴＳ）１０３に供給する。動き検出サブ
システム（ＩＰＬＳ）１０２では、センサ１０１により
得られる画像に対して簡単化ＭＯＣフィルタ１０４、及
び時空間規格化フィルタ１０５により動き情報を抽出す
るための画像情報処理を行なった後、特徴抽出１フィル
タ１０６により一次元動き情報特徴を抽出して局所極大
値検出ニューラルネットワーク群１０７で移動物体の中
心情報を検出する。[Embodiment] An embodiment of the present invention will be described in detail below. FIG. 1 shows a block diagram of a moving object detection and tracking system according to the present invention. The sensor 101 provides bitmap images to a motion detection subsystem (IPLS) 102 and a motion tracking subsystem (MTS) 103. A motion detection subsystem (IPLS) 102 performs image information processing to extract motion information using a simplified MOC filter 104 and a spatio-temporal normalization filter 105 on the image obtained by the sensor 101, and then performs feature extraction. 1 filter 106 extracts one-dimensional motion information features, and a local maximum value detection neural network group 107 detects center information of the moving object.

【００１０】動き追跡サブシステム（ＭＴＳ）１０３で
は、センサ１０１により得られる画像から動き検出サブ
システム（ＩＰＬＳ）１０２で得られる移動物体の中心
情報を用いることにより移動物体を包含する箱領域を切
りだし、箱領域内の画像から特徴抽出２フィルタ１０８
により移動物体に関する形状情報を抽出した後、新奇性
フィルタニューラルネッワーク１０９により追跡すべき
移動物体認識を行ない、この結果を用いて意図する移動
物体を視野の中心に捉えるようにセンサ駆動部１１０を
駆動する。The motion tracking subsystem (MTS) 103 cuts out a box region containing the moving object from the image obtained by the sensor 101 by using the center information of the moving object obtained by the motion detection subsystem (IPLS) 102. , Feature extraction 2 filter 108 from the image within the box area
After extracting shape information regarding the moving object, the novelty filter neural network 109 recognizes the moving object to be tracked, and uses this result to drive the sensor drive unit 110 to capture the intended moving object at the center of the field of view. do.

【００１１】まず、動き検出サブシステム（ＩＰＬＳ）
１０２について詳細に説明する。センサ１０１により得
られる連続画像フレームの画素輝度情報をＩｉｊ（ｔ１
），Ｉｉｊ（ｔ２），．．．と記述する。ここでｉ，ｊ
は画素の位置を、ｔ１、ｔ２は時刻を示す。これらの画
素輝度情報は簡単化ＭＯＣフィルタ１０４に入力される
。簡単化ＭＯＣフィルタ１０４は３層の構造を持ち、連
続する画像フレームの同一位置の画素における方向毎の
移動量、即ちオプティカルフローを計算する。このオプ
ティカルフロー算出フィルタは、Ｓ．Ｇｒｏｓｓｂｅｒ
ｇ，Ｍ．Ｒｕｄｄ著「Ａ　Ｎｅｕｒａｌ　Ａｒｃｈｉｔ
ｅｃｔｕｒｅ　ｆｏｒ　Ｖｉｓｕａｌ　Ｍｏｔｉｏｎ　
Ｐｅｒｃｅｐｔｉｏｎ＊　Ｇｒｏｕｐａｎｄ　Ｅｌｅｍ
ｅｎｔ　Ａｐｐａｒｅｎｔ　Ｍｏｔｉｏｎ」，Ｎｅｕｒ
ａｌ　Ｎｅｔｗｏｒｋ　Ｖｏｌ２，ｐｐ４２１−４５０
（１９８９）、またはＳ．Ｇｒｏｓｓｂｅｒｇ著「Ｔｈ
ｅ　Ａｄａｐｔｉｖｅ　Ｂｒａｉｎ　ＩＩ」，Ｅｌｓｅ
ｖｉｒ（１９８７）に詳細に記述されている。簡単化Ｍ
ＯＣフィルタ１０４の出力信号をＵｉｊ（ｋ，ｔ）と表
す。ここでｉ，ｊは画素の位置を、ｋは方向を、ｔは時
刻を示す。方向分解能は８であるとする。簡単化ＭＯＣ
フィルタ１０４は、現在取り込んでいる画像のエッジを
検出し、かつ現在取り込んでいる画像と直前に取り込ん
だ画像の輝度情報とを比較することにより、ｋ＝０〜７
に対応する８方向の輪郭速度情報、即ちオプティカルフ
ローＵｉｊ（ｋ，ｔ）を算出する。オプティカルフロー
Ｕｉｊ（ｋ，ｔ）は、時空間規格化フィルタ１０５によ
り時空間的に平均化、規格化されてＶｉｊ（ｋ，ｔ）に
変換される。変換式は次式（１）に基づく。First, the motion detection subsystem (IPLS)
102 will be explained in detail. The pixel brightness information of continuous image frames obtained by the sensor 101 is expressed as Iij(t1
), Iij(t2), . ．．．． It is described as follows. Here i, j
indicates the pixel position, and t1 and t2 indicate the time. These pixel brightness information are input to simplified MOC filter 104. The simplified MOC filter 104 has a three-layer structure and calculates the amount of movement in each direction of pixels at the same position in consecutive image frames, that is, the optical flow. This optical flow calculation filter is based on S. Grossber
g, M. “A Neural Archit” by Rudd.
ecture for Visual Motion
Perception* Group Elem
ENT APPARENT MOTION”, Neur
al Network Vol2, pp421-450
(1989), or S. Grossberg, “Th
e Adaptive Brain II”, Else
vir (1987). Simplification M
The output signal of the OC filter 104 is expressed as Uij(k,t). Here, i and j indicate the position of the pixel, k indicates the direction, and t indicates the time. It is assumed that the directional resolution is 8. Simplified MOC
The filter 104 detects the edge of the currently captured image and compares the brightness information of the currently captured image with the image captured immediately before, so that k=0 to 7.
Contour velocity information in eight directions corresponding to , that is, optical flow Uij (k, t) is calculated. The optical flow Uij (k, t) is spatiotemporally averaged and normalized by the spatiotemporal normalization filter 105 and converted into Vij (k, t). The conversion formula is based on the following formula (1).

【００１２】0012

【数１】 εｄＶｉｊ（ｋ，ｔ）／ｄｔ　＝　−αＶｉｊ（ｋ，ｔ
）　＋　（Ａ−Ｖｉｊ（ｋ，ｔ））ＮＥｉｊ　−　Ｖｉ
ｊ（ｋ，ｔ）ＮＩｉｊ　　　　（１）ここで、 εは時定数、ＮＥｉｊ　　＝　　Ｕｉｊ（ｋ，ｔ）、ＮＩｉｊ　＝　
ΣＧσ（ｕ，ｖ）Ｕ（ｉ＋ｕ）（ｊ＋ｖ）（ｋ，ｔ）、
（ｕ，ｖ）≠（０，０）Ｇσ（ｕ，ｖ）＝（１／２πσ）ｅｘｐ（−（ｕ２＋ｖ
２）／２σ２）である。[Equation 1] εdVij (k, t)/dt = −αVij (k, t
) + (A-Vij(k,t))NEij-Vi
j (k, t) NIij (1) where ε is a time constant, NEij = Uij (k, t), NIij =
ΣGσ(u,v)U(i+u)(j+v)(k,t),
(u, v)≠(0,0) Gσ(u,v)=(1/2πσ)exp(−(u2+v
2)/2σ2).

【００１３】式（１）は、Ｖｉｊ（ｋ，ｔ）の値が０と
Ａとの間に入るように速度を規格化する。即ち飛び抜け
て大きい速度成分を有する移動物体が他の移動物体の速
度の大きさ情報に与える影響が抑止される。さらに、時
空間規格化フィルタ１０５は簡単化ＭＯＣフィルタ１０
４の出力信号Ｕｉｊ（ｋ，ｔ）の局所的な時空間平均化
を行なう、非常に短期の記憶メモリとしても役割を果た
す。上記の処理過程を図２、３、４を用いて図式的に示
す。Equation (1) normalizes the speed so that the value of Vij(k,t) falls between 0 and A. That is, the influence of a moving object having an extremely large velocity component on velocity magnitude information of other moving objects is suppressed. Furthermore, the spatio-temporal normalization filter 105 is the simplified MOC filter 10
It also serves as a very short-term storage memory that performs local spatio-temporal averaging of the output signals Uij(k,t) of 4. The above processing steps are schematically illustrated using FIGS. 2, 3, and 4.

【００１４】図２に於いて、３個の物体がシーンに存在
しており、その中の１個の物体は静止物体であり、他の
２個の物体は異なる速度で右方向に移動している。Ｖｉ
ｊ（ｋ，ｔ）は特徴抽出１フィルタ１０６で８方向毎に
それぞれ、移動方向、及び移動方向に垂直な方向に沿っ
た投影分布がとられ、Ｓｘ，ｋ、Ｓｙ，ｋに変換される
。In FIG. 2, three objects exist in the scene, one of which is stationary, and the other two objects are moving to the right at different speeds. There is. Vi
The feature extraction 1 filter 106 calculates the projection distribution of j(k,t) along the movement direction and the direction perpendicular to the movement direction for each of the eight directions, and converts it into Sx,k and Sy,k.

【００１５】図４にこの投影分布の計算過程を図式的に
示す。この結果、２個の移動物体については中心情報は
Ｓｘ，ｋ、Ｓｙ，ｋの極大値に反映されることになる。FIG. 4 schematically shows the process of calculating this projection distribution. As a result, for the two moving objects, the center information will be reflected in the maximum values of Sx,k and Sy,k.

【００１６】図５に移動物体中心情報を出力する局所極
大値検出ニューラルネットワーク群１０７の中のひとつ
のニューラルネットワークの詳細な構造を示す。局所極
大値検出ニューラルネットワークは、Ｆ１、Ｆ２、Ｆ３
で示す３層構造を持つ。入力信号ＳｋはＦ１、Ｆ３層に
同時に入力される。Ｆ２、Ｆ３層は同じ層の他のノード
（ニューロン）から負の抑制性の入力を、自身のノード
（ニューロン）からは正の興奮性の入力を受ける競合型
の構造を持つ。これら同一層内の結合の値は変化しない
。Ｘｋ、Ｙｋは移動物体の中心座標を示しておりＦ２層
に出力される。インスターと呼ぶＦ１層からＦ２層に至
る結合、及び上記の同一層内での結合によってＦ２層の
ノード活性値Ｘｊは次式（２）により決定される。FIG. 5 shows the detailed structure of one of the local maximum value detection neural networks 107 that outputs moving object center information. The local maximum value detection neural network is F1, F2, F3
It has a three-layer structure shown in . The input signal Sk is simultaneously input to the F1 and F3 layers. The F2 and F3 layers have a competitive structure in which they receive negative inhibitory inputs from other nodes (neurons) in the same layer and positive excitatory inputs from their own nodes (neurons). The values of these connections within the same layer do not change. Xk and Yk indicate the center coordinates of the moving object and are output to the F2 layer. The node activation value Xj of the F2 layer is determined by the following equation (2) by the connections from the F1 layer to the F2 layer called instars and the connections within the same layer described above.

【００１７】[0017]

【数２】　　　　　　　　　　　　　　　　　　　　　　　　　
　ｎ　　ｄＸｊ／ｄｔ　＝　−αＸｊ　＋　（Ａ　−　
Ｘｊ）［ΣＷｊｉＳｉ　＋　ｆ（Ｘｊ）］　−　ＸｊΣ
ｆ（Ｘｉ）　　　　　　　　（２）　　　　　　　　　
　　　　　　　　　　　　　　　　　ｉ＝０　　　　　
　　　　　　　　　　　　ｉ≠ｊ　　Ｆ１層からＦ２層に至る結合値ｗｉｊは次式（３）
で与えられるインスター学習則により決定される。[Math 2]
ndXj/dt = −αXj + (A −
Xj) [ΣWjiSi + f(Xj)] − XjΣ
f(Xi) (2)
i=0
i≠j The coupling value wij from the F1 layer to the F2 layer is expressed by the following formula (3)
It is determined by the instar learning rule given by .

【００１８】[0018]

【数３】ｄｗｉｊ／ｄｔ　＝ｅ（ｔ）（Ｓｉ−ｗｉｊ）Ｘｊ　　
　　　　　　　　　　　　　　　　　　　　　　　　　
　　　　　　　　　（３）ここで、ｅ（ｔ）は時間の経
過と共に減衰する単調減衰関数である。[Formula 3] dwij/dt = e(t)(Si-wij)Xj

(3) Here, e(t) is a monotonous decay function that decays over time.

【００１９】アウトスター学習則と呼ぶＦ２からＦ３層
に至る結合、及び上記の同一層内での結合によってＦ３
層のノード活性値Ｙｋは次式（４）により決定される。[0019] Through the connection from F2 to F3 layer called the outstar learning rule and the connection within the same layer described above,
The node activation value Yk of the layer is determined by the following equation (4).

【００２０】[0020]

【数４】　　　　　　　　　　　　　　　　　　　　　　　　　
　ｎｄＹｋ／ｄｔ　＝　−αＹｋ　＋　（Ａ　−　Ｙｋ
）［ΣＺｋｊＸｊ　＋　ｇ（Ｘｊ）　＋　Ｓｋ］　−　
ＹｋΣｇ（Ｙｉ）　　　（４）　　　　　　　　　　　
　　　　　　　　　　　　　　　ｊ＝０　　　　　　　
　　　　　　　　　　　　　　　ｉ≠ｋ　　Ｆ２からＦ３層に至る結合値ｚｋｊは次式（５）で
与えられるアウトスター学習則により決定される。[Math 4]
ndYk/dt = −αYk + (A − Yk
) [ΣZkjXj + g(Xj) + Sk] −
YkΣg(Yi) (4)
j=0
i≠k The connection value zkj from F2 to F3 layer is determined by the outstar learning rule given by the following equation (5).

【００２１】[0021]

【数５】ｄｚｋｊ／ｄｔ　＝ｅ（ｔ）（Ｙｋ−ｚｋｊ）Ｘｊ　　
　　　　　　　　　　　　　　　　　　　　　　　　　
　　　　　　　　　（５）ここで、ｅ（ｔ）は式（３）
と同様の時間の経過と共に減衰する単調減衰関数である
。[Formula 5] dzkj/dt = e(t)(Yk-zkj)Xj

(5) Here, e(t) is the formula (3)
It is a monotonous decay function that decays over time, similar to .

【００２２】以上の式に基づいて、局所極大値検出ニュ
ーラルネットワーク群１０７は追跡されるべき各移動物
体の動き分布の学習を行なう。Ｆ１層からＦ２層の結合
はいわゆる整合フィルタとして動作し、Ｆ１層のノード
活性値から形成される信号のピークを尖鋭化すると同時
に不要なノイズ成分を除去する。Ｆ２層は短期メモリと
して動作し、信号Ｓｉのピークを短期的に保持すると同
時にＦ３層からのフィードバック結合によって長期メモ
リとしても動作する。この結果、追跡すべき移動物体が
一時的に他の移動物体、あるいは静止物体によって隠滅
した場合にも、中心座標は式（２）で形成される短期メ
モリ内に記憶維持されているため、後で再び出現した場
合には、すぐさま追跡を開始することが可能になる。局
所極大値検出ニューラルネットワークはＡＲＴ１ネット
ワークのサブセットであると考えられる。ＡＲＴ１ニュ
ーラルネットワークの動作の詳細は、「Ｔｈｅ　Ａｕｇ
ｍｅｎｔｅｄ　ＡＲＴ１　ＮＥｕｒａｌ　Ｎｅｔｗｏｒ
ｋ」，ＩＪＣＮＮ−９１（Ｓｅａｔｔｌｅ），ｐｒｏｃ
ｅｅｄｉｎｇｓ，ＶｏｌＩＩ，ｐｐ４６９−４７２．に
記述されている。Based on the above equations, the local maximum value detection neural network group 107 learns the motion distribution of each moving object to be tracked. The combination of the F1 layer and the F2 layer operates as a so-called matched filter, sharpening the peak of the signal formed from the node activation values of the F1 layer and removing unnecessary noise components. The F2 layer operates as a short-term memory, holding the peak of the signal Si in the short term, and at the same time operates as a long-term memory through feedback coupling from the F3 layer. As a result, even if the moving object to be tracked is temporarily obscured by another moving object or a stationary object, the center coordinates are stored in the short-term memory formed by equation (2), so that they can be tracked later. If it reappears, you can immediately start tracking it. The local maximum detection neural network is considered to be a subset of the ART1 network. For more information on how the ART1 neural network works, see “The Aug.
Mented ART1 NEural Network
k”, IJCNN-91 (Seattle), proc
eedings, Vol II, pp469-472. It is described in

【００２３】次に、図１に於いて、動き追跡サブシステ
ム（ＭＴＳ）１０３はセンサ１０１により得られる画像
から局所極大値検出ニューラルネットワーク群１０７に
より出力される中心情報Ｘｋ、Ｙｋを利用して切りだし
た画像内に於いて、特徴抽出２フィルタ１０８を用いる
ことにより物体の形状に関する特徴を抽出する。物体の
形状を認識するように教育された新奇性フィルタニュー
ラルネットワーク１０９で物体の形状の新奇部分を連想
出力することにより、他の移動物体との衝突、あるいは
他の移動物体、静止物体による隠滅が発生した場合にも
、意図する追跡移動物体の形状を連想出力することがで
きる。このために追跡を確実に続行することが可能にな
る。エッジ検出、フーリエ変換、一般化ハフ変換などの
方法が、形状、輪郭記述のため用いられる。Next, in FIG. 1, the motion tracking subsystem (MTS) 103 cuts the image obtained by the sensor 101 using the center information Xk, Yk output by the local maximum value detection neural network group 107. In the captured image, features related to the shape of the object are extracted by using the feature extraction 2 filter 108. The novelty filter neural network 109, trained to recognize the shape of an object, associates and outputs novel parts of the shape of the object, thereby preventing collisions with other moving objects, or being obscured by other moving or stationary objects. Even when this occurs, the shape of the intended tracked moving object can be output associatively. This makes it possible to continue tracking reliably. Methods such as edge detection, Fourier transform, and generalized Hough transform are used for shape and contour description.

【００２４】図６に局所極大値検出ニューラルネットワ
ーク群１０７から出力される移動物体に関する中心情報
Ｘｋ、Ｙｋを中心として移動物体を包含する箱を決定す
る過程を示す。この図の中の物体は図２のそれに対応し
ている。２個の移動物体の中心情報６０１、６０２は物
体を分離するための箱６０２、６０３によって分離でき
る。物体６０５はセンサ１０１から得られる画像中に含
まれているが、静止しているために動きの情報は得られ
ず、これを包含する箱も生成されない。FIG. 6 shows the process of determining a box containing a moving object centered on the center information Xk, Yk regarding the moving object output from the local maximum value detection neural network group 107. The objects in this figure correspond to those in FIG. Center information 601 and 602 of two moving objects can be separated by boxes 602 and 603 for separating the objects. Although the object 605 is included in the image obtained from the sensor 101, since it is stationary, no information about its movement can be obtained, and a box containing it is not generated.

【００２５】図７に新奇性フィルタニューラルネットワ
ーク１０９の構成を示す。新奇性フィルタの原理、応用
については、Ｔ．Ｋｏｈｏｎｅｎ著「Ｓｅｌｆ−ｏｒｇ
ａｎｉｚａｔｉｏｎ　ａｎｄ　Ａｓｓｏｃｉａｔｉｖｅ
　Ｍｅｍｏｒｙ」（Ｓｐｒｉｎｇｅｒ−Ｖｅｒｌａｇ，
１９８９）、またはＥ．Ａｒｄｉｚｚｏｒｅ著「Ａｐｐ
ｌｉｃａｔｉｏｎ　ｏｆ　ｔｈｅ　Ｎｏｖｅｌｔｙ　Ｆ
ｉｌｔｅｒ　ｔｏ　Ｍｏｔｉｏｎ　Ａｎａｌｙｓｉｓ」
（ＩＮＮＣ１９９０，Ｐａｒｉｓ，ｐｐ４６−４９）に
記述されている。新奇性フィルタニューラルネットワー
クは互いに相互結合された一層構造の完全相互結合型の
ニューラルネットワークであり、各ノードの活性値Ｘｉ
（ｔ）は式（６）に従って変化する。FIG. 7 shows the configuration of the novelty filter neural network 109. Regarding the principles and applications of novelty filters, see T. “Self-org” by Kohonen
anization and association
Memory” (Springer-Verlag,
1989), or E. “App” by Ardizzore
lication of the Novelty F
Ilter to Motion Analysis”
(INNC 1990, Paris, pp46-49). The novelty filter neural network is a fully interconnected neural network with a single layer structure, and the activation value Xi of each node is
(t) varies according to equation (6).

【００２６】[0026]

【数６】　Ｘｉ（ｔ＋１）　＝　Ｘｉ（ｔ）　＋　ΣＷｊｉ（ｔ
）Ｘｊ（ｔ）　　　　　　　　　　　　　　　　　　　
　　　　　　　　　　　　　（６）　　　　　　　　　
　　　　　　　　　ｉ≠ｊ　　学習則は次式（７）で与えられる。[Formula 6] Xi(t+1) = Xi(t) + ΣWji(t
)Xj(t)
(6)
i≠j The learning rule is given by the following equation (7).

【００２７】[0027]

【数７】ｄｗｉｊ／ｄｔ　＝　−αＸｉＸｊ　　　　　　　　　
　　　　　　　　　　　　　　　　　　　　　　　　　
　　　　　　　　　　　　　（７）上記の式（７）より
、入力形状パターンが長時間ネットワークに提示されて
いると各ノードの活性値出力Ｘｉ（ｔ）は徐々に０に近
づく。学習モード時には、学習すべき形状パターンはあ
る一定の時間（ｔ＝ｔ１）の間だけ提示される。。この
間、重みの値ｗｉｊは入力形状パターンを学習し、逆に
各ノードの活性値出力Ｘｉ（ｔ）は徐々に０に近づく。ｔ１の値を適切な値に選ぶことによって一回のパターン
提示により学習は収束する。学習モード時以外の通常時
には、パターン提示時間はｔ１からｔ２（例えば、ｔ１
の１以上の正の整数倍）へと減少させる。この結果、提
示された形状パターンが学習済みのパターンに等しい場
合には、各ノードの活性値出力Ｘｉ（ｔ）は徐々に０に
近づく。また、完全な学習パターンに対して一部分が欠けている
パターンが提示された場合には、欠けている部分のパタ
ーンが新奇パターンとして連想出力される。追跡すべき
物体の形状の学習が終了すると、物体どうしの重なりな
どの影響を受けずに追跡が可能となる。[Formula 7] dwij/dt = −αXiXj

(7) From the above equation (7), if the input shape pattern is presented to the network for a long time, the activation value output Xi(t) of each node gradually approaches 0. In the learning mode, the shape pattern to be learned is presented only for a certain period of time (t=t1). . During this time, the weight value wij learns the input shape pattern, and conversely, the activation value output Xi(t) of each node gradually approaches zero. By selecting an appropriate value for t1, learning converges with one pattern presentation. In normal times other than learning mode, the pattern presentation time varies from t1 to t2 (for example, t1
(a positive integer multiple of 1 or more). As a result, if the presented shape pattern is equal to the learned pattern, the activation value output Xi(t) of each node gradually approaches zero. Furthermore, when a pattern with a part missing from a complete learning pattern is presented, the pattern with the missing part is output as an associated novel pattern. Once the learning of the shape of the object to be tracked is completed, tracking becomes possible without being affected by objects overlapping each other.

【００２８】図８を用いて原理を説明する。８０１は左
から右に移動する物体を、８０２は物体８０１の前を通
過する物体で物体８０１の前部の一部分を隠している。この重なりあう時間が非常に短い時間の場合には、新奇
性フィルタの性質から物体８０１の前部の輪郭を連想出
力することができるために、物体８０１の完全な形状を
再現することが可能である。The principle will be explained using FIG. 801 is an object moving from left to right, and 802 is an object passing in front of the object 801, which partially hides the front part of the object 801. If this overlapping time is very short, the front outline of the object 801 can be output associatively due to the nature of the novelty filter, so it is possible to reproduce the complete shape of the object 801. be.

【００２９】次に、図１に示した移動物体の検出追跡シ
ステムの一実装例を示す。図９に於いて、９０１は画像
センサとしての一般的なＣＣＤカメラ、９０２はＭｅｉ
ｋｏ社のＭ１０筐体に内蔵された画像取り込みのための
ボード、９０３はＭ１０筐体に内蔵された画像表示のた
めのボードである。取り込まれた画像フレームはトラン
スヒ゜ュータバス９０４を介して４枚構成のＭｅｉｋｏ
社のＩｎ−Ｓｕｎトランスヒ゜ュータボード９０５上の
インモス社製Ｔ８００トランスヒ゜ュータにより構成さ
れるトランスヒ゜ュータネットワーク内で処理される。１枚のボードには１６台のインモス社製トランスヒ゜ュ
ータが搭載されているので、全部で６４台のＴ８００ト
ランスヒ゜ュータを処理モジュールとして使用すること
ができる。図１における、動き検出サブシステム（ＩＰ
ＬＳ）１０２、動き追跡サブシステム（ＭＴＳ）１０３
は上記Ｉｎ−Ｓｕｎトランスヒ゜ュータネットワーク上
に実装された。物体追跡の結果は画像表示ボード９０７
に直結されたモニタスクリーン９０６、Ｓｕｎ４／３７
０のモニタスクリーン９０８、Ｍａｃｉｎｔｏｓｈのモ
ニタスクリーン９０９に表示される。Ｓｕｎ４／３７０
のモニタスクリーン９０８、Ｍａｃｉｎｔｏｓｈのモニ
タスクリーン９０９への表示については、Ｘ−Ｗｉｎｄ
ｏｗを用いる。２台の模型列車９１１の画像をＣＣＤカ
メラ９０１で撮ることにより画像の入力を行なう。橋や
トンネルを動いている模型列車９１１のそばに置くこと
によって衝突、隠蔽の時間制御を行なう。Next, an implementation example of the moving object detection and tracking system shown in FIG. 1 will be described. In FIG. 9, 901 is a general CCD camera as an image sensor, 902 is a Mei
A board 903 is a board for displaying images built into the M10 casing manufactured by Ko Corporation for capturing images. The captured image frames are sent to a four-frame Meiko via a transducer bus 904.
The data are processed within a transducer network comprised of Inmos T800 transducers on In-Sun transducer boards 905. Since 16 Inmos transducers are mounted on one board, a total of 64 T800 transducers can be used as processing modules. The motion detection subsystem (IP
LS) 102, motion tracking subsystem (MTS) 103
was implemented on the In-Sun transducer network described above. The object tracking results are displayed on the image display board 907
Monitor screen 906 directly connected to Sun4/37
0 monitor screen 908 and Macintosh monitor screen 909. Sun4/370
Regarding the display on the monitor screen 908 and Macintosh monitor screen 909, please refer to the X-Wind
Use ow. Images are input by taking images of two model trains 911 with a CCD camera 901. Collision and concealment time can be controlled by placing bridges and tunnels near the moving model train 911.

【００３０】図１０に実装した、動き検出サブシステム
（ＩＰＬＳ）１０２、動き追跡サブシステム（ＭＴＳ）
１０３のモジュールブロック図を示す。図中の箱はＴ８
００トランスヒ゜ュータで動くコンパイル済のコードモ
ジュールを示す。例えば、コードモジュール１００５、
１００６、１００７、１００８ａ、１００８ｂなどは標
準ＯＣＣＡＭコンパイラでコンパイルしたものであり、
コードモジュール１００２、１００３、１００４は標準
Ｃ言語コンパイラでコンパイルしたものである。シリン
ダ１００９ａ、１００９ｂ、１０１１はパイプライン状
に接続したＴ８００トランスヒ゜ュータを示す。各ブロ
ックを接続する矢線は、各コードモジュール間の双方向
通信路を示す。Motion Detection Subsystem (IPLS) 102, Motion Tracking Subsystem (MTS), implemented in FIG.
103 shows a module block diagram of 103. The box in the diagram is T8
A compiled code module running on a 00 transducer is shown. For example, code module 1005,
1006, 1007, 1008a, 1008b, etc. were compiled with the standard OCCAM compiler,
Code modules 1002, 1003, and 1004 were compiled with a standard C language compiler. Cylinders 1009a, 1009b, and 1011 represent T800 transducers connected in a pipeline. Arrows connecting each block indicate bidirectional communication paths between each code module.

【００３１】シリンダ１００１はＳｕｎＯＳ４．０３に
基づくＵＮＩＸファイルシステムである。ＣＳ＿ＨＯＳ
Ｔモジュール１００２を介してトランスヒ゜ュータネッ
トワークとＵＮＩＸファイルシステム１００１との通信
を行なう。Ｓｕｎ４／３７０のモニタスクリーン９０８
、Ｍａｃｉｎｔｏｓｈのモニタスクリーン９０９、ＵＮ
ＩＸファイルシステム１００１との通信のためににＣＳ
＿ＨＯＳＴモジュール１００２が使用される。Cylinder 1001 is a UNIX file system based on SunOS 4.03. CS_HOS
Communication between the transmitter network and the UNIX file system 1001 is performed via the T module 1002. Sun4/370 monitor screen 908
, Macintosh monitor screen 909, UN
CS for communication with IX file system 1001
_HOST module 1002 is used.

【００３２】ＴＴＹモジュール１００３は、ＦＧモジュ
ール１００５が標準ビデオ信号を正しくサンプリングで
きるように初期化するためのパラメータ設定を行なうた
めのモジュールである。ＦＧモジュール１００５は、Ｐ
ＡＬ、ＮＴＳＣなどの如何なる標準ビデオ信号を表示、
処理画面の大きさによらずに正しくサンプリングする。ＴＴＹモジュール１００３、ＧＦＸモジュール１００４
はＣＳ＿ＨＯＳＴモジュール１００２の機能を補完する
制御モジュールであり、通常はＣＳ＿ＨＯＳＴモジュー
ル１００２を介してＦＧモジュール１００５、ＦＤモジ
ュール１００７などから構成されるトランスヒ゜ュータ
ネットワークとＵＮＩＸファイルシステム１００１との
入出力が実行される。なお、これらＴＴＹモジュール１
００３、ＧＦＸモジュール１００４はＣ言語で記述され
るが、標準添付される英国Ｍｅｉｋｏ社のＣＳ−ＴＯＯ
ＬＳライブラリによりトランスヒ゜ュータネットワーク
、ＵＮＩＸファイルシステム１００１との双方向通信が
可能である。ＧＦＸモジュール１００４は必要に応じて
ＦＤモジュール１００７のグラフィックモードを切り替
えるために使用される。例えば、追跡が成功している場
合に、これを通知する目的で追跡移動物体をアイコン描
画するためには、ＧＦＸモジュール１００４からのコマ
ンドによりグラフィックモードを切り替えることで行な
う。なお、ＦＤモジュール１００７は上記二つのモード
（通常表示モード、アイコン描画モード）での動作を可
能にするために、ＦＧモジュール１００５から送られる
画像を更新しながらアイコン画像との表示切り替えを行
なうための２画面切り換え機能を持つ。The TTY module 1003 is a module for setting parameters for initializing the FG module 1005 so that it can correctly sample a standard video signal. The FG module 1005 is P
Displays any standard video signal such as AL, NTSC, etc.
To correctly sample regardless of the size of a processing screen. TTY module 1003, GFX module 1004
is a control module that complements the functions of the CS_HOST module 1002, and normally input/output between the UNIX file system 1001 and a transfer network consisting of the FG module 1005, FD module 1007, etc. is performed via the CS_HOST module 1002. . In addition, these TTY modules 1
003, the GFX module 1004 is written in C language, but the CS-TOO from Meiko Co., Ltd., which is included as standard, is written in C language.
The LS library allows bidirectional communication with the transfer network and the UNIX file system 1001. The GFX module 1004 is used to switch the graphic mode of the FD module 1007 as necessary. For example, in order to draw an icon of the tracked moving object in order to notify when the tracking is successful, the graphic mode is switched by a command from the GFX module 1004. Note that in order to enable operation in the above two modes (normal display mode, icon drawing mode), the FD module 1007 uses a function to switch the display between the icon image and the icon image while updating the image sent from the FG module 1005. It has a 2-screen switching function.

【００３３】ＦＧモジュール１００５は画像の取り込み
を制御するための制御ソフトウェア、及び他のモジュー
ル（１００３，１００４，１００６）との通信を行なう
ための通信コードからなる。ＦＧモジュール１００５は
カメラから画像を取り込んでＢＵＦＦモジュール１００
６に画像を転送する。また、システムの動作開始時には
ＴＴＹモジュール１００３からの命令に基づいて画像取
り込みのための各種のパラメータ設定（標準ビデオ信号
の選択、表示、処理画面の大きさなどの設定）を行なう
。画像の取り込みが開始されると画像を水平に２分割し
て、２本のチャンネルを介してＢＵＦＦモジュール１０
０６に画像の転送を行なう。他のモジュールと同様にＦ
Ｇモジュール１００５は、数多くの並行プロセスの集ま
りにより実装されている。例えばそのうちの一つは画像
の取り込みのためのハードウェア制御プロセスであり、
他の大部分のプロセスは他のモジュール（ＴＴＹモジュ
ール１００３、ＧＦＸモジュール１００４、ＢＵＦＦモ
ジュール１００６）とのデータ通信を効率的に並行に実
行するためのものである。例えば、データ通信ルーチンはＴＣＰ／ＩＰなどの標準
ネットワークプロトコルを使用するように実装されてい
る。ＢＵＦＦモジュール１００６も同様に、数多くの並
行プロセスがリンクを介しての高速通信を実現している
。なお本文では、リンクと通信チャネルを同義語として
使用する。ＢＵＦＦモジュール１００６の主たる目的は
、ＦＧモジュール１００５からの出力をバッファリング
してＭａｓｔｅｒ１モジュール（１００８ａ）、Ｍａｓ
ｔｅｒ２モジュール（１００８ｂ）に二つに分割して転
送することである。二つのモジュールに処理を分割する
理由は、なるべく多くのＴ８００トランスヒ゜ュータで
同一処理を同時間に並列実行することにより画像の転送
に要するオーバヘッド時間を最小限に抑えつつ、全体と
しての処理時間を短縮することにある。前述のようにＢ
ＵＦＦモジュール１００６は、画像を二つに等分割して
、各々の分割画像をＭａｓｔｅｒ１モジュール（１００
８ａ）、Ｍａｓｔｅｒ２モジュール（１００８ｂ）に転
送する。ＢＵＦＦモジュール１００６はまた、Ｍａｓｔ
ｅｒ１モジュール（１００８ａ）、Ｍａｓｔｅｒ２モジ
ュール（１００８ｂ）から画像データ、及びグラフィッ
ク命令を受け取る。前記の通り、ＦＧモジュール１００
５からのグラフィックモードを切り替えのための命令、
Ｍａｓｔｅｒ１モジュール（１００８ａ）、Ｍａｓｔｅ
ｒ２モジュール（１００８ｂ）からの画像データを同一
のリンクを通して自動的に切り替えて転送する必要上、
厳密なプロトコルを設定した。このモジュールでも上記
二種類の通信は並行プロセスとして実行することにより
実装された。ＦＤモジュール１００７は画像表示用モニ
タスクリーン９０６の画面表示動作の制御を実行する。The FG module 1005 consists of control software for controlling image capture and communication codes for communicating with other modules (1003, 1004, 1006). The FG module 1005 takes in images from the camera and sends them to the BUFF module 100.
Transfer the image to 6. Furthermore, when the system starts operating, various parameter settings for image capture (setting of standard video signal selection, display, processing screen size, etc.) are performed based on commands from the TTY module 1003. When image capture starts, the image is divided into two horizontally and sent to the BUFF module 10 via two channels.
06, the image is transferred. F like other modules
G module 1005 is implemented by a collection of many parallel processes. For example, one of them is a hardware-controlled process for image capture,
Most of the other processes are for efficiently performing data communication with other modules (TTY module 1003, GFX module 1004, BUFF module 1006) in parallel. For example, data communication routines are implemented using standard network protocols such as TCP/IP. Similarly, the BUFF module 1006 has many parallel processes achieving high-speed communication via links. Note that in this text, link and communication channel are used synonymously. The main purpose of the BUFF module 1006 is to buffer the output from the FG module 1005 and send it to the Master1 module (1008a) and the Master1 module (1008a).
The data is divided into two parts and transferred to the ter2 module (1008b). The reason for dividing the processing into two modules is to minimize the overhead time required for image transfer and shorten the overall processing time by executing the same processing in parallel at the same time on as many T800 transducers as possible. It's about doing. As mentioned above, B
The UFF module 1006 divides the image into two equal parts and sends each divided image to the Master1 module (100
8a) and transferred to the Master2 module (1008b). BUFF module 1006 also
Image data and graphic instructions are received from the er1 module (1008a) and the Master2 module (1008b). As mentioned above, the FG module 100
Instructions for switching graphics modes from 5,
Master1 module (1008a), Master
Due to the need to automatically switch and transfer image data from the r2 module (1008b) through the same link,
Strict protocols were set. In this module as well, the above two types of communication were implemented by executing them as parallel processes. The FD module 1007 controls the screen display operation of the image display monitor screen 906.

【００３４】Ｍａｓｔｅｒ１モジュール（１００８ａ）
、Ｍａｓｔｅｒ２モジュール（１００８ｂ）は画像を簡
単化ＭＯＣフィルタ１０４の各パイプライン（１００９
ａ、１００９ｂ）の中のＴ８００トランスヒ゜ュータの
数と同じ数の小さいブロック画像に分割する。図１０に
於いて、実際には各パイプラインは、２０台のＴ８００
トランスヒ゜ュータから構成されている。Ｍａｓｔｅｒ
１モジュール（１００８ａ）、Ｍａｓｔｅｒ２モジュー
ル（１００８ｂ）は同一構成のモジュールであるので、
便宜上Ｍａｓｔｅｒ１モジュール（１００８ａ）につい
てのみ説明する。Ｍａｓｔｅｒ１モジュール（１００８
ａ）は、簡単化ＭＯＣフィルタパイプラインモジュール
（１００９ａ）に２０分割された画像を送り、後に表示
の必要がある場合には、簡単化ＭＯＣフィルタの処理結
果画像Ｖｉｊ（ｋ，ｔ），ｎを受け取るが、通常は一次
元部分投影分布量Ｓｘ，ｋ，ｎ、Ｓｙ，ｋ，ｎを受け取
る。ここで添字ｎは各分割画像に対し各部分量が得られ
るということを反映した、各分割画像に対応する添字を
示す。上記一次元部分投影分布量はＭａｓｔｅｒ１モジ
ュール（１００８ａ）の制御により、リンクを介してＢ
ＵＦＦ２モジュール１０１０に集計されて一次元投影分
布量Ｓｘ，ｋ、Ｓｙ，ｋが求められる。この同じリンク
を介して２分割された画像データＩｉｊ（ｔ）もＢＵＦ
Ｆ２モジュール１０１０に送られて、新奇性フィルタニ
ューラルネットワークモジュール１０１２での処理に用
いられる。また、全てのリンクは双方向性の通信路であ
るためにグラフィックコマンドの転送路、ＢＵＦＦ２モ
ジュール１０１０、局所極大値検出ニューラルネットワ
ークモジュール１０１１、あるいは新奇性フィルタニュ
ーラルネットワークモジュール１０１２の出力画像の転
送路としても使用される。簡単化ＭＯＣフィルタパイプ
ラインモジュール（簡単化ＭＯＣフィルタモジュール（
１００９ａ）、簡単化ＭＯＣフィルタモジュール（１０
０９ｂ））もＭａｓｔｅｒ１モジュール（１００８ａ）
、Ｍａｓｔｅｒ２モジュール（１００８ｂ）と同様に、
同一構成のモジュールであり、それぞれ２０台のＴ８０
０トランスヒ゜ュータからなる。各Ｔ８００トランスヒ
゜ュータでは全く同じスレーブモジュール１００９ｃが
その上で走る。各スレーブモジュール１００９ｃは各々
最初のマスタモジュール（Ｍａｓｔｅｒ１モジュール（
１００８ａ）、Ｍａｓｔｅｒ２モジュール（１００８ｂ
））に直接接続されているモジュール、及び最後尾のモ
ジュールを除いては、互いに両隣のモジュールに接続さ
れている。Master1 module (1008a)
, the Master2 module (1008b) applies each pipeline (1009) of the MOC filter 104 to simplify the image.
The image is divided into as many small block images as the number of T800 transducers in (a, 1009b). In Figure 10, each pipeline actually consists of 20 T800 units.
It consists of a transformer. Master
Since the Master 1 module (1008a) and Master 2 module (1008b) have the same configuration,
For convenience, only the Master1 module (1008a) will be described. Master1 module (1008
In a), the 20-divided image is sent to the simplified MOC filter pipeline module (1009a), and if it is necessary to display it later, the simplified MOC filter processing result image Vij (k, t),n is sent to the simplified MOC filter pipeline module (1009a). Usually, one-dimensional partial projection distribution amounts Sx,k,n and Sy,k,n are received. Here, the subscript n indicates a subscript corresponding to each divided image, reflecting the fact that each partial amount is obtained for each divided image. The one-dimensional partial projection distribution amount is transmitted to B via a link under the control of the Master 1 module (1008a).
The data are aggregated in the UFF2 module 1010 to obtain one-dimensional projection distribution amounts Sx,k and Sy,k. Image data Iij(t) divided into two via this same link is also BUF
It is sent to the F2 module 1010 and used for processing in the novelty filter neural network module 1012. In addition, since all links are bidirectional communication paths, they can be used as transfer paths for graphic commands, output images of the BUFF2 module 1010, the local maximum value detection neural network module 1011, or the novelty filter neural network module 1012. is also used. Simplified MOC filter pipeline module (Simplified MOC filter module (
1009a), simplified MOC filter module (10
09b)) also Master1 module (1008a)
, similar to the Master2 module (1008b),
Modules with the same configuration, each with 20 T80
Consists of 0 transducers. Each T800 transducer has an identical slave module 1009c running on it. Each slave module 1009c is the first master module (Master1 module).
1008a), Master2 module (1008b
)) and the last module are connected to modules on both sides of each other.

【００３５】各スレーブモジュール１００９ｃは、数多
くの並行プロセスから構成されるが、特に入力される画
像と処理結果Ｖｉｊ（ｋ，ｔ），ｎをバッファリングす
るためのリング状の入出力バッファを持つことにより効
率的なバッファリングを行なっている。スレーブモジュ
ール１００９ｃは、入力リングバッファから取り出した
画像Ｕｉｊ（ｋ，ｔ）ｎに対して逐次、簡単化ＭＯＣフ
ィルタ１０４処理、時空間規格化フィルタ１０５処理を
行なうことによりＶｉｊ（ｋ，ｔ），ｎを計算する。さ
らにＶｉｊ（ｋ，ｔ），ｎに対して、特徴抽出１フィル
タ１０６処理を行なうことによりＳｘ，ｋ，ｎ、Ｓｙ，
ｋ，ｎを計算する。各スレーブモジュール１００９ｃは、出力結果データを
隣あうモジュールのリンクに出力することにより自動的
にこれらの処理結果がマスタモジュール（Ｍａｓｔｅｒ
１モジュール（１００８ａ）、Ｍａｓｔｅｒ２モジュー
ル（１００８ｂ））まで転送されるように設計されてい
る。ＢＵＦＦ２モジュール１０１０は、Ｍａｓｔｅｒ１
モジュール１００９ａ、及びＭａｓｔｅｒ２モジュール
１００９ｂを介して、全ての一次元部分投影分布量Ｓｘ
，ｋ，ｎ、Ｓｙ，ｋ，ｎを受け取ることにより一次元投
影分布量Ｓｘ，ｋ、Ｓｙ，ｋを計算する。また、Ｍａｓ
ｔｅｒ１モジュール１００９ａ、及びＭａｓｔｅｒ２モ
ジュール１００９ｂから送られる、半分に分割された画
像を統合することで原画像Ｉｉｊを得る。Each slave module 1009c is composed of many parallel processes, and in particular has a ring-shaped input/output buffer for buffering input images and processing results Vij(k,t),n. This provides more efficient buffering. The slave module 1009c sequentially performs the simplified MOC filter 104 processing and the spatio-temporal normalization filter 105 processing on the image Uij(k,t)n extracted from the input ring buffer, thereby obtaining Vij(k,t),n. Calculate. Furthermore, by performing feature extraction 1 filter 106 processing on Vij (k, t), n, Sx, k, n, Sy,
Calculate k and n. Each slave module 1009c automatically transfers these processing results to the master module by outputting the output result data to the link of the adjacent module.
1 module (1008a), Master 2 module (1008b)). BUFF2 module 1010 is Master1
Through the module 1009a and the Master2 module 1009b, all one-dimensional partial projection distribution amounts Sx
, k, n, Sy, k, n, one-dimensional projection distribution amounts Sx, k, Sy, k are calculated. Also, Mas
The original image Iij is obtained by integrating the half-divided images sent from the ter1 module 1009a and the Master2 module 1009b.

【００３６】ＢＵＦＦ２モジュール１０１０は、局所極
大値検出ニューラルネットワークモジュール１０１１に
対して投影分布量Ｓｘ，ｋ、Ｓｙ，ｋを送る。局所極大
値検出ニューラルネットワークモジュール１０１１に於
いて、投影分布方向（Ｘ、Ｙ）ごとに、各移動方向に対
応する局所極大値検出ニューラルネットワーク１０７か
らの８つの出力を比較し、最大値を検出することにより
移動物体の中心情報（移動方向、中心座標）が求められ
る。ＢＵＦＦ２モジュール１０１０は、新奇性フィルタ
ニューラルネットワークモジュール１０１２に対して原
画像Ｉｉｊを送る。新奇性フィルタニューラルネットワ
ークモジュール１０１２に於いて、原画像Ｉｉｊから移
動物体を包含する箱（ウィンドウ）領域が求められる。さらに、この箱（ウィンドウ）領域内で特徴抽出２フィ
ルタ１０８により求められた、形状に関する特徴量は新
奇性フィルタ１０９に入力される。ＢＵＦＦ２モジュー
ル１０１０は新奇性フィルタ１０９の出力をデコード解
釈することによりセンサ駆動回路１１０を駆動する。実
際の実装では、局所極大値検出ニューラルネットワーク
モジュール１０１１は、１６台のＴ８００トランスヒ゜
ュータから構成されている。局所極大値検出ニューラルネットワークモジュール１０
１１は１６個の局所極大値検出ニューラルネットワーク
１０７から構成されている。新奇性フィルタモジュール
１０１２は１台のＴ８００トランスヒ゜ュータから構成
されている。The BUFF2 module 1010 sends the projection distribution amounts Sx,k and Sy,k to the local maximum value detection neural network module 1011. In the local maximum value detection neural network module 1011, eight outputs from the local maximum value detection neural network 107 corresponding to each movement direction are compared for each projection distribution direction (X, Y), and the maximum value is detected. As a result, center information (moving direction, center coordinates) of the moving object can be obtained. The BUFF2 module 1010 sends the original image Iij to the novelty filter neural network module 1012. In the novelty filter neural network module 1012, a box (window) region containing the moving object is determined from the original image Iij. Further, the feature amount related to the shape obtained by the feature extraction 2 filter 108 within this box (window) area is input to the novelty filter 109. The BUFF2 module 1010 drives the sensor drive circuit 110 by decoding and interpreting the output of the novelty filter 109. In the actual implementation, the local maximum detection neural network module 1011 consists of 16 T800 transducers. Local maximum value detection neural network module 10
11 is composed of 16 local maximum value detection neural networks 107. Novelty filter module 1012 consists of one T800 transformer.

【００３７】[0037]

【発明の効果】局所極大値検出ニューラルネットワーク
を用いることにより、移動物体の中心の移動方向を容易
に検出できるという効果、新奇性フィルタニューラルネ
ットワークを用いることにより他の静止、移動物体が追
跡されるべき移動物体に与える、衝突、隠蔽などの短期
的影響を排除することにより追跡能力が向上するという
効果がある。[Effects of the invention] By using the local maximum value detection neural network, the moving direction of the center of a moving object can be easily detected, and by using the novelty filter neural network, other stationary and moving objects can be tracked. This has the effect of improving tracking ability by eliminating short-term effects such as collisions and concealment on moving objects.

[Brief explanation of the drawing]

【図１】図１は本発明による移動物体の検出追跡システ
ムのブロック図である。FIG. 1 is a block diagram of a moving object detection and tracking system according to the present invention.

【図２】図２は移動物体の速度成分が生成される様子を
説明するためのブロック図である。FIG. 2 is a block diagram for explaining how a velocity component of a moving object is generated.

【図３】図３は移動物体の速度成分が生成される様子を
説明するためのブロック図である。FIG. 3 is a block diagram for explaining how a velocity component of a moving object is generated.

【図４】図４は移動物体の速度成分が生成される様子を
説明するためのブロック図である。FIG. 4 is a block diagram for explaining how a velocity component of a moving object is generated.

【図５】図５は、入力の局所極大値を検出する３層構造
のニューラルネットワークの構成を示した図である。FIG. 5 is a diagram showing the configuration of a three-layer neural network that detects local maximum values of input.

【図６】図６は移動物体を速度情報によって分離する過
程を示した模式図である。FIG. 6 is a schematic diagram showing a process of separating moving objects based on velocity information.

【図７】図７は新奇性フィルタとして動作する１層構造
のニューラルネットワークの構成を示した図である。FIG. 7 is a diagram showing the configuration of a one-layer neural network that operates as a novelty filter.

【図８】図８は追跡移動物体と他の移動物体の衝突の様
子を示した図である。FIG. 8 is a diagram showing a collision between a tracked moving object and another moving object.

【図９】図９は本発明に基づいて実際に実装されたシス
テムのブロック図を示す。FIG. 9 shows a block diagram of a system actually implemented according to the invention.

【図１０】図１０は図９をより詳細に説明するための図
である。FIG. 10 is a diagram for explaining FIG. 9 in more detail.

[Explanation of symbols]

１０１…センサ１０２…動き検出サブシステム（ＩＰＬＳ，Ｉｎｔｅｒ
ｅｓｔｉｎｇＰｏｉｎｔ　Ｌｏｃａｔｏｒ　Ｓｕｂｓｙ
ｓｔｅｍ）１０３…動き追跡サブシステムモジュール（ＭＴＳ，Ｍ
ｏｔｉｏｎ　Ｔｒａｃｋｉｎｇ　Ｓｕｂｓｙｓｔｅｍ）１０４…簡単化ＭＯＣ（Ｍｏｔｉｏｎ　Ｏｒｉｅｎｔｅ
ｄ　Ｃｏｎｔｒａｓｔ）フィルタ１０５…時空間規格化
フィルタ１０６…特徴抽出１フィルタ１０７…局所極大値検出ニューラルネットワーク群（Ｃ
ＤＮＮ，Ｃｅｎｔｒｏｉｄ　Ｄｅｔｅｃｔｉｏｎ　Ｎｅ
ｕｒａｌ　Ｎｅｔｗｗｏｒｋ）１０８…特徴抽出２フィ
ルタ１０９…新奇性フィルタニューラルネットワーク１１０
…センサ駆動回路６０１…移動物体１（矩形）６０２…移動物体２（円形）６０３…移動物体１を包含する箱（ウィンドウ）６０４
…移動物体２を包含する箱（ウィンドウ）６０５…静止
物体８０１…追跡移動物体８０２…他の静止、または移動物体９０１…ＣＣＤカメラ９０２…画像取り込みボード９０３…Ｍｅｉｋｏ社製Ｍ１０筐体９０４…Ｍｅｉｋｏ社製Ｍ１０筐体（画像取り込みボー
ド、画像表示ボードを内蔵）とＳｕｎ４／３７０（６４台のトランスヒ゜ュータを内蔵）を結合す
るトランスヒ゜ュータバス９０５…Ｓｕｎ４／３７０（６４台のトランスヒ゜ュー
タ）９０６…画像表示用のモニタスクリーン９０７…画
像表示ボード９０８…Ｓｕｎ４／３７０のモニタスクリーン（Ｘ−Ｗ
ｉｎｄｏｗ）９０９…Ｍａｃｉｎｔｏｓｈのモニタスク
リーン（Ｘ−Ｗｉｎｄｏｗ）９１０…イーサーネット９１１…２台の模型列車１００１…Ｕｎｉｘファイルシステム１００２…トランスヒ゜ュータネットワークホスト（Ｃ
Ｓ＿ＨＯＳＴ）モジュール１００３…ＴＴＹ（端末画面）モジュール１００４…Ｇ
ＦＸ（グラフィック）モジュール１００５…ＦＧ（画像
取り込み）モジュール１００６…ＢＵＦＦモジュール１００７…ＦＤ（画像表示）モジュール１００８ａ…Ｍ
ａｓｔｅｒ１モジュール１００８ｂ…Ｍａｓｔｅｒ２モ
ジュール１００９ａ…簡単化ＭＯＣフィルタパイプライ
ン１１００９ｂ…簡単化ＭＯＣフィルタパイプライン２
１００９ｃ…簡単化ＭＯＣフィルタパイプラインを構成
するスレーブモジュール１０１０…ＢＵＦＦ２モジュール１０１１…局所極大値検出ニューラルネットワークモジ
ュール（ＣＤＮＮ，Ｃｅｎｔｒｏｉｄ　Ｄｅｔｅｃｔｉ
ｏｎ　Ｎｅｕｒａｌ　Ｎｅｔｗｗｏｒｋ）１０１２…新
奇性フィルタニューラルネットワークモジュール101...Sensor 102...Motion detection subsystem (IPLS, Inter
estingPoint Locator Subsy
stem) 103...Motion tracking subsystem module (MTS, M
104...Simplified MOC (Motion Tracking Subsystem) 104...Simplified MOC (Motion Tracking Subsystem)
d Contrast) filter 105...Spatio-temporal normalization filter 106...Feature extraction 1 filter 107...Local maximum value detection neural network group (C
DNN, Centroid Detection Ne
ural Network) 108... Feature extraction 2 filter 109... Novelty filter neural network 110
...Sensor drive circuit 601...Moving object 1 (rectangle) 602...Moving object 2 (circular) 603...Box (window) containing moving object 1 604
…Box (window) containing moving object 2 605…Stationary object 801…Tracking moving object 802…Other stationary or moving object 901…CCD camera 902…Image capture board 903…Meiko M10 housing 904…Meiko Corporation A transformer bus 905 that connects a Sun4/370 (built-in 64 transformers) with a M10 chassis (with built-in image capture board and image display board) made by Manufacturer Co., Ltd....Sun4/370 (64 transformers) 906...image display Monitor screen 907 for...Image display board 908...Monitor screen for Sun4/370 (X-W
indow) 909...Macintosh monitor screen (X-Window) 910...Ethernet 911...Two model trains 1001...Unix file system 1002...Transfer network host (C
S_HOST) module 1003...TTY (terminal screen) module 1004...G
FX (graphics) module 1005...FG (image capture) module 1006...BUFF module 1007...FD (image display) module 1008a...M
aster1 module 1008b...Master2 module 1009a...Simplified MOC filter pipeline 11009b...Simplified MOC filter pipeline 2
1009c...Slave module configuring the simplified MOC filter pipeline 1010...BUFF2 module 1011...Local maximum value detection neural network module (CDNN, Centroid Detection)
on Neural Network) 1012...Novelty Filter Neural Network Module

Claims

[Claims]

Claims: 1. A method for determining motion information (optical flow) for each moving direction at a boundary point of a moving object, which is determined by the luminance change of each pixel on an image of consecutive image frames captured by an image input device. one means, a second means for extracting velocity information regarding an object from motion information (optical flow), a third means for taking a projection distribution of velocity information obtained from the second means, a third means In order to output the local maximum value of the projected distribution amount by inputting the projected distribution amount obtained by The fifth means is to extract shape information, the sixth means consists of a single-layer neural network for associative memory of shape information, and the seventh means is to move a specific moving object to the center of the field of view. What is claimed is: 1. A moving object detection and tracking system, characterized by having the system as an object tracking means.

Claim 2: Motion information obtained by the first means (
2. The moving object detection and tracking system according to claim 1, wherein said second means inputs said optical flow to said third means after being averaged and normalized spatiotemporally.

3. In the fifth means, image information regarding the moving object is cut out by providing a box containing the moving object centered on the center information obtained by the fourth means. 2. The moving object detection and tracking system according to item 1.

4. The movement according to claim 1, wherein the seventh means includes a driving device for driving an image input device to track a specific object among the moving objects. Object detection and tracking system.