JP2005202938A

JP2005202938A - Image search apparatus and image search method

Info

Publication number: JP2005202938A
Application number: JP2004358526A
Authority: JP
Inventors: Noriko Tanaka; 則子田中; Masaaki Sato; 正章佐藤
Original assignee: Matsushita Electric Industrial Co Ltd
Current assignee: Panasonic Holdings Corp
Priority date: 2003-12-19
Filing date: 2004-12-10
Publication date: 2005-07-28

Abstract

<P>PROBLEM TO BE SOLVED: To provide an image search apparatus and an image search method which realize proper image search and effective search by easily designating a search object. <P>SOLUTION: The image search apparatus has; a moving area extraction part 102 for extracting a storage object moving area within an image; an area division part 104 for dividing the storage object moving area into storage object block areas; a representative color calculation part 105 for deriving a representative color of each of storage object block areas constituting the storage object moving area; a DB 106 for storing representative colors of respective storage object block areas constituting the storage object moving area; a search area designation part 108 for extracting a search object area within the image; an area division part 109 for dividing the search object area into search object block areas; a representative color calculation part 110 for deriving a representative color of each of search object block areas constituting the search object area; and a comparison part 111 for comparing representative colors of respective storage object block areas constituting the storage object moving area which are stored in the DB 106, and representative colors of respective search object block areas constituting the search object area with each other and performing output according to the results of comparison. <P>COPYRIGHT: (C)2005,JPO&NCIPI

Description

本発明は、主として映像監視用途に用いられる映像検索装置及び映像検索方法に関する。 The present invention relates to a video search apparatus and a video search method mainly used for video surveillance.

従来、個人識別等の映像監視用途に用いられる各種の映像検索装置（監視システムや個人識別装置）が提案されている（例えば特許文献１及び２参照）。 Conventionally, various video search devices (a monitoring system and a personal identification device) used for video monitoring applications such as personal identification have been proposed (see, for example, Patent Documents 1 and 2).

従来の監視システムのブロック図を図１４に示す。図１４に示す監視システムは、デジタルレコーダ５０１内において、所定の位置に設置された撮像装置５０２により撮像された監視画像をＡ／Ｄ変換部５１１を介して記録するフレームメモリ５１２と、フレームメモリ５１２に記録された監視画像を順次読み出し、ユーザによる操作部５１７の操作に応じて、その読み出した監視画像の少なくとも一部の領域を指定し、更に、その指定した領域に動きのある監視画像を検索する処理制御部５１６と、処理制御部５１６によって検索されたフレームメモリ５１２内の監視画像を、データ圧縮部５１３、書き込みバッファ５１４、データ記録部５１５、読み出しバッファ５１８及びデータ伸張部５１９を介して入力し、再生する表示用メモリ５２０、Ｄ／Ａ変換部５２１及び表示装置５０５とを有する。この監視システムでは、ユーザによる操作部５１７の操作によって指定された領域において動きのあった映像のみが高速に検索、抽出され、表示される。 A block diagram of a conventional monitoring system is shown in FIG. The monitoring system illustrated in FIG. 14 includes a frame memory 512 that records a monitoring image captured by the imaging device 502 installed at a predetermined position in the digital recorder 501 via the A / D conversion unit 511, and a frame memory 512. The monitoring images recorded in the screen are sequentially read out, and at least a part of the read monitoring image is designated according to the operation of the operation unit 517 by the user, and further, the monitoring image having movement in the designated region is searched. The processing control unit 516 that performs the search, and the monitoring image in the frame memory 512 searched by the processing control unit 516 are input via the data compression unit 513, the write buffer 514, the data recording unit 515, the read buffer 518, and the data decompression unit 519. Display memory 520, D / A conversion unit 521 and display device 505 to be reproduced. To. In this monitoring system, only videos that have moved in the area designated by the operation of the operation unit 517 by the user are searched, extracted, and displayed at high speed.

また、従来の個人識別装置の動作を図１５に示す。図１５に示す個人識別装置は、識別すべき人物の顔画像を入力する画像入力部６１１と、この画像入力部６１１により入力された顔画像を、一次顔正規化部６１２を介して取得し、その顔画像の特徴点を抽出する特徴点抽出部６１３と、標準的な顔画像の特徴点を記録した標準顔画像特徴点データベース部６１４と、特徴点標準的な顔画像の特徴点と一致するように変形された基準顔画像を人物毎に記録した基準顔画像特徴点データベース部６１６と、特徴点が標準顔画像特徴点データベース部６１４に記録された標準的な顔画像の特徴点と一致するように、画像入力部６１１によって入力された顔画像を変形させる二次顔正規化部６１５と、二次顔正規化部６１５によって正規化された顔画像と基準顔画像データベース部６１６に記録された基準顔画像との相関を逐次求める画像相関演算部６１７と、画像相関演算部６１７が求めた相関地に基づいて、画像入力部６１１によって入力された顔画像に対応する人物を識別する判断部６１８とを有する。この個人識別装置は、入力された顔画像に対して最も相関の高い基準顔画像を検索し、その検索された基準顔画像により個人識別を行うことを可能とするものであり、顔の撮影方向の変化等に対して、ロバスト（頑健）な個人識別を行う。 FIG. 15 shows the operation of the conventional personal identification device. The personal identification device shown in FIG. 15 acquires an image input unit 611 that inputs a face image of a person to be identified, and a face image input by the image input unit 611 via a primary face normalization unit 612. The feature point extraction unit 613 that extracts the feature points of the face image, the standard face image feature point database unit 614 that records the feature points of the standard face image, and the feature points coincide with the feature points of the standard face image. The reference face image feature point database unit 616 that records the deformed reference face image for each person, and the feature points match the feature points of the standard face image recorded in the standard face image feature point database unit 614. As described above, the secondary face normalization unit 615 that deforms the face image input by the image input unit 611 and the face image normalized by the secondary face normalization unit 615 and the reference face image database unit 616 are recorded. An image correlation calculation unit 617 that sequentially obtains a correlation with the reference face image, and a determination unit 618 that identifies a person corresponding to the face image input by the image input unit 611 based on the correlation location obtained by the image correlation calculation unit 617. And have. This personal identification device searches for a reference face image having the highest correlation with an input face image, and makes it possible to perform personal identification based on the searched reference face image. Robust (robust) personal identification against changes in

また、従来の映像検索システムでは、例えば所望の人物画像を色情報を用いて検索する方法としては次のものが知られている。１つは画像全体を検索対象として指定し、その画像から色情報を抽出し、この抽出した色情報と記憶装置に蓄積されている映像データから同様に抽出した色情報を比較して類似画像を取得する方法である。また、属性情報として色情報を直接指定する場合、ＧＵＩを介してモニタ上に表示されている画像上で矩形領域などを入力することにより、表示されている画像から指定した矩形領域などを切り出して、この切り出し画像から色情報を抽出して検索対象にする方法がある。また、本明細書で述べる「ＨＳＶ空間」に関しては非特許文献１に、「ヒストグラムインターセクション」に関しては特許文献３にそれぞれ開示されている。 In the conventional video search system, for example, the following is known as a method for searching for a desired person image using color information. One is to specify the entire image as a search target, extract color information from the image, compare the extracted color information with the color information extracted from the video data stored in the storage device, and compare similar images. How to get. In addition, when color information is directly specified as attribute information, the specified rectangular area is cut out from the displayed image by inputting the rectangular area on the image displayed on the monitor via the GUI. There is a method of extracting color information from the cut image and making it a search target. The “HSV space” described in this specification is disclosed in Non-Patent Document 1, and the “histogram intersection” is disclosed in Patent Document 3.

特開２０００−１３２６６９号公報JP 2000-132669 A 特開平１１−１６１７９１号公報Japanese Patent Laid-Open No. 11-161791 特開２００４−２５２７４８号公報（段落００１２−００１３）JP 2004-252748 A (paragraphs 0012-0013) 高木幹雄・下田陽久監修、「画像解析ハンドブック」、東京大学出版会、１９９１年１月Supervised by Mikio Takagi and Yoshihisa Shimoda, "Image Analysis Handbook", The University of Tokyo Press, January 1991

しかしながら、前述した従来の監視システムは、動きの有無のみに基づいて再生対象の画像を検索しているため、個人識別等の特定の対象物を識別するための映像検索用途においては必ずしも十分ではない。一方、前述した従来の個人識別装置は、顔の目、鼻、口等の特定部分の画像を取得することができない場合には、画像を検索して、その検索結果に基づいて個人識別を行うことが不可能である。このため、映像検索を適切に行うことが要求されている。 However, since the conventional monitoring system described above searches for an image to be reproduced based only on the presence or absence of motion, it is not always sufficient for video search applications for identifying specific objects such as personal identification. . On the other hand, when the above-described conventional personal identification device cannot acquire an image of a specific part such as the eyes, nose, and mouth of the face, it searches for the image and performs personal identification based on the search result. It is impossible. For this reason, it is required to perform video search appropriately.

また、従来の映像検索システムにおいては、例えば画像そのものを検索キーとして指定する方法では、あらかじめ検索したい画像が手元になければ検索を行うことができない。また、属性情報として色情報を直接指定する場合、ＧＵＩを介してモニタ上に表示された画像上で矩形領域を入力することにより色情報を抽出する検索対象領域を指定する方法では、ユーザによって個人差があり、常に同様な検索結果を得ることが難しくなる可能性がある。 Further, in the conventional video search system, for example, in the method of designating the image itself as a search key, the search cannot be performed unless the image to be searched in advance is at hand. In addition, when color information is directly specified as attribute information, a method of specifying a search target area for extracting color information by inputting a rectangular area on an image displayed on a monitor via a GUI allows a user to select a color area. There are differences, and it may be difficult to always obtain similar search results.

本発明は従来の問題を解決するためになされたもので、映像検索を適切に行うこと、容易に検索対象を指定して効果的な検索を行うことが可能な映像検索装置及び映像検索方法を提供することを目的とする。 The present invention has been made to solve the conventional problems, and provides a video search apparatus and a video search method capable of appropriately performing a video search and easily performing an effective search by specifying a search target. The purpose is to provide.

本発明の映像検索装置は、第１の映像内の蓄積対象動領域を抽出し、前記蓄積対象動領域に対応する蓄積対象動領域映像信号を出力する蓄積対象動領域抽出手段と、前記蓄積対象動領域映像信号に基づいて、前記蓄積対象動領域を蓄積対象ブロック領域に分割し、前記蓄積対象動領域を構成する各々の前記蓄積対象ブロック領域に対応する蓄積対象ブロック領域映像信号を出力する蓄積対象ブロック領域分割手段と、前記蓄積対象ブロック領域の各々に対応する前記蓄積対象ブロック領域映像信号に基づいて、前記蓄積対象動領域を構成する各々の前記蓄積対象ブロック領域の代表色を導出して出力する蓄積対象ブロック領域代表色導出手段と、前記蓄積対象ブロック領域の各々の代表色を蓄積する蓄積対象ブロック領域代表色蓄積手段と、第２の映像内の検索対象領域を抽出し、前記検索対象領域に対応する検索対象領域映像信号を出力する検索対象領域抽出手段と、前記検索対象領域映像信号に基づいて、前記検索対象領域を検索対象ブロック領域に分割し、前記検索対象領域を構成する各々の前記検索対象ブロック領域に対応する検索対象ブロック領域映像信号を出力する検索対象ブロック領域分割手段と、前記検索対象ブロック領域の各々に対応する前記検索対象ブロック領域映像信号に基づいて、前記検索対象領域を構成する各々の前記検索対象ブロック領域の代表色を導出して出力する検索対象ブロック領域代表色導出手段と、前記蓄積対象ブロック領域の各々の代表色と、前記検索対象ブロック領域の各々の代表色とを比較し、その比較結果に応じた出力を行う比較手段とを有する構成となる。 The video search apparatus of the present invention extracts a storage target moving area in a first video and outputs a storage target moving area video signal corresponding to the storage target moving area, and the storage target Based on a moving area video signal, the accumulation target moving area is divided into accumulation target block areas, and accumulation target block area video signals corresponding to the accumulation target block areas constituting the accumulation target moving area are output. Based on the target block area dividing means and the storage target block area video signal corresponding to each of the storage target block areas, a representative color of each of the storage target block areas constituting the storage target moving area is derived. An accumulation target block area representative color deriving means for outputting; an accumulation target block area representative color accumulating means for accumulating each representative color of the accumulation target block area; A search target area extracting means for extracting a search target area in the video of two and outputting a search target area video signal corresponding to the search target area; and searching the search target area based on the search target area video signal A search target block area dividing unit that divides the target block area and outputs a search target block area video signal corresponding to each of the search target block areas constituting the search target area, and corresponds to each of the search target block areas Search target block area representative color deriving means for deriving and outputting a representative color of each search target block area constituting the search target area based on the search target block area video signal, and the storage target block area A comparison in which each representative color is compared with each representative color of the block area to be searched, and an output corresponding to the comparison result is performed. A configuration and a stage.

この構成により、映像内の動きと、その動きの領域を構成する各ブロックの色に基づいて映像検索が行われる。すなわち、映像内の動きだけでなく、その動きの領域の色によって映像検索が行われるため、動きの有無のみに基づいて映像を検索する場合や、映像検索において対象物の特定部分の映像が必要となる場合よりも、映像検索を適切に行うことができる。 With this configuration, the video search is performed based on the motion in the video and the color of each block constituting the motion region. In other words, video search is performed not only based on the motion in the video but also based on the color of the region of the motion. Therefore, when searching for a video based only on the presence or absence of motion, or when searching for a video of a specific part of the target object The video search can be performed more appropriately than in the case where

また、本発明の映像検索装置は、前記比較手段が、前記蓄積対象ブロック領域の各々の代表色と、前記検索対象ブロック領域の各々の代表色との差分を前記比較結果として導出する構成とすることができる。 In the video search device of the present invention, the comparison unit derives a difference between each representative color of the accumulation target block area and each representative color of the search target block area as the comparison result. be able to.

この構成により、複数の蓄積対象動領域の内、検索対象領域に色の近似するものが検索されることが可能になるため、色に基づいた映像検索を適切に行うことが可能となる。 With this configuration, it is possible to search for an approximated color to the search target area from among a plurality of accumulation target moving areas, and thus it is possible to appropriately perform video search based on color.

また、本発明の映像検索装置は、前記比較手段が、前記蓄積対象ブロック領域の各々の代表色を数値化した値と、前記検索対象ブロック領域の各々の代表色を数値化した値から、所定規則に従って前記差分を導出する構成とすることができる。 Further, in the video search device of the present invention, the comparison unit is configured to obtain a predetermined value from a value obtained by digitizing each representative color of the accumulation target block area and a value obtained by digitizing each representative color of the search target block area. The difference may be derived according to a rule.

また、本発明の映像検索装置は、前記蓄積対象動領域映像信号に基づいて、前記蓄積対象動領域が人であることの条件としてあらかじめ定められた所定条件を満たしているか否かを判定し、前記蓄積対象動領域が前記所定条件を満たしている場合に前記蓄積対象動領域映像信号を前記蓄積対象ブロック領域分割手段へ出力する人判定手段を有する構成とすることができる。 Further, the video search device of the present invention, based on the accumulation target moving area video signal, determines whether or not the accumulation target moving area satisfies a predetermined condition as a condition for being a person, The storage target moving area may be configured to include a person determination unit that outputs the storage target moving area video signal to the storage target block area dividing unit when the storage target moving area satisfies the predetermined condition.

この構成により、人に対応する蓄積対象動領域の色のみを蓄積対象とすることが可能となり、個人識別が行われる場合において、人以外の不必要な動領域の色が蓄積されることを防止することができる。 With this configuration, it is possible to store only the color of the accumulation target moving area corresponding to the person, and prevent the accumulation of unnecessary moving area colors other than the person when personal identification is performed. can do.

また、本発明の映像検索装置は、前記蓄積対象ブロック領域代表色導出手段が、前記蓄積対象ブロック領域に出現する色について輝度変化の影響を小さくする変換方式によって得られる値の平均値に対応する色及び前記得られる値の出現する頻度が最も高い色のいずれか一方を、前記蓄積対象ブロック領域の代表色として導出する構成とすることができる。 In the video search device of the present invention, the accumulation target block area representative color deriving unit corresponds to an average value of values obtained by a conversion method for reducing the influence of luminance change on colors appearing in the accumulation target block area. One of the color and the color with the highest frequency of the obtained value may be derived as a representative color of the accumulation target block area.

この構成により、蓄積対象ブロック領域の代表色を適切に導出することが可能となる。 With this configuration, it is possible to appropriately derive the representative color of the accumulation target block area.

また、本発明の映像検索装置は、前記検索対象ブロック領域代表色導出手段が、前記検索対象ブロック領域に出現する色について輝度変化の影響を小さくする変換方式によって得られる値の平均値に対応する色及び前記得られる値の出現する頻度が最も高い色のいずれか一方を、前記検索対象ブロック領域の代表色として導出する構成とすることができる。 In the video search device of the present invention, the search target block area representative color deriving unit corresponds to an average value of values obtained by a conversion method that reduces the influence of a luminance change on a color that appears in the search target block area. One of the color and the color with the highest frequency of the obtained value may be derived as a representative color of the search target block area.

この構成により、検索対象ブロック領域の代表色を適切に導出することが可能となる。 With this configuration, it is possible to appropriately derive the representative color of the search target block area.

また、本発明の映像検索方法は、第１の映像内の動き物体を第２の映像内の情報に基づいて検索する映像検索方法であって、前記第１の映像内の蓄積対象動領域を抽出し、前記蓄積対象動領域に対応する蓄積対象動領域映像信号を出力する蓄積対象動領域抽出ステップと、前記蓄積対象動領域映像信号に基づいて、前記蓄積対象動領域を蓄積対象ブロック領域に分割し、前記蓄積対象動領域を構成する各々の前記蓄積対象ブロック領域に対応する蓄積対象ブロック領域映像信号を出力する蓄積対象ブロック領域分割ステップと、前記蓄積対象ブロック領域の各々に対応する前記蓄積対象ブロック領域映像信号に基づいて、前記蓄積対象動領域を構成する各々の前記蓄積対象ブロック領域の代表色を導出して出力する蓄積対象ブロック領域代表色導出ステップと、前記蓄積対象ブロック領域の各々の代表色を蓄積対象ブロック領域蓄積手段に蓄積する蓄積対象ブロック領域代表色蓄積制御ステップと、前記第２の映像内の検索対象領域を抽出し、前記検索対象領域に対応する検索対象領域映像信号を出力する検索対象領域抽出ステップと、前記検索対象領域映像信号に基づいて、前記検索対象領域を検索対象ブロック領域に分割し、前記検索対象領域を構成する各々の前記検索対象ブロック領域に対応する検索対象ブロック領域映像信号を出力する検索対象ブロック領域分割ステップと、前記検索対象ブロック領域の各々に対応する前記検索対象ブロック領域映像信号に基づいて、前記検索対象領域を構成する各々の前記検索対象ブロック領域の代表色を導出して出力する検索対象ブロック領域代表色導出ステップと、前記蓄積対象ブロック領域の各々の代表色と、前記検索対象ブロック領域の各々の代表色とを比較し、その比較結果に応じた出力を行う比較ステップとを有する構成となる。 The video search method of the present invention is a video search method for searching for a moving object in a first video based on information in a second video, wherein a moving target moving area in the first video is determined. An accumulation target moving area extraction step for extracting and outputting a accumulation target moving area video signal corresponding to the accumulation target moving area, and based on the accumulation target moving area video signal, the accumulation target moving area is converted into an accumulation target block area. An accumulation target block area dividing step for dividing and outputting an accumulation target block area video signal corresponding to each of the accumulation target block areas constituting the accumulation target moving area, and the accumulation corresponding to each of the accumulation target block areas Accumulation target block area representative that derives and outputs a representative color of each of the accumulation target block areas constituting the accumulation target moving area based on the target block area video signal A derivation step, a storage target block region representative color storage control step of storing each representative color of the storage target block region in a storage target block region storage means, and extracting a search target region in the second video, A search target area extracting step for outputting a search target area video signal corresponding to the search target area, and dividing the search target area into search target block areas based on the search target area video signal, thereby configuring the search target area Based on the search target block region video signal corresponding to each of the search target block regions, the search target block region video signal outputting a search target block region video signal corresponding to each of the search target block regions A search target block for deriving and outputting a representative color of each of the search target block areas constituting the search target area. And a comparison step of comparing each representative color of the accumulation target block region with each representative color of the search target block region and performing output according to the comparison result. It becomes composition.

また、本発明の映像検索方法は、前記比較ステップが、前記蓄積対象ブロック領域の各々の代表色と、前記検索対象ブロック領域の各々の代表色との差分を前記比較結果として導出する構成とすることができる。 In the video search method of the present invention, the comparison step derives a difference between each representative color of the accumulation target block area and each representative color of the search target block area as the comparison result. be able to.

また、本発明の映像検索方法は、前記蓄積対象動領域映像信号に基づいて、前記蓄積対象動領域が人であることの条件として定められた所定条件を満たしているか否かを判定し、前記蓄積対象動領域が前記所定条件を満たしている場合に前記蓄積対象動領域映像信号を出力する人判定ステップを有し、前記蓄積対象ブロック領域分割ステップは、前記人判定ステップにおいて蓄積対象動領域映像信号が出力された場合に、その蓄積対象動領域映像信号に基づいて、前記蓄積対象動領域を蓄積対象ブロック領域に分割し、前記蓄積対象動領域を構成する各々の前記蓄積対象ブロック領域に対応する蓄積対象ブロック領域映像信号を出力する構成とすることができる。 Further, the video search method of the present invention determines, based on the accumulation target moving area video signal, whether or not the accumulation target moving area satisfies a predetermined condition defined as a condition of being a person, A person determination step of outputting the accumulation target moving area video signal when the accumulation target moving area satisfies the predetermined condition, wherein the accumulation target block area dividing step includes the accumulation target moving area image in the person determining step; When a signal is output, the accumulation target moving area is divided into accumulation target block areas based on the accumulation target moving area video signal, and each accumulation target block area constituting the accumulation target moving area is handled. The accumulation target block area video signal to be output can be output.

また、本発明の映像検索装置は、第１の映像内の蓄積対象動領域を抽出し、前記蓄積対象動領域に対応する蓄積対象動領域映像信号を出力する蓄積対象動領域抽出手段と、前記蓄積対象動領域映像信号に基づいて、前記蓄積対象動領域を蓄積対象ブロック領域に分割し、前記蓄積対象動領域を構成する各々の前記蓄積対象ブロック領域に対応する蓄積対象ブロック領域映像信号を出力する蓄積対象ブロック領域分割手段と、前記蓄積対象ブロック領域の各々に対応する前記蓄積対象ブロック領域映像信号に基づいて、前記蓄積対象動領域を構成する各々の前記蓄積対象ブロック領域に含まれる映像信号から色分布を抽出して出力する蓄積対象ブロック領域色情報生成手段と、前記蓄積対象ブロック領域の各々の色情報を蓄積する蓄積対象ブロック領域色情報蓄積手段と、第２の映像内の検索対象領域を抽出し、前記検索対象領域に対応する検索対象領域映像信号を出力する検索対象領域抽出手段と、前記検索対象領域映像信号に基づいて、前記検索対象領域を検索対象ブロック領域に分割し、前記検索対象領域を構成する各々の前記検索対象ブロック領域に対応する検索対象ブロック領域映像信号を出力する検索対象ブロック領域分割手段と、前記検索対象ブロック領域の各々に対応する前記検索対象ブロック領域映像信号に基づいて、前記検索対象領域を構成する各々の前期検索対象ブロック領域に含まれる映像信号から色分布を抽出して出力する検索対象ブロック領域色情報生成手段と、前記蓄積対象ブロック領域の各々の色分布と、前記検索対象ブロック領域の色分布とを比較し、その比較結果に応じた出力を行う比較手段とを有する構成となる。 Further, the video search device of the present invention extracts a storage target moving area in a first video and outputs a storage target moving area video signal corresponding to the storage target moving area; Based on the accumulation target moving area video signal, the accumulation target moving area is divided into accumulation target block areas, and an accumulation target block area video signal corresponding to each of the accumulation target block areas constituting the accumulation target moving area is output. And a video signal included in each of the accumulation target block areas constituting the accumulation target moving area based on the accumulation target block area video signal corresponding to each of the accumulation target block areas. Storage target block area color information generating means for extracting and outputting a color distribution from the storage target block area, and storing target block area for storing color information of each of the storage target block areas. A search area color information storage means, a search target area extraction means for extracting a search target area in the second video, and outputting a search target area video signal corresponding to the search target area; and the search target area video signal A search target block region dividing means for dividing the search target region into search target block regions and outputting search target block region video signals corresponding to the search target block regions constituting the search target region; Search that extracts and outputs a color distribution from video signals included in each previous search target block area constituting the search target area based on the search target block area video signal corresponding to each of the search target block areas The target block area color information generation means, the color distribution of each of the accumulation target block areas, and the color distribution of the search target block area are compared. And, a structure having a comparing means for performing an output corresponding to the comparison result.

この構成により、映像内の動きと、その動きの領域を構成する各ブロックの色に基づいて映像検索が行われる。すなわち、映像内の動きだけでなく、その動きの領域の色分布によって映像検索が行われるため、動きの有無のみに基づいて映像を検索する場合や、映像検索において対象物の特定部分の映像が必要となる場合よりも、映像検索を適切に行うことができる。 With this configuration, the video search is performed based on the motion in the video and the color of each block constituting the motion region. In other words, since video search is performed not only based on the motion in the video but also based on the color distribution of the region of the motion, when searching for the video based only on the presence or absence of motion, Video search can be performed more appropriately than necessary.

また、本発明の映像検索装置は、前記比較手段は、前記蓄積対象ブロック領域の各々の色分布と、前記検索対象ブロック領域の各々の色分布との色の出現頻度の一致度を前記比較結果として導出する構成とすることができる。 Further, in the video search device of the present invention, the comparison means calculates the degree of coincidence of the color appearance frequencies of each color distribution of the accumulation target block area and each color distribution of the search target block area as the comparison result. It can be set as the structure derived | led-out as.

また、本発明の映像検索装置は、前記比較手段は、前記蓄積対象ブロック領域の各々の色分布と、前記検索対象ブロック領域の色分布のいずれかとの色の出現頻度の一致度を前記比較結果として導出する構成とすることができる。 Further, in the video search device of the present invention, the comparing means calculates the degree of coincidence of the color appearance frequency between each color distribution of the accumulation target block area and one of the color distributions of the search target block area. It can be set as the structure derived | led-out as.

また、本発明の映像検索装置は、前記蓄積対象動領域映像信号に基づいて、前記蓄積対象動領域が人であることの条件として定められた所定条件を満たしているか否かを判定し、前記蓄積対象動領域が前記所定条件を満たしている場合に前記蓄積対象動領域映像信号を前記蓄積対象ブロック領域分割手段へ出力する人判定手段を有する構成とすることができる。 Further, the video search device of the present invention determines whether or not the accumulation target moving area satisfies a predetermined condition defined as a condition that the accumulation target moving area is a person based on the accumulation target moving area video signal, The storage target moving area can be configured to include a person determination unit that outputs the storage target moving area video signal to the storage target block area dividing unit when the storage target moving area satisfies the predetermined condition.

この構成により、複数蓄積対象動領域の内、検索対象領域の各色の発生数の割合が類似するものが検索できるため、色に基づいた映像検索を適切に行うことが可能となる。 With this configuration, it is possible to search for a plurality of accumulation target moving areas that have similar ratios of the number of occurrences of each color in the search target area, so that video search based on colors can be performed appropriately.

また、本発明の映像検索装置は、第１の映像内の蓄積対象動領域を抽出し、前記蓄積対象動領域に対応する蓄積対象動領域映像信号を出力する蓄積対象動領域抽出手段と、前記蓄積対象動領域映像信号に基づいて、前記蓄積対象動領域を蓄積対象ブロック領域に分割し、前記蓄積対象動領域を構成する各々の前記蓄積対象ブロック領域に対応する蓄積対象ブロック領域映像信号を出力する蓄積対象ブロック領域分割手段と、前記蓄積対象ブロック領域の各々に対応する前記蓄積対象ブロック領域映像信号に基づいて、前記蓄積対象動領域を構成する各々の前記蓄積対象ブロック領域に含まれる映像信号から色分布を抽出して出力する蓄積対象ブロック領域色情報生成手段と、前記蓄積対象ブロック領域の各々の色情報を蓄積する蓄積対象ブロック領域色情報蓄積手段と、第２の映像内の１つの検索対象点を抽出し、前記検索対象点に対応する検索対象領域映像信号に基づいて、前記検索対象点の色情報を抽出して出力する検索対象色情報生成手段と、前記蓄積対象ブロック領域の各々の色分布から代表色情報を生成し、前記代表色情報と、前記検索対象色情報とを比較し、その比較結果に応じた出力を行う比較手段とを有する構成となる。 Further, the video search device of the present invention extracts a storage target moving area in a first video and outputs a storage target moving area video signal corresponding to the storage target moving area; Based on the accumulation target moving area video signal, the accumulation target moving area is divided into accumulation target block areas, and an accumulation target block area video signal corresponding to each of the accumulation target block areas constituting the accumulation target moving area is output. And a video signal included in each of the accumulation target block areas constituting the accumulation target moving area based on the accumulation target block area video signal corresponding to each of the accumulation target block areas. Storage target block area color information generating means for extracting and outputting a color distribution from the storage target block area, and storing target block area for storing color information of each of the storage target block areas. A search area color information storage unit, and a search target point in the second video is extracted, and color information of the search target point is extracted based on a search target area video signal corresponding to the search target point. Retrieval color information generating means for outputting, representative color information is generated from each color distribution of the accumulation target block area, the representative color information is compared with the search target color information, and the comparison result is determined. And a comparison unit that performs output.

この構成により、検索対象とする画像が存在しない場合でも、検索装置によって示される色情報画像から一点指示するだけで、撮影された映像内の動きだけでなく、その動きの領域の色によって映像検索が行われるため、動きの有無のみに基づいて映像検索を行う場合や、映像検索において対象物の特定部分の映像が必要となる場合よりも、映像検索を適切に行うことができるともに、検索対象の指定を容易に行うことが可能となる。 With this configuration, even when there is no image to be searched, only one point is specified from the color information image indicated by the search device, and the video search is performed not only based on the motion in the captured video but also based on the color of the region of the motion. Therefore, the video search can be performed more appropriately than when the video search is performed based only on the presence or absence of movement or when the video of the specific part of the target object is required for the video search. Can be easily specified.

また、本発明の映像検索装置は、前記比較手段が、前記蓄積対象ブロック領域の各々の色分布から生成される代表色情報と、前記検索対象点の色情報との色の出現頻度の類似度を前記比較結果として導出する構成となる。 Further, in the video search device of the present invention, the comparison means uses the similarity of the appearance frequency of the color between the representative color information generated from each color distribution of the accumulation target block region and the color information of the search target point. Is derived as the comparison result.

この構成により、複数蓄積対象動領域の内、検索対象点の色情報に類似するものが検索できるため、色に基づいた映像検索を適切に行うことが可能となる。 With this configuration, it is possible to search for a plurality of accumulation target moving regions that are similar to the color information of the search target point, so that it is possible to appropriately perform video search based on color.

また、本発明の映像検索装置は、映像内の蓄積対象動領域を抽出し、前記蓄積対象動領域に対応する蓄積対象動領域映像信号を出力する蓄積対象動領域抽出手段と、前記蓄積対象動領域映像信号に基づいて、映像内の連続フレーム画像から同一人物を識別し、前期蓄積対象動領域映像信号を出力するとともに、動きがなくなった時点で動きシーン終了信号を出力する人物動きシーン判定手段と、前記蓄積対象動領域映像信号に基づいて、前記蓄積対象動領域を蓄積対象ブロック領域に分割し、前記蓄積対象動領域を構成する各々の前記蓄積対象ブロック領域に対応する蓄積対象ブロック領域映像信号を出力する蓄積対象ブロック領域分割手段と、前記蓄積対象ブロック領域の各々に対応する前記蓄積対象ブロック領域映像信号に基づいて、同一人物ごとに前記蓄積対象動領域を構成する各々の前記蓄積対象ブロック領域に含まれる映像信号から色特徴量を抽出し、動きシーン終了信号を受けて同一人物ごとに各々の前記蓄積対象ブロック領域に対応する色分布を生成して出力する蓄積対象ブロック領域色情報生成手段と、前記蓄積対象動領域映像信号に基づいて、同一人物ごとに検索対象領域を設定し、前記動きシーン終了信号を受けて同一人物ごとに前記検索対象領域と対応する検索対象領域映像信号を出力する検索領域設定手段と、前記蓄積対象ブロック領域の各々の色情報と前記検索対象領域と対応する前記検索対象領域映像信号を蓄積する領域色情報蓄積手段と、前記領域色情報蓄積手段から、前記検索対象領域映像信号を取得して表示リストを出力する人物代表画像リスト表示手段と、前記検索対象領域映像信号に基づいて、前記検索対象領域に含まれる映像信号から色分布を生成して出力する検索対象領域色情報生成手段と、前記蓄積対象ブロック領域の各々の色分布と、前記検索対象領域の色分布とを比較し、その比較結果に応じた出力を行う比較手段とを有する構成となる。 Further, the video search apparatus of the present invention extracts an accumulation target moving area in a video, and outputs an accumulation target moving area video signal corresponding to the accumulation target moving area, and the accumulation target moving area. Person motion scene determination means for identifying the same person from continuous frame images in the video based on the area video signal, outputting the moving area video signal to be accumulated in the previous period, and outputting a motion scene end signal when there is no movement And, based on the accumulation target moving area video signal, the accumulation target moving area is divided into accumulation target block areas, and the accumulation target block area videos corresponding to the respective accumulation target block areas constituting the accumulation target moving area A storage target block region dividing means for outputting a signal, and a storage target block region video signal corresponding to each of the storage target block regions. A color feature amount is extracted from a video signal included in each of the accumulation target block areas constituting the accumulation target moving area for each person, and the movement scene end signal is received to each accumulation target block area for each person. Based on the accumulation target block area color information generating means for generating and outputting a corresponding color distribution and the accumulation target moving area video signal, a search target area is set for each person, and the motion scene end signal is received. Search area setting means for outputting a search target area video signal corresponding to the search target area for each person, color information of each storage target block area and the search target area video signal corresponding to the search target area An area color information storage means for storing, and a person representative image list that obtains the search target area video signal from the area color information storage means and outputs a display list. Display means, search target area color information generating means for generating and outputting a color distribution from a video signal included in the search target area based on the search target area video signal, and each color of the storage target block area Comparing the distribution with the color distribution of the search target area, and a comparison means for performing output according to the comparison result.

この構成により、ユーザは、一人の人物につき一枚の画像を目視確認しながら検索対象人物を選択することができるため、検索対象領域の指定が容易に行うことができる。また、人物を選択することであらかじめ生成されている検索対象領域を使用して検索が行われるため、色に基づいた映像検索を適切に行うことが可能となる。 With this configuration, the user can select a search target person while visually confirming one image for each person, so that the search target area can be easily specified. In addition, since a search is performed using a search target area generated in advance by selecting a person, it is possible to appropriately perform a video search based on color.

また、本発明の映像検索装置は、前記蓄積対象ブロック領域分割手段が、分割位置を人物の形状に合わせて決定する構成とすることができる。 Further, the video search apparatus of the present invention may be configured such that the accumulation target block area dividing means determines the division position according to the shape of a person.

この構成により、人物の衣類を選択的に検索対象領域として指定することが可能になり、映像を色で検索するだけでなく、さらに絞りこんだ服装による検索が可能となる。 With this configuration, it becomes possible to selectively designate a person's clothing as a search target area, and not only a video can be searched for by color, but also a search based on further narrowed clothes can be performed.

また、本発明の映像検索方法は、第１の映像内の動き物体を第２の映像内の情報に基づいて検索する映像検索方法であって、前記第１の映像内の蓄積対象動領域を抽出し、前記蓄積対象動領域に対応する蓄積対象動領域映像信号を出力する蓄積対象動領域抽出ステップと、前記蓄積対象動領域映像信号に基づいて、前記蓄積対象動領域を蓄積対象ブロック領域に分割し、前記蓄積対象動領域を構成する各々の前記蓄積対象ブロック領域に対応する蓄積対象ブロック領域映像信号を出力する蓄積対象ブロック領域分割ステップと、前記蓄積対象ブロック領域の各々に対応する前記蓄積対象ブロック領域映像信号に基づいて、前記蓄積対象動領域を構成する各々の前記蓄積対象ブロック領域に含まれる映像信号から色分布を抽出して出力する蓄積対象ブロック領域色情報生成ステップと、前記蓄積対象ブロック領域の各々の色情報を蓄積対象ブロック領域蓄積手段に蓄積する蓄積対象ブロック領域色情報蓄積制御ステップと、前記第２の映像内の検索対象領域を抽出し、前記検索対象領域に対応する検索対象領域映像信号を出力する検索対象領域抽出ステップと、前記検索対象領域映像信号に基づいて、前記検索対象領域を検索対象ブロック領域に分割し、前記検索対象領域を構成する各々の前記検索対象ブロック領域に対応する検索対象ブロック領域映像信号を出力する検索対象ブロック領域分割ステップと、前記検索対象ブロック領域の各々に対応する前記検索対象ブロック領域映像信号に基づいて、前記検索対象領域を構成する各々の前期検索対象ブロック領域に含まれる映像信号から色分布を抽出して出力する検索対象ブロック領域色情報生成ステップと、前記蓄積対象ブロック領域の各々の色分布と、前記検索対象ブロック領域の色分布とを比較し、その比較結果に応じた出力を行う比較ステップとを有する構成となる。 The video search method of the present invention is a video search method for searching for a moving object in a first video based on information in a second video, wherein a moving target moving area in the first video is determined. An accumulation target moving area extraction step for extracting and outputting a accumulation target moving area video signal corresponding to the accumulation target moving area, and based on the accumulation target moving area video signal, the accumulation target moving area is converted into an accumulation target block area. An accumulation target block area dividing step for dividing and outputting an accumulation target block area video signal corresponding to each of the accumulation target block areas constituting the accumulation target moving area, and the accumulation corresponding to each of the accumulation target block areas Accumulation that extracts and outputs a color distribution from the video signal included in each of the accumulation target block areas constituting the accumulation target moving area based on the target block area video signal Elephant block area color information generation step, accumulation target block area color information accumulation control step for accumulating the color information of each accumulation object block area in the accumulation object block area accumulation means, and search object area in the second video A search target area extraction step for extracting a search target area video signal corresponding to the search target area, and dividing the search target area into search target block areas based on the search target area video signal, A search target block region dividing step for outputting a search target block region video signal corresponding to each of the search target block regions constituting the search target region, and the search target block region video signal corresponding to each of the search target block regions Based on the video signal included in each previous search target block area constituting the search target area. The search target block region color information generation step for extracting and outputting the color distribution from the output, comparing each color distribution of the accumulation target block region with the color distribution of the search target block region, and according to the comparison result And a comparison step for outputting.

また、本発明の映像検索方法は、前記比較ステップが、前記蓄積対象ブロック領域の各々の色分布と、前記検索対象ブロック領域の各々の色分布との色の出現頻度の一致度を前記比較結果として導出する構成とすることができる。 Further, in the video search method of the present invention, the comparison step determines the degree of coincidence of the color appearance frequency between each color distribution of the accumulation target block area and each color distribution of the search target block area. It can be set as the structure derived | led-out as.

また、本発明の映像検索方法は、前記比較ステップが、前記蓄積対象ブロック領域の各々の色分布と、前記検索対象ブロック領域の色分布のいずれかとの色の出現頻度の一致度を前記比較結果として導出する構成とすることができる。 Further, in the video search method of the present invention, the comparison step determines the degree of coincidence of the color appearance frequency between each color distribution of the accumulation target block area and one of the color distributions of the search target block area. It can be set as the structure derived | led-out as.

また、本発明の映像検索方法は、第１の映像内の動き物体を第２の映像内の情報に基づいて検索する映像検索方法であって、前記第１の映像内の蓄積対象動領域を抽出し、前記蓄積対象動領域に対応する蓄積対象動領域映像信号を出力する蓄積対象動領域抽出ステップと、前記蓄積対象動領域映像信号に基づいて、前記蓄積対象動領域を蓄積対象ブロック領域に分割し、前記蓄積対象動領域を構成する各々の前記蓄積対象ブロック領域に対応する蓄積対象ブロック領域映像信号を出力する蓄積対象ブロック領域分割ステップと、前記蓄積対象ブロック領域の各々に対応する前記蓄積対象ブロック領域映像信号に基づいて、前記蓄積対象動領域を構成する各々の前記蓄積対象ブロック領域に含まれる映像信号から色分布を抽出して出力する蓄積対象ブロック領域色情報生成ステップと、前記蓄積対象ブロック領域の各々の色情報を蓄積対象ブロック領域蓄積手段に蓄積する蓄積対象ブロック領域色情報蓄積制御ステップと、前記第２の映像内の１つの検索対象点を抽出し、前記検索対象点に対応する検索対象領域映像信号に基づいて、前記検索対象点の色情報を抽出して出力する検索対象色情報生成ステップと、前記蓄積対象ブロック領域の各々の色分布から代表色情報を生成し、前記代表色情報と、前記検索対象色情報とを比較し、その比較結果に応じた出力を行う比較ステップとを有する構成となる。 The video search method of the present invention is a video search method for searching for a moving object in a first video based on information in a second video, wherein a moving target moving area in the first video is determined. An accumulation target moving area extraction step for extracting and outputting a accumulation target moving area video signal corresponding to the accumulation target moving area, and based on the accumulation target moving area video signal, the accumulation target moving area is converted into an accumulation target block area. An accumulation target block area dividing step for dividing and outputting an accumulation target block area video signal corresponding to each of the accumulation target block areas constituting the accumulation target moving area, and the accumulation corresponding to each of the accumulation target block areas Accumulation that extracts and outputs a color distribution from the video signal included in each of the accumulation target block areas constituting the accumulation target moving area based on the target block area video signal Elephant block area color information generation step, accumulation target block area color information accumulation control step for accumulating the color information of each accumulation target block area in the accumulation target block area accumulation means, and one search in the second video A search target color information generating step of extracting a target point and extracting and outputting color information of the search target point based on a search target area video signal corresponding to the search target point; and each of the storage target block areas A comparison step of generating representative color information from the color distribution, comparing the representative color information with the color information to be searched, and outputting according to the comparison result.

また、本発明の映像検索方法は、前記比較ステップは、前記蓄積対象ブロック領域の各々の色分布から生成される代表色情報と、前記検索対象点の色情報との色の出現頻度の類似度を前記比較結果として導出する構成とすることができる。 Further, in the video search method of the present invention, the comparison step includes the similarity of the appearance frequency of the color between the representative color information generated from each color distribution of the accumulation target block region and the color information of the search target point. Can be derived as the comparison result.

また、本発明の映像検索方法は、映像内の蓄積対象動領域を抽出し、前記蓄積対象動領域に対応する蓄積対象動領域映像信号を出力する蓄積対象動領域抽出ステップと、前記蓄積対象動領域映像信号に基づいて、映像内の連続フレーム画像から同一人物を識別し、前期蓄積対象動領域映像信号を出力するとともに、動きがなくなった時点で動きシーン終了信号を出力する人物動きシーン判定ステップと、前記蓄積対象動領域映像信号に基づいて、前記蓄積対象動領域を蓄積対象ブロック領域に分割し、前記蓄積対象動領域を構成する各々の前記蓄積対象ブロック領域に対応する蓄積対象ブロック領域映像信号を出力する蓄積対象ブロック領域分割ステップと、前記蓄積対象ブロック領域の各々に対応する前記蓄積対象ブロック領域映像信号に基づいて、同一人物ごとに前記蓄積対象動領域を構成する各々の前記蓄積対象ブロック領域に含まれる映像信号から色特徴量を抽出し、動きシーン終了信号を受けて同一人物ごとに各々の前記蓄積対象ブロック領域に対応する色分布を生成して出力する蓄積対象ブロック領域色情報生成ステップと、前記蓄積対象動領域映像信号に基づいて、同一人物ごとに検索対象領域を設定し、前記動きシーン終了信号を受けて同一人物ごとに前記検索対象領域と対応する検索対象領域映像信号を出力する検索領域設定ステップと、前記蓄積対象ブロック領域の各々の色情報と前記検索対象領域と対応する前記検索対象領域映像信号を領域色情報蓄積手段に蓄積する領域色情報蓄積制御ステップと、前記検索対象領域映像信号を取得して表示リストを出力する人物代表画像リスト表示ステップと、前記検索対象領域映像信号に基づいて、前記検索対象領域に含まれる映像信号から色分布を生成して出力する検索対象領域色情報生成ステップと、前記蓄積対象ブロック領域の各々の色分布と、前記検索対象領域の色分布とを比較し、その比較結果に応じた出力を行う比較ステップとを有する構成となる。 In addition, the video search method of the present invention includes a storage target motion area extraction step of extracting a storage target motion area in a video and outputting a storage target motion area video signal corresponding to the storage target motion area; A person motion scene determination step of identifying the same person from continuous frame images in the video based on the region video signal, outputting the moving region video signal to be accumulated in the previous period, and outputting a motion scene end signal when there is no movement And, based on the accumulation target moving area video signal, the accumulation target moving area is divided into accumulation target block areas, and the accumulation target block area videos corresponding to the respective accumulation target block areas constituting the accumulation target moving area An accumulation target block area dividing step for outputting a signal, and the accumulation target block area video signal corresponding to each of the accumulation target block areas; Accordingly, a color feature amount is extracted from the video signal included in each of the accumulation target block areas constituting the accumulation target moving area for each same person, and each of the accumulations is received for each same person in response to a motion scene end signal. A storage target block region color information generation step for generating and outputting a color distribution corresponding to the target block region, and a search target region is set for each person based on the storage target moving region video signal, and the motion scene ends A search region setting step for receiving a signal and outputting a search target region video signal corresponding to the search target region for each person, and the search target corresponding to each color information of the storage target block region and the search target region A region color information storage control step for storing the region video signal in the region color information storage means, and acquiring the search target region video signal and outputting a display list A representative representative image list display step, a search target area color information generation step for generating and outputting a color distribution from a video signal included in the search target area based on the search target area video signal, and the storage target block The color distribution of each region is compared with the color distribution of the search target region, and a comparison step is performed for performing output according to the comparison result.

また、本発明の映像検索装置は、蓄積対象動領域が複数であることを特徴としている。この構成により、同時に複数人の動きを追跡し検索することができる。 The video search apparatus of the present invention is characterized in that there are a plurality of accumulation target moving areas. With this configuration, it is possible to simultaneously track and search for movements of a plurality of people.

本発明は、映像内の動きと、その動きの領域を構成する各ブロックの色とに基づいて映像検索が行われるため、動きの有無のみに基づいて映像検索が行われる場合や、映像検索において対象物の特定部分の映像が必要となる場合よりも、映像検索を適切に行うことができる。 In the present invention, the video search is performed based on the motion in the video and the color of each block constituting the region of the motion. The video search can be performed more appropriately than when a video of a specific part of the object is required.

また、本発明は、ユーザが検索対象を入力する際、モニタ上に表示された人物画像を目視で確認しながらその人物画像を選択するだけで、服装の色が類似した人物の映像を効率的に検索するための検索対象を容易に指定することができる。 In addition, the present invention can efficiently display a video of a person with similar clothing color simply by selecting the person image while visually confirming the person image displayed on the monitor when the user inputs a search target. It is possible to easily specify a search target for searching.

以下、本発明の実施の形態の映像検索装置について、図面を用いて説明する。 Hereinafter, an image search apparatus according to an embodiment of the present invention will be described with reference to the drawings.

本発明の第１の実施の形態における映像検索装置のブロック図を図１に示す。図１において、映像検索装置１００は、被写体を撮影して映像信号を生成、出力する画像入力部１０１ａ乃至１０１ｎ（以下、これら画像入力部１０１ａ乃至１０１ｎをまとめて、適宜「映像入力部１０１」と称する）と、対応する映像入力部１０１からの映像信号を入力して、映像内の動領域（蓄積対象動領域）を抽出し、その蓄積対象動領域に対応する映像信号（蓄積対象動領域映像信号）を出力する動領域抽出部１０２ａ乃至１０２ｎ（以下、これら動領域抽出部１０２ａ乃至１０２ｎをまとめて、適宜「動領域抽出部１０２」と称する）と、対応する動領域抽出部１０２からの蓄積対象動領域映像信号を入力して、蓄積対象動領域があらかじめ定められた人であることの条件を満たしているか否かを判定し、人であることの条件を満たしている場合に蓄積対象動領域映像信号を出力する人判定手段１０３ａ乃至１０３ｎ（以下、これら人判定部１０３ａ乃至１０３ｎをまとめて、適宜「人判定部１０３」と称する）と、対応する人判定部１０３からの蓄積対象動領域映像信号を入力し、蓄積対象動領域を複数のブロック領域（蓄積対象ブロック領域）に分割して各蓄積対象ブロック領域に対応する映像信号（蓄積対象ブロック領域映像信号）を出力する領域分割部１０４ａ乃至１０４ｎ（以下、これら領域分割部１０４ａ乃至１０４ｎをまとめて、適宜「領域分割部１０４」と称する）と、対応する領域分割部１０４からの各蓄積対象ブロック領域映像信号を入力して蓄積対象動領域を構成する各蓄積対象ブロック領域の代表色を算出して出力する代表色算出部１０５ａ乃至１０５ｎ（以下、これら代表色算出部１０５ａ乃至１０５ｎをまとめて、適宜「代表色算出部１０５」と称する）と、各代表色算出部１０５からの蓄積対象動領域を構成する各蓄積対象ブロック領域の代表色を蓄積するデータベース（ＤＢ）１０６と、ユーザが指示を入力するためのキーボード１０７と、映像入力部１０１からの映像信号を入力し、ユーザによるキーボード１０７の操作に応じて検索対象となる映像内の所定の領域（検索対象領域）を指定し、その検索対象領域に対応する映像信号（検索対象領域映像信号）を出力する検索対象領域指定部１０８と、検索対象領域指定部１０８からの検索対象領域映像信号を入力し、検索対象領域を複数のブロック領域（検索対象ブロック領域）に分割して各検索対象ブロック領域に対応する映像信号（検索対象ブロック領域映像信号）を出力する領域分割部１０９と、領域分割部１０９からの検索対象ブロック領域映像信号を入力して検索対象領域を構成する各検索対象ブロック領域の代表色を算出して出力する代表色算出部１１０と、代表色算出部１１０からの検索対象領域を構成する各検索対象領域ブロック領域の代表色とＤＢ１０６内の蓄積対象動領域を構成する各蓄積対象ブロック領域の代表色とを比較し、その比較結果に応じた検索結果を出力する比較部１１１と、比較部１１１における検索結果を表示するために、その検索結果のリストを生成して出力するリスト表示部１１２と、各映像入力部１０１からの映像信号の内、記録すべき映像信号を選択して出力する映像選択部１１３と、映像選択部１１３からの映像信号を圧縮する圧縮部１１４と、圧縮部１１４からの圧縮された映像信号を蓄積するストレージ１１５と、ユーザによるキーボード１０７の操作に応じて映像表示を指示する映像表示指示部１１６と、映像表示指示部１１６による指示に応じてストレージ１１５内の圧縮された映像信号を展開する展開部１１７と、比較部１１１による検索結果のリストや展開部１１７によって展開された映像信号に対応する映像を表示する表示部１１８とを有する。 FIG. 1 shows a block diagram of a video search apparatus according to the first embodiment of the present invention. In FIG. 1, a video search apparatus 100 captures a subject and generates and outputs a video signal. Image input units 101 a to 101 n (hereinafter, these image input units 101 a to 101 n are collectively referred to as “video input unit 101” as appropriate. The video signal from the corresponding video input unit 101 is input, the moving area (accumulation target moving area) in the video is extracted, and the video signal corresponding to the accumulating target moving area (accumulation target moving area video) A moving region extraction unit 102a to 102n that outputs a signal) (hereinafter, these moving region extraction units 102a to 102n are collectively referred to as a “moving region extraction unit 102” as appropriate) and an accumulation from the corresponding moving region extraction unit 102 The target moving area video signal is input, it is determined whether or not the accumulation target moving area satisfies a predetermined condition of being a person, and the condition of being a person is satisfied. The person determination means 103a to 103n (hereinafter, these person determination sections 103a to 103n are collectively referred to as “person determination section 103” as appropriate) for outputting the accumulation target moving area video signal in the case of The accumulation target moving area video signal from the unit 103 is input, the accumulation target moving area is divided into a plurality of block areas (accumulation target block areas), and video signals corresponding to the respective accumulation target block areas (accumulation target block area video signals) ) To output the region dividing units 104a to 104n (hereinafter, these region dividing units 104a to 104n are collectively referred to as “region dividing unit 104” as appropriate), and the respective accumulation target block region videos from the corresponding region dividing unit 104 A representative color calculation unit 105ano that inputs a signal and calculates and outputs a representative color of each accumulation target block area constituting the accumulation target moving area 105n (hereinafter, these representative color calculation units 105a to 105n are collectively referred to as “representative color calculation unit 105” as appropriate) and each accumulation target block area constituting the accumulation target moving area from each representative color calculation unit 105. A database (DB) 106 for storing representative colors, a keyboard 107 for a user to input instructions, and a video signal from the video input unit 101 are input, and a video to be searched according to an operation of the keyboard 107 by the user A search target area designating unit 108 for designating a predetermined area (search target area) and outputting a video signal (search target area video signal) corresponding to the search target area, and a search from the search target area designating unit 108 The target area video signal is input, and the search target area is divided into a plurality of block areas (search target block areas) to correspond to each search target block area. Region segmentation unit 109 that outputs a video signal (search target block region video signal) and a representative color of each search target block region constituting the search target region by inputting the search target block region video signal from the region segmentation unit 109 Representative color calculation unit 110 that calculates and outputs the representative color of each search target region block area constituting the search target area from the representative color calculation unit 110 and each accumulation target block constituting the accumulation target moving area in the DB 106 A comparison unit 111 that compares the representative colors of the regions and outputs a search result according to the comparison result, and a list display that generates and outputs a list of the search results in order to display the search results in the comparison unit 111 Unit 112, video selection unit 113 that selects and outputs a video signal to be recorded among video signals from each video input unit 101, and video from video selection unit 113 A compression unit 114 that compresses the video signal, a storage 115 that stores the compressed video signal from the compression unit 114, a video display instruction unit 116 that instructs video display in response to an operation of the keyboard 107 by the user, and a video display instruction An expansion unit 117 that expands the compressed video signal in the storage 115 in response to an instruction from the unit 116, and a display that displays a list of search results by the comparison unit 111 and a video corresponding to the video signal expanded by the expansion unit 117. Part 118.

以上のように構成された映像検索装置１００について、図２及び図３を用いてその動作を説明する。まず、図２を用いて代表色蓄積時の動作を説明する。 The operation of the video search apparatus 100 configured as described above will be described with reference to FIGS. First, the operation during accumulation of representative colors will be described with reference to FIG.

映像入力部１０１は、被写体を撮影して映像信号を生成する（Ｓ１０１）。更に、映像入力部１０１は、生成した映像信号を、対応する動領域抽出部１０２へ出力する。例えば、映像入力部１０１ａは、生成した映像信号を動領域抽出部１０２ａへ出力する。 The video input unit 101 shoots a subject and generates a video signal (S101). Further, the video input unit 101 outputs the generated video signal to the corresponding moving region extraction unit 102. For example, the video input unit 101a outputs the generated video signal to the moving region extraction unit 102a.

動領域抽出部１０２は、対応する映像入力部１０１からの映像信号を入力すると、その映像信号に対応する映像と、あらかじめ保持する背景映像との差分値を算出する背景差分処理を行う（Ｓ１０２）。次に、動領域抽出部１０２は、入力した映像信号に対応する映像に動きがあるか否かを判定する（Ｓ１０３）。具体的には、動領域抽出部１０２は、Ｓ１０２における背景差分処理において、差分値があらかじめ決められた値より大きな場合には映像に動きがあると判定し、そうでない場合には映像に動きがないと判定する。なお、動領域抽出部１０２は、Ｓ１０２において、背景差分処理以外の処理を行い、Ｓ１０３において、その背景差分処理以外の処理の結果に応じて映像に動きがあるか否かを判定するようにしても良い。 When the video signal from the corresponding video input unit 101 is input, the moving region extraction unit 102 performs background difference processing for calculating a difference value between the video corresponding to the video signal and the background video stored in advance (S102). . Next, the moving region extraction unit 102 determines whether or not there is a motion in the video corresponding to the input video signal (S103). Specifically, the moving region extraction unit 102 determines that there is motion in the video if the difference value is larger than a predetermined value in the background difference processing in S102, and otherwise the motion is in the video. Judge that there is no. Note that the moving region extraction unit 102 performs processing other than background difference processing in S102, and determines whether or not there is motion in the video according to the result of processing other than background difference processing in S103. Also good.

動領域抽出部１０２に入力された映像信号に対応する映像に動きがない場合には、映像入力部１０１による被写体の撮影及び映像信号の生成（Ｓ１０１）以降の動作が繰り返される。 When there is no motion in the video corresponding to the video signal input to the moving region extraction unit 102, the operations after the subject shooting and video signal generation (S101) by the video input unit 101 are repeated.

一方、動領域抽出部１０２に入力された映像信号に対応する映像に動きがある場合には、動領域抽出部１０２は、その映像内の動領域（蓄積対象動領域）を抽出する（Ｓ１０４）。更に、動領域抽出部１０２は、抽出した蓄積対象動領域に対応する蓄積対象動領域映像信号を、対応する人判定部１０３へ出力する。例えば、動領域抽出部１０２ａは、抽出した蓄積対象動領域動領域に対応する蓄積対象動領域映像信号を、人判定部１０３ａへ出力する。 On the other hand, when there is a motion in the video corresponding to the video signal input to the motion region extraction unit 102, the motion region extraction unit 102 extracts a motion region (accumulation target motion region) in the video (S104). . Furthermore, the moving area extraction unit 102 outputs the accumulation target moving area video signal corresponding to the extracted accumulation target moving area to the corresponding person determination unit 103. For example, the moving region extraction unit 102a outputs an accumulation target moving region video signal corresponding to the extracted accumulation target moving region moving region to the human determination unit 103a.

人判定部１０３は、対応する動領域抽出部１０２からの蓄積対象動領域映像信号を入力すると、その蓄積対象動領域映像信号に対応する蓄積対象動領域の映像に対して、楕円ハフ処理を行う（Ｓ１０５）。次に、人判定部１０３は、入力した蓄積対象動領域映像信号に対応する蓄積対象動領域が人であることの条件を満たしているか否かを判定する（Ｓ１０６）。具体的には、人判定部１０３は、Ｓ１０５における楕円ハフ処理によって、蓄積対象動領域の映像内に人の顔らしい楕円形状の領域を検出することができた場合には、その蓄積対象動領域が人であることの条件を満たしていると判定し、人の顔らしい楕円形状の領域を検出することができなかった場合には、その蓄積対象動領域が人であることの条件を満たしていないと判定する。なお、人判定部１０３は、Ｓ１０５において、楕円ハフ処理以外の処理（例えば、蓄積対象動領域全体の形状、大きさ等を導出する処理）を行い、Ｓ１０６において、その楕円ハフ処理以外の処理の結果に応じて蓄積対象動領域が人であることの条件を満たしているか否か判定するようにしても良い。 When the person determination unit 103 receives the accumulation target moving region video signal from the corresponding moving region extraction unit 102, the human determination unit 103 performs elliptical Hough processing on the image of the accumulation target moving region corresponding to the accumulation target moving region video signal. (S105). Next, the person determination unit 103 determines whether or not a condition that the accumulation target moving area corresponding to the input accumulation target moving area video signal is a person is satisfied (S106). Specifically, when the human determination unit 103 can detect an elliptical area that is likely to be a human face in the image of the accumulation target moving area by the ellipse hough processing in S105, the accumulation target moving area If it is determined that the condition of being a person is satisfied and an elliptical area that is likely to be a human face cannot be detected, the condition that the accumulation target moving area is a person is satisfied. Judge that there is no. In S105, the person determination unit 103 performs a process other than the elliptical Hough process (for example, a process for deriving the shape, size, etc. of the entire accumulation target moving area), and in S106, performs a process other than the elliptical Hough process. Depending on the result, it may be determined whether or not the condition that the accumulation target moving area is a person is satisfied.

蓄積対象動領域映像信号に対応する蓄積対象動領域が人であることの条件を満たしていない場合には、映像入力部１０１による被写体の撮影及び映像信号の生成（Ｓ１０１）以降の動作が繰り返される。 When the accumulation target moving area corresponding to the accumulation target moving area video signal does not satisfy the condition that the person is the person, the operation after the photographing of the subject and the generation of the video signal (S101) by the video input unit 101 is repeated. .

一方、蓄積対象動領域映像信号に対応する蓄積対象動領域が人であることの条件を満たしている場合には、人判定部１０３は、入力した蓄積対象動領域映像信号を、対応する領域分割部１０４へ出力する。例えば、人判定部１０３ａは、入力した蓄積対象動領域映像信号を領域分割部１０４ａへ出力する。 On the other hand, if the condition that the accumulation target moving area corresponding to the accumulation target moving area video signal is a person is satisfied, the person determination unit 103 divides the input accumulation target moving area video signal into the corresponding area division. Output to the unit 104. For example, the person determination unit 103a outputs the input accumulation target moving area video signal to the area dividing unit 104a.

領域分割部１０４は、対応する人判定部１０３からの蓄積対象動領域映像信号を入力し、その蓄積対象動領域映像信号に対応する蓄積対象動領域を複数のブロック領域（蓄積対象ブロック領域）に分割する（Ｓ１０７）。例えば、領域分割部１０４は、蓄積対象動領域を４つに分割する場合、その蓄積対象動領域の縦方向の画素及び横方向の画素をカウントし、縦方向の中点及び横方向の中点を特定する。更に、領域分割部１０４は、縦方向の中点を分割位置として蓄積対象動領域を縦方向に２分割するとともに、横方向の中点を分割位置として蓄積対象動領域を２分割することにより、蓄積対象動領域を４つの蓄積対象ブロック領域に分割する。なお、蓄積対象動領域の分割数や蓄積対象ブロック領域の形状は特に限定されるものではない。 The area dividing unit 104 receives the accumulation target moving area video signal from the corresponding person determination unit 103, and stores the accumulation target moving area corresponding to the accumulation target moving area video signal into a plurality of block areas (accumulation target block areas). Divide (S107). For example, when dividing the accumulation target moving area into four, the area dividing unit 104 counts the vertical pixels and the horizontal pixels of the accumulation target moving area, and calculates the vertical midpoint and the horizontal midpoint. Is identified. Further, the area dividing unit 104 divides the accumulation target moving area into two in the vertical direction with the middle point in the vertical direction as a division position, and divides the accumulation target moving area into two in the division direction with the middle point in the horizontal direction as a division position. The accumulation target moving area is divided into four accumulation target block areas. The number of divisions of the accumulation target moving area and the shape of the accumulation target block area are not particularly limited.

更に、領域分割部１０４は、蓄積対象動領域を構成する各蓄積対象ブロック領域に対応する映像信号（蓄積対象ブロック領域映像信号）を、対応する代表色算出部１０５へ出力する。例えば、領域分割部１０４ａは、代表色算出部１０５ａへ蓄積対象ブロック領域映像信号を出力する。 Further, the region dividing unit 104 outputs a video signal (storage target block region video signal) corresponding to each storage target block region constituting the storage target moving region to the corresponding representative color calculation unit 105. For example, the area dividing unit 104a outputs the accumulation target block area video signal to the representative color calculating unit 105a.

代表色算出部１０５は、対応する領域分割部１０４からの蓄積対象動領域を構成する各蓄積対象ブロック領域に対応する蓄積対象ブロック領域映像信号を入力すると、これら蓄積対象ブロック領域映像信号に基づいて、蓄積対象動領域を構成する各蓄積対象ブロック領域の代表色を算出する（Ｓ１０８）。具体的には、代表色算出部１０５は、蓄積対象ブロック領域に出現する色について所定規則（ＲＧＢ表色系をＨＳＶ表色系で表すように輝度変化の影響を小さくする変換規則）に従って定められた値の平均値に対応する色や、蓄積対象ブロック領域に出現する頻度が最も高い色を、その蓄積対象ブロック領域の代表色として算出する。 When the representative color calculation unit 105 inputs the accumulation target block region video signal corresponding to each accumulation target block region constituting the accumulation target moving region from the corresponding region division unit 104, the representative color calculation unit 105 based on the accumulation target block region video signal. Then, the representative color of each accumulation target block area constituting the accumulation target moving area is calculated (S108). Specifically, the representative color calculation unit 105 is determined according to a predetermined rule (a conversion rule that reduces the influence of luminance change so that the RGB color system is represented by the HSV color system) for colors appearing in the accumulation target block region. The color corresponding to the average value of the values or the color that appears most frequently in the accumulation target block area is calculated as the representative color of the accumulation target block area.

更に、代表色算出部１０５は、蓄積対象動領域を構成する各蓄積対象ブロック領域の代表色をＤＢ１０６へ出力する。ＤＢ１０６は、代表色算出部１０５からの蓄積対象動領域を構成する各蓄積対象ブロック領域の代表色を蓄積する（Ｓ１０９）。なお、ＤＢ１０６は、蓄積対象動領域を構成する各蓄積対象ブロック領域の代表色に対応付けて、その蓄積対象動領域を含む映像を撮影したカメラ１０２の識別情報（ＩＤ）、撮影日時、その蓄積対象動領域に対応する映像を縮小したサムネイル映像のデータを蓄積するようにしても良い。 Further, the representative color calculation unit 105 outputs the representative color of each accumulation target block area constituting the accumulation target moving area to the DB 106. The DB 106 accumulates the representative colors of the accumulation target block areas constituting the accumulation target moving area from the representative color calculation unit 105 (S109). The DB 106 associates with the representative color of each accumulation target block area constituting the accumulation target moving area, identifies the identification information (ID) of the camera 102 that shot the video including the accumulation target moving area, the shooting date and time, and the accumulation thereof. Thumbnail video data obtained by reducing the video corresponding to the target moving area may be accumulated.

前述したＳ１０１乃至Ｓ１０９の処理が繰り返されることによって、ＤＢ１０６には、複数の蓄積対象動領域のそれぞれについて、その蓄積対象動領域を構成する各蓄積対象ブロック領域の代表色が蓄積されることになる。 By repeating the processing of S101 to S109 described above, the representative color of each accumulation target block area constituting the accumulation target moving area is accumulated in the DB 106 for each of a plurality of accumulation target moving areas. .

次に、図３を用いて映像検索時の動作を説明する。
ユーザが表示部１１８に表示された映像を見て、キーボード１０７を操作することにより、その映像内の人物を検索対象として指定する指示（検索指示）を行うと、検索領域指定部１０８は、このユーザによる検索指示を受け付ける（Ｓ２０１）。次に、検索領域指定部１０８は、表示部１１８に表示された映像に対応する映像信号を画像入力部１０１から入力し、その映像内における、ユーザによって指示された動領域（検索対象領域）を抽出する（Ｓ２０２）。更に、検索領域指定部１０８は、抽出した検索対象領域に対応する検索対象領域映像信号を、領域分割部１０９へ出力する。 Next, the operation at the time of video search will be described with reference to FIG.
When the user views an image displayed on the display unit 118 and operates the keyboard 107 to give an instruction (search instruction) for specifying a person in the image as a search target, the search area specifying unit 108 A search instruction by the user is accepted (S201). Next, the search area designation unit 108 inputs a video signal corresponding to the video displayed on the display unit 118 from the image input unit 101, and selects a moving area (search target area) designated by the user in the video. Extract (S202). Further, the search area designating unit 108 outputs a search target area video signal corresponding to the extracted search target area to the area dividing unit 109.

領域分割部１０９は、検索領域指定部１０８からの検索対象領域映像信号を入力し、その検索対象領域映像信号に対応する検索対象領域を複数のブロック領域（検索対象ブロック領域）に分割する（Ｓ２０３）。例えば、領域分割部１０９は、検索対象領域を４つに分割する場合、前述の領域分割部１０４と同様、その検索対象領域の縦方向の画素及び横方向の画素をカウントし、縦方向の中点及び横方向の中点を特定する。更に、領域分割部１０９は、縦方向の中点を分割位置として検索対象領域を縦方向に２分割するとともに、横方向の中点を分割位置として検索対象領域を２分割することにより、検索対象領域を４つの検索対象ブロック領域に分割する。なお、検索対象領域の分割数や検索対象ブロック領域の形状は特に限定されるものではない。更に、領域分割部１０９は、検索対象領域を構成する各検索対象ブロック領域に対応する映像信号（検索対象ブロック領域映像信号）を、代表色算出部１１０へ出力する。 The area dividing unit 109 receives the search target area video signal from the search area specifying unit 108, and divides the search target area corresponding to the search target area video signal into a plurality of block areas (search target block areas) (S203). ). For example, when dividing the search target area into four, the area dividing unit 109 counts the vertical pixels and the horizontal pixels of the search target area in the same manner as the above-described area dividing unit 104, and determines the middle of the vertical direction. Identify the point and the midpoint in the horizontal direction. Further, the area dividing unit 109 divides the search target area into two in the vertical direction with the vertical midpoint as the split position, and divides the search target area into two with the horizontal midpoint as the split position. The area is divided into four search target block areas. Note that the number of divisions of the search target area and the shape of the search target block area are not particularly limited. Further, the region dividing unit 109 outputs a video signal (search target block region video signal) corresponding to each search target block region constituting the search target region to the representative color calculation unit 110.

代表色算出部１１０は、領域分割部１０９からの検索対象領域を構成する各検索対象ブロック領域に対応する検索対象ブロック領域映像信号を入力すると、これら検索対象ブロック領域映像信号に基づいて、検索対象領域を構成する各検索対象ブロック領域の代表色を算出する（Ｓ２０４）。具体的には、代表色算出部１１０は、前述の代表色算出部１０５と同様、検索対象ブロック領域に出現する色について所定規則に従って定められた値の平均値に対応する色や、検索対象ブロック領域に出現する頻度が最も高い色を、その検索対象ブロック領域の代表色として算出する。更に、代表色算出部１１０は、検索対象領域を構成する各検索対象ブロック領域の代表色を比較部１１１へ出力する。 When the representative color calculation unit 110 receives the search target block region video signal corresponding to each search target block region constituting the search target region from the region division unit 109, the representative color calculation unit 110 performs the search target based on the search target block region video signal. The representative color of each search target block area constituting the area is calculated (S204). Specifically, the representative color calculation unit 110, similar to the above-described representative color calculation unit 105, the color corresponding to the average value of the values determined according to a predetermined rule for the color appearing in the search target block area, and the search target block The color that appears most frequently in the area is calculated as the representative color of the search target block area. Further, the representative color calculation unit 110 outputs the representative color of each search target block area constituting the search target area to the comparison unit 111.

比較部１１１は、代表色算出部１１０からの検索対象領域を構成する各検索対象ブロック領域の代表色を入力すると、ＤＢ１０６からいずれかの蓄積対象動領域を構成する各蓄積対象ブロック領域の代表色を読み出す（Ｓ２０５）。 When the comparison unit 111 inputs the representative color of each search target block area constituting the search target area from the representative color calculation unit 110, the comparison unit 111 represents the representative color of each storage target block area constituting one of the accumulation target moving areas from the DB 106. Is read (S205).

更に、比較部１１１は、読み出した蓄積対象動領域を構成する各蓄積対象ブロック領域の代表色に対応する数値と、入力した検索対象領域を構成する各検索対象ブロック領域の代表色に対応する数値との間の距離を算出する（Ｓ２０６）。 Further, the comparison unit 111 represents a numerical value corresponding to the representative color of each accumulation target block area constituting the read accumulation target moving area and a numerical value corresponding to the representative color of each search target block area constituting the input search target area. Is calculated (S206).

代表色は、ＲＧＢの色情報に応じた数値、あるいは明度、彩度等に応じた数値を用いる。この場合、近似する色ほど、数値の差が小さくなるような数値化が行われる。例えば、蓄積対象動領域を構成する蓄積対象ブロック領域が４つ（Ａ１、Ａ２、Ａ３、Ａ４）であり、蓄積対象ブロック領域Ａ１の代表色の数値がａ１、蓄積対象ブロック領域Ａ２の代表色の数値がａ２、蓄積対象ブロック領域Ａ３の代表色の数値がａ３、蓄積対象ブロック領域Ａ４の代表色の数値がａ４で与えられ、一方、検索対象領域を構成する検索対象ブロック領域が４つ（蓄積対象ブロック領域Ａ１に対応するＢ１、蓄積対象ブロック領域Ａ２に対応するＢ２、蓄積対象ブロック領域Ａ３に対応するＢ３、蓄積対象ブロック領域Ａ４に対応するＢ４）であり、検索対象ブロック領域Ｂ１の代表色の数値がｂ１、検索対象ブロック領域Ｂ２の代表色の数値がｂ２、検索対象ブロック領域Ｂ３の代表色の数値がｂ３、検索対象ブロック領域Ｂ４の代表色の数値がｂ４）で与えられる場合を考える。この場合、比較部１１１は、各蓄積対象ブロック領域の代表色の数値を４次元空間の座標（ａ１，ａ２，ａ３，ａ４）で表すとともに、各検索対象ブロック領域の代表色の数値を４次元空間の座標（ｂ１，ｂ２，ｂ３，ｂ４）で表し、これら座標間のユークリッド距離を算出する。なお、比較部１１１は、ユークリッド距離以外の距離を算出しても良い。 As the representative color, a numerical value corresponding to RGB color information or a numerical value corresponding to lightness, saturation, or the like is used. In this case, a numerical value is calculated such that the closer the color is, the smaller the difference in the numerical value is. For example, there are four accumulation target block areas (A1, A2, A3, A4) constituting the accumulation target moving area, the representative color value of the accumulation target block area A1 is a1, and the representative color of the accumulation target block area A2 is The numerical value is a2, the numerical value of the representative color of the storage target block area A3 is a3, and the numerical value of the representative color of the storage target block area A4 is a4. On the other hand, there are four search target block areas (storage) B1 corresponding to the target block area A1, B2 corresponding to the storage target block area A2, B3 corresponding to the storage target block area A3, B4 corresponding to the storage target block area A4), and the representative color of the search target block area B1 Is b1, the representative color value of the search target block area B2 is b2, the representative color value of the search target block area B3 is b3, the search target block area Numerical value of the representative color of the B4 consider the case for which it is given by the b4). In this case, the comparison unit 111 represents the numerical value of the representative color of each accumulation target block area with the coordinates (a1, a2, a3, a4) of the four-dimensional space, and represents the numerical value of the representative color of each search target block area to the four-dimensional space. It is expressed by space coordinates (b1, b2, b3, b4), and the Euclidean distance between these coordinates is calculated. Note that the comparison unit 111 may calculate a distance other than the Euclidean distance.

次に、比較部１１１は、算出した蓄積対象動領域を構成する各蓄積対象ブロック領域の代表色に対応する数値と、検索対象領域を構成する各検索対象ブロック領域の代表色に対応する数値との距離が所定の閾値以内であるか否かを判定する（Ｓ２０７）。ここで、閾値は、ユーザが自由に設定可能であることが望ましい。 Next, the comparison unit 111 calculates a numerical value corresponding to the representative color of each accumulation target block area constituting the calculated accumulation target moving area, and a numerical value corresponding to the representative color of each search target block area constituting the search target area. It is determined whether or not the distance is within a predetermined threshold (S207). Here, it is desirable that the user can freely set the threshold.

蓄積対象動領域を構成する各蓄積対象ブロック領域の代表色に対応する数値と、検索対象領域を構成する各検索対象ブロック領域の代表色に対応する数値との距離が所定の閾値以内である場合、比較部１１１は、蓄積対象動領域を構成する各蓄積対象ブロック領域の代表色に関連するデータ（例えば、その蓄積対象動領域を含む映像を撮影したカメラ１０２の識別情報（ＩＤ）、撮影日時、その蓄積対象動領域の映像を縮小したサムネイル映像のデータ）を、検索結果としてリスト表示部１１２へ出力する。リスト表示部１１２は、比較部１１１からの蓄積対象動領域を構成する各蓄積対象ブロック領域の代表色に関連するデータを検索結果リストに追加する（Ｓ２０８）。 When the distance between the numeric value corresponding to the representative color of each accumulation target block area constituting the accumulation target moving area and the numeric value corresponding to the representative color of each search target block area constituting the search target area is within a predetermined threshold The comparison unit 111 includes data related to the representative color of each accumulation target block area constituting the accumulation target moving area (for example, identification information (ID) of the camera 102 that has captured the video including the accumulation target moving area, shooting date and time). , Thumbnail video data obtained by reducing the video of the accumulation target moving area) is output to the list display unit 112 as a search result. The list display unit 112 adds data related to the representative color of each accumulation target block area constituting the accumulation target moving area from the comparison unit 111 to the search result list (S208).

リスト表示部１１２による検索結果リストへの追加処理（Ｓ２０８）の後、又は、比較部１１１による蓄積対象動領域を構成する各蓄積対象ブロック領域の代表色に対応する数値と、検索対象領域を構成する各検索対象ブロック領域の代表色に対応する数値との距離が所定の閾値を超えるとの判断（Ｓ２０７における否定判断）の後、比較部１１１は、ＤＢ１０６内の全ての蓄積対象動領域を構成する各蓄積対象ブロック領域の代表色を読み出したか否かを判定する（Ｓ２０９）。ＤＢ１０６内にまだ読み出していない蓄積対象動領域を構成する各蓄積対象ブロック領域の代表色が存在する場合には、比較部１１１による、その読み出していない蓄積対象動領域を構成する各蓄積対象ブロック領域の代表色の読み出し（Ｓ２０５）以降の動作が繰り返される。 After the process of adding to the search result list by the list display unit 112 (S208), or the numerical value corresponding to the representative color of each accumulation target block area constituting the accumulation target moving area by the comparison unit 111 and the search target area After determining that the distance from the numerical value corresponding to the representative color of each search target block area exceeds a predetermined threshold (negative determination in S207), the comparison unit 111 configures all accumulation target moving areas in the DB 106. It is determined whether or not the representative color of each accumulation target block area is read (S209). When there is a representative color of each accumulation target block area that constitutes an accumulation target moving area that has not yet been read in the DB 106, each accumulation target block area that constitutes the accumulation target moving area that has not been read by the comparison unit 111 The operations after the reading of the representative color (S205) are repeated.

一方、比較部１１１がＤＢ１０６内の全ての蓄積対象動領域を構成する各蓄積対象ブロック領域の代表色を読み出した場合には、検索領域指定部１０８による、検索指示の受付（Ｓ２０１）以降の動作が繰り返される。更に、リスト表示部１１２が生成した検索結果リストが表示部１１８に表示され、ユーザは検索結果を認識することが可能となる。 On the other hand, when the comparison unit 111 reads the representative colors of the accumulation target block areas constituting all the accumulation target moving areas in the DB 106, operations after the search instruction reception (S201) by the search area designation unit 108 Is repeated. Further, the search result list generated by the list display unit 112 is displayed on the display unit 118, and the user can recognize the search result.

なお、図３では、ユーザが動領域を検索対象領域として指定する場合について説明したが、検索対象の人物が静止しており、その静止している人物の領域を検索対象領域として指定した場合も、同様に映像検索を行うことができる。この場合、検索領域指定部１０８は、図３と同様、ユーザによって指示された領域（検索対象領域）を抽出し、その抽出した検索対象領域に対応する検索対象領域映像信号を、領域分割部１０９へ出力する。 Note that FIG. 3 illustrates the case where the user designates the moving area as the search target area. However, the search target person may be stationary and the stationary person area may be designated as the search target area. Similarly, video search can be performed. In this case, the search area designating unit 108 extracts an area (search target area) instructed by the user as in FIG. 3, and searches the search target area video signal corresponding to the extracted search target area into the area dividing unit 109. Output to.

このように映像検索装置１００では、映像入力部１０１が撮影した映像内の動領域が人であることの条件を満たしている場合には、その動領域が複数の蓄積対象ブロック領域に分割された上で、各蓄積対象ブロック領域の代表色が算出されてＤＢ１０６に蓄積される。そして、映像検索の際には、ユーザが指定した検索対象領域が複数の検索対象ブロック領域に分割された上で、各検索対象ブロック領域の代表色が算出され、検索対象領域を構成する各検索対象ブロック領域の代表色に近似する蓄積対象領域を構成する各蓄積対象ブロック領域の代表色に関連するデータが検索結果としてユーザに示される。すなわち、撮影された映像内の動きだけでなく、その動きの領域の色によって映像検索が行われるため、動きの有無のみに基づいて映像検索を行う場合や、映像検索において対象物の特定部分の映像が必要となる場合よりも、映像検索を適切に行うことができる。 As described above, in the video search apparatus 100, when the moving area in the video shot by the video input unit 101 satisfies the condition that it is a person, the moving area is divided into a plurality of accumulation target block areas. Above, the representative color of each accumulation target block area is calculated and accumulated in the DB 106. In the video search, the search target area specified by the user is divided into a plurality of search target block areas, and the representative colors of each search target block area are calculated, and each search constituting the search target area is performed. Data related to the representative color of each accumulation target block area constituting the accumulation target area that approximates the representative color of the target block area is shown to the user as a search result. In other words, the video search is performed not only based on the motion in the captured video but also based on the color of the motion region. Therefore, when searching for a video based only on the presence or absence of motion, Video search can be performed more appropriately than when video is required.

次に、本発明の第２の実施の形態における映像検索装置のブロック図を図４に示す。図４において、第１の実施の形態と同様の構成要素については映像検索装置１００と同一の符号をつけて、その説明を省略する。 Next, FIG. 4 shows a block diagram of a video search apparatus according to the second embodiment of the present invention. In FIG. 4, the same components as those of the first embodiment are denoted by the same reference numerals as those of the video search apparatus 100, and the description thereof is omitted.

映像検索装置４００は、対応する領域分割部１０４からの各蓄積対象ブロック領域映像信号を入力して蓄積対象動領域を構成する各蓄積対象ブロック領域の色特徴量を抽出して色分布を生成し出力する色情報生成部４０１ａ乃至４０１ｎ（以下、これら色情報生成部４０１ａ乃至４０１ｎをまとめて、適宜「色情報生成部４０１」と称する）と、各色情報生成部４０１からの蓄積対象動領域を構成する各蓄積対象ブロック領域の色分布を蓄積するデータベース（ＤＢ）４０２と、領域分割部１０９からの各検索対象ブロック映像信号を入力し、検索対象領域を構成する各検索対象ブロックの色特徴量を抽出し、色分布を生成して出力する色情報生成部４０３と、色情報生成部４０３からの検索対象領域を構成する各検索対象ブロック領域の色分布とＤＢ４０２内の蓄積対象動領域を構成する各蓄積対象ブロック領域の色分布とを比較し、その比較結果に応じた検索結果を出力する比較部４０４とを有する。 The video search apparatus 400 inputs each accumulation target block area video signal from the corresponding area dividing unit 104, extracts the color feature amount of each accumulation target block area constituting the accumulation target moving area, and generates a color distribution. The output color information generation units 401a to 401n (hereinafter, these color information generation units 401a to 401n are collectively referred to as “color information generation unit 401” as appropriate) and the accumulation target moving area from each color information generation unit 401 are configured. The database (DB) 402 for storing the color distribution of each storage target block area to be input and each search target block video signal from the area dividing unit 109 are input, and the color feature amount of each search target block constituting the search target area is calculated. A color information generation unit 403 that extracts, generates and outputs a color distribution, and a color of each search target block area constituting the search target area from the color information generation unit 403 Comparing the color distribution of each storage object block region constituting the storage target motion area in the cloth and DB 402, and a comparator 404 for outputting a search result according to the comparison result.

以上のように構成された映像検索装置４００について、図５及び図６を用いてその動作を説明する。まず、図５を用いて色分布蓄積時の動作を説明する。第１の実施の形態と同様の処理ステップについては同一の符号をつけ、その動作説明は簡略化する。 The operation of the video search apparatus 400 configured as described above will be described with reference to FIGS. First, the operation during color distribution accumulation will be described with reference to FIG. The same processing steps as those of the first embodiment are denoted by the same reference numerals, and the operation description thereof is simplified.

映像入力部１０１は、被写体を撮影して映像信号を生成する（Ｓ１０１）。更に、映像入力部１０１は、生成した映像信号を、対応する動領域抽出部１０２へ出力する。 The video input unit 101 shoots a subject and generates a video signal (S101). Further, the video input unit 101 outputs the generated video signal to the corresponding moving region extraction unit 102.

動領域抽出部１０２は、対応する映像入力部１０１からの映像信号を入力すると、その映像信号に対応する映像と、あらかじめ保持する背景映像との差分値を算出する背景差分処理を行う（Ｓ１０２）。次に、動領域抽出部１０２は、入力した映像信号に対応する映像に動きがあるか否かを判定する（Ｓ１０３）。 When the video signal from the corresponding video input unit 101 is input, the moving region extraction unit 102 performs background difference processing for calculating a difference value between the video corresponding to the video signal and the background video stored in advance (S102). . Next, the moving region extraction unit 102 determines whether or not there is a motion in the video corresponding to the input video signal (S103).

一方、動領域抽出部１０２に入力された映像信号に対応する映像に動きがある場合には、動領域抽出部１０２は、その映像内の動領域（蓄積対象動領域）を抽出する（Ｓ１０４）。更に、動領域抽出部１０２は、抽出した蓄積対象動領域に対応する蓄積対象動領域映像信号を、対応する人判定部１０３へ出力する。 On the other hand, when there is a motion in the video corresponding to the video signal input to the motion region extraction unit 102, the motion region extraction unit 102 extracts a motion region (accumulation target motion region) in the video (S104). . Furthermore, the moving area extraction unit 102 outputs the accumulation target moving area video signal corresponding to the extracted accumulation target moving area to the corresponding person determination unit 103.

人判定部１０３は、対応する動領域抽出部１０２からの蓄積対象動領域映像信号を入力すると、その蓄積対象動領域映像信号に対応する蓄積対象動領域の映像に対して、楕円ハフ処理を行う（Ｓ１０５）。次に、人判定部１０３は、入力した蓄積対象動領域映像信号に対応する蓄積対象動領域が人であることの条件を満たしているか否かを判定する（Ｓ１０６）。 When the person determination unit 103 receives the accumulation target moving region video signal from the corresponding moving region extraction unit 102, the human determination unit 103 performs elliptical Hough processing on the image of the accumulation target moving region corresponding to the accumulation target moving region video signal. (S105). Next, the person determination unit 103 determines whether or not a condition that the accumulation target moving area corresponding to the input accumulation target moving area video signal is a person is satisfied (S106).

一方、蓄積対象動領域映像信号に対応する蓄積対象動領域が人であることの条件を満たしている場合には、人判定部１０３は、入力した蓄積対象動領域映像信号を、対応する領域分割部１０４へ出力する。 On the other hand, if the condition that the accumulation target moving area corresponding to the accumulation target moving area video signal is a person is satisfied, the person determination unit 103 divides the input accumulation target moving area video signal into the corresponding area division. Output to the unit 104.

領域分割部１０４は、対応する人判定部１０３からの蓄積対象動領域映像信号を入力し、その蓄積対象動領域映像信号に対応する蓄積対象動領域を複数のブロック領域（蓄積対象ブロック領域）に分割する（Ｓ１０７）。 The area dividing unit 104 receives the accumulation target moving area video signal from the corresponding person determination unit 103, and stores the accumulation target moving area corresponding to the accumulation target moving area video signal into a plurality of block areas (accumulation target block areas). Divide (S107).

更に、領域分割部１０４は、蓄積対象動領域を構成する各蓄積対象ブロック領域に対応する映像信号（蓄積対象ブロック領域映像信号）を、対応する色情報生成部４０１へ出力する。 Further, the area dividing unit 104 outputs a video signal (accumulation target block area video signal) corresponding to each accumulation target block area constituting the accumulation target moving area to the corresponding color information generation unit 401.

色情報生成部４０１は、対応する領域分割部１０４からの蓄積対象動領域を構成する各蓄積対象ブロック領域に対応する蓄積対象ブロック領域映像信号を入力すると、これら蓄積対象ブロック領域映像信号に基づいて、蓄積対象動領域を構成する各蓄積対象ブロック領域の色情報を生成する（Ｓ５０１）。具体的には、色情報生成部４０１は、色情報として蓄積対象ブロック領域映像の各画素のＲＧＢ値を取得してＨＳＶ空間（非特許文献１参照）に変換を行い、Ｈ、Ｓを用いてヒストグラム（以後、色分布と呼ぶ）を作成する。なお、ここでは色空間をＲＧＢからＨＳＶへ変換しているが、他の色空間（例えばＸＹＺ、ＹＣｒＣｂなど）でも一般的な使い方（ＸＹ平面、ＣｒＣｂ平面でヒストグラムを生成）により利用できる。 When the color information generation unit 401 receives the accumulation target block region video signal corresponding to each accumulation target block region constituting the accumulation target moving region from the corresponding region division unit 104, the color information generation unit 401, based on these accumulation target block region video signals. Then, color information of each accumulation target block area constituting the accumulation target moving area is generated (S501). Specifically, the color information generation unit 401 acquires the RGB value of each pixel of the accumulation target block region video as color information, converts it to the HSV space (see Non-Patent Document 1), and uses H and S. A histogram (hereinafter referred to as color distribution) is created. Although the color space is converted from RGB to HSV here, it can be used in other color spaces (for example, XYZ, YCrCb, etc.) by general usage (generating histograms on the XY plane and CrCb plane).

更に、色情報生成部４０１は、蓄積対象動領域を構成する各蓄積対象ブロック領域の色分布をＤＢ４０２へ出力する。ＤＢ４０２は、色情報生成部４０１からの蓄積対象動領域を構成する各蓄積対象ブロック領域の色分布を蓄積する（Ｓ５０２）。なお、ＤＢ４０２は、蓄積対象動領域を構成する各蓄積対象ブロック領域の色分布に対応付けて、その蓄積対象動領域を含む映像を撮影したカメラ１０２の識別情報（ＩＤ）、撮影日時、その蓄積対象動領域に対応する映像を縮小したサムネイル映像のデータを蓄積するようにしても良い。 Furthermore, the color information generation unit 401 outputs the color distribution of each accumulation target block area constituting the accumulation target moving area to the DB 402. The DB 402 accumulates the color distribution of each accumulation target block area constituting the accumulation target moving area from the color information generation unit 401 (S502). The DB 402 associates with the color distribution of each accumulation target block area constituting the accumulation target moving area, identifies the identification information (ID) of the camera 102 that shot the video including the accumulation target moving area, the shooting date and time, and the accumulation thereof. Thumbnail video data obtained by reducing the video corresponding to the target moving area may be accumulated.

前述したＳ１０１乃至Ｓ１０７とＳ５０１、Ｓ５０２の処理が繰り返されることによって、ＤＢ４０２には、複数の蓄積対象動領域のそれぞれについて、その蓄積対象動領域を構成する各蓄積対象ブロック領域の色分布が蓄積されることになる。 By repeating the processing of S101 to S107 and S501 and S502, the DB 402 accumulates the color distribution of each accumulation target block area constituting the accumulation target moving area for each of the plurality of accumulation target moving areas. Will be.

次に、図６を用いて映像検索時の動作を説明する。
ユーザが表示部１１８に表示された映像を見て、キーボード１０７を操作することにより、その映像内の人物を検索対象として指定する指示（検索指示）を行うと、検索領域指定部１０８は、このユーザによる検索指示を受け付ける（Ｓ２０１）。次に、検索領域指定部１０８は、表示部１１８に表示された映像に対応する映像信号を画像入力部１０１から入力し、その映像内における、ユーザによって指示された動領域（検索対象領域）を抽出する（Ｓ２０２）。更に、検索領域指定部１０８は、抽出した検索対象領域に対応する検索対象領域映像信号を、領域分割部１０９へ出力する。 Next, an operation during video search will be described with reference to FIG.
When the user views an image displayed on the display unit 118 and operates the keyboard 107 to give an instruction (search instruction) for specifying a person in the image as a search target, the search area specifying unit 108 A search instruction by the user is accepted (S201). Next, the search area designation unit 108 inputs a video signal corresponding to the video displayed on the display unit 118 from the image input unit 101, and selects a moving area (search target area) designated by the user in the video. Extract (S202). Further, the search area designating unit 108 outputs a search target area video signal corresponding to the extracted search target area to the area dividing unit 109.

領域分割部１０９は、検索領域指定部１０８からの検索対象領域映像信号を入力し、その検索対象領域映像信号に対応する検索対象領域を複数のブロック領域（検索対象ブロック領域）に分割する（Ｓ２０３）。更に、領域分割部１０９は、検索対象領域を構成する各検索対象ブロック領域に対応する映像信号（検索対象ブロック領域映像信号）を、色情報生成部４０３へ出力する。 The area dividing unit 109 receives the search target area video signal from the search area specifying unit 108, and divides the search target area corresponding to the search target area video signal into a plurality of block areas (search target block areas) (S203). ). Further, the region dividing unit 109 outputs a video signal (search target block region video signal) corresponding to each search target block region constituting the search target region to the color information generation unit 403.

色情報生成部４０３は、領域分割部１０９からの検索対象領域に対応する検索対象領域映像信号を入力すると、これら検索対象領域映像信号に基づいて、検索対象領域の色分布を生成する（Ｓ６０１）。具体的には、色情報生成部４０３は、前述の色情報生成部４０１と同様、色情報として蓄積対象ブロック領域映像の各画素のＲＧＢ値を取得してＨＳＶ空間に変換を行い、色分布を作成する。なお、ここでは色空間をＲＧＢからＨＳＶへ変換しているが、他の色空間（例えばＸＹＺ、ＹＣｒＣｂなど）でも一般的な使い方（ＸＹ平面、ＣｒＣｂ平面でヒストグラムを生成）により利用できる。更に、色情報生成部４０３は、検索対象領域の色分布を比較部４０４へ出力する。 When the search target area video signal corresponding to the search target area from the area dividing unit 109 is input, the color information generation unit 403 generates a color distribution of the search target area based on these search target area video signals (S601). . Specifically, the color information generation unit 403 acquires the RGB value of each pixel of the accumulation target block area video as color information and converts it into the HSV space, as in the color information generation unit 401 described above, and converts the color distribution. create. Although the color space is converted from RGB to HSV here, it can be used in other color spaces (for example, XYZ, YCrCb, etc.) by general usage (generating histograms on the XY plane and CrCb plane). Further, the color information generation unit 403 outputs the color distribution of the search target area to the comparison unit 404.

比較部４０４は、色情報生成部４０３からの検索対象領域の色分布を入力すると、ＤＢ４０２からいずれかの蓄積対象動領域を構成する各蓄積対象ブロック領域の色分布を読み出す（Ｓ６０２）。 When the comparison unit 404 receives the color distribution of the search target region from the color information generation unit 403, the comparison unit 404 reads the color distribution of each storage target block region constituting any of the storage target moving regions from the DB 402 (S602).

更に、比較部４０４は、読み出した蓄積対象動領域を構成する各蓄積対象ブロック領域の色分布と、入力した検索対象領域を構成する各検索対象ブロック領域の色分布を比較して一致度を算出する（Ｓ６０３）。 Further, the comparison unit 404 calculates the degree of coincidence by comparing the color distribution of each accumulation target block area constituting the read accumulation target moving area and the color distribution of each search target block area constituting the input search target area. (S603).

色分布は、Ｈ、Ｓを用いた２次元のヒストグラムなので、一致度の算出には例えばヒストグラムインターセクション（特許文献３参照）を用いれば良い。この方式によれば、色分布が完全に一致する場合の一致度は、検索対象領域を構成する各検索対象ブロック領域の色分布に含まれる全度数を積算した値となり、完全に不一致であれば一致度は０になる。 Since the color distribution is a two-dimensional histogram using H and S, for example, a histogram intersection (see Patent Document 3) may be used to calculate the degree of coincidence. According to this method, the degree of coincidence when the color distributions are completely matched is a value obtained by integrating all frequencies included in the color distribution of each search target block area constituting the search target area. The coincidence becomes 0.

蓄積対象動領域を構成する各蓄積対象ブロック領域の色分布のそれぞれと検索対象領域を構成する各検索対象ブロック領域の色分布との一致度を求め、これらをあらためて部分一致度と呼ぶ。部分一致度すべてを積算して一致度を算出する。または、目的に応じて比較対象とする蓄積対象ブロック領域を選択して部分一致度を算出し、算出した部分一致度のみを積算して一致度を算出することもできる。 The degree of coincidence between the color distribution of each accumulation target block area constituting the accumulation target moving area and the color distribution of each search target block area constituting the search target area is obtained, and these are collectively referred to as partial coincidence. The degree of coincidence is calculated by adding all the partial coincidences. Alternatively, it is also possible to calculate a partial matching degree by selecting a storage target block region to be compared according to the purpose, and calculate only the calculated partial matching degrees.

なお、ここではヒストグラムインターセクションを用いているが、２つの色分布を比較する方法は他にも考えることができ、例えば、あらかじめ設定された閾値より度数が大きい色分布位置を抽出し、これらの中で、その色分布位置と度数の割合を比較する方法も考えられる。 Although a histogram intersection is used here, other methods for comparing two color distributions can be considered. For example, a color distribution position having a frequency greater than a preset threshold value is extracted, and these color distribution positions are extracted. Among them, a method of comparing the color distribution position and the frequency ratio is also conceivable.

次に、比較部４０４は、算出した蓄積対象動領域を構成する各蓄積対象ブロック領域の色分布と、検索対象領域を構成する各検索対象ブロック領域の色分布との一致度が所定の閾値以内であるか否かを判定する（Ｓ６０４）。ここで、閾値は、ユーザが自由に設定可能であることが望ましい。 Next, the comparison unit 404 determines that the degree of coincidence between the calculated color distribution of each accumulation target block area constituting the accumulation target moving area and the color distribution of each search target block area constituting the search target area is within a predetermined threshold. It is determined whether or not (S604). Here, it is desirable that the user can freely set the threshold.

蓄積対象動領域を構成する各蓄積対象ブロック領域の色分布と、検索対象領域を構成する各検索対象ブロック領域の色分布との一致度が所定の閾値以内である場合、比較部４０４は、蓄積対象領域を構成する各蓄積対象ブロック領域を持つ蓄積対象動領域に関連するデータ（例えば、その蓄積対象動領域を含む映像を撮影したカメラ１０２の識別情報（ＩＤ）、撮影日時、その蓄積対象動領域の映像を縮小したサムネイル映像のデータ）を、検索結果としてリスト表示部１１２へ出力する。リスト表示部１１２は、比較部４０４からの蓄積対象領域を構成する各蓄積対象ブロック領域を持つ蓄積対象動領域に関連するデータを検索結果リストに追加する（Ｓ６０５）。 When the degree of coincidence between the color distribution of each accumulation target block area constituting the accumulation target moving area and the color distribution of each search target block area constituting the search target area is within a predetermined threshold, the comparison unit 404 stores Data related to the accumulation target moving area having each accumulation target block area constituting the target area (for example, the identification information (ID) of the camera 102 that captured the video including the accumulation target moving area, the shooting date and time, the accumulation target movement) The thumbnail image data obtained by reducing the region image is output to the list display unit 112 as a search result. The list display unit 112 adds data related to the accumulation target moving area having the accumulation target block areas constituting the accumulation target area from the comparison unit 404 to the search result list (S605).

リスト表示部１１２による検索結果リストへの追加処理（Ｓ６０５）の後、又は、比較部４０４による蓄積対象動領域を構成する各蓄積対象ブロック領域の色分布と、検索対象領域を構成する各検索対象ブロック領域の色分布との一致度が所定の閾値を超えるとの判断（Ｓ６０４における否定判断）の後、比較部４０４は、ＤＢ４０２内の全ての蓄積対象動領域を構成する各蓄積対象ブロック領域の色分布を読み出したか否かを判定する（Ｓ６０６）。ＤＢ４０２内にまだ読み出していない蓄積対象動領域を構成する各蓄積対象ブロック領域の色分布が存在する場合には、比較部４０４による、その読み出していない蓄積対象動領域を構成する各蓄積対象ブロック領域の色分布の読み出し（Ｓ６０２）以降の動作が繰り返される。 After the addition processing to the search result list (S605) by the list display unit 112, or the color distribution of each accumulation target block area constituting the accumulation target moving area by the comparison unit 404 and each search object constituting the search target area After determining that the degree of coincidence with the color distribution of the block area exceeds a predetermined threshold value (No determination in S604), the comparison unit 404 determines each of the accumulation target block areas constituting all the accumulation target moving areas in the DB 402. It is determined whether or not the color distribution has been read (S606). When there is a color distribution of each accumulation target block area that constitutes an accumulation target moving area that has not yet been read out in the DB 402, each accumulation target block area that constitutes an accumulation target moving area that has not been read by the comparison unit 404 The operation after reading the color distribution (S602) is repeated.

一方、比較部４０４がＤＢ４０２内の全ての蓄積対象動領域を構成する各蓄積対象ブロック領域の色分布を読み出した場合には、検索領域指定部１０８による、検索指示の受付（Ｓ２０１）以降の動作が繰り返される。更に、リスト表示部１１２が生成した検索結果リストが表示部１１８に表示され、ユーザは検索結果を認識することが可能となる。 On the other hand, when the comparison unit 404 reads out the color distribution of each accumulation target block area constituting all the accumulation target moving areas in the DB 402, the operation after the search instruction reception (S201) by the search area designation unit 108 Is repeated. Further, the search result list generated by the list display unit 112 is displayed on the display unit 118, and the user can recognize the search result.

なお、図６では、ユーザが動領域を検索対象領域として指定する場合について説明したが、検索対象の人物が静止しており、その静止している人物の領域を検索対象領域として指定した場合も、同様に映像検索を行うことができる。この場合、検索領域指定部１０８は、図６と同様、ユーザによって指示された領域（検索対象領域）を抽出し、その抽出した検索対象領域に対応する検索対象領域映像信号を、色情報生成部４０３へ出力する。 Note that FIG. 6 illustrates the case where the user designates the moving area as the search target area. However, the search target person may be stationary and the stationary person area may be designated as the search target area. Similarly, video search can be performed. In this case, the search area designating unit 108 extracts an area (search target area) instructed by the user, as in FIG. 6, and outputs a search target area video signal corresponding to the extracted search target area as a color information generation unit. Output to 403.

このように映像検索装置４００では、映像入力部１０１が撮影した映像内の動領域が人であることの条件を満たしている場合には、その動領域が複数の蓄積対象ブロック領域に分割された上で、各蓄積対象ブロック領域の色分布が生成されてＤＢ４０２に蓄積される。そして、映像検索の際には、ユーザが指定した検索対象領域の色分布が生成され、検索対象領域の色分布に類似した蓄積対象領域を構成する各蓄積対象ブロック領域を持つ蓄積対象領域に関連するデータが検索結果としてユーザに示される。すなわち、撮影された映像内の動きだけでなく、その動きの領域の色によって映像検索が行われるため、動きの有無のみに基づいて映像検索を行う場合や、映像検索において対象物の特定部分の映像が必要となる場合よりも、映像検索を適切に行うことができる。 As described above, in the video search device 400, when the moving area in the video captured by the video input unit 101 satisfies the condition that it is a person, the moving area is divided into a plurality of accumulation target block areas. Above, the color distribution of each accumulation target block area is generated and accumulated in the DB 402. In the video search, the color distribution of the search target area specified by the user is generated and related to the storage target areas having the respective storage target block areas constituting the storage target area similar to the color distribution of the search target area. Data to be displayed is shown to the user as a search result. In other words, the video search is performed not only based on the motion in the captured video but also based on the color of the motion region. Therefore, when searching for a video based only on the presence or absence of motion, Video search can be performed more appropriately than when video is required.

次に、本発明の第３の実施の形態の映像検索装置を図１２に示す。図１２において、第２の実施の形態と同様の構成要素については映像検索装置４００と同一の符号をつけて、その説明を省略する。 Next, FIG. 12 shows a video search apparatus according to the third embodiment of the present invention. In FIG. 12, the same components as those of the second embodiment are denoted by the same reference numerals as those of the video search device 400, and the description thereof is omitted.

図１２において、本発明の実施の形態の映像検索装置１１００は、ユーザによるキーボード１０７の操作に応じて検索対象となる色から色情報を抽出して出力する色情報生成部１１０１と、ＤＢ４０２内の蓄積対象動領域を構成する各蓄積対象ブロックの色分布から、連続する分布領域の平均色情報を生成して、色情報生成部１１０１からの検索対象色情報と、生成した各蓄積対象ブロックの平均色情報とを比較し、その比較結果に応じた検索結果を出力する比較部１１０２とを有する。 In FIG. 12, the video search apparatus 1100 according to the embodiment of the present invention includes a color information generation unit 1101 that extracts and outputs color information from colors to be searched according to the operation of the keyboard 107 by the user, The average color information of the continuous distribution area is generated from the color distribution of each accumulation target block constituting the accumulation target moving area, the search target color information from the color information generation unit 1101 and the average of each generated accumulation target block A comparison unit 1102 that compares the color information and outputs a search result corresponding to the comparison result;

以上のように構成された映像検索装置１１００について、図５と図１３を用いてその動作を説明する。第１、第２の実施の形態と同様の処理ステップについては同一の符号をつけ、その動作説明は簡略化する。また、色情報蓄積時の動作については、映像検索装置４００と同様なので図５による説明を省略する。 The operation of the video search apparatus 1100 configured as described above will be described with reference to FIGS. The same processing steps as those in the first and second embodiments are denoted by the same reference numerals, and the operation description thereof is simplified. Further, since the operation at the time of storing color information is the same as that of the video search device 400, the description with reference to FIG.

図１３を用いて映像検索時の動作を説明する。
ユーザがキーボード１０７を操作して検索開始要求を出力すると、色情報生成部１１０１は保持している色情報画像を表示部１１８に表示要求を出力する。色情報画像は、一般的にＰＣ上のソフトウェアで利用されている色設定で使用するカラーパレットや、あるいはあらかじめ検索装置内で設定されている複数の色情報などが考えられる。ユーザは表示された色情報画像を見て、検索対象の色を指定する指示（検索指示）を行うと、色情報生成部１１０１は、このユーザによる検索指示を受け付ける（Ｓ１２０１）。検索指示は、色情報画像上の一点を指定することで行われる。色情報生成部１１０１は、表示部１１８に表示された画像中からユーザによって指定された一点に対応する画素のＲＧＢ値を取得し、ＨＳＶ空間に変換して色情報を生成して（Ｓ１２０２）、比較部１１０２に出力する。比較部１１０２は、色情報生成部１１０１からの検索対象色情報が入力すると、ＤＢ４０２からいずれかの蓄積対象動領域を構成する各蓄積対象ブロック領域の色分布を読み出す（Ｓ６０２）。続いて比較部１１０２は、ＤＢ４０２から読み出した各蓄積対象ブロックの色分布から、あらかじめ設定されている固定値よりも大きな度数を持つ色情報のみを抽出し、さらにＨ、Ｓを用いた２次元座標上で連続している色分布領域単位に代表色情報を生成する（Ｓ１２０３）。さらに比較部１１０２は、入力した検索対象色情報と読み出した蓄積対象動領域を構成する各蓄積対象ブロック領域の色分布から生成した代表色情報を比較して類似度を算出する（Ｓ１２０４）。検索対象色情報と代表色情報は、Ｈ、Ｓを用いた２次元座標上においてそれぞれ一点となるので、これらの二点間の距離を求めればよい。 The operation at the time of video search will be described with reference to FIG.
When the user operates the keyboard 107 to output a search start request, the color information generation unit 1101 outputs a display request to the display unit 118 for the color information image held. The color information image may be a color palette used for color settings generally used by software on a PC, or a plurality of color information set in advance in the search device. When the user looks at the displayed color information image and gives an instruction (search instruction) to specify a color to be searched, the color information generation unit 1101 receives the search instruction from the user (S1201). The search instruction is performed by designating one point on the color information image. The color information generation unit 1101 acquires the RGB value of the pixel corresponding to one point designated by the user from the image displayed on the display unit 118, converts it to the HSV space, and generates color information (S1202). The result is output to the comparison unit 1102. When the search target color information from the color information generation unit 1101 is input, the comparison unit 1102 reads the color distribution of each accumulation target block area configuring any accumulation target moving area from the DB 402 (S602). Subsequently, the comparison unit 1102 extracts only color information having a frequency greater than a preset fixed value from the color distribution of each accumulation target block read from the DB 402, and further uses two-dimensional coordinates using H and S. Representative color information is generated in units of color distribution regions that are continuous above (S1203). Further, the comparison unit 1102 compares the input search target color information with the representative color information generated from the color distribution of each accumulation target block area constituting the read accumulation target moving area, and calculates the similarity (S1204). Since the search target color information and the representative color information are each one point on the two-dimensional coordinates using H and S, the distance between these two points may be obtained.

次に、比較部１１０２は、算出した蓄積対象動領域を構成する各蓄積対象ブロック領域の色分布から算出した代表色情報と、検索対象色情報との類似度が所定の閾値以内であるか否かを判定する（Ｓ１２０５）。蓄積対象動領域を構成する各蓄積対象ブロック領域の色分布から生成した代表色情報と、検索対象色情報との類似度が所定の閾値以内である場合、比較部１１０２は、蓄積対象領域を構成する各蓄積対象ブロック領域を持つ蓄積対象動領域に関連するデータ（例えば、その蓄積対象動領域を含む映像を撮影したカメラ１０２の識別情報（ＩＤ）、撮影日時、その蓄積対象動領域の映像を縮小したサムネイル映像のデータ）を、検索結果としてリスト表示部１１２へ出力する。リスト表示部１１２は、比較部１１０２からの蓄積対象領域を構成する各蓄積対象ブロック領域を持つ蓄積対象動領域に関連するデータを検索結果リストに追加する（Ｓ６０５）。 Next, the comparison unit 1102 determines whether the similarity between the representative color information calculated from the color distribution of each accumulation target block area constituting the calculated accumulation target moving area and the search target color information is within a predetermined threshold. Is determined (S1205). When the similarity between the representative color information generated from the color distribution of each accumulation target block area constituting the accumulation target moving area and the search target color information is within a predetermined threshold, the comparison unit 1102 configures the accumulation target area. Data related to the accumulation target moving area having each accumulation target block area (for example, the identification information (ID) of the camera 102 that captured the video including the accumulation target moving area, the shooting date and time, and the video of the accumulation target moving area) The reduced thumbnail image data) is output to the list display unit 112 as a search result. The list display unit 112 adds data related to the accumulation target moving area having the accumulation target block areas constituting the accumulation target area from the comparison unit 1102 to the search result list (S605).

リスト表示部１１２による検索結果リストへの追加処理（Ｓ６０５）の後、又は、比較部１１０２による蓄積対象動領域を構成する各蓄積対象ブロック領域の色分布から生成した代表色情報と、検索対象色情報との一致類似度が所定の閾値を超えるとの判断（Ｓ１２０５における否定判断）の後、比較部１１０２は、ＤＢ４０２内の全ての蓄積対象動領域を構成する各蓄積対象ブロック領域の色分布を読み出したか否かを判定する（Ｓ６０６）。ＤＢ４０２内にまだ読み出していない蓄積対象動領域を構成する各蓄積対象ブロック領域の色分布が存在する場合には、比較部１１０２による、その読み出していない蓄積対象動領域を構成する各蓄積対象ブロック領域の色分布の読み出し（Ｓ６０２）以降の動作が繰り返される。 After the addition processing to the search result list by the list display unit 112 (S605), or representative color information generated from the color distribution of each accumulation target block area constituting the accumulation target moving area by the comparison unit 1102, the search target color After determining that the coincidence similarity with the information exceeds a predetermined threshold value (No determination in S1205), the comparison unit 1102 determines the color distribution of each accumulation target block area constituting all accumulation target moving areas in the DB 402. It is determined whether the data has been read (S606). When there is a color distribution of each accumulation target block area that constitutes an accumulation target moving area that has not yet been read in the DB 402, each accumulation target block area that constitutes the accumulation target moving area that has not been read by the comparison unit 1102 The operation after reading the color distribution (S602) is repeated.

一方、比較部１１０２がＤＢ４０２内の全ての蓄積対象動領域を構成する各蓄積対象ブロック領域の色分布を読み出した場合には、色情報生成部１１０１による、検索指示の受付（Ｓ１２０１）以降の動作が繰り返される。更に、リスト表示部１１２が生成した検索結果リストが表示部１１８に表示され、ユーザは検索結果を認識することが可能となる。 On the other hand, when the comparison unit 1102 reads the color distribution of each accumulation target block area constituting all the accumulation target moving areas in the DB 402, the operation after the search instruction reception (S1201) by the color information generation unit 1101 Is repeated. Further, the search result list generated by the list display unit 112 is displayed on the display unit 118, and the user can recognize the search result.

なお、図１３では、ユーザが検索対象として、カラーパレットやあらかじめ設定されている色情報を指定する場合について説明したが、ＤＢ４０２に、蓄積対象領域を構成する各蓄積対象ブロック領域の色分布を蓄積する際に、これとは別に色分布を積算し、発生頻度の高い色情報をあらかじめ設定されている固定数だけ上位から選出することで、検索対象色情報を生成しても良い。ユーザは、このようにして生成された検索対象色情報の中から１つの色情報を選択して検索指示を行う。 In FIG. 13, the case where the user designates a color palette or preset color information as a search target has been described. However, the DB 402 stores the color distribution of each accumulation target block area constituting the accumulation target area. When this is done, the color distribution may be integrated separately, and the color information with high frequency of occurrence may be selected from a higher fixed number in advance to generate the search target color information. The user selects one color information from the search target color information generated in this way and gives a search instruction.

このように映像検索装置１１００では、映像入力部１０１が撮影した映像内の動領域が人であることの条件を満たしている場合には、その動領域が複数の蓄積対象ブロック領域に分割された上で、各蓄積対象ブロック領域の色分布が生成されてＤＢ４０２に蓄積される。そして、映像検索の際には、ユーザが１つの検索対象色情報を指定することで、検索対象色情報に類似した代表色を持つ蓄積対象領域を構成する各蓄積対象ブロック領域を持つ蓄積対象領域に関連するデータが検索結果としてユーザに示される。すなわち、検索対象とする画像が存在しない場合でも、検索装置によって示される色情報画像から一点指示するだけで、撮影された映像内の動きだけでなく、その動きの領域の色によって映像検索が行われるため、動きの有無のみに基づいて映像検索を行う場合や、映像検索において対象物の特定部分の映像が必要となる場合よりも、映像検索を適切に行うことができる。 As described above, in the video search apparatus 1100, when the moving area in the video captured by the video input unit 101 satisfies the condition that it is a person, the moving area is divided into a plurality of accumulation target block areas. Above, the color distribution of each accumulation target block area is generated and accumulated in the DB 402. In video search, the storage target area having each storage target block area constituting the storage target area having a representative color similar to the search target color information by the user specifying one search target color information. Data related to is shown to the user as a search result. That is, even when there is no image to be searched, only one point is specified from the color information image indicated by the search device, and the video search is performed not only by the motion in the captured video but also by the color of the region of the motion. Therefore, video search can be performed more appropriately than when video search is performed based only on the presence or absence of motion, or when a video of a specific part of an object is required for video search.

次に、本発明の第４の実施の形態の映像検索装置を図７に示す。第１の実施の形態と同様の構成要素については映像検索装置１００と同一の符号をつけて、その説明を省略する。 Next, a video search apparatus according to a fourth embodiment of the present invention is shown in FIG. Constituent elements similar to those in the first embodiment are denoted by the same reference numerals as those of the video search apparatus 100, and description thereof is omitted.

図７において、本発明の実施の形態の映像検索装置７００は、動き物体毎に対応する動領域抽出部１０２からの蓄積対象動領域映像信号を入力して、蓄積対象動領域があらかじめ定められた人であることの条件を満たしているか否かを判定し、人であることの条件を満たしている場合、動き物体毎に蓄積対象動領域映像信号を出力するとともに、連続して入力される蓄積対象動領域をあらかじめ定められた同一人物であることの条件をみたしているか否かを判定し、同一人物であると判定した場合、その人物を追跡し、追跡中の人物が映像中に存在しなくなった場合に動きシーン終了信号を出力する人物動きシーン判定部７０１ａ乃至７０１ｎ（以下、これら人物動きシーン判定部７０１ａ乃至７０１ｎをまとめて、適宜「人物動きシーン判定部７０１」と称する）と、対応する領域分割部１０４からの各蓄積対象ブロック領域映像信号を入力して蓄積対象動領域を構成する各蓄積対象ブロック領域の色特徴量を抽出して人物ごとに保存し、対応する人物動きシーン判定部７０１から動きシーン終了信号が入力されると、人物ごとに色分布を生成し出力する色情報生成部７０２ａ乃至７０２ｎ（以下、これら色情報生成部７０２ａ乃至７０２ｎをまとめて、適宜「色情報生成部７０２」と称する）と、対応する人物動きシーン判定部７０１から蓄積対象動領域映像信号を入力し、あらかじめ定められた検索対象領域設定条件を満たしているか否かを判定し、検索対象領域設定条件を満たしている場合、検索対象領域映像信号を人物ごとに保存し、対応する人物動きシーン判定部７０１から動きシーン終了信号が入力されると検索対象領域映像信号を出力する７０３ａ乃至７０３ｎ（以下、これら検索領域設定部７０３ａ乃至７０３ｎをまとめて、適宜「検索領域設定部７０３」と称する）と、各色情報生成部７０２からの人物ごとの色分布と各検索領域設定部７０３からの検索対象領域映像信号を蓄積するデータベース（ＤＢ）７０４と、ユーザが指示を入力するためのユーザ指示入力部７０５と、ユーザによる検索実行要求に応じてＤＢ７０４から人物代表画像として検索対象領域映像信号を読出し人物代表画像リストを生成して出力する人物代表画像リスト表示部７０６と、ユーザが選択した人物代表画像に対応する検索対象領域映像信号をＤＢ７０４から読み出して出力する検索対象選択部７０７と、検索対象選択部７０７から検索対象領域映像信号を入力して検索対象領域の色特徴量を抽出、色分布を生成して出力する色情報生成部７０８と、色情報生成部７０８からの検索対象領域の色分布とＤＢ７０４内の蓄積対象動領域を構成する各蓄積対象ブロック領域の色分布とを比較し、その比較結果に応じた検索結果を出力する比較部７０９とを有する。 In FIG. 7, the video search apparatus 700 according to the embodiment of the present invention inputs an accumulation target moving area video signal from the moving area extraction unit 102 corresponding to each moving object, and the accumulation target moving area is predetermined. It is determined whether or not the condition of being a person is satisfied, and if the condition of being a person is satisfied, the accumulation target moving area video signal is output for each moving object and the accumulation is continuously input. It is determined whether or not the target moving area meets a predetermined condition of the same person. If it is determined that the target moving area is the same person, the person is tracked and the person being tracked exists in the video. The person motion scene determination units 701a to 701n (hereinafter referred to as “person motion scene determination units 701a to 701n”, which output a motion scene end signal when they are not used, Each of the accumulation target block area video signals from the corresponding area dividing unit 104, and the color feature amount of each accumulation target block area constituting the accumulation target moving area is extracted for each person. When the motion scene end signal is input from the corresponding person motion scene determination unit 701, the color information generation units 702a to 702n that generate and output color distribution for each person (hereinafter, these color information generation units 702a to 702n) are stored. Are collectively referred to as “color information generation unit 702” as appropriate, and an accumulation target moving region video signal is input from the corresponding person motion scene determination unit 701, and whether or not a predetermined search target region setting condition is satisfied If the search target area setting condition is satisfied, the search target area video signal is stored for each person, and the corresponding person motion scene determination unit 70 is stored. When a motion scene end signal is input from 703a to 703n (hereinafter, these search region setting units 703a to 703n are collectively referred to as “search region setting unit 703”) that outputs search target region video signals, A database (DB) 704 for storing the color distribution for each person from the information generation unit 702 and the search target area video signal from each search area setting unit 703; a user instruction input unit 705 for a user to input an instruction; A person representative image list display unit 706 that reads a search target area video signal as a person representative image from the DB 704 in response to a search execution request by a user, generates a person representative image list, and outputs the person representative image list, and corresponds to the person representative image selected by the user. A search target selection unit 707 that reads out and outputs a search target region video signal from the DB 704, and a search target selection unit A color information generation unit 708 that inputs a search target region video signal from 707 to extract a color feature amount of the search target region, generates and outputs a color distribution, and a color distribution of the search target region from the color information generation unit 708 The comparison unit 709 compares the color distribution of each accumulation target block area constituting the accumulation target moving area in the DB 704 and outputs a search result according to the comparison result.

以上のように構成された映像検索装置７００について、図８乃至図１０を用いてその動作を説明する。第１、第２の実施の形態と同様の処理ステップについては同一の符号をつけ、その動作説明は簡略化する。まず、図８及び図９を用いて色情報蓄積時の動作を説明する。 The operation of the video search apparatus 700 configured as described above will be described with reference to FIGS. The same processing steps as those in the first and second embodiments are denoted by the same reference numerals, and the operation description thereof is simplified. First, the operation at the time of storing color information will be described with reference to FIGS.

図８に示すように、映像入力部１０１は、被写体を撮影して映像信号を生成する（Ｓ１０１）。更に、映像入力部１０１は、生成した映像信号を、対応する動領域抽出部１０２へ出力する。動領域抽出部１０２は、対応する映像入力部１０１からの映像信号を入力すると、その映像信号に対応する映像と、あらかじめ保持する背景映像との差分値を算出する背景差分処理を行う（Ｓ１０２）。 As shown in FIG. 8, the video input unit 101 shoots a subject and generates a video signal (S101). Further, the video input unit 101 outputs the generated video signal to the corresponding moving region extraction unit 102. When the video signal from the corresponding video input unit 101 is input, the moving region extraction unit 102 performs background difference processing for calculating a difference value between the video corresponding to the video signal and the background video stored in advance (S102). .

次に、動領域抽出部１０２は、入力した映像信号に対応する映像に動きがあるか否かを判定する（Ｓ１０３）。動領域抽出部１０２に入力された映像信号に対応する映像に動きがない場合、動領域抽出部１０２は、人物動きシーン判定部７０１に動きがないことを通知する。 Next, the moving region extraction unit 102 determines whether or not there is a motion in the video corresponding to the input video signal (S103). When there is no motion in the video corresponding to the video signal input to the motion region extraction unit 102, the motion region extraction unit 102 notifies the person motion scene determination unit 701 that there is no motion.

対応する動領域抽出部１０２から動きがないことを通知する信号が入力すると、人物動きシーン判定部７０１は、追跡モードであるか否かを判定する（Ｓ８０８）。 When a signal notifying that there is no motion is input from the corresponding moving region extraction unit 102, the human motion scene determination unit 701 determines whether the tracking mode is set (S808).

追跡モードでないと判定した場合、映像入力部１０１による被写体の撮影及び映像信号の生成（Ｓ１０１）以降の動作が繰り返される。 When it is determined that the mode is not the tracking mode, the operation after the photographing of the subject and the generation of the video signal (S101) by the video input unit 101 is repeated.

一方、追跡モードであると判定した場合、人物動きシーン判定部７０１は、対応する色情報生成部７０２と対応する検索領域設定部７０３に、動きシーン終了信号を出力する。 On the other hand, when it is determined that the tracking mode is set, the human motion scene determination unit 701 outputs a motion scene end signal to the corresponding color information generation unit 702 and the corresponding search area setting unit 703.

ＤＢ７０４は、色情報生成部７０２と検索領域設定部７０３が出力した人物ごとの各蓄積対象ブロック領域に対応した色分布を蓄積し、さらに検索対象領域と対応する検索対象領域映像を対応付けて蓄積する（Ｓ８０９）。また、追跡モードをオフにする。 The DB 704 accumulates the color distribution corresponding to each accumulation target block area for each person output by the color information generation unit 702 and the search area setting unit 703, and further stores the search target area video corresponding to the search target area in association with each other. (S809). Also turn off the tracking mode.

一方、動領域抽出部１０２に入力された映像信号に対応する映像に動きがある場合には、動領域抽出部１０２は、その映像内の動領域（蓄積対象動領域）をすべて抽出する（Ｓ１０４）。更に、動領域抽出部１０２は、抽出したすべての蓄積対象動領域に対応する蓄積対象動領域映像信号を、対応する人物動きシーン判定部７０１へ出力する。 On the other hand, when there is a motion in the video corresponding to the video signal input to the motion region extraction unit 102, the motion region extraction unit 102 extracts all motion regions (accumulation target motion regions) in the video (S104). ). Furthermore, the moving region extraction unit 102 outputs the accumulation target moving region video signals corresponding to all the extracted accumulation target moving regions to the corresponding person movement scene determination unit 701.

人物動きシーン判定部７０１は、対応する動領域抽出部１０２からのすべての蓄積対象動領域映像信号が入力すると、その蓄積対象動領域映像信号に対応する蓄積対象動領域の映像に対して、楕円ハフ処理を行う（Ｓ１０５）。次に、入力したすべての蓄積対象動領域映像信号に対応する蓄積対象動領域が人であることの条件を満たしているか否かを判定する（Ｓ１０６）。 When all the accumulation target moving area video signals from the corresponding moving area extraction unit 102 are input, the human motion scene determination unit 701 generates an ellipse with respect to the accumulation target moving area video corresponding to the accumulation target moving area video signal. Hough processing is performed (S105). Next, it is determined whether or not the condition that the accumulation target moving area corresponding to all input accumulation target moving area video signals is a person is satisfied (S106).

図９に示すように、蓄積対象動領域映像信号に対応する蓄積対象動領域が人であることの条件を満たしていない場合には、すべての蓄積対象動領域についての処理が終了したか否かを判定する（Ｓ８１０）。すべての蓄積対象動領域についての処理が終了していると判定した場合は、映像入力部１０１による被写体の撮影及び映像信号の生成（Ｓ１０１）以降の動作が繰り返される。すべての蓄積対象動領域についての処理が終了していないと判定した場合は、人物動きシーン判定部７０１による楕円ハフ処理（Ｓ１０５）以降の動作が繰り返される。 As shown in FIG. 9, if the condition that the accumulation target moving area corresponding to the accumulation target moving area video signal is not a person is satisfied, whether or not the processing for all accumulation target moving areas has been completed. Is determined (S810). When it is determined that the processing for all the accumulation target moving areas has been completed, the operation after the photographing of the subject and the generation of the video signal (S101) by the video input unit 101 is repeated. If it is determined that the processing for all the accumulation target moving areas has not been completed, the operations after the elliptical Hough process (S105) by the human motion scene determination unit 701 are repeated.

一方、蓄積対象動領域映像信号に対応する蓄積対象動領域が人であることの条件を満たしている場合には、人物動きシーン判定部７０１は、連続して入力した蓄積対象動領域映像信号に対応する蓄積対象動領域の人物が追跡中の蓄積対象動領域中の人物と同一人物であることの条件を満たしているか否かを判定する（Ｓ８０１）。具体的には、保存している蓄積対象領域の位置・移動情報を用いて、蓄積対象動領域の移動距離があらかじめ設定された人物の移動可能距離以内で、連続した移動方向が著しく変化していない場合に同一人物であると判定する。 On the other hand, if the condition that the accumulation target moving area corresponding to the accumulation target moving area video signal is a person is satisfied, the person motion scene determination unit 701 applies the continuously input accumulation target moving area video signal to the accumulation target moving area video signal. It is determined whether or not the corresponding person in the accumulation target moving area satisfies the condition that it is the same person as the person in the accumulation target moving area being tracked (S801). Specifically, using the stored location / movement information of the accumulation target area, the movement direction of the accumulation target moving area is within the preset movable range of the person, and the continuous movement direction has changed significantly. If not, it is determined that they are the same person.

同一人物であることの条件を満たしていないと判定した場合は、新規追跡人物として追跡を開始する（Ｓ８０２）。具体的には、人物を識別するためのＩＤを作成し、入力した蓄積対象動領域の重心座標を映像中の位置情報として保存する。なお、蓄積対象動領域の重心座標は、映像中での領域の位置が表せればその他の情報を用いても良い。また、追跡モードをＯＮにする（Ｓ８０２１）。 If it is determined that the condition of being the same person is not satisfied, tracking is started as a new tracking person (S802). Specifically, an ID for identifying a person is created, and the input barycentric coordinates of the accumulation target moving area are stored as position information in the video. Note that the barycentric coordinates of the accumulation target moving area may use other information as long as the position of the area in the video can be represented. Further, the tracking mode is turned on (S8021).

一方、同一人物であることの条件を満たしていると判定した場合は、人物動きシーン判定部７０１は、入力した蓄積対象動領域映像信号を対応する領域分割部１０４と対応する検索領域設定部７０３へ出力し、人物追跡情報として、保存されている蓄積対象動領域の位置情報から前フレームからの移動距離、移動方向、位置情報などを保存する（Ｓ８０３）。 On the other hand, if it is determined that the condition of being the same person is satisfied, the person motion scene determination unit 701 searches the input accumulation target moving region video signal for the corresponding region dividing unit 104 and the corresponding search region setting unit 703. The movement distance from the previous frame, the movement direction, the position information, etc. are stored as the person tracking information from the stored position information of the accumulation target moving area (S803).

領域分割部１０４は、対応する人物動きシーン判定部７０１からのすべての蓄積対象動領域映像信号を入力し、その蓄積対象動領域映像信号に対応する蓄積対象動領域を複数のブロック領域（蓄積対象ブロック領域）に分割する（Ｓ１０７）。更に、領域分割部１０４は、蓄積対象動領域を構成する各蓄積対象ブロック領域に対応する映像信号（蓄積対象ブロック領域映像信号）を、対応する色情報生成部７０２に出力する。 The area dividing unit 104 receives all the accumulation target moving area video signals from the corresponding person motion scene determination unit 701, and sets the accumulation target moving area corresponding to the accumulation target moving area video signal as a plurality of block areas (accumulation targets). It is divided into block areas) (S107). Further, the area dividing unit 104 outputs a video signal (accumulation target block area video signal) corresponding to each accumulation target block area constituting the accumulation target moving area to the corresponding color information generation unit 702.

色情報生成部７０２は、対応する領域分割部１０４からのすべての蓄積対象動領域を構成する各蓄積対象ブロック領域に対応する蓄積対象ブロック領域映像信号が入力すると、これら蓄積対象ブロック領域映像信号に基づいて、蓄積対象動領域を構成する各蓄積対象ブロック領域の色情報を抽出し、保存している同一人物のそれぞれの蓄積対象ブロック領域ごとに色情報を積算する（Ｓ８０４）。具体的には、色情報生成部７０２は色情報として、蓄積対象ブロック領域の各画素のＲＧＢ値を取得して、ＨＳＶ空間に変換を行い、Ｈ、Ｓを用いてヒストグラム（以後、色分布と呼ぶ）を作成し、さらに、追跡中の同一人物ごとに蓄積対象動領域を構成する複数の蓄積対象ブロック領域に対応する色分布を蓄積対象ブロック領域ごとに積算する。なお、ここでは色空間をＲＧＢからＨＳＶへ変換しているが、他の色空間（例えばＸＹＺ、ＹＣｒＣｂなど）でも一般的な使い方（ＸＹ平面、ＣｒＣｂ平面でヒストグラムを生成）により利用できる。 When the accumulation target block region video signal corresponding to each accumulation target block region constituting all the accumulation target moving regions from the corresponding region dividing unit 104 is input, the color information generation unit 702 receives the accumulation target block region video signal. Based on this, the color information of each accumulation target block area constituting the accumulation target moving area is extracted, and the color information is integrated for each accumulation target block area of the same person stored (S804). Specifically, the color information generation unit 702 obtains RGB values of each pixel in the accumulation target block area as color information, converts the RGB value into an HSV space, and uses H and S to generate a histogram (hereinafter referred to as color distribution). Further, the color distribution corresponding to a plurality of accumulation target block areas constituting the accumulation target moving area is integrated for each accumulation target block area for each same person who is being tracked. Although the color space is converted from RGB to HSV here, it can be used in other color spaces (for example, XYZ, YCrCb, etc.) by general usage (generating histograms on the XY plane and CrCb plane).

最後に、色情報生成部７０２は、それぞれの追跡人物ごとに各蓄積対象ブロックに対応した色分布を保存する（Ｓ８０７）。一方、対応する人物動きシーン判定部７０１から動きシーン終了信号が入力されると、色情報生成部７０２は保存している追跡人物ごとの各蓄積対象ブロック領域に対応する色分布を正規化してＤＢ７０４に出力する。 Finally, the color information generation unit 702 stores a color distribution corresponding to each accumulation target block for each tracking person (S807). On the other hand, when the motion scene end signal is input from the corresponding person motion scene determination unit 701, the color information generation unit 702 normalizes the color distribution corresponding to each accumulation target block region for each tracked person and stores the DB 704. Output to.

検索領域設定部７０３は、対応する人物動きシーン判定部７０１からすべての蓄積対象動領域映像信号が入力すると、あらかじめ設定されている規則に基づき検索対象領域の設定が可能か否かを判定する（Ｓ８０５）。具体的には、入力した蓄積対象動領域信号に対応した蓄積対象動領域の位置とサイズを用い、あらかじめ設定されている閾値により、検索対象領域の設定が可能か否かを判定する。 When all the accumulation target moving area video signals are input from the corresponding person motion scene determination unit 701, the search area setting unit 703 determines whether or not the search target area can be set based on a preset rule ( S805). Specifically, using the position and size of the accumulation target moving area corresponding to the input accumulation target moving area signal, it is determined whether or not the search target area can be set based on a preset threshold.

検索対象領域の設定が可能でないと判定した場合、検索領域設定部７０３は処理を終了し、検索対象領域の設定が可能であると判定した場合、検索領域設定部７０３は、蓄積対象動領域内で検索対象領域を設定する（Ｓ８０６）。最後に追跡人物ごとに保存している検索対象領域と対応する検索対象領域映像信号を保存する（Ｓ８０７）。このとき、すでに保存されている検索対象領域と対応する検索対象領域映像信号と、新しい検索対象領域と対応する検索対象領域映像信号の比較を行い、あらかじめ設定された条件に近い（例えば、検索対象領域映像信号全体の輝度・彩度の平均が、あらかじめ設定された値に近い）方を保存するようにすれば、常に最良の検索対象領域と対応する検索対象領域映像信号を保存することができる。 When it is determined that the search target area cannot be set, the search area setting unit 703 terminates the process, and when it is determined that the search target area can be set, the search area setting unit 703 In step S806, the search target area is set. Finally, a search target area video signal corresponding to the search target area stored for each tracked person is stored (S807). At this time, the search target area video signal corresponding to the search target area that has already been stored is compared with the search target area video signal corresponding to the new search target area, and is close to a preset condition (for example, the search target area If the average of the luminance / saturation of the entire area video signal is closer to a preset value), the search area video signal corresponding to the best search area can always be saved. .

前述したＳ１０１乃至Ｓ１０６、Ｓ８０１乃至Ｓ８１０の処理が繰り返されることによって、ＤＢ７０４には、複数の蓄積対象動領域のそれぞれについて、その蓄積対象動領域を構成する各蓄積対象ブロック領域の色分布と、検索対象領域と対応する検索対象領域映像が蓄積されることになる。 By repeating the processing of S101 to S106 and S801 to S810 described above, the DB 704 stores the color distribution of each accumulation target block area constituting the accumulation target moving area and the search for each of the plurality of accumulation target moving areas. The search target area video corresponding to the target area is accumulated.

次に、図１０を用いて映像検索時の動作を説明する。
ユーザがユーザ指示入力部７０５を用いて検索要求を行うと、人物代表画像リスト表示部７０６はユーザによる要求を受け付ける（Ｓ９０１）。人物代表画像リスト表示部７０６はＤＢ７０４に蓄積されている検索対象領域映像信号を人物代表画像として取得してリスト（人物代表画像リスト）を作成し、表示部１１８に出力し、表示部１１８は人物代表画像リストを表示する。ユーザが表示部１１８に表示された人物代表画像リストを見て、いずれかの人物代表画像の選択指示をユーザ指示入力部７０５から入力する。 Next, an operation during video search will be described with reference to FIG.
When the user makes a search request using the user instruction input unit 705, the person representative image list display unit 706 receives the request from the user (S901). The person representative image list display unit 706 acquires a search target area video signal stored in the DB 704 as a person representative image, creates a list (person representative image list), and outputs the list to the display unit 118. The display unit 118 A representative image list is displayed. The user views the person representative image list displayed on the display unit 118 and inputs an instruction to select one of the person representative images from the user instruction input unit 705.

ユーザ指示入力部７０５から人物代表画像の選択指示が入力すると、検索対象選択部７０７は、ＤＢ７０４からユーザの選択した人物代表画像に対応した検索対象領域と検索対象領域映像信号を取得して出力する（Ｓ９０２）。 When a user representative image selection instruction is input from the user instruction input unit 705, the search target selection unit 707 acquires and outputs a search target region and a search target region video signal corresponding to the person representative image selected by the user from the DB 704. (S902).

色情報生成部７０８は、検索対象選択部７０７から検索対象領域映像信号が入力すると、検索対象領域の色情報を抽出する（Ｓ９０３）。具体的には、色情報生成部７０２は色情報として、蓄積対象ブロック領域の各画素のＲＧＢ値を取得して、ＨＳＶ空間に変換を行い、Ｈ、Ｓを用いてヒストグラム（色分布）を作成する。さらに、追跡中の同一人物ごとに蓄積対象領域を構成する複数の蓄積対象ブロック領域に対応するヒストグラムを蓄積対象ブロック領域ごとに積算する。なお、ここでは色空間をＲＧＢからＨＳＶへ変換しているが、他の色空間（例えばＸＹＺ、ＹＣｒＣｂなど）に変換し、ヒストグラムを生成（ＸＹ平面、ＣｒＣｂ平面）しても良い。さらに色情報生成部７０８は、色分布を比較部７０９に出力する。 When the search target region video signal is input from the search target selection unit 707, the color information generation unit 708 extracts color information of the search target region (S903). Specifically, the color information generation unit 702 obtains RGB values of each pixel in the accumulation target block area as color information, converts the RGB values into an HSV space, and creates a histogram (color distribution) using H and S. To do. Further, histograms corresponding to a plurality of accumulation target block areas constituting the accumulation target area for each same person being tracked are integrated for each accumulation target block area. Although the color space is converted from RGB to HSV here, it may be converted into another color space (for example, XYZ, YCrCb, etc.) to generate a histogram (XY plane, CrCb plane). Further, the color information generation unit 708 outputs the color distribution to the comparison unit 709.

比較部７０９は、色情報生成部７０８からの検索対象領域の色分布を入力すると、ＤＢ７０４からいずれかの蓄積対象動領域を構成する各蓄積対象ブロック領域の色分布を読み出す（Ｓ６０２）。 When the comparison unit 709 receives the color distribution of the search target region from the color information generation unit 708, the comparison unit 709 reads the color distribution of each storage target block region constituting any storage target moving region from the DB 704 (S602).

更に、比較部７０９は、読み出した蓄積対象動領域を構成する各蓄積対象ブロック領域の色分布と、入力した検索対象領域の色分布を比較して一致度を算出する（Ｓ９０４）。 Further, the comparison unit 709 compares the color distribution of each accumulation target block area constituting the read accumulation target moving area with the input color distribution of the search target area to calculate the degree of coincidence (S904).

色分布はＨ、Ｓを用いた２次元のヒストグラムなので、一致度の算出には例えばヒストグラムインターセクションを用いれば良い。色分布が完全に一致する場合は色分布に含まれる全度数を積算した値となり、完全に不一致であれば一致度は０になる。 Since the color distribution is a two-dimensional histogram using H and S, for example, a histogram intersection may be used to calculate the degree of coincidence. When the color distributions completely match, the total frequencies included in the color distribution are integrated, and when the color distributions are not completely matched, the matching degree is zero.

蓄積対象動領域を構成する各蓄積対象ブロック領域の色分布のそれぞれと検索対象領域の色分布との一致度（部分一致度）を求め、部分一致度すべてを積算して一致度を算出する。ここではすべての部分一致度を積算して一致度を算出しているが、目的に応じて比較対象とする蓄積対象ブロック領域を選択して部分一致度を算出し、算出した部分一致度のみを積算して一致度を算出したり、すべての部分一致度を算出し、その中で一番高い部分一致度を一致度として選択することも考えられる。 The degree of coincidence (partial coincidence) between the color distribution of each accumulation target block area constituting the accumulation target moving area and the color distribution of the search target area is obtained, and the degree of coincidence is calculated by adding all the partial coincidences. Here, the degree of coincidence is calculated by accumulating all the partial coincidences.However, according to the purpose, the accumulation target block area is selected to calculate the partial coincidence, and only the calculated partial coincidence is calculated. It is conceivable that the degree of coincidence is calculated by integration, or all partial coincidences are calculated, and the highest partial coincidence is selected as the coincidence.

次に、比較部７０９は、算出した蓄積対象動領域を構成する各蓄積対象ブロック領域の色分布と、検索対象領域の色分布との一致度が所定の閾値以内であるか否かを判定する（Ｓ６０４）。ここで、閾値はユーザが自由に設定可能であることが望ましい。 Next, the comparing unit 709 determines whether or not the degree of coincidence between the calculated color distribution of each accumulation target block area constituting the accumulation target moving area and the color distribution of the search target area is within a predetermined threshold. (S604). Here, it is desirable that the user can freely set the threshold.

蓄積対象動領域を構成する各蓄積対象ブロック領域の色分布と、検索対象領域の色分布との一致度が所定の閾値以内である場合、比較部７０９は、蓄積対象領域を構成する各蓄積対象ブロック領域を持つ蓄積対象動領域に関連するデータ（例えば、その蓄積対象動領域を含む映像を撮影したカメラ１０２の識別情報（ＩＤ）、撮影日時、その蓄積対象動領域の映像を縮小したサムネイル映像のデータ）を、検索結果としてリスト表示部１１２へ出力する。リスト表示部１１２は、比較部７０９からの蓄積対象領域を構成する各蓄積対象ブロック領域を持つ蓄積対象動領域に関連するデータを検索結果リストに追加する（Ｓ６０５）。 When the degree of coincidence between the color distribution of each accumulation target block area constituting the accumulation target moving area and the color distribution of the search target area is within a predetermined threshold, the comparison unit 709 displays each accumulation target constituting the accumulation target area. Data related to the accumulation target moving area having a block area (for example, identification information (ID) of the camera 102 that shot the video including the accumulation target moving area, shooting date and time, thumbnail video obtained by reducing the video of the accumulation target moving area) Is output to the list display unit 112 as a search result. The list display unit 112 adds data related to the accumulation target moving area having the accumulation target block areas constituting the accumulation target area from the comparison unit 709 to the search result list (S605).

リスト表示部１１２による検索結果リストへの追加処理（Ｓ６０５）の後、又は、比較部７０９による蓄積対象動領域を構成する各蓄積対象ブロック領域の色分布と、検索対象領域の色分布との一致度が所定の閾値を超えるとの判断（Ｓ６０４における否定判断）の後、比較部７０９は、ＤＢ７０４内の全ての蓄積対象動領域を構成する各蓄積対象ブロック領域の色分布を読み出したか否かを判定する（Ｓ６０６）。ＤＢ７０４内にまだ読み出していない蓄積対象動領域を構成する各蓄積対象ブロック領域の色分布が存在する場合には、比較部７０９による、その読み出していない蓄積対象動領域を構成する各蓄積対象ブロック領域の色分布の読み出し（Ｓ６０２）以降の動作が繰り返される。 After the addition processing (S605) to the search result list by the list display unit 112, or the color distribution of each accumulation target block area constituting the accumulation target moving area by the comparison unit 709 and the color distribution of the search target area After the determination that the degree exceeds the predetermined threshold (negative determination in S604), the comparison unit 709 determines whether or not the color distribution of each accumulation target block area constituting all the accumulation target moving areas in the DB 704 has been read. Determination is made (S606). When there is a color distribution of each accumulation target block area that constitutes an accumulation target moving area that has not yet been read in the DB 704, each accumulation target block area that constitutes an accumulation target moving area that has not been read by the comparison unit 709. The operation after reading the color distribution (S602) is repeated.

一方、比較部７０９がＤＢ７０４内の全ての蓄積対象動領域を構成する各蓄積対象ブロック領域の色分布を読み出した場合には、人物代表画像リスト表示部７０６による、検索指示の受付（Ｓ９０１）以降の動作が繰り返される。更に、リスト表示部１１２が生成した検索結果リストが表示部１１８に表示され、ユーザは検索結果を認識することが可能となる。 On the other hand, when the comparison unit 709 reads the color distribution of each accumulation target block area constituting all the accumulation target moving areas in the DB 704, the person representative image list display unit 706 receives a search instruction (S901) and thereafter. Is repeated. Further, the search result list generated by the list display unit 112 is displayed on the display unit 118, and the user can recognize the search result.

このように映像検索装置７００では、映像入力部１０１が撮影した映像内の動領域が人であることの条件を満たしている場合には、その動領域が複数の蓄積対象ブロック領域に分割された上で、各蓄積対象ブロック領域の色情報が抽出され、さらに同一人物であると判定された場合は、その人物の一連の動きシーンの各動領域を構成する蓄積対象ブロック領域の色分布が生成され、また、同時にその人物の一連の動きシーンから最良の検索対象領域と対応する検索対象領域映像信号がＤＢ７０４に蓄積される。そして、映像検索の際には、ユーザが表示部１１８に表示された人物代表画像リストから検索対象の人物画像を指定すると自動的にその人物の検索対象領域の色分布が生成され、検索対象領域の色分布に類似した蓄積対象領域を構成する各蓄積対象ブロック領域を持つ蓄積対象領域に関連するデータが検索結果としてユーザに示される。すなわち、撮影された映像内の動きやその動きの領域の色によって検索できるだけでなく、人物の一連の動きシーンの中で１つだけ設定された検索対象領域を用いて検索が行われるため、動きの有無のみに基づいて映像検索を行う場合や、映像検索において対象物の特定部分の映像が必要となる場合よりも、映像検索を適切に行うことができる。さらにユーザは、検索領域設定時に映像を再生しながら所望の検索対象領域を探したり、同一人物の映像を繰り返し見て検索対象領域を選択することなく、所望の人物画像を検索対象領域として指定して検索を行うことができ、また検索を実行するたびに異なる結果を得るようなことなく常に安定した検索結果を得ることができる。 As described above, in the video search apparatus 700, when the moving area in the video captured by the video input unit 101 satisfies the condition that it is a person, the moving area is divided into a plurality of accumulation target block areas. Above, the color information of each accumulation target block area is extracted, and if it is determined that they are the same person, the color distribution of the accumulation target block areas constituting each moving area of the series of motion scenes of that person is generated At the same time, the search target area video signal corresponding to the best search target area from the series of motion scenes of the person is stored in the DB 704. In video search, when a user designates a human image to be searched from the person representative image list displayed on the display unit 118, a color distribution of the human search target area is automatically generated, and the search target area Data related to the accumulation target area having each accumulation target block area that constitutes the accumulation target area similar to the color distribution is shown to the user as a search result. In other words, not only can the search be performed based on the motion in the captured video and the color of the motion area, but also the search is performed using the search target region that is set in only one of the human motion scenes. The video search can be performed more appropriately than when the video search is performed based only on the presence or absence of the image, or when the video of the specific part of the target object is required for the video search. Furthermore, the user designates a desired person image as the search target area without searching for the desired search target area while playing the video when setting the search area, or repeatedly selecting the search target area by repeatedly viewing the video of the same person. Thus, a stable search result can always be obtained without obtaining a different result each time the search is executed.

なお、以上の説明では、映像検索時に検索対象選択部７０７が、ＤＢ７０４からユーザの指定した人物代表画像に対応する検索対象領域と検索対象領域映像信号を取得して色分布の生成を行っているが、映像記録時に、検索領域設定部７０３が検索対象領域の色分布の生成までを行い、色分布と検索対象領域映像信号を対応付けてＤＢ７０４に蓄積しても同様に実施可能である。 In the above description, the search target selection unit 707 obtains the search target area corresponding to the person representative image designated by the user and the search target area video signal from the DB 704 and generates the color distribution during the video search. However, it is also possible to perform the same operation when the search area setting unit 703 performs generation of the color distribution of the search target area and stores the color distribution and the search target area video signal in the DB 704 in association with each other during video recording.

なお、第１ないし第４の実施の形態において、図１１に示すように、領域分割手段１０４が、人物動きシーン判定部７０１が出力する蓄積対象領域を人物領域（ａ）とし、人物動きシーン判定部７０１が楕円ハフ処理により得た楕円位置とサイズを出力することにより、この楕円位置とサイズに基づき領域分割部１０４が（ｂ）に示すように領域を分割して蓄積対象ブロック領域と蓄積対象ブロック領域映像信号を出力し、色情報生成部７０２が各蓄積対象ブロック領域ごとに色分布を生成し出力する。同様に検索領域設定部７０３においても、図１１に示すように検索対象領域を分割して検索対象領域と対応した検索対象ブロック領域映像信号を出力し、ＤＢ７０４が、１つの動きシーンに対し４つの色分布と検索対象領域と対応する検索対象ブロック領域映像信号を関連付けて蓄積する。比較部７０９は各蓄積対象ブロック領域と対応する検索対象ブロック領域の色分布を比較することにより、ユーザがモニタ上で検索対象として人物代表画像を選択する際、頭・上半身・下半身・靴等の分割領域の選択項目を指定して人物代表画像を選択するようにすれば、人物の部分的な服装の色や、複数の人物の服装の色の組み合わせによる検索を可能にすることができる。 In the first to fourth embodiments, as shown in FIG. 11, the region dividing unit 104 sets the accumulation target region output from the person motion scene determination unit 701 as the person region (a), and determines the person motion scene determination. By outputting the ellipse position and size obtained by the ellipse hough process by the unit 701, the area dividing unit 104 divides the area as shown in FIG. The block area video signal is output, and the color information generation unit 702 generates and outputs a color distribution for each accumulation target block area. Similarly, in the search area setting unit 703, as shown in FIG. 11, the search target area is divided and a search target block area video signal corresponding to the search target area is output. The color distribution and the search target block area video signal corresponding to the search target area are stored in association with each other. The comparison unit 709 compares the color distributions of the search target block areas with the corresponding storage target block areas, so that when the user selects a person representative image as a search target on the monitor, the head, upper body, lower body, shoes, etc. If a representative image is selected by specifying a selection item for a divided area, it is possible to perform a search based on a combination of a partial clothing color of a person or a clothing color of a plurality of people.

また、第４の実施の形態において、追跡モードをＯＮにしている間は蓄積対象ブロックの大きさを固定する構成であってもよい。 In the fourth embodiment, the size of the accumulation target block may be fixed while the tracking mode is ON.

以上のように、本発明にかかる映像検索装置及び映像検索方法は、映像検索を適切に行うことができ、また、服装の色が類似した人物の映像を効率的に検索するための検索対象を容易に指定することができるという効果を有し、個人識別等の映像監視用途に用いられる映像検索装置及び映像検索方法として有用である。 As described above, the video search apparatus and the video search method according to the present invention can appropriately perform video search, and search targets for efficiently searching for videos of persons with similar clothing colors. It has the effect that it can be specified easily, and is useful as a video search apparatus and video search method used for video surveillance applications such as personal identification.

本発明の第１の実施の形態における映像検索装置のブロック図Block diagram of the video search apparatus in the first embodiment of the present invention 本発明の第１の実施の形態における映像検索装置の代表色蓄積時の動作のフローチャートFlowchart of operation at the time of accumulation of representative colors of the video search device in the first embodiment of the present invention 本発明の第１の実施の形態における映像検索装置の映像検索時の動作のフローチャートFlowchart of the operation at the time of video search of the video search device in the first embodiment of the present invention 本発明の第２の実施の形態における映像検索装置のブロック図Block diagram of a video search apparatus in the second embodiment of the present invention 本発明の第２の実施の形態における映像検索装置の代表色蓄積時の動作のフローチャートFlowchart of operation at the time of accumulation of representative colors of the video search apparatus in the second embodiment of the present invention 本発明の第２の実施の形態における映像検索装置の映像検索時の動作のフローチャートFlowchart of the operation at the time of video search of the video search device in the second embodiment of the present invention 本発明の第４の実施の形態における映像検索装置のブロック図The block diagram of the video search device in the 4th Embodiment of this invention 本発明の第４の実施の形態における映像検索装置の代表色蓄積時の動作のフローチャートFlowchart of operation at the time of accumulation of representative color of video search device in fourth embodiment of the present invention 本発明の第４の実施の形態における映像検索装置の代表色蓄積時の動作のフローチャートFlowchart of operation at the time of accumulation of representative color of video search device in fourth embodiment of the present invention 本発明の第４の実施の形態における映像検索装置の映像検索時の動作のフローチャートFlowchart of the operation at the time of video search of the video search device in the fourth embodiment of the present invention （ａ）本発明の第４の実施の形態における人物領域を示す模式図（ｂ）本発明の第４の実施の形態における色情報抽出単位に分割した人物領域を示す模式図(A) Schematic diagram showing a person area in the fourth embodiment of the present invention (b) Schematic diagram showing a person area divided into color information extraction units in the fourth embodiment of the present invention 本発明の第３の実施の形態における映像検索装置のブロック図Block diagram of a video search apparatus in a third embodiment of the present invention 本発明の第３の実施の形態における映像検索装置の映像検索時の動作のフローチャートFlowchart of the operation at the time of video search of the video search device in the third embodiment of the present invention 従来の監視システムのブロック図Block diagram of a conventional monitoring system 従来の個人識別装置のブロック図Block diagram of a conventional personal identification device

Explanation of symbols

１００、４００、７００、１１００映像検索装置
１０１ａ〜１０１ｎ映像入力部
１０２ａ〜１０２ｎ動領域抽出部
１０３ａ〜１０３ｎ人判定部
１０４ａ〜１０４ｎ、１０９領域分割部
１０５ａ〜１０５ｎ、１１０代表色算出部
１０６、４０２、７０４ＤＢ
１０７キーボード
１０８検索領域指定部
１１１、４０４、７０９、１１０２比較部
１１２リスト表示部
１１３映像選択部
１１４圧縮部
１１５ストレージ
１１６映像表示指示部
１１７展開部
１１８表示部
４０１ａ〜４０１ｎ、４０３、７０２ａ〜７０２ｎ、７０８、１１０１色情報生成部
７０１ａ〜７０１ｎ人物動きシーン判定部
７０３ａ〜７０３ｎ検索領域設定部
７０５ユーザ指示入力部
７０６人物代表画像リスト表示部
７０７検索対象選択部 100, 400, 700, 1100 Video search device 101a-101n Video input unit 102a-102n Moving region extraction unit 103a-103n Person determination unit 104a-104n, 109 Region division unit 105a-105n, 110 Representative color calculation unit 106, 402, 704 DB
107 Keyboard 108 Search area designation unit 111, 404, 709, 1102 Comparison unit 112 List display unit 113 Video selection unit 114 Compression unit 115 Storage 116 Video display instruction unit 117 Expanding unit 118 Display unit 401a to 401n, 403, 702a to 702n, 708, 1101 Color information generation unit 701a to 701n Human motion scene determination unit 703a to 703n Search area setting unit 705 User instruction input unit 706 Person representative image list display unit 707 Search target selection unit

Claims

An accumulation target moving area extracting means for extracting an accumulation target moving area in the first video and outputting an accumulation target moving area video signal corresponding to the accumulation target moving area;
Based on the accumulation target moving area video signal, the accumulation target moving area is divided into accumulation target block areas, and accumulation target block area video signals corresponding to the accumulation target block areas constituting the accumulation target moving area are obtained. An accumulation target block area dividing means for outputting;
A storage target block area representative that derives and outputs a representative color of each of the storage target block areas constituting the storage target moving area based on the storage target block area video signal corresponding to each of the storage target block areas Color derivation means;
Accumulation target block area representative color accumulation means for accumulating each representative color of the accumulation target block area;
Search target area extraction means for extracting a search target area in the second video and outputting a search target area video signal corresponding to the search target area;
A search for dividing the search target area into search target block areas based on the search target area video signal, and outputting search target block area video signals corresponding to the search target block areas constituting the search target area Target block area dividing means;
Retrieval block area representative color for deriving and outputting a representative color of each search block area constituting the search area based on the search block area video signal corresponding to each of the search block areas Deriving means;
An image search apparatus comprising: a comparison unit that compares each representative color of the accumulation target block area with each representative color of the search target block area and performs output according to the comparison result.

2. The video search apparatus according to claim 1, wherein the comparison means derives a difference between each representative color of the accumulation target block area and each representative color of the search target block area as the comparison result. .

Based on the accumulation target moving area video signal, it is determined whether or not the accumulation target moving area satisfies a predetermined condition as a condition for being a person, and the accumulation target moving area satisfies the predetermined condition. 3. The video search apparatus according to claim 1, further comprising a person determination unit that outputs the accumulation target moving area video signal to the accumulation target block area dividing unit when the condition is satisfied.

The accumulation target block region representative color deriving unit is configured to generate a color corresponding to an average value of values obtained by a conversion method for reducing an influence of a luminance change on a color appearing in the accumulation target block region, and a frequency of appearance of the obtained value. 4. The video search apparatus according to claim 1, wherein one of the highest colors is derived as a representative color of the accumulation target block area. 5.

The search target block area representative color deriving means includes a color corresponding to an average value of values obtained by a conversion method for reducing an influence of a luminance change on a color appearing in the search target block area, and a frequency at which the obtained value appears. 4. The video search apparatus according to claim 1, wherein one of the highest-colored colors is derived as a representative color of the search target block area. 5.

A video search method for searching for a moving object in a first video based on information in a second video,
An accumulation target moving area extracting step of extracting an accumulation target moving area in the first video and outputting an accumulation target moving area video signal corresponding to the accumulation target moving area;
Based on the accumulation target moving area video signal, the accumulation target moving area is divided into accumulation target block areas, and accumulation target block area video signals corresponding to the accumulation target block areas constituting the accumulation target moving area are obtained. An accumulation target block area dividing step to be output; and
A storage target block area representative that derives and outputs a representative color of each of the storage target block areas constituting the storage target moving area based on the storage target block area video signal corresponding to each of the storage target block areas A color derivation step;
An accumulation target block area representative color accumulation control step of accumulating each representative color of the accumulation target block area in an accumulation target block area accumulation unit;
A search target region extracting step of extracting a search target region in the second video and outputting a search target region video signal corresponding to the search target region;
A search for dividing the search target area into search target block areas based on the search target area video signal, and outputting search target block area video signals corresponding to the search target block areas constituting the search target area A target block area dividing step;
Retrieval block area representative color for deriving and outputting a representative color of each search block area constituting the search area based on the search block area video signal corresponding to each of the search block areas A derivation step;
A video search method comprising: a comparison step of comparing each representative color of the accumulation target block area with each representative color of the search target block area and performing output according to the comparison result.

The video search method according to claim 6, wherein the comparison step derives a difference between each representative color of the accumulation target block area and each representative color of the search target block area as the comparison result. .

Based on the accumulation target moving area video signal, it is determined whether or not the accumulation target moving area satisfies a predetermined condition defined as a condition that the accumulation target moving area is a person, and the accumulation target moving area satisfies the predetermined condition A human determination step of outputting the accumulation target moving area video signal when
The accumulation target block area dividing step divides the accumulation target moving area into accumulation target block areas based on the accumulation target moving area video signal when the accumulation target moving area video signal is output in the human determination step. 8. The video search method according to claim 6, wherein a storage target block area video signal corresponding to each of the storage target block areas constituting the storage target moving area is output.

An accumulation target moving area extracting means for extracting an accumulation target moving area in the first video and outputting an accumulation target moving area video signal corresponding to the accumulation target moving area;
Based on the accumulation target moving area video signal, the accumulation target moving area is divided into accumulation target block areas, and accumulation target block area video signals corresponding to the accumulation target block areas constituting the accumulation target moving area are obtained. An accumulation target block area dividing means for outputting;
Based on the accumulation target block area video signal corresponding to each of the accumulation target block areas, a color distribution is extracted from the video signals included in the accumulation target block areas constituting the accumulation target moving area and output. Accumulation target block area color information generating means;
Storage target block area color information storage means for storing color information of each of the storage target block areas;
Search target area extraction means for extracting a search target area in the second video and outputting a search target area video signal corresponding to the search target area;
A search for dividing the search target area into search target block areas based on the search target area video signal, and outputting search target block area video signals corresponding to the search target block areas constituting the search target area Target block area dividing means;
Search that extracts and outputs a color distribution from video signals included in each previous search target block area constituting the search target area based on the search target block area video signal corresponding to each of the search target block areas Target block area color information generating means;
A video search apparatus comprising: comparing means for comparing each color distribution of the accumulation target block area and the color distribution of the search target block area and performing output according to the comparison result.

The comparison means derives a degree of coincidence of color appearance frequencies of each color distribution of the accumulation target block area and each color distribution of the search target block area as the comparison result. 9. The video search device according to 9.

The comparison means derives, as the comparison result, a degree of coincidence of color appearance frequencies between each color distribution of the accumulation target block area and one of the color distributions of the search target block area. 9. The video search device according to 9.

Based on the accumulation target moving area video signal, it is determined whether or not the accumulation target moving area satisfies a predetermined condition defined as a condition that the accumulation target moving area is a person, and the accumulation target moving area satisfies the predetermined condition 12. The video search apparatus according to claim 9, further comprising a person determination unit that outputs the accumulation target moving area video signal to the accumulation target block area dividing unit.

An accumulation target moving area extracting means for extracting an accumulation target moving area in the first video and outputting an accumulation target moving area video signal corresponding to the accumulation target moving area;
Based on the accumulation target moving area video signal, the accumulation target moving area is divided into accumulation target block areas, and accumulation target block area video signals corresponding to the accumulation target block areas constituting the accumulation target moving area are obtained. An accumulation target block area dividing means for outputting;
Based on the accumulation target block area video signal corresponding to each of the accumulation target block areas, a color distribution is extracted from the video signals included in the accumulation target block areas constituting the accumulation target moving area and output. Accumulation target block area color information generating means;
Storage target block area color information storage means for storing color information of each of the storage target block areas;
Search target color information generating means for extracting one search target point in the second video and extracting and outputting color information of the search target point based on a search target region video signal corresponding to the search target point When,
Comparing means that generates representative color information from each color distribution of the accumulation target block region, compares the representative color information with the search target color information, and performs output according to the comparison result. A featured video search device.

The comparison means derives, as the comparison result, the similarity of the appearance frequency of the color between the representative color information generated from each color distribution of the accumulation target block region and the color information of the search target point. The video search device according to claim 13.

An accumulation target moving area extracting means for extracting an accumulation target moving area in a video and outputting an accumulation target moving area video signal corresponding to the accumulation target moving area;
A person who identifies the same person from the continuous frame images in the video based on the accumulation target moving area video signal, outputs the previous accumulation target moving area video signal, and outputs a motion scene end signal when there is no movement A motion scene determination means;
Based on the accumulation target moving area video signal, the accumulation target moving area is divided into accumulation target block areas, and accumulation target block area video signals corresponding to the accumulation target block areas constituting the accumulation target moving area are obtained. An accumulation target block area dividing means for outputting;
Based on the accumulation target block area video signal corresponding to each of the accumulation target block areas, a color feature amount is calculated from the video signal included in each of the accumulation target block areas constituting the accumulation target moving area for each same person. Storage target block area color information generating means for extracting and generating a color distribution corresponding to each of the storage target block areas for each same person in response to the motion scene end signal;
A search for setting a search target area for each person based on the accumulation target moving area video signal, and outputting a search target area video signal corresponding to the search target area for each same person in response to the motion scene end signal Region setting means;
Area color information storage means for storing the search target area video signal corresponding to each color information of the storage target block area and the search target area;
A person representative image list display means for acquiring the search target area video signal from the area color information storage means and outputting a display list;
Search target area color information generating means for generating and outputting a color distribution from a video signal included in the search target area based on the search target area video signal;
A video search apparatus comprising: a comparison unit that compares each color distribution of the accumulation target block area with the color distribution of the search target area and performs output according to the comparison result.

The video search apparatus according to claim 9, wherein the accumulation target block area dividing unit determines a division position based on a shape of a person.

A video search method for searching for a moving object in a first video based on information in a second video,
An accumulation target moving area extracting step of extracting an accumulation target moving area in the first video and outputting an accumulation target moving area video signal corresponding to the accumulation target moving area;
Based on the accumulation target moving area video signal, the accumulation target moving area is divided into accumulation target block areas, and accumulation target block area video signals corresponding to the accumulation target block areas constituting the accumulation target moving area are obtained. An accumulation target block area dividing step to be output; and
Based on the accumulation target block area video signal corresponding to each of the accumulation target block areas, a color distribution is extracted from the video signals included in the accumulation target block areas constituting the accumulation target moving area and output. An accumulation target block area color information generation step;
An accumulation target block area color information accumulation control step of accumulating each color information of the accumulation target block area in an accumulation target block area accumulation unit;
A search target region extracting step of extracting a search target region in the second video and outputting a search target region video signal corresponding to the search target region;
A search for dividing the search target area into search target block areas based on the search target area video signal, and outputting search target block area video signals corresponding to the search target block areas constituting the search target area A target block area dividing step;
Search that extracts and outputs a color distribution from video signals included in each previous search target block area constituting the search target area based on the search target block area video signal corresponding to each of the search target block areas A target block area color information generation step;
A video search method comprising: a comparison step of comparing each color distribution of the accumulation target block region with the color distribution of the search target block region and performing output according to the comparison result.

The comparison step derives a degree of coincidence of color appearance frequencies of each color distribution of the accumulation target block area and each color distribution of the search target block area as the comparison result. 17. The video search method according to 17.

The comparison step derives, as the comparison result, a degree of coincidence of color appearance frequencies between each color distribution of the accumulation target block area and one of the color distributions of the search target block area. 17. The video search method according to 17.

Based on the accumulation target moving area video signal, it is determined whether or not the accumulation target moving area satisfies a predetermined condition defined as a condition that the accumulation target moving area is a person, and the accumulation target moving area satisfies the predetermined condition A human determination step of outputting the accumulation target moving area video signal when
The accumulation target block area dividing step divides the accumulation target moving area into accumulation target block areas based on the accumulation target moving area video signal when the accumulation target moving area video signal is output in the human determination step. 20. The video search method according to claim 17, wherein a storage target block area video signal corresponding to each of the storage target block areas constituting the storage target moving area is output.

A video search method for searching for a moving object in a first video based on information in a second video,
An accumulation target moving area extracting step of extracting an accumulation target moving area in the first video and outputting an accumulation target moving area video signal corresponding to the accumulation target moving area;
Based on the accumulation target moving area video signal, the accumulation target moving area is divided into accumulation target block areas, and accumulation target block area video signals corresponding to the accumulation target block areas constituting the accumulation target moving area are obtained. An accumulation target block area dividing step to be output; and
Based on the accumulation target block area video signal corresponding to each of the accumulation target block areas, a color distribution is extracted from the video signals included in the accumulation target block areas constituting the accumulation target moving area and output. An accumulation target block area color information generation step;
An accumulation target block area color information accumulation control step of accumulating each color information of the accumulation target block area in an accumulation target block area accumulation unit;
Search target color information generation for extracting one search target point in the second video and extracting and outputting color information of the search target point based on a search target region video signal corresponding to the search target point Steps,
A comparison step of generating representative color information from each color distribution of the accumulation target block region, comparing the representative color information with the search target color information, and performing output according to the comparison result. A featured video search method.

The comparison step derives the similarity of the appearance frequency of the representative color information generated from each color distribution of the accumulation target block region and the color information of the search target point as the comparison result. The video search method according to claim 21.

An accumulation target moving area extracting step of extracting an accumulation target moving area in the video and outputting an accumulation target moving area video signal corresponding to the accumulation target moving area;
A person who identifies the same person from the continuous frame images in the video based on the accumulation target moving area video signal, outputs the previous accumulation target moving area video signal, and outputs a motion scene end signal when there is no movement A motion scene determination step;
Based on the accumulation target moving area video signal, the accumulation target moving area is divided into accumulation target block areas, and accumulation target block area video signals corresponding to the accumulation target block areas constituting the accumulation target moving area are obtained. An accumulation target block area dividing step to be output; and
Based on the accumulation target block area video signal corresponding to each of the accumulation target block areas, a color feature amount is calculated from the video signal included in each of the accumulation target block areas constituting the accumulation target moving area for each same person. An accumulation target block area color information generating step for extracting and generating a color distribution corresponding to each of the accumulation target block areas for each same person in response to the motion scene end signal;
A search for setting a search target area for each person based on the accumulation target moving area video signal, and outputting a search target area video signal corresponding to the search target area for each same person in response to the motion scene end signal An area setting step;
An area color information accumulation control step of accumulating in the area color information accumulation means the color information of each of the accumulation target block areas and the search target area video signal corresponding to the search target area;
A person representative image list display step of acquiring the search target area video signal and outputting a display list;
A search target area color information generation step for generating and outputting a color distribution from a video signal included in the search target area based on the search target area video signal;
A video search method comprising: a comparison step of comparing each color distribution of the accumulation target block region with the color distribution of the search target region and performing output according to the comparison result.