JPH05174149A

JPH05174149A - Picture recognition device

Info

Publication number: JPH05174149A
Application number: JP3344489A
Authority: JP
Inventors: Makoto Kosugi; 信小杉
Original assignee: Nippon Telegraph and Telephone Corp
Current assignee: Nippon Telegraph and Telephone Corp
Priority date: 1991-12-26
Filing date: 1991-12-26
Publication date: 1993-07-13
Anticipated expiration: 2014-01-20
Also published as: JP2849256B2

Abstract

PURPOSE:To enable the recognition after an object picture is easily found and the position and size are obtained by making a recognition object picture and an unknown picture in a mosaic state and searching/recognizing them. CONSTITUTION:A recognition object picture input part 1 fetches an object to be a recognition object, for instance, a typical human face picture if the object is human by using a TV camera or a scanner, etc., and inputs it in a rough search part 100. The object picture to be a recognition object is made in a rough mosaic state and rough position and size are found by scanning an unknown picture by using this picture as the feature of the object picture at the time of a search. Next, the object picture to be the recognition object is made in a fine mosaic state and accurate position and size are found by scanning around the already obtained rough position by using this picture. Further, recognition object picture is made in a fine mosaic and the object picture is recognized by performing the matching of the mosaic of the area obtaining the position and size in the already obtained unknown picture and the data which the previous recognition object picture is made in a finer mosaic.

Description

Detailed Description of the Invention

【０００１】[0001]

【産業上の利用分野】本発明は、入力された自然画像の
中から目的とする画像を精度よく探索し、対象を特定す
るようにした画像認識装置に関する。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to an image recognition apparatus for accurately searching for a target image from input natural images and specifying a target.

【０００２】[0002]

【従来の技術】代表的な自然画像は濃淡あるいはカラー
画像であるが、自然画像の中から目的とする画像を検出
し認識するため、従来は、画像の形状に注目し、形状情
報を特徴として入力画像内から特徴の一致する候補を探
出・認識するものであった。しかし、自然画像の中から
正しい形状を抽出することは至難であり、従来は、対象
物体と背景の切り分けのため、背景を事前学習したりす
ること、あるいは背景は一様なものに制限すること、線
分を抽出し易くするため人工的な剛物体などに対象を制
限すること、など種々の制約条件を課していた。2. Description of the Related Art A typical natural image is a grayscale or color image. In order to detect and recognize a target image from the natural image, conventionally, attention has been paid to the shape of the image, and the shape information is used as a feature. It was to find out and recognize candidates with matching features from the input image. However, it is very difficult to extract the correct shape from the natural image. Conventionally, in order to separate the target object and the background, it is necessary to pre-learn the background or limit the background to a uniform background. However, various restrictions have been imposed, such as limiting the target to an artificial rigid object in order to facilitate the extraction of line segments.

【０００３】[0003]

【発明が解決しようとする課題】目的の画像の有無の検
知でさえこのような困難が伴っており、まして自然画像
において対象物の正確な位置や大きさをつかむことは不
可能に近かった。Even the detection of the presence or absence of the target image is accompanied by such difficulty, and it is almost impossible to grasp the exact position and size of the object in the natural image.

【０００４】即ち、任意の自然画像の中から対象物を正
確に抽出することは至難のことであり、このため、実用
上、制約が多く有用性に難があった。本発明は、上記の
さまざまな制約条件を緩和し、目的とする画像を画面内
のどこにあるか正確に求め、この結果に基づき対象を認
識できるようにすることを目的としている。That is, it is extremely difficult to accurately extract an object from an arbitrary natural image, and therefore, there are many practical restrictions and difficulty in usefulness. An object of the present invention is to relax the above-mentioned various constraints, accurately find a target image on the screen, and recognize an object based on the result.

【０００５】[0005]

【課題を解決するための手段】本発明においては、対象
の特徴を形状ではなく粗い解像度の濃淡あるいはカラー
情報を用いる。即ち、リンカーンのモザイク画像からリ
ンカーンが識別できるように、解像度を大きく落とした
画像でも、対象の特徴を表現できることを根拠としてい
る。SUMMARY OF THE INVENTION In the present invention, the characteristics of the object are not the shape but the grayscale or color information of coarse resolution. That is, it is based on the fact that even if the image has a significantly reduced resolution so that Lincoln can be identified from the Lincoln mosaic image, the target feature can be expressed.

【０００６】具体的には、認識対象となる物体画像を粗
くモザイク化し、これを探索時の対象画像の特徴として
用いて、未知画像を走査しおおよその位置と大きさとを
見つける。次に、認識対象となる物体画像を細かくモザ
イク化し、これを用いて、既に得られた大まかな位置付
近を走査し正確な位置と大きさとを見つける。さらに、
認識対象物体画像を細かくモザイク化し、すでに得られ
た未知画像中の位置と大きさとを得ている領域のモザイ
クと上記先の認識対象物体画像を細かくモザイクしたデ
ータのマッチングにより対象画像を認識する。Specifically, the object image to be recognized is roughly mosaiced, and this is used as a feature of the target image at the time of searching to scan the unknown image to find its approximate position and size. Next, the object image to be recognized is finely mosaiced, and using this, the vicinity of the roughly obtained position is scanned to find the accurate position and size. further,
The recognition target object image is finely mosaiced, and the target image is recognized by matching the mosaic of the region in which the position and the size have already been obtained in the unknown image with the data obtained by finely mosaicing the above recognition target object image.

【０００７】[0007]

【作用】このように画像を数段階でモザイク化すること
により、任意の画面中から対象を正確に探し出し、さら
に認識することが容易に可能となる。In this way, by mosaicing the image in several stages, it is possible to easily find the object from any screen and further recognize it.

【０００８】[0008]

【実施例】以下、本発明の一実施例について図面により
説明する。図１は本発明における画像認識を行う一実施
例を示す図であり、図２は図１に示す粗探索部の構成、
図３は図１に示す詳細探索部の構成、図４は図１に示す
認識部の構成を示す。また図５と図６とは一緒になって
１つの図を表しており探索対象画像あるいは未知画像を
モザイク化し、未知画像の中から目的の対象を探索・認
識する過程を説明する図である。DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS An embodiment of the present invention will be described below with reference to the drawings. FIG. 1 is a diagram showing an embodiment for performing image recognition according to the present invention, and FIG. 2 is a configuration of a rough search unit shown in FIG.
FIG. 3 shows the configuration of the detail search unit shown in FIG. 1, and FIG. 4 shows the configuration of the recognition unit shown in FIG. FIG. 5 and FIG. 6 together represent one figure and are diagrams for explaining a process of mosaicing a search target image or an unknown image and searching / recognizing a target target from the unknown image.

【０００９】図１において１は認識対象画像入力部であ
り、ＴＶカメラあるいはスキャナなどを用い、認識対象
となる物体、例えば人物であれば典型的な人物の顔画像
を取り込み、粗探索部１００へ、具体的には、図２の粗
探索対象画像バッファ１０１へ入力する。したがって、
当該バッファ１０１の内容は濃淡あるいはカラーで表さ
れる画素の集合である。In FIG. 1, reference numeral 1 denotes a recognition target image input unit, which uses a TV camera, a scanner or the like to capture an object to be recognized, for example, a face image of a typical person in the case of a person, and to a rough search unit 100. Specifically, it is input to the rough search target image buffer 101 in FIG. Therefore,
The content of the buffer 101 is a set of pixels represented by shading or color.

【００１０】この認識対象画像を行列Ｆ＝［ｆ］で表
し、図５、図６のように、これをＷａ画素×Ｗａ画素の
サイズのブロックでＭ×Ｎに分割し、このモザイク画像
ＦＣの各ブロックをブロック内の代表値、例えば濃淡な
らばブロック内平均値、カラーならばブロック内で最大
の頻度を有する色で表す。This image to be recognized is represented by a matrix F = [f]. As shown in FIGS. 5 and 6, this image is divided into M × N blocks each having a size of Wa pixels × Wa pixels, and the mosaic image FC is divided into blocks. Each block is represented by a representative value in the block, for example, an average value in the block if it is a shade, and a color having the maximum frequency in the block if it is a color.

【００１１】粗探索対象画像モザイク化部１０２は、前
記バッファ１０１の画像をモザイク化し、粗探索対象画
像辞書１０３に蓄積する。なお、必要ならば、複数の探
索対象、この例では複数の顔画像の平均値をとり、代表
的な顔のモザイクデータとするか、複数の粗探索対象画
像を用意してもよい。The coarse search target image mosaic unit 102 mosaics the image in the buffer 101 and stores it in the coarse search target image dictionary 103. If necessary, a plurality of search targets, in this example, an average value of a plurality of face images may be taken as representative face mosaic data, or a plurality of rough search target images may be prepared.

【００１２】一方、未知画像Ｕは図１の未知画像入力部
２を介して粗探索部１００の中の、図２に示す未知画像
バッファ１０４に取り込まれる。そこで未知画像Ｕに対
して、やはり図５のように、未知画像モザイク化部１０
５でＷａ画素×Ｗａ画素のブロックサイズでＰ×Ｑに分
割し、ブロックごとに代表値を算出しモザイク画像ＵＣ
を得る。なお、未知画像入力部２は前記の認識対象画像
入力部１と同一でもよい。On the other hand, the unknown image U is taken into the unknown image buffer 104 shown in FIG. 2 in the rough search unit 100 via the unknown image input unit 2 of FIG. Therefore, for the unknown image U, as shown in FIG.
In 5, a block size of Wa pixels × Wa pixels is divided into P × Q, a representative value is calculated for each block, and a mosaic image UC is calculated.
To get The unknown image input unit 2 may be the same as the recognition target image input unit 1 described above.

【００１３】次に、図５のように、未知画像のモザイク
を走査して粗探索対象画像モザイクと一致する場所を探
索する。即ち、次の原理に基づく。未知画像のＰ×Ｑの
モザイクデータのうち、任意のＭ×Ｎのモザイクデータ
をＵｉ（斜線が存在する領域で示す）としたとき、Ｕｉ
と粗探索対象モザイクデータＦＣとの距離Ｄｉが最小値
となるときのＵｉの位置を粗探索結果とする。この距離
Ｄｉとして、種々の出し方があるが代表的なものとして
ユークリッド距離がある。Next, as shown in FIG. 5, the mosaic of the unknown image is scanned to search for a location that matches the coarse-search target image mosaic. That is, it is based on the following principle. When the arbitrary M × N mosaic data among the P × Q mosaic data of the unknown image is set to Ui (indicated by a shaded area), Ui
The position of Ui when the distance Di between the target mosaic data FC and the rough search target mosaic data FC is the minimum value is the rough search result. As the distance Di, there are various ways to output, but a typical one is the Euclidean distance.

【００１４】具体的には次のとおりである。図２におい
て、１０６は候補選択部であり、未知画像モザイク化部
１０５の中からＭ×Ｎブロック分の枠に入る候補データ
Ｕｉ（ｉ＝１，２…）を選択する。一般的には、はじめ
に未知画像の左上のＭ×Ｎブロック分がＵ１として選択
される。このモザイクデータＵ１との粗探索対象画像辞
書１０３からの粗探索対象モザイクデータＦＣとが距離
算出部１０７に入力され、上記の距離が算出される。こ
の結果は、このときの未知画像モザイクデータＵＣにお
けるＭ×Ｎの位置データ、即ちＵ１の位置データ、なら
びにモザイクのブロックサイズＷａとともに位置・サイ
ズ・距離蓄積部１０８に蓄えられる。Specifically, it is as follows. In FIG. 2, reference numeral 106 denotes a candidate selection unit that selects candidate data Ui (i = 1, 2 ...) That falls within the frame of M × N blocks from the unknown image mosaic unit 105. Generally, the upper left M × N block of the unknown image is generally selected as U1. The mosaic data U1 and the rough search target mosaic data FC from the rough search target image dictionary 103 are input to the distance calculation unit 107, and the distance is calculated. The result is stored in the position / size / distance storage unit 108 together with the M × N position data in the unknown image mosaic data UC at this time, that is, the U1 position data and the mosaic block size Wa.

【００１５】さらに、候補選択部１０６は未知画像モザ
イク化部１０５から次の候補Ｕ２を選択し、距離算出部
１０７はこの時の距離を算出して、候補Ｕ２の位置、ブ
ロックサイズとともに位置・サイズ・距離蓄積部１０８
に蓄える。同様に、未知画像モザイクの全領域に対して
Ｍ×Ｎの候補Ｕｉが順次選択され、選択された位置、ブ
ロックサイズと算出された距離はすべて位置・サイズ・
距離蓄積部１０８に蓄えられる。Further, the candidate selection unit 106 selects the next candidate U2 from the unknown image mosaic unit 105, and the distance calculation unit 107 calculates the distance at this time, and the position and block size of the candidate U2 as well as the position / size.・ Distance storage unit 108
Store in. Similarly, M × N candidate Ui are sequentially selected for all regions of the unknown image mosaic, and the selected position, block size, and calculated distance are all position / size /
It is stored in the distance storage unit 108.

【００１６】ところで、以上では未知画像のモザイク化
において、探索対象画像と同じブロックサイズ、Ｗａ画
素×Ｗａ画素を用いた。探索対象画像と未知画像内の探
索対象画像が同じサイズであればこのままでよいが、一
般的には、未知画像内における探索対象画像のサイズも
また未知である。このため、未知画像モザイク化部１０
５は、ブロックサイズの画素数を、例えば、Ｗａの１／
５〜１０／５などのＷａ’に変化させる。そこで、一つ
一つのブロックサイズＷａ’ごとに上記の手順を繰り返
して、得られた距離とそのときの位置ならびにブロック
サイズＷａ’を位置・サイズ・距離蓄積部１０８に蓄え
る。By the way, in the above, in the mosaicing of the unknown image, the same block size as the search target image, Wa pixels × Wa pixels, is used. If the search target image and the search target image in the unknown image have the same size, this size may be left unchanged, but generally, the size of the search target image in the unknown image is also unknown. Therefore, the unknown image mosaic unit 10
5 indicates the number of pixels of the block size, for example, 1 / Wa
Change to Wa 'such as 5 to 10/5. Therefore, the above procedure is repeated for each block size Wa ′, and the obtained distance, the position at that time, and the block size Wa ′ are stored in the position / size / distance storage unit 108.

【００１７】また、複数の探索対象画像を辞書においた
場合も、同様にして、一つ一つの探索対象画像ごとに上
記の手順を繰り返し、得られた距離とそのときの位置お
よびブロックサイズを位置・サイズ・距離蓄積部１０８
に蓄える。Also, when a plurality of search target images are stored in the dictionary, similarly, the above procedure is repeated for each search target image, and the obtained distance and the position and block size at that time are determined.・ Size / distance storage unit 108
Store in.

【００１８】こうして、全部の場合の距離が位置・サイ
ズ・距離蓄積部１０８に蓄えられると、最小距離検出部
１０９と、位置・サイズ・距離蓄積部１０８の中の距離
データのうち、最小値を取るものを検出する。この値は
認識対象有無判定部１１０に送られ、この値がある閾値
以下の時は未知画像内に対象画像があると判断し、その
位置およびブロックサイズを位置・サイズ・距離蓄積部
１０８から取り出して結果を粗探索結果出力部１１１に
出力する。粗探索結果出力部１１１は、ブロックサイズ
Ｗａから未知画像内の探索対象画像の大きさを［Ｍ×Ｗ
ａ］×［Ｎ×Ｗａ］より算出する。In this way, when the distances in all cases are stored in the position / size / distance storage unit 108, the minimum value of the minimum distance detection unit 109 and the distance data in the position / size / distance storage unit 108 is calculated. Detect what you take. This value is sent to the recognition target presence / absence determination unit 110, and when this value is less than or equal to a threshold value, it is determined that the target image exists in the unknown image, and the position and block size thereof are extracted from the position / size / distance accumulation unit 108. And outputs the result to the rough search result output unit 111. The rough search result output unit 111 calculates the size of the search target image in the unknown image from the block size Wa by [M × W
It is calculated from a] × [N × Wa].

【００１９】一方、最小距離値がある閾値を越えたとき
は未知画像内に対象画像が無いと判断し、「無い」こと
を粗探索結果出力部１１１に出力する。粗探索結果出力
部１１１はこれらの結果を直ちに詳細探索部２００に通
知する。On the other hand, when the minimum distance value exceeds a certain threshold value, it is determined that the target image does not exist in the unknown image, and "no" is output to the rough search result output unit 111. The coarse search result output unit 111 immediately notifies the detailed search unit 200 of these results.

【００２０】こうして粗探索により、認識対象のおおよ
その位置と大きさがわかると、モザイクのブロックを小
さくして、詳細にその位置を探索する。なお、詳細探索
では粗探索に比べ対象画像の領域を一部分に絞ってもよ
いが、ここでは、同一領域を用いる場合をのべる。In this way, when the rough position and size of the recognition target are found by the rough search, the block of the mosaic is made small and the position is searched in detail. In the detailed search, the area of the target image may be narrowed down to a part as compared with the coarse search, but here, the same area is used.

【００２１】まず、予め、図６のように、認識対象をＷ
ｂ画素×Ｗｂ画素（Ｗｂ＜Ｗａ）のサイズのブロックで
Ｍ’×Ｎ’に分割し、各ブロックをブロック内の代表値
で表す。First, as shown in FIG. 6, the recognition target is set to W in advance.
A block having a size of b pixels × Wb pixels (Wb <Wa) is divided into M ′ × N ′, and each block is represented by a representative value in the block.

【００２２】具体的には、詳細探索対象画像バッファ２
０１は、認識対象画像入力部１から認識対象画像データ
を取り込み、これを詳細探索対象画像モザイク化部２０
２に送る。詳細探索対象画像モザイク化部２０２は、こ
の画像をモザイク化（ＦＤ）し、詳細探索対象画像辞書
２０３に蓄積する。Specifically, the detailed search target image buffer 2
Reference numeral 01 captures recognition target image data from the recognition target image input unit 1 and stores the recognition target image data in the detailed search target image mosaic unit 20.
Send to 2. The detailed search target image mosaicking unit 202 mosaics (FD) this image and stores it in the detailed search target image dictionary 203.

【００２３】なお、必要ならば、複数の詳細探索対象画
像の平均値をとるか、複数の詳細探索対象画像を用意す
ればよいのは、粗探索の場合と同じである。一方、上記
の粗探索結果が粗探索結果通知部２２０に通知される
と、未知画像バッファ２０４はこれを知って、図１の未
知画像入力部２から、未知画像Ｕの中の認識対象画像の
該当部分を取り込む。そこで、未知画像モザイク化部２
０５は、これに対して、やはり図６のように、Ｗｂ画素
×Ｗｂ画素のブロックサイズでＰ’×Ｑ’に分割し、ブ
ロックごとに代表値を算出しモザイクＵＤを得る。If necessary, the average value of a plurality of detailed search target images may be taken or a plurality of detailed search target images may be prepared, as in the case of the rough search. On the other hand, when the above-described rough search result is notified to the rough search result notification unit 220, the unknown image buffer 204 knows this, and the unknown image input unit 2 of FIG. Capture the relevant part. Therefore, the unknown image mosaic unit 2
On the other hand, as shown in FIG. 6, 05 is divided into P ′ × Q ′ with a block size of Wb pixels × Wb pixels, a representative value is calculated for each block, and a mosaic UD is obtained.

【００２４】そこで、粗探索の場合と同様、図６のよう
に、未知画像の該当部分画像のモザイクを走査して詳細
探索対象モザイクＦＤと一致する場所を探索する。すな
わち、部分画像のＰ’×Ｑ’のモザイクＵＤのうち任意
のＭ’×Ｎ’のモザイクをＵｊ（斜線が存在する領域で
示す）としたとき、Ｕｊと探索対象のモザイクＦＤとの
距離Ｄｊを最小となるときのＵｊの位置を詳細探索結果
とする。Therefore, as in the case of the rough search, as shown in FIG. 6, the mosaic of the corresponding partial image of the unknown image is scanned to search for a location that matches the detailed search target mosaic FD. That is, when an arbitrary M ′ × N ′ mosaic of the P ′ × Q ′ mosaic UD of the partial image is Uj (indicated by a shaded area), the distance Dj between Uj and the search target mosaic FD The position of Uj when is minimized is the detailed search result.

【００２５】具体的には、図３において、候補選択部２
０６は、未知画像モザイク化部２０５の中からＭ’×
Ｎ’分の枠に入る候補Ｕｊを取り出す。距離算出部２０
７は、このＵｊと詳細探索対象画像辞書２０３からの詳
細探索対象モザイクＦＤの距離Ｄｊを算出し、このとき
のＵｊの位置データ、ならびにモザイクのブロックサイ
ズＷｂとともに位置・サイズ・距離蓄積部２０８に送
る。Specifically, in FIG. 3, the candidate selection unit 2
06 represents M ′ × out of the unknown image mosaicking unit 205.
A candidate Uj that falls within the N ′ frame is taken out. Distance calculator 20
7 calculates the distance Dj of this Uj and the detailed search target mosaic FD from the detailed search target image dictionary 203, and stores the position data of Uj at this time and the block size Wb of the mosaic in the position / size / distance accumulating unit 208. send.

【００２６】ここで、必要に応じ、ブロックサイズＷｂ
を変化させ（Ｗｂ’）たり、複数の詳細探索対象画像を
用いたりすることも、粗探索の場合と同様である。こう
して、最小距離検出部２０９で最小値を検出すると、こ
の時のＵｊの位置データならびにブロックサイズＷｂが
詳細探索結果出力部２１０に送られる。詳細探索結果出
力部２１０はブロックサイズＷｂより探索対象画像の正
確な大きさ、即ち、［Ｍ’×Ｗｂ］×［Ｎ’×Ｗｂ］を
算出し、位置データとともに認識部３００に送出する。Here, if necessary, the block size Wb
Is changed (Wb ′) or a plurality of detailed search target images are used, as in the case of the rough search. Thus, when the minimum distance detection unit 209 detects the minimum value, the position data of Uj and the block size Wb at this time are sent to the detailed search result output unit 210. The detailed search result output unit 210 calculates an accurate size of the search target image from the block size Wb, that is, [M ′ × Wb] × [N ′ × Wb], and sends it to the recognition unit 300 together with the position data.

【００２７】こうして詳細探索により、認識対象の正確
な位置と大きさがわかると、モザイクのブロックを小さ
くして、対象が何かを認識する。まず、予め、図６のよ
うに、認識対象をＷｃ画素×Ｗｃ画素（Ｗｃ＜Ｗｂ）の
サイズのブロックでＭ”×Ｎ”に分割する。このモザイ
クの各ブロックを、前記の場合と同様にブロック内の代
表値で表す。なお、認識対象の特徴が部分領域に集中し
ていれば、その部分をＭ”×Ｎ”に分割してもよい。す
なわち、探索の過程では対象に普遍的な特徴が必要であ
るが、認識の過程では、個々を区別する特徴が必要であ
り、より細かなモザイクや部分領域が有効となる。In this way, when the precise position and size of the recognition target are found by the detailed search, the mosaic block is reduced to recognize what the target is. First, as shown in FIG. 6, the recognition target is divided into M ″ × N ″ in advance by a block having a size of Wc pixels × Wc pixels (Wc <Wb). Each block of this mosaic is represented by a representative value in the block as in the above case. If the features to be recognized are concentrated in the partial area, that portion may be divided into M ″ × N ″. That is, in the process of searching, objects need universal features, but in the process of recognition, features that distinguish them are required, and finer mosaics and partial regions are effective.

【００２８】ここでは、この部分領域を用いる場合をの
べる。具体的には、認識対象画像バッファ３０１は、認
識対象画像入力部１から認識対象画像の部分領域のデー
タを取り込み、これを認識対象画像モザイク化部３０２
に送る。認識対象画像モザイク化部３０２は、この画像
をモザイク化（ＦＤＤ）し、認識対象画像辞書３０３に
蓄積する。Here, the case of using this partial area will be described. Specifically, the recognition target image buffer 301 takes in the data of the partial area of the recognition target image from the recognition target image input unit 1 and stores it in the recognition target image mosaicking unit 302.
Send to. The recognition target image mosaicking unit 302 mosaics (FDD) this image and stores it in the recognition target image dictionary 303.

【００２９】なお、通常、多くの認識対象画像について
モザイク化し、認識対象画像辞書３０３に蓄積する。一
方、上記の詳細探索結果が詳細探索部２００より詳細探
索結果通知部３２０に通知されると、未知画像バッファ
３０４はこれを知って、図１の未知画像入力部２から、
未知画像Ｕの中の認識対象画像の正確な位置と大きさの
画像を取り込む。画像切り出し・モザイク化部３０５
は、さらにその中の部分領域に対して、やはり図６のよ
うにＷｃ画素×Ｗｃ画素のブロックサイズでＭ”×Ｎ”
に分割し、ブロックごとに代表値を算出しモザイクデー
タＵＤＤを得る。Note that, usually, many recognition target images are mosaiced and stored in the recognition target image dictionary 303. On the other hand, when the detailed search result is notified from the detailed search unit 200 to the detailed search result notification unit 320, the unknown image buffer 304 knows this and the unknown image input unit 2 in FIG.
An image having an accurate position and size of the recognition target image in the unknown image U is captured. Image clipping / mosaicing unit 305
Is a block size of Wc pixels × Wc pixels as shown in FIG.
And the representative value is calculated for each block to obtain mosaic data UDD.

【００３０】そこで、このモザイクデータＵＤＤと認識
対象画像モザイクデータＦＤＤの距離を算出し、これが
或る閾値以下であれば、未知画像は認識対象画像である
と判定する。あるいは、認識対象画像が複数あれば、Ｆ
ＤＤとの距離の最小となるものを認識対象として判定す
る。ここでは、一般的な、後者の場合を例にのべる。Therefore, the distance between the mosaic data UDD and the recognition target image mosaic data FDD is calculated, and if this is less than a certain threshold value, it is determined that the unknown image is the recognition target image. Alternatively, if there are a plurality of recognition target images, F
The object with the smallest distance from DD is determined as the recognition target. Here, a general case of the latter case will be described as an example.

【００３１】具体的には、図４において、距離算出部３
０６は、画像切り出し・モザイク化部３０５の出力であ
る未知画像モザイクデータＵＤＤと認識対象画像辞書３
０３からの認識対象画像モザイクデータＦＤＤの距離を
算出し、このときの距離と認識対象画像の符号を距離蓄
積部３０７に送る。同様に、距離算出部３０６は、認識
対象画像辞書３０３から次々と認識対象画像モザイクデ
ータを取り出しては、未知画像モザイクデータＵＤＤと
の距離を算出し、この値と認識対象画像の符号を距離蓄
積部３０７に送る。Specifically, in FIG. 4, the distance calculation unit 3
Reference numeral 06 denotes the unknown image mosaic data UDD output from the image clipping / mosaicing unit 305 and the recognition target image dictionary 3
The distance of the recognition target image mosaic data FDD from 03 is calculated, and the distance at this time and the code of the recognition target image are sent to the distance storage unit 307. Similarly, the distance calculation unit 306 sequentially extracts the recognition target image mosaic data from the recognition target image dictionary 303, calculates the distance to the unknown image mosaic data UDD, and accumulates this value and the code of the recognition target image. Send to section 307.

【００３２】つぎに、最小距離検出部３０８は、距離蓄
積部３０７の中の最小値を検出すると、この時の認識対
象画像の符号を認識結果出力部３０９へ送出する。この
結果はさらに出力部３へ送出される。Next, when the minimum distance detection unit 308 detects the minimum value in the distance storage unit 307, the minimum distance detection unit 308 sends the code of the recognition target image at this time to the recognition result output unit 309. The result is further sent to the output unit 3.

【００３３】一方、粗探索の結果、対象画像が「無し」
の場合は、このことが出力部３に通知される。こうし
て、未知画像Ｕの中から、まず認識対象画像Ｆのおおよ
その位置がみつかり、これを手がかりに正確な位置と大
きさが見つけられ、さらに対象が特定される。On the other hand, as a result of the rough search, the target image is "none".
In the case of, this is notified to the output unit 3. In this way, the approximate position of the recognition target image F is first found in the unknown image U, and the accurate position and size are found by using this as a clue, and the target is further specified.

【００３４】[0034]

【発明の効果】本発明によれば、認識対象画像および未
知画像をモザイク化し探索・認識することによって、背
景に対する一様性など特殊な条件を与えることなく、ま
た、線分を用いた場合のノイズによるエラーを起すこと
なく、容易に対象画像を見いだしてその位置や大きさを
得た上、認識することが可能となる。とくに、形状や色
が似通った対象、例えば、人間の顔や蝶などには効果的
である。According to the present invention, by performing mosaic search and recognition on the recognition target image and the unknown image, no special condition such as uniformity with respect to the background is given, and when line segments are used. It is possible to easily find the target image, obtain its position and size, and recognize the target image without causing an error due to noise. In particular, it is effective for objects having similar shapes and colors, such as human faces and butterflies.

[Brief description of drawings]

【図１】本発明の全体を説明するブロック図である。FIG. 1 is a block diagram illustrating the entire invention.

【図２】粗探索部１００の内容を示す図である。2 is a diagram showing the contents of a coarse search unit 100. FIG.

【図３】詳細探索部２００の内容を示す図である。FIG. 3 is a diagram showing the contents of a detail search unit 200.

【図４】認識部３００の内容を示す図である。FIG. 4 is a diagram showing the contents of a recognition unit 300.

【図５】認識対象画像ならびに未知画像をモザイク化
し、未知画像の中から認識対象を段階的に探索・認識す
る過程を説明する図である。FIG. 5 is a diagram illustrating a process in which a recognition target image and an unknown image are mosaiced, and a recognition target is searched and recognized in stages from the unknown image.

【図６】図５と一緒になって１つの図を構成し、認識対
象画像ならびに未知画像をモザイク化し、未知画像の中
から認識対象を段階的に探索・認識する過程を説明する
図である。FIG. 6 is a diagram illustrating a process of forming a single figure together with FIG. 5, mosaicizing a recognition target image and an unknown image, and gradually searching for and recognizing the recognition target from the unknown image. ..

[Explanation of symbols]

１認識対象画像入力部２未知画像入力部３出力部１００粗探索部２００詳細探索部３００認識部 1 Recognition Target Image Input Section 2 Unknown Image Input Section 3 Output Section 100 Coarse Search Section 200 Detailed Search Section 300 Recognition Section

Claims

[Claims]

1. A first means for dividing a natural image to be recognized into M × N in advance with a large block size, calculating a representative value of the shade or color of each block, and storing the representative value as dictionary data, , A second means for previously dividing the natural image to be recognized into M ′ × N ′ with a small block size, calculating a representative value of the shade or color of each block, and storing it as dictionary data. A third means for dividing the natural image to be recognized into M ″ × N ″ with a smaller block size, calculating a representative value of the grayscale or color of each block and storing the dictionary data, and an unknown natural image When given as an input, the input image is divided into P × Q with a large block size to calculate a representative value of each block, and an arbitrary M × N area is taken out from this to obtain a first means. The fourth means for calculating and accumulating the distance formed by the representative value group obtained in step (4) and the fourth means over the entire area of P × Q are repeatedly applied to obtain the minimum value from the obtained distances. When this minimum value is smaller than the threshold value given in advance, it is determined that there is an image corresponding to the recognition target image at the position where the minimum value is derived in the input image. On the other hand, when the minimum value is larger than the threshold value, The fifth means for determining that there is no image corresponding to the recognition target image, and when the fifth means determines that the recognition target is present at a certain position, M × in the vicinity of the corresponding position in the input image The N area is divided into P ′ × Q ′ by small blocks, the representative value of each block is calculated, and an arbitrary M ′ × N ′ area is taken out from this and a representative value group obtained by the second means is obtained. A sixth means for calculating the distance formed by and accumulating it, and for the entire area of P'x The sixth means is repeatedly applied, the minimum value is obtained from the obtained distances, and the position and size of the M ′ × N ′ image at this time are set as the position and size of the recognition target image. And a region of the corresponding position and size in the input image obtained by the seventh means is divided into M ″ × N ″ with a smaller block size to calculate the representative value of each block, and the third means An image recognition apparatus comprising: an eighth means for calculating a distance formed by the representative value group obtained in step (4), and if the distance is equal to or less than a certain threshold value, the image is recognized as a recognition target image.

2. In the first means, a target image is previously set as a plurality of similar image groups, which are not unique, and each image is divided into M × N to calculate a representative value of each block. The image recognition apparatus according to claim 1, wherein an average value is calculated for each corresponding block among a plurality of images, and the average value is stored as a dictionary.

3. In the second means, a target image is previously set as a plurality of not only unique but similar image groups, each image is divided into M ′ × N ′, and a representative value of each block is represented. The image recognition apparatus according to claim 1, wherein the average value is calculated for each corresponding block among a plurality of images, and the average value is stored as a dictionary.

4. In the third means, a target image is not a unique image but a plurality of image groups, and each image is divided into M ″ × N ″ to calculate a representative value of each block. The image recognition apparatus according to claim 1, wherein the image recognition apparatus stores the dictionary as a dictionary.

5. The second means, wherein a partial image of the recognition target image is M ′ × N ′ in advance with a small block size.
And the representative value of each block is calculated and stored as dictionary data. In the sixth means and the seventh means, the partial area of the M × N area obtained by the fifth means is P ′. 2. The distance to the relevant dictionary data is calculated for each of M ′ × N ′ among them divided into × Q ′.
The image recognition device described.

6. The third means, wherein the partial image of the recognition target image is M ″ with a smaller block size in advance.
× N ″, and the representative value of each block is calculated and stored as dictionary data.
M × N 'area with a smaller block size
The image recognition apparatus according to claim 1, wherein the image recognition apparatus is divided into

7. The fourth means is characterized in that the block size used for division is variable.
The image recognition device described.

8. The sixth means, wherein the block size used for division is variable.
The image recognition device described.