JP4285640B2

JP4285640B2 - Object identification method, apparatus and program

Info

Publication number: JP4285640B2
Application number: JP2003282698A
Authority: JP
Inventors: 貞登赤堀
Original assignee: Fujifilm Corp
Current assignee: Fujifilm Corp
Priority date: 2002-07-30
Filing date: 2003-07-30
Publication date: 2009-06-24
Anticipated expiration: 2023-07-30
Also published as: JP2004078939A

Description

本発明は、画像を構成するオブジェクトの種類を自動的に識別するオブジェクト識別方法および装置ならびにプログラムに関するものである。 The present invention relates to an object identification method, apparatus, and program for automatically identifying the types of objects constituting an image.

デジタルカメラ等で撮像した画像情報において、画像情報にどのような画像が撮像されているかが識別することができれば、たとえば画像に含まれるオブジェクトの種類毎に分類、検索もしくは画像処理などをすることができる。 If image information captured by a digital camera or the like can identify what image is captured in the image information, for example, classification, search, or image processing may be performed for each type of object included in the image. it can.

たとえば画像の分類・検索をする場合、画像に含まれる物理的特徴量を用いて類似度を判断する画像検索システムが提案されている。すなわち、入力画像の局所領域を抽出して、その局所領域が位置と大きさを変化させながら参照画像と照合されて、画像の分類・検索を行う手法がある。また上記手法において、局所領域の色ヒストグラムを利用してヒストグラムを参照画像の色ヒストグラムと照合することにより物体の位置を検出して、画像の分類・検索を効率よく行う手法がある（たとえば非特許文献１参照）。しかし、上述したいずれの方法においても、画像の物理的特徴量で類似度を識別しているため、種類的には似ていないものが物理量の類似性により似ていると判断されてしまう場合があり、検索の精度が悪いという問題がある。 For example, when classifying and searching for images, an image search system has been proposed in which similarity is determined using physical feature amounts included in images. That is, there is a technique of extracting and localizing an input image, collating the reference region with a reference image while changing the position and size of the local region, and classifying and searching for the image. Further, in the above method, there is a method for efficiently classifying and searching images by detecting the position of an object by using a color histogram of a local region and comparing the histogram with a color histogram of a reference image (for example, non-patent) Reference 1). However, in any of the above-described methods, since the similarity is identified by the physical feature amount of the image, it may be determined that what is not similar in kind is similar due to the similarity of the physical amount. There is a problem that the accuracy of the search is poor.

また、画像処理を行う場合、高画質化処理の一例として特定色領域を識別して異なる処理をする方法が知られている（たとえば特許文献１参照）。これは、雑音成分が目立ちやすい領域を色で識別して、雑音除去を行うものである。しかし、色のみに基づいて識別しているため、たとえば肌と砂等を混同してしまう場合がある。そして、砂の領域を肌の領域と誤って認識して、砂の領域に雑音除去を行ってしまうと、テクスチャが失われて不自然な画像になるおそれがある。
特公平５−６２８７９号公報電子情報通信学会誌、ｖｏｌ．ｊ８１−ＤII，ｎｏ．９，ｐｐ．２０３５−２０４２，１９９８ When performing image processing, a method of identifying a specific color region and performing different processing is known as an example of high image quality processing (see, for example, Patent Document 1). In this method, an area where a noise component is conspicuous is identified by color, and noise is removed. However, since the identification is based only on the color, for example, skin and sand may be confused. If the sand area is mistakenly recognized as a skin area and noise is removed from the sand area, the texture may be lost, resulting in an unnatural image.
Japanese Patent Publication No. 5-62879 The Institute of Electronics, Information and Communication Engineers, vol. j81-DII, no. 9, pp. 2035-2042, 1998

上述のように、画像から直接得られる情報に基づいて画像の分類、検索もしくは画像処理を行う場合、ユーザーに適切な情報を提供することができない。これを解決する手法の１つとして、オブジェクトの種類を識別した上で、画像の分類、検索もしくは画像処理を行うことが考えられる。すると、画像の分類・検索においては、識別した種類に応じて分類・検索を行うことができるため、画像の分類・検索を容易に精度よく行うことができる。また、画像処理をする場合においても、そのオブジェクトにあった画像処理条件を用いて画像処理を行うことができる。 As described above, when image classification, retrieval, or image processing is performed based on information obtained directly from an image, appropriate information cannot be provided to the user. As one method for solving this, it is conceivable to classify, search or perform image processing after identifying the type of object. Then, in the image classification / search, the image can be classified / searched according to the identified type. Therefore, the image classification / search can be easily and accurately performed. Even when image processing is performed, image processing can be performed using image processing conditions suitable for the object.

上述した画像に含まれるオブジェクトの種類の識別は、画像に含まれるオブジェクト領域を抽出して、各オブジェクト領域毎に種類を識別する必要がある。このとき、たとえばユーザーが画面を見ながら画像内のオブジェクト領域を抽出して、各オブジェクト毎に種類を入力することも考えられる。しかし、ユーザーによるオブジェクト領域の種類の付与は作業の手間がかかるという問題がある。 To identify the type of object included in the image described above, it is necessary to extract the object area included in the image and identify the type for each object area. At this time, for example, the user may extract the object area in the image while looking at the screen and input the type for each object. However, there is a problem that it takes time and effort to give the object region type by the user.

そこで、本発明は、画像に含まれるオブジェクトの種類を自動的に識別することができるオブジェクト識別方法および装置ならびにプログラムを提供することを目的とする。 Therefore, an object of the present invention is to provide an object identification method, apparatus, and program that can automatically identify the type of an object included in an image.

本発明のオブジェクト識別方法は、画像に含まれるオブジェクトの種類を識別するオブジェクト識別方法において、前記画像を前記オブジェクト毎に領域分割したオブジェクト領域と、前記画像を設定画素数からなる、前記オブジェクト領域より小さい多数の領域に分割した複数のブロック領域とを生成するステップと、生成した複数の前記各ブロック領域毎にそれぞれ種類を識別するステップと、識別した前記ブロック領域の種類を前記各オブジェクト領域毎に集計するステップと、集計した結果を用いて前記オブジェクト領域の種類を識別するステップとを有することを特徴とする。 The object identification method of the present invention is an object identification method for identifying the type of an object included in an image, the object region comprising: an object region obtained by dividing the image into regions for each object; and the image comprising a set number of pixels. Generating a plurality of block areas divided into a plurality of small areas, identifying a type for each of the plurality of generated block areas, and identifying the type of the identified block area for each object area The method includes a step of counting, and a step of identifying the type of the object region using the totaled result.

本発明のオブジェクト識別装置は、画像に含まれるオブジェクトの種類を識別するオブジェクト識別装置において、前記画像を前記オブジェクト毎に領域分割して複数のオブジェクト領域を生成するオブジェクト領域生成手段と、前記画像を設定画素数からなる、前記オブジェクト領域より小さい多数の領域に分割して複数のブロック領域を生成するブロック領域生成手段と、生成された複数の前記ブロック領域毎にそれぞれ種類を識別するブロック領域識別手段と、前記各ブロック領域毎に識別された前記ブロック領域の種類を前記オブジェクト領域毎に集計し、集計した結果を用いて前記オブジェクトの種類を識別するオブジェクト識別手段とを有することを特徴とする。 The object identification device of the present invention is an object identification device for identifying the type of an object included in an image, an object region generation means for dividing the image into regions for each object to generate a plurality of object regions, and the image A block area generating unit configured to generate a plurality of block areas by dividing the pixel area into a plurality of areas smaller than the object area, and a block area identifying unit for identifying a type for each of the generated block areas And object identifying means for totalizing the types of the block areas identified for each of the block areas for each object area and identifying the types of the objects using the totaled results.

本発明のオブジェクト識別プログラムは、コンピュータに、画像をオブジェクト毎に領域分割したオブジェクト領域と、前記画像を設定画素数からなる、前記オブジェクト領域より小さい多数の領域に分割した複数のブロック領域とを生成する手順と、生成した複数の前記各ブロック領域毎にそれぞれ種類を識別する手順と、識別した前記ブロック領域の種類を前記各オブジェクト領域毎に集計する手順と、集計した結果を用いて前記オブジェクト領域の種類を識別する手順とを実行させることを特徴とするものである。 The object identification program according to the present invention generates, on a computer, an object area obtained by dividing an image into areas for each object, and a plurality of block areas obtained by dividing the image into a plurality of areas smaller than the object area, each having a set number of pixels. A procedure for identifying the type for each of the plurality of generated block regions, a procedure for counting the types of the identified block regions for each object region, and the object region using the tabulated result And a procedure for identifying the type of the program.

ここで、「オブジェクト」はたとえば人物、空、海、木、建物等の画像に含まれる被写体を意味し、「オブジェクト領域」は被写体が画像内に占める領域を意味する。 Here, “object” means a subject included in an image such as a person, sky, sea, tree, building, etc., and “object region” means a region occupied by the subject in the image.

「オブジェクトの種類を識別する」とは、画像内のオブジェクトについてたとえば「山」、「海」、「花」、「空」等の種類であることを特定することを意味し、さらにオブジェクトの種類がわからない場合に「不明」であることを特定することも含む。 “Identify the type of object” means that the object in the image is identified as a type such as “mountain”, “sea”, “flower”, “sky”, etc. It also includes specifying “unknown” when not sure.

また、「ブロック領域識別手段」は、ブロック領域毎に種類を識別するものであればよく、ブロック領域から複数のブロック特徴量を抽出する特徴量抽出手段と、抽出された複数の前記ブロック特徴量を２次元空間上に写像する写像手段と、２次元空間上の座標毎に種類を定義した種類頻度分布マップを有し、写像された２次元空間上の座標が種類頻度分布マップ上で示す種類をブロック領域の種類として出力する種類出力手段とを有するようにしてもよい。 In addition, the “block area identifying unit” may be any unit that identifies a type for each block area, and a feature amount extracting unit that extracts a plurality of block feature amounts from the block region, and a plurality of the extracted block feature amounts. A mapping means for mapping an image on a two-dimensional space, a type frequency distribution map in which a type is defined for each coordinate in the two-dimensional space, and a type in which the mapped coordinates in the two-dimensional space are indicated on the type frequency distribution map May be provided as a type of block area.

「２次元空間」は、学習機能を有する複数のニューロンをマトリックス状に配置した自己組織化マップであってもよい。 The “two-dimensional space” may be a self-organizing map in which a plurality of neurons having a learning function are arranged in a matrix.

また、「種類出力手段」は、識別したブロック領域の種類に関する情報を出力するものであればよく、識別した１つの種類を出力するものでもよいし、自己組織化マップの座標毎に種類の頻度値を種類の指標として定めた種類頻度分布マップを種類毎に有し、写像手段により検出された座標が各種類頻度分布マップ上で示す複数の頻度値をベクトル成分とした種類ベクトルを出力するものであってもよい。 The “type output unit” may be any unit that outputs information regarding the type of the identified block area, and may output one identified type, and the frequency of the type for each coordinate of the self-organizing map. Each type has a type frequency distribution map that defines the value as a type index, and outputs a type vector whose vector component is a plurality of frequency values whose coordinates detected by the mapping means are indicated on each type frequency distribution map It may be.

なお、「種類出力手段」は、種類ベクトルのベクトル成分のうち、最も大きい最大ベクトル成分となる種類を出力するものであってもよい。 The “type output unit” may output a type that is the largest maximum vector component among the vector components of the type vector.

また、「種類出力手段」は、種類ベクトルのうちベクトル成分の大きさが最大となる最大ベクトル成分が所定の最大成分しきい値よりも小さいときには、種類が不明である旨の出力を行うようにしてもよい。 Further, the “type output means” outputs that the type is unknown when the maximum vector component having the maximum vector component size among the type vectors is smaller than a predetermined maximum component threshold value. May be.

さらに、「特徴量抽出手段」は、画像の特徴を示す複数の特徴量を抽出するものであればよく、ブロック領域の色成分と明度成分と像的特徴成分をブロック特徴量として抽出するものであってもよいし、たとえば画像の各画素に割り当てられた成分信号値の１方向に沿った変化の規則性の程度を示す相関特徴量を抽出する相関特徴量抽出手段の他に、画像のエッジの特徴を示すエッジ特徴量を抽出するエッジ特徴量抽出手段や画像の色の特徴を示す色特徴量を抽出する色特徴量抽出手段を含むものであってもよい。 Further, the “feature amount extraction means” may be any means that can extract a plurality of feature amounts indicating image features, and extracts the color component, brightness component, and image feature component of the block area as block feature amounts. For example, in addition to the correlation feature quantity extraction means for extracting the correlation feature quantity indicating the degree of regularity of change along one direction of the component signal value assigned to each pixel of the image, the edge of the image The image processing apparatus may include an edge feature amount extracting unit that extracts an edge feature amount indicating a feature of the image, and a color feature amount extracting unit that extracts a color feature amount indicating a color feature of the image.

なお、「相関特徴量抽出手段」は、たとえば画像の縦方向に沿った相関特徴量、画像の横方向に沿った相関特徴量、もしくは画像の斜め方向に沿った相関特徴量を抽出する等の画像の少なくとも１方向の相関特徴量を抽出するものであればよい。 The “correlation feature extraction means” extracts, for example, a correlation feature along the vertical direction of the image, a correlation feature along the horizontal direction of the image, or a correlation feature along the diagonal direction of the image. What is necessary is just to extract the correlation feature quantity of at least one direction of the image.

さらに、「相関特徴量抽出手段」は、画像において同一方向に形成された２つの画素ラインを構成する複数の画素の成分信号値から、２つの画素ラインの相関関係を示す相関値を出力する所定の相互相関関数を有し、２つの画素ラインのいずれか一方を１画素ずつ画素ラインの形成方向にずらしながら画素の成分信号値を相互相関関数に入力することにより複数の相関値を取得し、取得した複数の相関値から最も大きい最大相関値を算出するものであり、画像の同一方向に形成された画素ラインのすべての組み合わせについて最大相関値を算出し、算出されたすべての最大相関値の平均値および標準偏差を相関特徴量として抽出するものであってもよい。 Further, the “correlation feature amount extraction unit” outputs a correlation value indicating a correlation between two pixel lines from component signal values of a plurality of pixels constituting two pixel lines formed in the same direction in the image. A plurality of correlation values are obtained by inputting the component signal value of the pixel into the cross-correlation function while shifting one of the two pixel lines in the pixel line forming direction one pixel at a time. The largest maximum correlation value is calculated from a plurality of acquired correlation values, the maximum correlation value is calculated for all combinations of pixel lines formed in the same direction of the image, and all the calculated maximum correlation values are calculated. The average value and the standard deviation may be extracted as the correlation feature amount.

また、「ブロック領域生成手段」は、たとえば前記画像をメッシュ状に区切った複数の第１ブロック領域と、複数の第１ブロック領域とメッシュ状に区切る位相をずらした第２ブロック領域とを生成するものや、オブジェクト領域内に設定画素数からなる切取枠を走査させて、切取枠により囲まれた画像を前記ブロック領域として生成するもののような、設定画素数からなるブロック領域を生成するものであればよい。 Further, the “block region generating means” generates, for example, a plurality of first block regions obtained by dividing the image into a mesh shape and a plurality of first block regions and a second block region having a phase shifted from each other in a mesh shape. Or a block area having a set number of pixels, such as an object area that scans a cut frame having a set number of pixels and generating an image surrounded by the cut frame as the block area. That's fine.

さらに、「ブロック領域生成手段」は、画像から解像度の異なる複数の解像度変換画像を生成する機能を有し、生成した複数の解像度変換画像からそれぞれブロック領域を生成するものであってもよい。 Furthermore, the “block area generation unit” may have a function of generating a plurality of resolution conversion images having different resolutions from an image, and may generate a block area from each of the generated resolution conversion images.

本発明のオブジェクト識別装置は、画像に含まれるオブジェクトの種類を識別するオブジェクト識別装置において、前記画像を前記オブジェクト毎に領域分割して複数のオブジェクト領域を生成するオブジェクト領域生成手段と、該オブジェクト領域生成手段により生成された前記オブジェクト領域から複数のオブジェクト特徴量を抽出する特徴量抽出手段と、該特徴量抽出手段により抽出されたオブジェクト特徴量を用いて、前記オブジェクト領域の種類を識別するオブジェクト識別手段とを有することを特徴とするものである。 The object identification device of the present invention is an object identification device for identifying the type of an object included in an image, an object region generation means for generating a plurality of object regions by dividing the image into regions for each object, and the object region Feature quantity extraction means for extracting a plurality of object feature quantities from the object area generated by the generation means, and object identification for identifying the type of the object area using the object feature quantities extracted by the feature quantity extraction means Means.

さらに、オブジェクト識別装置は、オブジェクト領域の外接矩形画像を規格化した規格化オブジェクト領域を生成する規格化手段を備えるものであってもよい。 Furthermore, the object identification device may include a normalizing unit that generates a standardized object area obtained by standardizing a circumscribed rectangular image of the object area.

なお、「特徴量抽出手段」は、規格化オブジェクト領域から特徴量を抽出する機能を有するものであってもよい。 Note that the “feature amount extraction means” may have a function of extracting a feature amount from the standardized object region.

本発明のオブジェクト識別方法および装置ならびにプログラムによれば、オブジェクト領域の種類の識別にブロック領域を使用することにより、各画素毎に種類を識別する場合に比べて、像構造的特徴をオブジェクト領域の種類の判断に加えることができるため、オブジェクトの種類を正確に識別することができる。 According to the object identification method, apparatus, and program of the present invention, by using the block area for identifying the type of the object area, the image structural features are compared with those in the object area as compared with the case of identifying the type for each pixel. Since it can be added to the type determination, the type of the object can be accurately identified.

また、各ブロック領域毎にそれぞれ種類を識別し、ブロック領域の種類を各オブジェクト領域毎に集計してオブジェクト領域の種類を識別することにより、オブジェクト領域の一部のブロック領域に本来の種類に識別されなかったものがあったとしても、その誤った認識を吸収してオブジェクトの種類を正確かつ自動的に識別することができる。 In addition, by identifying the type for each block area, the block area type is aggregated for each object area and the object area type is identified to identify the original type for some block areas of the object area Even if there is something that has not been done, the erroneous recognition can be absorbed and the type of object can be accurately and automatically identified.

なお、ブロック領域識別手段が、ブロック領域から複数のブロック特徴量を抽出する特徴量抽出手段と、抽出された複数の特徴量を２次元空間上に写像する写像手段と、２次元空間上の位置毎に種類を定義した種類頻度分布マップを有し、種類頻度分布マップを用いて複数の特徴量が写像された２次元空間上の位置からブロック領域の種類を出力する種類出力手段とを有する構成にすれば、ブロック領域の種類の識別を精度よく、かつ効率的に行うことができる。 The block area identifying means extracts a feature quantity extracting means for extracting a plurality of block feature quantities from the block area, a mapping means for mapping the extracted feature quantities on the two-dimensional space, and a position on the two-dimensional space. A type output unit that has a type frequency distribution map in which a type is defined for each type, and outputs a type of block area from a position in a two-dimensional space where a plurality of feature amounts are mapped using the type frequency distribution map By doing so, the type of the block area can be identified accurately and efficiently.

また、特徴量抽出手段が、ブロック領域の色成分と明度成分と像的特徴成分をブロック特徴量として抽出するようにすれば、ブロック領域の種類の識別をより正確に行うことができる。 Further, if the feature quantity extracting means extracts the color component, brightness component, and image feature component of the block area as the block feature quantity, the type of the block area can be identified more accurately.

さらに、種類出力手段が、自己組織化マップの座標毎に種類の頻度値を種類の指標として定めた種類頻度分布マップを種類毎に有し、種類出力手段が、写像手段により検出された座標が各種類頻度分布マップ上で示す複数の頻度値をベクトル成分とした種類ベクトルを出力するようにすれば、識別された１つの種類を出力するのではなく、ブロック領域の種類として可能性のある複数の種類の中からブロック領域の種類を識別できるようになるため、種類の識別精度を向上させることができる。 Further, the type output means has for each type a type frequency distribution map in which the type frequency value is determined as a type index for each coordinate of the self-organizing map, and the type output means has coordinates detected by the mapping means. If a type vector having a plurality of frequency values shown on each type frequency distribution map as a vector component is output, a plurality of possible types of block areas may be output instead of outputting one identified type. Since the type of the block area can be identified from among the types, the type identification accuracy can be improved.

また、種類出力手段が、種類ベクトルの成分のうち、最も大きい最大ベクトル成分となる種類をブロック領域の種類であると識別すれば、複数の種類の中から確率の高い種類をブロック領域の種類することができるため、識別精度を向上させることができる。 Further, if the type output means identifies the type that is the largest maximum vector component among the types vector components as the type of the block region, the type having a high probability among the plurality of types is selected as the type of the block region. Therefore, the identification accuracy can be improved.

さらに、種類出力手段が、最大ベクトル成分が所定の最大成分しきい値よりも小さいときには、画像の種類は不明である旨の出力を行うと、最大成分が低い種類の識別の信頼度が低いものは、種類の識別を行わずに不明とすることができるため、種類識別の信頼性を高めることができる。 Furthermore, when the type output means outputs that the type of the image is unknown when the maximum vector component is smaller than the predetermined maximum component threshold value, the identification reliability of the type having the low maximum component is low. Since it can be made unknown without identifying the type, the reliability of the type identification can be improved.

さらに、ブロック領域生成手段が、画像をメッシュ状に区切った複数の第１ブロック領域と、複数の第１ブロック領域とメッシュ状に区切る位相をずらした第２ブロック領域とを生成するようにすれば、オブジェクト領域の種類を識別するのに用いられるブロック領域の数を増やすことができるため、ブロック領域の種類の識別からオブジェクト領域の種類の識別を行う際の精度を向上させることができる。 Further, the block area generation means generates a plurality of first block areas obtained by dividing the image into a mesh shape, and a second block area having a phase shifted from the plurality of first block areas and the mesh shape. Since the number of block areas used to identify the type of object area can be increased, the accuracy in identifying the type of object area from the identification of the type of block area can be improved.

また、ブロック領域生成手段が、オブジェクト領域内に設定画素数からなる切取枠を走査させて、切取枠により囲まれた画像をブロック領域として生成するようにすると、オブジェクト領域の種類を識別するのに用いられるブロック領域の数を増やすことができるため、ブロック領域の種類の識別からオブジェクト領域の種類の識別を行う際の精度を向上させることができる。 In addition, when the block area generation unit scans a cut frame having a set number of pixels in the object area and generates an image surrounded by the cut frame as a block area, the type of the object area is identified. Since the number of block areas to be used can be increased, it is possible to improve accuracy when identifying the type of object area from identifying the type of block area.

さらに、ブロック領域生成手段が、画像から解像度の異なる複数の解像度変換画像を生成する機能を有し、生成した複数の解像度変換画像からそれぞれブロック領域を生成するようにすれば、被写体との距離によりオブジェクトの写り方が画像によって違う場合であっても、精度よくオブジェクトの種類を識別することができる。 Furthermore, if the block area generation unit has a function of generating a plurality of resolution conversion images having different resolutions from the image, and each block area is generated from the generated plurality of resolution conversion images, the block area generation unit depends on the distance from the subject. Even when the way the object is captured differs depending on the image, the type of the object can be accurately identified.

また、特徴量抽出手段が、画像の各画素に割り当てられた成分信号値の１方向に沿った変化の規則性の程度を示す相関特徴量を抽出する相関特徴量抽出手段を含む構成にすれば、相関特徴量により人工物に多く見られる規則的なパターンを有する画像と、自然物に多く見られるランダムなパターンを有する画像とを区別する指標となる特徴量を抽出することができるため、適切な種類の識別を行うことができる。 In addition, if the feature amount extraction unit includes a correlation feature amount extraction unit that extracts a correlation feature amount indicating the degree of regularity of change along one direction of the component signal value assigned to each pixel of the image. Since the feature quantity can be extracted as an index for distinguishing between an image having a regular pattern often found in artifacts and an image having a random pattern often found in natural objects, the correlation feature quantity is appropriate. Type identification can be performed.

さらに、相関特徴量抽出手段が、画像の縦方向に沿った相関特徴量と、画像の横方向に沿った相関特徴量とを抽出するようにすれば、縦方向および横方向に向かって規則的なパターンが形成されたものと、縦方向もしくは横方向のいずれか一方に向かって規則的なパターンが形成されたものとを区別することができる。 Further, if the correlation feature quantity extraction means extracts the correlation feature quantity along the vertical direction of the image and the correlation feature quantity along the horizontal direction of the image, the correlation feature quantity extraction means regularly in the vertical direction and the horizontal direction. Can be distinguished from those in which a regular pattern is formed and those in which a regular pattern is formed in either the vertical direction or the horizontal direction.

また、相関特徴量抽出手段が、２つの画素ラインのいずれか一方を１画素ずつ画素ラインの形成方向にずらしながら画素の成分信号値を所定の相互相関関数に入力することにより算出される複数の相関値のうち最も大きい最大相関値を用いて相関特徴量を算出するようにすれば、画像の縦方向もしくは横方向に向かって形成された規則的なパターンのみならず、画像の斜め方向に向かって形成されている規則的なパターンについても相関特徴量として抽出することができるため、画像の縦方向、横方向および斜め方向に向かって形成される規則的なパターンを相関特徴量として抽出することができる。 Further, the correlation feature amount extraction means calculates a plurality of values calculated by inputting the component signal value of the pixel to a predetermined cross-correlation function while shifting one of the two pixel lines one pixel at a time in the pixel line formation direction. If the correlation feature value is calculated using the largest correlation value among the correlation values, not only the regular pattern formed in the vertical or horizontal direction of the image but also the diagonal direction of the image. Therefore, regular patterns formed in the vertical, horizontal, and diagonal directions of images can be extracted as correlation features. Can do.

さらに、エッジ特徴量抽出手段が、画像の縦方向および横方向のエッジ成分の平均値および標準偏差をそれぞれ算出するようにすれば、たとえば「水（波）」のように縦方向と横方向によってエッジが異なるものと、「植物（花畑等）」の縦方向と横方向とで比較的均質なエッジのものとがエッジ特徴量によって区別することができる。 Furthermore, if the edge feature quantity extraction means calculates the average value and standard deviation of the edge components in the vertical and horizontal directions of the image, respectively, for example, “water (wave)” depending on the vertical and horizontal directions. Different edges can be distinguished from those having relatively uniform edges in the vertical and horizontal directions of “plants (flower garden, etc.)” by the edge feature amount.

また、本発明のオブジェクト識別装置によれば、オブジェクト領域生成手段により生成されたオブジェクト領域から複数のオブジェクト特徴量を抽出し、オブジェクト特徴量を用いて、前記オブジェクト領域の種類を識別することにより、オブジェクト領域の形状が複雑な場合やオブジェクト領域が小さいときであっても、確実にオブジェクトの種類の識別を行うことができる。 Further, according to the object identification device of the present invention, by extracting a plurality of object feature amounts from the object region generated by the object region generation means, by using the object feature amount, identifying the type of the object region, Even when the shape of the object area is complicated or the object area is small, the type of the object can be reliably identified.

なお、オブジェクト領域の外接矩形画像を規格化した規格化オブジェクト領域を生成する規格化手段をさらに備え、特徴量抽出手段が、規格化手段により生成された規格化オブジェクト領域から特徴量を抽出するようにすれば、被写体との距離により、画像内での大きさの異なるオブジェクト領域について同一の大きさに規格化された規格化オブジェクト領域から特徴量が抽出させることになるため、自己組織化マップによる種類の識別の精度を向上させることができる。 The image processing apparatus further includes a normalization unit that generates a standardized object region obtained by normalizing a circumscribed rectangular image of the object region, and the feature amount extraction unit extracts the feature amount from the standardized object region generated by the normalization unit. In this case, the feature amount is extracted from the standardized object region that is standardized to the same size for the object regions having different sizes in the image depending on the distance from the subject. The accuracy of type identification can be improved.

以下、本発明のオブジェクト識別装置について図面を参照しながら説明していく。図１は本発明のオブジェクト識別装置の第１の実施の形態を示すブロック図である。図１のオブジェクト識別装置１は全体画像Ｐに含まれる各オブジェクト毎の種類を識別するものであって、ブロック領域生成手段１０、オブジェクト領域生成手段２０、ブロック領域識別手段３０、オブジェクト識別手段７０等を有する。 The object identification device of the present invention will be described below with reference to the drawings. FIG. 1 is a block diagram showing a first embodiment of an object identification device of the present invention. The object identification device 1 in FIG. 1 identifies the type of each object included in the entire image P, and includes a block area generation means 10, an object area generation means 20, a block area identification means 30, an object identification means 70, etc. Have

図１のブロック領域生成手段１０は、図２（ａ）に示すように、全体画像Ｐを設定画素数毎に分割したブロック領域ＢＲを生成する機能を有する。そして、ブロック領域生成手段１０は生成したブロック領域ＢＲをブロック領域識別手段３０に送る。たとえば設定画素数が３２画素×３２画素である場合、全体画像Ｐが３２×３２画素からなる複数のブロック領域ＢＲに分割されることになる。 As shown in FIG. 2A, the block area generating unit 10 in FIG. 1 has a function of generating a block area BR in which the entire image P is divided for each set number of pixels. Then, the block area generation unit 10 sends the generated block area BR to the block area identification unit 30. For example, when the number of set pixels is 32 pixels × 32 pixels, the entire image P is divided into a plurality of block regions BR composed of 32 × 32 pixels.

オブジェクト領域生成手段２０は、図２（ｂ）に示すように、全体画像Ｐを各オブジェクト毎に領域分割してオブジェクト領域ＯＲを生成する機能を有する。そしてオブジェクト領域生成手段２０は生成した各オブジェクト領域ＯＲをオブジェクト識別手段７０に送る。 As shown in FIG. 2B, the object area generation unit 20 has a function of generating an object area OR by dividing the entire image P into areas for each object. Then, the object area generation unit 20 sends the generated object areas OR to the object identification unit 70.

ブロック領域識別手段３０は生成された各ブロック領域ＢＲ毎に種類を識別する機能を有する。すなわち、ブロック領域識別手段３０は、画像内のオブジェクトが「山」、「海」、「花」、「空」等の種類であることを特定するようになっている。ブロック領域識別手段３０は識別した種類をオブジェクト識別手段７０に送るようになっている。 The block area identifying means 30 has a function of identifying the type for each generated block area BR. In other words, the block area identifying means 30 identifies that the object in the image is of a type such as “mountain”, “sea”, “flower”, “sky”. The block area identifying unit 30 sends the identified type to the object identifying unit 70.

オブジェクト識別手段７０は、送られたブロック領域ＢＲ毎の種類を用いて、分割されたオブジェクト領域ＯＲ毎に種類情報を付与して、オブジェクト領域ＯＲの種類を識別可能にする機能を有する。具体的には、オブジェクト識別手段７０は、オブジェクト領域ＯＲ内の各ブロック領域ＢＲの種類を集計する。そして、オブジェクト識別手段７０は、あるオブジェクト領域ＯＲにおいて集計されたブロック領域ＢＲの種類のうち、最も多いブロック領域ＢＲの最大種類情報をオブジェクトの種類と識別する。なお、オブジェクト識別手段７０は、複数のオブジェクト領域ＯＲにまたがっているブロック領域ＢＲは、カウントしないようになっている。すると、図２（ｃ）に示すように、各オブジェクト領域ＯＲに種類が付された状態になり、オブジェクト領域ＯＲが種類情報によって識別可能となる。 The object identification unit 70 has a function of identifying the type of the object area OR by giving type information to each divided object area OR using the type of each sent block area BR. Specifically, the object identification unit 70 totals the types of the block areas BR in the object area OR. Then, the object identifying means 70 identifies the largest type information of the block region BR that is the largest among the types of block regions BR counted in a certain object region OR as the type of object. The object identifying means 70 does not count the block area BR that extends over the plurality of object areas OR. Then, as shown in FIG. 2 (c), each object area OR is in a type, and the object area OR can be identified by the type information.

なお、図１のオブジェクト識別手段７０において、オブジェクトの種類を多数決により決定するようにしているが、集計された種類情報のうち最も多い最大種類情報の割合（最大種類ｍａｘの数／オブジェクトを構成する全ブロック領域数）が種類情報しきい値より小さい場合、オブジェクト識別手段７０がオブジェクトの種類情報として「不明」を出力する機能を有していてもよい。あるいは、最大種類情報の割合と２番目に多い種類情報の割合との差が小さい場合、オブジェクト識別手段７０がオブジェクトの種類として「不明」を出力するようにしてもよい。これは、オブジェクトの種類情報を誤って識別するよりも、「不明」と判断された方がユーザーにとって好ましい場合があるためである。 In the object identifying means 70 of FIG. 1, the type of object is determined by majority vote. However, the ratio of the largest type information among the aggregated type information (the number of maximum types max / object is configured. If the total number of block areas) is smaller than the type information threshold, the object identification unit 70 may have a function of outputting “unknown” as the type information of the object. Alternatively, when the difference between the ratio of the maximum type information and the ratio of the second largest type information is small, the object identification unit 70 may output “unknown” as the object type. This is because it may be preferable for the user to determine “unknown” rather than erroneously identifying the object type information.

オブジェクト領域生成手段２０は、画像を構成する各画素から複数の特徴量を抽出し、類似した画素特徴量毎に画素を分類する画像の特徴量分類手段１００と、画素の分類毎に領域分割して複数のクラスタリング領域を生成する領域分割手段１０１と、生成されたクラスタリング領域のうち最も画素数の少ない最小クラスタリング領域を抽出する最小クラスタ領域抽出手段１１２と、抽出された最小クラスタリング領域と隣接する隣接クラスタリング領域を抽出する統合領域判断手段１１３と、生成されたクラスタリング領域を統合してオブジェクト領域を抽出する領域統合手段１１０とを有する。 The object region generation unit 20 extracts a plurality of feature amounts from each pixel constituting the image, classifies the pixels for each similar pixel feature amount, and divides the region for each pixel classification. Area dividing means 101 for generating a plurality of clustering areas, a minimum cluster area extracting means 112 for extracting a minimum clustering area having the smallest number of pixels among the generated clustering areas, and an adjacent adjacent to the extracted minimum clustering area The integrated region determination unit 113 extracts clustering regions and the region integration unit 110 extracts object regions by integrating the generated clustering regions.

ここで、図４と図５は画像を各オブジェクト領域毎に分割する過程を示す模式図であり、図４を参照してオブジェクト領域生成手段２０の動作例について説明する。まず、図４（ａ）に示すように、類似した特徴を有する画素が並んだ画像があると仮定する。このとき、特徴量分類手段１００において、各画素から複数の特徴量が抽出されて、各特徴量を要素とした複数の特徴ベクトルが生成される。その後、図４（ｂ）に示すように、複数の特徴ベクトルが類似する特徴ベクトル毎に分類される（クラスタリング）。 Here, FIGS. 4 and 5 are schematic diagrams showing a process of dividing an image for each object area, and an operation example of the object area generating means 20 will be described with reference to FIG. First, as shown in FIG. 4A, it is assumed that there is an image in which pixels having similar characteristics are arranged. At this time, the feature quantity classifying unit 100 extracts a plurality of feature quantities from each pixel and generates a plurality of feature vectors having each feature quantity as an element. Thereafter, as shown in FIG. 4B, a plurality of feature vectors are classified into similar feature vectors (clustering).

その後、領域分割手段１０１により、特徴量分類手段１００によりクラスタリングされた結果が実際の画像に写像される。すると、図５（ａ）に示すように、類似した画素からなる複数のクラスタリング領域が形成されて、ラベルを付したラベル画像としてデータベース１１１に記憶される。 Thereafter, the result of clustering by the feature amount classifying unit 100 is mapped to an actual image by the region dividing unit 101. Then, as shown in FIG. 5A, a plurality of clustering regions composed of similar pixels are formed and stored in the database 111 as label images with labels.

次に、領域統合の一例について説明する。まず、最小クラスタ領域抽出手段１１２により、データベースに記憶されたクラスタリング領域の中から最も小さい最小クラスタリング領域が抽出される。また、統合領域判断手段１１３において抽出された最小クラスタリング領域と隣接する隣接クラスタリング領域が抽出する。 Next, an example of region integration will be described. First, the smallest cluster area extraction unit 112 extracts the smallest minimum clustering area from the clustering areas stored in the database. In addition, adjacent clustering regions adjacent to the minimum clustering region extracted by the integrated region determination unit 113 are extracted.

ここで、最小クラスタリング領域が所定の微小画素しきい値以下の画素数（たとえば全画素数の１／１００）の場合、領域統合手段１１０において、最小クラスタリング領域が境界画素数（周囲長）の最も多い隣接クラスタリング領域と統合される。具体的には、図５（ａ）のクラスタリング領域Ａが所定の微小画素しきい値以下の画素数を有する最小クラスタリング領域であるとする。クラスタリング領域Ａは、クラスタリング領域Ｃ、Ｄと隣接しているため、クラスタリング領域Ｃ、Ｄが隣接クラスタリング領域となる。 Here, when the minimum clustering area has a number of pixels equal to or smaller than a predetermined minute pixel threshold (for example, 1/100 of the total number of pixels), in the area integration unit 110, the minimum clustering area has the largest number of boundary pixels (peripheral length). It is integrated with many adjacent clustering regions. Specifically, it is assumed that the clustering area A in FIG. 5A is the minimum clustering area having the number of pixels equal to or smaller than a predetermined minute pixel threshold. Since the clustering area A is adjacent to the clustering areas C and D, the clustering areas C and D are adjacent clustering areas.

そこで、領域統合手段１１０において、最小クラスタリング領域Ａとクラスタリング領域Ｃ、Ｄとが接している隣接画素数がそれぞれ算出される。図５（ａ）においては隣接クラスタリング領域Ｄとの境界画素数の方が隣接クラスタリング領域Ｃとの境界画素数よりも多い。このためクラスタリング領域Ａは図５（ｂ）のようにクラスタリング領域Ｄと統合する。 Therefore, the region integration unit 110 calculates the number of adjacent pixels where the minimum clustering region A and the clustering regions C and D are in contact with each other. In FIG. 5A, the number of boundary pixels with the adjacent clustering region D is larger than the number of boundary pixels with the adjacent clustering region C. Therefore, the clustering area A is integrated with the clustering area D as shown in FIG.

一方、最小クラスタリング領域が所定の小画素しきい値以下の画素数（たとえば全画素数の１／１０）の場合、領域統合手段１１０において、最小クラスタリング領域が特徴空間での距離が近い隣接クラスタリング領域と統合される。具体的には、図５（ｂ）において、クラスタリング領域Ｂが所定の小画素しきい値以下の最小クラスタリング領域であるとする。すると、クラスタリング領域Ｂの隣接クラスタリング領域はクラスタリング領域Ｃ、Ｄである。そこで、たとえばテクスチャ情報を距離を基準とした場合、どちらのクラスタリング領域Ｃ、Ｄのテクスチャがクラスタリング領域Ｂのテクスチャに近いかが判断される。そして、図５（ｃ）のように、クラスタリング領域Ｂが特徴空間での最も近い距離であるクラスタリング領域Ｄと統合される。 On the other hand, when the minimum clustering area has a number of pixels equal to or smaller than a predetermined small pixel threshold (for example, 1/10 of the total number of pixels), in the area integration unit 110, the adjacent clustering area where the minimum clustering area is close in the feature space Integrated with. Specifically, in FIG. 5B, it is assumed that the clustering region B is a minimum clustering region that is equal to or smaller than a predetermined small pixel threshold value. Then, the adjacent clustering regions of the clustering region B are the clustering regions C and D. Therefore, for example, when the texture information is based on the distance, it is determined which of the clustering regions C and D is close to the texture of the clustering region B. Then, as shown in FIG. 5C, the clustering region B is integrated with the clustering region D which is the closest distance in the feature space.

領域統合手段１１０において、上述した作業がたとえば最小クラスタ領域抽出手段１１２により抽出される最小クラスタリング領域が所定の小画素しきい値よりも大きい画素数になるまで行われて、画像が各オブジェクト領域ＯＲ毎に領域分割される（図２（ｃ）参照）。 In the region integration unit 110, the above-described operation is performed until, for example, the minimum clustering region extracted by the minimum cluster region extraction unit 112 has a number of pixels larger than a predetermined small pixel threshold, and the image is displayed in each object region OR. Each area is divided (see FIG. 2C).

次に、図１を参照してブロック領域識別手段３０について説明する。ブロック領域識別手段３０は、特徴量抽出手段４０、写像手段５０、種類出力手段６０等を有する。特徴量抽出手段４０は、ブロック領域ＢＲから複数のブロック特徴量を抽出する機能を有する。写像手段５０は、たとえば自己組織化マップからなる２次元空間ＳＯＭを有し、複数のブロック特徴量（多次元特徴量）を二次元空間ＳＯＭ上に写像するものである。種類出力手段６０は、２次元空間ＳＯＭ上の位置毎に種類を定義した種類頻度分布マップＫＤＭを有する。そして、種類出力手段６０は写像手段５０により写像された２次元空間ＳＯＭ上の座標ＣＩから種類頻度分布マップＫＤＭを用いてブロック領域ＢＲの種類を出力するものである。以下にブロック領域識別手段３０の各構成について具体的に説明していく。 Next, the block area identifying means 30 will be described with reference to FIG. The block area identification unit 30 includes a feature amount extraction unit 40, a mapping unit 50, a type output unit 60, and the like. The feature amount extraction unit 40 has a function of extracting a plurality of block feature amounts from the block region BR. The mapping means 50 has, for example, a two-dimensional space SOM made up of a self-organizing map, and maps a plurality of block feature values (multidimensional feature values) onto the two-dimensional space SOM. The type output means 60 has a type frequency distribution map KDM that defines types for each position on the two-dimensional space SOM. The type output means 60 outputs the type of the block area BR from the coordinates CI on the two-dimensional space SOM mapped by the mapping means 50 using the type frequency distribution map KDM. Below, each structure of the block area identification means 30 is demonstrated concretely.

図６は特徴量抽出手段４０の一例を示すブロック図であり、図６を参照して特徴量抽出手段４０について説明する。特徴量抽出手段４０は、色成分、明度成分および像的特徴成分からなる１５個のブロック特徴量ＢＣＱを出力するものであって、Ｌａｂ変換手段４１、第１平均値算出手段４２、第１ウェーブレット変換手段４３、距離画像生成手段４６、第２ウェーブレット変換手段４７等を有する。 FIG. 6 is a block diagram illustrating an example of the feature quantity extraction unit 40. The feature quantity extraction unit 40 will be described with reference to FIG. The feature quantity extraction means 40 outputs 15 block feature quantities BCQ composed of color components, lightness components, and image feature components, and includes a Lab conversion means 41, a first average value calculation means 42, a first wavelet. A conversion unit 43, a distance image generation unit 46, a second wavelet conversion unit 47, and the like are included.

Ｌａｂ変換手段４１は、ＲＧＢ画像からなるブロック領域ＢＲをＬａｂ画像に変換する機能を有する。平均値算出手段４２は、Ｌａｂ変換されたブロック領域ＢＲのＬ成分、ａ成分およびｂ成分の平均値Ｌ−ａｖｅ、ａ−ａｖｅ、ｂ−ａｖｅをそれぞれ算出する機能を有する。そして、算出された平均値Ｌ−ａｖｅ、ａ−ａｖｅ、ｂ−ａｖｅが色成分を抽出したブロック特徴量ＢＣＱとなる。 The Lab conversion means 41 has a function of converting a block area BR formed of RGB images into a Lab image. The average value calculating means 42 has a function of calculating average values L-ave, a-ave, and b-ave of the L component, a component, and b component of the block region BR subjected to Lab conversion. The calculated average values L-ave, a-ave, and b-ave are the block feature values BCQ from which the color components are extracted.

第１ウェーブレット変換手段４３は、Ｌａｂ変換されたブロック領域ＢＲをウェーブレット変換して明度成分の高周波成分Ｌ−ＬＨ、Ｌ−ＨＬ、Ｌ−ＨＨを算出するものである。また第１ウェーブレット変換手段４３に平均値算出手段４４と最大値算出手段４５とが接続されている。 The first wavelet transform unit 43 performs wavelet transform on the Lab-converted block region BR to calculate high-frequency components L-LH, L-HL, and L-HH of brightness components. In addition, an average value calculating means 44 and a maximum value calculating means 45 are connected to the first wavelet transform means 43.

平均値算出手段４４は、第１ウェーブレット変換手段４３により算出された高周波成分Ｌ−ＬＨ、Ｌ−ＨＬ、Ｌ−ＨＨの平均値Ｌ−ＬＨ−ａｖｅ、Ｌ−ＨＬ−ａｖｅ、Ｌ−ＨＨ−ａｖｅを算出するものである。そして、算出された平均値Ｌ−ＬＨ−ａｖｅ、Ｌ−ＨＬ−ａｖｅ、Ｌ−ＨＨ−ａｖｅが明度成分を抽出したブロック特徴量ＢＣＱとなる。 The average value calculating means 44 is the average values L-LH-ave, L-HL-ave, L-HH-ave of the high frequency components L-LH, L-HL, L-HH calculated by the first wavelet transform means 43. Is calculated. The calculated average values L-LH-ave, L-HL-ave, and L-HH-ave are the block feature values BCQ from which the brightness components are extracted.

また、最大値算出手段４５は、第１ウェーブレット変換手段４３により算出された高周波成分Ｌ−ＬＨ、Ｌ−ＨＬ、Ｌ−ＨＨの頻度分布において大きい方から５％の値を算出するものである。この最大値Ｌ−ＬＨ−ｍａｘ、Ｌ−ＨＬ−ｍａｘ、Ｌ−ＨＨ−ｍａｘが明度成分を抽出したブロック特徴量ＢＣＱとなる。 The maximum value calculating means 45 calculates a value of 5% from the largest in the frequency distribution of the high frequency components L-LH, L-HL, and L-HH calculated by the first wavelet transform means 43. The maximum values L-LH-max, L-HL-max, and L-HH-max become the block feature value BCQ from which the brightness component is extracted.

このように、Ｌ成分のブロック特徴量ＢＣＱとして平均値と最大値とを利用することにより、平均的に一定強度の高周波成分が分布してブロック領域ＢＲと、一部に強い高周波成分があるブロック領域ＢＲとを区別することができるようになり、ブロック領域ＢＲの種類の識別を正確に行うことができるようになる。 In this way, by using the average value and the maximum value as the block feature value BCQ of the L component, a high frequency component having a constant intensity is distributed on average, and the block region BR and a block having a strong high frequency component in part. The region BR can be distinguished from the region BR, and the type of the block region BR can be accurately identified.

距離画像生成手段４６は、Ｌａｂ変換手段４１によりＬａｂ変換されたブロック領域ＢＲから距離画像Ｄを生成する機能を有する。ここで、距離画像Ｄは、一般的な距離画像とは異なり、図７に示すように、Ｌａｂ変換した３変数のブロック領域ＢＲと、ウェーブレット変換した際に生成したブロック領域ＢＲの低周波成分からなるボケ画像とのユークリッド距離を画像化したものである。すなわち、Ｌａｂ空間における３次元距離画像は、均等色空間における信号変動の様子を１枚の画像にしたものであり、人が知覚する変動を表現したものとして説明することができる。３次元空間での変動を扱うことにより、明度画像から得られない像構造的特徴を引き出すことができるため、種類の識別をより正確に行うことができる。 The distance image generation unit 46 has a function of generating a distance image D from the block region BR subjected to Lab conversion by the Lab conversion unit 41. Here, the distance image D is different from a general distance image, as shown in FIG. 7, from the low-frequency components of the three-variable block region BR subjected to Lab transform and the block region BR generated upon wavelet transform. This is an image of the Euclidean distance from the blurred image. That is, the three-dimensional distance image in the Lab space is an image in which the signal variation in the uniform color space is made into one image, and can be described as representing the variation perceived by a person. By handling fluctuations in the three-dimensional space, image structural features that cannot be obtained from the brightness image can be extracted, so that the types can be identified more accurately.

つまり、各画素毎に抽出した画素特徴量に基づいて種類を識別した場合、像構造による種類の識別を行うことができないため、たとえば「空」と「海」のように像構造は異なるが明度や色が類似した種類の識別を精度よく行うことができない。一方、ブロック領域ＢＲ毎に距離画像Ｄを生成した像構造により種類の抽出を行うことにより、種類の識別をより正確に行うことができる。 In other words, when the type is identified based on the pixel feature value extracted for each pixel, the type cannot be identified by the image structure. For example, the image structure is different such as “sky” and “sea”, but the brightness is different. It is not possible to accurately identify types of similar colors. On the other hand, the type can be identified more accurately by extracting the type using the image structure in which the distance image D is generated for each block region BR.

第２ウェーブレット変換手段４７は生成された距離画像Ｄをウェーブレット変換して、その高周波成分Ｄ−ＬＨ、Ｄ−ＨＬ、Ｄ−ＨＨを出力する機能を有する。第２ウェーブレット変換手段４７に平均値算出手段４８と最大値算出手段４９とが接続されている。 The second wavelet transform unit 47 has a function of performing wavelet transform on the generated distance image D and outputting the high frequency components D-LH, D-HL, and D-HH. An average value calculating means 48 and a maximum value calculating means 49 are connected to the second wavelet transform means 47.

平均値算出手段４８は、第２ウェーブレット変換手段４７により算出された高周波成分Ｄ−ＬＨ、Ｄ−ＨＬ、Ｄ−ＨＨの平均値Ｄ−ＬＨ−ａｖｅ、Ｄ−ＨＬ−ａｖｅ、Ｄ−ＨＨ−ａｖｅを算出するものである。そして、算出された平均値Ｄ−ＬＨ−ａｖｅ、Ｄ−ＨＬ−ａｖｅ、Ｄ−ＨＨ−ａｖｅが像的特徴成分を抽出したブロック特徴量ＢＣＱとなる。 The average value calculating means 48 is the average values D-LH-ave, D-HL-ave, D-HH-ave of the high frequency components D-LH, D-HL, D-HH calculated by the second wavelet transform means 47. Is calculated. The calculated average values D-LH-ave, D-HL-ave, and D-HH-ave are the block feature values BCQ from which the image feature components are extracted.

また、最大値算出手段４９は、第１ウェーブレット変換手段４３により算出された高周波成分Ｄ−ＬＨ、Ｄ−ＨＬ、Ｄ−ＨＨの頻度分布において大きい方から５％の値を算出するものである。この最大値Ｄ−ＬＨ−ｍａｘ、Ｄ−ＨＬ−ｍａｘ、Ｄ−ＨＨ−ｍａｘが像的特徴成分を抽出したブロック特徴量ＢＣＱとなる。 The maximum value calculation means 49 calculates a value of 5% from the largest in the frequency distribution of the high frequency components D-LH, D-HL, and D-HH calculated by the first wavelet transform means 43. The maximum values D-LH-max, D-HL-max, and D-HH-max become block feature values BCQ from which image feature components are extracted.

このように、Ｄ（距離）成分のブロック特徴量ＢＣＱとして平均値と最大値とを利用することにより、平均的に一定強度の高周波成分が分布してブロック領域ＢＲと、一部に強い高周波成分があるブロック領域ＢＲとを区別することができるようになり、ブロック領域ＢＲの種類の判別を正確に行うことができるようになる。 In this way, by using the average value and the maximum value as the block feature value BCQ of the D (distance) component, the high-frequency component having a constant intensity is distributed on the average, and the block region BR and the high-frequency component strong in part. This makes it possible to distinguish a certain block area BR from a certain block area BR, and to accurately determine the type of the block area BR.

次に、図８は写像手段５０および種類出力手段６０の一例を示す模式図であり、図１と図８を参照して写像手段５０および種類出力手段６０について説明する。この写像手段５０および種類出力手段６０には自己組織化マップを用いた修正対向伝搬ネットワーク（参考文献：徳高、岸田、藤村「自己組織化マップの応用−多次元情報の２次元可視化」海文堂、１９９９）が用いられている。 Next, FIG. 8 is a schematic diagram showing an example of the mapping unit 50 and the type output unit 60. The mapping unit 50 and the type output unit 60 will be described with reference to FIGS. The mapping means 50 and the type output means 60 include a modified counter propagation network using a self-organizing map (reference: Tokutaka, Kishida, Fujimura, “Application of Self-Organizing Map-Two-dimensional Visualization of Multidimensional Information”, Kaibundo, 1999. ) Is used.

写像手段５０は、複数のニューロンＮをマトリックス状に配置した自己組織化マップからなる２次元空間ＳＯＭを有し、複数の特徴量（多次元特徴量）を２次元空間ＳＯＭ上に写像する機能を有する。各ニューロンＮはそれぞれブロック特徴量ＢＣＱと同一次元のベクトル座標を有する。本実施の形態においてはブロック特徴量ＢＣＱは１５個のブロック特徴量ＢＣＱからなっているため、各ニューロンは１５次元の結合荷重ベクトルからなっていることになる。 The mapping means 50 has a two-dimensional space SOM composed of a self-organizing map in which a plurality of neurons N are arranged in a matrix, and has a function of mapping a plurality of feature quantities (multidimensional feature quantities) onto the two-dimensional space SOM. Have. Each neuron N has a vector coordinate in the same dimension as the block feature BCQ. In the present embodiment, since the block feature value BCQ is composed of 15 block feature values BCQ, each neuron is composed of a 15-dimensional connection weight vector.

そして、写像手段５０は、１つのブロック領域ＢＲから抽出された１５個のブロック特徴量ＢＣＱを自己組織化マップＳＯＭ上のニューロンＮの中から、最も近似した（たとえば最もユークリッド距離等の近い）ニューロンＮｉ（発火要素）を選択する。これにより、複数のブロック特徴量ＢＣＱからなる多次元空間から２次元空間ＳＯＭ上に写像されたことになる。そして、写像手段５０は選択したニューロンＮｉの座標ＣＩを種類出力手段６０に送るようになっている。 The mapping unit 50 then approximates the 15 block feature values BCQ extracted from one block region BR among the neurons N on the self-organizing map SOM (for example, the neuron having the closest Euclidean distance or the like). Select Ni (ignition element). As a result, a multidimensional space composed of a plurality of block feature values BCQ is mapped onto the two-dimensional space SOM. Then, the mapping means 50 sends the coordinates CI of the selected neuron Ni to the type output means 60.

種類出力手段６０は、２次元空間ＳＯＭと同一の座標系を有する複数の種類頻度分布マップＫＤＭを有しており、写像手段５０により写像された２次元空間ＳＯＭ上の座標ＣＩから、種類頻度分布マップＫＤＭ上でその座標ＣＩの示す部位が示す種類を出力する機能を有する。この種類頻度分布マップＫＤＭは、図９に示すように、各種類毎に２次元空間上に様々な種類の分布が形成されており、各種類毎にそれぞれ種類頻度分布マップＫＤＭが用意されている。たとえば、図９（ａ）は種類が「空」の種類頻度分布マップＫＤＭ、図９（ｂ）は種類が「建物」の種類頻度分布マップＫＤＭ、図９（ｃ）は種類がＫＩの「木」の種類頻度分布マップＫＤＭ、図９（ｄ）は種類が「海」の種類頻度分布マップＫＤＭをそれぞれ示している。図９において、白の範囲が０．８〜１．０の頻度値（信頼度）、グレーの範囲が０．２〜０．８の頻度値（信頼度）、黒の範囲が０．０〜０．２の頻度値（信頼度）を示している。 The kind output means 60 has a plurality of kind frequency distribution maps KDM having the same coordinate system as the two-dimensional space SOM, and the kind frequency distribution is calculated from the coordinates CI on the two-dimensional space SOM mapped by the mapping means 50. The map KDM has a function of outputting the type indicated by the part indicated by the coordinates CI. As shown in FIG. 9, in this type frequency distribution map KDM, various types of distributions are formed in the two-dimensional space for each type, and a type frequency distribution map KDM is prepared for each type. . For example, FIG. 9A shows the type frequency distribution map KDM with the type “Empty”, FIG. 9B shows the type frequency distribution map KDM with the type “Building”, and FIG. 9C shows the “Tree” with the type KI. "Type frequency distribution map KDM", and FIG. 9D shows the type frequency distribution map KDM of the type "sea". In FIG. 9, the white range is a frequency value (reliability) of 0.8 to 1.0, the gray range is a frequency value (reliability) of 0.2 to 0.8, and the black range is 0.0 to 0.0. A frequency value (reliability) of 0.2 is shown.

なお、各種類毎に種類頻度分布マップＫＤＭが用意されている場合について例示しているが、１枚の種類頻度分布マップＫＤＭに複数の種類の分布が形成されていてもよい。 In addition, although the case where the type frequency distribution map KDM is prepared for each type is illustrated, a plurality of types of distributions may be formed in one type frequency distribution map KDM.

ここで、上述した種類を識別する際（認識モード）に使用される自己組織化マップＳＯＭおよび種類頻度分布マップＫＤＭは、予め学習されたものが使用される。すなわち、２次元空間ＳＯＭおよび種類頻度分布マップＫＤＭは学習機能を有しており、予め種類が判っているブロック領域ＢＲから抽出されたブロック特徴量ＢＣＱからなる学習用入力データを用いて各ニューロンＮおよび種類頻度分布マップＫＤＭが学習される。 Here, as the above-described self-organizing map SOM and type frequency distribution map KDM used for identifying types (recognition mode), those learned in advance are used. That is, the two-dimensional space SOM and the type frequency distribution map KDM have a learning function, and each neuron N uses the learning input data including the block feature amount BCQ extracted from the block region BR whose type is known in advance. And the type frequency distribution map KDM is learned.

具体的には、まず自己組織化マップＳＯＭの学習について説明する。自己組織化マップＳＯＭのニューロンは、初期状態においてランダムな結合荷重ベクトルを有している。そして、予め種類のわかっている学習用入力データが写像手段５０に入力される。すると、写像手段５０により学習用入力データと最も近似したニューロンＮｉ（発火要素）が選択される。同時に、選択されたニューロンＮｉ（発火要素）を取り囲むたとえば３×３個のニューロンが選択される。そして、ニューロンＮｉ（発火要素）およびその近傍にあるニューロンＮの結合荷重ベクトルが学習用入力データに近づく方向に更新されて、自己組織化マップＳＯＭのニューロンＮが学習される。 Specifically, learning of the self-organizing map SOM will be described first. The neurons of the self-organizing map SOM have random connection weight vectors in the initial state. Then, learning input data whose type is known in advance is input to the mapping means 50. Then, the neuron Ni (firing element) most similar to the learning input data is selected by the mapping means 50. At the same time, for example 3 × 3 neurons surrounding the selected neuron Ni (firing element) are selected. Then, the connection weight vector of the neuron Ni (firing element) and the neuron N in the vicinity thereof is updated in a direction approaching the learning input data, and the neuron N of the self-organizing map SOM is learned.

この作業が複数の学習用入力データを用いて行われる。さらに、この学習用入力データが複数回繰り返し自己組織化マップＳＯＭに入力される。ここで、複数の学習用入力データの入力が繰り返されるに連れて、結合荷重ベクトルが更新されるニューロンＮの近傍領域の範囲が狭くなっていき、最後には選択されたニューロンＮｉ（発火要素）のみの結合荷重ベクトルが更新される。 This operation is performed using a plurality of learning input data. Further, the learning input data is repeatedly input into the self-organizing map SOM a plurality of times. Here, as the input of a plurality of learning input data is repeated, the range of the neighborhood region of the neuron N in which the connection weight vector is updated becomes narrower, and finally the selected neuron Ni (firing element) is selected. Only the combined load vector is updated.

次に、種類頻度分布マップＫＤＭの学習について説明する。種類頻度分布マップＫＤＭにおいてすべての座標の初期値は０になっている。上述したように、自己組織化マップＳＯＭに学習用入力データが写像された際に、自己組織化マップＳＯＭ上の座標ＣＩが出力される。すると、学習用入力データの種類に対応する種類頻度分布マップＫＤＭ内の座標ＣＩに当たる部位およびそれを取り囲む領域（たとえば３×３個）に正の整数値（たとえば「１」）が加算される。 Next, learning of the type frequency distribution map KDM will be described. In the type frequency distribution map KDM, initial values of all coordinates are zero. As described above, when the learning input data is mapped to the self-organizing map SOM, the coordinates CI on the self-organizing map SOM are output. Then, a positive integer value (for example, “1”) is added to the portion corresponding to the coordinate CI in the type frequency distribution map KDM corresponding to the type of the input data for learning and the region (for example, 3 × 3) surrounding it.

そして、学習用入力データが入力されて行くにつれて、種類頻度分布マップＫＤＭ上の特定の領域ついて学習用入力データの入力により数値が加算されて大きくなっていく。つまり、同じ種類のブロック領域ＢＲであれば、ブロック特徴量ＢＣＱが類似していることになる。ブロック特徴量ＢＣＱが類似していれば、自己組織化マップＳＯＭ上の近くの座標に写像されることが多くなるため、種類頻度分布マップＫＤＭにおいても特定の座標の数値が大きくなっていく。 Then, as learning input data is input, numerical values are added to a specific area on the type frequency distribution map KDM to increase as learning input data is input. That is, if the same type of block region BR, the block feature amount BCQ is similar. If the block feature values BCQ are similar, they are often mapped to nearby coordinates on the self-organizing map SOM, so that the numerical values of specific coordinates also increase in the type frequency distribution map KDM.

最後に、種類頻度分布マップＫＤＭの各座標にある数値を全入力学習データ数×学習回数で割ると、各座標に０．０から１．０までの確率が入力された種類頻度分布マップＫＤＭが生成される。この確率が大きければ大きいほど、その種類である確率が大きくなることを意味する。図９の種類頻度分布マップＫＤＭにおいては、白の範囲が０．８〜１．０の信頼度（確率）、グレーの範囲が０．２〜０．８の信頼度（確率）、黒の範囲が０．０〜０．２の信頼度（確率）を示している。このように種類頻度分布マップＫＤＭがたとえば「空」、「建物」、「木」、「海」等の種類毎にそれぞれ形成されていく。 Finally, when the numerical value at each coordinate of the type frequency distribution map KDM is divided by the total number of input learning data times the number of learnings, the type frequency distribution map KDM in which a probability of 0.0 to 1.0 is input to each coordinate is obtained. Generated. This means that the greater the probability, the greater the probability of that type. In the type frequency distribution map KDM of FIG. 9, the reliability (probability) in the white range is 0.8 to 1.0, the reliability (probability) in the gray range is 0.2 to 0.8, and the black range. Indicates a reliability (probability) of 0.0 to 0.2. In this way, the type frequency distribution map KDM is formed for each type such as “sky”, “building”, “tree”, “sea”, and the like.

そして、実際のブロック領域ＢＲについて種類の識別をする際（認識モード）では、種類出力手段６０は、複数の種類頻度分布マップＫＤＭからそれぞれ座標ＣＩの部位が有する信頼度を抽出する。具体的には、写像手段５０から座標ＣＩが送られてきた場合、たとえば「空」、「建物」、「木」、「海」等のそれぞれの種類頻度分布マップＫＤＭ上の座標ＣＩに該当する部位の信頼度を抽出する。そして、種類出力手段６０は、各種類頻度分布マップＫＤＭから得られた確率をベクトル成分とする種類ベクトルを生成する。この場合、空の信頼度、建物の信頼度、木の信頼度および海の信頼度をベクトル成分とする種類ベクトルが生成される。その後、種類出力手段６０は最も大きい確率を有する種類をブロック領域ＢＲの種類情報であると識別して、種類をオブジェクト識別手段７０に送る。 When identifying the type of the actual block region BR (recognition mode), the type output unit 60 extracts the reliability of each part of the coordinate CI from the plurality of type frequency distribution maps KDM. Specifically, when the coordinate CI is sent from the mapping means 50, for example, it corresponds to the coordinate CI on each type frequency distribution map KDM such as “sky”, “building”, “tree”, “sea”, etc. Extract the reliability of the part. And the kind output means 60 produces | generates the kind vector which makes the probability obtained from each kind frequency distribution map KDM a vector component. In this case, a kind vector having the vector components of the reliability of the sky, the reliability of the building, the reliability of the tree, and the reliability of the sea is generated. Thereafter, the type output unit 60 identifies the type having the highest probability as the type information of the block area BR, and sends the type to the object identification unit 70.

なお、種類出力手段６０において、上述した種類ベクトルを構成するベクトル成分が、所定のベクトル成分しきい値より小さい場合、ブロック領域ＢＲの種類の識別の確信度が低いと判断して、「不明」とした種類をオブジェクト識別手段７０に送るようにしてもよい。もしくは最も大きいベクトル成分と２番目に大きいベクトル成分との差が小さい場合にも同様に、ブロック領域ＢＲの種類の識別の確信度が低いと判断して、種類を「不明」としてオブジェクト識別手段７０に送るようにしてもよい。これにより、種類の識別について信頼性の低いブロック領域ＢＲについてはオブジェクト領域ＯＲの種類の識別に与える影響を少なくすることができるため、オブジェクト領域ＯＲの識別の精度を向上させることができる。 In the type output means 60, when the vector component constituting the above-described type vector is smaller than the predetermined vector component threshold, it is determined that the certainty of identifying the type of the block region BR is low, and “unknown” The types may be sent to the object identification means 70. Alternatively, when the difference between the largest vector component and the second largest vector component is small, similarly, it is determined that the certainty of identifying the type of the block area BR is low, and the type is set to “unknown”, and the object identifying unit 70 You may make it send to. As a result, the block region BR with low reliability regarding the type identification can reduce the influence on the type identification of the object region OR, so that the accuracy of identification of the object region OR can be improved.

さらに、写像手段５０が送られた複数のブロック特徴量ＢＣＱを自己組織化マップＳＯＭに写像する際に、最も近似したニューロンＮｉ（発火要素）と複数のブロック特徴量ＢＣＱとの距離（たとえばユークリッド距離等）が所定の距離しきい値より大きい場合、写像手段５０は種類出力手段６０に対してマッチング処理を行わない旨の情報を送るようにしてもよい。その場合、種類出力手段６０においても、種類を「不明」とする種類をオブジェクト識別手段７０に送るようにしてもよい。この場合であっても、種類の識別について信頼性の低いブロック領域ＢＲについてはオブジェクト領域ＯＲの種類の識別に与える影響を少なくすることができるため、オブジェクト領域ＯＲの識別の精度を向上させることができる。 Further, when the plurality of block feature values BCQ sent by the mapping means 50 are mapped onto the self-organizing map SOM, the distance (for example, Euclidean distance) between the most approximate neuron Ni (firing element) and the plurality of block feature values BCQ. Or the like) is larger than a predetermined distance threshold value, the mapping unit 50 may send information indicating that the matching processing is not performed to the type output unit 60. In that case, the type output unit 60 may send a type of “unknown” to the object identification unit 70. Even in this case, since the influence on the type identification of the object area OR can be reduced for the block area BR having low reliability for the type identification, the accuracy of the identification of the object area OR can be improved. it can.

図１０は本発明のオブジェクト識別方法の好ましい実施の形態を示すフローチャートであり、図１から図１０を参照してオブジェクト識別方法について説明する。まず、オブジェクト領域生成手段２０により入力された画像をオブジェクト毎に領域分割したオブジェクト領域ＯＲが生成される。一方では、ブロック領域生成手段１０により入力された画像を設定画素数（たとえば３２×３２画素）からなる、オブジェクト領域ＯＲより小さい複数のブロック領域ＢＲが生成される。（ステップＳＴ１）。 FIG. 10 is a flowchart showing a preferred embodiment of the object identification method of the present invention. The object identification method will be described with reference to FIGS. First, an object area OR is generated by dividing the image input by the object area generation means 20 into areas for each object. On the other hand, a plurality of block areas BR smaller than the object area OR, which are composed of a set number of pixels (for example, 32 × 32 pixels), are generated from the image input by the block area generating means 10. (Step ST1).

次に、特徴量抽出手段４０により、ブロック領域ＢＲから１５個の特徴量ＢＣＱが抽出される（ステップＳＴ２）。その後、抽出した特徴量ＢＣＱが写像手段５０により自己組織化マップＳＯＭに写像されて、自己組織化マップＳＯＭの座標ＣＩが種類出力手段６０に送られる（ステップＳＴ３）。種類出力手段６０において、種類頻度分布マップＫＤＭにおいて座標ＣＩが示す種類を抽出して、オブジェクト識別手段７０に送る（ステップＳＴ４）。この作業がすべてのブロック領域ＢＲについて行われる（ステップＳＴ５）。 Next, the feature quantity extraction means 40 extracts 15 feature quantities BCQ from the block region BR (step ST2). Thereafter, the extracted feature value BCQ is mapped to the self-organizing map SOM by the mapping unit 50, and the coordinates CI of the self-organizing map SOM are sent to the type output unit 60 (step ST3). The type output means 60 extracts the type indicated by the coordinates CI in the type frequency distribution map KDM and sends it to the object identification means 70 (step ST4). This operation is performed for all the block areas BR (step ST5).

その後、オブジェクト識別手段７０において、各オブジェクト領域ＯＲ毎に付与された種類を集計する（ステップＳＴ６）。そして、最も多い種類がそのオブジェクト領域ＯＲの種類として出力される（ステップＳＴ７）。 Thereafter, in the object identification means 70, the types assigned to each object area OR are totaled (step ST6). The most common type is output as the type of the object area OR (step ST7).

上記実施の形態によれば、各ブロック領域ＢＲ毎にそれぞれ種類を識別し、ブロック領域ＢＲの種類を各オブジェクト領域ＯＲ毎に集計してオブジェクト領域ＯＲの種類を識別することにより、正確にオブジェクトの種類を自動的に識別することができる。すなわち、ブロック領域識別手段３０において、各ブロック領域ＢＲについて本来のオブジェクトの種類とは異なる種類であると識別される場合がある。たとえば、オブジェクトが「海」である場合、海のオブジェクト領域ＯＲ内のブロック領域ＢＲに「空」と判断されるものが存在することがある。このとき、オブジェクト領域ＯＲの種類は集計された種類のうち最も多い種類が付与されるようになっているため、一部にオブジェクトの真の種類情報と異なる種類情報が付されたブロック領域ＢＲが存在した場合であっても、本当のオブジェクトとは異なる種類がオブジェクトに付されるのを防止することができる。よって、自動的にかつ正確にオブジェクトの種類を識別することができる。 According to the above embodiment, the type of each block area BR is identified, the type of the block area BR is counted for each object area OR, and the type of the object area OR is identified, thereby accurately identifying the object. The type can be automatically identified. That is, the block area identifying means 30 may identify each block area BR as a type different from the original object type. For example, when the object is “sea”, there is a case where a block area BR in the sea object area OR is determined to be “sky”. At this time, since the type of the object region OR is the largest of the aggregated types, the block region BR partially attached with type information different from the true type information of the object is provided. Even if it exists, it is possible to prevent the object from being given a different type from the real object. Therefore, the object type can be automatically and accurately identified.

一方、上述したように、ブロック領域ＢＲの色成分、明度成分および像的特徴成分をブロック特徴量ＢＣＱとして抽出して、ブロック特徴量ＢＣＱを修正対向伝搬ネットワークに入力することにより、ブロック領域ＢＲ毎の種類を識別することができるようになる。つまり、画素毎に画素特徴量を抽出して種類を識別しようとした場合、種類の識別を正確に行うことができない。これは、画像から得られる画素特徴量には距離情報（像情報）が含まれておらず、明度情報もしくは色情報しか抽出することができない。よって、たとえば「海」と「空」は同一の色の場合もあるため、「海」のオブジェクトが「空」のオブジェクトと判断されてしまう場合がある。 On the other hand, as described above, the color component, the brightness component, and the image feature component of the block region BR are extracted as the block feature value BCQ, and the block feature value BCQ is input to the modified counter propagation network. The type of can be identified. That is, when trying to identify a type by extracting a pixel feature amount for each pixel, the type cannot be accurately identified. This is because pixel information obtained from an image does not include distance information (image information), and only lightness information or color information can be extracted. Therefore, for example, “sea” and “sky” may have the same color, and therefore the “sea” object may be determined to be the “sky” object.

一方、ブロック領域ＢＲ毎にブロック特徴量ＢＣＱを抽出して種類を識別するようにしているため、「海」と「空」等の色情報や明度情報が類似しているオブジェクトであっても識別することができるようになり、正確に種類の識別をすることができる。 On the other hand, since the block feature value BCQ is extracted for each block area BR and the type is identified, even objects having similar color information and brightness information such as “sea” and “sky” are identified. It is possible to identify the type accurately.

図１１は本発明のオブジェクト識別装置の第２の実施の形態を示すブロック図であり、図１１を参照してオブジェクト識別装置２００について説明する。なお、図のオブジェクト識別装置２００において、図１のオブジェクト識別装置１と同一の構成を有する部位には同一の符号を付してその説明を省略する。 FIG. 11 is a block diagram showing a second embodiment of the object identification device of the present invention. The object identification device 200 will be described with reference to FIG. In the object identification device 200 shown in the figure, parts having the same configuration as the object identification device 1 in FIG. 1 are denoted by the same reference numerals and description thereof is omitted.

図１１のオブジェクト識別装置２００が図１のオブジェクト識別装置１と異なる点は、オブジェクト領域ＯＲを抽出した後、そのオブジェクト領域ＯＲをブロック領域ＢＲに分割する点である。 The object identification device 200 of FIG. 11 is different from the object identification device 1 of FIG. 1 in that after the object region OR is extracted, the object region OR is divided into block regions BR.

すなわち、図１１のオブジェクト識別装置２００は、ブロック領域生成手段１０、オブジェクト領域生成手段２０、ブロック領域識別手段３０、オブジェクト識別手段７０等を有する。そして、オブジェクト領域生成手段２０により、画像をオブジェクト領域ＯＲ毎に領域した後、ブロック領域生成手段１０により、オブジェクト領域ＯＲを各ブロック領域ＢＲ毎に分割する。そして、ブロック領域識別手段３０により、各ブロック領域ＢＲ毎に種類を識別した後、オブジェクト識別手段７０において、オブジェクト領域ＯＲ内のブロック領域ＢＲを集計してオブジェクト領域ＯＲの種類を識別する。このオブジェクト識別装置２００であっても、図１のオブジェクト識別装置１と同様の効果を得ることができる。 That is, the object identification device 200 of FIG. 11 includes a block area generation unit 10, an object area generation unit 20, a block area identification unit 30, an object identification unit 70, and the like. Then, after the object area generating means 20 divides the image for each object area OR, the block area generating means 10 divides the object area OR for each block area BR. Then, after identifying the type for each block area BR by the block area identifying means 30, the object identifying means 70 totals the block areas BR in the object area OR to identify the type of the object area OR. Even with this object identification device 200, the same effect as the object identification device 1 of FIG. 1 can be obtained.

なお、上記各実施の形態において、図８の写像手段５０においては１つの自己組織化マップＳＯＭを有するものであるが、図１２に示すように、２つの自己組織化マップを有するようにしてもよい。具体的には、写像手段１５０は第１自己組織化マップＳＯＭ１と第２自己組織化マップＳＯＭ２を備え、第１自己組織化マップＳＯＭ１へ複数のブロック特徴量ＢＣＱを写像するための第１写像手段１５１と、第１写像手段１５１により各ブロック領域ＢＲ毎に取得された第１自己組織化マップＳＯＭ１における第１座標ＣＩ１を取得して、複数の第１座標ＣＩ１を第２自己組織化マップＳＯＭ２に写像する第２写像手段１５２とを備えている。 In each of the above embodiments, the mapping means 50 in FIG. 8 has one self-organizing map SOM. However, as shown in FIG. 12, it may have two self-organizing maps. Good. Specifically, the mapping unit 150 includes a first self-organizing map SOM1 and a second self-organizing map SOM2, and a first mapping unit for mapping a plurality of block feature values BCQ to the first self-organizing map SOM1. 151, the first coordinate CI1 in the first self-organizing map SOM1 acquired for each block region BR by the first mapping means 151 is acquired, and the plurality of first coordinates CI1 are converted into the second self-organizing map SOM2. And second mapping means 152 for mapping.

ここで、第１写像手段１５１および第１自己組織化マップＳＯＭ１は、図の写像手段５０および自己組織化マップＳＯＭと同一の構造を有している。一方、第２写像手段１５１は、たとえば互いに隣接する３×３個のブロック領域等の空間的に特定の位置関係にある複数のブロック領域ＢＲについて、第１写像手段１５１から出力された複数の第１座標ＣＩ１を第２自己組織化マップＳＯＭ２に写像するようになっている。これにより、ブロック領域ＢＲによる種類の識別をする際に、複数のブロック領域ＢＲからなる大域的な特徴（構造的な特徴）を利用した種類の識別を行うことができるため、ブロック領域ＢＲの種類の識別の精度を向上させることができる。さらに、上述した２段階の自己組織化マップＳＯＭ１、ＳＯＭ２だけでなく更に多段にすることにより、より大域的構造から種類を識別することができるようになる。 Here, the first mapping means 151 and the first self-organizing map SOM1 have the same structure as the mapping means 50 and the self-organizing map SOM shown in the figure. On the other hand, the second mapping unit 151 includes a plurality of second output units output from the first mapping unit 151 for a plurality of block regions BR having a specific spatial relationship such as 3 × 3 block regions adjacent to each other. One coordinate CI1 is mapped to the second self-organizing map SOM2. As a result, when identifying the type by the block area BR, it is possible to identify the type using a global feature (structural feature) made up of a plurality of block areas BR. The accuracy of identification can be improved. Furthermore, by using not only the above-described two-stage self-organizing maps SOM1 and SOM2 but also more stages, types can be identified from a more global structure.

また、上記各実施の形態において、ブロック領域生成手段１０は以下に示すような機能を有していてもよい。すなわち、ブロック領域生成手段１０は、図１３（ａ）に示すように、画像をメッシュ状に区切った複数の第１ブロック領域ＢＲ１と、図１３（ｂ）に示すように、複数の第１ブロック領域ＢＲ１とメッシュ状に区切る位相をずらした第２ブロック領域ＢＲ２とを生成するようになっている。つまり、ブロック領域生成手段１０はたとえば３２画素×３２画素からなる設定画素数のブロック領域をメッシュ状に機械的に区切って第１ブロック領域ＢＲ１を生成する（図１３（ａ）参照）他に、さらに、図１３（ｂ）に示すような横方向および横方向に対して半ブロック分（１６画素分）ずらしたメッシュ状の第２ブロック領域ＢＲ２を生成する。そして、生成された第１ブロック領域ＢＲ１および第２ブロック領域ＢＲ２を用いて種類の識別が行われることとなる。なお、この場合であっても、オブジェクト領域ＯＲの境界を含むブロック領域ＢＲ１、ＢＲ２は種類の識別に用いられないようになっている。 In each of the above embodiments, the block area generation unit 10 may have the following functions. That is, the block area generating means 10 includes a plurality of first block areas BR1 obtained by dividing an image into meshes as shown in FIG. 13A, and a plurality of first blocks as shown in FIG. 13B. The region BR1 and the second block region BR2 in which the phase of partitioning in a mesh shape is shifted are generated. That is, the block area generation means 10 generates the first block area BR1 by mechanically dividing the block area having a set number of pixels of 32 pixels × 32 pixels into a mesh shape (see FIG. 13A), Furthermore, a mesh-like second block region BR2 shifted by a half block (16 pixels) with respect to the horizontal direction and the horizontal direction as shown in FIG. 13B is generated. Then, the type identification is performed using the generated first block region BR1 and second block region BR2. Even in this case, the block areas BR1 and BR2 including the boundary of the object area OR are not used for type identification.

このように、オブジェクト領域ＯＲの種類の識別に用いられるブロック領域ＢＲの数を増やすことにより、識別の精度を向上させることができる。すなわち、上述したように、オブジェクト領域ＯＲの境界を含むブロック領域ＢＲは、複数の領域の特徴が混在しているとともに境界のエッジも含まれることによる識別精度の低下を防止するために、オブジェクト領域ＯＲの種類の集計に含まれていない。したがって、オブジェクト領域ＯＲが小さい場合には生成されるブロック領域ＢＲの数は少なくなり、複雑な形状のオブジェクト領域ＯＲの場合には、他のオブジェクト領域ＯＲとの境界が多くなるため、識別に用いられるブロック領域ＢＲの数は少なくなる。このため、識別された種類は精度が低くなってしまい、特に、少し複雑な画像になると識別ができず多くのオブジェクト領域ＯＲが不明であると判断されてしまう。 Thus, the accuracy of identification can be improved by increasing the number of block areas BR used for identifying the type of object area OR. In other words, as described above, the block region BR including the boundary of the object region OR is the object region in order to prevent a reduction in identification accuracy due to the presence of the boundary edges and the mixed features of the plurality of regions. Not included in OR type aggregation. Therefore, when the object area OR is small, the number of block areas BR to be generated is small, and in the case of an object area OR having a complicated shape, the boundary with other object areas OR is increased. The number of block areas BR to be reduced is reduced. For this reason, the accuracy of the identified type is lowered, and in particular, if the image is a little complicated, it cannot be identified and it is determined that many object regions OR are unknown.

このとき、図１３（ａ）、（ｂ）に示すようなそれぞれ位相のずれたブロック領域ＢＲ１、ＢＲ２を生成すれば、オブジェクト領域ＯＲの境界を含まないブロック領域ＢＲの数を増やして、より正確な種類の識別を行うことができる。 At this time, if block regions BR1 and BR2 having different phases as shown in FIGS. 13A and 13B are generated, the number of block regions BR that do not include the boundary of the object region OR is increased, and more accurate. Different types of identification can be made.

なお、図１３（ｂ）においては、横方向および縦方向に対して半ブロック分ずらした第２ブロック領域ＢＲ２を生成するようにしているが、図１３（ｃ）に示すような横方向にのみ半ブロック分（１６画素分）ずらした第２ブロック領域ＢＲ２を生成してもよいし、図１３（ｄ）に示すように縦方向にのみ半ブロック分ずらした第２ブロック領域ＢＲ２を生成するようにしてもよい。また、ブロック領域識別手段３０において、図１３（ａ）〜図１３（ｄ）の各ブロック領域ＢＲ１、ＢＲ２のすべてを用いてもよいし、ブロック領域ＢＲのいずれかを組み合わせて用いるようにしてもよい。さらに、図１３（ａ）〜図１３（ｄ）において、ブロック領域生成手段１０は、半ブロック分ずらした場合について例示しているが、半ブロック分ずらす場合に限定されず、たとえば１／４ブロック分（８画素分）ずらす等の設定画素数よりも小さいピッチだけずらしたものであればよい。 In FIG. 13B, the second block region BR2 shifted by a half block with respect to the horizontal and vertical directions is generated, but only in the horizontal direction as shown in FIG. 13C. The second block region BR2 shifted by a half block (16 pixels) may be generated, or the second block region BR2 shifted by a half block only in the vertical direction as shown in FIG. It may be. Further, in the block area identification means 30, all of the block areas BR1 and BR2 of FIGS. 13A to 13D may be used, or any one of the block areas BR may be used in combination. Good. Further, in FIG. 13A to FIG. 13D, the block region generation means 10 is illustrated with respect to a case where it is shifted by a half block, but is not limited to a case where it is shifted by a half block. What is necessary is just to shift by a smaller pitch than the set number of pixels, such as shifting by minutes (for 8 pixels).

さらに、上記各実施の形態において、ブロック領域生成手段１０が図１４に示すように、画像から解像度の異なる複数の解像度変換画像を生成する機能を有し、生成した複数の解像度変換画像から設定画素数からなるブロック領域を生成する機能を有していてもよい。具体的には、ブロック領域生成手段１０は、全体画像に対してたとえばガウシアンピラミッドもしくはウェーブレット変換等の公知の解像度変換技術を施し、複数の解像度変換画像を生成する。そして、ブロック領域生成手段１０は、生成した複数の解像度変換画像についてそれぞれ設定画素数毎にメッシュ状に区切ることにより、ブロック領域ＢＲを生成していく。そして、複数の解像度変換画像から生成されたブロック領域ＢＲ毎に種類の識別が行われるようになる。 Further, in each of the above embodiments, as shown in FIG. 14, the block area generation unit 10 has a function of generating a plurality of resolution conversion images having different resolutions from the image, and setting pixels are generated from the generated plurality of resolution conversion images. It may have a function of generating a block area consisting of numbers. Specifically, the block area generation unit 10 performs a known resolution conversion technique such as a Gaussian pyramid or a wavelet transform on the entire image to generate a plurality of resolution conversion images. Then, the block area generation unit 10 generates the block area BR by dividing the generated plurality of resolution conversion images into meshes for each set number of pixels. Then, type identification is performed for each block region BR generated from a plurality of resolution-converted images.

このとき、ブロック領域生成手段１０は、設定画素数（たとえば３２画素×３２画素）の変更は行わない。これは、ブロック領域識別手段３０において、特徴量に基づいて種類の識別を行う際に、学習した際のブロック領域ＢＲの大きさと、識別する際のブロック領域ＢＲの大きさが異なるのを防止して、自己組織化マップＳＯＭにおける識別精度の低下を防止するためである。 At this time, the block area generation unit 10 does not change the number of set pixels (for example, 32 pixels × 32 pixels). This prevents the size of the block area BR when learned from differentiating between the size of the block area BR when learning and the size of the block area BR when identifying when identifying the type based on the feature amount in the block area identifying means 30. This is to prevent a decrease in identification accuracy in the self-organizing map SOM.

このように、解像度の異なる解像度変換画像を用いてブロック領域ＢＲを生成することにより、ブロック領域ＢＲの種類の識別の精度を向上させることができる。すなわち、通常の全体画像において、同じ被写体を近くから撮影した画像と遠くから撮影した画像とでは被写体の写り方が異なる。近くから撮影した場合には被写体の種類が識別できなくても遠くから撮影した場合には被写体の種類が識別できる場合やその逆の場合がある。そこで、解像度変換画像を用いることにより、この写り方の違いによる精度の低下を防止してブロック領域ＢＲの種類識別の精度を向上させることができる。 Thus, by generating the block area BR using resolution-converted images having different resolutions, it is possible to improve the accuracy of identifying the type of the block area BR. That is, in a normal whole image, the way the subject is captured differs between an image obtained by photographing the same subject from near and an image obtained from far away. Even if the subject type cannot be identified when shooting from near, the subject type can be identified when shooting from a distance, or vice versa. Therefore, by using a resolution-converted image, it is possible to prevent a decrease in accuracy due to the difference in the way of capturing and improve the accuracy of identifying the type of the block region BR.

なお、図１４において、ブロック領域ＢＲは、全体画像を機械的にメッシュ状に区切ることにより生成しているが、図１３に示すように半ブロック分ずらして生成するようにしてもよい。 In FIG. 14, the block region BR is generated by mechanically dividing the entire image into a mesh shape, but may be generated by being shifted by a half block as shown in FIG. 13.

図１５は本発明のオブジェクト識別装置の第３の実施の形態を示すブロック図である。なお、図１５のオブジェクト識別装置３００において、図１のオブジェクト識別装置１と同一の構成を有する部位には同一の符号を付してその説明を省略する。図１５のオブジェクト識別装置３００が、図１のオブジェクト識別装置１と異なる点は、ブロック領域生成手段３１０におけるブロック領域ＢＲの生成方法である。 FIG. 15 is a block diagram showing a third embodiment of the object identification device of the present invention. In the object identification device 300 of FIG. 15, parts having the same configuration as the object identification device 1 of FIG. The object identification device 300 in FIG. 15 is different from the object identification device 1 in FIG. 1 in the block region BR generation method in the block region generation means 310.

具体的には、ブロック領域生成手段１０は、図１６に示すように、オブジェクト領域ＯＲ内に設定画素数からなる切取枠を走査させて、切取枠により囲まれた画像をブロック領域として生成するようになっている。図１７は図１５のブロック領域生成手段３１０の動作例を示すフローチャートであり、図１５から図１７を参照してブロック領域ＢＲの生成方法の一例について説明する。 Specifically, as shown in FIG. 16, the block area generation unit 10 scans a cut frame having a set number of pixels in the object area OR, and generates an image surrounded by the cut frame as a block area. It has become. FIG. 17 is a flowchart showing an example of the operation of the block area generating unit 310 of FIG. 15, and an example of a method of generating the block area BR will be described with reference to FIGS.

まず、オブジェクト領域生成手段２０により、全体画像から複数のオブジェクト領域ＯＲが生成される（ステップＳＴ１０）。その後、生成された各オブジェクト領域ＯＲに対して領域ＩＤが付与される（ステップＳＴ１１）。そして、生成された複数のオブジェクト領域ＯＲの中から、ブロック領域ＢＲを生成するオブジェクト領域ＯＲが決定され（ステップＳＴ１２）、ブロック領域ＢＲが生成されていく（ステップＳＴ１３）。このブロック領域生成工程（ステップＳＴ１３）が、全体画像に含まれるすべてのオブジェクト領域ＯＲについて行われる（ステップＳＴ１２〜ステップＳＴ１４）。その後、生成された複数のブロック領域ＢＲの種類がブロック領域識別手段３０により識別される。 First, the object area generation means 20 generates a plurality of object areas OR from the entire image (step ST10). Thereafter, an area ID is assigned to each generated object area OR (step ST11). Then, the object area OR for generating the block area BR is determined from the plurality of generated object areas OR (step ST12), and the block area BR is generated (step ST13). This block region generation step (step ST13) is performed for all object regions OR included in the entire image (steps ST12 to ST14). Thereafter, the types of the plurality of generated block areas BR are identified by the block area identifying means 30.

図１８はブロック領域生成工程（ステップＳＴ１３）の一例を示すフローチャートであり、図１８を参照してブロック領域ＢＲの生成工程について説明する。まず、オブジェクト領域ＯＲ内の始点に切取枠が設置される（ステップＳＴ１３−１）。具体的には、図１６に示すようにオブジェクト領域ＯＲの左上端に切取枠の左上角が位置するように切取枠が位置決めされる。そして、切取枠内のすべての領域ＩＤが一致するか否かが判断されて（ステップＳＴ１３−２）、切取枠内の領域ＩＤがすべて一致する場合には、切取枠に囲まれた領域がブロック領域ＢＲとして生成される（ステップＳＴ１３−３）。 FIG. 18 is a flowchart showing an example of the block area generation step (step ST13). The generation process of the block area BR will be described with reference to FIG. First, a cutting frame is set at the start point in the object area OR (step ST13-1). Specifically, as shown in FIG. 16, the cutting frame is positioned so that the upper left corner of the cutting frame is positioned at the upper left corner of the object area OR. Then, it is determined whether or not all the area IDs in the cut frame match (step ST13-2). If all the area IDs in the cut frame match, the area surrounded by the cut frame is blocked. The area BR is generated (step ST13-3).

その後、切取枠が水平方向（右方向）に向かってたとえば８画素だけずらされる（ステップＳＴ１３−４）。ここで、切取枠がオブジェクト領域ＯＲの最右端まで走査したか否かが判断され（ステップＳＴ１３−５）、走査していない場合には続けてブロック領域の生成が行われる（ステップＳＴ１３−２〜ステップＳＴ１３−５）。一方、切取枠が、オブジェクト領域ＯＲの最右端まで走査した場合には、切取枠が垂直方向（下方向）にたとえば８画素だけずらされるとともに、水平方向にも移動してオブジェクト領域ＯＲの左端に位置決めされる（ステップＳＴ１３−６）。その後、水平方向に対してブロック領域ＢＲが生成されていく（ステップＳＴ１３−２〜ステップＳＴ１３−６）。そして、切取枠がオブジェクト領域ＯＲの最下端まで走査した場合には（ステップＳＴ１３−７）、１つのオブジェクト領域ＯＲについてブロック領域ＢＲの生成が完了する。 Thereafter, the cut frame is shifted by, for example, 8 pixels in the horizontal direction (right direction) (step ST13-4). Here, it is determined whether or not the cutting frame has been scanned to the rightmost end of the object area OR (step ST13-5). If not, the block area is generated (step ST13-2 to ST13-2). Step ST13-5). On the other hand, when the cutting frame is scanned to the rightmost end of the object area OR, the cutting frame is shifted by, for example, 8 pixels in the vertical direction (downward), and also moved in the horizontal direction to the left end of the object area OR. Positioning is performed (step ST13-6). Thereafter, the block region BR is generated in the horizontal direction (step ST13-2 to step ST13-6). When the cut frame is scanned to the lowest end of the object area OR (step ST13-7), the generation of the block area BR for one object area OR is completed.

このように、切取枠をオブジェクト領域ＯＲ内において走査させながらブロック領域ＢＲを生成することにより、オブジェクト領域ＯＲの種類を識別するためのブロック領域ＢＲの数を増やすことができるため、オブジェクト領域ＯＲの識別の精度を向上させることができる。 In this way, by generating the block area BR while scanning the cut frame within the object area OR, the number of block areas BR for identifying the type of the object area OR can be increased. The accuracy of identification can be improved.

なお、切取枠は水平方向および垂直方向に対して８画素ずらす場合について例示しているが、２画素や４画素といったの切取枠よりも小さい画素に設定されていればよい。さらに、領域ＩＤを変更することにより切取枠により切り取られるブロック領域ＢＲを決定するようにしているが、機械的に切取枠をたとえば２画素等の切取枠よりも小さい画素ピッチで、縦方向および横方向に走査するようにしてもよい。このとき、切取枠内に２つの領域ＩＤを含まれているブロック領域ＢＲについては、種類の識別を行わないようにしてもよい。 Note that the cut frame is illustrated as being shifted by 8 pixels with respect to the horizontal direction and the vertical direction, but may be set to pixels smaller than the cut frame, such as 2 pixels or 4 pixels. Further, the block area BR to be cut out by the cut frame is determined by changing the area ID. However, the cut frame is mechanically arranged at a pixel pitch smaller than the cut frame, such as 2 pixels, in the vertical direction and the horizontal direction. You may make it scan in a direction. At this time, the type identification may not be performed for the block area BR including two area IDs in the cut frame.

図１９は本発明のオブジェクト識別装置における特徴量抽出手段の別の実施の形態を示すブロック図である。図１９の特徴量抽出手段１４０は、画像変換手段１４１、エッジ画像生成手段１４２、相関特徴量抽出手段１４３、エッジ特徴量抽出手段１４４、色特徴量抽出手段１４５等を有する。 FIG. 19 is a block diagram showing another embodiment of the feature quantity extraction means in the object identification device of the present invention. The feature quantity extraction unit 140 of FIG. 19 includes an image conversion unit 141, an edge image generation unit 142, a correlation feature quantity extraction unit 143, an edge feature quantity extraction unit 144, a color feature quantity extraction unit 145, and the like.

画像変換手段１４１は、ＲＧＢ表色系により表現されているブロック領域をＹＣＣ表色系に変換するものである。このとき、画像変換手段１４１は、画像を構成する複数のブロック領域ＢＲのうち、１つのオブジェクト領域ＯＲに含まれるブロック領域を識別するようになっている。これは、画像を構成する複数のブロック領域ＢＲのうち、オブジェクト領域ＯＲ間の境界にまたがるブロック領域ＢＲは、オブジェクト領域ＯＲの種類の判断には使用しないため、特徴量の抽出を行わないためである。 The image conversion unit 141 converts the block area expressed by the RGB color system to the YCC color system. At this time, the image conversion unit 141 identifies a block area included in one object area OR among a plurality of block areas BR constituting the image. This is because, among the plurality of block areas BR constituting the image, the block area BR that straddles the boundary between the object areas OR is not used for the determination of the type of the object area OR, and thus the feature amount is not extracted. is there.

エッジ画像生成手段１４２は、画像変換手段１４１により生成されたＹ成分を用いてエッジ画像を生成する機能を有する。ここで、エッジ画像生成手段１４２は、図２０（ａ）に示す縦エッジ検出用フィルターを用いて縦エッジ画像を生成するとともに、図２０（ｂ）に示す横エッジ検出用フィルターを用いて横エッジ画像を生成するようになっている。 The edge image generation unit 142 has a function of generating an edge image using the Y component generated by the image conversion unit 141. Here, the edge image generation unit 142 generates a vertical edge image using the vertical edge detection filter shown in FIG. 20A and uses the horizontal edge detection filter shown in FIG. An image is generated.

なお、エッジ画像生成手段１４２は、図２０に示すようなエッジ検出用フィルター（ｐｒｅｗｉｔｔフィルター）を用いているが、たとえば上下左右の画素には対角線上のものより大きな重みを与えたエッジ検出用フィルター（Ｓｏｂｅｌフィルター）を用いたエッジ検出方法やその他の公知のエッジ検出方法を用いることができる。 The edge image generation unit 142 uses an edge detection filter (prewitt filter) as shown in FIG. 20, for example, an edge detection filter in which higher and lower pixels are given higher weights than those on the diagonal line. An edge detection method using (Sobel filter) or other known edge detection methods can be used.

図１９の相関特徴量抽出手段１４３は、ブロック領域ＢＲの各画素に割り当てられた成分信号値の１方向に沿った変化の規則性の程度を示す相関特徴量を抽出するものである。ここで、図２１は相関特徴量抽出手段１４３における相関特徴量の算出方法の一例を示すフローチャートを示しており、図２１を参照して相関特徴量の算出方法について説明する。 The correlation feature amount extraction unit 143 in FIG. 19 extracts a correlation feature amount indicating the degree of regularity of change along one direction of the component signal value assigned to each pixel of the block region BR. Here, FIG. 21 is a flowchart showing an example of a calculation method of the correlation feature quantity in the correlation feature quantity extraction unit 143, and the calculation method of the correlation feature quantity will be described with reference to FIG.

なお、図２１において横方向に沿った変化に関する相関特徴量の抽出について説明するが、同様の手法により縦方向に沿った変化に対する相関特徴量も抽出される。また、以下に示すＦ_ｉ（ｘ）は、第ｉ行における第ｘ画素（ｉ＝０〜３１、ｘ＝０〜３１）の成分信号値を示し、Ｆ_ｊ（ｘ）は第ｊ行における第ｘ画素（ｉ＝０〜３１、ｘ＝０〜３１）の成分信号値を示すものとする。 In addition, although extraction of the correlation feature-value regarding the change along a horizontal direction is demonstrated in FIG. 21, the correlation feature-value with respect to the change along a vertical direction is also extracted by the same method. Further, F _i (x) shown below indicates the component signal value of the x-th pixel (i = 0 to 31, x = 0 to 31) in the i-th row, and F _j (x) indicates the component signal value in the j-th row. The component signal value of x pixel (i = 0-31, x = 0-31) shall be shown.

最初に、エッジ画像生成手段１４２において生成された縦エッジ画像を用いて、縦エッジ画像の各行に沿った成分信号値Ｆ_ｉ（ｘ）、Ｆ_ｊ（ｘ）の変化を規格化する（ステップＳＴ２１）。具体的には、成分信号値Ｆ_ｉ（ｘ）と平均値Ｆ_ｉとの差分を標準偏差δ_ｉで割り、規格化された成分信号値Ｆ_ｉ’（ｘ）が求められる。 First, using the vertical edge image generated by the edge image generation means 142, the change in the component signal values F _i (x) and F _j (x) along each row of the vertical edge image is normalized (step ST21). ). Specifically, the difference between the component signal value F _i (x) and the average value F _i is divided by the standard deviation δ _i to obtain a normalized component signal value F _i ′ (x).

Ｆ_ｉ’（ｘ）＝（Ｆ_ｉ（ｘ）−Ｆ_ｉ）／δ_ｉ
同様に、ｊ行の成分信号値Ｆ_ｊ（ｘ）と平均値Ｆ_ｊとの差分を標準偏差δ_ｉで割り、規格化された成分信号値Ｆ_ｉ’（ｘ）が求められる。 F _i ′ (x) = (F _i (x) −F _i ) / δ _i
Similarly, the difference between the component signal value F _j (x) of j rows and the average value F _j is divided by the standard deviation δ _i to obtain a normalized component signal value F _i ′ (x).

Ｆ_ｊ’（ｘ）＝（Ｆ_ｊ（ｘ）−Ｆ_ｊ）／δ_ｊ
このように、成分信号値を規格化して相関特徴量を求めるのは、各行間における変動幅や平均値の違いを排除して、変動パターン自体の相互相関性を示す相関特徴量を導出するためである。なお、Ｆ_ｉ（ｘ）、Ｆ_ｊ（ｘ）が一定値であり標準偏差が０の場合は、Ｆ_ｉ’（ｘ）＝０（一定）、Ｆ_ｊ’（ｘ）＝０（一定）とする。 F _j ′ (x) = (F _j (x) −F _j ) / δ _j
As described above, the correlation feature value is obtained by normalizing the component signal value in order to derive the correlation feature value indicating the cross-correlation of the variation pattern itself by eliminating the difference in the fluctuation range and the average value between the rows. It is. When F _i (x) and F _j (x) are constant values and the standard deviation is 0, F _i ′ (x) = 0 (constant) and F _j ′ (x) = 0 (constant). To do.

そして、異なる２行（第ｉ行と第ｊ行）の組合せについて、これら２行に関する規格化された成分信号値Ｆ_ｉ’（ｘ）およびＦ_ｊ’（ｘ）を用いて、相互相関関数

Then, with respect to the combination of two different rows (i-th row and j-th row), the cross-correlation function is obtained using the normalized component signal values F _i ′ (x) and F _j ′ (x) regarding these two rows.

が導出される（ステップＳＴ２２）。この相互相関関数は、概念的に言えば、図２２（ａ）に示すように、２行の規格化された成分信号値Ｆ_ｉ’（ｘ）およびＦ_ｊ’（ｘ）をｄ画素分だけずらして掛け合わせ、その総和を取るものである。すると、図２２（ｂ）に示すような、ｄの関数としての相互相関関数Ｇ_ｉｊ（ｄ）が得られる。 Is derived (step ST22). Conceptually, the cross-correlation function is obtained by converting the normalized component signal values F _i ′ (x) and F _j ′ (x) of two rows by d pixels as shown in FIG. Multiply by shifting and take the sum. Then, a cross-correlation function G _ij (d) as a function of d as shown in FIG. 22B is obtained.

次に、算出した相互相関関数Ｇ_ｉｊ（ｄ）にｄ＝０〜３１に代入したときの相関値の中から最大相関値が算出される（ステップＳＴ２３）。

Next, the maximum correlation value is calculated from the correlation values when d = 0 to 31 is substituted into the calculated cross-correlation function G _ij (d) (step ST23).

この作業をすべての２行の組み合わせについて最大相関値が算出される（ステップＳＴ２１〜ステップＳＴ２４）。ここでは、３２画素×３２画素のブロック領域においては、０行〜３１行のすべての組み合わせの最大相関値が算出される。そして、算出されたすべての最大相関値の平均値および標準偏差が算出されて、この平均値および標準偏差が相関特徴量とされる（ステップＳＴ２５）。同様に、縦方向に沿った変化に関する最大相関値の平均値および標準偏差が相関特徴量として算出される（ステップＳＴ２１〜ステップＳＴ２５）。 In this operation, the maximum correlation value is calculated for all combinations of two rows (steps ST21 to ST24). Here, in the block region of 32 pixels × 32 pixels, the maximum correlation values of all combinations of the 0th to 31st rows are calculated. Then, an average value and a standard deviation of all the calculated maximum correlation values are calculated, and the average value and the standard deviation are set as correlation feature amounts (step ST25). Similarly, the average value and the standard deviation of the maximum correlation values relating to changes along the vertical direction are calculated as correlation feature amounts (steps ST21 to ST25).

上述したように算出された相関特徴量は、オブジェクトを構成するブロック領域ＢＲに規則的なパターンがあるかどうかを示すものであり、最大相関値の平均値が大きく標準偏差の小さくなればなるほど、規則的なパターンが形成されていることを意味する。一般的に撮影された画像に含まれる自然物は規則的なパターン、連続的なパターン、周期的なパターンは少なく、ランダムなパターンにより構成されていることが多い。一方、ビルや石畳等の人工物は規則的なパターン等により構成されていることが多い。そこで、オブジェクトを構成するブロック領域ＢＲが規則的なパターンを構成しているか否かを示す相関特徴量を抽出することにより、ブロック領域ＢＲが人工的に作られた建造物等の画像の一部であるのか、自然物の画像の一部であるのかを判断することができる。 The correlation feature amount calculated as described above indicates whether or not there is a regular pattern in the block region BR constituting the object. The larger the average value of the maximum correlation values is and the smaller the standard deviation is, It means that a regular pattern is formed. In general, natural objects included in captured images have few regular patterns, continuous patterns, and periodic patterns, and are often composed of random patterns. On the other hand, artifacts such as buildings and cobblestones are often composed of regular patterns. Therefore, a part of an image of a building or the like in which the block region BR is artificially created by extracting a correlation feature amount indicating whether or not the block region BR constituting the object forms a regular pattern. Or a part of an image of a natural object.

なお、相関特徴量抽出手段１４３は、単に規格化された成分信号値Ｆ_ｉ’（ｘ）、Ｆ_ｊ’（ｘ）の積の総和の平均値および標準偏差を相関特徴量として抽出してもよいが、上述のように、相互相関関数の最大値の平均値および標準偏差を相関特徴量として用いれば、たとえば斜め方向に規則的な模様や波紋が撮影されたブロック領域ＢＲについても、そのパターンの規則性を示す適当な相関特徴量を導出できるようになり、ブロック領域の相関に関する特徴量を正確に表した相関特徴量の抽出を行うことができる。ここで、１画素ずつ画素ライン画素ラインをずらした場合（ｄ＝０，１，２，・・・３１）について言及しているが、２画素分ずらす等の複数画素ずらしながら最大相関値を算出するようにしてもよい。 Note that the correlation feature quantity extraction means 143 may simply extract the average value and standard deviation of the sum of the normalized component signal values F _i ′ (x) and F _j ′ (x) as the correlation feature quantity. However, as described above, if the average value and the standard deviation of the maximum value of the cross-correlation function are used as the correlation feature amount, for example, the pattern of the block region BR in which a regular pattern or ripple is photographed in an oblique direction. Accordingly, it is possible to derive an appropriate correlation feature amount indicating regularity of the block, and it is possible to extract a correlation feature amount that accurately represents the feature amount related to the correlation of the block region. Here, the case where the pixel line is shifted pixel by pixel (d = 0, 1, 2,... 31) is mentioned, but the maximum correlation value is calculated while shifting a plurality of pixels, such as shifting by two pixels. You may make it do.

エッジ特徴量抽出手段１４４は、ブロック領域ＢＲのエッジ成分の特徴量を抽出するものである。具体的には、エッジ特徴量抽出手段１４４は、エッジ検出フィルター（図２０参照）を用いて生成された縦エッジ画像および横エッジ画像について、それぞれの成分信号値の平均値および標準偏差を算出し、４個のエッジ特徴量を出力するものである。 The edge feature quantity extraction means 144 extracts the feature quantity of the edge component of the block area BR. Specifically, the edge feature quantity extraction unit 144 calculates an average value and a standard deviation of each component signal value for the vertical edge image and the horizontal edge image generated using the edge detection filter (see FIG. 20). Four edge feature values are output.

このように、エッジ成分の特徴量としてエッジ成分の平均値を用いることにより、自然物の中でもエッジの少ない「空」と自然物の中でもエッジの多い「水」や「植物」とを分類することができる。また、エッジ特徴量としてブロック領域ＢＲの縦方向のエッジ成分と横方向のエッジ成分とを抽出することにより、たとえば「水」のように方向によってエッジ成分の特徴が異なるオブジェクトと、「植物」「花畑」等の縦方向および横方向において比較的均一なエッジを形成するオブジェクトとを分類することができる。 In this way, by using the average value of the edge component as the feature value of the edge component, it is possible to classify “sky” with few edges among natural objects and “water” and “plants” with many edges among natural objects. . Further, by extracting the vertical edge component and the horizontal edge component of the block region BR as edge feature amounts, for example, “water”, an object having different edge component characteristics depending on the direction, “plant”, “ Objects that form relatively uniform edges in the vertical and horizontal directions such as “flower garden” can be classified.

色特徴量抽出手段１４５は、ブロック領域ＢＲの色特徴を示す色特徴量を抽出するものである。具体的には、色特徴量抽出手段１４５は、ＹＣＣ表色系で表されたブロック領域ＢＲを構成する３２×３２画素分の輝度成分（Ｙ成分）および２つの色差成分（Ｃｒ、Ｃｂ）の各成分信号値の平均値および標準偏差を算出し、１のブロック領域から６個の色特徴量を抽出するものである。 The color feature amount extraction unit 145 extracts a color feature amount indicating the color feature of the block region BR. Specifically, the color feature amount extraction unit 145 includes the luminance component (Y component) and the two color difference components (Cr, Cb) for 32 × 32 pixels constituting the block region BR expressed in the YCC color system. An average value and a standard deviation of each component signal value are calculated, and six color feature amounts are extracted from one block area.

なお、色特徴量抽出手段１４５は、ＲＧＢ表色系からＹＣＣ表色系に変換された後に色特徴量が抽出するようにしているが、たとえばＲＧＢ表色系のまま各成分（ＲＧＢ）について色特徴量を抽出するようにしてもよいし、画像変換手段１４１において、ＲＧＢ表色系のブロック領域ＢＲをＬａｂ表色系に変換して、Ｌａｂの各成分について色特徴量を抽出するようにしてもよい。また、色特徴量抽出手段１４５は、各成分信号値の平均値と標準偏差とを色特徴量として抽出しているが、たとえば最大値や最小値、分位点等その他の代表値を色特徴量として用いてもよい。 The color feature quantity extraction unit 145 extracts the color feature quantity after conversion from the RGB color system to the YCC color system. The feature amount may be extracted, or the image conversion unit 141 may convert the RGB color system block area BR to the Lab color system and extract the color feature amount for each component of Lab. Also good. The color feature quantity extraction unit 145 extracts the average value and standard deviation of each component signal value as color feature quantities. For example, the color feature quantity extraction unit 145 uses other representative values such as a maximum value, minimum value, and quantile as color features. It may be used as a quantity.

そして、４つの相関特徴量と４つのエッジ特徴量と６つの色特徴量とからなる１４次元のブロック特徴量が写像手段５０に入力されて、種類出力手段６０により種類の識別が行われるようになる。このとき、写像手段５０における自己組織化マップＳＯＭの結合荷重ベクトルは、１４次元のベクトルから構成されるようになり、自己組織化マップＳＯＭは１４次元の特徴ベクトルを用いて学習された状態になっている。 Then, a 14-dimensional block feature value composed of four correlation feature values, four edge feature values, and six color feature values is input to the mapping unit 50, and the type output unit 60 identifies the type. Become. At this time, the combined load vector of the self-organizing map SOM in the mapping means 50 is composed of 14-dimensional vectors, and the self-organizing map SOM is learned using the 14-dimensional feature vectors. ing.

なお、各特徴量は変動幅を調整した上で適当な重み付けをして使用するようにしてもよい。さらに、図１９の特徴量抽出手段１４０が、上述した相関特徴量、エッジ特徴量、色特徴量の他に、図６における距離画像から抽出した特徴量や高周波成分の特徴量を算出する機能を有するものであってもよい。 Each feature amount may be used with appropriate weighting after adjusting the fluctuation range. Further, the feature quantity extraction unit 140 in FIG. 19 has a function of calculating the feature quantity extracted from the distance image in FIG. 6 and the feature quantity of the high frequency component in addition to the above-described correlation feature quantity, edge feature quantity, and color feature quantity. You may have.

図２３は本発明のオブジェクト識別装置の第４の実施の形態を示すブロック図であり、図２３を参照してオブジェクト識別装置５００について説明する。なお、図２３のオブジェクト識別装置５００において、図１のオブジェクト識別装置１と同一の構成を有する部位には同一の符号を付してその説明を省略する。 FIG. 23 is a block diagram showing a fourth embodiment of the object identification device of the present invention. The object identification device 500 will be described with reference to FIG. In the object identification device 500 of FIG. 23, parts having the same configuration as the object identification device 1 of FIG.

オブジェクト識別装置５００において、最初にオブジェクト領域生成手段２０がオブジェクト領域ＯＲを生成するようになっている。そして、特徴量抽出手段５４０が、生成されたオブジェクト領域ＯＲからオブジェクト特徴量を抽出するようになっている。その後、抽出したオブジェクト特徴量を用いて写像手段５０および種類出力手段６０により、オブジェクト領域ＯＲの種類が識別されるようになっている。 In the object identification device 500, first, the object area generation means 20 generates an object area OR. Then, the feature quantity extraction means 540 extracts the object feature quantity from the generated object area OR. After that, the type of the object area OR is identified by the mapping unit 50 and the type output unit 60 using the extracted object feature amount.

さらに、この特徴量抽出手段５４０は、画像変換手段５１０により所定の画像変換処理が施された全体画像と生成されたオブジェクト領域ＯＲとを用いてオブジェクト特徴量を抽出するようになっている。具体的には、画像変換手段５１０は、ＲＧＢ表色系からなる全体画像をＹＣＣ表色系に変換し、ＹＣＣ表色系の各成分毎の３つの画像を生成する機能を有する。さらに、画像変換手段５１０は、Ｙ成分から生成した縦エッジ画像と横エッジ画像とを生成するようになっている。そして特徴量抽出手段４０は、ＹＣＣ各成分毎の３つの画像、縦エッジ画像、横エッジ画像の５つの画像からそれぞれオブジェクト特徴量を抽出するようになっている。 Further, the feature amount extraction unit 540 extracts an object feature amount using the entire image that has been subjected to the predetermined image conversion process by the image conversion unit 510 and the generated object region OR. Specifically, the image conversion unit 510 has a function of converting the entire image composed of the RGB color system into the YCC color system and generating three images for each component of the YCC color system. Furthermore, the image conversion means 510 generates a vertical edge image and a horizontal edge image generated from the Y component. The feature quantity extraction means 40 extracts object feature quantities from five images, that is, three images for each YCC component, a vertical edge image, and a horizontal edge image.

ここで、特徴量抽出手段５４０は以下に示す手法によりオブジェクト特徴量を抽出するようになっている。すなわち、特徴量抽出手段５４０は、上述した各画像に対して領域分割結果を組み合わせることにより、各オブジェクト領域ＯＲ毎の画素値の分布（ヒストグラム）を生成する。そして、特徴量抽出手段５４０は、ヒストグラムから平均値および標準偏差を算出し、オブジェクト特徴量を生成するようになっている。なお、特徴量としてヒストグラムの代表点（たとえば最大値、最小値、中央値、分位点等）を用いてもよい。また、自己組織化マップＳＯＭの学習用サンプルは、ブロック領域ＢＲに上述した画像変換を施し、上述したヒストグラムから抽出した特徴量を用いて行われることになる。 Here, the feature quantity extraction means 540 extracts object feature quantities by the following method. That is, the feature amount extraction unit 540 generates a distribution (histogram) of pixel values for each object region OR by combining the region division results for each image described above. The feature quantity extraction unit 540 calculates an average value and a standard deviation from the histogram, and generates an object feature quantity. Note that a representative point of the histogram (for example, maximum value, minimum value, median value, quantile, etc.) may be used as the feature quantity. In addition, the learning sample of the self-organizing map SOM is performed using the above-described image conversion on the block region BR and using the feature amount extracted from the above-described histogram.

なお、上述した特徴量抽出手段５４０において、図６や図１９に示すような特徴量をオブジェクト領域ＯＲから抽出し、オブジェクト特徴量としてもよい。さらに、上述した画像変換手段５１０において、全体画像に多重解像度変換を施し解像度の異なる複数の解像度変換画像、全体画像をＲＧＢ表色系からＬａｂ表色系に変換した画像、モフォロジーフィルタ等を用いて特定形状の構造を抽出したフィルタリング画像等を生成するようにし、特徴量抽出手段５４０は、各画像から特徴量を抽出するようにしてもよい。 Note that the feature quantity extraction means 540 described above may extract feature quantities as shown in FIGS. 6 and 19 from the object region OR and use them as object feature quantities. Further, the above-described image conversion unit 510 uses a plurality of resolution conversion images having different resolutions by performing multi-resolution conversion on the entire image, an image obtained by converting the entire image from the RGB color system to the Lab color system, a morphology filter, and the like. A filtering image or the like obtained by extracting a structure having a specific shape may be generated, and the feature amount extraction unit 540 may extract a feature amount from each image.

これにより、オブジェクト領域ＯＲの領域形状が複雑な場合や小さい場合においてもオブジェクト領域ＯＲの種類を確実に識別することができるようになる。すなわち、全体画像をブロック領域ＢＲに分けたときには、オブジェクト領域ＯＲが複雑な場合にはオブジェクト領域ＯＲが複数のブロック領域ＢＲに分かれてしまい、オブジェクト領域ＯＲが小さい場合には種類識別に用いるブロック領域ＢＲの数が少なくなってしまう。 As a result, even when the area shape of the object area OR is complex or small, the type of the object area OR can be reliably identified. That is, when the entire image is divided into block areas BR, the object area OR is divided into a plurality of block areas BR when the object area OR is complicated, and the block area used for type identification when the object area OR is small. The number of BR will decrease.

これに対し、ブロック領域識別手段３０による識別結果をブロック領域ＢＲに含まれるすべての画素に割り当てるようにし、オブジェクト領域ＯＲを構成する画素に割り当てられた種類のうち、最も画素の多い種類をオブジェクト領域ＯＲの種類であると識別することも考えられる。しかし、オブジェクト領域ＯＲの境界を含むブロック領域ＢＲについても種類の識別を行う必要があり、その結果、種類の識別の精度が低下してしまうという問題がある。そこで、オブジェクト領域ＯＲ自体から特徴量を抽出して種類の識別を行うことにより、複雑な形状のオブジェクト領域ＯＲや形状の小さいオブジェクト領域ＯＲについても精度よく種類の識別を行うことができる。 On the other hand, the result of identification by the block area identifying means 30 is assigned to all the pixels included in the block area BR, and the type having the largest number of pixels among the types assigned to the pixels constituting the object area OR is assigned to the object area. It may be possible to identify the type of OR. However, it is necessary to identify the type of the block region BR including the boundary of the object region OR. As a result, there is a problem that the accuracy of identifying the type is lowered. Therefore, by extracting the feature amount from the object area OR itself and identifying the type, it is possible to accurately identify the type of the object area OR having a complicated shape or the object area OR having a small shape.

図２４は本発明のオブジェクト識別装置の第５の実施の形態を示すブロック図であり、図２４を参照してオブジェクト識別装置６００について説明する。なお、図２４のオブジェクト識別装置６００において図１のオブジェクト識別装置１および図２３のオブジェクト識別装置５００と同一の構成を有する部位には同一の符号を付してその説明を省略する。図２４のオブジェクト識別装置６００が、図２３のオブジェクト識別装置５００と異なる点は、オブジェクト領域ＯＲの外接矩形画像を規格化した規格化オブジェクト領域を生成する規格化手段６３０をさらに備えることである。したがって、オブジェクト特徴量を抽出する際のオブジェクト領域ＯＲの大きさは、いずれの画像のいずれのオブジェクト領域ＯＲであっても同一の大きさとなる。 FIG. 24 is a block diagram showing a fifth embodiment of the object identification device of the present invention. The object identification device 600 will be described with reference to FIG. In the object identification device 600 of FIG. 24, parts having the same configurations as those of the object identification device 1 of FIG. 1 and the object identification device 500 of FIG. The object identification device 600 of FIG. 24 is different from the object identification device 500 of FIG. 23 in that the object identification device 600 further includes a normalization unit 630 that generates a normalized object region obtained by normalizing a circumscribed rectangular image of the object region OR. Accordingly, the size of the object area OR when extracting the object feature amount is the same regardless of the object area OR of any image.

このように、オブジェクト領域ＯＲを規格化してからオブジェクト特徴量を抽出することにより、全体画像に含まれるオブジェクト領域の大きさに種類の識別精度が依存されることなく、正確な識別を行うことができる。つまり、全体画像に含まれるオブジェクトの大きさは、撮影時の状況により多種多様なものとなる。そこで、各オブジェクト領域ＯＲを規格化した後にオブジェクト特徴量を抽出し種類の識別を行うことにより、サイズの変動に対してロバスト性を持たせ、精度の高い種類の識別を行うことが可能となる。 As described above, by extracting the object feature amount after normalizing the object area OR, accurate identification can be performed without depending on the size of the object area included in the entire image and the type identification accuracy. it can. That is, the size of the object included in the entire image varies depending on the situation at the time of shooting. Therefore, by extracting the object feature amount after standardizing each object region OR and identifying the type, it is possible to make the type robust with respect to the size variation and to identify the type with high accuracy. .

なお、図１のオブジェクト識別装置１と図２４のオブジェクト識別装置６００とを組み合わせて使用するようにしてもよい。すると、オブジェクトの一部が遮蔽物によって隠れている場合、オブジェクト領域ＯＲからは種類の識別精度が低くなってしまうが、ブロック領域ＢＲによる識別の集計を用いれば、オブジェクトの種類の識別精度が低下するのを防止することができる。 Note that the object identification device 1 of FIG. 1 and the object identification device 600 of FIG. 24 may be used in combination. Then, when a part of the object is hidden by the shielding object, the type identification accuracy is lowered from the object area OR, but if the identification summation by the block area BR is used, the object type identification accuracy is lowered. Can be prevented.

なお、本発明の実施の形態は上記各実施の形態に限定されない。たとえば、図１〜図２２のオブジェクト識別装置１、３００については、ブロック領域ＢＲの種類を識別し、その識別結果を集計してオブジェクト領域ＯＲの種類を識別し、図２３および図２４のオブジェクト識別装置５００、６００については、オブジェクト領域ＯＲ自体から種類を識別するようにしているが、両者を組み合わせるようにしてもよい。すなわち、ブロック領域識別手段３０によるブロック領域ＢＲの種類の識別と、種類出力手段によるオブジェクト領域ＯＲ自体の種類の識別とを用いて、オブジェクト識別手段において最終的なオブジェクト領域ＯＲの種類を識別するようにしてもよい。 The embodiments of the present invention are not limited to the above embodiments. For example, for the object identification devices 1 and 300 in FIGS. 1 to 22, the types of the block areas BR are identified, the identification results are aggregated to identify the types of the object areas OR, and the object identifications in FIGS. As for the devices 500 and 600, the type is identified from the object region OR itself, but they may be combined. That is, the object identification means identifies the final type of the object area OR using the block area BR type identification by the block area identification means 30 and the type identification means by the type output means. It may be.

また、図１のブロック領域識別手段３０は、種類として「空」や「海」等といった情報をオブジェクト識別手段７０に送るようにしているが、上述した種類ベクトル自体を種類としてオブジェクト識別手段７０に送るようにしてもよい。この場合、オブジェクト識別手段７０は、オブジェクト領域ＯＲに含まれる各ブロック領域ＢＲの種類ベクトルを単純加算することにより、種類ベクトルのうち最大のベクトル成分となっている種類をオブジェクト領域ＯＲの種類として識別するようにしてもよい。あるいは、最大のベクトル成分が最大しきい値よりも小さい等の場合、オブジェクト識別手段７０がオブジェクト領域ＯＲの種類を「不明」となるようにしてもよい。 1 sends information such as “sky” and “sea” as types to the object identification unit 70, but the above-described type vector itself is used as a type to the object identification unit 70. You may make it send. In this case, the object identifying means 70 identifies the type that is the largest vector component of the type vectors as the type of the object region OR by simply adding the type vector of each block region BR included in the object region OR. You may make it do. Alternatively, when the maximum vector component is smaller than the maximum threshold value, the object identification unit 70 may set the type of the object area OR to “unknown”.

また、オブジェクト領域ＯＲの生成およびブロック領域ＢＲの生成は、送られる全体画像Ｐの有する解像度をそのまま使用している場合について例示しているが、オブジェクト領域生成手段２０およびブロック領域生成手段１０に入力する前に解像度を落としてから入力するようにしてもよい。解像度を落とすことにより、処理するデータ量を少なくすることができるため、処理速度の向上および処理の効率化を図ることができる。 In addition, the generation of the object area OR and the generation of the block area BR are exemplified for the case where the resolution of the whole image P to be sent is used as it is, but the input to the object area generation means 20 and the block area generation means 10 You may make it input after reducing resolution before doing. Since the amount of data to be processed can be reduced by reducing the resolution, the processing speed can be improved and the processing efficiency can be improved.

さらに、オブジェクト領域ＯＲを生成する際の解像度と、ブロック領域ＢＲを生成する際の解像度が同一である必要はない。たとえば、ブロック領域ＢＲが、オブジェクト領域ＯＲの画像よりも解像度を高くするようにしてもよい。これは、ブロック領域ＢＲは上述したようにそれぞれ種類を識別する必要があるが、オブジェクト領域ＯＲに分割する際には大雑把に類似した領域に分けることを目的とするため、比較的低解像度の画像を利用しても目的は達成することができるためである。 Furthermore, the resolution for generating the object area OR and the resolution for generating the block area BR do not have to be the same. For example, the block area BR may have a higher resolution than the image of the object area OR. This is because it is necessary to identify the type of each of the block regions BR as described above. However, since the purpose is to roughly divide the block region BR into regions similar to each other, the relatively low-resolution image This is because the purpose can be achieved even if is used.

また、図１において、ブロック領域生成手段１０により生成されたブロック領域ＢＲをそのままブロック領域識別手段３０に送るようにしているが、ブロック領域ＢＲ毎の判定結果に対してたとえばモフォロジー処理やＣｌｏｓｉｎｇ演算等の平滑化処理を行った後にブロック領域識別手段３０に送るようにしてもよい。これにより、ブロック領域ＢＲ内に含まれる孤立したノイズ的な要素が切り捨てられて、種類識別の精度の向上を図ることができる。 In FIG. 1, the block area BR generated by the block area generation means 10 is sent as it is to the block area identification means 30. For example, morphology processing, closing operation, etc. are performed on the determination result for each block area BR. After performing the smoothing process, it may be sent to the block area identifying means 30. As a result, the isolated noisy elements included in the block region BR are discarded, and the accuracy of type identification can be improved.

本発明のオブジェクト識別装置の第１の実施の形態を示すブロック図The block diagram which shows 1st Embodiment of the object identification device of this invention 本発明のオブジェクト識別装置において、画像に含まれるオブジェクト毎に種類が識別される様子を示す図The figure which shows a mode that a kind is identified for every object contained in an image in the object identification device of this invention. 本発明のオブジェクト識別装置におけるオブジェクト領域生成手段の一例を示すブロック図The block diagram which shows an example of the object area | region production | generation means in the object identification apparatus of this invention 図２のオブジェクト領域生成手段により画像が領域分割される様子を示す図The figure which shows a mode that an image is divided | segmented into an area | region by the object area | region production | generation means of FIG. 図２のオブジェクト領域生成手段によりクラスタリング領域が統合されてオブジェクト領域が形成される様子を示す図The figure which shows a mode that a clustering area | region is integrated by the object area | region production | generation means of FIG. 2, and an object area | region is formed. 本発明のオブジェクト識別装置における特徴量抽出手段の一例を示すブロック図The block diagram which shows an example of the feature-value extraction means in the object identification device of this invention 本発明のオブジェクト識別装置における距離画像生成手段における距離画像の生成の様子を示すブロック図The block diagram which shows the mode of the production | generation of the distance image in the distance image generation means in the object identification device of this invention 本発明のオブジェクト識別装置における写像手段および種類出力手段の一例を示すブロック図The block diagram which shows an example of the mapping means and kind output means in the object identification apparatus of this invention 本発明のオブジェクト識別装置における種類頻度分布マップの一例を示すブロック図The block diagram which shows an example of the kind frequency distribution map in the object identification apparatus of this invention 本発明のオブジェクト識別方法の好ましい実施の形態を示すフローチャートThe flowchart which shows preferable embodiment of the object identification method of this invention 本発明のオブジェクト識別装置の第２の実施の形態を示すブロック図The block diagram which shows 2nd Embodiment of the object identification device of this invention 本発明のオブジェクト識別装置における写像手段の別の一例を示すブロック図The block diagram which shows another example of the mapping means in the object identification apparatus of this invention 図１のブロック領域生成手段の別の生成方法の一例を示す模式図The schematic diagram which shows an example of another production | generation method of the block area production | generation means of FIG. 図１のブロック領域生成手段の別の生成方法の一例を示す模式図The schematic diagram which shows an example of another production | generation method of the block area production | generation means of FIG. 本発明のオブジェクト識別装置の第３の実施の形態を示すブロック図The block diagram which shows 3rd Embodiment of the object identification device of this invention 図１５のオブジェクト識別装置におけるブロック領域の生成方法の一例を示す模式図FIG. 15 is a schematic diagram showing an example of a block area generation method in the object identification device of FIG. 図１５のオブジェクト識別装置におけるブロック領域の生成方法の一例を示すフローチャートThe flowchart which shows an example of the production | generation method of the block area | region in the object identification apparatus of FIG. 図１５のオブジェクト識別装置におけるブロック領域の生成方法の一例を示すフローチャートThe flowchart which shows an example of the production | generation method of the block area | region in the object identification apparatus of FIG. 本発明のオブジェクト識別装置における特徴量抽出手段の別の実施の形態を示すブロック図The block diagram which shows another embodiment of the feature-value extraction means in the object identification device of this invention 図１９の画像生成手段において使用されるエッジフィルターの一例を示す図The figure which shows an example of the edge filter used in the image generation means of FIG. 図１９の相関特徴量抽出手段の動作例を示すフローチャートThe flowchart which shows the operation example of the correlation feature-value extraction means of FIG. 図１９の相関特徴量抽出手段における相互相関関数の一例を示すグラフ図FIG. 19 is a graph showing an example of a cross-correlation function in the correlation feature quantity extraction unit of FIG. 本発明のオブジェクト識別装置の第４の実施の形態を示すブロック図The block diagram which shows 4th Embodiment of the object identification device of this invention 本発明のオブジェクト識別装置の第５の実施の形態を示すブロック図The block diagram which shows 5th Embodiment of the object identification device of this invention

Explanation of symbols

１、３００、５００、６００オブジェクト識別装置
１０ブロック領域生成手段
２０オブジェクト領域生成手段
３０ブロック領域識別手段
３０ブロック領域識別手段
３０種類識別手段
４０、１４０特徴量抽出手段
４１変換手段
４２平均値算出手段
４３ウェーブレット変換手段
４４平均値算出手段
４５最大値算出手段
４６距離画像生成手段
４７ウェーブレット変換手段
４８平均値算出手段
４９最大値算出手段
５０写像手段
６０種類出力手段
７０オブジェクト識別手段
１００特徴量分類手段
１０１領域分割手段
１１０領域統合手段
１１１データベース
１１２最小クラスタ領域抽出手段
１１３統合領域判断手段
１３０ブロック領域生成手段
１４０特徴量抽出手段
１４１画像変換手段
１４２エッジ画像生成手段
１４３相関特徴量抽出手段
１４４エッジ特徴量抽出手段
１４５色特徴量抽出手段
１５０写像手段
２００オブジェクト識別装置
２０１ブロック領域生成手段
ＢＲブロック領域
ＢＲ１第１ブロック領域
ＢＲ２第２ブロック領域
ＫＤＭ種類頻度分布マップ
ＫＩ種類ベクトル
ＯＲオブジェクト領域
Ｐ画像
ＳＯＭ自己組織化マップ（２次元空間） 1, 300, 500, 600 Object identification device 10 Block area generation means 20 Object area generation means 30 Block area identification means 30 Block area identification means 30 Type identification means 40, 140 Feature quantity extraction means 41 Conversion means 42 Average value calculation means 43 Wavelet transform means 44 Average value calculation means 45 Maximum value calculation means 46 Distance image generation means 47 Wavelet transform means 48 Average value calculation means 49 Maximum value calculation means 50 Mapping means 60 Type output means 70 Object identification means 100 Feature quantity classification means 101 Region Dividing unit 110 Region integrating unit 111 Database 112 Minimum cluster region extracting unit 113 Integrated region determining unit 130 Block region generating unit 140 Feature amount extracting unit 141 Image converting unit 142 Edge image generating unit 143 Correlation feature amount extraction Means 144 Edge feature amount extraction means 145 Color feature amount extraction means 150 Mapping means 200 Object identification device 201 Block area generation means BR Block area BR1 First block area BR2 Second block area KDM Kind frequency distribution map KI Kind vector OR Object area P Image SOM Self-organizing map (2D space)

Claims

In an object identification method for identifying the type of object included in an image,
Generating an object area obtained by dividing the image for each object, and a plurality of block areas obtained by dividing the image into a plurality of areas smaller than the object area, each having a set number of pixels;
Identify each type for each of the plurality of generated block areas,
Totalize the types of the identified block areas for each object area,
An object identification method comprising: identifying a type of the object region using a totaled result.

In an object identification device for identifying the type of object included in an image,
Object region generation means for dividing the image into regions for each object to generate a plurality of object regions;
A block area generating unit configured to divide the image into a plurality of areas smaller than the object area, each having a set number of pixels, and generating a plurality of block areas;
A block area identifying means for identifying a type for each of the plurality of block areas generated by the block area generating means;
Object identification apparatus comprising: object identification means that aggregates the types of the block areas identified for each of the block areas for each object area, and identifies the types of the objects using the aggregated results .

The block area identification means;
Feature quantity extraction means for extracting a plurality of block feature quantities from the block region;
Mapping means for mapping a plurality of the block feature values extracted by the feature value extraction means on a two-dimensional space;
A type frequency distribution map in which a type is defined for each coordinate in the two-dimensional space, and the type in which the coordinates in the two-dimensional space mapped by the mapping unit indicate on the type frequency distribution map The object identification device according to claim 2, further comprising: a type output unit that outputs a type.

The object identification apparatus according to claim 3, wherein the two-dimensional space is a self-organizing map in which a plurality of neurons having a learning function are arranged in a matrix.

5. The object identification device according to claim 3, wherein the feature amount extraction unit extracts a color component, a brightness component, and an image feature component of the block area as the block feature amount. .

The type output means has, for each type, the type frequency distribution map in which the type frequency value is defined as the type index for each coordinate of the self-organizing map, and the coordinates detected by the mapping means are 6. The object identification device according to claim 3, wherein a type vector having a plurality of frequency values shown on each type frequency distribution map as a vector component is output.

7. The object identifying apparatus according to claim 6, wherein the type output means outputs a type that is the largest maximum vector component among vector components of the type vector.

8. The type output unit outputs the fact that the type of the block area is unknown when the maximum vector component is smaller than a predetermined maximum component threshold value. Object identification device.

The block area generating means generates a plurality of first block areas obtained by dividing the image into a mesh shape, and a second block area having a phase shifted from the plurality of first block areas in a mesh shape. 9. The object identification device according to claim 2, wherein the object identification device includes:

The block area generation unit has a function of causing the object area to scan a cutting frame having the set number of pixels and generating the block area from an image surrounded by the cutting frame. The object identification device according to any one of claims 2 to 9.

The block area generating means has a function of generating a plurality of resolution conversion images having different resolutions from the image, and has a function of generating the block area from the generated plurality of resolution conversion images, respectively. The object identification device according to any one of claims 2 to 10, wherein the object identification device is characterized in that:

A correlation feature amount, wherein the feature amount extraction unit extracts, as a correlation feature amount, whether or not a variation pattern of a vertical or horizontal signal value of the pixel has a correlation between pixel lines in the block region. 12. The object identification device according to claim 3, further comprising an extraction unit.

The correlation feature quantity extraction unit is configured to extract the correlation feature quantity along a vertical direction of the block area and the correlation feature quantity along a horizontal direction of the block area. 13. The object identification device according to 12.

The correlation feature quantity extraction unit outputs a correlation value indicating a correlation between the two pixel lines from component signal values of a plurality of pixels constituting the two pixel lines formed in the same direction in the block region. Having a cross-correlation function of
The plurality of correlation values are acquired by inputting the component signal value of the pixel to the cross-correlation function while shifting one of the two pixel lines pixel by pixel in the formation direction of the pixel line. The largest maximum correlation value is calculated from a plurality of correlation values,
The maximum correlation value is calculated for all combinations of the pixel lines formed in the same direction of the block region, and the average value and standard deviation of all the calculated maximum correlation values are extracted as correlation feature amounts. The object identification device according to claim 12, wherein the object identification device is provided.

15. The feature amount extraction unit includes an edge feature amount extraction unit that extracts an edge feature amount indicating a feature of an edge component in a vertical direction and a horizontal direction of the block region. The object identification device according to any one of the above.

16. The feature amount extraction unit according to claim 3, wherein the feature amount extraction unit includes a color feature amount extraction unit that extracts a color feature amount indicating a feature of a color component of the block region. Object identification device.

On the computer,
A procedure for generating an object area obtained by dividing an image for each object, and a plurality of block areas obtained by dividing the image into a plurality of areas smaller than the object area, each having a set number of pixels
A procedure for identifying the type for each of the plurality of generated block areas,
A procedure for totalizing the types of the identified block areas for each object area;
An object identification program for executing a procedure for identifying the type of the object area using the totaled result.