JP2020087037A

JP2020087037A - Identification apparatus and identification method

Info

Publication number: JP2020087037A
Application number: JP2018221561A
Authority: JP
Inventors: 正雄山中; Masao Yamanaka
Original assignee: Toyota Motor Corp
Current assignee: Toyota Motor Corp
Priority date: 2018-11-27
Filing date: 2018-11-27
Publication date: 2020-06-04

Abstract

To determine existence of an abnormal object based on an image with satisfactory accuracy.SOLUTION: The present invention is directed to an identification apparatus for determining existence of an abnormal object included in an image. The identification apparatus has a first type identifier for estimating a label of each of a plurality of images included in an input image based on a learning result of characteristics of pixels included in a first image which is an image having no abnormal object and for outputting a first probability distribution representing probability distribution of the plurality of labels for every pixel, a second type identifier for generating a second probability distribution from the first probability distribution based on the learning results of relationship of the labels at the surrounding of a plurality of pixels included in the first image, and a determining means for determining existence of the abnormal object in the second image based on discrepancy between the first probability distribution obtained by inputting the second image into the first type identifier and the second probability distribution obtained by inputting the first probability distribution into the second type identifier.SELECTED DRAWING: Figure 1A

Description

本発明は、画像に基づいた異常判定技術に関する。 The present invention relates to an image-based abnormality determination technique.

路上障害物（車両走行の妨げとなる、路上に放置された未知の物体）を検出する方法として、たとえば、非特許文献１に記載されたような、ミリ波レーダーを用いた方法が提案されている。非特許文献1に記載の技術によれば、時間帯や天候を問わず、かつ遠距離で
あっても路上障害物を安定的に検出することができる。 As a method of detecting an obstacle on the road (an unknown object left on the road that hinders vehicle traveling), for example, a method using a millimeter wave radar as described in Non-Patent Document 1 has been proposed. There is. According to the technique described in Non-Patent Document 1, it is possible to stably detect an obstacle on the road regardless of the time zone and the weather and even at a long distance.

また、路上障害物を検出する方法として、たとえば、非特許文献２および３に記載されたような、カメラを用いた方法が提案されている。非特許文献２に記載の技術では、画像に基づいて学習させた識別器によって路上障害物を検出している。また、非特許文献３に記載の技術では、過去と現在の画像とで、時間・空間方向の位相を合わせこみ、その差分から路上障害物を検出している。 Further, as a method for detecting an obstacle on the road, for example, a method using a camera as described in Non-Patent Documents 2 and 3 has been proposed. In the technique described in Non-Patent Document 2, a road obstacle is detected by a discriminator learned based on an image. Further, in the technique described in Non-Patent Document 3, the phases in the time and space directions are matched between the past and present images, and a road obstacle is detected from the difference between them.

また、非特許文献４に記載されたような、カメラとミリ波レーダーを組み合わせ、路上障害物を検出する試みがある。 There is also an attempt to detect an obstacle on the road by combining a camera and a millimeter wave radar as described in Non-Patent Document 4.

稲葉敬之，桐本哲郎，"車載用ミリ波レーダー，自動車技術vol.64,no.2，pp.74-79,Feb.2010.89.Takayuki Inaba, Tetsuro Kirimoto, "In-vehicle millimeter-wave radar, Automotive Technology vol.64, no.2, pp.74-79, Feb.2010.89. Joint HOG特徴量を用いた2段階AdaBoostによる車両検出, 2009, 電子情報通信学会論文誌, 尾崎・藤吉他（中部大学）Vehicle detection by 2-step AdaBoost using Joint HOG feature, 2009, Transactions of the Institute of Electronics, Information and Communication Engineers, Ozaki and Fujiyoshi et al. (Chubu University) 過去の車載カメラ映像との時空間差分による不特定障害物検知に関する検討, 2012,画像の認識・理解シンポジウム(MIRU2012)論文集,久徳・村瀬他（名古屋大学）A Study on Unspecified Obstacle Detection by Spatio-Temporal Difference from Past Vehicle Images, 2012, Symposium on Image Recognition and Understanding (MIRU2012), Kutoku and Murase et al. (Nagoya University) Integrating Milliｍeter Wave Radar with a Monocular Vision Sensor for On-Road Obstacle Detection Applications(西安交通大学), 2011Integrating Millimeter Wave Radar with a Monocular Vision Sensor for On-Road Obstacle Detection Applications (Xi'an Jiaotong University), 2011

非特許文献１に記載された、ミリ波レーダーを用いた方法では、検出精度および検出範囲において課題がある。ミリ波レーダーを用いた方法の場合、一般的に解像度が低く、レーダーの照射範囲も狭いため、サイズが小さい、もしくは、道路の端にある障害物の検出が困難なためである。 The method using the millimeter wave radar described in Non-Patent Document 1 has problems in detection accuracy and detection range. This is because the method using a millimeter wave radar generally has a low resolution and a narrow irradiation range of the radar, so that the size is small or it is difficult to detect an obstacle at the end of the road.

また、非特許文献２に記載されたような、機械学習による路上障害物の検出においては、予め学習していない物体を識別できないという問題がある。
また、非特許文献３に記載されたような、カメラ画像による手法では、先行車両や対向車両、その他の移動物体を路上障害物と誤判定してしまうという問題がある。
非特許文献４に記載された手法では、高精度に路上障害物を検出することができるが、ミリ波レーダーとカメラを併用するため、装置の設置および運用にかかるコストが高くなってしまう。 Further, in the detection of a road obstacle by machine learning as described in Non-Patent Document 2, there is a problem that an object that has not been learned in advance cannot be identified.
Moreover, the method using a camera image as described in Non-Patent Document 3 has a problem that a preceding vehicle, an oncoming vehicle, and other moving objects are erroneously determined as road obstacles.
The method described in Non-Patent Document 4 can detect an obstacle on the road with high accuracy, but since the millimeter-wave radar and the camera are used together, the cost for installing and operating the device increases.

本発明は上記の問題点を考慮してなされたものであり、画像に基づいて異常物体の存在を精度よく判定できる識別装置を提供することを目的とする。 The present invention has been made in consideration of the above problems, and an object of the present invention is to provide an identification device that can accurately determine the presence of an abnormal object based on an image.

本発明の一様態に係る識別装置は、
画像に含まれる異常物体の存在を判定する識別装置であって、前記異常物体を含まない画像である第一の画像に含まれる画素の特徴を学習した結果に基づいて、入力画像に含まれる複数の画素のそれぞれに対してラベルを推定し、複数のラベルの確率分布を画素ごとに表した第一の確率分布を出力する第一タイプ識別器と、前記第一の画像に含まれる複数の画素の周辺におけるラベルの関係性を学習した結果に基づいて、前記第一の確率分布から、第二の確率分布を生成する第二タイプ識別器と、第二の画像を前記第一タイプ識別器に入力して得られた前記第一の確率分布と、前記第一の確率分布を前記第二タイプ識別器に入力して得られた前記第二の確率分布と、の解離度に基づいて、前記第二の画像中に前記異常物体が存在することを判定する判定手段と、を有する。 An identification device according to one aspect of the present invention,
An identification device for determining the presence of an abnormal object included in an image, wherein a plurality of images included in an input image are obtained based on a result of learning characteristics of pixels included in a first image that does not include the abnormal object. A first type classifier that estimates a label for each of the pixels and outputs a first probability distribution that represents the probability distribution of a plurality of labels for each pixel, and a plurality of pixels included in the first image Based on the result of learning the relationship of labels in the vicinity of, from the first probability distribution, a second type discriminator that generates a second probability distribution, and a second image to the first type discriminator. Based on the dissociation degree of the first probability distribution obtained by input, and the second probability distribution obtained by inputting the first probability distribution to the second type discriminator, the Determination means for determining the presence of the abnormal object in the second image.

また、本発明に係る識別方法は、
画像に含まれる異常物体の存在を判定する識別方法であって、前記異常物体を含まない画像である第一の画像に含まれる画素の特徴を学習した結果に基づいて、入力画像に含まれる複数の画素のそれぞれに対してラベルを推定し、複数のラベルの確率分布を画素ごとに表した第一の確率分布を出力する第一の識別ステップと、前記第一の画像に含まれる複数の画素の周辺におけるラベルの関係性を学習した結果に基づいて、前記第一の確率分布から、第二の確率分布を生成する第二の識別ステップと、第二の画像を前記第一の識別ステップで処理して得られた前記第一の確率分布と、前記第一の確率分布を前記第二の識別ステップで処理して得られた前記第二の確率分布と、の解離度に基づいて、前記第二の画像中に前記異常物体が存在することを判定する判定ステップと、を含む。 Further, the identification method according to the present invention,
An identification method for determining the presence of an abnormal object included in an image, wherein a plurality of images included in an input image are obtained based on a result of learning characteristics of pixels included in a first image that does not include the abnormal object. A first identification step of estimating a label for each of the pixels, and outputting a first probability distribution representing the probability distribution of a plurality of labels for each pixel; and a plurality of pixels included in the first image. Based on the result of learning the relationship of labels in the vicinity of, from the first probability distribution, a second identification step of generating a second probability distribution, and a second image in the first identification step. Based on the dissociation degree of the first probability distribution obtained by processing, and the second probability distribution obtained by processing the first probability distribution in the second identification step, the A determination step of determining that the abnormal object is present in the second image.

また、本発明の別様態に係る識別装置は、
画像に含まれる異常物体の存在を判定する識別装置であって、前記異常物体を含まない画像である第一の画像に含まれる画素の特徴を学習した結果に基づいて、入力画像に含まれる複数の画素のそれぞれに対してラベルを推定し、複数のラベルの確率分布を画素ごとに表した第一の確率分布を出力する第一タイプ識別器と、前記第一の画像を前記第一タイプ識別器に入力した場合に前記第一タイプ識別器が出力する前記第一の確率分布の傾向を学習した結果に基づいて、前記第一の確率分布から、第二の確率分布を生成する第二タイプ識別器と、第二の画像を前記第一タイプ識別器に入力して得られた前記第一の確率分布と、前記第一の確率分布を前記第二タイプ識別器に入力して得られた前記第二の確率分布と、の解離度に基づいて、前記第二の画像中に前記異常物体が存在することを判定する判定手段と、を有する。 Further, an identification device according to another aspect of the present invention,
An identification device for determining the presence of an abnormal object included in an image, wherein a plurality of images included in an input image are obtained based on a result of learning characteristics of pixels included in a first image that does not include the abnormal object. A first type discriminator that estimates a label for each of the pixels and outputs a first probability distribution that represents the probability distribution of a plurality of labels for each pixel, and identifies the first image by the first type discriminator. Type that generates a second probability distribution from the first probability distribution based on the result of learning the tendency of the first probability distribution output by the first type discriminator when input to the device Discriminator, the first probability distribution obtained by inputting a second image to the first type discriminator, and obtained by inputting the first probability distribution to the second type discriminator Determination means for determining the presence of the abnormal object in the second image based on the dissociation degree of the second probability distribution.

本発明によれば、画像に基づいて異常物体の存在を精度よく判定できる識別装置を提供することができる。 According to the present invention, it is possible to provide an identification device that can accurately determine the presence of an abnormal object based on an image.

第一の実施形態に係る識別装置の機能構成図。The functional block diagram of the identification device which concerns on 1st embodiment. 第一の実施形態に係る識別装置の機能構成図。The functional block diagram of the identification device which concerns on 1st embodiment. 画像取得部によって取得される画像の例。An example of the image acquired by the image acquisition unit. 入力された画像にラベルを付与した画像の例。An example of an image in which a label is added to the input image. 第一識別器が出力するデータを説明する図。The figure explaining the data which a 1st discriminator outputs. 第二識別器が出力するデータを説明する図The figure explaining the data which the second discriminator outputs. ランダムフォレストによる決定木を説明する図。The figure explaining the decision tree by a random forest. 第二識別器が学習するラベルの共起性を説明する図。The figure explaining the co-occurrence of the label which a 2nd discriminator learns. 第一の実施形態（学習モード）において行う処理のフローチャート。The flowchart of the process performed in 1st embodiment (learning mode). 第一の実施形態（評価モード）において行う処理のフローチャート。The flowchart of the process performed in 1st embodiment (evaluation mode). 路上障害物を可視化した画像の例。An example of a visual image of a road obstacle. 第二の実施形態に係る識別装置の機能構成図。The functional block diagram of the identification device which concerns on 2nd embodiment. 第二の実施形態に係る識別装置の機能構成図。The functional block diagram of the identification device which concerns on 2nd embodiment. 第三識別器が学習するラベルの類似性を説明する図。The figure explaining the similarity of the label which a 3rd discriminator learns. 第二の実施形態（学習モード）において行う処理のフローチャート。The flowchart of the process performed in 2nd embodiment (learning mode). 第二の実施形態（評価モード）において行う処理のフローチャート。The flowchart of the process performed in 2nd embodiment (evaluation mode).

本発明に係る識別装置は、車両の前方を撮像して得られた画像に基づいて、当該車両の前方にある路上障害物（車両の走行に支障を与えうる未知の物体。本明細書では異常物体ともいう）の有無を識別する装置である。具体的には、路上障害物が無い状況を撮像して得られた画像を用いて機械学習を行い、学習の結果を用いて、路上障害物の有無が未知である画像を評価する。 An identification device according to the present invention is based on an image obtained by capturing an image of the front of a vehicle, and a road obstacle (an unknown object that may hinder the running of the vehicle in front of the vehicle. It is also a device for identifying the presence or absence of an object. Specifically, machine learning is performed using an image obtained by capturing a situation where there is no road obstacle, and an image in which the presence or absence of a road obstacle is unknown is evaluated using the learning result.

本発明に係る識別装置は、異常物体を含まない画像である第一の画像に含まれる画素の特徴を学習した結果に基づいて、入力画像に含まれる複数の画素のそれぞれに対してラベルを推定し、複数のラベルの確率分布を画素ごとに表した第一の確率分布を出力する第一タイプ識別器を用いる。 An identification device according to the present invention estimates a label for each of a plurality of pixels included in an input image, based on a result of learning characteristics of pixels included in a first image that is an image that does not include an abnormal object. Then, a first type discriminator that outputs a first probability distribution that represents the probability distributions of a plurality of labels for each pixel is used.

一方、このように、画素の特徴に基づいてラベルの推定を行う識別器は、学習していない物体を検出できないという問題がある。
この問題を解決するため、本発明に係る識別装置は、（１）第一の画像に含まれる複数の画素の周辺におけるラベルの関係性を学習した結果に基づいて、前記第一の確率分布から、第二の確率分布を生成する第二タイプ識別器を併用し、（２）第二の画像を前記第一タイプ識別器に入力して得られた前記第一の確率分布と、前記第一の確率分布を前記第二タイプ識別器に入力して得られた前記第二の確率分布と、の解離度に基づいて、前記第二の画像中に前記異常物体が存在することを判定する。 On the other hand, as described above, there is a problem that the classifier that estimates the label based on the feature of the pixel cannot detect the unlearned object.
In order to solve this problem, the identification device according to the present invention uses (1) the first probability distribution based on the result of learning the relationship of labels around a plurality of pixels included in the first image. , A second type discriminator that generates a second probability distribution is also used, and (2) the first probability distribution obtained by inputting a second image to the first type discriminator, and the first It is determined that the abnormal object is present in the second image based on the dissociation degree of the second probability distribution obtained by inputting the probability distribution of the second type discriminator to the second type discriminator.

なお、前記ラベルの関係性は、前記ラベルの共起性または類似性であることを特徴としてもよい。 The relationship between the labels may be co-occurrence or similarity of the labels.

第二タイプ識別器は、異常物体を含まない画像（第一の画像）における、ある画素の周辺におけるラベルの関係性（例えば、ラベルの共起性や類似度など）を学習させた識別器である。第二タイプ識別器は、第一の確率分布を入力として、上記の学習結果を適用した第二の確率分布を出力可能に構成される。
第二タイプ識別器によって評価された第二の確率分布が、第一の確率分布から乖離しているということは、第一タイプ識別器が、異常物体を含まない画像（第一の画像）に対して出力した確率分布とは異なる傾向を持つ確率分布を出力していることを意味する。従って、第一の確率分布と第二の確率分布との解離度に基づいて、第二の画像中における異常物体の存在尤度が推定できる。 The second type discriminator is a discriminator that has learned label relationships (eg, label co-occurrence and similarity) around a pixel in an image that does not include an abnormal object (first image). is there. The second type classifier is configured to be able to output the second probability distribution to which the above learning result is applied, with the first probability distribution as an input.
The second probability distribution evaluated by the second type discriminator deviates from the first probability distribution, which means that the first type discriminator does not include an abnormal object in the image (first image). This means that a probability distribution that has a tendency different from the probability distribution that is output is output. Therefore, the existence likelihood of an abnormal object in the second image can be estimated based on the dissociation degree between the first probability distribution and the second probability distribution.

また、前記第二タイプ識別器は、前記第一の画像を前記第一タイプ識別器に入力して得られる前記第一の確率分布を学習データとして学習した識別器であることを特徴としてもよい。
また、前記第二タイプ識別器は、前記第一の画像を前記第一タイプ識別器に入力して得られる前記第一の確率分布を学習データとして、前記第一の画像に含まれる複数の画素の周辺における前記ラベルの類似性を学習した識別器であることを特徴としてもよい。 Further, the second type classifier may be a classifier that has learned the first probability distribution obtained by inputting the first image to the first type classifier as learning data. ..
Further, the second type discriminator, the first probability distribution obtained by inputting the first image to the first type discriminator as learning data, a plurality of pixels included in the first image. The discriminator may be a classifier that learns the similarity of the labels around the.

このような学習データを用いて、前述したラベルの関係性を学習させることで、第一タ
イプ識別器が第一の画像に対して出力する確率分布の傾向を学習させることができる。 By learning the above-described label relationships using such learning data, the tendency of the probability distribution output by the first type classifier for the first image can be learned.

また、前記第二タイプ識別器は、前記第一の画像に正解ラベルを付与した第一のラベル画像を学習データとして、前記第一の画像に含まれる複数の画素の周辺における前記ラベルの共起性を学習した識別器であることを特徴としてもよい。
第二タイプ識別器は、確率分布ではなく、正解ラベルが付与された画像によって学習を行わせてもよい。このような構成によっても、第一タイプ識別器が第一の画像に対して出力する確率分布の傾向を学習させることができる。 The second type discriminator uses the first label image in which the correct label is given to the first image as learning data, and the label co-occurrence around a plurality of pixels included in the first image. It may be characterized by being a classifier that has learned the sex.
The second type classifier may perform learning by using an image to which a correct answer label is added instead of the probability distribution. Even with such a configuration, the tendency of the probability distribution output by the first type classifier for the first image can be learned.

以下、本発明の具体的な実施形態について図面に基づいて説明する。各実施形態に記載されているハードウェア構成、モジュール構成、機能構成等は、特に記載がない限りは発明の技術的範囲をそれらのみに限定する趣旨のものではない。 Hereinafter, specific embodiments of the present invention will be described with reference to the drawings. The hardware configuration, module configuration, functional configuration, etc. described in each embodiment are not intended to limit the technical scope of the invention to these unless otherwise specified.

（第一の実施形態）
第一の実施形態に関して、図面を参照して説明する。図１Ａおよび図１Ｂは、第一の実施形態に係る識別装置１０の機能構成を示すブロック図である。
本実施形態に係る識別装置１０は、入力された画像に基づいて、当該画像に含まれる路上障害物の有無を判定し、判定結果を出力する装置である。 (First embodiment)
The first embodiment will be described with reference to the drawings. 1A and 1B are block diagrams showing a functional configuration of the identification device 10 according to the first embodiment.
The identification device 10 according to the present embodiment is a device that determines the presence or absence of a road obstacle included in the image based on the input image and outputs the determination result.

識別装置１０は、複数の識別器を有しており、当該識別器を学習させる学習モードと、学習の結果に基づいて画像の評価を行う評価モードに切り替え可能に構成される。以降の説明において、識別器を学習させる処理を学習処理と称し、学習した識別器を用いて画像を評価する処理を評価処理と称する。また、学習に用いる画像を学習用画像と称し、評価に用いる画像を評価用画像と称する。学習用画像は、路上障害物が存在しない状況において車載カメラから取得した画像であり、評価用画像は、路上障害物の存在が不明である状況において車載カメラから取得した画像である。
図１Ａは、学習モードにおけるデータの流れを示した図であり、図１Ｂは、評価モードにおけるデータの流れを示した図である。 The classification device 10 has a plurality of classifiers and is configured to be switchable between a learning mode for learning the classifiers and an evaluation mode for evaluating an image based on the learning result. In the following description, the process of learning the classifier is referred to as a learning process, and the process of evaluating an image using the learned classifier is referred to as an evaluation process. An image used for learning is referred to as a learning image, and an image used for evaluation is referred to as an evaluation image. The learning image is an image acquired from the vehicle-mounted camera in the situation where there is no road obstacle, and the evaluation image is an image acquired from the vehicle-mounted camera in the situation where the existence of the road obstacle is unknown.
FIG. 1A is a diagram showing a data flow in the learning mode, and FIG. 1B is a diagram showing a data flow in the evaluation mode.

本実施形態に係る識別装置１０は、半導体集積回路（ＬＳＩ）を用いて実現できる。識別装置１０は、画像取得部１１、ラベル付与部１２、識別部１３、異常判定部１４を有して構成される。また、識別部１３は、第一識別器１３１および第二識別器１３２を有して構成される。これらの構成要素はソフトウェアモジュールによって実現され、識別装置１０が果たす機能にそれぞれ対応している。 The identification device 10 according to the present embodiment can be realized by using a semiconductor integrated circuit (LSI). The identification device 10 includes an image acquisition unit 11, a labeling unit 12, an identification unit 13, and an abnormality determination unit 14. Further, the identification unit 13 is configured to include a first identification device 131 and a second identification device 132. These components are realized by software modules and correspond to the functions performed by the identification device 10, respectively.

画像取得部１１は、車両前方を撮像して得られた画像を取得する手段である。画像取得部１１は、例えば、車両の前方に向けて設置されたカメラ（車載カメラ）から画像を取得する。図２は、画像取得部１１によって取得される画像の例である。本実施形態では、ある時刻ｔにおいて取得した画像を、Ｉ（ｔ）と表現する。 The image acquisition unit 11 is a unit that acquires an image obtained by imaging the front of the vehicle. The image acquisition unit 11 acquires an image from, for example, a camera (vehicle-mounted camera) installed toward the front of the vehicle. FIG. 2 is an example of an image acquired by the image acquisition unit 11. In the present embodiment, the image acquired at a certain time t is expressed as I(t).

ラベル付与部１２は、画像取得部１１が取得した画像に対してラベルを付与する手段である。
ここで、ラベルについて説明する。本実施形態では、「画像に含まれる複数の画素のそれぞれが何に対応しているか」をラベルによって表す。図３は、例示した画像にラベルを付与した例である。ラベル付与部１２は、所定の意味を持つラベル（意味的ラベルとも言う）を予めＭ種類定義し、当該ラベルｍを、対象の画像に対して画素単位で付与する。
例えば、Ｍが７である場合、ｍは以下のように定義することができる。
１：空，２：自然，３：他車両，４：建築物，５：白線，６：道路，７：自車両
本実施形態では、ラベル付与部１２は、入力された画像をユーザに提示し、当該ユーザからラベルの付与結果を取得する。以降、画像に対してラベルが付与された結果をラベル
画像と称する。図３がラベル画像の例である。以降、画像Ｉ（ｔ）に対応するラベル画像をＬ（ｔ）と表現する。 The label attaching unit 12 is a unit that attaches a label to the image acquired by the image acquiring unit 11.
Here, the label will be described. In the present embodiment, “what each of the plurality of pixels included in the image corresponds to” is represented by a label. FIG. 3 is an example in which a label is added to the illustrated image. The label assigning unit 12 defines M types of labels (also referred to as semantic labels) having a predetermined meaning in advance, and assigns the label m to the target image in pixel units.
For example, if M is 7, then m can be defined as:
1: Sky, 2: Nature, 3: Other vehicle, 4: Building, 5: White line, 6: Road, 7: Own vehicle In the present embodiment, the labeling unit 12 presents the input image to the user. , The label assignment result is obtained from the user. Hereinafter, the result of attaching a label to an image will be referred to as a label image. FIG. 3 is an example of the label image. Hereinafter, the label image corresponding to the image I(t) will be expressed as L(t).

次に、識別部１３について説明する。識別部１３は、第一識別器１３１と第二識別器１３２を有して構成される。 Next, the identification unit 13 will be described. The discriminating unit 13 is configured to include a first discriminator 131 and a second discriminator 132.

第一識別器１３１は、本発明における第一タイプ識別器であり、画像に含まれる画素の特徴と、前述した複数のラベルとの関係性を学習する識別器である。第一識別器１３１は、複数の学習用画像（路上障害物を含まない画像。以下、正常画像とも言う）と、当該学習用画像に対してラベル（正解ラベル）を付与した結果得られる複数のラベル画像と、を用いて学習される。 The first discriminator 131 is the first type discriminator in the present invention, and is a discriminator that learns the relationship between the characteristics of the pixels included in the image and the plurality of labels described above. The first discriminator 131 includes a plurality of learning images (images that do not include road obstacles; hereinafter, also referred to as normal images) and a plurality of learning images that are obtained by assigning labels (correct labels) to the learning images. Label images are used for learning.

第一識別器１３１は、画像Ｉ（ｔ）の座標（ｘ，ｙ）周辺における特徴（色，エッジ強度，テクスチャなど）の統計的分布と、当該座標における意味的ラベルｍ（ｍ＝１，…，Ｍ）との関係性を学習することで得られる。
第一識別器１３１は、評価用画像を入力すると、当該画像Ｉ（ｔ）の座標（ｘ，ｙ）における意味的ラベルの確率分布を出力する。図４は、第一識別器１３１が出力するデータを説明する図である。図４は、画像Ｉ（ｔ）の、座標（ｘ，ｙ）における意味的ラベルの確率分布Ｐ_1,ｍを表している。本例では、座標（ｘ，ｙ）にある画素について、意味的ラベルが１、すなわち、空である可能性が最も高いことを表している。
第一識別器１３１は、ディープラーニングといった既知の学習手法を用いて学習させることができる。 The first classifier 131 has a statistical distribution of features (color, edge strength, texture, etc.) around the coordinates (x, y) of the image I(t), and the semantic label m (m=1,...) At the coordinates. , M) by learning the relationship with them.
When the evaluation image is input, the first discriminator 131 outputs the probability distribution of semantic labels at the coordinates (x, y) of the image I(t). FIG. 4 is a diagram illustrating the data output by the first discriminator 131. FIG. 4 shows the probability distribution P _1,m of the semantic label of the image I(t) at the coordinates (x,y). In this example, the pixel at the coordinate (x, y) has the highest semantic label of 1, that is, is most likely to be empty.
The first discriminator 131 can be trained using a known learning method such as deep learning.

なお、第一識別器１３１が出力した確率分布のうち、最も確率が高いラベルを各座標について選択することで、評価用画像に対応するラベル画像を得ることもできる。 Note that the label image corresponding to the evaluation image can be obtained by selecting the label with the highest probability among the probability distributions output by the first discriminator 131 for each coordinate.

第二識別器１３２は、本発明における第二タイプ識別器であり、正常画像中におけるラベルの共起性を学習する識別器である。第二識別器１３２は、複数のラベル画像を用いて学習される。 The second classifier 132 is the second type classifier in the present invention, and is a classifier that learns the co-occurrence of labels in a normal image. The second discriminator 132 is learned using a plurality of label images.

第二識別器１３２は、ラベル画像Ｌ（ｔ）の座標（ｘ，ｙ）における意味的ラベルｍ（ｍ＝１，…，Ｍ）と、当該座標の周辺における意味的ラベルｍの統計的分布との関係性（すなわち共起性）を学習することで得られる。
第二識別器１３２は、確率分布を入力すると、当該確率分布をラベル画像Ｌ（ｔ）に変換し、学習したラベルの共起性に基づいて、当該画像Ｌ（ｔ）の座標（ｘ，ｙ）における意味的ラベルの確率分布を出力する。図５は、第二識別器１３２が出力するデータを説明する図である。図５は、ラベル画像Ｌ（ｔ）の、座標（ｘ，ｙ）における意味的ラベルの確率分布Ｐ_2,ｍを表している。本例では、座標（ｘ，ｙ）にある画素について、周辺の画素との共起性（例えば、「"空"のラベルが付与された画素に囲まれた画素は"空"である可能性が高い」，「"空"のラベルが付与された画素の近傍にある画素は"空"か"自然"である可能性が高い」等）に基づいて、空である可能性が最も高いことを表している。
第二識別器１３２は、例えば、ランダムフォレストといった既知の学習手法を用いて学習させることができる。具体的な学習方法については後述する。 The second discriminator 132 calculates the semantic label m (m=1,..., M) at the coordinate (x, y) of the label image L(t) and the statistical distribution of the semantic label m around the coordinate. It is obtained by learning the relationship (that is, co-occurrence) of.
Upon inputting the probability distribution, the second classifier 132 converts the probability distribution into a label image L(t), and based on the learned co-occurrence of the label, coordinates (x, y) of the image L(t). ) Output the probability distribution of the semantic labels in. FIG. 5 is a diagram illustrating data output by the second discriminator 132. FIG. 5 shows the probability distribution P _2,m of the semantic label at the coordinates (x, y) of the label image L(t). In this example, with respect to the pixel at the coordinates (x, y), co-occurrence with surrounding pixels (for example, a pixel surrounded by pixels labeled with "empty" may be "empty") Is most likely to be empty", "Pixels in the vicinity of pixels labeled with "Sky" are likely to be "Sky" or "Natural"", etc.) Is represented.
The second classifier 132 can be trained using a known learning method such as random forest. A specific learning method will be described later.

第一識別器１３１が、画素の特徴を学習した結果によってラベルの確率分布を生成するのに対し、第二識別器１３２は、ある座標の周辺にある複数のラベルの共起性を学習した結果によってラベルの確率分布を生成する。
第二識別器１３２を学習させることで、確率分布を入力すると、当該確率分布をラベル画像に変換し、当該ラベル画像に含まれる画素のそれぞれについて、図５のような意味的ラベルの確率分布を出力する識別器を得ることができる。 The first discriminator 131 generates a probability distribution of labels according to the result of learning the characteristics of pixels, while the second discriminator 132 learns the co-occurrence of a plurality of labels around a certain coordinate. Generates a label probability distribution.
When the probability distribution is input by learning the second classifier 132, the probability distribution is converted into a label image, and the probability distribution of the semantic label as shown in FIG. 5 is calculated for each pixel included in the label image. An output discriminator can be obtained.

異常判定部１４は、第一識別器１３１が出力した確率分布と、第二識別器が出力した確率分布との解離度を算出し、当該解離度に基づいて、評価用画像に路上障害物が含まれている度合い（異常度）を算出する手段である。
異常判定部１４は、入力された評価用画像Ｉ（ｔ）の任意の点ｐ（ｘ，ｙ）における障害物らしさを示す尤度Ｓ（ｘ，ｙ）を算出し、算出結果に基づいて、評価用画像Ｉ（ｔ）に含まれる路上障害物を検出する。具体的な処理内容については後述する。 The abnormality determination unit 14 calculates the dissociation degree between the probability distribution output by the first discriminator 131 and the probability distribution output by the second discriminator, and based on the dissociation degree, road obstacles appear in the evaluation image. It is a means for calculating the degree of inclusion (abnormality).
The abnormality determination unit 14 calculates the likelihood S(x, y) indicating the likelihood of an obstacle at an arbitrary point p(x, y) of the input evaluation image I(t), and based on the calculation result, A road obstacle included in the evaluation image I(t) is detected. Specific processing contents will be described later.

次に、第二識別器１３２が、ラベル画像に含まれる複数のラベルの共起性を学習させる処理について、ランダムフォレストを例示して詳細に説明する。
ランダムフォレストとは、図６のような、決定木を弱識別器とする集団学習アルゴリズムの一種であり、複数のノードｒ（１，…，Ｒ）と、ノード間を接続するリンクとから構成される。最上位層にあるノードをルートノード、最下位層にあるノードをリーフノード、それ以外を単にノードと呼ぶ。 Next, the process in which the second classifier 132 learns the co-occurrence of a plurality of labels included in the label image will be described in detail by exemplifying a random forest.
Random forest is a kind of group learning algorithm using a decision tree as a weak discriminator as shown in FIG. 6, and is composed of a plurality of nodes r(1,..., R) and links connecting the nodes. It The node in the highest layer is called a root node, the node in the lowest layer is a leaf node, and the other nodes are simply called nodes.

各ノードには学習により、ノード内のサンプル（意味的ラベル画像Ｌ（ｔ）から無作為に抽出した画素ｐ（ｘ，ｙ））を左右のノードに振り分けるための判定条件Φ_r(ｒ＝１，…，Ｒ）と、識別対象のカテゴリｍに対する確率分布Ｐ_2,ｍ（ｒ）（ｒ＝１，…，Ｒ，ｍ＝１，…，Ｍ）とが格納される。
なお、識別対象のカテゴリｍは、前述したように、所定の範囲内（例えば、１，…，Ｍ（Ｍ＝７））で事前に設定すればよい。 A determination condition Φ _r (r=1) for allocating a sample in the node (pixel p(x, y) randomly extracted from the semantic label image L(t)) to the left and right nodes by learning in each node. ,..., R) and the probability distribution P _2,m (r) (r=1,..., R, m=1,..., M) for the category m to be identified are stored.
The category m to be identified may be set in advance within a predetermined range (for example, 1,..., M (M=7)) as described above.

ランダムフォレストの学習に必要な判定条件の候補φ_c（ｃ＝１，…，Ｃ）は、座標（
ｘ，ｙ）を始点とする２つのオフセットベクトル（ｒ_a，ｒ_b）を無作為に設定し、当該ベクトルの終点（ａ，ｂ）における意味的ラベル（ｍ_ａ，ｍ_ｂ）（∈１，…，Ｍ）の共起性に基づいて定義すればよい。 Candidates φ _c (c=1,..., C) for the judgment conditions necessary for learning the random forest are coordinates (
x, y) is set at random as two offset vectors (r _a , r _b ), and the semantic labels (m _a , m _b ) at the end points (a, b) of the vectors (ε 1, ε 1, , M) may be defined based on the co-occurrence.

具体的な例を挙げて説明する。
例えば、図７（Ａ）に示したように、座標ｐ（ｘ，ｙ）において、判定条件（ｒ_a，ｒ_b，１，２）を設定したとする。この場合、オフセットベクトルｒ_aにおける意味的ラベル
が“空”（ｍ＝１）で、かつ、オフセットベクトルｒ_bにおける意味的ラベルが“自然”
（ｍ＝２）であることが成立するため、右側のノードに遷移する。
一方、座標ｐ（ｘ，ｙ）において判定条件（ｒ_a，ｒ_b，１，１）を設定した場合、オフセットベクトルｒ_aにおける意味的ラベルが“空”（ｍ＝１）で、かつ、オフセットベク
トルｒ_bにおける意味的ラベルが“空”（ｍ＝１）であることが成立しないため、左側の
ノードに遷移する。 A specific example will be described.
For example, as shown in FIG. _7A , it is assumed that the determination condition (r _a , r _b , 1, 2) is set at the coordinate p(x, y). In this case, the semantic label in the offset vector r _a is “empty” (m=1) and the semantic label in the offset vector r _b is “natural”.
Since it is established that (m=2), it transits to the node on the right side.
On the other hand, when the determination condition (r _a , r _b , 1, 1) is set at the coordinate p(x, y), the semantic label in the offset vector r _a is “empty” (m=1) and the offset is Since it is not established that the semantic label in the vector r _b is “empty” (m=1), the node transits to the node on the left side.

同様に、図７（Ｂ）に示したように、座標ｐ（ｘ，ｙ）において、判定条件（ｒ_a，ｒ_b，２，２）を設定したとする。この場合、オフセットベクトルｒ_aにおける意味的ラベル
が“自然”（ｍ＝２）で、かつ、オフセットベクトルｒ_bにおける意味的ラベルが“自然
”（ｍ＝２）であることが成立するため、右側のノードに遷移する。
一方、座標ｐ（ｘ，ｙ）において判定条件（ｒ_a，ｒ_b，１，２）を設定した場合、オフセットベクトルｒ_aにおける意味的ラベルが“空”（ｍ＝１）で、かつ、オフセットベク
トルｒ_bにおける意味的ラベルが“自然”（ｍ＝２）であることが成立しないため、左側
のノードに遷移する。 Similarly, as shown in FIG. 7 (B), the coordinates p (x, y) in the determination condition _{_{(r a, r b, 2,2}} ) and setting the. In this case, it is established that the semantic label in the offset vector r _a is “natural” (m=2) and the semantic label in the offset vector r _b is “natural” (m=2). Transition to the node.
On the other hand, when the determination condition (r _a , r _b , 1, 2) is set at the coordinate p(x, y), the semantic label in the offset vector r _a is “empty” (m=1) and the offset is Since it is not established that the semantic label in the vector r _b is “natural” (m=2), the node transits to the left node.

このように、判定条件の候補φ_c（ｃ＝１，…，Ｃ）が得られたら、後は公知の手順に
従って学習をすればよい。ここでいう学習とは、各ノードｒ（１，…，Ｒ）に適切な判定条件Φ_r（ｒ＝１，…，Ｒ）と、識別対象カテゴリｍ（＝１，…，Ｍ）に対する確率分布
Ｐ_2,ｍ（ｒ）（ｒ＝１，…，Ｒ，ｍ＝１，…，Ｍ）を設定することをいう。
具体的には、ｒ番目のノードにおける判定条件Φ_r（ｒ∈１，…，Ｒ）は、判定条件の
候補φ_c（ｃ＝１，…，Ｃ）のうち、式（１）で定義される信頼度Ｇ（φ）を最大にする
ものを設定すればよい。

Thus, candidate _{φ c (c = 1, ...} , C) of the determination condition Once obtained, after may be learned in accordance with known procedures. Learning here means the determination condition Φ _r (r=1,..., R) appropriate for each node r(1,..., R) and the probability distribution for the identification target category m (=1,..., M). P2 _,m (r) (r=1,..., R, m=1,..., M) is set.
Specifically, the determination condition Φ _r (rε1,..., R) at the r-th node is defined by the equation (1) among the determination condition candidates φ _c (c=1,..., C). The maximum reliability G(φ) may be set.

ここで、Ｑ_l（φ）は、判定条件φで左側のノードに遷移するｐ（ｘ，ｙ）の個数、Ｑ_r（φ）は、判定条件φで右側のノードに遷移するｐ（ｘ，ｙ）の個数、Ｈ（Ｑ（φ））は、所定ノードにおける識別対象カテゴリに対するエントロピーを表す。また、Ｈ（Ｑ_l（
φ））は、判定条件φで左側のノードに遷移したサンプルの識別対象カテゴリに対するエントロピーを表す。また、Ｈ（Ｑ_r（φ））は、判定条件φで右側のノードに遷移したサ
ンプルの識別対象カテゴリに対するエントロピーを表す。
最終的に、ランダムフォレストの各ノードｒ（１，…，Ｒ）における判定条件Φ_r（ｒ
＝１，…，Ｒ）と、識別対象カテゴリｍに対する確率分布Ｐ_2,ｍ（ｒ）が得られる。 Here, Q _l (φ) is the number of p(x, y) that transits to the left side node under the determination condition φ, and Q _r (φ) is p(x, y) that transits to the right side node under the determination condition φ. The number of y), H(Q(φ)), represents the entropy for the identification target category at the predetermined node. In addition, H(Q _l (
φ)) represents the entropy for the identification target category of the sample that has transited to the node on the left side under the determination condition φ. Further, H(Q _r (φ)) represents the entropy for the identification target category of the sample that transits to the node on the right side under the determination condition φ.
Finally, the determination condition Φ _r (r at each node r(1,..., R) in the random forest
=1,..., R) and the probability distribution P _2,m (r) for the classification target category m are obtained.

第二識別器１３２は、路上障害物が含まれない画像におけるラベルの共起性を学習することで、「正常画像を入力した第一識別器１３１が出力する確率分布の傾向」を学習する識別器であると言える。 The second discriminator 132 learns the “probability of the probability distribution output by the first discriminator 131 that has input a normal image” by learning the co-occurrence of labels in an image that does not include road obstacles. It can be said to be a vessel.

次に、第一の実施形態に係る識別装置１０が行う学習処理について説明する。
図８は、複数の画像を用いて第一識別器１３１および第二識別器１３２を学習させる処理の流れを示すフローチャートである。学習は、一枚以上の学習用画像を用いて行われる。学習用画像は、路上障害物を含まない画像である。図８に示した処理は、複数の学習用画像のそれぞれについて繰り返し実行される。複数の学習用画像は、例えば、複数のタイムステップにおいて取得された画像Ｉ（ｔ）とすることができる。 Next, the learning process performed by the identification device 10 according to the first embodiment will be described.
FIG. 8 is a flowchart showing the flow of processing for learning the first classifier 131 and the second classifier 132 using a plurality of images. Learning is performed using one or more learning images. The learning image is an image that does not include road obstacles. The process shown in FIG. 8 is repeatedly executed for each of the plurality of learning images. The plurality of learning images can be, for example, images I(t) acquired at a plurality of time steps.

まず、ステップＳ１１で、画像取得部１１が、学習用画像Ｉ（ｔ）を取得する。
次に、ステップＳ１２で、ラベル付与部１２が、学習用画像Ｉ（ｔ）に対応するラベル画像Ｌ（ｔ）を取得する。前述したように、ラベル画像は、装置のユーザが手動で生成してもよい。取得されたラベル画像Ｌ（ｔ）は、第一識別器１３１および第二識別器１３２に出力される。
次に、ステップＳ１３で、第一識別器１３１が、取得した学習用画像Ｉ（ｔ）およびラベル画像Ｌ（ｔ）に基づいて、前述した方法によって学習を行う。
次に、ステップＳ１４で、第二識別器１３２が、取得したラベル画像Ｌ（ｔ）に基づいて、前述した方法によって学習を行う。 First, in step S11, the image acquisition unit 11 acquires the learning image I(t).
Next, in step S12, the label assigning unit 12 acquires the label image L(t) corresponding to the learning image I(t). As mentioned above, the label image may be manually generated by the user of the device. The acquired label image L(t) is output to the first discriminator 131 and the second discriminator 132.
Next, in step S13, the first discriminator 131 performs learning by the above-described method based on the acquired learning image I(t) and label image L(t).
Next, in step S14, the second discriminator 132 performs learning by the method described above based on the acquired label image L(t).

次に、学習済みの識別部１３を用いて、路上障害物の有無を判定する方法について説明する。
図９は、学習済みの識別部１３を用いて、評価用画像を評価し、路上障害物の存在を判定する処理の流れを示すフローチャートである。評価用画像は、路上障害物の存在が不明な画像である。評価用画像は、例えば、車両に搭載されたカメラからリアルタイムで供給されてもよい。 Next, a method for determining the presence/absence of a road obstacle by using the learned identification unit 13 will be described.
FIG. 9 is a flowchart showing a flow of processing of evaluating the evaluation image and determining the presence of a road obstacle by using the learned identification unit 13. The evaluation image is an image in which the presence of an obstacle on the road is unknown. The evaluation image may be supplied in real time from, for example, a camera mounted on the vehicle.

まず、ステップＳ２１で、画像取得部１１が、評価用画像Ｉ’（ｔ）を取得する。
次に、ステップＳ２２で、第一識別器１３１が、入力された評価用画像Ｉ’（ｔ）に基づいて、確率分布Ｐ_1,ｍを生成する。 First, in step S21, the image acquisition unit 11 acquires the evaluation image I′(t).
Next, in step S22, the first discriminator 131 generates a probability distribution P _1,m based on the input evaluation image I′(t).

第一識別器１３１が生成した確率分布Ｐ_1,ｍは、異常判定部１４および第二識別器１３
２へ出力される。
ステップＳ２３では、まず、第二識別器１３２が、確率分布Ｐ_1,ｍからラベル画像Ｌ’（ｔ）を生成する。ラベル画像Ｌ’（ｔ）中の任意の座標におけるラベルｍ’は、確率Ｐ_1,ｍにおける尤度が最大となるラベルである。すなわち、式（２）によって算出することができる。

The probability distribution P _1,m generated by the first discriminator 131 is the abnormality determination unit 14 and the second discriminator 13.
2 is output.
In step S23, first, the second discriminator 132 generates a label image L′(t) from the probability distribution P _1,m . A label m′ at an arbitrary coordinate in the label image L′(t) is a label having the maximum likelihood at the probability P _1,m . That is, it can be calculated by the equation (2).

次に、第二識別器１３２が、生成したラベル画像Ｌ’（ｔ）に基づいてラベルの確率分布Ｐ_2,ｍを生成する。第二識別器１３２が生成した確率分布Ｐ_2,ｍは、異常判定部１４へ出力される。 Next, the second classifier 132 generates a label probability distribution P _2,m based on the generated label image L′(t). The probability distribution P _2,m generated by the second classifier 132 is output to the abnormality determination unit 14.

次に、ステップＳ２４で、異常判定部１４が、ステップＳ２２で得られた確率分布Ｐ_1,ｍと、ステップＳ２３で得られた確率分布Ｐ_2,ｍに基づいて、第一の尤度Ｓ₁を算出する
。第一の尤度Ｓ₁は、式（３）のように、差の二乗和によって求めることができる。

Next, in step S24, the abnormality determination unit 14 determines the first likelihood S ₁ based on the probability distribution P _1,m obtained in step S22 and the probability distribution P _2,m obtained in step S23. To calculate. The first likelihood S ₁ can be obtained by the sum of squared differences, as in Expression (3).

第一の尤度Ｓ₁（ｘ，ｙ）は、評価用画像Ｉ’（ｔ）中の座標（ｘ，ｙ）における、第
一識別器１３１と第二識別器１３２の識別結果の解離度を示す。第一識別器１３１が出力した確率分布と、第二識別器１３２が出力した確率分布が乖離しているこということは、画素の特徴に基づいて識別を行った結果と、当該結果にラベルの共起性を加味した結果とが異なることを意味する。
正常画像におけるラベルの共起性を加味した識別結果が、元の識別結果から乖離していることは、第一識別器１３１が出力した確率分布が、正常画像を入力した場合に得られる確率分布とは異なる傾向を有していることを意味する。従って、第一の尤度Ｓ₁の大きさ
に基づいて、評価用画像中に未知の物体が存在する尤度を推定することができる。 The first likelihood S ₁ (x, y) is the dissociation degree of the discrimination results of the first discriminator 131 and the second discriminator 132 at the coordinates (x, y) in the evaluation image I′(t). Show. The difference between the probability distribution output by the first discriminator 131 and the probability distribution output by the second discriminator 132 means that the result of discrimination based on the characteristics of the pixel and the label of the result. It means that the result of considering co-occurrence is different.
The fact that the identification result considering the label co-occurrence in the normal image deviates from the original identification result means that the probability distribution output by the first discriminator 131 is a probability distribution obtained when the normal image is input. Means having different tendencies. Therefore, the likelihood that an unknown object exists in the evaluation image can be estimated based on the magnitude of the first likelihood S ₁ .

このようにして得られた結果は、路上障害物の検出結果を活用するシステム、たとえば、車両運転支援システム内のプログラム等に伝達される。 The result obtained in this way is transmitted to a system that utilizes the detection result of a road obstacle, for example, a program in a vehicle driving support system.

路上障害物の検出結果を活用するシステムでは、例えば、第一の尤度Ｓ₁が所定の閾値
よりも大きい領域を特定し、評価用画像と重畳させることで、路上障害物を明示することができる。所定の閾値は、例えば、第一の尤度Ｓ₁におけるクラス内分散を最小化し、ク
ラス間分散を最大化する値に設定してもよい（判別分析法）。 In a system that utilizes the detection result of a road obstacle, for example, a region in which the first likelihood S ₁ is larger than a predetermined threshold value is specified and superimposed on the evaluation image, so that the road obstacle can be clearly indicated. it can. The predetermined threshold may be set, for example, to a value that minimizes the intra-class variance and maximizes the inter-class variance in the first likelihood S ₁ (discriminant analysis method).

かかる構成によると、図１０（Ａ）に示したように、路上障害物の存在が疑われる領域（白色領域）と、それ以外の領域（黒色領域）を分割できるようになる。また、この結果を用いることで、図１０（Ｂ）に示したように、路上障害物の候補領域を強調する画像処理を行うことができる。
さらに、路上障害物として検出した領域をネットワーク経由で他の装置（例えば、サーバ装置や後方車両）に伝達し、注意喚起をしてもよい。 With this configuration, as shown in FIG. 10A, it is possible to divide the area (white area) where the presence of a road obstacle is suspected and the other area (black area). Further, by using this result, as shown in FIG. 10B, it is possible to perform image processing that emphasizes the candidate area of the road obstacle.
Furthermore, the area detected as a road obstacle may be transmitted to another device (for example, a server device or a rear vehicle) via a network to call attention.

（第二の実施形態）
第一の実施形態では、画素の特徴を学習した第一識別器と、対象座標の周辺におけるラベルの共起性を学習した第二識別器を併用して路上障害物の判定を行った。これに対し、第二の実施形態は、対象座標の周辺における特徴ベクトルの類似度を学習した第三識別器（本発明における第二タイプ識別器）をさらに併用して路上障害物の判定を行う実施形態
である。 (Second embodiment)
In the first embodiment, the first discriminator that has learned the characteristics of pixels and the second discriminator that has learned the co-occurrence of labels around the target coordinates are used together to make a road obstacle determination. On the other hand, in the second embodiment, a third discriminator (second type discriminator in the present invention) that has learned the similarity of the feature vectors around the target coordinates is further used to determine the road obstacle. It is an embodiment.

図１１Ａおよび図１１Ｂは、第二の実施形態に係る識別装置１０の機能構成を示すブロック図である。図１１Ａは、学習モードにおけるデータの流れを示した図であり、図１１Ｂは、評価モードにおけるデータの流れを示した図である。 11A and 11B are block diagrams showing the functional configuration of the identification device 10 according to the second embodiment. FIG. 11A is a diagram showing a data flow in the learning mode, and FIG. 11B is a diagram showing a data flow in the evaluation mode.

第一の実施形態では、第二識別器１３２が、正常画像に対して正解ラベルを付与して得られるラベル画像に基づいて、正常画像における意味的ラベルの共起性を学習した。これに対し、第三識別器１３３は、正常画像を第一識別器１３１に入力して得られる確率分布に基づいて、正常画像における意味的ラベルの類似度を学習する。 In the first embodiment, the second classifier 132 learned the co-occurrence of the semantic label in the normal image based on the label image obtained by assigning the correct label to the normal image. On the other hand, the third classifier 133 learns the degree of similarity of the semantic labels in the normal image based on the probability distribution obtained by inputting the normal image to the first classifier 131.

第三識別器１３３は、座標（ｘ，ｙ）周辺における確率分布Ｐ_1,mの統計的分布と、当
該座標における意味的ラベルｍとの関係性を学習することで得られる。すなわち、第三識別器１３３は、「正常画像を第一識別器１３１に入力した場合において、ある座標の周辺においてどのような意味的ラベルが出現しやすいか」を学習する識別器であると言える。換言すると、第三識別器１３３は、正常画像を入力した第一識別器１３１が出力する確率分布の傾向を学習する識別器であると言える。 The third discriminator 133 is obtained by learning the relationship between the statistical distribution of the probability distribution P _1,m around the coordinates (x, y) and the semantic label m at the coordinates. That is, the third discriminator 133 can be said to be a discriminator that learns "what kind of semantic label is likely to appear around certain coordinates when a normal image is input to the first discriminator 131". .. In other words, the third classifier 133 can be said to be a classifier that learns the tendency of the probability distribution output by the first classifier 131 that has input a normal image.

第三識別器１３３は、第二識別器１３２と同様に、ランダムフォレストを用いて学習させることができる。 Like the second classifier 132, the third classifier 133 can perform learning using a random forest.

図１２を参照しながら説明する。
ランダムフォレストの学習に必要な判定条件の候補φ_c（ｃ＝１，…，Ｃ）は、座標（
ｘ，ｙ）を始点とする２つのオフセットベクトル（ｒ_a，ｒ_b）を無作為に設定し、当該ベクトルの終点（ａ，ｂ）における確率分布（Ｐ_1,m（ａ），Ｐ_1,m（ｂ））に基づいて、確率分布ベクトルｖ_aおよびｖ_bを算出し、ｖ_aおよびｖ_bのなす角度θに基づいて設定する。
ここで、ｖ_a＝（Ｐ_1，1（ａ），…，Ｐ_1，M（ａ））とし、ｖ_b＝（Ｐ_1，1（ｂ），…，Ｐ_1，M（ｂ））とする。また、角度θの閾値をτとする。 This will be described with reference to FIG.
Candidate conditions φ _c (c=1,..., C) necessary for learning the random forest are coordinates (
Two offset vectors (r _a , r _b ) starting from (x, y) are randomly set, and probability distributions (P _1,m (a), P _1, ) at the end points (a, b) of the vectors are set _{. The} probability distribution vectors v _a and v _b are calculated based on _m (b)), and set based on the angle θ formed by v _a and v _b .
Here, v _a =(P _1,1 (a),..., P _1,M (a)) and v _b =(P _1,1 (b),..., P _1,M (b)) To do. Further, the threshold value of the angle θ is τ.

ここで、判定条件（ｒ_a，ｒ_b，τ）を設定したとする。
図１２（Ａ）に示したように、点ａにおける確率分布ベクトルｖ_aと、点ｂにおける確
率分布ベクトルｖ_bとのなす角度θ＞τが成立する場合、右側のノードに遷移する。
一方、図１２（Ｂ）に示したように、点ｂにおける確率分布ベクトルｖ_aと、点ｂにお
ける確率分布ベクトルｖ_bとのなす角度θ＞τが成立しない場合、左側のノードに遷移す
る。
なお、確率分布ベクトルｖ_aと、確率分布ベクトルｖ_bとのなす角度θは、式（４）によって算出できる。

Here, it is assumed that the determination condition (r _a , r _b , τ) is set.
As shown in FIG. 12A, when the angle θ>τ formed by the probability distribution vector v _a at the point _a and the probability distribution vector v _b at the point b is satisfied, the node transits to the right node.
On the other hand, as shown in FIG. 12B, when the angle θ>τ formed by the probability distribution vector v _a at the point b and the probability distribution vector v _b at the point b is not satisfied, the node transits to the left node.
The angle θ formed by the probability distribution vector v _a and the probability distribution vector v _b can be calculated by the equation (4).

ここで、θが０に近い場合、点ａにおける確率分布ベクトルと、点ｂにおける確率分布ベクトルとの間に類似性があることが推定される。一方、θがπ／２に近い場合、点ａにおける確率分布ベクトルと、点ｂにおける確率分布ベクトルとの間に類似性がないことが推定される。すなわち、適切な閾値τを設定することにより、点ａにおける確率分布ベクトルと、点ｂにおける確率分布ベクトルとの間の類似性を判定することができる。 Here, when θ is close to 0, it is estimated that there is similarity between the probability distribution vector at the point a and the probability distribution vector at the point b. On the other hand, when θ is close to π/2, it is estimated that there is no similarity between the probability distribution vector at the point a and the probability distribution vector at the point b. That is, by setting an appropriate threshold value τ, the similarity between the probability distribution vector at the point a and the probability distribution vector at the point b can be determined.

第三識別器１３３を学習させることで、第一識別器１３１が出力した確率分布Ｐ_1,mを
入力すると、画像に含まれる画素のそれぞれについて、意味的ラベルの確率分布Ｐ_3,mを
出力する識別器を得ることができる。 When the probability distribution P _1,m output from the first classifier 131 is input by learning the third classifier 133, the probability distribution P _3,m of the semantic label is output for each pixel included in the image. It is possible to obtain a discriminator that

図１３は、第二の実施形態において、複数の画像を用いて第一識別器１３１，第二識別器１３２，第三識別器１３３を学習させる処理の流れを示すフローチャートである。なお、第一の実施形態と同様の処理については、点線で図示し、説明を省略する。 FIG. 13 is a flowchart showing a flow of processing for learning the first discriminator 131, the second discriminator 132, and the third discriminator 133 using a plurality of images in the second embodiment. Note that the same processing as that of the first embodiment is illustrated by the dotted line, and the description thereof will be omitted.

第二の実施形態では、ステップＳ１４が完了した後で、第一識別器１３１に再度学習用画像を入力し、評価を行う。この結果得られた確率分布Ｐ_1,mを、第三識別器１３３に入
力し、前述した方法によって学習を行う。 In the second embodiment, after step S14 is completed, the learning image is input again to the first discriminator 131, and evaluation is performed. The probability distribution P _1,m obtained as a result is input to the third discriminator 133, and learning is performed by the method described above.

図１４は、学習済みの識別部１３を用いて、評価用画像を評価し、路上障害物の存在を判定する処理の流れを示すフローチャートである。
第二の実施形態では、ステップＳ２３が完了した後で、第一識別器１３１によって生成された確率分布Ｐ_1,mを第三識別器１３３に入力し、確率分布Ｐ_3,mを得る（ステップＳ２３Ａ）。 FIG. 14 is a flowchart showing a flow of processing for evaluating the evaluation image and determining the presence of a road obstacle by using the learned identification unit 13.
In the second embodiment, after the step S23 is completed, the probability distribution P _1,m generated by the first classifier 131 is input to the third classifier 133 to obtain the probability distribution P _3,m (step S23A).

第一の実施形態では、ステップＳ２４で、確率分布Ｐ_1,mと確率分布Ｐ_2,mの解離度に基づいて、第一の尤度Ｓ₁を算出した。
これに対し、第二の実施形態では、確率分布Ｐ_1,mと確率分布Ｐ_2,mの解離度に基づいて、第一の尤度Ｓ₁を算出し、かつ、確率分布Ｐ_1,mと確率分布Ｐ_3,mの解離度に基づいて、
第二の尤度Ｓ₂を算出する。また、第一の尤度Ｓ₁と第二の尤度Ｓ₂を統合し、最終的な尤
度（統合尤度Ｓ）を算出する（ステップＳ２４Ａ）。
第二の尤度Ｓ₂は、式（５）のように、差の二乗和によって求めることができる。

In the first embodiment, in step S24, the first likelihood S ₁ is calculated based on the dissociation degree of the probability distribution P _1,m and the probability distribution P _2,m .
On the other hand, in the second embodiment, the first likelihood S ₁ is calculated based on the dissociation degree of the probability distribution P _1,m and the probability distribution P _2,m , and the probability distribution P _1,m And the dissociation degree of the probability distribution P _3,m ,
The second likelihood S ₂ is calculated. Further, the first likelihood S ₁ and the second likelihood S ₂ are integrated to calculate the final likelihood (integrated likelihood S) (step S24A).
The second likelihood S ₂ can be obtained by the sum of squares of the differences, as in Expression (5).

第二の実施形態では、異常判定部１４が、第一の尤度Ｓ₁と第二の尤度Ｓ₂に基づいて、評価用画像Ｉ’（ｔ）の座標（ｘ，ｙ）における統合尤度Ｓ（ｘ，ｙ）を算出する。統合尤度Ｓは、例えば、式（６）のように、第一の尤度Ｓ₁と第二の尤度Ｓ₂の和であってもよいし、式（７）のように、第一の尤度Ｓ₁と第二の尤度Ｓ₂の積であってもよい。
Ｓ（ｘ，ｙ）＝Ｓ₁（ｘ，ｙ）＋Ｓ₂（ｘ，ｙ）・・・式（６）
Ｓ（ｘ，ｙ）＝Ｓ₁（ｘ，ｙ）・Ｓ₂（ｘ，ｙ）・・・式（７） In the second embodiment, the abnormality determination unit 14 calculates the integrated likelihood at the coordinates (x, y) of the evaluation image I′(t) based on the first likelihood S ₁ and the second likelihood S _2. The degree S(x,y) is calculated. The integrated likelihood S may be the sum of the first likelihood S ₁ and the second likelihood S ₂ as in Equation (6), or may be the first likelihood S ₁ as in Equation (7). It may be a product of the likelihood S ₁ and the second likelihood S ₂ .
S(x,y)=S ₁ (x,y)+S ₂ (x,y) (6)
S(x, y)=S ₁ (x, y)·S ₂ (x, y) (7)

（有利な効果）
第二識別器１３２および第三識別器１３３は、共に、「正常画像（路上障害物を含まない画像）を第一識別器１３１に入力して得られる確率分布の傾向」を学習する識別器であると言える。このような識別器に、第一識別器１３１が出力した確率分布を入力し、得られた確率分布を、元の確率分布と比較することで、「第一識別器が、学習時の傾向から外れたものを出力している」ことを検知することができる。すなわち、第一識別器のみでは路上障害物を正しく検知できない場合であっても、「未知の物体に起因して、第一識別器が誤判定を起こしている」ことを推定することができる。
また、路上障害物に対する尤度を画素ごとに算出できるため、路上障害物の存在が疑われる領域をユーザに明示することができる。 (Advantageous effect)
The second discriminator 132 and the third discriminator 133 are both discriminators that learn "a tendency of a probability distribution obtained by inputting a normal image (an image that does not include road obstacles) to the first discriminator 131". It can be said that there is. By inputting the probability distribution output from the first classifier 131 to such a classifier and comparing the obtained probability distribution with the original probability distribution, “the first classifier is determined from the tendency at the time of learning. It is possible to detect that "the output is out of line". That is, even when the road obstacle cannot be correctly detected only by the first discriminator, it can be estimated that "the first discriminator is erroneously determined due to an unknown object".
Further, since the likelihood for a road obstacle can be calculated for each pixel, it is possible to clearly indicate to the user the area where the existence of a road obstacle is suspected.

（変形例）
上記の実施形態はあくまでも一例であって、本発明はその要旨を逸脱しない範囲内で適宜変更して実施しうる。
例えば、第二の実施形態では、第一識別器１３１と第二識別器１３２、第三識別器１３３を併用したが、第一識別器１３１と第三識別器１３３のみを利用してもよい。この場合
、第二の尤度Ｓ₂のみを用いて異常判定を行うようにしてもよい。 (Modification)
The above-described embodiment is merely an example, and the present invention can be appropriately modified and implemented without departing from the scope of the invention.
For example, in the second embodiment, the first discriminator 131, the second discriminator 132, and the third discriminator 133 are used together, but only the first discriminator 131 and the third discriminator 133 may be used. In this case, the abnormality determination may be performed using only the second likelihood S ₂ .

本発明は、前述した手段の少なくとも一部を備える識別装置として捉えることができる。また、本発明は、上記処理の少なくとも一部を実行する識別方法として捉えることもできる。 The present invention can be understood as an identification device including at least a part of the above-mentioned means. Further, the present invention can be understood as an identification method that executes at least a part of the above processing.

本発明における識別装置１０は、半導体集積回路（ＬＳＩ）による実装に限定されず、汎用的なマイクロプロセッサやメモリを有するコンピュータがプログラムを実行することによって実現されても構わない。 The identification device 10 in the present invention is not limited to mounting by a semiconductor integrated circuit (LSI), and may be realized by a computer having a general-purpose microprocessor or memory executing a program.

また、実施形態の説明では、路上障害物を検出する構成を挙げたが、正常画像中に存在しない物体を検出するものであれば、路上障害物以外を対象としてもよい。例えば、画像に何らかの異常物体が含まれていることを検知する識別装置として発明を実施することもできる。 Further, in the description of the embodiment, the configuration in which the obstacle on the road is detected has been described, but any object other than the obstacle on the road may be used as long as it detects an object that does not exist in the normal image. For example, the invention can be implemented as an identification device that detects that an image contains some abnormal object.

また、実施形態の説明では、同一の装置が学習処理と評価処理の双方を行ったが、学習処理を行う装置と、評価処理を行う装置が分かれていてもよい。さらに、識別装置１０から、学習処理に関連した手段を省いてもよい。例えば、例示した識別装置１０から、学習に関する手段を省略し、評価処理のみを行う装置（識別装置）として実施してもよい。
この場合、学習済みの識別部１３（または、識別部１３が有する識別器）を装置から論理的に切り離し、別の装置に組み込み可能に構成してもよい。例えば、実施形態に係る識別装置１０によって学習された識別部１３を、評価処理のみを行う装置に組み込めるようにしてもよい。
また、反対に、学習処理のみを行う装置として発明を実施してもよい。 Further, in the description of the embodiment, the same device performs both the learning process and the evaluation process, but the device that performs the learning process and the device that performs the evaluation process may be separated. Further, means related to the learning process may be omitted from the identification device 10. For example, the learning device may be omitted from the illustrated identification device 10 and may be implemented as a device (identification device) that performs only the evaluation process.
In this case, the learned discriminator 13 (or the discriminator included in the discriminator 13) may be logically separated from the device so that it can be incorporated into another device. For example, the identification unit 13 learned by the identification device 10 according to the embodiment may be incorporated into a device that performs only the evaluation process.
On the contrary, the invention may be implemented as an apparatus that performs only the learning process.

１０・・・識別装置
１１・・・画像取得部
１２・・・ラベル付与部
１３・・・識別部
１３１・・・第一識別器
１３２・・・第二識別器
１４・・・異常判定部 10... Identification device 11... Image acquisition unit 12... Labeling unit 13... Identification unit 131... First discriminator 132... Second discriminator 14... Abnormality determination unit

Claims

An identification device for determining the presence of an abnormal object included in an image,
Based on the result of learning the characteristics of the pixels included in the first image that is an image that does not include the abnormal object, the label is estimated for each of the plurality of pixels included in the input image, and the probability of the plurality of labels is estimated. A first type discriminator which outputs a first probability distribution representing a distribution for each pixel,
Based on the result of learning the relationship of labels in the vicinity of a plurality of pixels included in the first image, from the first probability distribution, a second type classifier that generates a second probability distribution,
The first probability distribution obtained by inputting a second image to the first type discriminator, and the second probability obtained by inputting the first probability distribution to the second type discriminator. Probability distribution, based on the dissociation degree of, the determination means for determining that the abnormal object is present in the second image,
And an identification device.

The second type discriminator is a discriminator that has learned the co-occurrence or similarity of the labels as the relationship of the labels,
The identification device according to claim 1.

The second type classifier is a classifier that has learned the first probability distribution obtained by inputting the first image to the first type classifier as learning data.
The identification device according to claim 1 or 2.

The second type discriminator uses, as learning data, the first probability distribution obtained by inputting the first image to the first type discriminator, and around a plurality of pixels included in the first image. Is a classifier that has learned the similarity of the labels in
The identification device according to claim 1.

The second type discriminator uses the first label image in which the correct label is given to the first image as learning data, and determines the co-occurrence of the label around a plurality of pixels included in the first image. Is a learned discriminator,
The identification device according to claim 1.

An identification method for determining the presence of an abnormal object included in an image,
Based on the result of learning the characteristics of the pixels included in the first image that is an image that does not include the abnormal object, the label is estimated for each of the plurality of pixels included in the input image, and the probability of the plurality of labels is estimated. A first identification step of outputting a first probability distribution expressing a distribution for each pixel;
Based on the result of learning the relationship of labels in the vicinity of a plurality of pixels included in the first image, from the first probability distribution, a second identification step of generating a second probability distribution,
The first probability distribution obtained by processing the second image in the first identification step, and the second probability distribution obtained by processing the first probability distribution in the second identification step. Probability distribution, based on the dissociation degree of, the determination step of determining that the abnormal object is present in the second image,
Identification method, including.

A program for causing a computer to execute the identification method according to claim 6.

An identification device for determining the presence of an abnormal object included in an image,
Based on the result of learning the characteristics of the pixels included in the first image that is an image that does not include the abnormal object, the label is estimated for each of the plurality of pixels included in the input image, and the probability of the plurality of labels is estimated. A first type discriminator which outputs a first probability distribution representing a distribution for each pixel,
Based on the result of learning the tendency of the first probability distribution output by the first type discriminator when the first image is input to the first type discriminator, from the first probability distribution, A second type discriminator that generates a second probability distribution,
The first probability distribution obtained by inputting a second image to the first type discriminator, and the second probability obtained by inputting the first probability distribution to the second type discriminator. Probability distribution, based on the dissociation degree of, the determination means for determining that the abnormal object is present in the second image,
And an identification device.