JP2007122362A

JP2007122362A - State estimation method using neural network and state estimation apparatus using neural network

Info

Publication number: JP2007122362A
Application number: JP2005313017A
Authority: JP
Inventors: Masaaki Uechi; 正昭上地; Kazuya Sasaki; 和也佐々木; Fumiaki Takeda; 史章竹田
Original assignee: Toyota Motor Corp
Current assignee: Toyota Motor Corp
Priority date: 2005-10-27
Filing date: 2005-10-27
Publication date: 2007-05-17

Abstract

<P>PROBLEM TO BE SOLVED: To provide a state estimation method using a neural network and its state estimation apparatus capable of highly precise estimation. <P>SOLUTION: The state estimation method using the neural network includes: an initial learning step for learning the neural network on the basis of input data and teacher data corresponding to a class label of the input data; a virtual teacher data assignment step for assigning virtual teacher data corresponding to the virtual class label to evaluation data whose class label is unknown; an evaluation step for learning the neural network on the basis of the evaluation data and the virtual teacher data of the evaluation data to evaluate convergence property of a learning curve of the learning; and a class label estimation step for estimating as a class label of the evaluation data a virtual class label corresponding to the virtual teacher label of the learning curve which has the highest convergence property out of a plurality of learning curves every virtual teacher data of various kinds assigned to the evaluation data. <P>COPYRIGHT: (C)2007,JPO&INPIT

Description

本発明は、ドライバ状態などを高精度に推定するためのニューラルネットワークを用いた状態推定方法及びニューラルネットワークを用いた状態推定装置に関する。 The present invention relates to a state estimation method using a neural network and a state estimation apparatus using a neural network for estimating a driver state and the like with high accuracy.

ドライバ状態に応じた運転支援を行うために、ドライバの心理状態、身体的な状態や運転スキルを推定するための方法が各種提案されている。この推定方法として、ニューラルネットワークを利用した方法である（特許文献１参照）。この方法では、車両情報などの各種情報の組み合わせからなる入力データに対して焦りや疲労などのドライバ状態を示すクラスラベルに対応する教師データを設定し、この入力データと教師データからなる学習データによって学習を行い、教師データと出力データの誤差が十分に収束するようなニューラルネットワークを予め構築しておく。そして、車両走行中に得られる車両情報などの組み合わせからなる評価データをそのニューラルネットワークに入力し、ニューラルネットワークによるフィードフォワード計算によってその評価データに対するドライバ状態を推定する。
特開平１０−３２４１７５号公報 Various methods for estimating a driver's psychological state, physical state, and driving skill have been proposed in order to provide driving assistance according to the driver state. As this estimation method, it is a method using a neural network (see Patent Document 1). In this method, teacher data corresponding to a class label indicating a driver state such as impatience and fatigue is set for input data composed of a combination of various information such as vehicle information, and learning data composed of the input data and the teacher data is used. Learning is performed and a neural network is constructed in advance so that the error between the teacher data and the output data is sufficiently converged. Then, evaluation data composed of a combination of vehicle information obtained while the vehicle is traveling is input to the neural network, and a driver state for the evaluation data is estimated by feedforward calculation using the neural network.
Japanese Patent Laid-Open No. 10-324175

ドライバ状態は、個人特性や運転中のコンディションなどによって大きなバラツキがある。そのため、その全てのパターンを網羅した完全な学習は不可能であり、様々なドライバの多様なコンディションに対応したニューラルネットワークを予め構築することは不可能である。したがって、学習したときに用いた学習データと実際に得られる評価データとが乖離している場合（つまり、学習が不十分な評価データが入力された場合）、ドライバ状態の推定精度が低下する。また、推定精度を向上させるために、ドライバの個人特性に合わせてニューラルネットワークをチューニングする場合、正しい学習データを用いて再学習を行わなければならない。そのため、評価データが入力される毎にドライバがその現在のドライバ状態を教師データとして設定する必要があり、非常に手間を要し、ドライバに負担をかける。 The driver state varies greatly depending on personal characteristics and driving conditions. Therefore, complete learning that covers all the patterns is impossible, and it is impossible to construct in advance a neural network corresponding to various conditions of various drivers. Therefore, when the learning data used when learning is different from the actually obtained evaluation data (that is, when evaluation data with insufficient learning is input), the estimation accuracy of the driver state decreases. In order to improve the estimation accuracy, when the neural network is tuned according to the personal characteristics of the driver, re-learning must be performed using correct learning data. Therefore, every time evaluation data is input, the driver needs to set the current driver state as teacher data, which is very time-consuming and burdens the driver.

そこで、本発明は、高精度な推定が可能なニューラルネットワークを用いた状態推定方法及びその状態推定装置を提供することを課題とする。 Therefore, an object of the present invention is to provide a state estimation method and a state estimation apparatus using a neural network capable of highly accurate estimation.

本発明に係るニューラルネットワークを用いた状態推定方法は、入力データと当該入力データのクラスラベルに対応する教師データとに基づいてニューラルネットワークの学習を行う初期学習ステップと、クラスラベルが未知の評価データに仮想のクラスラベルに対応する仮想教師データを割り当てる仮想教師データ割当ステップと、評価データと評価データの仮想教師データとに基づいてニューラルネットワークの学習を行い、当該学習の学習曲線の収束特性を評価する評価ステップと、評価データに割り当てた複数の異なる仮想教師データ毎の複数の学習曲線のうち、収束特性の高い学習曲線の仮想教師データに対応する仮想のクラスラベルを評価データのクラスラベルと推定するクラスラベル推定ステップとを含むことを特徴とする。 The state estimation method using a neural network according to the present invention includes an initial learning step for learning a neural network based on input data and teacher data corresponding to the class label of the input data, and evaluation data with an unknown class label. Based on the virtual teacher data assignment step for assigning virtual teacher data corresponding to the virtual class label and the evaluation data and the virtual teacher data of the evaluation data, the neural network is learned, and the convergence characteristics of the learning curve of the learning are evaluated. And the virtual class label corresponding to the virtual teacher data of the learning curve having a high convergence characteristic among the plurality of learning curves for each of the plurality of different virtual teacher data assigned to the evaluation data is estimated as the class label of the evaluation data And a class label estimating step.

このニューラルネットワークを用いた状態推定方法では、初期学習ステップにおいて、入力データとその入力データのクラスラベルに対応する教師データからなる学習データに基づいて学習を行い、初期学習によるニューラルネットワークを構築する。初期学習のニューラルネットワークを構築後にクラスラベルが未知の評価データが入力されると、状態推定方法では、仮想教師データ割当ステップにおいて、その評価データに対して仮想のクラスラベルに対応する仮想教師データを割り当てる。この際、評価データに対して異なる複数の仮想教師データ（仮想のクラスラベル）を割り当てる。そして、状態推定方法では、評価ステップにおいて、評価データとその評価データの仮想教師データとの組み合わせ毎に、その評価データと仮想教師データとを学習データとして学習を行い、学習の収束特性を評価するための学習曲線（例えば、学習回数に対して、その学習回数のときの学習における仮想教師データと出力データとの二乗誤差を対付けたもの）を生成し、その学習曲線の収束特性を評価する。したがって、評価データに対して仮想教師データの数に応じた複数の学習曲線が生成される。評価データに対して割り当てた仮想教師データが妥当な教師データであるほど（仮想のクラスラベルが評価データに対して妥当なクラスラベルであるほど）、学習曲線が滑らかにかつ速やかに収束していく。さらに、状態推定方法では、クラスラベル推定ステップにおいて、評価データに対して生成した複数の学習曲線の中から収束特性の高い学習曲線を抽出し、その抽出した学習曲線の仮想教師データに対応する仮想のクラスラベルを評価データに対するクラスラベルとする。つまり、評価データに対して妥当性の最も高いクラスラベルが推定されたことになる。このように、この状態推定方法では、学習済みのニューラルネットワークのフィードフォワード計算による評価データに対する１回の評価ではなく、評価データに対して複数の仮想教師データ（仮想のクラスラベル）を設定し、学習における時間的過程（学習曲線の収束特性）に基づく複数回の評価を行うので、評価データに対するクラスラベルの推定精度が高い。さらに、複数の仮想のクラスラベルについての各学習過程における大域的な学習の収束性を判断指標としているので、評価データにおけるノイズや一部のデータ欠損に対しても推定精度が高く、ロバスト性も高い。また、この状態推定方法では、評価データに対して複数の仮想教師データを自動的に割り当てて評価を行うので、評価データが入力される毎にユーザが教師データを設定する必要がなく、ユーザに負担をかけない。 In this state estimation method using a neural network, in an initial learning step, learning is performed based on learning data composed of input data and teacher data corresponding to the class label of the input data, and a neural network based on initial learning is constructed. When evaluation data with an unknown class label is input after constructing the initial learning neural network, in the state estimation method, virtual teacher data corresponding to the virtual class label is assigned to the evaluation data in the virtual teacher data allocation step. assign. At this time, different virtual teacher data (virtual class labels) are assigned to the evaluation data. In the state estimation method, in the evaluation step, for each combination of the evaluation data and the virtual teacher data of the evaluation data, learning is performed using the evaluation data and the virtual teacher data as learning data, and the convergence characteristics of the learning are evaluated. A learning curve (for example, a learning error for which the square error between the virtual teacher data and the output data in learning at that learning number is matched) is generated, and the convergence characteristic of the learning curve is evaluated . Therefore, a plurality of learning curves corresponding to the number of virtual teacher data are generated for the evaluation data. The more the virtual teacher data assigned to the evaluation data is valid teacher data (the virtual class label is the valid class label for the evaluation data), the smoother the learning curve converges. . Further, in the state estimation method, in the class label estimation step, a learning curve having a high convergence characteristic is extracted from a plurality of learning curves generated for the evaluation data, and the virtual curve corresponding to the virtual teacher data of the extracted learning curve is extracted. Is the class label for the evaluation data. That is, the class label having the highest validity for the evaluation data is estimated. As described above, in this state estimation method, a plurality of virtual teacher data (virtual class labels) are set for the evaluation data instead of one evaluation for the evaluation data by the feedforward calculation of the learned neural network. Since the evaluation is performed a plurality of times based on the temporal process in learning (the convergence characteristic of the learning curve), the estimation accuracy of the class label for the evaluation data is high. Furthermore, since the convergence of global learning in each learning process for multiple virtual class labels is used as a judgment index, the estimation accuracy is high and robustness can be obtained even with respect to noise and some data loss in the evaluation data. high. Also, in this state estimation method, evaluation is performed by automatically assigning a plurality of virtual teacher data to the evaluation data, so that the user does not need to set the teacher data every time the evaluation data is input. Do not put a burden.

本発明の上記ニューラルネットワークを用いた状態推定方法では、クラスラベル推定ステップで推定した評価データのクラスラベルを出力するクラスラベル出力ステップを含む構成としてもよい。 The state estimation method using the neural network of the present invention may include a class label output step for outputting the class label of the evaluation data estimated in the class label estimation step.

このニューラルネットワークを用いた状態推定方法では、クラスラベル出力ステップにおいて、評価データに対して推定したクラスラベルを推定結果として出力する。これによって、その出力されたクラスラベルをユーザに知らせたり、あるいは、評価データとそのクラスラベルを学習データとして再学習を行ったりするなど、推定結果を活用することができる。 In this state estimation method using a neural network, a class label estimated with respect to evaluation data is output as an estimation result in a class label output step. As a result, it is possible to utilize the estimation result such as informing the user of the output class label or performing relearning using the evaluation data and the class label as learning data.

本発明の上記ニューラルネットワークを用いた状態推定方法では、クラスラベル推定ステップで推定した評価データのクラスラベルに対応する仮想教師データを評価データの教師データとしてニューラルネットワークの学習を行う追加学習ステップを含む構成としてもよい。 The state estimation method using the neural network of the present invention includes an additional learning step of learning the neural network using virtual teacher data corresponding to the class label of the evaluation data estimated in the class label estimation step as teacher data of the evaluation data. It is good also as a structure.

このニューラルネットワークを用いた状態推定方法では、追加学習ステップにおいて、推定したクラスラベルに対応する仮想教師データを評価データの教師データとし、評価データとその教師データからなる学習データに基づいて追加学習を行い、ニューラルネットワークを更新する。このように、この状態推定方法では、初期学習によるニューラルネットワークを構築後に入力される教師データを持たない評価データを用いた追加学習が可能なので、ユーザの個人特性などに対応した推定精度の高いニューラルネットワークにチューニングすることができる。 In this state estimation method using a neural network, in the additional learning step, virtual teacher data corresponding to the estimated class label is used as teacher data of the evaluation data, and additional learning is performed based on the learning data including the evaluation data and the teacher data. And update the neural network. In this way, in this state estimation method, additional learning using evaluation data that does not have teacher data that is input after the construction of a neural network by initial learning is possible, so that a neural network with high estimation accuracy corresponding to the user's personal characteristics, etc. Can be tuned to the network.

本発明の上記ニューラルネットワークを用いた状態推定方法では、評価ステップは、評価データと評価データの仮想教師データ及び入力データと入力データの教師データに基づいてニューラルネットワークの学習を行う構成としてもよい。 In the state estimation method using the neural network of the present invention, the evaluation step may be configured to learn the neural network based on the evaluation data, the virtual teacher data of the evaluation data, and the input data and the teacher data of the input data.

このニューラルネットワークを用いた状態推定方法の評価ステップでは、学習曲線を生成するための学習を行う際に、評価データとその仮想教師データとの仮想の学習データに加えて入力データとその教師データとの学習データに基づいて学習を行う。このように、この状態推定方法では、正しくラベリングされている学習データも用いて学習を行い、学習曲線の収束特性を評価するので、評価データに対するクラスラベルの推定精度が更に向上する。 In the evaluation step of the state estimation method using this neural network, when performing learning for generating a learning curve, in addition to virtual learning data of the evaluation data and its virtual teacher data, input data and its teacher data Learning is performed based on the learning data. As described above, in this state estimation method, learning is also performed using correctly labeled learning data and the convergence characteristics of the learning curve are evaluated, so that the accuracy of class label estimation for the evaluation data is further improved.

本発明の上記ニューラルネットワークを用いた状態推定方法では、入力データ及び評価データは、運転状態検出手段で検出した値からなり、クラスラベルは、ドライバ状態である構成としてもよい。 In the state estimation method using the neural network of the present invention, the input data and the evaluation data may be values detected by the driving state detection means, and the class label may be a driver state.

このニューラルネットワークを用いた状態推定方法では、入力データや評価データが運転状態検出手段で検出した車両を運転中に得られる各種情報からなり、クラスラベルがその車両の運転中のドライバの各種状態であり、運転中に得られる評価データからドライバ状態を高精度に推定することができる。運転状態検出手段としては、例えば、ドライバの顔向き、視線、動作、生理情報（例えば、心拍数）などのドライバ情報の検出手段、車間距離、車線、歩行者、渋滞情報などの車両周辺の環境情報の検出手段、車速、舵角、アクセル開度、ブレーキ踏込み度、前後Ｇ、横Ｇなどの車両情報の検出手段である。ドライバ状態としては、例えば、焦り、先急ぎなどのドライバの心理状態、疲労、覚醒度などのドライバの身体的状態、初級運転、熟練運転などのドライバの運転スキルである。 In this state estimation method using a neural network, input data and evaluation data consist of various information obtained while driving the vehicle detected by the driving state detection means, and the class label indicates various states of the driver while driving the vehicle. Yes, the driver state can be estimated with high accuracy from the evaluation data obtained during driving. The driving state detection means includes, for example, driver information detection means such as the driver's face orientation, line of sight, motion, physiological information (for example, heart rate), vehicle surroundings such as inter-vehicle distance, lane, pedestrian, and traffic jam information. Information detection means, vehicle speed, rudder angle, accelerator opening, degree of brake depression, front / rear G, lateral G, and other vehicle information detection means. The driver state includes, for example, the driver's psychological state such as impatience and rush, the driver's physical state such as fatigue and arousal level, and the driver's driving skill such as beginner's driving and skilled driving.

本発明の上記ニューラルネットワークを用いた状態推定方法では、ニューラルネットワークは、３層階層型ニューラルネットワークであり、入力データと入力データの教師データとの対応関係、評価データと評価データの仮想教師データとの対応関係又は評価データと評価データに対して推定した教師データとの対応関係を誤差逆伝播法で学習し、誤差逆伝播法による誤差に基づいて学習曲線を生成する構成としてもよい。 In the state estimation method using the neural network of the present invention, the neural network is a three-layer hierarchical neural network, the correspondence between the input data and the teacher data of the input data, the evaluation data and the virtual teacher data of the evaluation data, Or a correspondence relationship between the evaluation data and the teacher data estimated with respect to the evaluation data may be learned by the error back propagation method, and a learning curve may be generated based on the error by the error back propagation method.

本発明に係るニューラルネットワークを用いた状態推定装置は、入力データと当該入力データのクラスラベルに対応する教師データとに基づいてニューラルネットワークの学習を行う初期学習手段と、クラスラベルが未知の評価データに仮想のクラスラベルに対応する仮想教師データを割り当てる仮想教師データ割当手段と、評価データと評価データの仮想教師データとに基づいてニューラルネットワークの学習を行い、当該学習の学習曲線の収束特性を評価する評価手段と、評価データに割り当てた複数の異なる仮想教師データ毎の複数の学習曲線のうち、収束特性の高い学習曲線の仮想教師データに対応する仮想のクラスラベルを評価データのクラスラベルと推定するクラスラベル推定手段とを備えることを特徴とする。 The state estimation device using a neural network according to the present invention includes initial learning means for learning a neural network based on input data and teacher data corresponding to the class label of the input data, and evaluation data having an unknown class label. Based on the virtual teacher data allocating means for allocating virtual teacher data corresponding to the virtual class label and the evaluation data and the virtual teacher data of the evaluation data, the neural network is learned, and the convergence characteristics of the learning curve of the learning are evaluated. And a virtual class label corresponding to the virtual teacher data of the learning curve having a high convergence characteristic among a plurality of learning curves for each of a plurality of different virtual teacher data assigned to the evaluation data is estimated as a class label of the evaluation data Class label estimating means.

本発明の上記ニューラルネットワークを用いた状態推定装置では、クラスラベル推定手段で推定した評価データのクラスラベルを出力するクラスラベル出力手段を備える構成としてもよい。 The state estimation apparatus using the neural network of the present invention may include a class label output unit that outputs a class label of evaluation data estimated by the class label estimation unit.

本発明の上記ニューラルネットワークを用いた状態推定装置では、クラスラベル推定手段で推定した評価データのクラスラベルに対応する仮想教師データを評価データの教師データとしてニューラルネットワークの学習を行う追加学習手段を備える構成としてもよい。 The state estimation apparatus using the neural network of the present invention includes additional learning means for learning the neural network using the virtual teacher data corresponding to the class label of the evaluation data estimated by the class label estimation means as the teacher data of the evaluation data. It is good also as a structure.

本発明の上記ニューラルネットワークを用いた状態推定装置では、評価手段は、評価データと評価データの仮想教師データ及び入力データと入力データの教師データに基づいてニューラルネットワークの学習を行う構成としてもよい。 In the state estimation apparatus using the neural network of the present invention, the evaluation means may be configured to learn the neural network based on the evaluation data, the virtual teacher data of the evaluation data, and the input data and the teacher data of the input data.

本発明の上記ニューラルネットワークを用いた状態推定装置では、入力データ及び評価データは、運転状態検出手段で検出した値からなり、クラスラベルは、ドライバ状態である構成としてもよい。 In the state estimation device using the neural network according to the present invention, the input data and the evaluation data may be values detected by the driving state detecting means, and the class label may be a driver state.

本発明の上記ニューラルネットワークを用いた状態推定装置では、ニューラルネットワークは、３層階層型ニューラルネットワークであり、入力データと入力データの教師データとの対応関係、評価データと評価データの仮想教師データとの対応関係又は評価データと評価データに対して推定した教師データとの対応関係を誤差逆伝播法で学習し、誤差逆伝播法による誤差に基づいて学習曲線を生成する構成としてもよい。 In the state estimation device using the neural network according to the present invention, the neural network is a three-layer hierarchical neural network, the correspondence between the input data and the teacher data of the input data, the evaluation data and the virtual teacher data of the evaluation data, Or a correspondence relationship between the evaluation data and the teacher data estimated with respect to the evaluation data may be learned by the error back propagation method, and a learning curve may be generated based on the error by the error back propagation method.

上記した各ニューラルネットワークを用いた状態推定装置は、上記した各ニューラルネットワークを用いた状態推定方法と同様の作用効果を有する。 The state estimation device using each neural network described above has the same operational effects as the state estimation method using each neural network described above.

本発明によれば、ニューラルネットワークを用いた評価データに対するクラスラベルの推定において、高精度な推定を行うことができる。 According to the present invention, it is possible to perform highly accurate estimation in class label estimation for evaluation data using a neural network.

以下、図面を参照して、本発明に係るニューラルネットワークを用いた状態推定方法及びニューラルネットワークを用いた状態推定装置の実施の形態を説明する。 Embodiments of a state estimation method using a neural network and a state estimation apparatus using a neural network will be described below with reference to the drawings.

本実施の形態では、本発明を、車両に搭載され、ドライバ状態を識別するドライバ状態識別装置に適用する。本実施の形態に係るドライバ状態識別装置では、ドライバ状態を識別するために必要な様々な情報をセンサなどによって取得し、ニューラルネットワーク識別器によって取得した情報からなる評価データに対するドライバ状態（クラスラベル）を識別する。そして、本実施の形態に係るドライバ状態識別装置では、ドライバ状態に応じた運転の安全性を判断し、運転の安全性を向上させるための各種運転支援を行う。 In the present embodiment, the present invention is applied to a driver state identification device that is mounted on a vehicle and identifies a driver state. In the driver state identification device according to the present embodiment, various information necessary for identifying the driver state is obtained by a sensor or the like, and the driver state (class label) for the evaluation data including the information obtained by the neural network classifier Identify In the driver state identification device according to the present embodiment, driving safety according to the driver state is determined, and various driving assistances for improving driving safety are performed.

本実施の形態で用いるニューラルネットワーク識別器は、３層階層型のニューラルネットであり、誤差逆伝播法による学習によって構築される。ドライバ状態を識別するために必要な情報としては、ドライバから得られる各種情報、車両周辺の環境から得られる各種情報、車両から得られる各種情報である。これら各種情報の組み合わせが評価データとしてニューラルネットワーク識別器に入力される。識別するドライバ状態としては、焦り、先急ぎなどの心理状態、覚醒度、疲労などの身体的状態、初級運転、熟練運転などの運転スキルなどがある。 The neural network classifier used in the present embodiment is a three-layer hierarchical neural network, and is constructed by learning using an error back propagation method. Information necessary for identifying the driver state includes various information obtained from the driver, various information obtained from the environment around the vehicle, and various information obtained from the vehicle. A combination of these various information is input to the neural network classifier as evaluation data. Examples of the driver state to be identified include psychological states such as impatience and rush, physical state such as arousal level and fatigue, driving skills such as elementary driving and skilled driving.

図１〜図５を参照して、ドライバ状態識別装置１について説明する。図１は、本実施の形態に係るドライバ状態識別装置の構成図である。図２は、図１のドライバ状態識別装置における識別方法の概念図である。図３は、評価データの一例である。図４は、仮クラスラベルと仮クラスラベルに対応する仮教師データの一例である。図５は、評価データに対して仮教師データ（仮クラスラベル）を割り当てた場合の一例である。 The driver state identification device 1 will be described with reference to FIGS. FIG. 1 is a configuration diagram of a driver state identification device according to the present embodiment. FIG. 2 is a conceptual diagram of an identification method in the driver state identification device of FIG. FIG. 3 is an example of evaluation data. FIG. 4 is an example of temporary class data and temporary teacher data corresponding to the temporary class label. FIG. 5 is an example when temporary teacher data (temporary class label) is assigned to evaluation data.

ドライバ状態識別装置１は、ニューラルネットワーク識別器によってドライバ状態を識別し、ドライバ状態に応じて各種運転支援を行う。特に、ドライバ状態識別装置１は、ドライバ状態の識別精度を向上させるために、車両運転中に得られる評価データに対して複数の仮のクラスラベルを設定し、仮クラスラベル毎に仮学習を行ってその全ての学習曲線に基づいて評価データに対して最も妥当性のある仮クラスラベルを評価データのクラスラベルとして識別する。さらに、ドライバ状態識別装置１は、評価データと識別されたクラスラベルからなる学習データを用いて再学習を行い、ドライバの特性に合わせてニューラルネットワーク識別器をチューニングする。そのために、ドライバ状態識別装置１は、ドライバ状態認識手段、環境情報認識手段、車両情報認識手段、ディスプレイ４０、スピーカ４１、運転支援システム４２、ニューラルネットワーク識別ＥＣＵ[Electronic Control Unit]５０（なお、以下において「識別ＥＣＵ５０」と記載）、ドライバ状態適応型運転支援ＥＣＵ６０（なお、以下において「運転支援ＥＣＵ６０」と記載）を備えている。 The driver state identification device 1 identifies a driver state with a neural network classifier and performs various driving assistances according to the driver state. In particular, the driver state identification device 1 sets a plurality of provisional class labels for evaluation data obtained during driving of the vehicle and performs provisional learning for each provisional class label in order to improve the identification accuracy of the driver state. Based on all the learning curves, the temporary class label that is most appropriate for the evaluation data is identified as the class label of the evaluation data. Further, the driver state identification device 1 performs re-learning using learning data including the class label identified as the evaluation data, and tunes the neural network classifier according to the characteristics of the driver. For this purpose, the driver state identification device 1 includes a driver state recognition unit, an environment information recognition unit, a vehicle information recognition unit, a display 40, a speaker 41, a driving support system 42, a neural network identification ECU [Electronic Control Unit] 50 (hereinafter referred to as the following). And a driver state adaptive driving support ECU 60 (hereinafter referred to as “driving support ECU 60”).

なお、本実施の形態では識別ＥＣＵ５０で行われる各処理が特許請求の範囲に記載する初期学習手段、仮想教師データ割当手段、評価手段、クラスラベル推定手段、クラスラベル出力手段及び追加学習手段に相当し、ドライバ状態認識手段、環境情報認識手段及び車両情報認識手段が特許請求の範囲に記載する運転状態検出手段に相当する。 In the present embodiment, each process performed by the identification ECU 50 corresponds to the initial learning means, virtual teacher data allocation means, evaluation means, class label estimation means, class label output means, and additional learning means described in the claims. The driver state recognition unit, the environment information recognition unit, and the vehicle information recognition unit correspond to the driving state detection unit described in the claims.

ドライバ状態認識手段としては、顔向き・視線認識センサ１０、動作・姿勢認識センサ１１、顔・眼球認識センサ１２、足元認識センサ１３、心拍センサ１４などがある。顔向き・視線認識センサ１０は、ドライバの顔向きと視線を認識するセンサであり、その認識情報を識別ＥＣＵ５０に送信する。動作・姿勢認識センサ１１は、ドライバの動作と姿勢を認識するセンサであり、その認識情報を識別ＥＣＵ５０に送信する。顔・眼球認識センサ１２は、ドライバの顔と眼球を認識するセンサであり、その認識情報を識別ＥＣＵ５０に送信する。足元認識センサ１３は、ドライバの足元の動作や姿勢を認識するセンサであり、その認識情報を識別ＥＣＵ５０に送信する。心拍センサ１４は、ドライバの生理情報として心拍数を検出し、その検出情報を識別ＥＣＵ５０に送信する。 As the driver state recognition means, there are a face orientation / line-of-sight recognition sensor 10, a motion / posture recognition sensor 11, a face / eyeball recognition sensor 12, a foot recognition sensor 13, a heart rate sensor 14, and the like. The face direction / line-of-sight recognition sensor 10 is a sensor that recognizes the driver's face direction and line of sight, and transmits the recognition information to the identification ECU 50. The motion / posture recognition sensor 11 is a sensor that recognizes the driver's motion and posture, and transmits the recognition information to the identification ECU 50. The face / eyeball recognition sensor 12 is a sensor that recognizes the driver's face and eyeball, and transmits the recognition information to the identification ECU 50. The foot recognition sensor 13 is a sensor that recognizes the motion and posture of the driver's foot, and transmits the recognition information to the identification ECU 50. The heart rate sensor 14 detects the heart rate as the physiological information of the driver, and transmits the detected information to the identification ECU 50.

なお、顔向き・視線認識センサ１０、動作・姿勢認識センサ１１、顔・眼球認識センサ１２、足元認識センサ１３は、各センサ単体で構成してもよいし、あるいは、ドライバの顔、体全体、足元を撮像する各カメラと画像処理を行うＥＣＵなどで構成してもよい。ここでは、ドライバの生理情報として、心拍数だけを例示したが、他にも脳波、呼吸数、体温、瞬目回数などを取得するようにしてもよい。 Note that the face orientation / line-of-sight recognition sensor 10, the motion / posture recognition sensor 11, the face / eyeball recognition sensor 12, and the foot recognition sensor 13 may be configured by each sensor alone, or the face of the driver, the entire body, You may comprise each camera which images a step, ECU etc. which perform image processing. Here, only the heart rate is illustrated as the physiological information of the driver. However, the brain wave, the respiratory rate, the body temperature, the number of blinks, and the like may be acquired.

環境情報認識手段としては、車間距離センサ２０、車線認識センサ２１、信号機認識センサ２２、標識認識センサ２３、一時停止線認識センサ２４、歩行者認識センサ２５、交通環境情報取得用通信装置２６、カーナビゲーションシステム２７などがある。車間距離センサ２０は、前方車両との車間距離を検出するセンサであり、その検出情報を識別ＥＣＵ５０に送信する。車線認識センサ２１は、走行中の車線を認識するセンサであり、その認識情報を識別ＥＣＵ５０に送信する。信号機認識センサ２２は、車両前方に存在する信号機を認識するセンサであり、その認識情報を識別ＥＣＵ５０に送信する。標識認識センサ２３は、車両前方に存在する標識を認識するセンサであり、その認識情報を識別ＥＣＵ５０に送信する。一時停止線認識センサ２４は、車両前方に存在する一時停止線を認識するセンサであり、その認識情報を識別ＥＣＵ５０に送信する。歩行者認識センサ２５は、車両前方に存在する歩行者を認識するセンサであり、その認識情報を識別ＥＣＵ５０に送信する。交通環境情報取得用通信装置２６は、路上局や他車両搭載の通信装置と通信を行い、渋滞情報や他車両の情報などを取得するための通信装置であり、その取得した交通環境情報を識別ＥＣＵ５０に送信する。カーナビゲーションシステム２７は、車両の現在位置や走行方向の検出及び目的地までの経路案内などを行うシステムであり、これらの情報を識別ＥＣＵ５０に送信する。 The environmental information recognition means includes an inter-vehicle distance sensor 20, a lane recognition sensor 21, a traffic signal recognition sensor 22, a sign recognition sensor 23, a temporary stop line recognition sensor 24, a pedestrian recognition sensor 25, a traffic environment information acquisition communication device 26, a car. There is a navigation system 27 and the like. The inter-vehicle distance sensor 20 is a sensor that detects the inter-vehicle distance from the preceding vehicle, and transmits the detection information to the identification ECU 50. The lane recognition sensor 21 is a sensor for recognizing a traveling lane, and transmits the recognition information to the identification ECU 50. The traffic light recognition sensor 22 is a sensor for recognizing a traffic light existing in front of the vehicle, and transmits the recognition information to the identification ECU 50. The sign recognition sensor 23 is a sensor for recognizing a sign existing in front of the vehicle, and transmits the recognition information to the identification ECU 50. The temporary stop line recognition sensor 24 is a sensor that recognizes a temporary stop line existing in front of the vehicle, and transmits the recognition information to the identification ECU 50. The pedestrian recognition sensor 25 is a sensor that recognizes a pedestrian existing in front of the vehicle, and transmits the recognition information to the identification ECU 50. The communication device 26 for acquiring traffic environment information is a communication device for communicating with roadside stations and communication devices mounted on other vehicles and acquiring traffic jam information, information on other vehicles, etc., and identifying the acquired traffic environment information. It transmits to ECU50. The car navigation system 27 is a system that detects the current position and traveling direction of the vehicle and provides route guidance to the destination, and transmits these pieces of information to the identification ECU 50.

なお、車間距離センサ２０、車線認識センサ２１、信号機認識センサ２２、標識認識センサ２３、一時停止線認識センサ２４、歩行者認識センサ２５は、各センサ単体で構成してもよいし、あるいは、車両前方を撮像するカメラやレーダと画像処理などを行うＥＣＵなどで構成してもよい。 The inter-vehicle distance sensor 20, the lane recognition sensor 21, the traffic signal recognition sensor 22, the sign recognition sensor 23, the temporary stop line recognition sensor 24, and the pedestrian recognition sensor 25 may be configured as a single sensor or a vehicle. You may comprise by the camera etc. which perform the image processing etc., the camera and radar which image the front.

車両情報認識手段としては、車速センサ３０、舵角センサ３１、アクセル開度センサ３２、ブレーキ踏込み度センサ３３、前後Ｇセンサ３４、横Ｇセンサ３５、上下Ｇセンサ３６、ウインカ操作センサ３７などがある。車速センサ３０は、車両の速度を検出するセンサであり、その検出値を識別ＥＣＵ５０に送信する。舵角センサ３１は、ステアリングホイールの舵角を検出するセンサであり、その検出値を識別ＥＣＵ５０に送信する。アクセル開度センサ３２は、アクセルペダルの開度を検出するセンサであり、その検出値を識別ＥＣＵ５０に送信する。ブレーキ踏込み度センサ３３は、ブレーキペダルの踏込み度を検出するセンサであり、その検出値を識別ＥＣＵ５０に送信する。前後Ｇセンサ３４は、車両に作用する前後Ｇを検出するセンサであり、その検出値を識別ＥＣＵ５０に送信する。横Ｇセンサ３５は、車両に作用する横Ｇを検出するセンサであり、その検出値を識別ＥＣＵ５０に送信する。上下Ｇセンサ３６は、車両に作用する上下Ｇを検出するセンサであり、その検出値を識別ＥＣＵ５０に送信する。ウインカ操作センサ３７は、ウインカの操作状態を検出センサであり、その検出値を識別ＥＣＵ５０に送信する。 As vehicle information recognition means, there are a vehicle speed sensor 30, a steering angle sensor 31, an accelerator opening sensor 32, a brake depression degree sensor 33, a front / rear G sensor 34, a lateral G sensor 35, a vertical G sensor 36, a blinker operation sensor 37, and the like. . The vehicle speed sensor 30 is a sensor that detects the speed of the vehicle, and transmits the detected value to the identification ECU 50. The steering angle sensor 31 is a sensor that detects the steering angle of the steering wheel, and transmits the detected value to the identification ECU 50. The accelerator opening sensor 32 is a sensor that detects the opening of the accelerator pedal, and transmits the detected value to the identification ECU 50. The brake depression degree sensor 33 is a sensor that detects the degree of depression of the brake pedal, and transmits the detected value to the identification ECU 50. The front / rear G sensor 34 is a sensor that detects the front / rear G acting on the vehicle, and transmits the detected value to the identification ECU 50. The lateral G sensor 35 is a sensor that detects the lateral G acting on the vehicle, and transmits the detected value to the identification ECU 50. The vertical G sensor 36 is a sensor that detects the vertical G acting on the vehicle, and transmits the detected value to the identification ECU 50. The turn signal operation sensor 37 is a detection sensor that detects the operation state of the turn signal, and transmits the detected value to the identification ECU 50.

ディスプレイ４０、スピーカ４１は、車両内の各システムと共用で利用され、ドライバ状態識別装置１ではドライバに対する注意喚起やアドバイス提示などを行う際に利用される。運転支援システム４２としては、例えば、プリクラッシュセーフティシステム、アダプティブクルーズコントロールシステム、レーンキープシステムがある。 The display 40 and the speaker 41 are used in common with each system in the vehicle, and the driver state identification device 1 is used when alerting the driver or presenting advice. Examples of the driving support system 42 include a pre-crash safety system, an adaptive cruise control system, and a lane keeping system.

識別ＥＣＵ５０は、ＣＰＵ[Central Processing Unit]、ＲＯＭ[Read Only Memory]、ＲＡＭ[Random AccessMemory]などからなり、ニューラルネットワーク識別器が構成されている。識別ＥＣＵ５０では、ユーザによる使用開始前（車両販売前）に一般的な学習データによってニューラルネットワーク識別器を構築するための予備学習処理、ユーザによる使用開始後（車両販売後）のクラスラベルを持たない評価データに対して妥当なクラスラベルを識別するための識別学習処理、ユーザが使用開始後の新規な学習データによってニューラルネットワーク識別器をチューニングするための再学習処理などを行う。 The identification ECU 50 includes a CPU [Central Processing Unit], a ROM [Read Only Memory], a RAM [Random Access Memory], and the like, and constitutes a neural network classifier. The identification ECU 50 does not have a pre-learning process for constructing a neural network classifier based on general learning data before the start of use by the user (before vehicle sales), and no class label after the start of use by the user (after vehicle sales). An identification learning process for identifying an appropriate class label with respect to the evaluation data, a re-learning process for tuning the neural network classifier with new learning data after the user starts use, and the like are performed.

ニューラルネットワーク識別器は、多数のニューロンを有しており、ニューロンは相互に重みを持って接続されネットワーク構造（神経回路網）を形成している。ニューラルネットワーク識別器の入力層の各ニューロンには評価データに含まれる各情報のデータが入力され、出力層の各ニューロンからクラスラベルを示すデータを出力する。ニューラルネットワーク識別器は、クラスラベルに応じて覚醒度モデル、先急ぎモデル、初級運転モデルなどがあり、入力される各情報の組み合わせによって識別可能なクラスラベルが決まっている。例えば、車速、横Ｇ、心拍数、車間距離の各検出値が入力された場合にはクラスラベルとして丁寧、通常、焦り、疲労、異常の５つの状態を識別できる。 The neural network classifier has a large number of neurons, and the neurons are connected with weights to form a network structure (neural network). Each neuron in the input layer of the neural network classifier receives data of each information included in the evaluation data, and outputs data indicating a class label from each neuron in the output layer. Neural network classifiers include arousal level models, rush models, and elementary driving models according to class labels, and class labels that can be identified are determined by combinations of input information. For example, when the detected values of vehicle speed, lateral G, heart rate, and inter-vehicle distance are input, five states of polite, normal, impatient, fatigue, and abnormality can be identified as class labels.

予備学習処理について説明する。予備学習を行う前に、例えば、多数のドライバ（被験者）が車両を運転してモデルコースを走行し、一定時間毎に、ドライバ情報認識手段、環境情報認識手段、車両情報認識手段の各センサによって上記した各情報を取得し、その幾つかの情報の組み合わせからなる入力データを収集する。そして、各ドライバが、その各情報を取得したときの心理状態、身体的情報及び運転スキルなどをクラスラベルとして設定する。この際、走行時間帯（ひいては、交通量）、ドライバの睡眠時間や運転前の作業量（運動量）など変えてデータをそれぞれ取得する。なお、予備学習に用いる入力データとクラスラベルについてはこのように実験走行とドライバによる設定によって取得してもよいし、他の方法によって取得してもよい。 The preliminary learning process will be described. Before the preliminary learning, for example, a large number of drivers (subjects) drive the vehicle and travel on the model course, and the driver information recognition unit, the environment information recognition unit, and the vehicle information recognition unit each time a fixed time passes. Each piece of information described above is acquired, and input data consisting of a combination of some information is collected. Each driver sets the psychological state, physical information, driving skill, and the like when the information is acquired as class labels. At this time, the data is acquired by changing the travel time zone (and hence the traffic volume), the sleeping time of the driver, the work amount before driving (the amount of exercise), and the like. Note that the input data and class labels used for the preliminary learning may be acquired by the experiment running and the setting by the driver as described above, or may be acquired by other methods.

識別ＥＣＵ５０では、各入力データとそれに対応するクラスラベルが入力されると、クラスラベルに対して予め決められた教師データを設定し、入力データとその教師データからなる学習データを生成する。そして、識別ＥＣＵ５０では、ニューラルネットワーク識別器に入力データを入力して誤差逆伝播法による学習を行い、ニューラルネットワーク識別器の出力層の各ニューロンからの出力データと教師データとの誤差が十分に小さくなるように（収束状態となるように）各ニューロンの重みを更新する。このように、識別ＥＣＵ５０では、正しくラベリングされた全ての学習データについてそれぞれ学習を行い、基本となるニューラルネットワーク識別器を構築する。学習の終了条件としては、学習を所定回数行った場合あるいは二乗誤差が所定値以下になった場合を条件とする。また、学習の更新式として、例えば、改良型の誤差逆伝播法の式（１）を用いる。この改良型の誤差逆伝播法は、通常の誤差逆伝播法のデルタ項に慣性項と振動項を付加したものであり、慣性項と振動項はそれぞれ学習を加速する働きとローカルミニマムに陥った場合に振動してそこから確率的に抜け出す働きを有している。 When each input data and its corresponding class label are input, the identification ECU 50 sets predetermined teacher data for the class label, and generates learning data composed of the input data and the teacher data. The identification ECU 50 inputs the input data to the neural network classifier and performs learning by the error back propagation method, and the error between the output data from each neuron in the output layer of the neural network classifier and the teacher data is sufficiently small. The weight of each neuron is updated so that it becomes (convergence state). In this way, the identification ECU 50 performs learning for all learning data that are correctly labeled, and constructs a basic neural network classifier. As a learning end condition, the learning is performed a predetermined number of times or the square error is a predetermined value or less. Further, as the learning update formula, for example, formula (1) of the improved error back propagation method is used. This improved error backpropagation method adds the inertia term and vibration term to the delta term of the normal error backpropagation method, and the inertial term and the vibration term fall into the local minimum with the function of accelerating learning, respectively. In some cases, it vibrates and probabilistically escapes from it.

式（１）において、Δω（）は重みの修正量であり、ｃは学習回数であり、εは正の学習定数であり、ｄは一般化誤差であり、ｏは出力データであり、αは慣性項の比例定数であり、βは振動項の比例定数である。 In Equation (1), Δω () is a weight correction amount, c is the number of learnings, ε is a positive learning constant, d is a generalization error, o is output data, and α is It is the proportionality constant of the inertia term, and β is the proportionality constant of the vibration term.

識別学習処理について説明する。図２を参照して、識別学習処理と再学習処理の概要を説明する。車両運転中にセンサなどによって情報が収集され、ドライバの状態を識別するための新たな評価データを取得すると、その評価データに対して識別される可能性のあるクラスラベルを仮のクラスラベル（仮教師データ）として割り当てる。したがって、ある評価データに対して複数の仮クラスラベルが割り当てられ、評価データと教師データからなる仮の学習データが複数構成される。この仮クラスラベル毎に、誤差逆伝播法による学習を行い、学習回数に応じた二乗誤差の変化を示す学習曲線を生成する。この学習を行う際に、識別精度を向上させるために、予備学習などで使用した正しくラベリングされた学習データも用いる。学習の時間的過程（学習曲線の収束具合）を判断指標として、仮クラスラベル毎の学習曲線の中から収束特性の最も高い学習曲線を抽出し、その学習曲線に対応する仮クラスラベルをその評価データのクラスラベルとして識別する。この評価データとこの識別されたクラスラベルとが学習データとしても妥当であると判断されたことになる。そこで、この評価データと識別されたクラスラベルの教師データからなる新規の学習データを用いて、誤差逆伝播法による学習を行い、ニューラルネットワーク識別器の各ニューロンの重みを更新する。 The identification learning process will be described. With reference to FIG. 2, an outline of the identification learning process and the relearning process will be described. When information is collected by a sensor or the like while driving the vehicle and new evaluation data for identifying the state of the driver is obtained, a class label that may be identified for the evaluation data is assigned a temporary class label (temporary Assigned as teacher data). Accordingly, a plurality of provisional class labels are assigned to certain evaluation data, and a plurality of provisional learning data composed of evaluation data and teacher data is configured. For each provisional class label, learning by the error back propagation method is performed, and a learning curve indicating a change in the square error according to the number of learning is generated. In performing this learning, correctly labeled learning data used in preliminary learning or the like is also used in order to improve identification accuracy. The learning process with the highest convergence characteristics is extracted from the learning curve for each temporary class label, using the temporal process of learning (the convergence of the learning curve) as a decision index, and the temporary class label corresponding to the learning curve is evaluated. Identifies as a data class label. This evaluation data and the identified class label are determined to be valid as learning data. Therefore, using the new learning data composed of the evaluation data and the teacher data of the identified class label, learning is performed by the error back propagation method, and the weight of each neuron of the neural network classifier is updated.

識別ＥＣＵ５０では、一定時間毎に、各認識手段のセンサなどから情報を取り入れ、各情報を組み合わせて評価データとする。評価データはドライバの心理状態、身体的情報や運転スキルを識別するためのデータであるので、ドライバの各状態を見極めるために所定期間に収集した多数のサンプルが必要となる。この収集する所定期間及びサンプリング周期は識別するドライバ状態に応じて決まっており、心理状態や身体的状態については短いスパンで識別する必要があるので、所定期間及びサンプリング周期は短い時間が設定され、運転スキルについては長いスパンで識別する必要があるので、所定期間及びサンプリング周期は長い時間が設定される。図３には、評価データの一例を示しており、情報の組み合わせとして車速、横Ｇ、心拍数、車間距離からなり、各センサの検出値Ｘ_１，Ｘ_２，Ｘ_３，Ｘ_４のベクトル形式で構成される。評価データは所定期間にサンプリング周期毎に収集され、評価データ１，・・・，評価データｍまでのｍ個のサンプル数の評価データが取得される。 In the identification ECU 50, information is taken from the sensors of the respective recognition means at regular intervals, and the information is combined into evaluation data. Since the evaluation data is data for identifying the driver's psychological state, physical information and driving skill, a large number of samples collected in a predetermined period are required to determine each state of the driver. The predetermined period and sampling period to be collected are determined according to the driver state to be identified, and it is necessary to identify the psychological state and physical state with a short span, so the predetermined period and sampling period are set to a short time, Since the driving skill needs to be identified with a long span, a long time is set for the predetermined period and the sampling cycle. FIG. 3 shows an example of evaluation data, which consists of a vehicle speed, a lateral G, a heart rate, and an inter-vehicle distance as a combination of information, and a vector format of detection values X ₁ , X ₂ , X ₃ , X ₄ of each sensor. Consists of. Evaluation data is collected for each sampling period in a predetermined period, and evaluation data of m samples up to evaluation data 1,..., Evaluation data m is acquired.

識別ＥＣＵ５０では、評価データを取得すると、評価データに対してその評価データに含まれる情報から識別可能なクラスラベルを仮のクラスラベルとして割り当て、この仮クラスラベルに仮教師データをそれぞれ対応付ける。クラスラベルと教師データとは１対１の関係であり、教師データは各値が０／１のベクトル形式で構成される。図４には、仮クラスラベルの一例とその仮のクラスラベルに対応する仮教師データの一例を示している。仮クラスラベルとしては、丁寧、通常、焦り、疲労、異常が割り当てられる。仮教師データは、ニューラルネットワーク識別器の出力層のニューロンの数を５とした場合であり、５つの０／１の値のベクトル形式で表される。また、図５には図３に示す評価データに対して仮教師データ（すなわち、仮クラスラベル）を割り当てた場合の一例を示している。図５から判るように、サンプリング周期毎の評価データ１，評価データ２，・・・に対してそれぞれ５つの仮教師データが割り当てられる。この例の場合、ｍ×５個の組み合わせとなる。 Upon obtaining the evaluation data, the identification ECU 50 assigns a class label that can be identified from information included in the evaluation data to the evaluation data as a temporary class label, and associates the temporary teacher data with the temporary class label. The class label and the teacher data have a one-to-one relationship, and the teacher data is configured in a vector format in which each value is 0/1. FIG. 4 shows an example of a temporary class label and an example of temporary teacher data corresponding to the temporary class label. As the temporary class label, polite, normal, impatience, fatigue, and abnormality are assigned. The temporary teacher data is a case where the number of neurons in the output layer of the neural network classifier is 5, and is represented in a vector format of five 0/1 values. FIG. 5 shows an example in which temporary teacher data (that is, temporary class label) is assigned to the evaluation data shown in FIG. As can be seen from FIG. 5, five temporary teacher data are assigned to evaluation data 1, evaluation data 2,. In this example, there are m × 5 combinations.

識別ＥＣＵ５０では、仮クラスラベル毎に、予備学習での学習と同様に、評価データと仮教師データからなる仮の学習データ及び正しくラベリングされている学習データ（予備学習処理で用いた学習データや識別学習処理で処理済の評価データとその評価データに対して識別されたクラスラベルの教師データからなる学習データ）を用いて学習を行い、ニューラルネットワーク識別器の各ニューロンの重みを更新する。この学習の過程において、識別ＥＣＵ５０では、１回の学習が終了する毎に、学習曲線を生成するために、式（２）により仮教師データと出力データとの二乗誤差を演算し、その二乗誤差を学習回数に対応付けて保持する。識別ＥＣＵ５０では、上記した学習の終了条件を満たすと、ニューラルネットワーク識別器の各ニューロンの更新された重みを学習前の値に戻し、次の仮クラスラベルについての学習を行う。このような処理を評価データに割り当てた全ての仮クラスラベルについて行う。 In the identification ECU 50, for each provisional class label, provisional learning data composed of evaluation data and provisional teacher data and learning data correctly labeled (learning data and identification used in the preliminary learning process) Learning is performed using the evaluation data processed in the learning process and the teacher data of the class label identified for the evaluation data), and the weight of each neuron of the neural network classifier is updated. In this learning process, the identification ECU 50 calculates a square error between the temporary teacher data and the output data according to the equation (2) to generate a learning curve every time learning is completed, and the square error is calculated. Is stored in association with the number of learning times. When the learning ECU satisfies the learning termination condition, the identification ECU 50 returns the updated weight of each neuron of the neural network classifier to the value before learning, and performs learning for the next temporary class label. Such processing is performed for all temporary class labels assigned to the evaluation data.

式（２）において、ｍは評価データ（入力データ）のサンプル数であり、ｎはニューラルネットワーク識別器の出力層のニューロンの数であり、ｐはクラスラベル（教師データ）の数であり、ｔは教師データであり、ｏは出力データである。ちなみに、この二乗誤差の演算式は、予備学習処理や再学習処理の際の学習にも用いられる。 In equation (2), m is the number of samples of evaluation data (input data), n is the number of neurons in the output layer of the neural network classifier, p is the number of class labels (teacher data), and t Is teacher data, and o is output data. Incidentally, this square error calculation formula is also used for learning in the preliminary learning process and the relearning process.

識別ＥＣＵ５０では、全ての仮クラスラベルについての学習が終了すると、仮クラスラベル毎に、学習回数に対してその学習回数のときの二乗誤差を対応付けて学習曲線を生成する。図２には、図５に示す評価データと割り当てた５つの仮クラスラベルとの学習曲線をグラフ化したもの示しており、横軸が学習回数であり、縦軸が二乗誤差である。ちなみに、この処理は識別ＥＣＵ５０内で行われているので、学習曲線は、実際には、図２のようにグラフ化されておらず、学習回数と二乗誤差との数値の対応付けで構成されている。 When the learning for all the temporary class labels is completed, the identification ECU 50 generates a learning curve for each temporary class label by associating the square error at the learning frequency with the learning frequency. FIG. 2 is a graph showing the learning curve of the evaluation data shown in FIG. 5 and the five assigned temporary class labels. The horizontal axis represents the number of learnings, and the vertical axis represents the square error. Incidentally, since this process is performed in the identification ECU 50, the learning curve is not actually graphed as shown in FIG. 2, and is configured by associating numerical values between the number of learnings and the square error. Yes.

識別ＥＣＵ５０では、仮クラスラベル毎に生成された各学習曲線の収束特性（大域的に学習が収束しているか否か）をそれぞれ判定し、全ての学習曲線を比較評価する。そして、識別ＥＣＵ５０では、その全ての学習曲線の中から最も収束特性が高い学習曲線を抽出し、その学習曲線の仮クラスラベルを評価データに対して妥当なクラスラベルとして識別する。図２に示す例では、仮ラベル２の学習曲線が最も収束特性が高く、評価データに対してラベル２の「通常」のクラスラベルが識別されたことになる。仮ラベル２の学習曲線は、学習が大域的に収束しており、評価データが正しくラベリング学習データのバラツキと遜色ない範囲で収集されており、目的の状態のデータとして正しくラベリングされていると判定できる。それ以外の学習曲線は、学習が大域的に収束しておらず、評価データが正しくラベリングされた学習データのバラツキの範囲外であるかあるいは部分的に大きく異なった状態で収集されていると判定できる。 The identification ECU 50 determines the convergence characteristics (whether learning has converged globally) of each learning curve generated for each temporary class label, and compares and evaluates all learning curves. Then, the identification ECU 50 extracts a learning curve having the highest convergence characteristic from all the learning curves, and identifies the temporary class label of the learning curve as a valid class label for the evaluation data. In the example shown in FIG. 2, the learning curve of the temporary label 2 has the highest convergence characteristic, and the “normal” class label of the label 2 is identified for the evaluation data. The learning curve of tentative label 2 determines that the learning has converged globally, the evaluation data has been collected in a range that is not inconsistent with the variation in the labeling learning data, and has been correctly labeled as target state data. it can. The other learning curves are determined that the learning is not globally converged, and the evaluation data is outside the range of correctly labeled learning data, or is collected in a partially different state it can.

学習曲線の収束特性の判定条件としては、３つの観点で判定し、１つ目が学習が収束するか否か（学習終了時の二乗誤差）、２つ目が収束する場合には速やかに収束するか（収束速度）、３つ目が収束する場合には滑らかに収束するかである。この３つの判定条件を識別ＥＣＵ５０で判定する具体的な手法としては、１つ目が一定回数（正しいラベリングのデータで予め学習することで設定可能）の学習で収束するか、２つ目が一定回数で学習が収束する場合に何回の学習で収束したか、３つ目が一定回数で学習が収束する場合に学習曲線の勾配が大きく振動していないか（学習曲線の勾配の最大値をとるかあるいは勾配のピークのうち大きい方から３つ程度の平均をとるなどして振動の指標とすることで可能）である。以上の３つの判定手法を定量化して統合した評価関数（収束までの学習回数と振動の指標を入力とする）を作成することにより、学習曲線の収束特性を定量的に演算することができる。 Judgment conditions for the learning curve convergence characteristics are determined from three viewpoints: whether the first converges or not (square error at the end of learning); if the second converges, it converges quickly (Convergence speed) Whether the third converges smoothly. As a specific method for determining these three determination conditions by the identification ECU 50, the first is converged by learning a fixed number of times (can be set by learning in advance with correct labeling data) or the second is constant. If the learning converges at a certain number of times, how many times the learning has converged, and if the third converges at a certain number of times, the learning curve gradient is not oscillating greatly (the maximum value of the learning curve gradient is It is possible to use it as an index of vibration by taking the average of about three of the peaks of the gradient or taking the average of the larger one). By creating an evaluation function (using the number of learnings until convergence and the vibration index as inputs) by quantifying and integrating the above three determination methods, the convergence characteristic of the learning curve can be calculated quantitatively.

識別ＥＣＵ５０では、評価データに対してクラスラベルを識別すると、その識別結果（つまり、ドライバ状態）を運転支援ＥＣＵ６０に出力する。また、この評価データと識別されたクラスラベル（教師データ）とは再学習処理で用いられる。 When the identification ECU 50 identifies the class label for the evaluation data, the identification ECU 50 outputs the identification result (that is, the driver state) to the driving support ECU 60. The evaluation data and the identified class label (teacher data) are used in the relearning process.

再学習処理について説明する。識別ＥＣＵ５０では、識別学習処理において評価データに対する識別が行われた後、その評価データに対して識別結果に基づいてラベリングを行い、その評価データと教師データからなる学習データを生成する。そして、識別ＥＣＵ５０では、予備学習での学習と同様に、その学習データを用いて学習を行い、ニューラルネットワーク識別器の各ニューロンの重みを更新する。このように、識別ＥＣＵ５０では、新規な学習データによって再学習を行い、ニューラルネットワーク識別器をチューニングする。 The relearning process will be described. In the identification ECU 50, after the evaluation data is identified in the identification learning process, the evaluation data is labeled based on the identification result, and learning data including the evaluation data and teacher data is generated. Then, in the identification ECU 50, learning is performed using the learning data as in the preliminary learning, and the weight of each neuron of the neural network classifier is updated. In this way, the identification ECU 50 performs re-learning with new learning data and tunes the neural network classifier.

運転支援ＥＣＵ６０は、ＣＰＵ、ＲＯＭ、ＲＡＭなどからなる。運転支援ＥＣＵ６０では、識別ＥＣＵ５０で識別された所定のドライバ状態が入力されると、そのドライバ状態に応じて運転の安全性を判断する。そして、運転支援ＥＣＵ６０では、運転の安全性が正常レベルより低下している場合、そのレベルに応じてドライバに注意喚起やアドバイス提示するための画像やメッセージを生成し、その画像をディスプレイ４０に表示させたり、そのメッセージをスピーカ４１から音声出力させる。また、運転支援ＥＣＵ６０では、運転の安全性が正常レベルより低下している場合、そのレベルに応じて運転支援システム４２に対して制御タイミングを変えるなどの指示信号を送信する。 The driving support ECU 60 includes a CPU, a ROM, a RAM, and the like. When the predetermined driver state identified by the identification ECU 50 is input, the driving assistance ECU 60 determines driving safety according to the driver state. When the driving safety is lower than the normal level, the driving support ECU 60 generates an image or a message for alerting the driver or presenting advice according to the level, and displays the image on the display 40. Or the voice of the message is output from the speaker 41. Further, when the driving safety is lower than the normal level, the driving support ECU 60 transmits an instruction signal such as changing the control timing to the driving support system 42 according to the level.

なお、ある程度長い期間にわたる識別学習処理と再学習処理によってドライバの個人特性に合わせた識別精度の高いニューラルネットワーク識別器にチューニングできた場合には、識別学習処理及び再学習処理を停止し、そのチューニングされたニューラルネットワーク識別器のフィードフォワード計算による１回の評価によってクラスラベル（ドライバ状態）を識別してもよい。 If the neural network discriminator with high discrimination accuracy matched to the driver's personal characteristics can be tuned by discriminating learning processing and relearning processing over a certain period of time, the discriminating learning processing and relearning processing are stopped and the tuning is stopped. The class label (driver state) may be identified by a single evaluation by feedforward calculation of the neural network classifier.

図１〜図５を参照して、ドライバ状態識別装置１の動作について説明する。特に、識別ＥＣＵ５０における予備学習処理については図６のフローチャートに沿って説明し、識別学習処理については図７のフローチャートに沿って説明し、再学習処理については図８のフローチャートに沿って説明する。図６は、図１のニューラルネットワーク識別ＥＣＵにおける予備学習処理の流れを示すフローチャートである。図７は、図１のニューラルネットワーク識別ＥＣＵにおける識別学習処理の流れを示すフローチャートである。図８は、図１のニューラルネットワーク識別ＥＣＵにおける再学習処理の流れを示すフローチャートである。特に、図８のフローチャートで示される処理は、ニューラルネットワーク識別器の性能向上のための処理であり、図７のフローチャートで示されるニューラルネットワーク識別器における状態識別のための処理とは目的が異なる。 The operation of the driver state identification device 1 will be described with reference to FIGS. In particular, the preliminary learning process in the identification ECU 50 will be described with reference to the flowchart of FIG. 6, the identification learning process will be described with reference to the flowchart of FIG. 7, and the relearning process will be described with reference to the flowchart of FIG. FIG. 6 is a flowchart showing a flow of preliminary learning processing in the neural network identification ECU of FIG. FIG. 7 is a flowchart showing the flow of identification learning processing in the neural network identification ECU of FIG. FIG. 8 is a flowchart showing the flow of the relearning process in the neural network identification ECU of FIG. In particular, the process shown in the flowchart of FIG. 8 is a process for improving the performance of the neural network classifier, and has a different purpose from the process for status identification in the neural network classifier shown in the flowchart of FIG.

ユーザによる使用開始前、ドライバ状態を識別するための各情報が幾つか組み合わせられた入力データが多数収集され、各入力データに対してクラスラベルがそれぞれ設定される。入力データとそれに正しくラベリングされたクラスラベル毎に、識別ＥＣＵ５０では、その入力ラベルとそのクラスラベルの教師データからなる学習データを生成する（Ｓ１０）。そして、識別ＥＣＵ５０では、ニューラルネットワーク識別器に入力データを入力して誤差逆伝播法による学習を行い、ニューラルネットワーク識別器の各ニューロンの重みを更新する（Ｓ１１）。ここで、ユーザによる使用開始前に、一般的な学習データによって、基本となるニューラルネットワーク識別器が構築される。 Before the start of use by the user, a large number of input data in which several pieces of information for identifying the driver state are combined is collected, and a class label is set for each input data. For each input data and each correctly labeled class label, the identification ECU 50 generates learning data including the input label and the teacher data of the class label (S10). Then, the identification ECU 50 inputs the input data to the neural network classifier, performs learning by the error back propagation method, and updates the weight of each neuron of the neural network classifier (S11). Here, before the start of use by the user, a basic neural network classifier is constructed by general learning data.

ユーザによる使用開始後、ドライバによる車両運転中に、ドライバ情報認識手段、環境情報認識手段、車両情報認識手段の各センサなどでは、ドライバ状態の識別に必要な各情報を検出や認識し、その各情報を識別ＥＣＵ５０に送信する。識別ＥＣＵ５０では、一定時間毎に、各センサなどからの情報をそれぞれ受信する（Ｓ２０，Ｓ３０）。そして、識別ＥＣＵ５０では、サンプリング周期毎に、幾つかの情報を組み合わせて評価データとし、所定期間におけるサンプリング周期毎の評価データを保持する（Ｓ２０，Ｓ３０）。この評価データはクラスラベルを持っておらず、妥当なクラスラベルを識別する必要がある。なお、上記したように、評価データを収集する所定周期及びサンプリング周期は識別するドライバ状態に応じて設定されている。 The driver information recognition means, environmental information recognition means, and vehicle information recognition means sensors detect and recognize each information necessary for identifying the driver state while the vehicle is being driven by the driver. Information is transmitted to identification ECU50. The identification ECU 50 receives information from each sensor or the like at regular time intervals (S20, S30). The identification ECU 50 combines several pieces of information into evaluation data for each sampling period, and holds evaluation data for each sampling period in a predetermined period (S20, S30). This evaluation data does not have a class label, and it is necessary to identify a valid class label. As described above, the predetermined period and sampling period for collecting the evaluation data are set according to the driver state to be identified.

識別ＥＣＵ５０では、過去の学習において用いられた正しくラベリングされた学習データを生成する（Ｓ２１）。また、識別ＥＣＵ５０では、評価データに対して仮のクラスラベルを割り当て、評価データとその仮クラスラベルに対応する仮教師データからなる学習データを生成し、学習データとして追加する（Ｓ２２）。この評価データに割り当てられた仮クラスラベルは評価データのクラスラベルとして妥当か、その妥当性を判断する必要がある。識別ＥＣＵ５０では、正しくラベリングされた学習データ及び仮のラベリングによる学習データにより誤差逆伝播法による学習を行い、ニューラルネットワーク識別器の各ニューロンの重みを更新する（Ｓ２３）。学習の終了条件を満たすまで、識別ＥＣＵ５０では、学習の時間的過程を監視するために、１回の学習毎に、仮教師データと出力データとの二乗誤差を演算し、その二乗誤差を学習回数と対応付けて保持する（Ｓ２４）。学習の終了条件を満たすと、識別ＥＣＵ５０では、その仮クラスラベルについての学習を終了し、ニューラルネットワークの各ニューロンの今回の学習によって更新された重みデータを全て棄却し、学習前の重みに戻す（Ｓ２５）。そして、識別ＥＣＵ５０では、評価データに対して可能性のある全ての仮のラベリングを試したか否かを判定する（Ｓ２６）。Ｓ２６の判定にて全ての仮のラベリングを試していないと判定した場合、識別ＥＣＵ５０では、Ｓ２１に戻って、次の仮クラスラベルについての処理を行う。ここで、評価データに対して割り当てた仮のクラスラベル毎に、学習が行われ、その学習の時間的過程が監視される。 The identification ECU 50 generates correctly labeled learning data used in past learning (S21). Further, the identification ECU 50 assigns a temporary class label to the evaluation data, generates learning data composed of the evaluation data and temporary teacher data corresponding to the temporary class label, and adds it as learning data (S22). It is necessary to determine whether or not the temporary class label assigned to the evaluation data is valid as the class label of the evaluation data. The identification ECU 50 performs learning by the error back propagation method using the learning data correctly labeled and the learning data by temporary labeling, and updates the weight of each neuron of the neural network classifier (S23). Until the learning end condition is satisfied, the identification ECU 50 calculates the square error between the temporary teacher data and the output data for each learning, and monitors the square error for each learning in order to monitor the time process of learning. (S24). When the learning end condition is satisfied, the identification ECU 50 ends the learning for the temporary class label, rejects all the weight data updated by the current learning of each neuron of the neural network, and returns to the weight before learning ( S25). Then, the identification ECU 50 determines whether or not all possible temporary labeling has been tried on the evaluation data (S26). If it is determined in S26 that not all temporary labeling has been tried, the identification ECU 50 returns to S21 and performs processing for the next temporary class label. Here, learning is performed for each temporary class label assigned to the evaluation data, and the temporal process of the learning is monitored.

Ｓ２６の判定にて全ての仮のラベリングを試したと判定した場合、識別ＥＣＵ５０では、仮クラスラベル毎に、学習回数とその二乗誤差から学習曲線を生成し、学習曲線の収束特性を定量的に演算する（Ｓ２７）。そして、識別ＥＣＵ５０では、全ての学習曲線の収束特性を比較評価し、収束特性の最も優れる学習曲線を抽出する（Ｓ２７）。さらに、識別ＥＣＵ５０では、抽出した学習曲線に対応する仮クラスラベルを評価データに対して妥当なクラスラベルと識別し、その識別結果を運転支援ＥＣＵ６０に出力する（Ｓ２８）。ここで、評価データに割り当てた複数の仮クラスラベルの中から学習の収束性の最も良い仮クラスラベルが評価データのクラスラベル（所定のドライバ状態）として識別される。 If it is determined in S26 that all temporary labeling has been tried, the identification ECU 50 generates a learning curve from the number of learnings and its square error for each temporary class label, and quantitatively calculates the convergence characteristics of the learning curve. (S27). Then, the identification ECU 50 compares and evaluates the convergence characteristics of all the learning curves, and extracts the learning curve having the best convergence characteristics (S27). Further, the identification ECU 50 identifies the temporary class label corresponding to the extracted learning curve as an appropriate class label for the evaluation data, and outputs the identification result to the driving support ECU 60 (S28). Here, the temporary class label having the best learning convergence is identified as the class label (predetermined driver state) of the evaluation data from among the plurality of temporary class labels assigned to the evaluation data.

評価データを取得（Ｓ２０，Ｓ３０）した後、上記のＳ２１〜Ｓ２７の処理により評価データに対するクラスラベルの識別処理が行われると（Ｓ３１）、識別ＥＣＵ５０では、評価データが学習データとして妥当か否かを判定する（具体的には、Ｓ３１の処理によって評価データに対して１つの妥当なクラスラベルが識別されているか否かを判定する）（Ｓ３２）。Ｓ３２の判定にて学習データとして妥当でないと判定した場合、識別ＥＣＵ５０では、再学習を行わない。 After the evaluation data is acquired (S20, S30), when the class label identification process for the evaluation data is performed by the processes of S21 to S27 (S31), the identification ECU 50 determines whether the evaluation data is valid as learning data. (Specifically, it is determined whether or not one valid class label is identified for the evaluation data by the process of S31) (S32). If it is determined in S32 that the learning data is not valid, the identification ECU 50 does not perform relearning.

Ｓ３２の判定にて学習データとして妥当と判定した場合、識別ＥＣＵ５０では、評価データに識別結果に従ってラベリングし（Ｓ３３）、評価データと識別された教師データとからなる正しくラベリングされた学習データを生成する（Ｓ３４）。識別ＥＣＵ５０では、この生成した学習データにより誤差逆伝播法による学習を行い、ニューラルネットワーク識別器の各ニューロンの重みを更新する（Ｓ３５）。これによって、ドライバの個人特性を反映した学習データによって、ニューラルネットワーク識別器がチューニングされる。 When it is determined that the learning data is valid in the determination of S32, the identification ECU 50 labels the evaluation data according to the identification result (S33), and generates correctly labeled learning data including the evaluation data and the identified teacher data. (S34). The identification ECU 50 performs learning by the back propagation method using the generated learning data, and updates the weight of each neuron of the neural network classifier (S35). As a result, the neural network classifier is tuned with the learning data reflecting the personal characteristics of the driver.

運転支援ＥＣＵ６０では、識別ＥＣＵ５０から識別結果が入力されると、識別された所定のドライバ情報に基づいて運転の安全性のレベルを判定する。運転の安全性が正常レベルより低下している場合、運転支援ＥＣＵ６０では、ディスプレイ４０やスピーカ４１によって、そのレベルに応じてドライバに注意喚起やアドバイス提示するとともに、そのレベルに応じて運転支援システム４２に対して制御タイミングを変えるなどの指示信号を送信する。これによって、疲労、焦り、眠気などのドライバの安全性のレベルの低下時に、安全性を向上させるための適切な運転支援が行われる。 When the identification result is input from the identification ECU 50, the driving assistance ECU 60 determines the level of driving safety based on the identified predetermined driver information. When the driving safety is lower than the normal level, the driving support ECU 60 uses the display 40 and the speaker 41 to alert the driver and present advice according to the level, and according to the level, the driving support system 42. An instruction signal such as changing the control timing is transmitted. Thus, appropriate driving assistance for improving safety is performed when the driver's safety level is reduced, such as fatigue, scorching, and sleepiness.

このドライバ状態識別装置１によれば、従来のようにニューラルネットワークのフィードフォワード計算による評価データに対する１回の評価ではなく、評価データに対して複数の仮クラスラベル（仮教師データ）を自動的に割り当て、学習における時間的過程（学習曲線の収束特性）に基づく複数回の評価を行うので、評価データに対するクラスラベル（ひいては、ドライバ状態）の識別精度が高い。特に、評価において大域的に学習が収束するか否かを判断指標としているので、ノイズや一部のデータ欠損に対して、従来より識別精度が高く、ロバスト性も高い。また、運転中に得られる一般的な情報を評価データとし、評価データに対して複数の仮想教師データを自動的に割り当てて識別を行うので、ドライバに負担をかけない。 According to the driver state identification device 1, a plurality of temporary class labels (temporary teacher data) are automatically assigned to the evaluation data, instead of a single evaluation of the evaluation data based on the feedforward calculation of the neural network as in the prior art. Since the evaluation is performed a plurality of times based on the temporal process in the allocation and learning (the convergence characteristic of the learning curve), the identification accuracy of the class label (and thus the driver state) for the evaluation data is high. In particular, since whether or not the learning converges globally in the evaluation is used as a determination index, the discrimination accuracy is higher and the robustness is higher with respect to noise and some data loss. In addition, since general information obtained during driving is used as evaluation data, and a plurality of virtual teacher data is automatically assigned to the evaluation data for identification, the driver is not burdened.

つまり、評価データに対して複数の異なる仮クラスラベルを設定しているので、従来よりも非常に多くの情報に基づいて識別を行っている。具体的には、識別学習処理において１つの仮クラスラベルについての評価が従来のフィードフォワード計算による１回の評価に相当しており、複数の仮クラスラベルについてそれぞれ評価して識別を行うことにより、従来よりも非常に多くの情報に基づいて識別を行っていることになる。また、評価データに対して可能性のある全ての仮クラスラベルを設定して学習を試み、各仮クラスラベルについての学習過程の収束具合を監視することにより評価データに対してその仮クラスラベルが妥当か否かの情報を取得し、それらの情報に基づいて識別を行うので、従来のように学習済みのニューラルネットワークを用いて評価データに対して一度だけの評価によって対応が困難なノイズや一部のデータ欠損がある評価データに対しても、柔軟な対応が可能となる。 That is, since a plurality of different temporary class labels are set for the evaluation data, the identification is performed based on much more information than before. Specifically, in the identification learning process, the evaluation for one temporary class label corresponds to one evaluation by the conventional feedforward calculation, and by evaluating and identifying each of the plurality of temporary class labels, The identification is performed based on much more information than before. In addition, by setting all possible temporary class labels for the evaluation data, learning is attempted, and by monitoring the convergence of the learning process for each temporary class label, the temporary class label is assigned to the evaluation data. Since information on whether or not it is appropriate is acquired and identification is performed based on the information, noise or one of the problems that cannot be easily dealt with by evaluating the evaluation data only once using a learned neural network as in the past. It is possible to flexibly cope with evaluation data having some data deficiencies.

さらに、このドライバ状態識別装置１によれば、評価データと識別されたクラスラベル（教師データ）を学習データとして再学習を行うので、ドライバの個人特性に合わせたニューラルネットワーク識別器にチューニングしていくことができる。これによって、評価データに応じてニューラルネットワーク識別器が動的にアジャストしていくので、従来の重み固定のニューラルネットワークで識別を行うより、柔軟性があり、識別精度が高くなる。また、チューニングを行うためにドライバが状態を入力する必要がないので、ドライバに負担をかけない。 Furthermore, according to the driver status identification device 1, since the learning is performed using the class label (teacher data) identified as the evaluation data as learning data, the neural network classifier is tuned to the personal characteristics of the driver. be able to. As a result, the neural network discriminator dynamically adjusts according to the evaluation data, so that there is more flexibility and higher discrimination accuracy than the conventional discrimination with a fixed weight neural network. Further, since it is not necessary for the driver to input a state in order to perform tuning, no burden is imposed on the driver.

また、このドライバ状態識別装置１では、識別学習を行う際に評価データと仮教師データからなる仮の学習データに加えて正しくラベリングされた学習データも用いて学習を行うので、評価データに対するクラスラベルの識別精度が更に向上する。 Further, in the driver state identification device 1, since learning is performed using correctly labeled learning data in addition to temporary learning data including evaluation data and temporary teacher data when performing identification learning, a class label for the evaluation data is used. The identification accuracy is further improved.

また、このドライバ状態識別装置１では、識別したドライバ状態に応じて適切な運転支援を行うので、ドライバの状態に応じてドライバに対して効果的な改善を促すことができ、運転の安全性を向上させることができる。この際、ドライバに煩わしさを与えない過不足のない運転支援を行うことがき、より早期の段階からの適切な運転支援が可能である。 In addition, since the driver status identification device 1 performs appropriate driving support according to the identified driver status, the driver status can be urged to be effectively improved according to the driver status, and driving safety can be improved. Can be improved. At this time, driving support without excess or deficiency that does not bother the driver can be performed, and appropriate driving support from an earlier stage is possible.

以上、本発明に係る実施の形態について説明したが、本発明は上記実施の形態に限定されることなく様々な形態で実施される。 As mentioned above, although embodiment which concerns on this invention was described, this invention is implemented in various forms, without being limited to the said embodiment.

例えば、本実施の形態では評価データとしてドライバ情報、環境情報、車両情報を検出し、それらの検出した情報からドライバ状態を識別するドライバ状態識別装置に適用したが、ドライバ状態以外にも様々のものの識別に適用可能であり、また、ドライバ状態を識別するために必要な情報としては実施の形態で例示した以外の情報を用いてもよいし、識別するドライバ状態としても実施の形態で例示した以外のドライバ状態を識別するようにしてもよい。 For example, in the present embodiment, driver information, environment information, and vehicle information are detected as evaluation data, and applied to a driver state identification device that identifies a driver state from the detected information. It is applicable to identification, and as information necessary for identifying the driver state, information other than that exemplified in the embodiment may be used, and the driver state to be identified is not exemplified in the embodiment. The driver status may be identified.

また、本実施の形態では評価データに対するドライバ状態を識別し、その識別結果を用いて運転支援を行う構成としたが、評価データに対するドライバ状態の識別結果を出力するだけでもよいし、あるいは、その識別結果を他のシステムに提供するようにしてもよい。 In the present embodiment, the driver state for the evaluation data is identified and the driving support is performed using the identification result. However, the driver state identification result for the evaluation data may be output, or The identification result may be provided to another system.

また、本実施の形態では評価データに対する識別結果に基づく新たな学習データによって再学習（追加学習）も行う構成としたが、評価データに対する識別を行った後、再学習を行わない構成としてもよい。 In this embodiment, re-learning (additional learning) is also performed using new learning data based on the identification result for evaluation data. However, re-learning may not be performed after identification for evaluation data. .

また、本実施の形態では識別学習を行う際に評価データとその仮教師データによる学習データ以外に正しくラベリングされた学習データを用いて学習を行う構成としたが、正しくラベリングされた学習データを用いないで、評価データとその仮教師データによる学習データだけを用いて行ってもよい。 In this embodiment, the learning is performed using correctly labeled learning data other than the learning data based on the evaluation data and the temporary teacher data when performing identification learning. However, the learning data correctly labeled is used. Instead, the evaluation data and the learning data based on the temporary teacher data may be used.

また、本実施の形態では３層階層型のニューラルネットであり、誤差逆伝播法による学習方法を用いたが、ニューラルネットワークの構造や学習方法については特に限定しない。 In this embodiment, the neural network is a three-layer hierarchical type, and the learning method based on the error back propagation method is used. However, the structure of the neural network and the learning method are not particularly limited.

また、本実施の形態では式（２）による二乗誤差を用いて学習曲線を生成したが、学習曲線としては学習の収束特性を評価できるものであれば特に限定しない。 In the present embodiment, the learning curve is generated using the square error according to Expression (2), but the learning curve is not particularly limited as long as the convergence characteristic of learning can be evaluated.

本実施の形態に係るドライバ状態識別装置の構成図である。It is a block diagram of the driver state identification apparatus which concerns on this Embodiment. 図１のドライバ状態識別装置における識別方法の概念図である。It is a conceptual diagram of the identification method in the driver state identification device of FIG. 評価データの一例である。It is an example of evaluation data. 仮クラスラベルと仮クラスラベルに対応する仮教師データの一例である。It is an example of temporary teacher data corresponding to a temporary class label and a temporary class label. 評価データに対して仮教師データ（仮クラスラベル）を割り当てた場合の一例である。It is an example at the time of assigning temporary teacher data (temporary class label) to evaluation data. 図１のニューラルネットワーク識別ＥＣＵにおける予備学習処理の流れを示すフローチャートである。It is a flowchart which shows the flow of the preliminary learning process in the neural network identification ECU of FIG. 図１のニューラルネットワーク識別ＥＣＵにおける識別学習処理の流れを示すフローチャートである。It is a flowchart which shows the flow of the identification learning process in the neural network identification ECU of FIG. 図１のニューラルネットワーク識別ＥＣＵにおける再学習処理の流れを示すフローチャートである。It is a flowchart which shows the flow of the relearning process in the neural network identification ECU of FIG.

Explanation of symbols

１…ドライバ状態識別装置、１０…顔向き・視線認識センサ、１１…動作・姿勢認識センサ、１２…顔・眼球認識センサ、１３…足元認識センサ、１４…心拍センサ、２０…車間距離センサ、２１…車線認識センサ、２２…信号機認識センサ、２３…標識認識センサ、２４…一時停止線認識センサ、２５…歩行者認識センサ、２６…交通環境情報取得用通信装置、２７…カーナビゲーションシステム、３０…車速センサ、３１…舵角センサ、３２…アクセル開度センサ、３３…ブレーキ踏込み度センサ、３４…前後Ｇセンサ、３５…横Ｇセンサ、３６…上下Ｇセンサ、３７…ウインカ操作センサ、４０…ディスプレイ、４１…スピーカ、４２…運転支援システム、５０…ニューラルネットワーク識別ＥＣＵ、６０…ドライバ状態適応型運転支援ＥＣＵ DESCRIPTION OF SYMBOLS 1 ... Driver state identification device, 10 ... Face direction / line-of-sight recognition sensor, 11 ... Motion / posture recognition sensor, 12 ... Face / eyeball recognition sensor, 13 ... Foot recognition sensor, 14 ... Heart rate sensor, 20 ... Inter-vehicle distance sensor, 21 ... Lane recognition sensor, 22 ... Traffic signal recognition sensor, 23 ... Sign recognition sensor, 24 ... Pause recognition sensor, 25 ... Pedestrian recognition sensor, 26 ... Communication device for acquiring traffic environment information, 27 ... Car navigation system, 30 ... Vehicle speed sensor, 31 ... Rudder angle sensor, 32 ... Accelerator opening sensor, 33 ... Brake depression sensor, 34 ... Front / rear G sensor, 35 ... Lateral G sensor, 36 ... Vertical G sensor, 37 ... Blinker operation sensor, 40 ... Display , 41 ... Speaker, 42 ... Driving support system, 50 ... Neural network identification ECU, 60 ... Driver state adaptive driving support ECU

Claims

An initial learning step of learning the neural network based on the input data and the teacher data corresponding to the class label of the input data;
A virtual teacher data assignment step for assigning virtual teacher data corresponding to a virtual class label to evaluation data whose class label is unknown;
An evaluation step of performing learning of a neural network based on the evaluation data and virtual teacher data of the evaluation data, and evaluating a convergence characteristic of a learning curve of the learning,
A class label for estimating a virtual class label corresponding to the virtual teacher data of the learning curve having a high convergence characteristic among a plurality of learning curves for a plurality of different virtual teacher data assigned to the evaluation data as a class label of the evaluation data A state estimation method using a neural network, comprising: an estimation step.

The state estimation method using a neural network according to claim 1, further comprising a class label output step of outputting a class label of evaluation data estimated in the class label estimation step.

3. An additional learning step of learning a neural network using virtual teacher data corresponding to a class label of evaluation data estimated in the class label estimation step as teacher data of the evaluation data. The state estimation method using the neural network described in 1.

4. The neural network learning according to claim 1, wherein the evaluation step performs learning of a neural network based on the evaluation data, virtual teacher data of the evaluation data, and the input data and teacher data of the input data. 5. A state estimation method using the neural network according to claim 1.

The input data and the evaluation data are values detected by the driving state detection means,
The state estimation method using a neural network according to any one of claims 1 to 4, wherein the class label is a driver state.

The neural network is a three-layer hierarchical neural network,
A correspondence relationship between the input data and the teacher data of the input data, a correspondence relationship between the evaluation data and the virtual teacher data of the evaluation data, or a correspondence relationship between the evaluation data and the teacher data estimated with respect to the evaluation data. The state estimation method using a neural network according to any one of claims 1 to 5, wherein learning is performed by an error back propagation method, and a learning curve is generated based on an error by the error back propagation method. .

Initial learning means for learning a neural network based on input data and teacher data corresponding to the class label of the input data;
Virtual teacher data assignment means for assigning virtual teacher data corresponding to a virtual class label to evaluation data whose class label is unknown;
Based on the evaluation data and virtual teacher data of the evaluation data, learning of the neural network, and evaluation means for evaluating the convergence characteristics of the learning curve of the learning,
A class label for estimating a virtual class label corresponding to the virtual teacher data of the learning curve having a high convergence characteristic among a plurality of learning curves for a plurality of different virtual teacher data assigned to the evaluation data as a class label of the evaluation data And a state estimation device using a neural network.

8. The state estimation apparatus using a neural network according to claim 7, further comprising class label output means for outputting a class label of evaluation data estimated by the class label estimation means.

9. An additional learning means for learning a neural network using virtual teacher data corresponding to a class label of evaluation data estimated by the class label estimation means as teacher data of the evaluation data. The state estimation apparatus using the neural network described in 1.

The evaluation means performs learning of a neural network based on the evaluation data, virtual teacher data of the evaluation data, and the input data and teacher data of the input data. A state estimation apparatus using the neural network described in item 1.

The input data and the evaluation data are values detected by the driving state detection means,
The state estimation apparatus using a neural network according to any one of claims 7 to 10, wherein the class label is a driver state.

The neural network is a three-layer hierarchical neural network,
A correspondence relationship between the input data and the teacher data of the input data, a correspondence relationship between the evaluation data and the virtual teacher data of the evaluation data, or a correspondence relationship between the evaluation data and the teacher data estimated with respect to the evaluation data. The state estimation device using a neural network according to any one of claims 7 to 11, wherein learning is performed by an error back propagation method, and a learning curve is generated based on an error by the error back propagation method. .