JP2022038241A

JP2022038241A - Vehicle allocation system

Info

Publication number: JP2022038241A
Application number: JP2020142631A
Authority: JP
Inventors: 大樹横山; Daiki Yokoyama; 智洋金子; Tomohiro Kaneko
Original assignee: Toyota Motor Corp
Current assignee: Toyota Motor Corp
Priority date: 2020-08-26
Filing date: 2020-08-26
Publication date: 2022-03-10

Abstract

To suppress an inequality among passengers of vehicle allocation services.SOLUTION: In a vehicle allocation system 100 comprising a server 1 and a plurality of autonomous driving vehicles 2, and providing a vehicle allocation service using the autonomous driving vehicle 2, the server 1 is configured that the server selects a learning vehicle which needs to learn a learning model from the autonomous driving vehicles 2 which are waiting for a vehicle dispatch service based on a learning progress of the learning model used in the autonomous driving vehicle 2 received from the autonomous driving vehicle 2, and transmits a learning instruction to the learning vehicle; and upon receiving the learning instruction, the autonomous driving vehicle 2 is configured to perform traveling for learning the learning model while waiting for dispatch based on the learning instruction.SELECTED DRAWING: Figure 1

Description

本発明は、配車システムに関する。 The present invention relates to a vehicle allocation system.

近年、自律走行が可能な自動運転車両を用いたタクシーやバス、ライドシェア等のモビリティサービスの実現に向けて、自動運転技術の開発が行われている。特許文献１には、このようなモビリティサービスの一つとして、利用客の利用要求に応じて利用客の希望する乗車地点に自動運転車両を移動させ、自動運転車両に利用客を乗車させて目的地まで送り届けることが可能な配車サービスに関する技術が開示されている。 In recent years, autonomous driving technology has been developed for the realization of mobility services such as taxis, buses, and ride sharing using autonomous vehicles capable of autonomous driving. As one of such mobility services, Patent Document 1 aims to move an autonomous driving vehicle to a boarding point desired by the user in response to a user's request for use, and to allow the user to ride on the autonomous driving vehicle. The technology related to the vehicle dispatch service that can be delivered to the ground is disclosed.

特許文献２には、サーバや車両で学習を行った学習済みのニューラルネットワークを用いて、内燃機関の排気浄化触媒の温度を推定するものが開示されている。 Patent Document 2 discloses that the temperature of the exhaust gas purification catalyst of an internal combustion engine is estimated by using a learned neural network trained by a server or a vehicle.

特開２０１７－１８２１３７号公報Japanese Unexamined Patent Publication No. 2017-182137 特開２０１９－１８３６９８号公報Japanese Unexamined Patent Publication No. 2019-183698

将来的には、配車サービスに使用される各自動運転車両上で学習モデルを使用した各種の制御が実施され、また当該学習モデルの学習が行われることが想定される。各自動運転車両で使用される学習モデルは、当然、その学習が進んでいることが望ましい。しかしながら、配車サービスに使用される各自動運転車両の配車状況は車両毎に様々である。したがって、配車状況によっては、学習モデルの学習が十分に進んでいない車両、すなわち他の自動運転車両よりも学習モデルの学習進捗度が低い自動運転車両が生じることが考えられる。 In the future, it is expected that various controls using the learning model will be carried out on each autonomous driving vehicle used for the vehicle allocation service, and that the learning model will be learned. Naturally, it is desirable that the learning model used in each autonomous driving vehicle is well-learned. However, the vehicle allocation status of each autonomous vehicle used for the vehicle allocation service varies from vehicle to vehicle. Therefore, depending on the vehicle allocation situation, it is conceivable that there will be a vehicle in which the learning of the learning model has not sufficiently progressed, that is, an autonomous driving vehicle in which the learning progress of the learning model is lower than that of other autonomous driving vehicles.

そうすると、例えば、学習モデルの学習が十分に進んでいない自動運転車両が配車された配車サービスの利用客に対する配車サービスの質が、学習モデルの学習が十分に進んでいる自動運転車両が配車された利用客に対する配車サービスの質よりも低下するなどして、配車サービスの利用客の中で不平等が生じるおそれがある。 Then, for example, the quality of the vehicle dispatch service for the passengers of the vehicle dispatch service to which the self-driving vehicle in which the learning model has not been sufficiently learned is dispatched, and the self-driving vehicle in which the learning model has been sufficiently learned are dispatched. There is a risk of inequality among the passengers of the vehicle dispatch service, such as lowering the quality of the vehicle dispatch service to the passengers.

本発明はこのような問題点に着目してなされたものであり、配車サービスの利用客の中で不平等が生じるのを抑制することを目的とする。 The present invention has been made by paying attention to such a problem, and an object of the present invention is to suppress the occurrence of inequality among the users of the vehicle dispatch service.

上記課題を解決するために、本発明のある態様による配車システムは、サーバと複数の自動運転車両とを備え、自動運転車両を用いた配車サービスを提供する。サーバは、自動運転車両から受信した、自動運転車両において使用される学習モデルの学習進捗度に基づいて、配車サービスのために待機している自動運転車両の中から学習モデルの学習が必要な学習車両を選択し、学習車両に対して学習指示を送信するように構成される。自動運転車両は、学習指示を受信すると、学習指示に基づいて、学習モデルの学習を行うための走行を配車待機中に実施するように構成される。 In order to solve the above problems, the vehicle allocation system according to a certain aspect of the present invention includes a server and a plurality of autonomous driving vehicles, and provides a vehicle allocation service using the autonomous driving vehicle. Based on the learning progress of the learning model used in the autonomous driving vehicle received from the autonomous driving vehicle, the server needs to learn the learning model from the autonomous driving vehicle waiting for the vehicle allocation service. It is configured to select a vehicle and send learning instructions to the learning vehicle. Upon receiving the learning instruction, the autonomous driving vehicle is configured to perform traveling for learning the learning model while waiting for dispatch based on the learning instruction.

本発明のこの態様によれば、各自動運転車両の学習モデルの学習進捗度に基づいて配車サービスのために待機している自動運転車両の中から学習モデルの学習が必要な学習車両を選択し、当該学習車両に対して、学習モデルの学習を行うための走行を配車待機中に実施させることができる。そのため、各自動運転車両の配車状況が車両毎に様々であったとしても、各自動運転車両の学習モデルの学習進捗度に差が生じるのを抑制できる。したがって、配車サービスの利用客の中で不平等が生じるのを抑制することができる。 According to this aspect of the present invention, a learning vehicle that requires learning of a learning model is selected from among the automatic driving vehicles waiting for a vehicle allocation service based on the learning progress of the learning model of each automatic driving vehicle. , The learning vehicle can be made to run for learning the learning model while waiting for the vehicle to be dispatched. Therefore, even if the vehicle allocation status of each autonomous driving vehicle varies from vehicle to vehicle, it is possible to suppress a difference in the learning progress of the learning model of each autonomous driving vehicle. Therefore, it is possible to suppress the occurrence of inequality among the users of the vehicle dispatch service.

図１は、本発明の一実施形態による配車システムの概略構成図である。FIG. 1 is a schematic configuration diagram of a vehicle allocation system according to an embodiment of the present invention. 図２は、本発明の一実施形態による自動運転車両のハードウェア構成を示す概略図である。FIG. 2 is a schematic diagram showing a hardware configuration of an autonomous driving vehicle according to an embodiment of the present invention. 図３は、ニューラルネットワークモデルの一例を示す図である。FIG. 3 is a diagram showing an example of a neural network model. 図４は、サーバ、自動運転車両及び携帯端末の間で実行される配車処理について説明するフローチャートである。FIG. 4 is a flowchart illustrating a vehicle allocation process executed between a server, an autonomous vehicle, and a mobile terminal. 図５は、サーバ及び自動運転車両の間で実行される学習進捗度更新処理について説明するフローチャートである。FIG. 5 is a flowchart illustrating a learning progress update process executed between the server and the autonomous driving vehicle. 図６は、サーバ及び自動運転車両の間で実行される学習走行処理について説明するフローチャートである。FIG. 6 is a flowchart illustrating a learning driving process executed between the server and the autonomous driving vehicle.

以下、図面を参照して本発明の一実施形態について詳細に説明する。なお、以下の説明では、同様な構成要素には同一の参照番号を付す。 Hereinafter, an embodiment of the present invention will be described in detail with reference to the drawings. In the following description, similar components are given the same reference numbers.

図１は、本発明の一実施形態による配車システム１００の概略構成図である。 FIG. 1 is a schematic configuration diagram of a vehicle allocation system 100 according to an embodiment of the present invention.

本実施形態による配車システム１００は、サーバ１と、配車サービスに使用される複数の自動運転車両２と、配車サービスの各利用客４が所持する携帯電話機やタブレットコンピュータなどの携帯端末３と、を備える。配車システム１００は、利用客４の利用要求に応じて利用客４の希望する乗車地点に自動運転車両２を移動させ、自動運転車両２に利用客４を乗車させて利用客４の目的地まで送り届ける配車サービスを実現するためのシステムである。 The vehicle allocation system 100 according to the present embodiment includes a server 1, a plurality of autonomous driving vehicles 2 used for the vehicle allocation service, and a mobile terminal 3 such as a mobile phone or a tablet computer possessed by each user 4 of the vehicle allocation service. Be prepared. The vehicle allocation system 100 moves the autonomous driving vehicle 2 to the desired boarding point of the passenger 4 in response to the usage request of the passenger 4, and causes the autonomous driving vehicle 2 to board the passenger 4 to the destination of the passenger 4. It is a system to realize the delivery service.

サーバ１、自動運転車両２、及び携帯端末３は、光通信回線等で構成されるネットワーク５を介して互いに通信可能となっている。サーバ１は、例えば不図示のゲートウェイ等を介してネットワーク５と接続される。自動運転車両２、及び携帯端末３は、例えば無線基地局６等を介してネットワーク５と接続される。 The server 1, the self-driving vehicle 2, and the mobile terminal 3 can communicate with each other via a network 5 composed of an optical communication line or the like. The server 1 is connected to the network 5 via, for example, a gateway (not shown). The self-driving vehicle 2 and the mobile terminal 3 are connected to the network 5 via, for example, a radio base station 6.

図１に示すように、サーバ１は、サーバ通信部１１と、サーバ記憶部１２と、サーバ処理部１３と、を備える。 As shown in FIG. 1, the server 1 includes a server communication unit 11, a server storage unit 12, and a server processing unit 13.

サーバ通信部１１は、サーバ１を例えばゲートウェイ等を介してネットワーク５と接続するための通信インターフェース回路を有し、各自動運転車両２及び各携帯端末３との間で相互に通信することができるように構成される。 The server communication unit 11 has a communication interface circuit for connecting the server 1 to the network 5 via, for example, a gateway, and can communicate with each other with each autonomous driving vehicle 2 and each mobile terminal 3. It is configured as follows.

サーバ記憶部１２は、ＨＤＤ（Hard Disk Drive）や光記録媒体、半導体メモリ等の記憶媒体を有し、サーバ処理部１３での処理に用いられる各種のコンピュータプログラムやデータ等を記憶する。 The server storage unit 12 has a storage medium such as an HDD (Hard Disk Drive), an optical recording medium, and a semiconductor memory, and stores various computer programs, data, and the like used for processing in the server processing unit 13.

サーバ処理部１３は、一又は複数個のプロセッサ及びその周辺回路を有する。サーバ処理部１３は、サーバ記憶部１２に格納された各種のコンピュータプログラムを実行し、サーバ１の全体的な動作を統括的に制御するものであり、例えばＣＰＵ（Central Processing Unit）である。例えばサーバ処理部１３は、携帯端末３から配車依頼を受信すると、複数の自動運転車両２の中から配車車両を決定し、当該配車車両に対して配車指示を送信する。 The server processing unit 13 has one or more processors and peripheral circuits thereof. The server processing unit 13 executes various computer programs stored in the server storage unit 12 and comprehensively controls the overall operation of the server 1, for example, a CPU (Central Processing Unit). For example, when the server processing unit 13 receives a vehicle allocation request from the mobile terminal 3, it determines a vehicle allocation vehicle from among a plurality of autonomous driving vehicles 2 and transmits a vehicle allocation instruction to the vehicle allocation vehicle.

図２は、自動運転車両２のハードウェア構成を示す概略図である。 FIG. 2 is a schematic diagram showing a hardware configuration of the autonomous driving vehicle 2.

図２に示すように、自動運転車両２は、周辺情報検出装置２１、ＧＰＳ受信機２２、地図データベース２３、アクチュエータ２４、状態量検出装置２５、通信モジュール２６及び電子制御ユニット２０を備え、加速、操舵、及び制動に関する運転操作を自動的に行うことができるように構成される。周辺情報検出装置２１、ＧＰＳ受信機２２、地図データベース２３、アクチュエータ２４、状態量検出装置２５及び通信モジュール２６は、ＣＡＮ（Controller Area Network）等の規格に準拠した車内ネットワーク２７を介して電子制御ユニット２０に通信可能に接続される。 As shown in FIG. 2, the autonomous driving vehicle 2 includes a peripheral information detection device 21, a GPS receiver 22, a map database 23, an actuator 24, a state quantity detection device 25, a communication module 26, and an electronic control unit 20, and accelerates. It is configured to be able to automatically perform driving operations related to steering and braking. The peripheral information detection device 21, the GPS receiver 22, the map database 23, the actuator 24, the state quantity detection device 25, and the communication module 26 are electronically controlled units via an in-vehicle network 27 compliant with standards such as CAN (Controller Area Network). Connected to 20 communicably.

周辺情報検出装置２１は、自動運転車両２の周辺情報を検出する。周辺情報には、道路の白線、他車両、歩行者、自転車、建物、標識、信号機、障害物等の情報が含まれる。周辺情報検出装置２１は、例えば、車外カメラやミリ波レーダ、ライダ（LiDAR；Light Detection and Ranging）、超音波センサなどを含む。周辺情報検出装置２１の出力は、電子制御ユニット２０に送信される。 The peripheral information detection device 21 detects peripheral information of the autonomous driving vehicle 2. Peripheral information includes information such as white lines on roads, other vehicles, pedestrians, bicycles, buildings, signs, traffic lights, obstacles, and the like. The peripheral information detection device 21 includes, for example, an out-of-vehicle camera, a millimeter-wave radar, a lidar (LiDAR; Light Detection and Ranging), an ultrasonic sensor, and the like. The output of the peripheral information detection device 21 is transmitted to the electronic control unit 20.

ＧＰＳ受信機２２は、３個以上のＧＰＳ衛星から信号を受信し、自動運転車両２の現在位置（例えば自動運転車両２の緯度及び経度）を検出する。ＧＰＳ受信機２２の出力は、電子制御ユニット２０に送信される。 The GPS receiver 22 receives signals from three or more GPS satellites and detects the current position of the autonomous driving vehicle 2 (for example, the latitude and longitude of the autonomous driving vehicle 2). The output of the GPS receiver 22 is transmitted to the electronic control unit 20.

地図データベース２３は、地図情報を記憶している。電子制御ユニット２０は、地図データベース２３から地図情報を取得する。 The map database 23 stores map information. The electronic control unit 20 acquires map information from the map database 23.

アクチュエータ２４は、自動運転を行うために電子制御ユニット２０によって駆動される各種の制御部品であって、例えば、自動運転車両２の加速に必要な駆動装置（例えば、内燃機関及びモータの少なくとも一方）や、自動運転車両２の制動に必要な制動装置（例えば、ブレーキアクチュエータ）、自動運転車両２の操舵に必要な操舵装置（例えば、ステアリングモータ）などを含む。 The actuator 24 is various control components driven by the electronic control unit 20 for autonomous driving, and is, for example, a drive device (for example, at least one of an internal combustion engine and a motor) necessary for accelerating the autonomous driving vehicle 2. It also includes a braking device (for example, a brake actuator) necessary for braking the autonomous driving vehicle 2, a steering device (for example, a steering motor) necessary for steering the autonomous driving vehicle 2.

状態量検出装置２５は、自動運転車両２、内燃機関、モータ、バッテリ、周辺環境等の状態量を検出する。例えば、状態量検出装置２５は、車速センサやヨーレートセンサ、アクセルペダルストロークセンサ、エアフローメータ、空燃比センサ、クランク角センサ、トルクセンサ、電圧センサ、外気温センサ、ＮＯｘセンサなどを含む。状態量検出装置２５の出力は、電子制御ユニット２０に送信される。 The state quantity detecting device 25 detects the state quantity of the self-driving vehicle 2, the internal combustion engine, the motor, the battery, the surrounding environment, and the like. For example, the state amount detecting device 25 includes a vehicle speed sensor, a yaw rate sensor, an accelerator pedal stroke sensor, an air flow meter, an air fuel ratio sensor, a crank angle sensor, a torque sensor, a voltage sensor, an outside temperature sensor, a NOx sensor and the like. The output of the state quantity detecting device 25 is transmitted to the electronic control unit 20.

通信モジュール２６は、無線通信機能を有する車載の端末である。通信モジュール２６は、ネットワーク５（図１参照）と不図示のゲートウェイ等を介して接続される無線基地局６（図１参照）にアクセスすることで、無線基地局６を介してネットワーク５と接続される。これにより、サーバ１との間で相互に通信が行われる。 The communication module 26 is an in-vehicle terminal having a wireless communication function. The communication module 26 connects to the network 5 via the radio base station 6 by accessing the radio base station 6 (see FIG. 1) connected to the network 5 (see FIG. 1) via a gateway (not shown) or the like. Will be done. As a result, mutual communication is performed with the server 1.

電子制御ユニット２０は、信号線を介して互いに接続された、車内通信インターフェース２０１、車両記憶部２０２及び車両処理部２０３を備える。 The electronic control unit 20 includes an in-vehicle communication interface 201, a vehicle storage unit 202, and a vehicle processing unit 203 connected to each other via a signal line.

車内通信インターフェース２０１は、車内ネットワーク２７に電子制御ユニット２０を接続するための通信インターフェース回路である。 The in-vehicle communication interface 201 is a communication interface circuit for connecting the electronic control unit 20 to the in-vehicle network 27.

車両記憶部２０２は、ＨＤＤ（Hard Disk Drive）や光記録媒体、半導体メモリ等の記憶媒体を有し、車両処理部２０３での処理に用いられる各種のコンピュータプログラムやデータ等を記憶する。 The vehicle storage unit 202 has a storage medium such as an HDD (Hard Disk Drive), an optical recording medium, and a semiconductor memory, and stores various computer programs, data, and the like used for processing in the vehicle processing unit 203.

車両処理部２０３は、一又は複数個のプロセッサ及びその周辺回路を有する。車両処理部２０３は、車両記憶部２０２に格納された各種のコンピュータプログラムを実行し、自動運転車両２に搭載された各種の制御部品を統括的に制御するものであり、例えばＣＰＵである。例えば車両処理部２０３は、サーバ１から配車指示を受信すると、配車依頼を行った利用客４の希望する乗車地点に向けて自車両を移動させ、自車両に利用客４を乗車させて利用客４の目的地まで送り届ける。 The vehicle processing unit 203 includes one or more processors and peripheral circuits thereof. The vehicle processing unit 203 executes various computer programs stored in the vehicle storage unit 202 and comprehensively controls various control components mounted on the autonomous driving vehicle 2, for example, a CPU. For example, when the vehicle processing unit 203 receives a vehicle allocation instruction from the server 1, the vehicle processing unit 203 moves the own vehicle toward the desired boarding point of the user 4 who has requested the vehicle allocation, and causes the user 4 to board the own vehicle. Deliver to 4 destinations.

各自動運転車両２では、各自動運転車両２を制御するにあたり、例えば機械学習などの学習を実施した学習モデル（人工知能モデル）が必要に応じて使用されている。本実施形態では、学習モデルとして、ディープニューラルネットワーク（ＤＮＮ；Deep Neural Network）や畳み込みニューラルネットワーク（ＣＮＮ；Convolutional Neural Network）などを用いたニューラルネットワークモデル（以下「ＮＮモデル」という。）を使用し、当該ＮＮモデルに対して深層学習を実施している。したがって、本実施形態による学習モデルは、深層学習を実施した学習済みのＮＮモデルということもできる。深層学習は、人工知能（ＡＩ；Artificial Intelligence）を代表する機械学習手法の一つである。 In each autonomous driving vehicle 2, in controlling each autonomous driving vehicle 2, a learning model (artificial intelligence model) in which learning such as machine learning is performed is used as needed. In this embodiment, as a learning model, a neural network model (hereinafter referred to as “NN model”) using a deep neural network (DNN; Deep Neural Network), a convolutional neural network (CNN; Convolutional Neural Network), or the like is used. Deep learning is carried out for the NN model. Therefore, the learning model according to this embodiment can be said to be a trained NN model in which deep learning is performed. Deep learning is one of the representative machine learning methods of artificial intelligence (AI).

図３は、ＮＮモデルの一例を示す図である。 FIG. 3 is a diagram showing an example of the NN model.

図３における丸印は人工ニューロンを表す。人工ニューロンは、通常、ノード又はユニットと称される（本明細書では、「ノード」と称す）。図３において、Ｌ＝１は入力層を示し、Ｌ＝２及びＬ＝３は隠れ層を示し、Ｌ＝４は出力層を示している。隠れ層は、中間層とも称される。なお、図３には、隠れ層が２層のＮＮモデルを例示しているが、隠れ層の層数は特に限られるものではなく、また、入力層、隠れ層及び出力層の各層のノードの数も特に限られるものではない。 The circles in FIG. 3 represent artificial neurons. Artificial neurons are commonly referred to as nodes or units (referred to herein as "nodes"). In FIG. 3, L = 1 indicates an input layer, L = 2 and L = 3 indicate a hidden layer, and L = 4 indicates an output layer. The hidden layer is also called the middle layer. Although FIG. 3 illustrates an NN model having two hidden layers, the number of hidden layers is not particularly limited, and the nodes of the input layer, the hidden layer, and the output layer are not particularly limited. The number is not particularly limited.

図３において、ｘ_１及びｘ_２は入力層（Ｌ＝１）の各ノード及びそのノードからの出力値を示しており、ｙは出力層（Ｌ＝４）のノード及びその出力値を示している。同様に、ｚ_１ ^{（Ｌ＝２）} _、ｚ_２ ^{（Ｌ＝２）}及びｚ_３ ^{（Ｌ＝２）}は隠れ層（Ｌ＝２）の各ノード及びそのノードからの出力値を示しており、ｚ_１ ^{（Ｌ＝３）}及びｚ_２ ^{（Ｌ＝３）}は隠れ層（Ｌ＝３）の各ノード及びそのノードからの出力値を示している。 In FIG. 3, x ₁ and x ₂ indicate the node of the input layer (L = 1) and the output value from the node, and y indicates the node of the output layer (L = 4) and its output value. There is. Similarly, z ₁ ^{(L = 2)} _, z ₂ ^{(L = 2)} and z ₃ ^{(L = 2)} indicate each node of the hidden layer (L = 2) and the output value from that node, and z ₁ ^{(L = 3)} and z ₂ ^{(L = 3)} indicate each node of the hidden layer (L = 3) and the output value from that node.

入力層の各ノードでは入力がそのまま出力される。一方、隠れ層（Ｌ＝２）の各ノードには、入力層の各ノードの出力値ｘ_１及びｘ_２が入力され、隠れ層（Ｌ＝２）の各ノードでは、それぞれ対応する重みｗ及びバイアスｂを用いて総入力値ｕが算出される。例えば、図３において隠れ層（Ｌ＝２）のｚ_ｋ ^{（Ｌ＝２）}（ｋ＝１、２、３）で示される各ノードにおいて算出される総入力値ｕ_ｋ ^{（Ｌ＝２）}は、次式のようになる（Ｍは入力層のノードの数）。

The input is output as it is at each node of the input layer. On the other hand, the output values x1 and x2 of _each node of the input layer are input to each node of the hidden layer (L = ₂ ), and the corresponding weights w and each node of the hidden layer (L = 2) are input. The total input value u is calculated using the bias b. For example, in FIG. 3, the total input value _uk ^{(L = 2)} calculated at each node represented by z _k ^{(L = 2)} (k = 1, 2, 3) of the hidden layer (L = 2) is It becomes as follows (M is the number of nodes in the input layer).

次いで、この総入力値ｕ_ｋ ^{（Ｌ＝２）}は活性化関数ｆにより変換され、隠れ層（Ｌ＝２）のｚ_ｋ ^{（Ｌ＝２）}で示されるノードから、出力値ｚ_ｋ ^{（Ｌ＝２）}（＝ｆ（ｕ_ｋ ^{（Ｌ＝２）}））として出力される。一方、隠れ層（Ｌ＝３）の各ノードには、隠れ層（Ｌ＝２）の各ノードの出力値ｚ_１ ^{（Ｌ＝２）} _、ｚ_２ ^{（Ｌ＝２）}及びｚ_３ ^{（Ｌ＝２）}が入力され、隠れ層（Ｌ＝３）の各ノードでは、それぞれ対応する重みｗ及びバイアスｂを用いて総入力値ｕ（＝Σｚ・ｗ＋ｂ）が算出される。この総入力値ｕは同様に活性化関数により変換され、隠れ層（Ｌ＝３）の各ノードから、出力値ｚ_１ ^{（Ｌ＝３）}、ｚ_２ ^{（Ｌ＝３）}として出力される。活性化関数は例えばシグモイド関数σである。 Next, this total input value uk ^{(L = 2)} is converted by the activation function f, and the output value z _k ⁽ L = 2) is output from the node _represented by z _k ^{(L = 2)} of the hidden layer (L = 2). ²⁾ It is output as (= f ( _uk ^{(L = 2)} )). On the other hand, for each node of the hidden layer (L = 3), the output values of each node of the hidden layer (L = 2) z ₁ ^{(L = 2)} _, z ₂ ^{(L = 2)} and z ₃ ^{(L = 2} ). ⁾ Is input, and at each node of the hidden layer (L = 3), the total input value u (= Σz · w + b) is calculated using the corresponding weight w and bias b, respectively. This total input value u is similarly converted by the activation function, and is output as output values z ₁ ^{(L = 3)} and z ₂ ^{(L = 3)} from each node of the hidden layer (L = 3). The activation function is, for example, a sigmoid function σ.

また、出力層（Ｌ＝４）のノードには、隠れ層（Ｌ＝３）の各ノードの出力値ｚ_１ ^{（Ｌ＝３）}及びｚ_２ ^{（Ｌ＝３）}が入力され、出力層のノードでは、それぞれ対応する重みｗ及びバイアスｂを用いて総入力値ｕ（Σｚ・ｗ＋ｂ）が算出されるか、又は、それぞれ対応する重みｗのみを用いて総入力値ｕ（Σｚ・ｗ）が算出される。例えば、出力層のノードでは活性化関数として恒等関数が用いられる。この場合、出力層のノードにおいて算出された総入力値ｕが、そのまま出力値ｙとして出力層のノードから出力される。 Further, the output values z ₁ ^{(L = 3)} and z ₂ ^{(L = 3) of each node of the hidden layer (L = 3)} are input to the node of the output layer (L = 4), and the node of the output layer is input. Then, the total input value u (Σz · w + b) is calculated using the corresponding weights w and the bias b, respectively, or the total input value u (Σz · w) is calculated using only the corresponding weights w. Will be done. For example, the node of the output layer uses an identity function as the activation function. In this case, the total input value u calculated in the node of the output layer is directly output from the node of the output layer as the output value y.

このようにＮＮモデルは、入力層と、隠れ層と、出力層と、を備え、一又は複数の入力パラメータが入力層から入力されると、入力パラメータに対応する一又は複数の出力パラメータを出力層から出力する。 As described above, the NN model includes an input layer, a hidden layer, and an output layer, and when one or more input parameters are input from the input layer, one or a plurality of output parameters corresponding to the input parameters are output. Output from the layer.

入力パラメータの例としては、例えばＮＮモデルを用いて自動運転車両２に搭載された内燃機関を制御する場合であれば、機関回転速度や機関冷却水温度、燃料噴射量、燃料噴射時期、燃圧、吸入空気量、吸気温度、ＥＧＲ率、過給圧といった、内燃機関の運転状態を示す各種パラメータの現在値が挙げられる。そして、このような入力パラメータに対応する出力パラメータの例としては、排気中のＮＯｘ濃度やその他の物質の濃度、機関出力トルクといった内燃機関の性能を表す各種パラメータの推定値が挙げられる。これにより、ＮＮモデルに内燃機関の運転状態を示す各種パラメータの現在値を入力パラメータとして入力することで、内燃機関の性能を表す各種パラメータの推定値（現在の推定値又は将来の推定値）を出力パラメータとして取得することができるので、例えば出力パラメータに基づいて、内燃機関の性能が所望の性能に近づくように内燃機関を制御することができる。これにより、例えば機関出力トルクを適切に制御して車両の前後方向や横方向の加速度を制御し、車両挙動を安定させたり乗り心地を向上させたりすることができる。また、出力パラメータを実測するためのセンサ等を備える場合には、実測値と推定値との差に応じて、センサ等の故障を判断したりすることもできる。 As an example of the input parameters, for example, when controlling the internal combustion engine mounted on the automatic driving vehicle 2 using the NN model, the engine rotation speed, the engine cooling water temperature, the fuel injection amount, the fuel injection timing, the fuel pressure, etc. The current values of various parameters indicating the operating state of the internal combustion engine, such as the amount of intake air, the intake temperature, the EGR rate, and the boost pressure, can be mentioned. Examples of output parameters corresponding to such input parameters include estimated values of various parameters representing the performance of the internal combustion engine such as NOx concentration in exhaust gas, concentration of other substances, and engine output torque. As a result, by inputting the current values of various parameters indicating the operating state of the internal combustion engine into the NN model as input parameters, the estimated values (current estimated value or future estimated value) of various parameters indicating the performance of the internal combustion engine can be input. Since it can be acquired as an output parameter, the internal combustion engine can be controlled so that the performance of the internal combustion engine approaches a desired performance, for example, based on the output parameter. Thereby, for example, the engine output torque can be appropriately controlled to control the acceleration in the front-rear direction and the lateral direction of the vehicle, and the vehicle behavior can be stabilized and the riding comfort can be improved. Further, when a sensor or the like for actually measuring the output parameter is provided, it is possible to determine the failure of the sensor or the like according to the difference between the measured value and the estimated value.

ＮＮモデルの精度を向上させるためには、ＮＮモデルを学習させる必要がある。ＮＮモデルの学習には、入力パラメータの実測値と、この入力パラメータの実測値に対応した出力パラメータの実測値（正解データ）と、を含む多数の訓練データセットが用いられる。多数の訓練データセットを用いて、公知の誤差逆伝搬法によってニューラルネットワーク内の重みｗ及びバイアスｂの値を繰り返し更新することで、重みｗ及びバイアスｂの値が学習され、学習モデルが生成される。 In order to improve the accuracy of the NN model, it is necessary to train the NN model. For learning the NN model, a large number of training data sets including the measured values of the input parameters and the measured values (correct answer data) of the output parameters corresponding to the measured values of the input parameters are used. By repeatedly updating the values of the weight w and the bias b in the neural network by a known error back propagation method using a large number of training data sets, the values of the weight w and the bias b are learned and a learning model is generated. To.

ここで、配車サービスに使用される各自動運転車両２の配車状況は、車両毎に様々である。そのため、自動運転車両２の中には、配車サービスのサービス提供地域内に満遍なく配車される車両もあれば、サービス提供地域内の一部の偏った地域にばかり配車される車両もある。なお配車サービスのサービス提供地域は、特に限られるものではなく、都道府県内の全域や都道府県を跨いだ広範囲の地域でもよいし、都道府県内の一部の地域やスマートシティ（コネクティッドシティ）内などの比較的限られた地域でもよい。 Here, the vehicle allocation status of each autonomous driving vehicle 2 used for the vehicle allocation service varies from vehicle to vehicle. Therefore, among the autonomous driving vehicles 2, some vehicles are evenly distributed within the service providing area of the vehicle allocation service, and some vehicles are distributed only to a part of the biased area within the service providing area. The service area of the vehicle dispatch service is not particularly limited, and may be the entire area within a prefecture or a wide area across prefectures, a part of the prefecture, or a smart city (connected city). It may be a relatively limited area such as inside.

サービス提供地域内に満遍なく配車された自動運転車両２の場合、サービス提供地域の全域を満遍なく走行して得られた訓練データセットに基づいて、ＮＮモデルの学習を行うことができるので、サービス提供地域に最適化された学習モデルを作成することができる。 In the case of the self-driving vehicle 2 evenly distributed in the service providing area, the NN model can be learned based on the training data set obtained by traveling the entire service providing area evenly, so that the service providing area can be learned. It is possible to create a learning model optimized for.

これに対して、サービス提供地域内の一部の偏った地域にばかり配車された自動運転車両２の場合、サービス提供地域内の未走行地域や走行回数の少ない地域（以下「未走行地域等」という。）において十分な量の訓練データセットを取得することができないので、サービス提供地域内において、ＮＮモデルの学習が十分でない地域（以下「学習未習熟地域」という。）が生じることになる。そのため、サービス提供地域に最適化された学習済みモデルを作成することができず、サービス提供地域内の一部の偏った地域にばかり配車された自動運転車両２のＮＮモデルは、未走行地域等における学習が不十分なＮＮモデルとなる。 On the other hand, in the case of the self-driving vehicle 2 that is distributed only to a part of the service providing area in a biased area, the non-driving area or the area where the number of times of driving is small in the service providing area (hereinafter, "non-driving area, etc."" Since it is not possible to acquire a sufficient amount of training data set in (), there will be areas in the service providing area where the learning of the NN model is not sufficient (hereinafter referred to as “learning unlearned area”). Therefore, it is not possible to create a trained model optimized for the service providing area, and the NN model of the autonomous driving vehicle 2 distributed only to a part of the biased area in the service providing area is a non-driving area, etc. It becomes an NN model with insufficient learning in.

したがって、サービス提供地域内の一部の偏った地域にばかり配車された自動運転車両２が、配車サービスの提供中に未走行地域等を走行することになった場合には、当該未走行地域等での学習が不十分なＮＮモデルを用いて自動運転車両２の制御が行われることなる。その結果、未走行地域等の道路交通環境や地形などに十分に適合していないＮＮモデルを用いて自動運転車両２の制御が行われることになり、例えば乗り心地が悪化するなどして、配車サービスの利用客に対する配車サービスの質が低下するおそれがある。 Therefore, if the autonomous driving vehicle 2 that is dispatched only to a part of the biased area in the service providing area travels in the non-driving area or the like while the vehicle dispatching service is being provided, the non-driving area or the like. The self-driving vehicle 2 will be controlled using the NN model for which learning is insufficient. As a result, the autonomous driving vehicle 2 will be controlled using an NN model that is not sufficiently suitable for the road traffic environment such as a non-driving area and the terrain. There is a risk that the quality of vehicle dispatch services for service users will deteriorate.

そこで本実施形態では、サービス提供地域内における各自動運転車両２のＮＮモデルの学習進捗度を算出することとした。なお本実施形態において、ＮＮモデルの学習進捗度とは、配車サービスのサービス提供地域内において、ＮＮモデルの学習がどの程度進んでいるかを表す指標であって、基本的に、サービス提供地域内において学習が不十分な地域が多く存在する車両ほど低い値を示すようになっている。したがって、ＮＮモデルの学習進捗度が低い自動運転車両２は、サービス提供地域内の特定の地域での学習が不十分、すなわち当該特定の地域での走行が不十分で、ＮＮモデルの学習のために必要な訓練データセットを十分に取得できていないことになる。 Therefore, in the present embodiment, it is decided to calculate the learning progress of the NN model of each autonomous driving vehicle 2 in the service providing area. In the present embodiment, the learning progress of the NN model is an index showing how much the learning of the NN model has progressed in the service providing area of the vehicle dispatch service, and is basically in the service providing area. Vehicles with many areas where learning is inadequate show lower values. Therefore, the self-driving vehicle 2 having a low learning progress of the NN model has insufficient learning in a specific area within the service providing area, that is, insufficient traveling in the specific area, and is for learning the NN model. It means that the training data set required for the training has not been sufficiently acquired.

そして、各自動運転車両２のＮＮモデルの学習進捗度に基づいて、配車サービスのために待機している自動運転車両２の中から学習進捗度が相対的に低く、かつ次回の配車サービスの提供までに時間に余裕のある自動運転車両２を「学習車両」として選択し、学習車両に対して自車両の学習進捗度を高めるために、配車サービスの待機中に学習未習熟地域を走行させることとした。 Then, based on the learning progress of the NN model of each autonomous driving vehicle 2, the learning progress is relatively low among the autonomous driving vehicles 2 waiting for the vehicle allocation service, and the next vehicle allocation service is provided. Select the self-driving vehicle 2 that has enough time as the "learning vehicle", and drive the learning inexperienced area while waiting for the vehicle dispatch service in order to improve the learning progress of the own vehicle for the learning vehicle. And said.

これにより、学習車両は、配車サービスの待機中に学習未習熟地域を走行して、当該地域における訓練データセットを取得すると共に、取得した訓練データセットに基づいてＮＮモデルの学習を行うことができる。したがって、配車サービスの待機中に、各自動運転車両２の学習進捗度を向上させることができる。 As a result, the learning vehicle can travel in an unlearned area while waiting for the vehicle allocation service, acquire a training data set in the area, and learn an NN model based on the acquired training data set. .. Therefore, it is possible to improve the learning progress of each autonomous driving vehicle 2 while the vehicle dispatch service is on standby.

以下、配車システム１００で実行される各種の処理について説明する。 Hereinafter, various processes executed by the vehicle allocation system 100 will be described.

図４は、サーバ１、自動運転車両２及び携帯端末３との間で実行される配車処理について説明するフローチャートである。 FIG. 4 is a flowchart illustrating a vehicle allocation process executed between the server 1, the autonomous driving vehicle 2, and the mobile terminal 3.

ステップＳ１において、配車サービスの利用客４は、自身が所持している携帯端末３を操作して配車依頼をサーバ１に送信する。利用客４は、例えば配車サービスを利用するためのアプリを携帯端末３上で起動することにより、配車依頼をサーバ１に送信することができる。配車依頼には、例えば、利用客４の識別情報（個人情報）や乗車希望時刻、乗車希望位置、降車希望位置、走行予定エリアなどの配車のために必要な情報が含まれる。 In step S1, the user 4 of the vehicle allocation service operates the mobile terminal 3 owned by the user 4 to transmit the vehicle allocation request to the server 1. The user 4 can send a vehicle allocation request to the server 1 by, for example, activating an application for using the vehicle allocation service on the mobile terminal 3. The vehicle allocation request includes, for example, identification information (personal information) of the passenger 4, information necessary for vehicle allocation such as desired boarding time, desired boarding position, desired disembarkation position, and planned travel area.

ステップＳ２において、配車依頼を受信したサーバ１は、各自動運転車両２の配車状況や配車予約情報などを参照し、各自動運転車両２の中から配車車両を選択すると共に、配車車両に関する情報（例えば車種や車両識別番号など）を配車依頼の送信元の携帯端末３に送信する。 In step S2, the server 1 that has received the vehicle allocation request refers to the vehicle allocation status of each autonomous driving vehicle 2, the vehicle allocation reservation information, etc., selects a vehicle allocation vehicle from each autonomous driving vehicle 2, and also provides information on the vehicle allocation vehicle ( For example, the vehicle type, vehicle identification number, etc.) are transmitted to the mobile terminal 3 from which the vehicle allocation request is transmitted.

ステップＳ３において、配車依頼の送信元となる携帯端末３は、配車車両に関する情報を受信したか否かを判定し、配車車両に関する情報を受信していればステップＳ４の処理に進み、配車車両に関する情報を受信していなければ所定時間を空けた後、再度、配車車両に関する情報を受信したか否かを判定する。 In step S3, the mobile terminal 3 that is the transmission source of the vehicle allocation request determines whether or not the information regarding the vehicle allocation vehicle has been received, and if the information regarding the vehicle allocation vehicle is received, proceeds to the process of step S4 and proceeds to the process of the vehicle allocation vehicle. If the information has not been received, after a predetermined time has passed, it is determined whether or not the information regarding the dispatched vehicle has been received again.

ステップＳ４において、配車車両に関する情報を受信した携帯端末３は、当該情報を例えばアプリを介して携帯端末３の画面に表示するなどして配車サービスの利用客４に提供する。 In step S4, the mobile terminal 3 that has received the information about the vehicle dispatching vehicle provides the information to the user 4 of the vehicle dispatching service, for example, by displaying the information on the screen of the mobile terminal 3 via an application.

ステップＳ５において、サーバ１は、配車車両に対して配車指示を送信する。配車指示には、配車依頼に含まれていた各種の情報、すなわち、利用客４の識別情報（個人情報）や乗車希望時刻、乗車希望位置、降車希望位置、走行予定エリアなどが含まれる。 In step S5, the server 1 transmits a vehicle allocation instruction to the vehicle dispatch vehicle. The vehicle allocation instruction includes various information included in the vehicle allocation request, that is, identification information (personal information) of the passenger 4, desired boarding time, desired boarding position, desired disembarkation position, planned travel area, and the like.

ステップＳ６において、配車指示を受信した自動運転車両２の電子制御ユニット２０は、配車指示に従って自車両の自動運転を行って、配車依頼を行った利用客４の希望乗車時刻までに、自車両を希望乗車位置に到着させる。 In step S6, the electronic control unit 20 of the autonomous driving vehicle 2 that has received the vehicle allocation instruction automatically drives the own vehicle according to the vehicle allocation instruction, and by the desired boarding time of the user 4 who has requested the vehicle allocation, the own vehicle is delivered. Arrive at the desired boarding position.

図５は、サーバ１及び自動運転車両２の間で実行される学習進捗度更新処理について説明するフローチャートである。 FIG. 5 is a flowchart illustrating a learning progress update process executed between the server 1 and the autonomous driving vehicle 2.

ステップＳ１１において、自動運転車両２の電子制御ユニット２０は、自車両のＮＮモデルの学習進捗度を前回サーバ１に送信してから所定期間が経過したか否かを判定する。電子制御ユニット２０は、自車両のＮＮモデルの学習進捗度を前回サーバ１に送信してから所定期間が経過していれば、ステップＳ１２の処理に進む。一方で電子制御ユニット２０は、自車両のＮＮモデルの学習進捗度を前回サーバ１に送信してから所定期間が経過していなければ、今回の処理を終了する。 In step S11, the electronic control unit 20 of the self-driving vehicle 2 determines whether or not a predetermined period has elapsed since the learning progress of the NN model of the own vehicle was transmitted to the server 1 last time. The electronic control unit 20 proceeds to the process of step S12 if a predetermined period has elapsed since the learning progress of the NN model of the own vehicle was transmitted to the server 1 last time. On the other hand, the electronic control unit 20 ends this process if a predetermined period has not elapsed since the learning progress of the NN model of the own vehicle was transmitted to the server 1 last time.

ステップＳ１２において、自動運転車両２の電子制御ユニット２０は、自車両のＮＮモデルの学習進捗度を算出する。 In step S12, the electronic control unit 20 of the autonomous driving vehicle 2 calculates the learning progress of the NN model of the own vehicle.

本実施形態では、サービス提供地域内を複数の小地域に区分けしており、各小地域で取得された訓練データセットの数とその取得時期とに基づいて、小地域毎にＮＮモデルの学習進捗度を算出している。具体的には本実施形態では、下記の（１）式に基づいて、各小地域の学習進捗度を算出している。
学習進捗度＝Ａ×（訓練データセットの数）＋Ｆ（訓練データセット取得時期）…（１） In this embodiment, the service providing area is divided into a plurality of subregions, and the learning progress of the NN model for each subregion based on the number of training data sets acquired in each subregion and the acquisition time thereof. The degree is calculated. Specifically, in this embodiment, the learning progress of each subregion is calculated based on the following equation (1).
Learning progress = A x (number of training data sets) + F (training data set acquisition time) ... (1)

なお、（１）式において、Ａは所定値であり、基本的に訓練データセットの数が多くなるほど、学習進捗度が大きくなるような値に設定される。またＦは所定の変換関数であり、学習進捗度を算出する時期に対して、訓練データセット取得時期が古くなるほど、学習進捗度が小さくなるような値が算出される関数とされる。これは、或る小地域において取得された訓練データセットの取得時期が古くなるほど、交通環境等が変化するため、当該小地域を新たに走行して学習させる必要性が生じるためである。 In the equation (1), A is a predetermined value, and is basically set to a value such that the learning progress increases as the number of training data sets increases. Further, F is a predetermined conversion function, and is a function that calculates a value such that the learning progress becomes smaller as the training data set acquisition time becomes older than the time when the learning progress is calculated. This is because the older the acquisition time of the training data set acquired in a certain subregion, the more the traffic environment and the like change, so that it becomes necessary to newly drive and learn in the subregion.

なお本実施形態では、電子制御ユニット２０は、車両走行中に、訓練データセット（ＮＮモデルの入力パラメータの実測値と出力パラメータの実測値）を随時取得している。 In this embodiment, the electronic control unit 20 acquires training data sets (measured values of input parameters and output parameters of the NN model) at any time while the vehicle is running.

ステップＳ１３において、自動運転車両２の電子制御ユニット２０は、算出した自車両のＮＮモデルの学習進捗度を、自車両の識別情報と共にサーバ１に送信する。 In step S13, the electronic control unit 20 of the autonomous driving vehicle 2 transmits the calculated learning progress of the NN model of the own vehicle to the server 1 together with the identification information of the own vehicle.

ステップＳ１４において、ＮＮモデルの学習進捗度を受信したサーバ１は、各自動運転車両２の小地域毎のＮＮモデルの学習進捗度が記憶された学習進捗度データベースを更新する。具体的にはサーバ１は、車両識別情報に基づいてＮＮモデルの学習進捗度を送信してきた自動運転車両２の特定し、その自動運転車両２の各小地域のＮＮモデルの学習進捗度を更新する。 In step S14, the server 1 that has received the learning progress of the NN model updates the learning progress database in which the learning progress of the NN model for each subregion of each autonomous driving vehicle 2 is stored. Specifically, the server 1 identifies the self-driving vehicle 2 that has transmitted the learning progress of the NN model based on the vehicle identification information, and updates the learning progress of the NN model in each subregion of the self-driving vehicle 2. do.

図６は、サーバ１及び自動運転車両２の間で実行される学習走行処理について説明するフローチャートである。 FIG. 6 is a flowchart illustrating a learning traveling process executed between the server 1 and the autonomous driving vehicle 2.

ステップＳ２１において、サーバ１は、学習進捗度データベースを定期的に参照し、各自動運転車両２の中から学習車両候補を抽出する。例えばサーバ１は、各自動運転車両２の中から、学習進捗度が所定値以下となっている小地域がある自動運転車両２を、学習車両候補として抽出する。 In step S21, the server 1 periodically refers to the learning progress database and extracts learning vehicle candidates from each autonomous driving vehicle 2. For example, the server 1 extracts from each autonomous driving vehicle 2 an autonomous driving vehicle 2 having a small area where the learning progress is equal to or less than a predetermined value as a learning vehicle candidate.

ステップＳ２２において、サーバ１は、学習車両候補の中から、配車待ち時間中に、車両基地（車庫）から学習進捗度が所定値以下となっている小地域まで行って、当該小地域内を訓練データセット取得のために走行した後、再び車両基地まで戻ってこられる自動運転車両２を、学習車両として選択する。すなわちサーバ１は、配車サービスのために待機している自動運転車両２の中から学習進捗度が相対的に低い小地域を有し、かつ次回の配車サービスの提供までに時間に余裕のある自動運転車両２を、学習車両として選択する。 In step S22, the server 1 goes from the learning vehicle candidates to a small area where the learning progress is equal to or less than a predetermined value from the vehicle base (garage) during the waiting time for vehicle allocation, and trains in the small area. The self-driving vehicle 2 that can return to the depot after traveling for data set acquisition is selected as the learning vehicle. That is, the server 1 has a small area where the learning progress is relatively low among the autonomous driving vehicles 2 waiting for the vehicle allocation service, and the automatic operation has enough time until the next vehicle allocation service is provided. The driving vehicle 2 is selected as the learning vehicle.

ステップＳ２３において、サーバ１は、学習車両に対して、学習指示を送信する。学習指示には、訓練データセットを取得する小地域に関する情報、すなわち、ＮＮモデルの学習進捗度を向上させるために学習が必要な小地域（以下「学習対象小地域」という。）に関する情報と、車両基地への戻り時刻と、が含まれている。 In step S23, the server 1 transmits a learning instruction to the learning vehicle. The learning instructions include information about the subregions from which the training data set is acquired, that is, information about the subregions that need learning to improve the learning progress of the NN model (hereinafter referred to as "learning target subregions"). The return time to the vehicle base and is included.

ステップＳ２４において、自動運転車両２の電子制御ユニット２０は、学習指示を受信したか否かを判定する。電子制御ユニット２０は、学習指示を受信していれば、ステップＳ２５の処理に進む。一方で電子制御ユニット２０は、学習指示を受信していなければ、今回の処理を終了する。 In step S24, the electronic control unit 20 of the autonomous driving vehicle 2 determines whether or not the learning instruction has been received. If the electronic control unit 20 has received the learning instruction, the electronic control unit 20 proceeds to the process of step S25. On the other hand, if the electronic control unit 20 has not received the learning instruction, the electronic control unit 20 ends the current process.

ステップＳ２５において、自動運転車両２の電子制御ユニット２０は、学習指示に基づいて、学習対象小地域に向けて自車両を自動運転により出発させて学習対象小地域内を訓練データセット取得のために走行させた後、戻り時刻までに自車両を車両基地に帰還させる。そして電子制御ユニット２０は、学習対象小地域内の走行中に取得した訓練データセットに基づいて、ＮＮモデルの学習を実行する。なお、本実施形態では、このＮＮモデルの学習を学習対象小地域以内の走行中に随時実行しているが、車両基地に戻ってから行うようにしてもよい。 In step S25, the electronic control unit 20 of the autonomous driving vehicle 2 automatically departs its own vehicle toward the learning target small area based on the learning instruction to acquire the training data set in the learning target small area. After driving, the vehicle is returned to the vehicle base by the return time. Then, the electronic control unit 20 executes learning of the NN model based on the training data set acquired while traveling in the training target subregion. In the present embodiment, the learning of this NN model is executed at any time while traveling within the learning target subregion, but it may be performed after returning to the depot.

以上説明した本実施形態による配車システム１００は、サーバ１と複数の自動運転車両２とを備え、自動運転車両２を用いた配車サービスを提供する。サーバ１は、自動運転車両２から受信した、自動運転車両２において使用される学習モデルの学習進捗度に基づいて、配車サービスのために待機している自動運転車両２の中から学習モデルの学習が必要な学習車両を選択し、学習車両に対して学習指示を送信するように構成される。自動運転車両２は、学習指示を受信すると、学習指示に基づいて、学習モデルの学習を行うための走行を配車待機中に実施するように構成される。 The vehicle allocation system 100 according to the present embodiment described above includes a server 1 and a plurality of autonomous driving vehicles 2, and provides a vehicle allocation service using the autonomous driving vehicle 2. The server 1 learns a learning model from among the autonomous driving vehicles 2 waiting for the vehicle allocation service based on the learning progress of the learning model used in the autonomous driving vehicle 2 received from the autonomous driving vehicle 2. Is configured to select the required learning vehicle and send learning instructions to the learning vehicle. Upon receiving the learning instruction, the autonomous driving vehicle 2 is configured to perform traveling for learning the learning model while waiting for vehicle allocation based on the learning instruction.

これにより、配車サービスのために待機している自動運転車両２の中から学習モデルの学習が必要な学習車両（例えば学習モデルの学習進捗度が低い車両）を選択し、当該学習車両に対して、学習モデルの学習を行うための走行を配車待機中に実施させることができる。そのため、各自動運転車両２において使用される学習モデルの学習進捗度に差が生じるのを抑制できる。 As a result, a learning vehicle that requires learning of the learning model (for example, a vehicle with a low learning progress of the learning model) is selected from the automatic driving vehicles 2 waiting for the vehicle allocation service, and the learning vehicle is selected. , It is possible to carry out the running for learning the learning model while waiting for the vehicle to be dispatched. Therefore, it is possible to suppress a difference in the learning progress of the learning model used in each autonomous driving vehicle 2.

各自動運転車両２において使用される学習モデルの学習進捗度に差があると、例えば、学習モデルの学習が十分に進んでいない車両が配車された配車サービスの利用客に対する配車サービスの質が、学習モデルの学習が十分に進んでいる車両が配車された利用客に対する配車サービスの質よりも低下するなどして、配車サービスの利用客の中で不平等が生じるおそれがある。 If there is a difference in the learning progress of the learning model used in each autonomous driving vehicle 2, for example, the quality of the vehicle allocation service for the passengers of the vehicle allocation service to which the vehicle for which the learning of the learning model has not been sufficiently advanced is sufficiently determined. There is a risk of inequality among the passengers of the vehicle dispatch service, for example, because the quality of the vehicle dispatch service for the passengers to whom the vehicle has been sufficiently trained in the learning model is lower than the quality of the vehicle dispatch service.

したがって、本実施形態のように各自動運転車両２において使用される学習モデルの学習進捗度に差が生じるのを抑制することで、配車サービスの利用客の中で不平等が生じるのを抑制することができる。 Therefore, by suppressing the difference in the learning progress of the learning model used in each autonomous driving vehicle 2 as in the present embodiment, it is possible to suppress the occurrence of inequality among the users of the vehicle allocation service. be able to.

以上、本発明の実施形態について説明したが、上記実施形態は本発明の適用例の一部を示したに過ぎず、本発明の技術的範囲を上記実施形態の具体的構成に限定する趣旨ではない。 Although the embodiments of the present invention have been described above, the above embodiments are only a part of the application examples of the present invention, and the technical scope of the present invention is limited to the specific configuration of the above embodiments. do not have.

例えば上記の実施形態では、配車依頼を受信したサーバ１は、各自動運転車両２の配車状況や配車予約情報を参照し、各自動運転車両２の中から配車車両を選択していたが、さらに各自動運転車両２のＮＮモデルの学習進捗度を参照し、学習進捗度の低い車両は、高い車両と比較して配車車両として選択する際の優先度を下げるようにしてもよい。これにより、学習進捗度の低い車両は、配車車両として選択されにくくなるので、配車サービスの待機中にＮＮモデルの学習のための走行を行うための時間的余裕が生じやすくなる。その結果として、配車サービスの待機中にＮＮモデルの学習のための走行を行う機会や頻度を増やして学習進捗度を向上させることができる。 For example, in the above embodiment, the server 1 that has received the vehicle allocation request refers to the vehicle allocation status of each autonomous driving vehicle 2 and the vehicle allocation reservation information, and selects a vehicle allocation vehicle from each autonomous driving vehicle 2. With reference to the learning progress of the NN model of each autonomous driving vehicle 2, a vehicle having a low learning progress may have a lower priority when selected as a vehicle to be dispatched as compared with a vehicle having a high learning progress. As a result, it becomes difficult for a vehicle having a low learning progress to be selected as a vehicle dispatch vehicle, so that there is a tendency to have time to run for learning the NN model while waiting for the vehicle dispatch service. As a result, it is possible to improve the learning progress by increasing the opportunity and frequency of running for learning the NN model while waiting for the vehicle dispatch service.

１サーバ
２自動運転車両
１００配車システム 1 Server 2 Self-driving vehicle 100 Vehicle dispatch system

Claims

It is a vehicle allocation system that includes a server and a plurality of autonomous driving vehicles and provides a vehicle allocation service using the autonomous driving vehicle.
Based on the learning progress of the learning model used in the autonomous driving vehicle received from the autonomous driving vehicle, the server selects the learning model from among the autonomous driving vehicles waiting for the vehicle allocation service. It is configured to select a learning vehicle that needs learning and send a learning instruction to the learning vehicle.
Upon receiving the learning instruction, the autonomous driving vehicle is configured to perform traveling for learning the learning model while waiting for vehicle allocation based on the learning instruction.
Vehicle dispatch system.