WO2021205616A1

WO2021205616A1 - Moving body control device, moving body control method, and learning device

Info

Publication number: WO2021205616A1
Application number: PCT/JP2020/016019
Authority: WO
Inventors: 貴之井對
Original assignee: 三菱電機株式会社
Priority date: 2020-04-09
Filing date: 2020-04-09
Publication date: 2021-10-14

Abstract

A moving body control device comprises an actual sensor data acquisition unit (14) that acquires sensor data, a feature amount calculation unit (16) that calculates a feature amount from the sensor data acquired by the actual sensor data acquisition unit (14), and a hyperparameter calculation unit (17) that inputs the feature amount calculated by the feature amount calculation unit (16) to a machine learning model (13) to thereby calculate a hyperparameter used when calculating a control amount of a moving body.

Description

Mobile control device, mobile control method, and learning device

The present disclosure relates to a mobile body control device for calculating a control amount of a mobile body, a mobile body control method, and a learning device.

Conventionally, in the field of automatic driving of a moving body, a technique for learning the control amount of a vehicle for each driving situation is known (for example, Patent Document 1).

JP-A-2019-10967

In mobile control technology such as model prediction control or PID control, there is a problem that hyperparameters according to the driving situation must be manually set in order to obtain a control amount according to the driving situation. Hyperparameters are weights of evaluation functions and the like.

The present disclosure has been made to solve the above-mentioned problems, and provides a mobile body control device that can set hyperparameters according to a driving situation, which is used in a mobile body control technology, without human intervention. The purpose is to do.

The moving body control device according to the present disclosure is a moving body control device that calculates a control amount of a moving body from sensor data indicating the surrounding environment of the moving body, and is an actual sensor data acquisition unit that acquires sensor data and an actual sensor. Used when calculating the control amount of a moving body by inputting the feature amount calculation unit that calculates the feature amount from the sensor data acquired by the data acquisition unit and the feature amount calculated by the feature amount calculation unit into the machine learning model. It is equipped with a hyper parameter calculation unit that calculates hyper parameters.

According to the present disclosure, hyperparameters according to the driving situation used in the mobile control technology can be set without human intervention.

It is a figure which shows the structural example of the mobile body control apparatus which concerns on Embodiment 1. FIG. It is a flowchart for demonstrating operation of the mobile body control apparatus which concerns on Embodiment 1. FIG. In the first embodiment, a configuration example of the mobile control system in the case where the learning device is provided outside the mobile control device and the mobile control device and the learning device constitute a mobile control system. It is a figure which shows. 4A and 4B are diagrams showing an example of the hardware configuration of the mobile control device according to the first embodiment.

Hereinafter, embodiments of the present disclosure will be described in detail with reference to the drawings.
Embodiment 1.
FIG. 1 is a diagram showing a configuration example of the mobile control device 1 according to the first embodiment.
In the first embodiment, the moving body is assumed to be a vehicle. Further, it is assumed that the mobile control device 1 according to the first embodiment is mounted on a vehicle (not shown).
The moving body control device 1 calculates the control amount of the moving body according to the traveling situation in which the moving body is traveling. In the first embodiment, the traveling condition means various shapes of a road on which a moving body travels, such as a straight road, a curve, an uphill, a downhill, or an intersection. In the first embodiment, the traveling speed when the moving body travels on the road of various shapes is also included in the traveling situation. Further, in the first embodiment, the control amount of the moving body is a control amount for controlling the operation of the moving body. The mobile body control device 1 calculates the control amount of the mobile body by using a known mobile body control technique.

In the first embodiment, the mobile body control device 1 calculates the control amount of the mobile body by using a known model predictive control technique. At that time, the moving body control device 1 calculates the hyperparameters used when calculating the control amount of the moving body according to the traveling situation in which the moving body is traveling.
In model prediction control, a model that predicts future behavior based on vehicle dynamics is generated in advance, and what kind of input is optimal based on the model based on the evaluation function and constraints. Is calculated. The hyperparameter is the weight of the evaluation function in the model prediction control or the threshold value in the constraint condition. The mobile body control device 1 calculates the optimum control amount of the moving body to be given to the moving body based on the model prediction control. In PID control, the hyperparameters are proportional gain, integral gain, and differential gain.
The moving body control device 1 calculates hyperparameters based on a trained model in machine learning (hereinafter referred to as "machine learning model"). The machine learning model is generated by the mobile control device 1.
The control amount calculated by the moving body control device 1 is used for automatic driving control in a moving body, in other words, a vehicle. In the first embodiment, it is assumed that the vehicle has an automatic driving function. Even when the vehicle has an automatic driving function, the driver can drive the vehicle by himself / herself without executing the automatic driving function.

As shown in FIG. 1, the moving body control device 1 includes a learning device 2, an actual sensor data acquisition unit 14, a data conversion unit 15, a feature amount calculation unit 16, a hyperparameter calculation unit 17, and a control amount calculation unit 18. Be prepared.

The learning device 2 generates a machine learning model 13 by learning using the teacher data. The learning device 2 includes a teacher data acquisition unit 11, a learning unit 12, and a machine learning model 13.
The teacher data acquisition unit 11 acquires teacher data.
In the first embodiment, the teacher data is data in which hyperparameters are added to the feature amount calculated from the sensor data indicating the surrounding environment of the moving body. The feature amount calculated from the sensor data indicating the surrounding environment of the moving body is the feature amount according to the traveling condition of the moving body.
The teacher data acquisition unit 11 outputs the acquired teacher data to the learning unit 12.

The learning unit 12 generates a machine learning model 13 by learning using the teacher data acquired by the teacher data acquisition unit 11.
The machine learning model 13 is a machine learning model that inputs a feature amount calculated from sensor data indicating the surrounding environment of a moving body and outputs hyperparameters.
In the first embodiment, as shown in FIG. 1, the machine learning model 13 is provided in the moving body control device 1, but this is only an example. The machine learning model 13 is a moving body control device. It may be provided in a place outside the device 1 where the mobile control device 1 can be referred to.

The actual sensor data acquisition unit 14 acquires sensor data (hereinafter referred to as “actual sensor data”) indicating the surrounding environment of the moving body when the moving body actually travels.
In the first embodiment, the actual sensor data is, for example, an image. The actual sensor data acquisition unit 14 acquires, for example, an image captured by a camera (not shown) that captures the front of the moving body. The camera is provided on the moving body. In the first embodiment, the actual sensor data will be described below assuming that the actual sensor data is an image captured by the camera (hereinafter referred to as “camera image”). The actual sensor data acquisition unit 14 outputs the camera image to the data conversion unit 15.
The actual sensor data may be numerical data such as LiDAR data.

The data conversion unit 15 performs data conversion on the data elements included in the actual sensor data acquired by the actual sensor data acquisition unit 14. For example, the data conversion unit 15 performs the above data conversion using a known semantic segmentation technique. To give a specific example, for example, among the pixels included in the camera image, the pixel indicating a car is blue, the pixel indicating a road is pink, or the pixel indicating a street tree is green. Data conversion is performed to color-code the pixels of the camera image. When the actual sensor data is numerical data, the data conversion unit 15 performs data conversion for removing noise, for example, so that the numerical data approaches the simulation data indicating the surrounding environment of the moving body.

The features included in the teacher data are the features calculated from the simulation data. For example, the feature amount included in the teacher data is a feature amount calculated from an image (hereinafter referred to as “simulation image”) reproduced by an automatic driving simulator (not shown). Specifically, the teacher data is, for example, a hyper that calculates a feature amount calculated from a simulation image and an ideal control amount when calculating a control amount of a moving body based on the feature amount in an automatic driving simulator. The parameter is the associated data.
The automatic driving simulator is a so-called automatic driving simulator using a general simulation technique. In the automatic driving simulator, a simulation image showing the surrounding environment of the moving body is reproduced. The simulation image is, for example, a CG (Computer Graphics) image.
Here, there is a possibility that the feature amount to be calculated as the same feature amount may not be calculated as the same feature amount in the case where the feature amount is calculated from the camera image and the case where the feature amount is calculated from the simulation image. The feature amount calculated from the camera image is required when the control amount calculation unit 18 calculates the control amount of the moving body in the moving body control device 1. At that time, the control amount calculation unit 18 uses the hyperparameters calculated by the hyperparameter calculation unit 17 based on the machine learning model 13. The feature amount calculation unit 16 calculates the feature amount based on the camera image. Details of the control amount calculation unit 18, the feature amount calculation unit 16, and the hyperparameter calculation unit 17 will be described later.
Then, when calculating the control amount of the moving body based on the feature amount calculated from the camera image, the hyperparameters based on the machine learning model 13 generated based on the teacher data including the feature amount calculated from the simulation image. If is used, an appropriate control amount may not be calculated.

Therefore, the data conversion unit 15 performs data conversion on the camera image, so that the feature amount to be calculated as the same feature amount is generated depending on whether the feature amount is calculated from the camera image or the simulation image. Absorb the difference. As a result, in the moving body control device 1, the hyperparameters used when calculating the control amount of the moving body are based on the feature amount different from the feature amount when calculating the control amount of the moving body. Therefore, it is possible to reduce the possibility that the control amount is not calculated appropriately.
It should be noted that the same data conversion as the data conversion performed by the data conversion unit 15 on the camera image needs to be performed on the simulation image before the feature amount is calculated.
The data conversion unit 15 outputs the data-converted camera image (hereinafter referred to as “converted camera image”) to the feature amount calculation unit 16.

The feature amount calculation unit 16 calculates the feature amount according to the traveling state of the moving body from the converted camera image after the data conversion unit 15 has converted.
The feature amount calculation unit 16 calculates the feature amount by using a known technique such as image processing or machine learning.
The feature amount calculation unit 16 outputs the calculated feature amount to the hyperparameter calculation unit 17. The feature amount calculation unit 16 outputs the calculated feature amount in association with, for example, the converted camera image.

The hyperparameter calculation unit 17 calculates the hyperparameters used when calculating the control amount of the moving body by inputting the feature amount calculated by the feature amount calculation unit 16 into the machine learning model 13.
The hyperparameter calculation unit 17 outputs the calculated hyperparameters to the control amount calculation unit 18.

The control amount calculation unit 18 calculates the control amount of the moving body based on the feature amount calculated by the feature amount calculation unit 16 and the hyperparameters calculated by the hyperparameter calculation unit 17. The control amount calculation unit 18 calculates the control amount of the moving body by the known model prediction control.
The control amount calculation unit 18 outputs the calculated control amount of the moving body to an external device (not shown). The external device is, for example, an automatic driving control device (not shown) that controls the automatic driving of the vehicle. The automatic driving control device automatically drives the vehicle based on the control amount output from the control amount calculation unit 18.

In the first embodiment, the hyperparameter calculation unit 17 may output the calculated hyperparameters to the learning unit 12. At this time, the hyperparameter calculation unit 17 also outputs the feature amount when the hyperparameter is calculated. Then, the learning unit 12 may cause the machine learning model 13 to learn based on the hyperparameters and the feature amount output from the hyperparameter calculation unit 17.

The operation of the mobile control device 1 according to the first embodiment will be described.
FIG. 2 is a flowchart for explaining the operation of the mobile control device 1 according to the first embodiment.
The teacher data acquisition unit 11 acquires teacher data (step ST201).
The teacher data acquisition unit 11 outputs the acquired teacher data to the learning unit 12.

The learning unit 12 generates a machine learning model 13 by learning using the teacher data acquired by the teacher data acquisition unit 11 in step ST201 (step ST202).

The actual sensor data acquisition unit 14 acquires the actual sensor data (step ST203). Specifically, the actual sensor data acquisition unit 14 acquires, for example, a camera image.
The actual sensor data acquisition unit 14 outputs the camera image to the data conversion unit 15.

The data conversion unit 15 converts the data elements included in the actual sensor data acquired by the actual sensor data acquisition unit 14 in step ST203 for each set of data elements forming a characteristic category (step). ST204).
The data conversion unit 15 outputs the converted camera image to the feature amount calculation unit 16.

The feature amount calculation unit 16 calculates the feature amount according to the traveling state of the moving body from the converted camera image after the data conversion unit 15 has converted in step ST204 (step ST205).
The feature amount calculation unit 16 outputs the calculated feature amount to the hyperparameter calculation unit 17. The feature amount calculation unit 16 outputs the calculated feature amount in association with, for example, the converted camera image.

The hyperparameter calculation unit 17 calculates the control amount of the moving body by inputting the feature amount calculated by the feature amount calculation unit 16 in step ST205 into the machine learning model 13 generated by the learning unit 12 in step ST202. The hyperparameters used in the above are calculated (step ST206).
The hyperparameter calculation unit 17 outputs the calculated hyperparameters to the control amount calculation unit 18.

The control amount calculation unit 18 calculates the control amount of the moving body based on the feature amount calculated by the feature amount calculation unit 16 in step ST205 and the hyperparameters calculated by the hyperparameter calculation unit 17 in step ST206 (). Step ST207).

Regarding the operation of the mobile control device 1 described with reference to FIG. 2, the operations of step ST201 and step ST202 may be performed before the operation of step ST206 is performed.
Further, in step ST206, the hyperparameter calculation unit 17 may output the calculated hyperparameters to the learning unit 12. At this time, the hyperparameter calculation unit 17 also outputs the feature amount when the hyperparameter is calculated. After that, the learning unit 12 may cause the machine learning model 13 to learn based on the hyperparameters and the feature amount output from the hyperparameter calculation unit 17.

In this way, the mobile control device 1 calculates hyperparameters using machine learning. Specifically, the mobile control device 1 calculates hyperparameters by inputting the feature amount calculated from the actual sensor data into the machine learning model 13. Therefore, the mobile body control device 1 can set hyperparameters according to the traveling situation, which are used in the mobile body control technology, without human intervention.

Further, the moving body control device 1 calculates the control amount of the moving body using the hyperparameters calculated based on the machine learning model 13. As a result, the moving body control device 1 can obtain a controlled amount of the moving body according to the traveling state of the moving body.

Further, the moving body control device 1 generates a machine learning model 13 by learning using teacher data in which hyperparameters are added to feature quantities. Therefore, the mobile body control device 1 can set hyperparameters according to the traveling situation, which are used in the mobile body control technology, without human intervention.

In the above-described first embodiment, the mobile control device 1 is provided with the data conversion unit 15, but the mobile control device 1 is not required to include the data conversion unit 15. The feature amount calculation unit 16 may calculate the feature amount according to the traveling situation from the actual sensor data acquired by the actual sensor data acquisition unit 14.

Further, in the above-described first embodiment, the learning device 2 is provided in the mobile control device 1, but this is only an example. For example, the learning device 2 may be provided outside the mobile control device 1.
FIG. 3 shows, in the first embodiment, when the learning device 2 is provided outside the moving body control device 1 and the moving body control device 1a and the learning device 2 constitute a moving body control system. It is a figure which shows the configuration example of the moving body control system.
In FIG. 3, the same reference numerals are given to the same configurations as those described with reference to FIG. 1, and duplicate description will be omitted.
The moving body control device 1a includes an actual sensor data acquisition unit 14, a data conversion unit 15, a feature amount calculation unit 16, a hyperparameter calculation unit 17, and a control amount calculation unit 18. The mobile control device 1a shown in FIG. 3 is not required to include the data conversion unit 15.
In FIG. 3, the learning device 2 is provided in a server, for example, and is connected to the mobile control device 1a via a network. The learning device 2 may be, for example, an in-vehicle device.

Further, in the above-described first embodiment, the moving body control devices 1 and 1a are in-vehicle devices mounted on the vehicle, and the actual sensor data acquisition unit 14, the data conversion unit 15, the feature quantity calculation unit 16, and the hyper. It is assumed that the parameter calculation unit 17 and the control amount calculation unit 18 are provided in the moving body control devices 1, 1a. Not limited to this, a part of the actual sensor data acquisition unit 14, the data conversion unit 15, the feature amount calculation unit 16, the hyperparameter calculation unit 17, and the control amount calculation unit 18 is mounted on the in-vehicle device of the vehicle. The mobile control system may be configured by the in-vehicle device and the server, assuming that the other is provided in the server connected to the in-vehicle device via the network.
For example, the feature amount calculation unit 16, the hyperparameter calculation unit 17, and the control amount calculation unit 18 are provided in the server, and the actual sensor data acquisition unit 14 and the data conversion unit 15 are provided in the in-vehicle device. May be. The feature amount calculation unit 16 acquires the actual sensor data after data conversion from the in-vehicle device. The control amount calculation unit 18 outputs the calculated control amount to the in-vehicle device.

4A and 4B are diagrams showing an example of the hardware configuration of the mobile control devices 1 and 1a according to the first embodiment.
In the first embodiment, the teacher data acquisition unit 11, the learning unit 12, the actual sensor data acquisition unit 14, the data conversion unit 15, the feature amount calculation unit 16, the hyperparameter calculation unit 17, and the control amount calculation unit. The functions of 18 are realized by the processing circuit 401. That is, the mobile body control device 1 includes a processing circuit 401 for performing control for calculating the control amount of the mobile body using machine learning.
The processing circuit 401 may be dedicated hardware as shown in FIG. 4A, or may be a CPU (Central Processing Unit) 405 that executes a program stored in the memory 406 as shown in FIG. 4B.

When the processing circuit 401 is dedicated hardware, the processing circuit 401 may be, for example, a single circuit, a composite circuit, a programmed processor, a parallel programmed processor, an ASIC (Application Specific Integrated Circuit), or an FPGA (Field-Programmable). Gate Array) or a combination of these is applicable.

When the processing circuit 401 is the CPU 405, the teacher data acquisition unit 11, the learning unit 12, the actual sensor data acquisition unit 14, the data conversion unit 15, the feature amount calculation unit 16, the hyperparameter calculation unit 17, and the control amount. The function of the calculation unit 18 is realized by software, firmware, or a combination of software and firmware. That is, the teacher data acquisition unit 11, the learning unit 12, the actual sensor data acquisition unit 14, the data conversion unit 15, the feature amount calculation unit 16, the hyperparameter calculation unit 17, and the control amount calculation unit 18 are HDDs. It is realized by a processing circuit such as (Hard Disk Drive) 402, a CPU 405 that executes a program stored in a memory 406, and a system LSI (Data-Scale Integration). Further, the programs stored in the HDD 402, the memory 406, etc. are the teacher data acquisition unit 11, the learning unit 12, the actual sensor data acquisition unit 14, the data conversion unit 15, the feature amount calculation unit 16, and the hyperparameter calculation. It can also be said that the procedure or method of the unit 17 and the control amount calculation unit 18 is executed by the computer. Here, the memory 406 is, for example, a RAM (Random Access Memory), a ROM (Read Only Memory), a flash memory, an EPROM (Erasable Programmable Read Only Memory), an EEPROM (Electrically Memory), etc. A sexual or volatile semiconductor memory, a magnetic disk, a flexible disk, an optical disk, a compact disk, a mini disk, a DVD (Digital Versaille Disc), or the like is applicable.

Regarding the functions of the teacher data acquisition unit 11, the learning unit 12, the actual sensor data acquisition unit 14, the data conversion unit 15, the feature amount calculation unit 16, the hyperparameter calculation unit 17, and the control amount calculation unit 18. , Some may be realized by dedicated hardware, and some may be realized by software or firmware. For example, the teacher data acquisition unit 11 and the actual sensor data acquisition unit 14 are realized by the processing circuit 401 as dedicated hardware, and the learning unit 12, the data conversion unit 15, the feature amount calculation unit 16, and the feature amount calculation unit 16 The functions of the hyperparameter calculation unit 17 and the control amount calculation unit 18 can be realized by the processing circuit 401 reading and executing the program stored in the memory 406.
Further, the mobile body control device 1 includes a device such as an automatic operation control device, and an input interface device 403 and an output interface device 404 that perform wired communication or wireless communication.

As described above, according to the first embodiment, the moving body control devices 1 and 1a calculate the control amount of the moving body from the sensor data (actual sensor data) indicating the surrounding environment of the moving body. , 1a, the feature amount calculation unit that calculates the feature amount from the actual sensor data acquisition unit 14 that acquires the sensor data (actual sensor data) and the sensor data (actual sensor data) acquired by the actual sensor data acquisition unit 14. 16 and a hyper parameter calculation unit 17 for calculating a hyper parameter used when calculating a control amount of a moving body by inputting the feature amount calculated by the feature amount calculation unit 16 into the machine learning model 13 are provided. Configured. Therefore, the mobile body control devices 1, 1a can set hyperparameters according to the traveling situation, which are used in the mobile body control technology, without human intervention.

Further, according to the first embodiment, in the learning device 2, hyperparameters used when calculating the control amount of the moving body are added to the feature amount calculated from the sensor data indicating the surrounding environment of the moving body. A teacher data acquisition unit 11 that acquires the teacher data, and a learning unit 12 that generates a machine learning model 13 that inputs a feature amount and outputs hyperparameters by learning using the teacher data acquired by the teacher data acquisition unit 11. It was configured to be equipped with. Therefore, the mobile body control devices 1, 1a can set hyperparameters according to the traveling situation, which are used in the mobile body control technology, without human intervention.

In the above-described first embodiment, the moving body is a vehicle, but this is only an example. The mobile body control devices 1 and 1a according to the first embodiment can be used as a device for calculating the control amount of the mobile body in various mobile bodies capable of automatic control.

Within the scope of the present disclosure, it is possible to modify any component of the embodiment or omit any component of the embodiment.

Since the mobile body control device according to the present disclosure is configured so that hyperparameters according to the traveling situation used in the mobile body control technology can be set without human intervention, the movement for calculating the control amount of the mobile body is calculated. It can be applied to body control devices.

1,1a Mobile control device, 2 Learning device, 11 Teacher data acquisition unit, 12 Learning unit, 13 Machine learning model, 14 Actual sensor data acquisition unit, 15 Data conversion unit, 16 Feature amount calculation unit, 17 Hyperparameter calculation unit , 18 control amount calculation unit, 401 processing circuit, 402 HDD, 403 input interface device, 404 output interface device, 405 CPU, 406 memory.

Claims

A mobile control device that calculates the control amount of the mobile from sensor data indicating the surrounding environment of the mobile.
The actual sensor data acquisition unit that acquires the sensor data, and
A feature amount calculation unit that calculates a feature amount from the sensor data acquired by the actual sensor data acquisition unit, and a feature amount calculation unit.
A moving body control device including a hyperparameter calculation unit that calculates hyperparameters used when calculating the control amount of the moving body by inputting the feature amount calculated by the feature amount calculation unit into a machine learning model.
The mobile control according to claim 1, further comprising a control amount calculation unit that calculates a control amount of the moving body based on the feature amount calculated by the feature amount calculation unit and the hyperparameters calculated by the hyperparameter calculation unit. Device.
A data conversion unit that performs data conversion on the data elements included in the sensor data acquired by the actual sensor data acquisition unit is provided.
The feature amount calculation unit
The mobile control device according to claim 1, wherein a feature amount is calculated from the sensor data after the data conversion unit has converted the data.
A teacher data acquisition unit that acquires teacher data in which the hyperparameters are added to a feature amount calculated from sensor data indicating the surrounding environment of the moving object, and a teacher data acquisition unit.
A learning unit for generating the machine learning model that inputs the feature amount and outputs the hyperparameters by learning using the teacher data acquired by the teacher data acquisition unit is provided.
The hyperparameter calculation unit is characterized in that it calculates the hyperparameters by inputting the feature amount calculated by the feature amount calculation unit into the machine learning model generated by the learning unit. Mobile control device.
It is a moving body control method that calculates a control amount of the moving body from sensor data indicating the surrounding environment of the moving body.
The step in which the actual sensor data acquisition unit acquires the sensor data,
A step in which the feature amount calculation unit calculates the feature amount from the sensor data acquired by the actual sensor data acquisition unit, and
The hyperparameter calculation unit inputs the feature amount calculated by the feature amount calculation unit into the machine learning model, so that the hyperparameter calculation unit includes a step of calculating the hyperparameters used when calculating the control amount of the moving body. Control method.
A teacher data acquisition unit that acquires teacher data in which hyperparameters used when calculating the control amount of the moving body are added to the feature amount calculated from the sensor data indicating the surrounding environment of the moving body.
A learning device provided with a learning unit that generates a machine learning model that inputs the feature amount and outputs the hyperparameters by learning using the teacher data acquired by the teacher data acquisition unit.