JP2019215755A

JP2019215755A - Image recognition device, image recognition method, machine learning model providing device, machine learning model providing method, machine learning model generating method, and machine learning model device

Info

Publication number: JP2019215755A
Application number: JP2018113129A
Authority: JP
Inventors: ともえ大築; Tomoe Otsuki; 尚哉杉本; Naoya Sugimoto
Original assignee: Denso Ten Ltd
Current assignee: Denso Ten Ltd
Priority date: 2018-06-13
Filing date: 2018-06-13
Publication date: 2019-12-19
Anticipated expiration: 2038-06-13
Also published as: JP7177609B2

Abstract

To provide an image recognition device and an image recognition method which enable accurate image recognition of a recognition object irrespective of a relative speed between an imaging apparatus that images the recognition object and the recognition object.SOLUTION: An image recognition device includes an acquisition part, an identification part, a storage part and a determination part. The acquisition part acquires from an imaging apparatus, an image including an imaged recognition object. The identification part identifies a relative speed between the imaging apparatus and the recognition object. The storage part stores a machine learning model that derives from the image including the imaged recognition object, recognition result that a subject in the image is the recognition object, for each of speed ranges. The determination part adopts a machine learning model in a speed range according to the relative speed identified by the identification part, out of the plurality of machine learning models to determine the subject in the image.SELECTED DRAWING: Figure 1

Description

開示の実施形態は、画像認識装置、画像認識方法、機械学習モデル提供装置、機械学習モデル提供方法、機械学習モデル生成方法、および機械学習モデル装置に関する。 The disclosed embodiments relate to an image recognition device, an image recognition method, a machine learning model providing device, a machine learning model providing method, a machine learning model generation method, and a machine learning model device.

従来、複数の画像認識装置から画像認識処理に用いられた画像データを収集し、蓄積した画像データを学習データとして使用して画像認識処理についての機械学習を行い、画像認識処理に用いるパラメータを更新して画像認識装置へ提供するシステムがある。 Conventionally, image data used for image recognition processing is collected from a plurality of image recognition devices, machine learning about image recognition processing is performed using the stored image data as learning data, and parameters used for image recognition processing are updated. There is a system for providing the information to an image recognition device.

特開２０１５−１３５５５２号公報JP-A-2013-135552

しかしながら、従来の画像認識装置は、認識対象物を撮像する撮像装置と認識対象物との相対速度によっては、認識対象物を正確に画像認識することができないことがある。 However, the conventional image recognition device may not be able to accurately recognize the recognition target object depending on the relative speed between the imaging device that captures the recognition target object and the recognition target object.

実施形態の一態様は、上記に鑑みてなされたものであって、認識対象物を撮像する撮像装置と認識対象物との相対速度によらず、正確に認識対象物を画像認識することができる画像認識装置、画像認識方法、機械学習モデル提供装置、機械学習モデル提供方法、機械学習モデル生成方法、および機械学習モデル装置を提供することを目的とする。 An aspect of the embodiment is made in view of the above, and can accurately recognize an image of a recognition target object regardless of the relative speed between the imaging device that captures the recognition target object and the recognition target object. It is an object to provide an image recognition device, an image recognition method, a machine learning model providing device, a machine learning model providing method, a machine learning model generating method, and a machine learning model device.

実施形態の一態様に係る画像認識装置は、取得部と、判別部と、記憶部と、判定部とを備える。取得部は、撮像装置から認識対象物が撮像された画像を取得する。判別部は、前記撮像装置および前記認識対象物間の相対速度を判別する。記憶部は、前記認識対象物が撮像された画像から当該画像内の被写体が前記認識対象物であるという認識結果を導出する機械学習モデルを速度範囲毎に記憶する。判定部は、複数の前記機械学習モデルから前記判別部によって判別される前記相対速度に応じた前記速度範囲の前記機械学習モデルを採用して前記画像内の被写体を判定する。 An image recognition device according to an aspect of an embodiment includes an acquisition unit, a determination unit, a storage unit, and a determination unit. The acquisition unit acquires an image of the recognition target object captured from the imaging device. The determining unit determines a relative speed between the imaging device and the recognition target. The storage unit stores, for each speed range, a machine learning model that derives a recognition result indicating that a subject in the image is the recognition target object from an image of the recognition target object. The determining unit determines a subject in the image by using the machine learning model in the speed range corresponding to the relative speed determined by the determining unit from the plurality of machine learning models.

実施形態の一態様に係る画像認識装置、画像認識方法、機械学習モデル提供装置、機械学習モデル提供方法、機械学習モデル生成方法、および機械学習モデル装置は、認識対象物を撮像する撮像装置と認識対象物との相対速度によらず、正確に認識対象物を画像認識することができる。 An image recognition device, an image recognition method, a machine learning model providing device, a machine learning model providing method, a machine learning model generation method, and a machine learning model device according to one embodiment of the present invention are provided with an image capturing device that captures an object to be recognized and a recognition device. The image of the recognition target can be accurately recognized regardless of the relative speed with respect to the target.

図１は、実施形態に係る画像認識方法の概要を示す説明図である。FIG. 1 is an explanatory diagram illustrating an outline of an image recognition method according to the embodiment. 図２は、実施形態に係る画像認識装置の構成の一例を示すブロック図である。FIG. 2 is a block diagram illustrating an example of a configuration of the image recognition device according to the embodiment. 図３Ａは、実施形態に係る機械学習モデルの生成手順を示す説明図である。FIG. 3A is an explanatory diagram illustrating a procedure for generating a machine learning model according to the embodiment. 図３Ｂは、実施形態に係る機械学習モデルの生成手順を示す説明図である。FIG. 3B is an explanatory diagram illustrating a procedure for generating a machine learning model according to the embodiment. 図３Ｃは、実施形態に係る機械学習モデルの生成手順を示す説明図である。FIG. 3C is an explanatory diagram illustrating a procedure for generating a machine learning model according to the embodiment. 図４は、実施形態に係る画像認識処理の一例を示す説明図である。FIG. 4 is an explanatory diagram illustrating an example of an image recognition process according to the embodiment. 図５は、実施形態に係る画像認識装置の制御部が実行する処理の一例を示すフローチャートである。FIG. 5 is a flowchart illustrating an example of a process executed by the control unit of the image recognition device according to the embodiment.

以下、添付図面を参照して、画像認識装置、画像認識方法、機械学習モデル提供装置、機械学習モデル提供方法、機械学習モデル生成方法、および機械学習モデル装置の実施形態を詳細に説明する。なお、以下に示す実施形態によりこの発明が限定されるものではない。以下では、車両に搭載される撮像装置から取得する車両の周囲が撮像された画像に写る被写体を画像認識する画像認識装置および画像認識方法を例に挙げて説明する。 Hereinafter, embodiments of an image recognition device, an image recognition method, a machine learning model providing device, a machine learning model providing method, a machine learning model generating method, and a machine learning model device will be described in detail with reference to the accompanying drawings. In addition, this invention is not limited by embodiment shown below. Hereinafter, an image recognition device and an image recognition method for recognizing a subject appearing in a captured image of the periphery of the vehicle, which is acquired from an imaging device mounted on the vehicle, will be described as an example.

図１は、実施形態に係る画像認識方法の概要を示す説明図である。実施形態に係る画像認識装置１は、車両に搭載される撮像装置から車両の周囲が撮像された画像を取得する。ここでは、例えば、図１（ａ）に示すように、画像認識装置１が、撮像装置および画像認識装置１を搭載した車両（以下、「自車両」と記載する）の周囲を走行する他の車両１００が写った画像１０１を取得した場合を例に挙げて説明する。 FIG. 1 is an explanatory diagram illustrating an outline of an image recognition method according to the embodiment. The image recognition device 1 according to the embodiment acquires an image in which the periphery of a vehicle is imaged from an imaging device mounted on the vehicle. Here, for example, as shown in FIG. 1A, the image recognition device 1 is configured to travel around a vehicle equipped with the imaging device and the image recognition device 1 (hereinafter referred to as “own vehicle”). The case where the image 101 of the vehicle 100 is obtained will be described as an example.

他の車両１００のように移動する認識対象物が撮像される場合、画像１０１には、人間が目視した他の車両１００の形状とは若干異なる形状の他の車両１００が写る場合がある。例えば、シャッタースピードが比較的遅い撮像装置によって高速で移動する他の車両１００を撮像した場合、図１（ａ）に示すように、画像１０１中の他の車両１００の像にブレが発生することがある。 When an image of a recognition target that moves like another vehicle 100 is captured, the image 101 may show another vehicle 100 having a shape slightly different from the shape of the other vehicle 100 as seen by a human. For example, when an image of another vehicle 100 moving at a high speed is captured by an imaging device having a relatively slow shutter speed, blurring occurs in the image of the other vehicle 100 in the image 101 as shown in FIG. There is.

また、シャッタースピードが比較的遅い撮像装置によって横方向に高速で移動する被写体を撮像した場合、被写体が実際の形状よりも横方向に伸びた形状となって画像に写ることがある。 Further, when an image capturing apparatus having a relatively slow shutter speed captures an image of a subject moving at a high speed in the lateral direction, the subject may appear in the image as a shape extending in the horizontal direction more than the actual shape.

具体的には、撮像装置は、複数の撮像素子が行列状に配置される撮像領域の上の行に配置される撮像素子から順に駆動（露光）して１フレームの画像を撮像する。このため、撮像領域における上部分で撮像された被写体は、撮像領域における下部分で撮像されるときには既に横方向へ移動している。 Specifically, the imaging device sequentially drives (exposes) the imaging elements arranged in a row above the imaging region in which a plurality of imaging elements are arranged in a matrix to capture an image of one frame. For this reason, the subject imaged in the upper part of the imaging region has already moved in the horizontal direction when the image is captured in the lower part of the imaging region.

その結果、画像には、実際の形状よりも横方向に伸びた形状の被写体が写る所謂ローリングシャッター現象（以下、ローリング現象と記載する）が発生する。かかるローリング現象の程度やブレの大きさは、撮像装置と被写体との相対速度の違いによって変動する。 As a result, a so-called rolling shutter phenomenon (hereinafter, referred to as a rolling phenomenon) occurs in the image, in which a subject having a shape extending in a lateral direction more than the actual shape is photographed. The degree of the rolling phenomenon and the magnitude of the shake vary depending on the difference in the relative speed between the imaging device and the subject.

このため、例えば、他の車両１００が写った画像１０１から被写体は他の車両１００であるという画像認識結果を導出する機械学習モデルを生成するためには、ローリング現象の程度やブレの大きさが異なる膨大な枚数の画像を教材として使用した機械学習が必要となる。 For this reason, for example, in order to generate a machine learning model that derives an image recognition result that the subject is another vehicle 100 from an image 101 in which another vehicle 100 is captured, the degree of the rolling phenomenon and the magnitude of blurring must be reduced. Machine learning using a huge number of different images as teaching materials is required.

また、仮に膨大な枚数の画像を教材として使用した機械学習によって機械学習モデルを生成したとしても、他の車両１００が写った画像１０１から被写体が他の車両１００であるという正確な画像認識結果を導出できない場合もある。 Even if a machine learning model is generated by machine learning using an enormous number of images as teaching materials, an accurate image recognition result that the subject is another vehicle 100 is obtained from the image 101 in which the other vehicle 100 is captured. In some cases, it cannot be derived.

例えば、ローリング現象やブレが発生した画像中の車両を機械学習モデルによって画像認識する場合、その画像と類似した画像が機械学習の教材に含まれていたとしても、教材の中にはローリング現象やブレの程度が異なる画像も多く含まれている。 For example, when recognizing a vehicle in an image in which a rolling phenomenon or blurring has occurred using a machine learning model, even if an image similar to the image is included in the learning material for the machine learning, the learning material includes the rolling phenomenon or the like. Many images with different degrees of blur are also included.

このため、機械学習モデルによる判定では、画像中の被写体が車両であるという判定結果の尤度が低くなり、画像中の被写体が車両と判定されず、正確な画像認識結果を導出することができない場合がある。 For this reason, in the determination using the machine learning model, the likelihood of the determination result that the subject in the image is a vehicle is low, the subject in the image is not determined to be a vehicle, and an accurate image recognition result cannot be derived. There are cases.

そこで、実施形態に係る画像認識装置１は、画像１０１を取得すると、まず、撮像装置と認識対象物（ここでは、他の車両１００）との相対速度を判別する。そして、画像認識装置１は、例えば、相対速度が第１速度範囲、第２速度範囲、および第３速度範囲のどの範囲に含まれるかを判定する。 Therefore, when acquiring the image 101, the image recognition device 1 according to the embodiment first determines the relative speed between the imaging device and the recognition target (the other vehicle 100 in this case). Then, for example, the image recognition device 1 determines which of the first speed range, the second speed range, and the third speed range the relative speed is included in.

ここでは、第１速度範囲が最も遅い速度範囲であり、第２速度範囲が第１速度範囲よりも速く、第３速度範囲よりも遅い速度範囲であるものとする。画像認識装置１は、例えば、画像中における被写体の移動速度を相対速度として判別する。なお、移動速度の判別方法は、これに限定されるものではない。 Here, it is assumed that the first speed range is the slowest speed range, the second speed range is faster than the first speed range, and is slower than the third speed range. The image recognition device 1 determines, for example, a moving speed of a subject in an image as a relative speed. Note that the method of determining the moving speed is not limited to this.

このとき、画像認識装置１は、例えば、図１（ｂ）に示すように、相対速度が第２速度範囲に含まれると判別した場合、図１（ｃ）に示す画像認識処理を行う。具体的には、画像認識装置１は、一例として第１速度範囲用機械学習モデル１−１、第２速度範囲用機械学習モデル１−２、および第３速度範囲用機械学習モデル１−３という３つの機械学習モデルを備える。 At this time, when the image recognition device 1 determines that the relative speed is included in the second speed range, for example, as illustrated in FIG. 1B, the image recognition device 1 performs the image recognition process illustrated in FIG. Specifically, the image recognition device 1 includes, for example, a first speed range machine learning model 1-1, a second speed range machine learning model 1-2, and a third speed range machine learning model 1-3. It has three machine learning models.

第１速度範囲用機械学習モデル１−１は、例えば、被写体との相対速度が第１速度範囲に含まれる車両の画像認識に特化された機械学習モデルである。第２速度範囲用機械学習モデル１−２は、例えば、被写体との相対速度が第２速度範囲に含まれる車両の画像認識に特化された機械学習モデルである。 The first speed range machine learning model 1-1 is, for example, a machine learning model specialized for image recognition of a vehicle whose relative speed to a subject is included in the first speed range. The second speed range machine learning model 1-2 is, for example, a machine learning model specialized for image recognition of a vehicle whose relative speed to the subject is included in the second speed range.

第３速度範囲用機械学習モデル１−３は、例えば、被写体との相対速度が第３速度範囲に含まれる車両の画像認識に特化された機械学習モデルである。なお、画像認識装置１が備える機械学習モデルの数は、複数であれば３つに限定されるものではない。 The third speed range machine learning model 1-3 is, for example, a machine learning model specialized for image recognition of a vehicle whose relative speed to the subject is included in the third speed range. Note that the number of machine learning models provided in the image recognition device 1 is not limited to three as long as it is plural.

また、以下では、３つの機械学習モデルのうち、不特定の機械学習モデルを指す場合には、単に機械学習モデルと記載する。これら３つの機械学習モデルの生成手順の一例については、図３Ａ，図３Ｂ，図３Ｃを参照して後述する。 Further, hereinafter, when an unspecified machine learning model among three machine learning models is indicated, it is simply described as a machine learning model. An example of a procedure for generating these three machine learning models will be described later with reference to FIGS. 3A, 3B, and 3C.

画像認識装置１は、上記のように、被写体との相対速度が第２速度範囲に含まれると判別した場合、３つの機械学習モデルのうち、第２速度範囲用機械学習モデル１−２へ画像１０１を入力する。これにより、画像認識装置１は、被写体が他の車両１００であるという正確な画像認識結果１０２を導出することができる。 As described above, when the image recognition device 1 determines that the relative speed with respect to the subject is included in the second speed range, the image recognition device 1 sends the image to the second speed range machine learning model 1-2 among the three machine learning models. Enter 101. Thus, the image recognition device 1 can derive an accurate image recognition result 102 indicating that the subject is another vehicle 100.

次に、図２を参照し、実施形態に係る画像認識装置１の構成の一例について説明する。図２は、実施形態に係る画像認識装置１の構成の一例を示すブロック図である。図２に示すように、画像認識装置１は、撮像装置４１、レーダ４２、舵角センサ４３、および車両制御装置４４と接続される。 Next, an example of a configuration of the image recognition device 1 according to the embodiment will be described with reference to FIG. FIG. 2 is a block diagram illustrating an example of a configuration of the image recognition device 1 according to the embodiment. As shown in FIG. 2, the image recognition device 1 is connected to an imaging device 41, a radar 42, a steering angle sensor 43, and a vehicle control device 44.

なお、相対速度別画像集１１０は、認識対象物が写った複数枚の画像を含む画像集である。相対速度別画像集１１０に含まれる各画像は、画像に写っている被写体の種類、撮像時における被写体と撮像装置との相対速度を示す情報が付与されている。 Note that the image collection by relative speed 110 is an image collection including a plurality of images in which the recognition target object is captured. Each image included in the relative speed-specific image collection 110 is provided with information indicating the type of the subject shown in the image and the relative speed between the subject and the imaging device at the time of imaging.

かかる相対速度別画像集１１０は、画像認識処理の教師あり機械学習の教材として画像認識装置１によって使用される。相対速度別画像集１１０の一例については、図３Ａ、図３Ｂおよび図３Ｃを参照し、機械学習モデルの生成手順の説明と合わせて後述する。 The image collection 110 for each relative speed is used by the image recognition apparatus 1 as a teaching material of supervised machine learning of the image recognition processing. An example of the relative speed-specific image collection 110 will be described later with reference to FIGS. 3A, 3B, and 3C, together with a description of a procedure for generating a machine learning model.

撮像装置４１は、例えば、画像認識装置１が搭載される車両の前部に設置されて車両の前方を撮像する車載カメラである。なお、撮像装置４１は、車両の側部や後部に設けられ、車両の周囲を撮像する車載カメラであってもよい。撮像装置４１は、撮像した画像を画像認識装置１へ出力する。 The imaging device 41 is, for example, a vehicle-mounted camera that is installed at the front of a vehicle on which the image recognition device 1 is mounted and captures an image in front of the vehicle. Note that the imaging device 41 may be a vehicle-mounted camera that is provided on a side portion or a rear portion of the vehicle and captures an image around the vehicle. The imaging device 41 outputs the captured image to the image recognition device 1.

レーダ４２は、例えば、ミリ波の送信波を車両の周囲へ放射して物標に反射した送信波の反射波を受信し、送信波と受信波とに基づいて車両から物標までの距離、物標が存在する角度、車両に対する物標の相対速度を検知するミリ波レーダである。レーダ４２は、検知した相対速度を画像認識装置１へ出力する。舵角センサ４３は、車両の操舵角度を検知するセンサである。舵角センサ４３は、検知した操舵角度を画像認識装置１へ出力する。 The radar 42 receives, for example, a reflected wave of the transmission wave reflected on the target by radiating a millimeter wave transmission wave to the surroundings of the vehicle, and based on the transmission wave and the reception wave, a distance from the vehicle to the target, This is a millimeter wave radar that detects the angle at which the target exists and the relative speed of the target with respect to the vehicle. The radar 42 outputs the detected relative speed to the image recognition device 1. The steering angle sensor 43 is a sensor that detects a steering angle of the vehicle. The steering angle sensor 43 outputs the detected steering angle to the image recognition device 1.

車両制御装置４４は、例えば、車両全体を統括制御するＥＣＵ（Electronic Control Unit）である。車両制御装置４４は、例えば、画像認識装置１によって自車両の走行に支障をきたすような被写体が画像認識された場合に、車両の速度制御、操舵制御、および制動制御を行うことによって車両に障害物を回避させる。 The vehicle control device 44 is, for example, an ECU (Electronic Control Unit) that performs overall control of the entire vehicle. The vehicle control device 44 performs speed control, steering control, and braking control of the vehicle, for example, when the image recognition device 1 recognizes an image of a subject that interferes with the traveling of the vehicle, thereby causing an obstacle to the vehicle. Avoid objects.

また、画像認識装置１は、制御部２と記憶部３とを備える。記憶部３は、例えば、データフラッシュ等の情報記憶デバイスである。記憶部３は、第１速度範囲用機械学習モデル１−１、第２速度範囲用機械学習モデル１−２・・・第ｎ速度範囲用機械学習モデル１−ｎ（ｎは、３以上の自然数）というｎ個の機械学習モデルを記憶する。 Further, the image recognition device 1 includes a control unit 2 and a storage unit 3. The storage unit 3 is, for example, an information storage device such as a data flash. The storage unit 3 stores the first speed range machine learning model 1-1, the second speed range machine learning model 1-2,... The nth speed range machine learning model 1-n (n is a natural number of 3 or more) ) Are stored.

制御部２は、ＣＰＵ（Central Processing Unit）、ＲＯＭ（Read Only Memory）、ＲＡＭ（Random Access Memory）などを有するマイクロコンピュータや各種の回路を含む。 The control unit 2 includes a microcomputer having a CPU (Central Processing Unit), a ROM (Read Only Memory), a RAM (Random Access Memory), and various circuits.

制御部２は、ＣＰＵがＲＯＭに記憶されたプログラムを、ＲＡＭを作業領域として使用して実行することにより機能する学習部２１、取得部２２、判別部２３、および判定部２４を備える。なお、制御部２は、ＡＳＩＣ（Application Specific Integrated Circuit）やＦＰＧＡ（Field Programmable Gate Array）等のハードウェアで構成されてもよい。 The control unit 2 includes a learning unit 21, an acquisition unit 22, a determination unit 23, and a determination unit 24 that function by executing a program stored in the ROM by the CPU using the RAM as a work area. The control unit 2 may be configured by hardware such as an ASIC (Application Specific Integrated Circuit) or an FPGA (Field Programmable Gate Array).

制御部２が備える学習部２１、取得部２２、判別部２３、および判定部２４は、それぞれ以下に説明する情報処理の作用を実現または実行する。なお、制御部２の内部構成は、図２に示した構成に限られず、後述する情報処理を行う構成であれば他の構成であってもよい。 The learning unit 21, the acquisition unit 22, the determination unit 23, and the determination unit 24 included in the control unit 2 each realize or execute an operation of information processing described below. The internal configuration of the control unit 2 is not limited to the configuration illustrated in FIG. 2 and may be another configuration as long as the configuration performs information processing described later.

学習部２１は、相対速度別画像集１１０から各相対速度範囲および被写体の種類別に複数枚（例えば、数千枚）の画像を取得する。そして、学習部２１は、取得した画像を教師あり機械学習の教材として使用して各相対速度範囲および被写体の種類別に、それぞれ機械学習モデル（例えば、車両を画像認識するための第１速度範囲用機械学習モデル１−１等）を生成して記憶部３に記憶させる。 The learning unit 21 acquires a plurality (for example, thousands) of images for each relative speed range and for each type of subject from the relative speed-specific image collection 110. Then, the learning unit 21 uses the acquired image as a teaching material for supervised machine learning, and uses a machine learning model (for example, a first speed range for image recognition of a vehicle) for each relative speed range and for each type of subject. A machine learning model 1-1) is generated and stored in the storage unit 3.

ここで、図３Ａ、図３Ｂ、および図３Ｃを参照し、実施形態に係る機械学習モデルの生成手順の一例について説明する。図３Ａ、図３Ｂ、および図３Ｃは、実施形態に係る機械学習モデルの生成手順を示す説明図である。 Here, an example of a procedure for generating a machine learning model according to the embodiment will be described with reference to FIGS. 3A, 3B, and 3C. FIG. 3A, FIG. 3B, and FIG. 3C are explanatory diagrams illustrating a procedure for generating a machine learning model according to the embodiment.

ここでは、画像に含まれる車両を画像認識するための機械学習モデルを生成する場合について説明する。なお、学習部２１は、車両以外にも、オートバイ、自転車、歩行者等を画像認識するための機械学習モデルも生成することができる。 Here, a case will be described in which a machine learning model for recognizing a vehicle included in an image is generated. The learning unit 21 can also generate a machine learning model for image recognition of a motorcycle, a bicycle, a pedestrian, and the like, in addition to the vehicle.

図３Ａに示すように、学習部２１は、被写体と撮像装置４１との相対速度が最も遅い第１速度範囲用機械学習モデル１−１を生成する場合、相対速度別画像集１１０から第１速度範囲画像集１０-１の画像１１−１，１１−２等を取得する。 As illustrated in FIG. 3A, when the learning unit 21 generates the first speed range machine learning model 1-1 in which the relative speed between the subject and the imaging device 41 is the slowest, the learning unit 21 determines the first speed from the image collection 110 for each relative speed. The images 11-1 and 11-2 of the range image collection 10-1 are acquired.

第１速度範囲画像集１０-１に含まれる画像は、被写体と撮像装置４１との相対速度が最も遅いので、画像１１−１，１１−２等のように、ローリング現象やブレが発生していない。 Since the images included in the first speed range image collection 10-1 have the slowest relative speed between the subject and the imaging device 41, the rolling phenomenon and blurring occur as in the images 11-1 and 11-2. Absent.

学習部２１は、これらの画像の特徴を機械学習することにより、撮像装置に対する相対速度が第１速度範囲に含まれる車両が撮像された画像が入力された場合に、被写体は車両であるという画像認識結果を出力する第１速度範囲用機械学習モデル１−１を生成する。 The learning unit 21 machine-learns the features of these images, so that when an image of a vehicle whose relative speed with respect to the imaging device falls within the first speed range is input, the image that the subject is a vehicle is input. A first speed range machine learning model 1-1 for outputting a recognition result is generated.

また、図３Ｂに示すように、学習部２１は、被写体と撮像装置４１との相対速度が第１区度範囲の次に速い第２速度範囲用機械学習モデル１−２を生成する場合、相対速度別画像集１１０から第２速度範囲画像集１０−２の画像１２−１，１２−２等を取得する。 Further, as illustrated in FIG. 3B, the learning unit 21 generates the second speed range machine learning model 1-2 in which the relative speed between the subject and the imaging device 41 is the next highest after the first interval range. The images 12-1 and 12-2 of the second speed range image collection 10-2 are acquired from the speed-specific image collection 110.

第２速度範囲画像集１０−２に含まれる画像は、被写体と撮像装置４１との相対速度が第１速度範囲よりも速いので、画像１２−１，１２−２等のように、若干ではあるがローリング現象やブレが発生している。 The images included in the second speed range image collection 10-2 are slightly different, such as the images 12-1, 12-2, etc., because the relative speed between the subject and the imaging device 41 is faster than the first speed range. However, rolling phenomenon and blurring have occurred.

学習部２１は、これらの画像の特徴を機械学習することにより、撮像装置に対する相対速度が第２速度範囲に含まれる車両が撮像された画像が入力された場合に、被写体は車両であるという画像認識結果を出力する第２速度範囲用機械学習モデル１−２を生成する。 The learning unit 21 machine-learns the features of these images, so that when an image of a vehicle whose relative speed to the imaging device is included in the second speed range is input, the image that the subject is a vehicle is input. A second speed range machine learning model 1-2 that outputs a recognition result is generated.

また、図３Ｃに示すように、学習部２１は、被写体と撮像装置４１との相対速度が最も速い第ｎ速度範囲用機械学習モデル１−ｎを生成する場合、相対速度別画像集１１０から第ｎ速度範囲画像集１０−ｎの画像１３−１，１３−２等を取得する。 Further, as illustrated in FIG. 3C, when the learning unit 21 generates the n-th speed range machine learning model 1-n in which the relative speed between the subject and the imaging device 41 is the fastest, the learning unit 21 uses the The images 13-1 and 13-2 of the n-speed range image collection 10-n are acquired.

第ｎ速度範囲画像集１０−ｎに含まれる画像は、被写体と撮像装置４１との相対速度が最も速いので、画像１３−１，１３−２等のように、大きなブレや顕著なローリング現象が発生している。 Since the images included in the n-th speed range image collection 10-n have the highest relative speed between the subject and the imaging device 41, large blurring and a remarkable rolling phenomenon as in the images 13-1 and 13-2 and the like occur. It has occurred.

学習部２１は、これらの画像の特徴を機械学習することにより、撮像装置に対する相対速度が第ｎ速度範囲に含まれる車両が撮像された画像が入力された場合に、被写体は車両であるという画像認識結果を出力する第ｎ速度範囲用機械学習モデル１−ｎを生成する。 The learning unit 21 machine-learns the features of these images, so that when an image of a vehicle whose relative speed with respect to the imaging device is included in the n-th speed range is input, the image that the subject is a vehicle is obtained. An n-th speed range machine learning model 1-n for outputting a recognition result is generated.

このように、学習部２１は、相対速度の速度範囲毎に入力される各速度範囲内の相対速度で移動する種類が既知の認証対象が撮像された複数の画像に基づいて、速度範囲毎の機械学習モデルを生成する。これにより、各速度範囲用機械学習モデルは、各速度範囲に含まれない相対速度で移動する認証対象が撮像された画像の影響が反映されることがない。 As described above, the learning unit 21 performs, for each of the speed ranges, an authentication target of a known type that moves at a relative speed within each of the speed ranges input for each of the speed ranges. Generate a machine learning model. As a result, each speed range machine learning model does not reflect the influence of the image of the authentication target moving at a relative speed that is not included in each speed range.

また、学習部２１は、速度範囲毎に画像を機械学習することにより、教材となる画像から被写体の速度を識別学習する必要がなくため、教材として使用する画像の枚数を大幅に低減することができる。 The learning unit 21 does not need to identify and learn the speed of the subject from the image serving as the teaching material by machine learning the image for each speed range, so that the number of images used as the teaching material can be significantly reduced. it can.

なお、上記した機械学習モデルの生成手順は、フレームレート（シャッタースピード）が固定であるものとして説明したが、シャッタースピードが可変の撮像装置４１の場合、速度範囲毎に機械学習する場合の各速度範囲をシャッタースピードに応じて正規化する。 The above-described procedure for generating the machine learning model has been described assuming that the frame rate (shutter speed) is fixed. However, in the case of the imaging device 41 having a variable shutter speed, each speed in the case of performing machine learning for each speed range is described. Normalize the range according to the shutter speed.

例えば、あるフレームレートを基準フレームレートとして設定しておき、撮像装置４１のフレームレートが基準フレームレートの場合には、基準フレームレートに対応する基準速度範囲に係数として１を乗算した速度範囲とする。 For example, a certain frame rate is set as a reference frame rate, and when the frame rate of the imaging device 41 is the reference frame rate, a speed range obtained by multiplying a reference speed range corresponding to the reference frame rate by 1 as a coefficient is set. .

フレームレートが基準フレームレート以外の場合には、基準速度範囲に係数（基準フレームレート／撮像フレームレート）を乗算した速度範囲とする。これにより、学習部２１は、撮像装置４１に設定されるフレームレートに応じた適切な速度範囲毎に画像の機械学習モデルを生成することができる。 If the frame rate is other than the reference frame rate, the speed range is obtained by multiplying the reference speed range by a coefficient (reference frame rate / imaging frame rate). Accordingly, the learning unit 21 can generate a machine learning model of an image for each appropriate speed range according to the frame rate set in the imaging device 41.

また、機械学習の教材として使用される画像は、ズームの倍率によって画像内を移動する認識対象の移動速度が変動する。このため、教材に使用される画像は、ズームの倍率を自車両から画像中の認識対象までの距離が基準距離となるように正規化しておくことが望ましい。 In an image used as a learning material for machine learning, the moving speed of a recognition target that moves in the image varies depending on the zoom magnification. For this reason, it is desirable to normalize the zoom magnification of the image used for the teaching material so that the distance from the host vehicle to the recognition target in the image becomes the reference distance.

また、画像認識装置１（後述の判定部２４）が実際に画像認識する画像についても、同様にズームの倍率を自車両から画像中の認識対象までの距離が基準距離となるように正規化しておくことが望ましい。なお、画像のズームを正規化する場合に使用する認識対象までの距離は、レーダ４２から取得することが可能である。 Similarly, for an image actually recognized by the image recognition device 1 (the determination unit 24 described later), the zoom magnification is normalized such that the distance from the vehicle to the recognition target in the image becomes the reference distance. It is desirable to keep. The distance to the recognition target used when normalizing the zoom of the image can be acquired from the radar 42.

図２へ戻り、制御部２が備える学習部２１以外の処理部の説明を進める。取得部２２は、撮像装置４１から車両周辺が撮像された画像を順次取得して判別部２３へ出力する。判別部２３は、取得部２２から入力される画像の被写体と撮像装置４１との相対速度を判別する。 Returning to FIG. 2, the description of the processing units other than the learning unit 21 included in the control unit 2 proceeds. The acquisition unit 22 sequentially acquires images of the surroundings of the vehicle from the imaging device 41 and outputs the images to the determination unit 23. The determination unit 23 determines the relative speed between the subject of the image input from the acquisition unit 22 and the imaging device 41.

判別部２３は、画像中における被写体の移動速度を被写体と撮像装置４１との相対速度として判別する。これにより、判別部２３は、被写体と撮像装置４１との物理的な位置関係による相対速度よりも、被写体のブレやローリング現象に直接的に影響する相対速度を判別することができる。 The determination unit 23 determines the moving speed of the subject in the image as the relative speed between the subject and the imaging device 41. Accordingly, the determination unit 23 can determine the relative speed that directly affects the blurring and the rolling phenomenon of the subject, rather than the relative speed based on the physical positional relationship between the subject and the imaging device 41.

判別部２３は、例えば、撮像装置４１によって連続して撮像される複数フレームの画像間における被写体の位置の移動速度を被写体と撮像装置４１との相対速度として判別する。これにより、判別部２３は、画像中における被写体と撮像装置４１との正確な相対速度を判別することができる。 The determination unit 23 determines, for example, the moving speed of the position of the subject between the images of a plurality of frames continuously captured by the imaging device 41 as the relative speed between the subject and the imaging device 41. Thereby, the determination unit 23 can determine an accurate relative speed between the subject and the imaging device 41 in the image.

このとき、撮像装置４１から入力される画像は、ズームの倍率によって画像内を移動する認識対象の移動速度が変動する。このため、判別部２３は、撮像装置４１から入力される画像のズームの倍率を自車両から画像中の認識対象までの距離が基準距離となるように正規化しておくことが望ましい。判別部２３は、画像のズームを正規化する場合に使用する認識対象までの距離をレーダ４２から取得することが可能である。 At this time, in the image input from the imaging device 41, the moving speed of the recognition target moving in the image varies depending on the zoom magnification. For this reason, it is desirable that the determination unit 23 normalizes the zoom magnification of the image input from the imaging device 41 so that the distance from the host vehicle to the recognition target in the image becomes the reference distance. The determination unit 23 can acquire the distance to the recognition target used when normalizing the zoom of the image from the radar 42.

また、連続するフレーム間で認識対象が写る位置は、撮像装置４１のフレームレート（シャッタースピード）によっても変わる。このため、判別部２３は、撮像装置４１のフレームレートが可変の場合には、撮像装置４１に設定されているフレームレートに応じて認証対象の移動速度を正規化することが望ましい。 Further, the position where the recognition target appears between consecutive frames also changes depending on the frame rate (shutter speed) of the imaging device 41. Therefore, when the frame rate of the imaging device 41 is variable, it is preferable that the determination unit 23 normalizes the moving speed of the authentication target according to the frame rate set in the imaging device 41.

また、判別部２３は、例えば、レーダ４２から入力される車両と被写体との相対速度を被写体と撮像装置４１との相対速度として判別することもできる。これにより、判別部２３は、相対速度を判別する処理を行うことなく、被写体と撮像装置４１との相対速度を判別することができる。 The determination unit 23 can also determine, for example, the relative speed between the vehicle and the subject input from the radar 42 as the relative speed between the subject and the imaging device 41. Accordingly, the determination unit 23 can determine the relative speed between the subject and the imaging device 41 without performing the process of determining the relative speed.

ただし、判別部２３は、レーダ４２から入力される相対速度を被写体と撮像装置４１との相対速度として判別すると、画像中における被写体の正確な移動速度を判別できない場合がある。例えば、車両が交差点を右左折する場合、撮像装置４１の撮像方向が急に変わるので、被写体と撮像装置４１との相対速度よりも速い移動速度で被写体が画像中を移動することがある。 However, if the determination unit 23 determines the relative speed input from the radar 42 as the relative speed between the subject and the imaging device 41, the determination unit 23 may not be able to determine the accurate moving speed of the subject in the image. For example, when the vehicle makes a right or left turn at an intersection, the imaging direction of the imaging device 41 changes suddenly, so that the subject may move in the image at a moving speed higher than the relative speed between the subject and the imaging device 41.

このため、判別部２３は、レーダ４２から入力される相対速度を被写体と撮像装置４１との相対速度として判別する場合、撮像装置４１による撮像方向の変化速度に基づいて、レーダ４２から入力される相対速度を補正する。 Therefore, when determining the relative speed input from the radar 42 as the relative speed between the subject and the imaging device 41, the determination unit 23 receives the input from the radar 42 based on the speed of change in the imaging direction by the imaging device 41. Correct the relative speed.

例えば、判別部２３は、舵角センサ４３から入力される車両の操舵角に基づいて、車両の進行方向の変化速度、つまり、撮像方向の変化速度を推定し、推定した撮像方向の変化速度に基づいて、レーダ４２から入力される相対速度を補正する。 For example, the determination unit 23 estimates a change speed in the traveling direction of the vehicle, that is, a change speed in the imaging direction based on the steering angle of the vehicle input from the steering angle sensor 43, and determines the change speed in the imaging direction. Based on this, the relative speed input from the radar 42 is corrected.

これにより、判別部２３は、画像中における被写体の正確な移動速度に近い移動速度を被写体と撮像装置４１との相対速度として判別することができる。そして、判別部２３は、判別した被写体と撮像装置４１との相対速度と、撮像装置４１から入力された画像とを判定部２４へ出力する。 Accordingly, the determination unit 23 can determine the moving speed close to the accurate moving speed of the subject in the image as the relative speed between the subject and the imaging device 41. Then, the determination unit 23 outputs the determined relative speed between the subject and the imaging device 41 and the image input from the imaging device 41 to the determination unit 24.

判定部２４は、判別部２３から入力される画像中の被写体を画像認識する場合、記憶部３に記憶された複数の機械学習モデルのうち、判別部２３から入力される相対速度を含む速度範囲用の機械学習モデルを採用して被写体を判定する。 When recognizing the subject in the image input from the determination unit 23, the determination unit 24 includes a speed range including the relative speed input from the determination unit 23 among the plurality of machine learning models stored in the storage unit 3. The subject is determined by employing a machine learning model for use.

ここで、図４を合わせて参照しながら、判定部２４による画像認識処理の一例について説明する。図４は、実施形態に係る画像認識処理の一例を示す説明図である。図４（ａ）に示すように、判定部２４は、例えば、ブレが発生した車両の像１０３が写った画像１０４が判別部２３から入力される場合、画像１０４を複数の分割領域に分割する。 Here, an example of the image recognition processing by the determination unit 24 will be described with reference to FIG. FIG. 4 is an explanatory diagram illustrating an example of an image recognition process according to the embodiment. As illustrated in FIG. 4A, for example, when the image 104 including the image 103 of the blurred vehicle is input from the determination unit 23, the determination unit 24 divides the image 104 into a plurality of divided regions. .

そして、判定部２４は、判別部２３から入力される相対速度が第２速度範囲に含まれていた場合、図４（ｂ）に示すように、第２速度範囲用機械学習モデル１−２を採用する。第２速度範囲用機械学習モデル１−２は、概念的には、図４（ｂ）に示すように、入力層２ａ、中間層２ｂ、および出力層２ｃという３層構造の処理層を備える。なお、図４（ａ）では、中間層２ｂが１層である場合を示しているが、中間層２ｂは複数層設けられてもよい。 Then, when the relative speed input from the determination unit 23 is included in the second speed range, the determination unit 24 generates the second speed range machine learning model 1-2 as shown in FIG. adopt. The second speed range machine learning model 1-2 conceptually includes a processing layer having a three-layer structure of an input layer 2a, an intermediate layer 2b, and an output layer 2c, as shown in FIG. 4B. Although FIG. 4A shows a case where the intermediate layer 2b is a single layer, a plurality of intermediate layers 2b may be provided.

入力層２ａ、中間層２ｂ、および出力層２ｃは、各分割領域の画像が車両の一部か否かの判定を並行して行う複数のノード２ｚを備える。入力層２ａの各ノード２ｚは、それぞれ中間層２ｂの全ノード２ｚと接続される。中間層２ｂの各ノード２ｚは、それぞれ出力層２ｃの全ノード２ｚと接続される。このように、第２速度範囲用機械学習モデル１−２は、ノード２ｚ同士が接続されたニューラルネットワーク構造となっている。 The input layer 2a, the intermediate layer 2b, and the output layer 2c include a plurality of nodes 2z that determine in parallel whether or not the image of each divided region is a part of the vehicle. Each node 2z of the input layer 2a is connected to all nodes 2z of the intermediate layer 2b. Each node 2z of the intermediate layer 2b is connected to all nodes 2z of the output layer 2c. Thus, the second speed range machine learning model 1-2 has a neural network structure in which the nodes 2z are connected to each other.

判定部２４は、画像１０４の各分割領域の画像を入力層２ａの各ノード２ｚへ入力する。入力層２ａの各ノード２ｚは、順次入力される各分割領域の画像が車両の一部か否かの判定を行い、車両の一部である可能性が高い程高い重み付けをした判定結果を中間層２ｂのノード２ｚへ出力する。 The determination unit 24 inputs the image of each divided region of the image 104 to each node 2z of the input layer 2a. Each node 2z of the input layer 2a determines whether or not the sequentially input image of each divided region is a part of the vehicle, and intermediately determines the weighted determination result as the possibility that the image is a part of the vehicle is higher. Output to the node 2z of the layer 2b.

中間層２ｂの各ノード２ｚも同様に、各分割領域の画像が車両の一部か否かの判定を行い、車両の一部である可能性が高い程高い重み付けをした判定結果を出力層２ｃのノード２ｚへ出力する。 Similarly, each node 2z of the intermediate layer 2b also determines whether or not the image of each divided region is a part of the vehicle, and outputs a higher weighted determination result as the possibility of being a part of the vehicle is higher than the output layer 2c. To the node 2z.

出力層２ｃの各ノード２ｚは、各分割領域の画像が車両の一部か否かの判定を行い、分割領域毎の判定結果２ｄを出力する。なお、図４（ｂ）には、画像１０４における下から４行目の分割領域の判定結果２ｄを示している。 Each node 2z of the output layer 2c determines whether or not the image of each divided region is a part of the vehicle, and outputs a determination result 2d for each divided region. FIG. 4B shows a determination result 2d of the divided region in the fourth row from the bottom in the image 104.

図４（ｂ）に示す例では、第２速度範囲用機械学習モデル１−２は、画像１０４における下から４行目の分割領域について、左右両端の分割領域の画像以外を車両（車両の一部）と判定している。 In the example illustrated in FIG. 4B, the second speed range machine learning model 1-2 uses the vehicle (one of the vehicles) for the divided region on the fourth line from the bottom in the image 104 other than the images of the divided regions on the left and right ends. Part).

第２速度範囲用機械学習モデル１−２は、画像１０４における全分割領域の画像について車両の一部か否かの判定を行う。判定部２４は、かかる分割領域毎の判定結果２ｄに基づいて、図４（ｃ）に示すように、被写体は車両であるという画像認識結果１０２を導出する。 The second speed range machine learning model 1-2 determines whether or not the image of all the divided regions in the image 104 is a part of the vehicle. The determination unit 24 derives an image recognition result 102 indicating that the subject is a vehicle, as shown in FIG. 4C, based on the determination result 2d for each divided region.

このとき、第２速度範囲用機械学習モデル１−２は、撮像装置４１に対する相対速度が第２速度範囲内の被写体が撮像された画像だけを教材とした機械学習によって生成されているため、被写体は車両であるという正確な画像認識結果を導出することができる。 At this time, since the second speed range machine learning model 1-2 is generated by machine learning using only an image of a subject whose relative speed with respect to the imaging device 41 is within the second speed range as a learning material, Can derive an accurate image recognition result indicating that the vehicle is a vehicle.

その後、判定部２４は、画像認識した車両が自車両の走行に支障をきたすか否かを判定する。判定部２４は、画像認識した車両の自車両に対する位置に基づいて自車両の走行に支障をきたすか否かを判定する。 Thereafter, the determination unit 24 determines whether the vehicle whose image has been recognized interferes with the traveling of the own vehicle. The determining unit 24 determines whether or not the traveling of the own vehicle is hindered based on the position of the vehicle with respect to the own vehicle whose image has been recognized.

なお、判定部２４は、車両以外にも、歩行者や路面上の落下物等の被写体を画像認識した場合にも、被写体が自車両の走行に支障をきたすか否かを判定する。そして、判定部２４は、画像認識したものが自車両の走行に支障をきたすと判定した場合には、画像認識したものの位置を示す情報を車両制御装置４４へ出力する。 The determination unit 24 determines whether or not the subject interferes with the traveling of the own vehicle when the subject such as a pedestrian or a falling object on a road surface is image-recognized in addition to the vehicle. Then, when the determination unit 24 determines that the image-recognized object interferes with the traveling of the own vehicle, the determination unit 24 outputs information indicating the position of the image-recognized object to the vehicle control device 44.

車両制御装置４４は、判定部２４から画像認識されたものの位置を示す情報が入力される場合に、車両の速度制御、操舵制御、および制動制御を行うことによって自車両に危険を回避させる。 When the information indicating the position of the image-recognized one is input from the determination unit 24, the vehicle control device 44 performs the speed control, the steering control, and the braking control of the vehicle to make the own vehicle avoid danger.

次に、図５を参照し、実施形態に係る画像認識装置１の制御部２が実行する処理の一例について説明する。図５は、実施形態に係る画像認識装置１の制御部２が実行する処理の一例を示すフローチャートである。 Next, an example of a process performed by the control unit 2 of the image recognition device 1 according to the embodiment will be described with reference to FIG. FIG. 5 is a flowchart illustrating an example of a process executed by the control unit 2 of the image recognition device 1 according to the embodiment.

制御部２は、撮像装置４１によって車両の周囲の画像が撮像される毎に、図５に示す処理を実行する。具体的には、制御部２は、撮像装置４１によって車両の周囲の画像が撮像されると、まず、撮像装置４１から画像を取得する（ステップＳ１０１）。 The control unit 2 executes the process shown in FIG. 5 every time an image around the vehicle is captured by the imaging device 41. Specifically, when an image around the vehicle is captured by the imaging device 41, the control unit 2 first acquires an image from the imaging device 41 (step S101).

続いて、制御部２は、撮像装置４１と認識対象物との相対速度を判定する（ステップＳ１０２）。その後、制御部２は、判別した相対速度を含む速度範囲の機械学習モデルを選択する（ステップＳ１０３）。 Subsequently, the control unit 2 determines a relative speed between the imaging device 41 and the recognition target (Step S102). Thereafter, the control unit 2 selects a machine learning model in a speed range including the determined relative speed (Step S103).

そして、制御部２は、選択した機械学習モデルへ画像を入力し（ステップＳ１０４）、機械学習モデルの出力に基づいて被写体を判定する（ステップＳ１０５）。続いて、制御部２は、車両制御装置４４への通知が必要か否かを判定する（ステップＳ１０６）。 Then, the control unit 2 inputs an image to the selected machine learning model (step S104), and determines a subject based on the output of the machine learning model (step S105). Subsequently, the control unit 2 determines whether or not the notification to the vehicle control device 44 is necessary (Step S106).

このとき、制御部２は、被写体の車両に対する位置および被写体の種類等に基づいて、被写体が自車両の走行に支障をきたすと判定する場合には通知が必要と判定し、支障をきたさないと判定する場合には、通知が必要でないと判定する。 At this time, based on the position of the subject with respect to the vehicle, the type of the subject, and the like, the control unit 2 determines that the notification is necessary when it is determined that the subject interferes with the traveling of the own vehicle, and determines that there is no obstacle. When determining, it is determined that notification is not necessary.

そして、制御部２は、通知が必要でないと判定した場合（ステップＳ１０６，Ｎｏ）、処理を終了する。また、制御部２は、通知が必要と判定した場合には（ステップＳ１０６，Ｙｅｓ）、被写体の車両に対する位置を車両制御装置４４へ通知して（ステップＳ１０７）、処理を終了する。 Then, when the control unit 2 determines that the notification is not necessary (Step S106, No), the process ends. When it is determined that the notification is necessary (Step S106, Yes), the control unit 2 notifies the vehicle control device 44 of the position of the subject with respect to the vehicle (Step S107), and ends the process.

なお、上述した実施形態では、車両に搭載される撮像装置から取得する車両の周囲が撮像された画像に写る被写体を画像認識する画像認識装置および画像認識方法を例に挙げて説明したが、これは一例である。 In the above-described embodiment, the image recognition apparatus and the image recognition method for recognizing a subject appearing in a captured image of the periphery of the vehicle acquired from the imaging device mounted on the vehicle have been described as examples. Is an example.

例えば、上述した画像認識装置１が備える制御部２および記憶部３の機能を複数の車両と無線通信可能なセンタに設けられる情報提供装置に持たせてもよい。かかる場合、情報提供装置は、複数の車両から相対速度別の画像を取得して機械学習し、第１〜第ｎ速度範囲用機械学習モデル１−１〜１―ｎを作成して記憶する。 For example, the functions of the control unit 2 and the storage unit 3 included in the image recognition device 1 described above may be provided in an information providing device provided in a center that can wirelessly communicate with a plurality of vehicles. In such a case, the information providing apparatus acquires images for each relative speed from a plurality of vehicles and performs machine learning, and creates and stores the first to n-th speed range machine learning models 1-1 to 1-n.

尚、第１〜第ｎ速度範囲用機械学習モデル１−１〜１―ｎには、例えばデータヘッダ部に付加する等の方法により各速度範囲用機械学習モデルの識別データが付加され、画像認識処理装置等による画像認識処理における各速度範囲用機械学習モデルの選択処理に用いられることになる。 Note that the identification data of each speed range machine learning model is added to the first to n-th speed range machine learning models 1-1 to 1-n by, for example, a method of adding the data learning section to a data header section, and image recognition is performed. It is used for selecting a machine learning model for each speed range in image recognition processing by a processing device or the like.

情報提供装置は、各車両から撮像された判定用の画像を受信する場合に、上述した画像認識装置１と同様の手法によって、第１〜第ｎ速度範囲用機械学習モデル１−１〜１―ｎから選択した機械学習モデルを採用して認識対象を判定する。そして、情報提供装置は、認識対象の判定結果を画像の送信元となる車両へ返信する。 The information providing apparatus, when receiving an image for determination captured from each vehicle, by the same method as the image recognition apparatus 1 described above, uses the first to n-th speed range machine learning models 1-1 to 1-1- The recognition target is determined using the machine learning model selected from n. Then, the information providing device returns the determination result of the recognition target to the vehicle that is the transmission source of the image.

また、情報提供装置は、上述した画像認識装置１が備える学習部２１および記憶部３の機能を備える構成であってもよい。かかる構成の場合、情報提供装置は、複数の車両から相対速度別の画像を取得して機械学習し、第１〜第ｎ速度範囲用機械学習モデル１−１〜１−ｎを作成して記憶する。 Further, the information providing device may be configured to include the functions of the learning unit 21 and the storage unit 3 included in the image recognition device 1 described above. In the case of such a configuration, the information providing apparatus obtains images according to relative speeds from a plurality of vehicles and performs machine learning, creates and stores machine learning models 1-1 to 1-n for the first to n-th speed ranges. I do.

そして、情報提供装置は、車両に設けられて画像内の被写体を判定する画像認識装置から指定された速度情報に応じた速度範囲の機械学習モデルを第１〜第ｎ速度範囲用機械学習モデル１−１〜１−ｎから選択して車両の画像認識装置へ提供する。かかる情報提供装置によれば、車両側の処理負荷を大幅に低減することができる。 The information providing apparatus is configured to provide a machine learning model having a speed range corresponding to the speed information designated by the image recognition device provided in the vehicle and determining a subject in the image to the first to n-th speed range machine learning models 1. -1 to 1-n to provide to the vehicle image recognition device. According to such an information providing device, the processing load on the vehicle can be significantly reduced.

また、実施形態に係る画像認識装置および画像認識方法は、例えば、ベルトコンベアによって搬送される製品の画像を撮像するカメラや防犯カメラ等、任意の撮像装置によって任意の場所で撮像される画像に写る被写体の画像認識に適用することができる。 In addition, the image recognition device and the image recognition method according to the embodiment are, for example, captured in an image captured at an arbitrary location by an arbitrary imaging device, such as a camera or a security camera that captures an image of a product conveyed by a belt conveyor. It can be applied to image recognition of a subject.

また、上述した実施形態では、画像認識装置１が学習部２１を備える場合について説明したが、実施形態に係る画像認識装置は、必ずしも学習部２１を備えていなくてもよい。画像認識装置は、他の機械学習装置によって生成された第１速度範囲用機械学習モデル１−１、第２速度範囲用機械学習モデル１−２・・・第ｎ速度範囲用機械学習モデル１−ｎを記憶しておくことで上述した画像認識処理を行うことができる。 Further, in the above-described embodiment, the case where the image recognition device 1 includes the learning unit 21 has been described. However, the image recognition device according to the embodiment does not necessarily need to include the learning unit 21. The image recognition device includes a first speed range machine learning model 1-1, a second speed range machine learning model 1-2,... An nth speed range machine learning model 1 generated by another machine learning device. By storing n, the above-described image recognition processing can be performed.

さらなる効果や変形例は、当業者によって容易に導き出すことができる。このため、本発明のより広範な態様は、以上のように表しかつ記述した特定の詳細および代表的な実施形態に限定されるものではない。したがって、添付の特許請求の範囲およびその均等物によって定義される総括的な発明の概念の精神または範囲から逸脱することなく、様々な変更が可能である。 Further effects and modifications can be easily derived by those skilled in the art. Thus, the broader aspects of the present invention are not limited to the specific details and representative embodiments shown and described above. Accordingly, various modifications can be made without departing from the spirit or scope of the general inventive concept as defined by the appended claims and their equivalents.

１画像認識装置
２制御部
２１学習部
２２取得部
２３判別部
２４判定部
３記憶部
１−１第１速度範囲用機械学習モデル
１−２第２速度範囲用機械学習モデル
１−３第３速度範囲用機械学習モデル
４１撮像装置
４２レーダ
４３舵角センサ
４４車両制御装置
１１０相対速度別画像集 DESCRIPTION OF SYMBOLS 1 Image recognition apparatus 2 Control part 21 Learning part 22 Acquisition part 23 Judgment part 24 Judgment part 3 Storage part 1-1 Machine learning model for 1st speed range 1-2 Machine learning model for 2nd speed range 1-3 Third speed Range machine learning model 41 Imaging device 42 Radar 43 Steering angle sensor 44 Vehicle control device 110 Image collection by relative speed

Claims

An acquisition unit configured to acquire an image in which the recognition target is imaged from the imaging device;
A determining unit that determines a relative speed between the imaging device and the recognition target object,
A storage unit that stores, for each speed range, a machine learning model that derives a recognition result that a subject in the image is the recognition target object from an image in which the recognition target object is captured,
A determination unit that determines a subject in the image by using the machine learning model in the speed range according to the relative speed determined by the determination unit from the plurality of machine learning models. Image recognition device.

The determination unit includes:
The image recognition device according to claim 1, wherein the relative speed is determined based on a moving speed of the subject in the image.

The determination unit includes:
The image recognition device according to claim 2, wherein the relative speed is determined based on a moving speed of the position of the subject between a plurality of frames of images continuously captured by the imaging device.

The determination unit includes:
The image recognition device according to claim 1, wherein the relative speed is corrected based on a change speed of an imaging direction by the imaging device.

The machine learning model for each speed range based on a plurality of images in which the type of recognition target moving at the relative speed within each speed range input for each relative speed range is imaged. The image recognition device according to claim 1, further comprising: a learning unit configured to generate the image.

An acquisition step of acquiring an image in which the recognition target is imaged from the imaging device,
A determining step of determining a relative speed between the imaging device and the recognition target;
A storage step of storing, for each speed range, a machine learning model that derives a recognition result that a subject in the image is the recognition target object from an image in which the recognition target object is captured,
Determining a subject in the image by employing the machine learning model in the speed range according to the relative speed determined in the determining step from the plurality of machine learning models. Image recognition method.

A storage unit that stores, for each speed range, a machine learning model that derives a recognition result that a subject in the image is the recognition target object from an image in which the recognition target object is captured by the imaging device.
A providing unit that selects the machine learning model in the speed range according to the speed information designated by the image recognition device that determines a subject in an image from a plurality of the machine learning models and provides the selected machine learning model to the image recognition device. An apparatus for providing a machine learning model, comprising:

A storage step of storing, for each speed range, a machine learning model that derives a recognition result that a subject in the image is the recognition target from an image of the recognition target captured by the imaging device.
Providing the selected machine learning model in the speed range according to the speed information specified by the image recognition device that determines the subject in the image from a plurality of the machine learning models and providing the selected machine learning model to the image recognition device. A method for providing a machine learning model, the method comprising:

A method for generating a machine learning model used by an image recognition device for image recognition processing,
A model generation step of generating a machine learning model for each speed range by learning an image of a moving object moving at a speed of each speed range in a plurality of speed ranges,
An identification data adding step of adding identification data for selecting and using the machine learning model for each speed range,
A machine learning model generation method, comprising:

An image recognition device is a machine learning model device used for image recognition processing,
Machine learning model unit for each of a plurality of speed ranges generated by learning the image of the moving body moving at the speed of each speed range in the plurality of speed ranges,
An identification data unit added to the speed range machine learning model unit to select and use the speed range machine learning model unit;
A machine learning model device comprising: