JP6858101B2

JP6858101B2 - Coordinate detector and trained model

Info

Publication number: JP6858101B2
Application number: JP2017166420A
Authority: JP
Inventors: 正義林; 貴彦深澤
Original assignee: PFU Ltd
Current assignee: PFU Ltd
Priority date: 2017-08-31
Filing date: 2017-08-31
Publication date: 2021-04-14
Anticipated expiration: 2037-08-31
Also published as: JP2019046007A

Description

本発明は、座標検出装置及び学習済みモデルに関する。 The present invention relates to a coordinate detector and a trained model.

マシンビジョン技術の発展や、スマートデバイス等のカメラ搭載デバイスの普及に伴って、撮影された物体の形状を精度良く検出することが求められている。 With the development of machine vision technology and the widespread use of camera-mounted devices such as smart devices, it is required to accurately detect the shape of a photographed object.

撮影された物体の形状を検出する方法として、特徴点抽出を用いた検出方法がある。しかし、特徴点抽出を用いた検出方法では、検出の対象物毎に、抽出する特徴点の見直しや、抽出の際に使用する閾値の調節等が必要になるため、オペレータの作業負荷が大きくなる。 As a method of detecting the shape of a photographed object, there is a detection method using feature point extraction. However, in the detection method using feature point extraction, it is necessary to review the feature points to be extracted and adjust the threshold value used for extraction for each object to be detected, which increases the workload of the operator. ..

特開２００７−０８５９３７号公報Japanese Unexamined Patent Publication No. 2007-085937

そこで、近年、撮影された物体の形状を機械学習を用いて検出する技術についての検討が行われている。しかし、機械学習を用いた従来の検出方法では、検出精度が低かった。 Therefore, in recent years, a technique for detecting the shape of a photographed object by using machine learning has been studied. However, the conventional detection method using machine learning has low detection accuracy.

開示の技術は、上記に鑑みてなされたものであって、撮影された物体の形状を精度良く検出することを目的とする。 The disclosed technique has been made in view of the above, and an object thereof is to accurately detect the shape of a photographed object.

開示の態様では、座標検出装置は、記憶部と、検出部とを有する。前記記憶部は、物体の規定点が中心に位置する画像を用いて機械学習により生成された、前記物体が撮影された入力画像において前記物体の規定点が中心に位置する領域の情報を出力する学習済みモデルを記憶する。前記検出部は、前記物体が撮影された前記入力画像に対して、前記学習済みモデルを用いて前記物体の規定点が中心に位置する領域を検出し、前記入力画像における前記物体の規定点の座標を検出する。 In the disclosed aspect, the coordinate detection device has a storage unit and a detection unit. The storage unit outputs information on a region in which the specified point of the object is located in the center in an input image in which the object is captured, which is generated by machine learning using an image in which the specified point of the object is located in the center. Memorize the trained model. The detection unit detects a region in which the defined point of the object is located at the center of the input image in which the object is photographed by using the learned model, and determines the defined point of the object in the input image. Detect coordinates.

開示の態様によれば、物体の形状を精度良く検出することができる。 According to the disclosed aspect, the shape of the object can be detected with high accuracy.

図１は、実施例１の物体形状検出システムの構成例を示す図である。FIG. 1 is a diagram showing a configuration example of the object shape detection system of the first embodiment. 図２は、実施例１の学習モデル生成装置の構成例を示す図である。FIG. 2 is a diagram showing a configuration example of the learning model generator of the first embodiment. 図３は、実施例１の座標検出装置の構成例を示す図である。FIG. 3 is a diagram showing a configuration example of the coordinate detection device of the first embodiment. 図４は、実施例１の学習モデル生成装置の処理の説明に供するフローチャートである。FIG. 4 is a flowchart provided for explaining the processing of the learning model generator of the first embodiment. 図５は、実施例１の学習モデル生成装置の動作の説明に供する図である。FIG. 5 is a diagram for explaining the operation of the learning model generator of the first embodiment. 図６は、実施例１の学習モデル生成装置の動作の説明に供する図である。FIG. 6 is a diagram for explaining the operation of the learning model generator of the first embodiment. 図７は、実施例１の座標検出装置の動作の説明に供する図である。FIG. 7 is a diagram for explaining the operation of the coordinate detection device of the first embodiment. 図８は、実施例１の分類モデルの動作の説明に供する図である。FIG. 8 is a diagram for explaining the operation of the classification model of the first embodiment. 図９は、実施例２の学習モデル生成装置の処理の説明に供するフローチャートである。FIG. 9 is a flowchart provided for explaining the processing of the learning model generator of the second embodiment. 図１０は、実施例２の学習モデル生成装置の動作の説明に供する図である。FIG. 10 is a diagram for explaining the operation of the learning model generator of the second embodiment. 図１１は、実施例２の座標検出装置の動作の説明に供する図である。FIG. 11 is a diagram for explaining the operation of the coordinate detection device according to the second embodiment. 図１２は、実施例２の検出分類モデルの動作の説明に供する図である。FIG. 12 is a diagram for explaining the operation of the detection classification model of the second embodiment. 図１３は、実施例２の検出分類モデルの動作の説明に供する図である。FIG. 13 is a diagram for explaining the operation of the detection classification model of the second embodiment. 図１４は、実施例２の座標変換の一例を示す図である。FIG. 14 is a diagram showing an example of the coordinate transformation of the second embodiment. 図１５は、実施例３の文字認識装置の構成例を示す図である。FIG. 15 is a diagram showing a configuration example of the character recognition device of the third embodiment. 図１６は、実施例３の補正部及び認識部の動作の説明に供する図である。FIG. 16 is a diagram for explaining the operation of the correction unit and the recognition unit according to the third embodiment. 図１７は、実施例３の透視投影変換の一例を示す図である。FIG. 17 is a diagram showing an example of the perspective projection conversion of the third embodiment. 図１８は、実施例４の画像処理装置の構成例を示す図である。FIG. 18 is a diagram showing a configuration example of the image processing device of the fourth embodiment. 図１９は、実施例４の画像処理装置の動作の説明に供する図である。FIG. 19 is a diagram for explaining the operation of the image processing apparatus of the fourth embodiment. 図２０は、実施例５の検出対象物体の一例を示す図である。FIG. 20 is a diagram showing an example of the object to be detected in the fifth embodiment. 図２１は、実施例５の検出対象物体の一例を示す図である。FIG. 21 is a diagram showing an example of the object to be detected in the fifth embodiment.

以下に、本願の開示する座標検出装置及び学習済みモデルの実施例を図面に基づいて説明する。なお、この実施例により本願の開示する座標検出装置及び学習済みモデルが限定されるものではない。また、実施例において同一の機能を有する構成、及び、同一の処理を行うステップには同一の符号を付す。 Hereinafter, examples of the coordinate detection device and the trained model disclosed in the present application will be described with reference to the drawings. It should be noted that this embodiment does not limit the coordinate detection device and the trained model disclosed in the present application. Further, in the embodiment, the same reference numerals are given to the configurations having the same functions and the steps for performing the same processing.

［実施例１］
＜物体形状検出システムの構成＞
図１は、実施例１の物体形状検出システムの構成例を示す図である。図１において、物体形状検出システム１は、学習モデル生成装置１０と、座標検出装置２０とを有する。 [Example 1]
<Configuration of object shape detection system>
FIG. 1 is a diagram showing a configuration example of the object shape detection system of the first embodiment. In FIG. 1, the object shape detection system 1 includes a learning model generation device 10 and a coordinate detection device 20.

学習モデル生成装置１０には元画像が入力され、学習モデル生成装置１０は、入力された元画像を用いて「学習済みモデル」を生成し、生成した学習済みモデルを座標検出装置２０へ出力する。 The original image is input to the learning model generation device 10, the learning model generation device 10 generates a "learned model" using the input original image, and outputs the generated learned model to the coordinate detection device 20. ..

座標検出装置２０には形状検出の対象となる物体（以下では「検出対象物体」と呼ぶことがある）が撮影された画像（以下では「検出対象画像」と呼ぶことがある）が入力される。座標検出装置２０は、学習モデル生成装置１０で生成された学習済みモデルを用いて、検出対象画像に撮影されている検出対象物体の形状を検出し、検出結果を出力する。検出対象画像は、座標検出装置２０への「入力画像」に相当する。 An image (hereinafter sometimes referred to as "detection target image") obtained by capturing an object (hereinafter sometimes referred to as "detection target object") to be shape-detected is input to the coordinate detection device 20. .. The coordinate detection device 20 detects the shape of the detection target object captured in the detection target image by using the trained model generated by the learning model generation device 10, and outputs the detection result. The detection target image corresponds to the "input image" to the coordinate detection device 20.

＜学習モデル生成装置の構成＞
図２は、実施例１の学習モデル生成装置の構成例を示す図である。図２において、学習モデル生成装置１０は、データセット生成部１１と、学習モデル生成部１２と、記憶部１３と、出力部１４とを有する。 <Configuration of learning model generator>
FIG. 2 is a diagram showing a configuration example of the learning model generator of the first embodiment. In FIG. 2, the learning model generation device 10 includes a data set generation unit 11, a learning model generation unit 12, a storage unit 13, and an output unit 14.

データセット生成部１１には元画像が入力される。データセット生成部１１は、元画像から、学習済みモデルの生成に使用される「データセット」を生成し、生成したデータセットを学習モデル生成部１２へ出力する。 The original image is input to the data set generation unit 11. The data set generation unit 11 generates a "data set" used for generating a trained model from the original image, and outputs the generated data set to the training model generation unit 12.

学習モデル生成部１２は、データセット生成部１１で生成されたデータセットを用いて学習済みモデルを生成し、生成した学習済みモデルを記憶部１３へ出力する。つまり、データセット生成部１１で生成されたデータセットが、学習済みモデルを生成する際の教師データとなる。 The learning model generation unit 12 generates a trained model using the data set generated by the data set generation unit 11, and outputs the generated trained model to the storage unit 13. That is, the data set generated by the data set generation unit 11 becomes the teacher data when the trained model is generated.

記憶部１３は、学習モデル生成部１２で生成された学習済みモデルを記憶する。 The storage unit 13 stores the learned model generated by the learning model generation unit 12.

出力部１４は、記憶部１３に記憶されている学習済みモデルを取得し、取得した学習済みモデルを座標検出装置２０へ出力する。学習モデル生成装置１０から座標検出装置２０への学習済みモデルの出力は、例えば、学習モデル生成装置１０に対するオペレータの指示に従って行われる。 The output unit 14 acquires the trained model stored in the storage unit 13 and outputs the acquired trained model to the coordinate detection device 20. The output of the trained model from the learning model generation device 10 to the coordinate detection device 20 is performed according to, for example, an operator's instruction to the training model generation device 10.

＜座標検出装置の構成＞
図３は、実施例１の座標検出装置の構成例を示す図である。図３において、座標検出装置２０は、取得部２１と、記憶部２２と、検出部２３とを有する。 <Configuration of coordinate detection device>
FIG. 3 is a diagram showing a configuration example of the coordinate detection device of the first embodiment. In FIG. 3, the coordinate detection device 20 includes an acquisition unit 21, a storage unit 22, and a detection unit 23.

取得部２１は、学習モデル生成装置１０から出力された学習済みモデルを取得し、取得した学習済みモデルを記憶部２２へ出力する。 The acquisition unit 21 acquires the learned model output from the learning model generation device 10, and outputs the acquired learned model to the storage unit 22.

記憶部２２は、取得部２１で取得された学習済みモデルを記憶する。 The storage unit 22 stores the learned model acquired by the acquisition unit 21.

検出部２３には検出対象画像が入力され、検出部２３は、記憶部２２に記憶されている学習済みモデルを用いて、検出対象物体の形状を検出し、検出結果を出力する。 The detection target image is input to the detection unit 23, and the detection unit 23 detects the shape of the detection target object using the learned model stored in the storage unit 22 and outputs the detection result.

＜学習モデル生成装置の処理＞
図４は、実施例１の学習モデル生成装置の処理の説明に供するフローチャートである。 <Processing of learning model generator>
FIG. 4 is a flowchart provided for explaining the processing of the learning model generator of the first embodiment.

図４において、ステップＳ１１では、データセット生成部１１が、元画像から、教師データとしての学習用データセットＡを生成する。 In FIG. 4, in step S11, the data set generation unit 11 generates the learning data set A as teacher data from the original image.

ステップＳ１３では、学習モデル生成部１２が、第一学習済みモデルとしての「検出モデル」の学習を行う。 In step S13, the learning model generation unit 12 learns the “detection model” as the first trained model.

また、ステップＳ１５では、学習モデル生成部１２が、第二学習済みモデルとしての「分類モデル」の学習を行う。 Further, in step S15, the learning model generation unit 12 learns the “classification model” as the second learned model.

学習モデル生成部１２は、ステップＳ１３の処理とステップＳ１５の処理とを、同時に並行して行っても良いし、また、一方の処理が済んだ後に他方の処理を行っても良い。 The learning model generation unit 12 may simultaneously perform the processing of step S13 and the processing of step S15 in parallel, or may perform the other processing after one processing is completed.

＜学習モデル生成装置の動作＞
図５及び図６は、実施例１の学習モデル生成装置の動作の説明に供する図である。図５には、検出モデルの学習の動作例を図示し、図６には、分類モデルの学習の動作例を図示する。以下、検出モデルの学習の動作例と、分類モデルの学習の動作例とに分けて説明する。また以下では、検出対象画像に撮影されている検出対象物体の一例として、自動車の矩形のナンバープレートを挙げて説明する。また以下では、検出対象物体上に存在する「規定点」の一例として、ナンバープレートの四隅に存在する「コーナー点」を挙げて説明する。コーナー点は、「頂点」と呼ばれることもある。 <Operation of learning model generator>
5 and 6 are diagrams for explaining the operation of the learning model generator of the first embodiment. FIG. 5 illustrates an operation example of learning of the detection model, and FIG. 6 illustrates an operation example of learning of the classification model. Hereinafter, an operation example of learning the detection model and an operation example of learning the classification model will be described separately. Further, in the following, as an example of the detection target object captured in the detection target image, a rectangular license plate of an automobile will be described. Further, in the following, as an example of the "specified points" existing on the object to be detected, the "corner points" existing at the four corners of the license plate will be described. Corner points are sometimes called "vertices".

＜検出モデルの学習の動作例：図５＞
図５に示すように、データセット生成部１１には、ナンバープレートＮＰを有する自動車の画像が元画像として複数入力され、データセット生成部１１は、これら複数の元画像から、第一教師データとしてのデータセットＡ１と、第二教師データとしてのデータセットＡ２とを生成する。 <Example of learning operation of detection model: Fig. 5>
As shown in FIG. 5, a plurality of images of a vehicle having a number plate NP are input to the data set generation unit 11 as original images, and the data set generation unit 11 uses these plurality of original images as first teacher data. Data set A1 and data set A2 as the second teacher data are generated.

データセットＡ１は、図５に示すように、検出対象画像においてナンバープレートＮＰの４個のコーナー点が不鮮明な複数の「ネガティブ画像」により形成される。 As shown in FIG. 5, the data set A1 is formed by a plurality of "negative images" in which the four corner points of the license plate NP are unclear in the image to be detected.

一方で、データセットＡ２は、図５に示すように、ナンバープレートＮＰの４個のコーナー点の何れか一つのコーナー点だけを含む複数の「ポジティブ画像」により形成される。 On the other hand, as shown in FIG. 5, the data set A2 is formed by a plurality of "positive images" including only one of the four corner points of the license plate NP.

ここで、各ポジティブ画像ＰＩにおいて、コーナー点ＣＰは、ポジティブ画像の中心に位置する。すなわち、例えばポジティブ画像ＰＩのアスペクト比が「ｘ：ｙ＝１：１」の場合、コーナー点ＣＰがｘ＝１／２，ｙ＝１／２の位置に配置されるようにポジティブ画像ＰＩが生成させる。換言すれば、ナンバープレートＮＰの四辺で形成される境界線のうちの互いに接する二辺の境界線がｘ＝１／２でのｙ方向に平行な直線及びｙ＝１／２でのｘ方向に平行な直線にほぼ重なるようにポジティブ画像ＰＩが生成される。図５に示す例では、ナンバープレートＮＰの４個のコーナー点のうちの左下のコーナー点ＣＰがポジティブ画像ＰＩの中心に位置している。すなわち、図５に示す例では、ポジティブ画像ＰＩにおいて、ナンバープレートＮＰの四辺のうち、左辺がｘ＝１／２でのｙ方向に平行な直線にほぼ重なり、かつ、左辺と互いに接する下辺がｙ＝１／２でのｘ方向に平行な直線にほぼ重なっている。 Here, in each positive image PI, the corner point CP is located at the center of the positive image. That is, for example, when the aspect ratio of the positive image PI is "x: y = 1: 1", the positive image PI is generated so that the corner points CP are arranged at the positions of x = 1/2 and y = 1/2. Let me. In other words, of the boundary lines formed by the four sides of the license plate NP, the boundary lines of the two tangent sides are a straight line parallel to the y direction at x = 1/2 and the x direction at y = 1/2. The positive image PI is generated so that it almost overlaps the parallel straight lines. In the example shown in FIG. 5, the lower left corner point CP of the four corner points of the license plate NP is located at the center of the positive image PI. That is, in the example shown in FIG. 5, in the positive image PI, of the four sides of the license plate NP, the left side substantially overlaps the straight line parallel to the y direction at x = 1/2, and the lower side in contact with the left side is y. It almost overlaps with a straight line parallel to the x direction at = 1/2.

学習モデル生成部１２は、データセット生成部１１で生成されたデータセットＡ１，Ａ２を教師データとして用いて機械学習を行って、第一学習済みモデルとしての検出モデルを生成する。検出モデルを生成する際の機械学習は、例えば、ＬＢＰ（Local Binary Pattern）特徴を用いたBoostingにより行う。 The learning model generation unit 12 performs machine learning using the data sets A1 and A2 generated by the data set generation unit 11 as teacher data, and generates a detection model as the first trained model. Machine learning when generating a detection model is performed by, for example, Boosting using LBP (Local Binary Pattern) features.

＜分類モデルの学習の動作例：図６＞
図６に示すように、データセット生成部１１には、ナンバープレートＮＰを有する自動車の画像が元画像として複数入力され、データセット生成部１１は、これら複数の元画像から、第三教師データとしてのデータセットＡ３を生成する。なお、データセットＡ１、データセットＡ２及びデータセットＡ３により、図４のステップＳＴ１１における学習用データセットＡが形成される。 <Example of learning operation of classification model: Fig. 6>
As shown in FIG. 6, a plurality of images of a vehicle having a number plate NP are input to the data set generation unit 11 as original images, and the data set generation unit 11 uses these plurality of original images as third teacher data. Data set A3 is generated. The data set A1, the data set A2, and the data set A3 form the learning data set A in step ST11 of FIG.

データセットＡ３は、図６に示すように、ナンバープレートＮＰの４個のコーナー点のうち、左上のコーナー点だけを含む複数の画像（以下では「左上コーナー点画像」と呼ぶことがある）と、右上のコーナー点だけを含む複数の画像（以下では「右上コーナー点画像」と呼ぶことがある）と、右下のコーナー点だけを含む複数の画像（以下では「右下コーナー点画像」と呼ぶことがある）と、左下のコーナー点だけを含む複数の画像（以下では「左下コーナー点画像」と呼ぶことがある）とにより形成される。左上コーナー点画像には左上のコーナー点に対応する「コーナー１」というラベルが付され、右上コーナー点画像には右上のコーナー点に対応する「コーナー２」というラベルが付され、右下コーナー点画像には右下のコーナー点に対応する「コーナー３」というラベルが付され、左下コーナー点画像には左下のコーナー点に対応する「コーナー４」というラベルが付される。なお、データセットＡ３を形成する各画像において、コーナー点は画像の中心に位置しなくても良い。 As shown in FIG. 6, the data set A3 includes a plurality of images including only the upper left corner point among the four corner points of the license plate NP (hereinafter, may be referred to as “upper left corner point image”). , Multiple images containing only the upper right corner point (hereinafter sometimes referred to as "upper right corner point image") and multiple images containing only the lower right corner point (hereinafter referred to as "lower right corner point image"). It is formed by (sometimes called) and a plurality of images (hereinafter sometimes referred to as "lower left corner point image") including only the lower left corner point. The upper left corner point image is labeled "Corner 1" corresponding to the upper left corner point, the upper right corner point image is labeled "Corner 2" corresponding to the upper right corner point, and the lower right corner point is labeled. The image is labeled "Corner 3" corresponding to the lower right corner point, and the lower left corner point image is labeled "Corner 4" corresponding to the lower left corner point. In each image forming the data set A3, the corner point does not have to be located at the center of the image.

学習モデル生成部１２は、データセット生成部１１で生成されたデータセットＡ３を教師データとして用いて機械学習を行って、第二学習済みモデルとしての分類モデルを生成する。分類モデルを生成する際の機械学習として深層学習を用いる。分類モデルの生成は、例えば、４層のＣＮＮ（Convolutional Neural Network）により行う。 The learning model generation unit 12 performs machine learning using the data set A3 generated by the data set generation unit 11 as teacher data, and generates a classification model as a second trained model. Deep learning is used as machine learning when generating a classification model. The classification model is generated by, for example, a four-layer CNN (Convolutional Neural Network).

＜座標検出装置の動作＞
図７は、実施例１の座標検出装置の動作の説明に供する図である。 <Operation of coordinate detection device>
FIG. 7 is a diagram for explaining the operation of the coordinate detection device of the first embodiment.

図５に示すようにして生成された検出モデル、及び、図６に示すようにして生成された分類モデルは、座標検出装置２０の取得部２１によって学習モデル生成装置１０から取得されて記憶部２２に記憶される。 The detection model generated as shown in FIG. 5 and the classification model generated as shown in FIG. 6 are acquired from the learning model generation device 10 by the acquisition unit 21 of the coordinate detection device 20 and stored in the storage unit 22. Is remembered in.

図７に示すように、ナンバープレートが撮影された検出対象画像が座標検出装置２０に入力されると、検出部２３は、まず、検出対象画像に対して検出モデルを用いて、検出対象画像においてナンバープレートのコーナー点が中心に位置する領域の候補（以下では「コーナー点存在領域候補」と呼ぶことがある）を検出する。すなわち、学習モデル生成装置１０により生成された検出モデルは、検出対象画像が入力されて、検出対象画像においてコーナー点存在領域候補を示す情報（以下では「候補情報」と呼ぶことがある）を検出部２３へ出力する学習済みモデルである。また、検出モデルは、検出対象画像が入力される入力層と、出力層と、入力層から出力層までの何れかの層であって出力層以外の層に属する第一要素（第一ノード）と、第一要素と第一要素の重みとに基づいて値が算出される第二要素（第二ノード）とを有する。そして、検出モデルは、入力層に入力された検出対象画像に対し、出力層以外の各層に属する各要素を第一要素として、第一要素と第一要素の重みとに基づく演算を行うことにより候補情報を出力する。図７に示す例では、検出部２３が検出対象画像に対して検出モデルを用いることにより、例えば、ＣＡ１〜ＣＡ７の７個のコーナー点存在領域候補が検出される。ここで、コーナー点存在領域候補の領域の大きさ及びアスペクト比は、図５におけるポジティブ画像ＰＩと同一である。 As shown in FIG. 7, when the detection target image on which the license plate is photographed is input to the coordinate detection device 20, the detection unit 23 first uses the detection model for the detection target image in the detection target image. A candidate for a region in which the corner point of the license plate is located at the center (hereinafter, may be referred to as a "corner point existence region candidate") is detected. That is, in the detection model generated by the learning model generation device 10, the detection target image is input, and information indicating a corner point existence region candidate (hereinafter, may be referred to as "candidate information") is detected in the detection target image. This is a trained model to be output to unit 23. Further, the detection model is a first element (first node) that is any layer from the input layer, the output layer, and the input layer to the output layer to which the image to be detected is input and belongs to a layer other than the output layer. And a second element (second node) whose value is calculated based on the first element and the weight of the first element. Then, the detection model performs an operation based on the first element and the weight of the first element with each element belonging to each layer other than the output layer as the first element for the detection target image input to the input layer. Output candidate information. In the example shown in FIG. 7, when the detection unit 23 uses the detection model for the image to be detected, for example, seven corner point existence region candidates of CA1 to CA7 are detected. Here, the size and aspect ratio of the area of the corner point existence area candidate are the same as those of the positive image PI in FIG.

検出部２３は、次いで、候補情報に対して分類モデルを用いて、候補情報により示されたコーナー点存在領域候補がナンバープレートの４個のコーナー点のうちの何れのコーナー点を含む領域であるかを特定する。すなわち、学習モデル生成装置１０により生成された分類モデルは、候補情報が入力されて、候補情報が示すコーナー点存在領域候補がナンバープレートの４個のコーナー点のうちの何れのコーナー点を含む領域であるかを示す情報（以下では「コーナー点第一特定情報」と呼ぶことがある）を検出部２３へ出力する学習済みモデルである。また、分類モデルは、候補情報が入力される入力層と、出力層と、入力層から出力層までの何れかの層であって出力層以外の層に属する第一要素と、第一要素と第一要素の重みとに基づいて値が算出される第二要素とを有する。そして、分類モデルは、入力層に入力された候補情報に対し、出力層以外の各層に属する各要素を第一要素として、第一要素と第一要素の重みとに基づく演算を行うことによりコーナー点第一特定情報を出力する。 Next, the detection unit 23 uses a classification model for the candidate information, and the corner point existence region candidate indicated by the candidate information is an region including any of the four corner points of the license plate. To identify. That is, in the classification model generated by the learning model generation device 10, candidate information is input, and the corner point existence area candidate indicated by the candidate information is an area including any of the four corner points of the license plate. This is a trained model that outputs information indicating whether or not the information (hereinafter, may be referred to as “corner point first specific information”) to the detection unit 23. Further, the classification model includes an input layer into which candidate information is input, an output layer, a first element which is any layer from the input layer to the output layer and belongs to a layer other than the output layer, and a first element. It has a second element whose value is calculated based on the weight of the first element. Then, the classification model corners the candidate information input to the input layer by performing an operation based on the first element and the weight of the first element, with each element belonging to each layer other than the output layer as the first element. The point first specific information is output.

ここで、分類モデルの動作例について説明する。図８は、実施例１の分類モデルの動作の説明に供する図である。図８に示すように、分類モデルは、まず、コーナー点存在領域候補ＣＡ１〜ＣＡ７の各々について、コーナー１〜４の何れに対応するコーナー点を含む領域であるかの確度を算出する。図８に示す例では、分類モデルにより、コーナー点存在領域候補ＣＡ１は確度２０％のコーナー４（つまり左下のコーナー）を含む領域であると分類される。同様に、コーナー点存在領域候補ＣＡ２は確度５％のコーナー１（つまり左上のコーナー）を含む領域、コーナー点存在領域候補ＣＡ３は確度１５％のコーナー４（つまり左下のコーナー）を含む領域、コーナー点存在領域候補ＣＡ４は確度９２％のコーナー１（つまり左上のコーナー）を含む領域、コーナー点存在領域候補ＣＡ５は確度８６％のコーナー２（つまり右上のコーナー）を含む領域、コーナー点存在領域候補ＣＡ６は確度８８％のコーナー４（つまり左下のコーナー）を含む領域、コーナー点存在領域候補ＣＡ７は確度９１％のコーナー３（つまり右下のコーナー）を含む領域であると分類モデルによって分類される。 Here, an operation example of the classification model will be described. FIG. 8 is a diagram for explaining the operation of the classification model of the first embodiment. As shown in FIG. 8, the classification model first calculates the certainty of each of the corner point existence region candidates CA1 to CA7 as to which of the corners 1 to 4 corresponds to the region including the corner point. In the example shown in FIG. 8, the corner point existence region candidate CA1 is classified as a region including a corner 4 (that is, a lower left corner) having a probability of 20% according to the classification model. Similarly, the corner point existence area candidate CA2 is an area including a corner 1 with a probability of 5% (that is, the upper left corner), and the corner point existence area candidate CA3 is an area including a corner 4 (that is, a lower left corner) with a probability of 15%. The point existence area candidate CA4 is an area including a corner 1 (that is, the upper left corner) with a probability of 92%, and the corner point existence area candidate CA5 is an area including a corner 2 (that is, the upper right corner) with an accuracy of 86%. CA6 is classified as an area including a corner 4 with an accuracy of 88% (that is, a lower left corner), and a corner point existence area candidate CA7 is classified as an area including a corner 3 with an accuracy of 91% (that is, a lower right corner) by a classification model. ..

そして、分類モデルは、コーナー１〜４の各コーナー毎に、確度が最も高いコーナーを選択してコーナー点第一特定情報を出力する。すなわち、図７及び図８に示す例では、分類モデルは、左上のコーナー点を含む領域としてコーナー点存在領域候補ＣＡ４を選択し、選択したコーナー点存在領域候補ＣＡ４に対応付けて「コーナー１」というコーナー点第一特定情報を出力する。また、分類モデルは、右上のコーナー点を含む領域としてコーナー点存在領域候補ＣＡ５を選択し、選択したコーナー点存在領域候補ＣＡ５に対応付けて「コーナー２」というコーナー点第一特定情報を出力する。また、分類モデルは、右下のコーナー点を含む領域としてコーナー点存在領域候補ＣＡ７を選択し、選択したコーナー点存在領域候補ＣＡ７に対応付けて「コーナー３」というコーナー点第一特定情報を出力する。また、分類モデルは、左下のコーナー点を含む領域としてコーナー点存在領域候補ＣＡ６を選択し、選択したコーナー点存在領域候補ＣＡ６に対応付けて「コーナー４」というコーナー点第一特定情報を出力する。これにより、検出部２３は、検出対象画像において、ナンバープレートの４個のコーナー点が存在する領域（以下では「コーナー点存在領域」と呼ぶことがある）を検出することができる。 Then, the classification model selects the corner with the highest accuracy for each of the corners 1 to 4 and outputs the corner point first specific information. That is, in the examples shown in FIGS. 7 and 8, the classification model selects the corner point existence area candidate CA4 as the area including the upper left corner point, and associates the selected corner point existence area candidate CA4 with the “corner 1”. The first specific information of the corner point is output. Further, the classification model selects the corner point existence area candidate CA5 as the area including the upper right corner point, and outputs the corner point first specific information called "corner 2" in association with the selected corner point existence area candidate CA5. .. Further, the classification model selects the corner point existence area candidate CA7 as the area including the lower right corner point, and outputs the corner point first specific information called "corner 3" in association with the selected corner point existence area candidate CA7. To do. Further, the classification model selects the corner point existence area candidate CA6 as the area including the lower left corner point, and outputs the corner point first specific information called "corner 4" in association with the selected corner point existence area candidate CA6. .. As a result, the detection unit 23 can detect a region in which the four corner points of the license plate exist (hereinafter, may be referred to as a "corner point existence region") in the detection target image.

次いで、検出部２３は、図７に示すように、４個のコーナー点存在領域の各々の中心座標を抽出する。すなわち、検出部２３は、「コーナー１」というコーナー点第一特定情報に対応付けられたコーナー点存在領域の中心座標をナンバープレートの左上のコーナー点の座標ＣＯ１として抽出する。また、検出部２３は、「コーナー２」というコーナー点第一特定情報に対応付けられたコーナー点存在領域の中心座標をナンバープレートの右上のコーナー点の座標ＣＯ２として抽出する。また、検出部２３は、「コーナー３」というコーナー点第一特定情報に対応付けられたコーナー点存在領域の中心座標をナンバープレートの右下のコーナー点の座標ＣＯ３として抽出する。また、検出部２３は、「コーナー４」というコーナー点第一特定情報に対応付けられたコーナー点存在領域の中心座標をナンバープレートの左下のコーナー点の座標ＣＯ４として抽出する。このようにして、検出部２３は、分類モデルを用いて検出した４個のコーナー点存在領域の各々の中心座標を、検出対象画像におけるナンバープレートの４個のコーナー点の各々の座標として検出し、検出結果として出力する。このようにして検出された４個のコーナー点の座標の外縁を直線で結ぶことにより、検出対象画像において撮影されているナンバープレートの矩形の形状を検出することができる。 Next, as shown in FIG. 7, the detection unit 23 extracts the center coordinates of each of the four corner point existing regions. That is, the detection unit 23 extracts the center coordinates of the corner point existence region associated with the corner point first specific information called "corner 1" as the coordinates CO1 of the upper left corner point of the license plate. Further, the detection unit 23 extracts the center coordinates of the corner point existence region associated with the corner point first specific information called "corner 2" as the coordinates CO2 of the upper right corner point of the license plate. Further, the detection unit 23 extracts the center coordinates of the corner point existence region associated with the corner point first specific information called "corner 3" as the coordinates CO3 of the lower right corner point of the license plate. Further, the detection unit 23 extracts the center coordinates of the corner point existence region associated with the corner point first specific information called "corner 4" as the coordinates CO4 of the lower left corner point of the license plate. In this way, the detection unit 23 detects the center coordinates of each of the four corner point existence regions detected using the classification model as the coordinates of each of the four corner points of the license plate in the detection target image. , Output as detection result. By connecting the outer edges of the coordinates of the four corner points detected in this way with a straight line, the rectangular shape of the license plate captured in the image to be detected can be detected.

以上のように、実施例１では、座標検出装置２０は、記憶部２２と検出部２３とを有する。記憶部２２は、第一学習済みモデルとしての検出モデルと、第二学習済みモデルとしての分類モデルとを記憶する。検出モデルは、ナンバープレートのコーナー点が中心に位置するポジティブ画像ＰＩを用いて機械学習により生成された学習済みモデルである。また、検出モデル及び分類モデルは、検出対象画像においてナンバープレートのコーナー点が中心に位置する領域の情報（候補情報、コーナー点第一特定情報）を出力する。検出部２３は、ナンバープレートが撮影された検出対象画像に対して、検出モデル及び分類モデルを用いてナンバープレートのコーナー点が中心に位置する領域を検出し、検出対象画像におけるナンバープレートのコーナー点の座標を検出する。つまり、検出モデル及び分類モデルは、検出対象画像においてナンバープレートのコーナー点が中心に位置する領域の情報（候補情報、コーナー点第一特定情報）を検出部２３へ出力することにより、検出対象画像におけるナンバープレートのコーナー点の座標を検出する検出部２３に、ナンバープレートのコーナー点が中心に位置する領域を検出させる。 As described above, in the first embodiment, the coordinate detection device 20 has a storage unit 22 and a detection unit 23. The storage unit 22 stores the detection model as the first trained model and the classification model as the second trained model. The detection model is a trained model generated by machine learning using a positive image PI in which the corner points of the license plate are located in the center. Further, the detection model and the classification model output information (candidate information, corner point first specific information) of the region where the corner point of the license plate is located at the center in the detection target image. The detection unit 23 uses a detection model and a classification model to detect a region in which the corner point of the license plate is located at the center of the detection target image captured by the license plate, and the detection unit 23 detects the corner point of the license plate in the detection target image. Detect the coordinates of. That is, the detection model and the classification model output the information (candidate information, corner point first specific information) of the region where the corner point of the license plate is located at the center in the detection target image to the detection unit 23, thereby detecting the detection target image. The detection unit 23 that detects the coordinates of the corner points of the license plate in the above is made to detect the region where the corner points of the license plate are located at the center.

より詳細には、検出モデルは、検出対象画像が入力されて、検出対象画像においてナンバープレートのコーナー点が中心に位置する領域の候補を示す情報（候補情報）を出力する。分類モデルは、候補情報が入力されて、候補情報が示すコーナー点存在領域候補がナンバープレートの４個のコーナー点のうちの何れのコーナー点を含む領域であるかを示す情報（コーナー点第一特定情報）を出力する。検出部２３は、検出対象画像に対して検出モデルを用いてコーナー点存在領域候補を検出し、検出したコーナー点存在領域候補に対して分類モデルを用いてナンバープレートのコーナー点を含む領域を検出し、分類モデルを用いて検出した領域の中心座標を検出対象画像におけるナンバープレートのコーナー点の座標として検出する。 More specifically, the detection model inputs the detection target image and outputs information (candidate information) indicating a candidate of a region in which the corner point of the license plate is located at the center of the detection target image. In the classification model, candidate information is input, and information indicating which corner point of the four corner points of the license plate is included in the corner point existence area candidate indicated by the candidate information (corner point first). Specific information) is output. The detection unit 23 detects the corner point existence region candidate using the detection model for the detection target image, and detects the region including the corner point of the license plate using the classification model for the detected corner point existence region candidate. Then, the center coordinates of the region detected by using the classification model are detected as the coordinates of the corner points of the license plate in the detection target image.

このように、ナンバープレートのコーナー点が中心に位置するポジティブ画像ＰＩを用いて機械学習により生成された学習済みモデルを用いて検出対象画像におけるナンバープレートのコーナー点の座標を検出することにより、検出対象画像において撮影されているナンバープレートの矩形の形状を精度良く検出することができる。 In this way, detection is performed by detecting the coordinates of the license plate corner points in the detection target image using the trained model generated by machine learning using the positive image PI in which the license plate corner points are located in the center. The rectangular shape of the license plate captured in the target image can be detected with high accuracy.

［実施例２］
＜物体形状検出システムの構成、学習モデル生成装置の構成、座標検出装置の構成＞
実施例２の物体形状検出システム、学習モデル生成装置及び座標検出装置の各構成は、実施例１と同一であるため（図１〜３）、説明を省略する。 [Example 2]
<Configuration of object shape detection system, configuration of learning model generator, configuration of coordinate detection device>
Since each configuration of the object shape detection system, the learning model generation device, and the coordinate detection device of the second embodiment is the same as that of the first embodiment (FIGS. 1 to 3), the description thereof will be omitted.

＜学習モデル生成装置の処理＞
図９は、実施例２の学習モデル生成装置の処理の説明に供するフローチャートである。 <Processing of learning model generator>
FIG. 9 is a flowchart provided for explaining the processing of the learning model generator of the second embodiment.

図９において、ステップＳ２１では、データセット生成部１１が、元画像から、教師データとしての学習用データセットＢを生成する。 In FIG. 9, in step S21, the data set generation unit 11 generates the learning data set B as teacher data from the original image.

ステップＳ２３では、学習モデル生成部１２が、学習済みモデルとしての「検出分類モデル」の学習を行う。 In step S23, the learning model generation unit 12 learns the “detection classification model” as the learned model.

＜学習モデル生成装置の動作＞
図１０〜１２は、実施例２の学習モデル生成装置の動作の説明に供する図である。 <Operation of learning model generator>
10 to 12 are diagrams provided for explaining the operation of the learning model generator of the second embodiment.

図１０に示すように、データセット生成部１１には、ナンバープレートＮＰを有する自動車の画像が元画像として複数入力され、データセット生成部１１は、これら複数の元画像から、第一教師データとしてのデータセットＢ１と、第二教師データとしてのデータセットＢ２とを生成する。データセットＢ１及びデータセットＢ２により、図９のステップＳＴ２１における学習用データセットＢが形成される。 As shown in FIG. 10, a plurality of images of a vehicle having a number plate NP are input to the data set generation unit 11 as original images, and the data set generation unit 11 uses these plurality of original images as first teacher data. Data set B1 and data set B2 as the second teacher data are generated. The data set B1 and the data set B2 form the training data set B in step ST21 of FIG.

データセットＢ１は、図１０に示すように、左上コーナー点画像と、右上コーナー点画像と、右下コーナー点画像と、左下コーナー点画像とにより形成される。つまり、データセットＢ１は、ナンバープレートＮＰの４個のコーナー点の何れか一つのコーナー点だけを含む複数の「ポジティブ画像」により形成される。左上コーナー点画像には左上のコーナー点に対応する「コーナー１」というラベルが付され、右上コーナー点画像には右上のコーナー点に対応する「コーナー２」というラベルが付され、右下コーナー点画像には右下のコーナー点に対応する「コーナー３」というラベルが付され、左下コーナー点画像には左下のコーナー点に対応する「コーナー４」というラベルが付される。 As shown in FIG. 10, the data set B1 is formed by an upper left corner point image, an upper right corner point image, a lower right corner point image, and a lower left corner point image. That is, the data set B1 is formed by a plurality of "positive images" including only one of the four corner points of the license plate NP. The upper left corner point image is labeled "Corner 1" corresponding to the upper left corner point, the upper right corner point image is labeled "Corner 2" corresponding to the upper right corner point, and the lower right corner point is labeled. The image is labeled "Corner 3" corresponding to the lower right corner point, and the lower left corner point image is labeled "Corner 4" corresponding to the lower left corner point.

ここで、コーナー１〜４の各ラベルを付された各ポジティブ画像ＰＩにおいて、コーナー点ＣＰは、ポジティブ画像の中心に位置する。すなわち、例えばポジティブ画像ＰＩのアスペクト比が「ｘ：ｙ＝１：１」の場合、コーナー点ＣＰがｘ＝１／２，ｙ＝１／２の位置に配置されるようにポジティブ画像ＰＩが生成させる。換言すれば、ナンバープレートＮＰの四辺で形成される境界線のうちの互いに接する二辺の境界線がｘ＝１／２でのｙ方向に平行な直線及びｙ＝１／２でのｘ方向に平行な直線にほぼ重なるようにポジティブ画像ＰＩが生成される。図１０に示す例では、ナンバープレートＮＰの４個のコーナー点のうちの左下のコーナー点ＣＰがポジティブ画像ＰＩの中心に位置している。すなわち、図１０に示す例では、ポジティブ画像ＰＩにおいて、ナンバープレートＮＰの四辺のうち、左辺がｘ＝１／２でのｙ方向に平行な直線にほぼ重なり、かつ、左辺と互いに接する下辺がｙ＝１／２でのｘ方向に平行な直線にほぼ重なっている。 Here, in each of the positive image PIs labeled with the corners 1 to 4, the corner point CP is located at the center of the positive image. That is, for example, when the aspect ratio of the positive image PI is "x: y = 1: 1", the positive image PI is generated so that the corner points CP are arranged at the positions of x = 1/2 and y = 1/2. Let me. In other words, of the boundary lines formed by the four sides of the license plate NP, the boundary lines of the two tangent sides are a straight line parallel to the y direction at x = 1/2 and the x direction at y = 1/2. The positive image PI is generated so that it almost overlaps the parallel straight lines. In the example shown in FIG. 10, the lower left corner point CP of the four corner points of the license plate NP is located at the center of the positive image PI. That is, in the example shown in FIG. 10, in the positive image PI, of the four sides of the license plate NP, the left side substantially overlaps the straight line parallel to the y direction at x = 1/2, and the lower side in contact with the left side is y. It almost overlaps with a straight line parallel to the x direction at = 1/2.

また、データセットＢ２では、各元画像に対して、各元画像において撮影されているナンバープレートＮＰのコーナー１〜４の４個のコーナー点の座標コーナー１（ｘ,ｙ）、コーナー２（ｘ,ｙ）、コーナー３（ｘ,ｙ）、コーナー４（ｘ,ｙ）が対応付けられる。 Further, in the data set B2, for each original image, the coordinate corners 1 (x, y) and the corners 2 (x) of the four corner points of the license plate NP corners 1 to 4 captured in each original image are taken. , Y), corner 3 (x, y), and corner 4 (x, y) are associated.

学習モデル生成部１２は、データセット生成部１１で生成されたデータセットＢ１，Ｂ２を教師データとして用いて機械学習を行って、検出分類モデルを生成する。検出分類モデルを生成する際の機械学習として深層学習を用いる。検出分類モデルの生成は、例えば、YOLOv2をベースとした１１層のＦＣＮ（Fully Convolutional Networks）により行う。 The learning model generation unit 12 generates a detection classification model by performing machine learning using the data sets B1 and B2 generated by the data set generation unit 11 as teacher data. Deep learning is used as machine learning when generating a detection classification model. The detection classification model is generated by, for example, an 11-layer FCN (Fully Convolutional Networks) based on YOLOv2.

検出分類モデルの機械学習は、以下に示すように、第一段階と第二段階との二段階に分けて行われる。 Machine learning of the detection classification model is performed in two stages, the first stage and the second stage, as shown below.

すなわち、まず第一段階で、学習モデル生成部１２は、データセット生成部１１で生成されたデータセットＢ１を第一教師データとして用いて機械学習を行って、重みの初期値の学習を行う。この初期値の学習は、実施例１の分類モデルの学習に相当するものである。 That is, in the first step, the learning model generation unit 12 performs machine learning using the data set B1 generated by the data set generation unit 11 as the first teacher data, and learns the initial value of the weight. The learning of the initial value corresponds to the learning of the classification model of the first embodiment.

次いで第二段階で、学習モデル生成部１２は、第一段階で学習した初期値をフィルタ群Ｆ１〜Ｆ８の各要素の重みとしてセットした後、データセット生成部１１で生成されたデータセットＢ２を第二教師データとして用いて機械学習を行って、検出分類モデルを生成する。 Next, in the second stage, the learning model generation unit 12 sets the initial values learned in the first stage as the weights of each element of the filter groups F1 to F8, and then sets the data set B2 generated by the data set generation unit 11. Machine learning is performed using it as the second teacher data to generate a detection classification model.

＜座標検出装置の動作＞
図１１は、実施例２の座標検出装置の動作の説明に供する図である。 <Operation of coordinate detection device>
FIG. 11 is a diagram for explaining the operation of the coordinate detection device according to the second embodiment.

図１０〜１２に示すようにして生成された検出分類モデルは、座標検出装置２０の取得部２１によって学習モデル生成装置１０から取得されて記憶部２２に記憶される。 The detection classification model generated as shown in FIGS. 10 to 12 is acquired from the learning model generation device 10 by the acquisition unit 21 of the coordinate detection device 20 and stored in the storage unit 22.

図１１に示すように、ナンバープレートが撮影された検出対象画像が座標検出装置２０に入力されると、検出部２３は、まず、検出対象画像に対して検出分類モデルを用いて、ナンバープレートにおける４個のコーナー点存在領域ＣＢ１，ＣＢ２，ＣＢ３，ＣＢ４を検出するとともに、検出した各コーナー点存在領域の中心座標であるコーナー１Ｓ（ｘ,ｙ）、コーナー２Ｓ（ｘ,ｙ）、コーナー３Ｓ（ｘ,ｙ）、コーナー４Ｓ（ｘ,ｙ）を検出する。ここで、コーナー１Ｓ（ｘ,ｙ）はナンバープレートの左上のコーナー点に対応する座標であり、コーナー２Ｓ（ｘ,ｙ）はナンバープレートの右上のコーナー点に対応する座標であり、コーナー３Ｓ（ｘ,ｙ）はナンバープレートの右下のコーナー点に対応する座標であり、コーナー４Ｓ（ｘ,ｙ）はナンバープレートの左下のコーナー点に対応する座標である。但し、コーナー１Ｓ（ｘ,ｙ）、コーナー２Ｓ（ｘ,ｙ）、コーナー３Ｓ（ｘ,ｙ）、コーナー４Ｓ（ｘ,ｙ）の各座標は、相対座標のローカル座標である。 As shown in FIG. 11, when the detection target image in which the license plate is captured is input to the coordinate detection device 20, the detection unit 23 first uses the detection classification model for the detection target image in the license plate. Four corner point existence areas CB1, CB2, CB3, CB4 are detected, and corners 1S (x, y), corners 2S (x, y), and corners 3S (corners 3S), which are the center coordinates of the detected corner point existence areas, are detected. x, y) and corner 4S (x, y) are detected. Here, the corner 1S (x, y) is the coordinates corresponding to the upper left corner point of the license plate, the corner 2S (x, y) is the coordinates corresponding to the upper right corner point of the license plate, and the corner 3S ( x, y) is the coordinates corresponding to the lower right corner point of the license plate, and corner 4S (x, y) is the coordinates corresponding to the lower left corner point of the license plate. However, the coordinates of the corner 1S (x, y), the corner 2S (x, y), the corner 3S (x, y), and the corner 4S (x, y) are local coordinates of relative coordinates.

そこで、検出部２３は、次いで、ローカル座標であるコーナー１Ｓ（ｘ,ｙ）、コーナー２Ｓ（ｘ,ｙ）、コーナー３Ｓ（ｘ,ｙ）、コーナー４Ｓ（ｘ,ｙ）を、検出対象画像における絶対座標（以下では「画像座標」と呼ぶことがある）に変換する。そして、検出部２３は、コーナー１Ｓ（ｘ,ｙ）を座標変換した後の画像座標をナンバープレートの左上のコーナー点の座標ＣＯ１として検出する。また、検出部２３は、コーナー２Ｓ（ｘ,ｙ）を座標変換した後の画像座標をナンバープレートの右上のコーナー点の座標ＣＯ２として検出する。また、検出部２３は、コーナー３Ｓ（ｘ,ｙ）を座標変換した後の画像座標をナンバープレートの右下のコーナー点の座標ＣＯ３として検出する。また、検出部２３は、コーナー４Ｓ（ｘ,ｙ）を座標変換した後の画像座標をナンバープレートの左下のコーナー点の座標ＣＯ４として検出する。このようにして、検出部２３は、検出分類モデルを用いて検出した４個のコーナー点存在領域の各々の中心座標コーナー１Ｓ（ｘ,ｙ）、コーナー２Ｓ（ｘ,ｙ）、コーナー３Ｓ（ｘ,ｙ）、コーナー４Ｓ（ｘ,ｙ）を座標変換した上で検出対象画像におけるナンバープレートの４個のコーナー点の座標として検出する。このようにして検出された４個のコーナー点の画像座標の外縁を直線で結ぶことにより、検出対象画像において撮影されているナンバープレートの矩形の形状を検出することができる。 Therefore, the detection unit 23 then detects the local coordinates of the corner 1S (x, y), the corner 2S (x, y), the corner 3S (x, y), and the corner 4S (x, y) in the image to be detected. Convert to absolute coordinates (hereinafter sometimes referred to as "image coordinates"). Then, the detection unit 23 detects the image coordinates after the corner 1S (x, y) is coordinate-converted as the coordinates CO1 of the upper left corner point of the license plate. Further, the detection unit 23 detects the image coordinates after the corner 2S (x, y) is coordinate-converted as the coordinates CO2 of the upper right corner point of the license plate. Further, the detection unit 23 detects the image coordinates after the corner 3S (x, y) are coordinate-converted as the coordinates CO3 of the lower right corner point of the license plate. Further, the detection unit 23 detects the image coordinates after the corner 4S (x, y) are coordinate-converted as the coordinates CO4 of the lower left corner point of the license plate. In this way, the detection unit 23 has the center coordinate corner 1S (x, y), corner 2S (x, y), and corner 3S (x) of each of the four corner point existence regions detected using the detection classification model. , Y) and corner 4S (x, y) are coordinate-converted and then detected as the coordinates of the four corner points of the license plate in the detection target image. By connecting the outer edges of the image coordinates of the four corner points detected in this way with a straight line, the rectangular shape of the license plate captured in the image to be detected can be detected.

ここで、学習モデル生成装置１０により生成された検出分類モデルは、検出対象画像が入力されて、検出対象画像においてコーナー点存在領域を特定し、特定したコーナー点存在領域がナンバープレートの４個のコーナー点のうちの何れのコーナー点を含む領域であるかを示す情報（以下では「コーナー点第二特定情報」と呼ぶことがある）と、コーナー点第二特定情報によって示されるコーナー点存在領域の中心座標を示す情報（以下では「中心座標情報」と呼ぶことがある）とを検出部２３へ出力する学習済みモデルである。そして、検出部２３は、コーナー点第二特定情報と中心座標情報とに基づいて、ナンバープレートにおける４個のコーナー点存在領域を検出するとともに、検出した各コーナー点存在領域の中心座標であるコーナー１Ｓ（ｘ,ｙ）、コーナー２Ｓ（ｘ,ｙ）、コーナー３Ｓ（ｘ,ｙ）、コーナー４Ｓ（ｘ,ｙ）を検出する。また、検出分類モデルは、検出対象画像が入力される入力層と、出力層と、入力層から出力層までの何れかの層であって出力層以外の層に属する第一要素と、第一要素と第一要素の重みとに基づいて値が算出される第二要素とを有する。そして、検出分類モデルは、入力層に入力された検出対象画像に対し、出力層以外の各層に属する各要素を第一要素として、第一要素と第一要素の重みとに基づく演算を行うことにより中心座標情報を出力する。 Here, in the detection classification model generated by the learning model generation device 10, the detection target image is input, the corner point existence region is specified in the detection target image, and the specified corner point existence region is four number plates. Information indicating which of the corner points includes the corner point (hereinafter, may be referred to as "corner point second specific information") and the corner point existence area indicated by the corner point second specific information. This is a trained model that outputs information indicating the center coordinates of the above (hereinafter, may be referred to as “center coordinate information”) to the detection unit 23. Then, the detection unit 23 detects the four corner point existence areas on the license plate based on the corner point second specific information and the center coordinate information, and also detects the corners which are the center coordinates of each of the detected corner point existence areas. 1S (x, y), corner 2S (x, y), corner 3S (x, y), and corner 4S (x, y) are detected. Further, the detection classification model includes an input layer into which an image to be detected is input, an output layer, a first element which is any layer from the input layer to the output layer and belongs to a layer other than the output layer, and the first element. It has a second element whose value is calculated based on the element and the weight of the first element. Then, the detection classification model performs an operation based on the first element and the weight of the first element with each element belonging to each layer other than the output layer as the first element for the detection target image input to the input layer. Outputs the center coordinate information by.

＜検出分類モデルの動作＞
図１２及び図１３は、実施例２の検出分類モデルの動作の説明に供する図である。 <Operation of detection classification model>
12 and 13 are diagrams for explaining the operation of the detection classification model of the second embodiment.

図１２に示すように、検出分類モデルは、検出分類モデルに入力された検出対象画像を「Ｍ×Ｎ個」の複数のグリッドに分割し、各グリッドを順次「注目グリッドＧＲ」に設定しながら、以下のように動作する。すなわち、検出分類モデルは、図１２に示すように、矩形領域ａ，ｂ，ｃの各領域の中心座標（以下では「矩形領域中心座標」と呼ぶことがある）が注目グリッドＧＲ内に含まれるという条件の下で、矩形領域ａ，ｂ，ｃの中にナンバープレートのコーナー点が発見されるように、矩形領域ａ，ｂ，ｃを検出する。一つの注目グリッドＧＲに対して検出される矩形領域の数は「アンカー数」として予め設定されている。ここでは、一例として、「アンカー数＝３」として、一つの注目グリッドＧＲに対して矩形領域ａ，ｂ，ｃの３個の矩形領域が検出される場合について説明する。検出された矩形領域（以下では「検出矩形領域」と呼ぶことがある）は、以下の式（１）のように表される。 As shown in FIG. 12, in the detection classification model, the detection target image input to the detection classification model is divided into a plurality of "M × N" grids, and each grid is sequentially set to the "attention grid GR". , Works as follows. That is, as shown in FIG. 12, the detection classification model includes the center coordinates of each of the rectangular regions a, b, and c (hereinafter, may be referred to as "rectangular region center coordinates") in the attention grid GR. Under the condition, the rectangular regions a, b, and c are detected so that the corner points of the license plate are found in the rectangular regions a, b, and c. The number of rectangular regions detected for one attention grid GR is preset as the “number of anchors”. Here, as an example, a case where three rectangular regions a, b, and c are detected for one attention grid GR will be described with “the number of anchors = 3”. The detected rectangular area (hereinafter, may be referred to as “detected rectangular area”) is expressed by the following equation (1).

検出矩形領域＝
（ｘ座標，ｙ座標，幅ｗ，高さｈ，確度，
クラス確率（コーナー１，コーナー２，コーナー３，コーナー４））…式（１） Detection rectangular area =
(X coordinate, y coordinate, width w, height h, accuracy,
Class probability (corner 1, corner 2, corner 3, corner 4)) ... Equation (1)

式（１）において、コーナー１〜４はそれぞれ、上記のように、ナンバープレートの左上のコーナー点、右上のコーナー点、右下のコーナー点、左下のコーナー点に対応する。また、式（１）において、「幅ｗ」及び「高さｈ」は、予め決められた値である。また、式（１）における「ｘ座標」及び「ｙ座標」は、図１３に示すように、注目グリッドＧＲの原点を基準にした相対座標（つまり、ローカル座標）である。また、式（１）における「確度」は、検出矩形領域内に何れかのコーナー点が含まれる確度を表す。また、式（１）における「クラス確率」は、４個のコーナー点それぞれの、検出矩形領域内における存在確率を表す。 In the formula (1), the corners 1 to 4 correspond to the upper left corner point, the upper right corner point, the lower right corner point, and the lower left corner point of the license plate, respectively, as described above. Further, in the equation (1), the "width w" and the "height h" are predetermined values. Further, the "x coordinate" and the "y coordinate" in the equation (1) are relative coordinates (that is, local coordinates) with respect to the origin of the grid GR of interest, as shown in FIG. Further, the "probability" in the equation (1) represents the probability that any corner point is included in the detection rectangular area. Further, the "class probability" in the equation (1) represents the existence probability of each of the four corner points in the detection rectangular area.

例えば、図１２に示す注目グリッドＧＲに対する検出矩形領域ａ，ｂ，ｃは、以下の式（２）〜（４）のように表される。 For example, the detection rectangular regions a, b, and c for the attention grid GR shown in FIG. 12 are expressed by the following equations (2) to (4).

検出矩形領域ａ
＝（0.12,0.87,0.49,0.64,0.589,(0,0,0.9999,0)）…式（２）
検出矩形領域ｂ
＝（0.16,0.77,1.77,2.11,0.010,(0,0.0001,0.9998,0.0001)）…式（３）
検出矩形領域ｃ
＝（0.33,0.73,5.45,6.56,0.016,(0.54,0.23,0.05,0.19)）…式（４） Detection rectangular area a
= (0.12,0.87,0.49,0.64,0.589, (0,0,0.9999,0))… Equation (2)
Detection rectangular area b
= (0.16,0.77,1.77,2.11,0.010, (0,0.0001,0.9998,0.0001))… Equation (3)
Detection rectangular area c
= (0.33,0.73,5.45,6.56,0.016, (0.54,0.23,0.05,0.19))… Equation (4)

よって、例えば「確度」の閾値を「0.5」とすると、確度が0.5以上であるのは検出矩形領域ａであり、検出矩形領域ｂ，ｃについては確度が0.5未満であるため、図１２に示す注目グリッドＧＲに対しては、検出矩形領域ｂ，ｃは除外され、検出矩形領域ａが、右下のコーナー点に対するコーナー点存在領域候補として選択される。 Therefore, for example, assuming that the threshold value of "accuracy" is "0.5", it is the detection rectangular area a that has an accuracy of 0.5 or more, and the accuracy of the detection rectangular areas b and c is less than 0.5, which is shown in FIG. For the grid GR of interest, the detection rectangle areas b and c are excluded, and the detection rectangle area a is selected as a corner point existence area candidate for the lower right corner point.

検出分類モデルは、すべてのグリッドを注目グリッドＧＲに順次設定しながら以上の動作を行う。そして、検出分類モデルは、各コーナー点について、確度が最も高い検出矩形領域を最終的なコーナー点存在領域に決定する。そして、検出分類モデルは、各コーナー点について、確度が最も高い検出矩形領域を示す情報をコーナー点第二特定情報として出力し、矩形領域中心座標を示す情報を中心座標情報として出力する。 The detection classification model performs the above operation while sequentially setting all the grids to the attention grid GR. Then, the detection classification model determines the detection rectangular region having the highest accuracy as the final corner point existence region for each corner point. Then, the detection classification model outputs the information indicating the detection rectangular area having the highest accuracy as the second specific information of the corner points for each corner point, and outputs the information indicating the center coordinates of the rectangular area as the center coordinate information.

＜ローカル座標から画像座標への変換＞
検出部２３は、例えば、以下の式（５）に従って検出矩形領域のローカル座標であるコーナー１Ｓ（ｘ,ｙ）、コーナー２Ｓ（ｘ,ｙ）、コーナー３Ｓ（ｘ,ｙ）、コーナー４Ｓ（ｘ,ｙ）を、画像座標であるコーナー１Ｓ（ｘ',ｙ'）、コーナー２Ｓ（ｘ',ｙ'）、コーナー３Ｓ（ｘ',ｙ'）、コーナー４Ｓ（ｘ',ｙ'）に変換する。よって、画像座標であるコーナー１Ｓ（ｘ',ｙ'）、コーナー２Ｓ（ｘ',ｙ'）、コーナー３Ｓ（ｘ',ｙ'）、コーナー４Ｓ（ｘ',ｙ'）がそれぞれ、検出部２３から出力される座標ＣＯ１，ＣＯ２，ＣＯ３，ＣＯ４（図１１）に相当する。式（５）において、「Ｗ」は検出対象画像の幅を表し、「Ｈ」は検出対象画像の高さを表す。また、式（５）において、「ｕ」は注目グリッドＧＲのｘ方向の位置を表し、「ｖ」は注目グリッドＧＲのｙ方向の位置を表す。 <Conversion from local coordinates to image coordinates>
For example, the detection unit 23 has corners 1S (x, y), corners 2S (x, y), corners 3S (x, y), and corners 4S (x), which are local coordinates of the detection rectangular region, according to the following equation (5). , Y) is converted into corner 1S (x', y'), corner 2S (x', y'), corner 3S (x', y'), and corner 4S (x', y'), which are image coordinates. To do. Therefore, the corner 1S (x', y'), the corner 2S (x', y'), the corner 3S (x', y'), and the corner 4S (x', y'), which are the image coordinates, are the detection units, respectively. It corresponds to the coordinates CO1, CO2, CO3, and CO4 (FIG. 11) output from 23. In the formula (5), "W" represents the width of the detection target image, and "H" represents the height of the detection target image. Further, in the equation (5), "u" represents the position of the attention grid GR in the x direction, and "v" represents the position of the attention grid GR in the y direction.

ｘ'＝Ｗ（ｕ＋ｘ）／Ｍ
ｙ'＝Ｈ（ｖ＋ｙ）／Ｎ …式（５） x'= W (u + x) / M
y'= H (v + y) / N ... Equation (5)

例えば、検出対象画像のサイズが（Ｗ,Ｈ）＝（1322,902）であり、グリッドの総数が（Ｍ×Ｎ）＝（13×13）個であった場合、座標変換前後の座標値は図１４に示すようになる。図１４は、実施例２の座標変換の一例を示す図である。 For example, if the size of the image to be detected is (W, H) = (1322,902) and the total number of grids is (M × N) = (13 × 13), the coordinate values before and after the coordinate conversion are As shown in FIG. FIG. 14 is a diagram showing an example of the coordinate transformation of the second embodiment.

以上のように、実施例２では、記憶部２２は、学習済みモデルとしての検出分類モデルを記憶する。検出分類モデルは、ナンバープレートのコーナー点が中心に位置するポジティブ画像ＰＩを用いて機械学習により生成された学習済みモデルである。また、検出分類モデルは、検出対象画像においてナンバープレートのコーナー点が中心に位置する領域の情報（コーナー点第二特定情報、中心座標情報）を出力する。検出部２３は、ナンバープレートが撮影された検出対象画像に対して、検出分類モデルを用いてナンバープレートのコーナー点が中心に位置する領域を検出し、検出対象画像におけるナンバープレートのコーナー点の座標を検出する。つまり、検出分類モデルは、検出対象画像においてナンバープレートのコーナー点が中心に位置する領域の情報（コーナー点第二特定情報、中心座標情報）を出力することにより、検出対象画像におけるナンバープレートのコーナー点の座標を検出する検出部２３に、ナンバープレートのコーナー点が中心に位置する領域を検出させる。 As described above, in the second embodiment, the storage unit 22 stores the detection classification model as the learned model. The detection classification model is a trained model generated by machine learning using a positive image PI in which the corner points of the license plate are located in the center. In addition, the detection classification model outputs information on a region in which the corner point of the license plate is located at the center in the image to be detected (corner point second specific information, center coordinate information). The detection unit 23 detects the region where the corner point of the license plate is located at the center of the detection target image on which the license plate is captured by using the detection classification model, and the coordinates of the corner point of the license plate in the detection target image. Is detected. That is, the detection classification model outputs the information of the region where the license plate corner point is located at the center in the detection target image (corner point second specific information, center coordinate information), so that the license plate corner in the detection target image is output. The detection unit 23 that detects the coordinates of the points is made to detect the region where the corner point of the license plate is located at the center.

より詳細には、検出分類モデルは、検出対象画像が入力されて、検出対象画像においてナンバープレートのコーナー点が中心に位置する領域を特定し、特定した領域がナンバープレートの複数の４個のコーナー点のうちの何れのコーナー点を含む領域であるかを示す情報である情報（コーナー点第二特定情報）と、コーナー点第二特定情報によって示される領域の中心座標を示す情報（中心座標情報）とを出力する。検出部２３は、検出対象画像に対して検出分類モデルを用いて、コーナー点存在領域を検出するとともに、コーナー点存在領域の中心座標を検出し、検出した中心座標を座標変換した上で検出対象画像におけるナンバープレートのコーナー点の座標として検出する。 More specifically, in the detection classification model, the detection target image is input, the region where the corner point of the license plate is located at the center in the detection target image is specified, and the specified region is a plurality of four corners of the license plate. Information indicating which of the points the area includes the corner point (corner point second specific information) and information indicating the center coordinates of the area indicated by the corner point second specific information (center coordinate information). ) And is output. The detection unit 23 detects the corner point existence region using the detection classification model for the detection target image, detects the center coordinates of the corner point existence region, converts the detected center coordinates into coordinates, and then detects the detection target. It is detected as the coordinates of the corner points of the license plate in the image.

また、実施例２の検出分類モデルの機能は、実施例１の検出モデルと分類モデルの両者の機能を一体化したものに相当するため、コーナー点の座標を検出する際の演算規模は、実施例２の方が実施例１よりも小さくなる。 Further, since the function of the detection classification model of the second embodiment corresponds to the one in which the functions of both the detection model and the classification model of the first embodiment are integrated, the calculation scale when detecting the coordinates of the corner points is carried out. Example 2 is smaller than Example 1.

［実施例３］
＜文字認識装置の構成＞
図１５は、実施例３の文字認識装置の構成例を示す図である。図１５において、文字認識装置３０は、座標検出装置２０と、補正部３１と、認識部３２とを有する。 [Example 3]
<Configuration of character recognition device>
FIG. 15 is a diagram showing a configuration example of the character recognition device of the third embodiment. In FIG. 15, the character recognition device 30 includes a coordinate detection device 20, a correction unit 31, and a recognition unit 32.

文字認識装置３０に入力された検出対象画像は、座標検出装置２０及び補正部３１に入力される。例えば、検出対象画像には、実施例１，２と同様に、自動車の矩形のナンバープレートが撮影されている。 The detection target image input to the character recognition device 30 is input to the coordinate detection device 20 and the correction unit 31. For example, in the image to be detected, a rectangular license plate of an automobile is photographed as in the first and second embodiments.

座標検出装置２０は、実施例１または実施例２で説明したようにして、検出対象画像において、ナンバープレートの４個のコーナー点の座標ＣＯ１，ＣＯ２，ＣＯ３，ＣＯ４を検出し、検出結果として補正部３１へ出力する。 As described in the first or second embodiment, the coordinate detection device 20 detects the coordinates CO1, CO2, CO3, and CO4 of the four corner points of the license plate in the image to be detected, and corrects them as the detection result. Output to unit 31.

補正部３１は、コーナー点の座標ＣＯ１，ＣＯ２，ＣＯ３，ＣＯ４に基づいて、検出対象画像に撮影されているナンバープレートの矩形の歪みをナンバープレートの文字も一体として補正し、歪みが補正された後の矩形を有するナンバープレートの画像を認識部３２へ出力する。すなわち、補正部３１は、４個のコーナー点の座標ＣＯ１，ＣＯ２，ＣＯ３，ＣＯ４の外縁を直線で結ぶことにより形成される矩形の歪みを補正する。 The correction unit 31 corrects the rectangular distortion of the license plate captured in the detection target image together with the characters of the license plate based on the coordinates CO1, CO2, CO3, and CO4 of the corner points, and the distortion is corrected. The image of the license plate having the latter rectangle is output to the recognition unit 32. That is, the correction unit 31 corrects the distortion of the rectangle formed by connecting the outer edges of the coordinates CO1, CO2, CO3, and CO4 of the four corner points with a straight line.

認識部３２は、歪みが補正された後の矩形の中に存在する文字を認識し、認識結果を出力する。文字の認識は、例えばＯＣＲ（Optical Character Recognition）により行われる。 The recognition unit 32 recognizes the characters existing in the rectangle after the distortion is corrected, and outputs the recognition result. Character recognition is performed by, for example, OCR (Optical Character Recognition).

＜補正部及び認識部の動作＞
図１６は、実施例３の補正部及び認識部の動作の説明に供する図である。 <Operation of correction unit and recognition unit>
FIG. 16 is a diagram for explaining the operation of the correction unit and the recognition unit according to the third embodiment.

図１６に示すように、検出対象画像と、ナンバープレートの４個のコーナー点の座標ＣＯ１，ＣＯ２，ＣＯ３，ＣＯ４とを入力された補正部３１は、座標ＣＯ１，ＣＯ２，ＣＯ３，ＣＯ４に基づいて、周知の技術である「透視投影変換」を利用して、検出対象画像に撮影されているナンバープレートの矩形の歪みを補正する。すなわち、補正部３１は、座標ＣＯ１，ＣＯ２，ＣＯ３，ＣＯ４にそれぞれ対応する複数のコーナー点で囲まれる領域の形状の歪みを、透視投影変換を用いて補正する。この補正により、検出対象画像に撮影されているナンバープレートの形状が例えば台形であった場合でも、ナンバープレートの形状は長方形に補正される。また、ナンバープレートの矩形の歪みの補正に伴って、同時に、ナンバープレート内の文字の歪みも補正される。 As shown in FIG. 16, the correction unit 31 in which the detection target image and the coordinates CO1, CO2, CO3, CO4 of the four corner points of the license plate are input is based on the coordinates CO1, CO2, CO3, CO4. , The distortion of the rectangular shape of the license plate captured in the image to be detected is corrected by using the well-known technique "perspective projection conversion". That is, the correction unit 31 corrects the distortion of the shape of the region surrounded by the plurality of corner points corresponding to the coordinates CO1, CO2, CO3, and CO4 by using the perspective projection transformation. By this correction, even if the shape of the license plate captured in the detection target image is, for example, a trapezoid, the shape of the license plate is corrected to a rectangle. In addition, along with the correction of the distortion of the rectangle of the license plate, the distortion of the characters in the license plate is also corrected at the same time.

図１７は、実施例３の透視投影変換の一例を示す図である。図１７は、ナンバープレートの矩形の実寸を330×165［mm］とした場合の座標変換例である。透視投影変換により、座標ＣＯ１（ｘ',ｙ'），ＣＯ２（ｘ',ｙ'），ＣＯ３（ｘ',ｙ'），ＣＯ４（ｘ',ｙ'）が、座標ＣＯ１（ｘ'',ｙ''），ＣＯ２（ｘ'',ｙ''），ＣＯ３（ｘ'',ｙ''），ＣＯ４（ｘ'',ｙ''）に補正される。 FIG. 17 is a diagram showing an example of the perspective projection conversion of the third embodiment. FIG. 17 is an example of coordinate conversion when the actual size of the rectangle of the license plate is 330 × 165 [mm]. By fluoroscopic projection conversion, the coordinates CO1 (x', y'), CO2 (x', y'), CO3 (x', y'), and CO4 (x', y') are converted to the coordinates CO1 (x'', y'). It is corrected to y''), CO2 (x'', y''), CO3 (x'', y''), CO4 (x'', y'').

以上のように、実施例３では、文字認識装置３０は、座標検出装置２０と、補正部３１と、認識部３２とを有する。座標検出装置２０は、検出対象画像におけるナンバープレートの４個のコーナー点の座標ＣＯ１，ＣＯ２，ＣＯ３，ＣＯ４を検出する。補正部３１は、座標ＣＯ１，ＣＯ２，ＣＯ３，ＣＯ４にそれぞれ対応する複数のコーナー点で囲まれる領域の形状の歪みを補正する。認識部３２は、形状の歪みが補正された後の領域の中に存在する文字を認識する。 As described above, in the third embodiment, the character recognition device 30 includes the coordinate detection device 20, the correction unit 31, and the recognition unit 32. The coordinate detection device 20 detects the coordinates CO1, CO2, CO3, and CO4 of the four corner points of the license plate in the image to be detected. The correction unit 31 corrects the distortion of the shape of the region surrounded by the plurality of corner points corresponding to the coordinates CO1, CO2, CO3, and CO4, respectively. The recognition unit 32 recognizes the characters existing in the region after the distortion of the shape is corrected.

こうすることで、ナンバープレート内の文字の認識精度を高めることができる。また、実施例３では、検出対象画像におけるナンバープレートの形状の検出を機械学習を用いて行う一方で、機械学習による形状の検出と分離して、形状の補正及び文字認識を行っている。このため、ナンバープレート内の歪んだ文字をそのまま機械学習を用いて認識する場合よりも、用意する教師データの数が少なくて済むとともに、機械学習の学習時間を短縮することができ、また、演算量を減少させることができるので、より低スペックなプロセッサを用いて機械学習を行うことができる。 By doing so, the recognition accuracy of the characters in the license plate can be improved. Further, in the third embodiment, the shape of the license plate in the image to be detected is detected by using machine learning, while the shape is corrected and the character is recognized separately from the shape detection by machine learning. Therefore, the number of teacher data to be prepared can be reduced, the learning time of machine learning can be shortened, and the calculation can be performed, as compared with the case where the distorted characters in the number plate are recognized as they are by using machine learning. Since the amount can be reduced, machine learning can be performed using a processor with lower specifications.

［実施例４］
＜画像処理装置の構成＞
図１８は、実施例４の画像処理装置の構成例を示す図である。図１８において、画像処理装置４０は、座標検出装置２０と、補正部４１と、記憶部４２と、重畳部４３と、逆変換部４４とを有する。 [Example 4]
<Configuration of image processing device>
FIG. 18 is a diagram showing a configuration example of the image processing device of the fourth embodiment. In FIG. 18, the image processing device 40 includes a coordinate detection device 20, a correction unit 41, a storage unit 42, a superposition unit 43, and an inverse conversion unit 44.

画像処理装置４０に入力された検出対象画像は、座標検出装置２０及び補正部４１に入力される。 The detection target image input to the image processing device 40 is input to the coordinate detection device 20 and the correction unit 41.

座標検出装置２０は、実施例１または実施例２で説明したようにして、検出対象画像において、検出対象物体のコーナー点の座標を検出し、検出結果として補正部４１へ出力する。 As described in the first or second embodiment, the coordinate detection device 20 detects the coordinates of the corner points of the detection target object in the detection target image, and outputs the detection result to the correction unit 41.

補正部４１は、座標検出装置２０で検出されたコーナー点の座標に基づいて、周知の技術である「透視投影変換」を利用して、検出対象画像に撮影されている検出対象物体の形状の歪みを補正し、歪みが補正された後の形状を有する検出対象物体の画像（以下では「形状補正後画像」と呼ぶことがある）を重畳部４３へ出力する。すなわち、補正部４１は、複数のコーナー点の座標の外縁を直線で結ぶことにより形成される形状の歪みを補正する。 The correction unit 41 uses the well-known technique "perspective projection conversion" based on the coordinates of the corner points detected by the coordinate detection device 20 to form the shape of the detection target object captured in the detection target image. The distortion is corrected, and an image of the object to be detected having the shape after the distortion is corrected (hereinafter, may be referred to as a “shape-corrected image”) is output to the superimposing unit 43. That is, the correction unit 41 corrects the distortion of the shape formed by connecting the outer edges of the coordinates of the plurality of corner points with a straight line.

記憶部４２には、検出対象物体の画像と重畳されるコンテンツ（以下では「重畳コンテンツ」と呼ぶことがある）が予め記憶されている。記憶部４２に記憶されている重畳コンテンツは歪みの無いコンテンツである。 The storage unit 42 stores in advance content that is superimposed on the image of the object to be detected (hereinafter, may be referred to as “superimposed content”). The superimposed content stored in the storage unit 42 is content without distortion.

重畳部４３は、記憶部４２から重畳コンテンツを取得し、取得した重畳コンテンツを形状補正後画像に重畳し、重畳コンテンツが形状補正後画像に重畳された後の画像（以下では「重畳後画像」と呼ぶことがある）を逆変換部４４へ出力する。 The superimposition unit 43 acquires the superimposition content from the storage unit 42, superimposes the acquired superimposition content on the shape-corrected image, and superimposes the superimposition content on the shape-corrected image (hereinafter, “superimposition post-image”). Is sometimes called) is output to the inverse conversion unit 44.

逆変換部４４は、補正部４１で行われた透視投影変換と逆の変換を重畳後画像に対して行って重畳後画像の形状を歪ませ、形状を歪ませた後の重畳後画像を重畳結果として出力する。 The inverse conversion unit 44 performs a transformation opposite to the perspective projection conversion performed by the correction unit 41 on the superimposed image to distort the shape of the superimposed image, and superimposes the superimposed image after the shape is distorted. Output as a result.

＜画像処理装置の動作＞
図１９は、実施例４の画像処理装置の動作の説明に供する図である。 <Operation of image processing device>
FIG. 19 is a diagram for explaining the operation of the image processing apparatus of the fourth embodiment.

図１９に示すように、座標検出装置２０は、検出対象画像において、検出対象物体のコーナー点の座標を検出する。図１９では、一例として、検出対象物体の形状が「星形」であるとする。 As shown in FIG. 19, the coordinate detection device 20 detects the coordinates of the corner points of the detection target object in the detection target image. In FIG. 19, as an example, it is assumed that the shape of the object to be detected is a “star”.

補正部４１は、座標検出装置２０で検出されたコーナー点の座標に基づいて透視投影変換を行って星形の画像の歪みを補正する。すなわち、補正部４１は、座標検出装置２０で検出された複数の座標にそれぞれ対応する複数のコーナー点で囲まれる領域の形状の歪みを、透視投影変換を用いて補正する。この補正により、星形の画像内に設定されたマーカーの歪みも補正される。 The correction unit 41 corrects the distortion of the star-shaped image by performing perspective projection conversion based on the coordinates of the corner points detected by the coordinate detection device 20. That is, the correction unit 41 corrects the distortion of the shape of the region surrounded by the plurality of corner points corresponding to the plurality of coordinates detected by the coordinate detection device 20 by using the perspective projection conversion. This correction also corrects the distortion of the markers set in the star-shaped image.

重畳部４３は、記憶部４２から取得した重畳コンテンツを、星形の画像内に設定されたマーカーを基準にして、歪み補正後の星形の画像に重畳する。 The superimposing unit 43 superimposes the superimposing content acquired from the storage unit 42 on the distortion-corrected star-shaped image with reference to the marker set in the star-shaped image.

逆変換部４４は、補正部４１で行われた透視投影変換と逆の変換を重畳後画像に対して行って重畳後画像の形状を歪ませる。 The inverse conversion unit 44 performs a transformation opposite to the perspective projection conversion performed by the correction unit 41 on the superimposed image to distort the shape of the superimposed image.

以上のように、実施例４では、画像処理装置４０は、座標検出装置２０と、補正部４１と、重畳部４３とを有する。座標検出装置２０は、検出対象画像における検出対象物体の複数のコーナー点の座標を検出する。補正部４１は、座標検出装置２０で検出された複数の座標にそれぞれ対応する複数のコーナー点で囲まれる領域の形状の歪みを補正する。重畳部４３は、歪みが補正された後の形状に所定のコンテンツを重畳する。 As described above, in the fourth embodiment, the image processing device 40 includes the coordinate detection device 20, the correction unit 41, and the superimposition unit 43. The coordinate detection device 20 detects the coordinates of a plurality of corner points of the detection target object in the detection target image. The correction unit 41 corrects the distortion of the shape of the region surrounded by the plurality of corner points corresponding to the plurality of coordinates detected by the coordinate detection device 20. The superimposing unit 43 superimposes a predetermined content on the shape after the distortion is corrected.

こうすることで、歪みを補正した後の検出対象物体の画像にコンテンツを重畳することができるため、例えば、ＡＲ（Augmented Reality）における正確な表現が可能となる。 By doing so, the content can be superimposed on the image of the object to be detected after the distortion is corrected, so that accurate representation in AR (Augmented Reality) is possible, for example.

［実施例５］
物体形状検出システム１が対象とする検出対象物体は、ナンバープレートに限定されない。例えば、検出対象物体は、道路標記等であっても良い。図２０及び図２１は、実施例５の検出対象物体の一例を示す図である。図２０に示す道路標識には、３個のコーナー点が存在する。また、図２１に示す道路標識には、８個のコーナー点が存在する。 [Example 5]
The object to be detected by the object shape detection system 1 is not limited to the license plate. For example, the object to be detected may be a road marking or the like. 20 and 21 are diagrams showing an example of the object to be detected in the fifth embodiment. The road sign shown in FIG. 20 has three corner points. Further, the road sign shown in FIG. 21 has eight corner points.

また、コーナー点は、検出対象物体上に存在する「規定点」の一例であり、物体形状検出システム１が対象とする規定点は、コーナー点に限定されない。例えば、円形の道路標識の円周上に任意の規定点が設定されても良い。すなわち、物体形状検出システム１が対象とする規定点は、検出対象画像における検出対象物体の外縁上に存在する何れかの点であれば良い。 Further, the corner point is an example of a "specified point" existing on the object to be detected, and the specified point targeted by the object shape detection system 1 is not limited to the corner point. For example, an arbitrary defining point may be set on the circumference of a circular road sign. That is, the specified point targeted by the object shape detection system 1 may be any point existing on the outer edge of the object to be detected in the image to be detected.

さらに、規定点として、検出対象画像におけるナンバープレートのネジ、ナンバープレートの封印の中心、ナンバープレート内の“・”や“−”の文字等を採用しても良い。 Further, as the specified points, the screw of the license plate in the image to be detected, the center of the seal of the license plate, the characters "・" and "-" in the license plate, and the like may be adopted.

［他の実施例］
［１］記憶部１３，２２，４２は、ハードウェアとして、例えば、メモリ、ＨＤＤ（Hard Disk Drive）、ＳＳＤ（Solid State Drive）等により実現される。記憶部１３，２２，４２を実現するメモリの一例として、ＳＤＲＡＭ（Synchronous Dynamic Random Access Memory）等のＲＡＭ（Random Access Memory）、ＲＯＭ（Read Only Memory）、フラッシュメモリ等が挙げられる。データセット生成部１１、学習モデル生成部１２、検出部２３、補正部３１，４１、認識部３２、重畳部４３及び逆変換部４４は、ハードウェアとして、例えばプロセッサにより実現することができる。データセット生成部１１、学習モデル生成部１２、検出部２３、補正部３１，４１、認識部３２、重畳部４３及び逆変換部４４を実現するプロセッサの一例として、ＣＰＵ（Central Processing Unit）、ＤＳＰ（Digital Signal Processor）、ＦＰＧＡ（Field Programmable Gate Array）、ＡＳＩＣ（Application Specific Integrated Circuit）等が挙げられる。また、データセット生成部１１、学習モデル生成部１２、検出部２３、補正部３１，４１、認識部３２、重畳部４３及び逆変換部４４は、プロセッサと周辺回路とを含むＬＳＩ（Large Scale Integrated circuit）によって実現されても良い。出力部１４及び取得部２１は、ハードウェアとして、例えば、無線通信モジュールまたはネットワークインタフェースモジュールにより実現される。よって例えば、学習モデル生成装置１０は、パーソナルコンピュータやサーバ等のコンピュータ装置として実現される。また例えば、座標検出装置２０、文字認識装置３０または画像処理装置４０は、スマートフォンやタブレット端末等のスマートデバイスとして実現される。 [Other Examples]
[1] The storage units 13, 22, and 42 are realized as hardware by, for example, a memory, an HDD (Hard Disk Drive), an SSD (Solid State Drive), or the like. Examples of the memory that realizes the storage units 13, 22, and 42 include RAM (Random Access Memory) such as SDRAM (Synchronous Dynamic Random Access Memory), ROM (Read Only Memory), and flash memory. The data set generation unit 11, the learning model generation unit 12, the detection unit 23, the correction units 31, 41, the recognition unit 32, the superimposition unit 43, and the inverse conversion unit 44 can be realized as hardware, for example, by a processor. As an example of a processor that realizes a data set generation unit 11, a learning model generation unit 12, a detection unit 23, a correction unit 31, 41, a recognition unit 32, a superimposition unit 43, and an inverse conversion unit 44, a CPU (Central Processing Unit) and a DSP (Digital Signal Processor), FPGA (Field Programmable Gate Array), ASIC (Application Specific Integrated Circuit) and the like. Further, the data set generation unit 11, the learning model generation unit 12, the detection unit 23, the correction units 31, 41, the recognition unit 32, the superimposition unit 43, and the inverse conversion unit 44 are LSIs (Large Scale Integrated) including a processor and peripheral circuits. It may be realized by circuit). The output unit 14 and the acquisition unit 21 are realized as hardware by, for example, a wireless communication module or a network interface module. Therefore, for example, the learning model generation device 10 is realized as a computer device such as a personal computer or a server. Further, for example, the coordinate detection device 20, the character recognition device 30, or the image processing device 40 is realized as a smart device such as a smartphone or a tablet terminal.

［２］物体形状検出システム１での上記説明における各処理の全部または一部は、各処理に対応するプログラムを物体形状検出システム１が有するプロセッサに実行させることによって実現してもよい。例えば、上記説明における各処理に対応するプログラムがメモリに記憶され、プログラムがプロセッサによってメモリから読み出されて実行されても良い。また、プログラムは、任意のネットワークを介して物体形状検出システム１に接続されたプログラムサーバに記憶され、そのプログラムサーバから物体形状検出システム１にダウンロードされて実行されたり、物体形状検出システム１が読み取り可能な記録媒体に記憶され、その記録媒体から読み出されて実行されても良い。物体形状検出システム１が読み取り可能な記録媒体には、例えば、メモリーカード、ＵＳＢメモリ、ＳＤカード、フレキシブルディスク、光磁気ディスク、ＣＤ−ＲＯＭ、ＤＶＤ、及び、Ｂｌｕ−ｒａｙ（登録商標）ディスク等の可搬の記憶媒体が含まれる。また、プログラムは、任意の言語や任意の記述方法にて記述されたデータ処理方法であり、ソースコードやバイナリコード等の形式を問わない。また、プログラムは必ずしも単一的に構成されるものに限られず、複数のモジュールや複数のライブラリとして分散構成されるものや、ＯＳに代表される別個のプログラムと協働してその機能を達成するものも含む。 [2] All or part of each process in the above description in the object shape detection system 1 may be realized by causing a processor included in the object shape detection system 1 to execute a program corresponding to each process. For example, the program corresponding to each process in the above description may be stored in the memory, and the program may be read from the memory by the processor and executed. Further, the program is stored in a program server connected to the object shape detection system 1 via an arbitrary network, downloaded from the program server to the object shape detection system 1 and executed, or read by the object shape detection system 1. It may be stored in a possible recording medium, read from the recording medium, and executed. Recording media that can be read by the object shape detection system 1 include, for example, memory cards, USB memory, SD cards, flexible discs, magneto-optical discs, CD-ROMs, DVDs, Blu-ray (registered trademark) discs, and the like. Includes portable storage media. The program is a data processing method described in an arbitrary language or an arbitrary description method, and may be in any format such as source code or binary code. In addition, the program is not necessarily limited to a single program, but is distributed as multiple modules or multiple libraries, or cooperates with a separate program represented by the OS to achieve its function. Including things.

［３］物体形状検出システム１の分散・統合の具体的形態は図示するものに限られず、物体形状検出システム１の全部または一部を、各種の付加等に応じて、または、機能負荷に応じて、任意の単位で機能的または物理的に分散・統合して構成することができる。 [3] The specific form of dispersion / integration of the object shape detection system 1 is not limited to the one shown in the drawing, and all or a part of the object shape detection system 1 can be added according to various additions or functional loads. Therefore, it can be functionally or physically distributed / integrated in any unit.

［４］実施例３，４における補正部３１，４１は、透視投影変換以外の技術を用いて形状の歪みを補正しても良い。 [4] The correction units 31 and 41 in Examples 3 and 4 may correct the distortion of the shape by using a technique other than the perspective projection conversion.

１物体形状検出システム
１０学習モデル生成装置
２０座標検出装置
１１データセット生成部
１２学習モデル生成部
２３検出部
３１，４１補正部
３２認識部
４３重畳部 1 Object shape detection system 10 Learning model generation device 20 Coordinate detection device 11 Data set generation unit 12 Learning model generation unit 23 Detection unit 31, 41 Correction unit 32 Recognition unit 43 Superimposition unit

Claims

A first trained model and a second trained model generated by machine learning using an image including a specified point of an object, and an input image in which the object is photographed is input, and the input image is described as described above. The first trained model that outputs the first information that is information indicating the candidate of the region including the specified point of the object and the first information are input, and the candidate indicated by the first information is the object. A storage unit that stores a second trained model that outputs second information, which is information indicating which of the plurality of specified points is included in the specified point.
Using a pre-Symbol first learned model to the input image in which the object was photographed to detect the candidate, a defined point of the object using the relative detected the candidate second trained model A detection unit that detects a region including the region and detects the coordinates of a specified point of the object in the input image based on the region detected by using the second trained model.
A coordinate detection device comprising.

The coordinate detection device according to claim 1 and
A correction unit that corrects the distortion of the shape of the region surrounded by the plurality of specified points corresponding to the plurality of coordinates detected by the coordinate detection device, and the correction unit.
A recognition unit that recognizes characters existing in the region after the distortion of the shape is corrected, and
A character recognition device comprising.

The coordinate detection device according to claim 1 and
A correction unit that corrects the distortion of the shape of the region surrounded by the plurality of specified points corresponding to the plurality of coordinates detected by the coordinate detection device, and the correction unit.
A superimposing portion that superimposes a predetermined content on the shape after the distortion is corrected, and
An image processing device comprising.

A trained model formed from a first trained model and a second trained model generated by machine learning using an image containing a defined point of an object.
The first trained model is input with an input image in which the object is photographed, and outputs first information which is information indicating a candidate for a region including a specified point of the object in the input image.
The second trained model is information indicating which of the plurality of specified points of the object is included in the candidate indicated by the first information when the first information is input. Output some second information,
Trained model.