JP7359380B2

JP7359380B2 - Detection parameter generation device, detection parameter generation method, detection parameter generation program, object detection device, object detection method, and object detection program

Info

Publication number: JP7359380B2
Application number: JP2019143964A
Authority: JP
Inventors: 匠市村; 真鎌田; 直多羅尾; 亮山口
Original assignee: Chudenko Corp
Current assignee: Chudenko Corp
Priority date: 2019-08-05
Filing date: 2019-08-05
Publication date: 2023-10-11
Anticipated expiration: 2039-08-05
Also published as: JP2021026506A

Description

本発明は、検出パラメタ生成装置、検出パラメタ生成方法、検出パラメタ生成プログラム、オブジェクト検出装置、オブジェクト検出方法、およびオブジェクト検出プログラムに関し、より詳しくは、構造適応型ＤＢＮを用いて画像からオブジェクトを検出するための検出パラメタを生成する検出パラメタ生成装置、検出方法および検出プログラム、ならびに、生成された検出パラメタを用いてオブジェクトを検出するオブジェクト検出装置、検出方法および検出プログラムに関する。 The present invention relates to a detection parameter generation device, a detection parameter generation method, a detection parameter generation program, an object detection device, an object detection method, and an object detection program, and more specifically, to a detection parameter generation device, a detection parameter generation method, a detection parameter generation program, and an object detection program. The present invention relates to a detection parameter generation device, a detection method, and a detection program that generate detection parameters for an object, and an object detection device, a detection method, and a detection program that detect an object using the generated detection parameters.

従来、構造適応型深層学習法が知られている（非特許文献１～３、特許文献１）。この手法によれば、制限付きボルツマンマシン（ＲＢＭ）を多段に重ねてディープ・ビリーフ・ネットワーク（ＤＢＮ）を構築する際に最適な隠れニューロン数および隠れ層数が自動的に求められる。これにより、学習用データに基づいて最適な構造のニューラルネットワークを構築することが可能である。この構造適応型深層学習法により構築された構造適応型ＤＢＮは、複数のベンチマークテストにおいて、既存の畳み込みニューラルネットワーク（ＣＮＮ）よりも高い分類精度を示している（非特許文献４）。 Conventionally, structure adaptive deep learning methods have been known (Non-Patent Documents 1 to 3, Patent Document 1). According to this method, when constructing a deep belief network (DBN) by stacking restricted Boltzmann machines (RBM) in multiple stages, the optimal number of hidden neurons and hidden layers are automatically determined. Thereby, it is possible to construct a neural network with an optimal structure based on the learning data. A structure-adaptive DBN constructed using this structure-adaptive deep learning method has shown higher classification accuracy than existing convolutional neural networks (CNNs) in multiple benchmark tests (Non-Patent Document 4).

なお、分類精度の改善手法としてファインチューニング（ＦｉｎｅＴｕｎｉｎｇ）法が知られている（例えば、非特許文献５）。この手法では、学習後のネットワークの各層の入出力パターンの頻度に応じて、誤って分類された事例が正しく分類されるようにネットワークの重みを修正する。 Note that a fine tuning method is known as a method for improving classification accuracy (for example, Non-Patent Document 5). In this method, the weights of the network are modified according to the frequency of input/output patterns of each layer of the network after learning so that incorrectly classified cases are classified correctly.

本発明者らは、胸部Ｘ線画像に係る公開データベース（ＣｈｅｓｔＸ－Ｒａｙ８：ＣＸＲ８）を用いて学習された構造適応型ＤＢＮを用いて、癌の有無を高い精度で検出可能なアルゴリズムを提案している（非特許文献６）。 The present inventors have proposed an algorithm that can detect the presence or absence of cancer with high accuracy using a structure-adaptive DBN trained using a public database of chest X-ray images (ChestX-Ray8: CXR8). (Non-patent Document 6).

このアルゴリズムでは、まず、検出対象のオブジェクトが含まれる画像を入力し、当該画像を複数の領域に沿って分割して複数の分割画像を得る。そして、分割画像を学習済みの構造適応型ＤＢＮに与えて、出力層から出力される値を取得する。各ニューロンから出力される値は、ソフトマックス関数により正規化されており、オブジェクトの検出確率を示す。 In this algorithm, first, an image containing an object to be detected is input, and the image is divided along a plurality of regions to obtain a plurality of divided images. Then, the divided images are given to the learned structure-adaptive DBN to obtain the values output from the output layer. The value output from each neuron is normalized by a softmax function and indicates the probability of object detection.

次に、あるオブジェクトに対応するニューロンから出力された値（確率）が当該オブジェクトに対して設定された閾値（第１の検出パラメタ）よりも大きいかどうかを判定する。そして、出力された値が第１の検出パラメタよりも大きい場合、分割画像に基づいて変形画像を作成し、変形画像を学習済みの構造適応型ＤＢＮに与えて、出力層から出力される値を取得する。出力された値（確率）が当該オブジェクトに対して設定された別の閾値（第２の検出パラメタ）よりも大きいかどうかを判定する。そして、出力された値が第２の検出パラメタより大きい場合に、当該オブジェクトを検出したと最終的に判定する。なお、第１および第２の検出パラメタは、０より大きく、１より小さい値であり、値が小さいほどオブジェクトは検出され易くなる。 Next, it is determined whether a value (probability) output from a neuron corresponding to a certain object is larger than a threshold (first detection parameter) set for the object. Then, if the output value is larger than the first detection parameter, a deformed image is created based on the divided image, the deformed image is given to the learned structure adaptive DBN, and the value output from the output layer is get. It is determined whether the output value (probability) is larger than another threshold (second detection parameter) set for the object. Then, if the output value is larger than the second detection parameter, it is finally determined that the object has been detected. Note that the first and second detection parameters have values greater than 0 and less than 1, and the smaller the value, the easier the object is to be detected.

特開２０１９－７４９４６号公報JP2019-74946A

S.Kamada and T.Ichimura, An Adaptive Learning Method of Restricted Boltzmann Machine by Neuron Generation and Annihilation Algorithm, Proc. of 2016 IEEE International Conference on Systems, Man, and Cybernetics (IEEE SMC 2016), pp.1273-1278 (2016)S. Kamada and T. Ichimura, An Adaptive Learning Method of Restricted Boltzmann Machine by Neuron Generation and Annihilation Algorithm, Proc. of 2016 IEEE International Conference on Systems, Man, and Cybernetics (IEEE SMC 2016), pp.1273-1278 (2016 ) S.Kamada and T.Ichimura, A Structural Learning Method of Restricted Boltzmann Machine by Neuron Generation and Annihilation Algorithm, Neural Information Processing, Lecture Notes in Computer Science (LNCS, vol.9950), pp.372-380 (2016)S.Kamada and T.Ichimura, A Structural Learning Method of Restricted Boltzmann Machine by Neuron Generation and Annihilation Algorithm, Neural Information Processing, Lecture Notes in Computer Science (LNCS, vol.9950), pp.372-380 (2016) S.Kamada and T.Ichimura, An Adaptive Learning Method of Deep Belief Network by Layer Generation Algorithm, Proc. of 2016 IEEE Region 10 Conference (TENCON), pp.2971-2974 (2016)S.Kamada and T.Ichimura, An Adaptive Learning Method of Deep Belief Network by Layer Generation Algorithm, Proc. of 2016 IEEE Region 10 Conference (TENCON), pp.2971-2974 (2016) S.Kamada, T.Ichimura, Akira Hara, and Kenneth J. Mackin, Adaptive Structure Learning Method of Deep Belief Network using Neuron Generation-Annihilation and Layer Generation, Neural Computing and Applications, doi.org/10.1007/s00521-018-3622-y, pp.1-15 (2018)S. Kamada, T. Ichimura, Akira Hara, and Kenneth J. Mackin, Adaptive Structure Learning Method of Deep Belief Network using Neuron Generation-Annihilation and Layer Generation, Neural Computing and Applications, doi.org/10.1007/s00521-018-3622 -y, pp.1-15 (2018) S.Kamada and T.Ichimura, Fine Tuning of Adaptive Learning of Deep Belief Network for Misclassification and its Knowledge Acquisition, International Journal Computational Intelligence Studies, Vol.6, No.4, pp.333-348 (2017)S.Kamada and T.Ichimura, Fine Tuning of Adaptive Learning of Deep Belief Network for Misclassification and its Knowledge Acquisition, International Journal Computational Intelligence Studies, Vol.6, No.4, pp.333-348 (2017) 市村匠，鎌田真，“ChestX-ray8 を用いた構造適応型Deep Belief Network による胸部疾患位置検出システム”，計測自動制御学会システム・情報部門学術講演会2018 講演論文集(SSI2018) (2018)Takumi Ichimura, Makoto Kamata, “Chest disease location detection system using structure-adaptive deep belief network using ChestX-ray8”, Proceedings of the Society of Instrument and Control Engineers, Systems and Information Division Academic Conference 2018 (SSI2018) (2018)

前述のオブジェクト検出アルゴリズムを用いることにより、胸部Ｘ線画像については高い検出精度が得られることが確認されている。 It has been confirmed that high detection accuracy can be obtained for chest X-ray images by using the object detection algorithm described above.

しかしながら、本発明者らによるさらなる研究によれば、画像の種類によっては、高い検出精度を得ることが難しい場合があることがわかってきた。例えば建築設備図面に係る画像の場合、高い検出精度が得られない場合がある。 However, further research by the present inventors has revealed that it may be difficult to obtain high detection accuracy depending on the type of image. For example, in the case of images related to architectural equipment drawings, high detection accuracy may not be obtained.

建築設備図面には、照明、コンセント、トイレ等を示す様々な種類の記号が含まれる。また、記号ごとに頻出度や希少度といった特徴が異なる。ここで、頻出度とは、ある画像（図面）におけるオブジェクト（記号）の出現頻度に関する度合を示し、希少度とは、複数枚の画像におけるオブジェクトの希少価値に関する度合を示す。希少度は、オブジェクトが特定の図面にしか出現しない場合に高くなる。 Building equipment drawings include various types of symbols that indicate lighting, outlets, toilets, etc. Furthermore, each symbol has different characteristics such as frequency and rarity. Here, the frequency refers to the degree to which an object (symbol) appears in a certain image (drawing), and the rarity refers to the degree to which the object is rare in a plurality of images. Rarity is high when an object only appears in a specific drawing.

従来のオブジェクト検出アルゴリズムでは、第１および第２の検出パラメタは各オブジェクトに共通の値として与えられていた。このため、従来のアルゴリズムでは、図面に含まれる各種記号を高い精度で検出することは困難であった。試行錯誤的に検出パラメタを探索することで高い検出精度が得られることもあるが、非常に手間がかかり効率的でなかった。 In conventional object detection algorithms, the first and second detection parameters are given as common values to each object. Therefore, with conventional algorithms, it has been difficult to detect various symbols included in drawings with high accuracy. High detection accuracy can sometimes be obtained by searching for detection parameters through trial and error, but this is extremely time-consuming and inefficient.

本発明は、上記のような技術的認識に基づいてなされたものであり、その目的は、特徴の異なる複数種類のオブジェクトを高い精度で検出することを可能とする検出パラメタを容易に生成できる検出パラメタ生成装置、検出パラメタ生成方法および検出パラメタ生成プログラムを提供すること、ならびに、生成された検出パラメタを用いて特徴の異なる複数種類のオブジェクトを高い精度で検出できるオブジェクト検出装置、オブジェクト検出方法およびオブジェクト検出プログラムを提供することである。 The present invention has been made based on the above-mentioned technical recognition, and its purpose is to easily generate detection parameters that make it possible to detect multiple types of objects with different characteristics with high accuracy. To provide a parameter generation device, a detection parameter generation method, and a detection parameter generation program, and an object detection device, an object detection method, and an object that can detect multiple types of objects with different characteristics with high accuracy using the generated detection parameters. The purpose is to provide a detection program.

本発明に係る検出パラメタ生成装置は、
構造適応型ＤＢＮを用いて画像からオブジェクトを検出するための検出パラメタを生成する検出パラメタ生成装置であって、
複数の画像を有するデータセットのうちの一つの画像における、あるオブジェクトの頻出度を示す第１の指標を求める頻出度指標算出部と、
前記データセットにおける前記オブジェクトの希少度を示す第２の指標を求める希少度指標算出部と、
前記第１の指標と前記第２の指標との積に基づいて、検出用画像を分割して得られた分割画像に前記オブジェクトが存在するかどうかを判定するための第１の検出パラメタと、前記分割画像に基づいて作成された変形画像に前記オブジェクトが存在するかどうかを判定するための第２の検出パラメタとを更新する検出パラメタ更新部と、
を備えることを特徴とする。 The detection parameter generation device according to the present invention includes:
A detection parameter generation device that generates detection parameters for detecting an object from an image using a structure adaptive DBN, comprising:
a frequency index calculation unit that calculates a first index indicating the frequency of a certain object in one image of a dataset having a plurality of images;
a rarity index calculation unit that calculates a second index indicating the rarity of the object in the data set;
a first detection parameter for determining whether the object is present in a divided image obtained by dividing the detection image based on the product of the first index and the second index; a detection parameter updating unit that updates a second detection parameter for determining whether the object is present in the deformed image created based on the divided image;
It is characterized by having the following.

また、前記検出パラメタ生成装置において、
前記検出パラメタ更新部は、前記第１の指標と前記第２の指標との積が大きくなるにつれて前記第１の検出パラメタおよび前記第２の検出パラメタを小さくするようにしてもよい。 Further, in the detection parameter generation device,
The detection parameter updating unit may decrease the first detection parameter and the second detection parameter as the product of the first index and the second index increases.

また、前記検出パラメタ生成装置において、
前記画像は建築設備図面に係る画像であり、前記オブジェクトは、建築設備図面の記号であるようにしてもよい。 Further, in the detection parameter generation device,
The image may be an image related to a building equipment drawing, and the object may be a symbol of the building equipment drawing.

本発明に係るオブジェクト検出装置は、
検出用画像を入力する画像入力部と、
前記検出用画像を複数の分割画像に分割する画像分割部と、
前記分割画像を構造適応型ＤＢＮの入力層に与えて、前記構造適応型ＤＢＮの出力層に出力された、あるオブジェクトに対する第１の確率を取得する第１の確率取得部と、
前記第１の確率が前記検出パラメタ生成装置で生成された第１の検出パラメタよりも大きければ、前記分割画像に基づいて変形画像を作成する変形画像作成部と、
前記変形画像を前記構造適応型ＤＢＮの入力層に与えて、前記構造適応型ＤＢＮの出力層に出力された前記オブジェクトに対する第２の確率を取得する第２の確率取得部と、
前記第２の確率が前記検出パラメタ生成装置で生成された第２の検出パラメタよりも大きければ、前記オブジェクトを検出したと判定する判定部と、
を備えることを特徴とする。 The object detection device according to the present invention includes:
an image input section for inputting a detection image;
an image dividing unit that divides the detection image into a plurality of divided images;
a first probability acquisition unit that applies the divided image to an input layer of a structure adaptive DBN and obtains a first probability for a certain object output to an output layer of the structure adaptive DBN;
a deformed image creation unit that creates a deformed image based on the divided image if the first probability is greater than the first detection parameter generated by the detection parameter generation device;
a second probability acquisition unit that applies the deformed image to the input layer of the structure-adaptive DBN to obtain a second probability for the object output to the output layer of the structure-adaptive DBN;
a determination unit that determines that the object has been detected if the second probability is greater than the second detection parameter generated by the detection parameter generation device;
It is characterized by having the following.

本発明によれば、特徴の異なる複数種類のオブジェクトを高い精度で検出することを可能とする検出パラメタを容易に生成することができる。 According to the present invention, it is possible to easily generate detection parameters that enable highly accurate detection of multiple types of objects with different characteristics.

また、本発明によれば、前記検出パラメタを用いて、特徴の異なる複数種類のオブジェクトを高い精度で検出することができる。 Further, according to the present invention, multiple types of objects with different characteristics can be detected with high accuracy using the detection parameters.

実施形態に係るオブジェクト検出に係る全体処理フローの概略を示す図である。FIG. 2 is a diagram schematically showing an overall processing flow related to object detection according to the embodiment. 構造適応型ＤＢＮの構築に用いた訓練データおよびテストデータの概要を示す図である。FIG. 2 is a diagram showing an overview of training data and test data used to construct a structure-adaptive DBN. 構造適応型ＤＢＮの構築に用いたカテゴリ別の記号の例を示す図である。FIG. 3 is a diagram showing an example of symbols classified by category used in constructing a structure-adaptive DBN. 構築された構造適応型ＤＢＮの例を模式的に示す図である。FIG. 2 is a diagram schematically showing an example of a constructed structure-adaptive DBN. 構築された構造適応型ＤＢＮの分類精度を示す図である。FIG. 3 is a diagram showing the classification accuracy of the constructed structure-adaptive DBN. 実施形態に係る検出パラメタ生成装置の概略的構成を示す図である。1 is a diagram showing a schematic configuration of a detection parameter generation device according to an embodiment. 実施形態に係る図面データベースの一例を示す図である。It is a figure showing an example of a drawing database concerning an embodiment. 実施形態に係る検出パラメタ生成方法を示すフローチャートである。3 is a flowchart illustrating a detection parameter generation method according to an embodiment. 実施形態に係るオブジェクト検出装置の概略的構成を示す図である。1 is a diagram showing a schematic configuration of an object detection device according to an embodiment. 建築設備図面の画像の一例を示す図である。It is a figure which shows an example of the image of a building equipment drawing. 変形画像の作成方法について説明するための図である。FIG. 3 is a diagram for explaining a method of creating a deformed image. 実施形態に係るオブジェクト検出方法を示すフローチャートである。3 is a flowchart illustrating an object detection method according to an embodiment. 従来（手動調整）と実施形態（自動調整）による記号の検出精度を示す図である。It is a figure which shows the detection accuracy of the symbol by the conventional (manual adjustment) and embodiment (automatic adjustment). 記号を正しく検出できなかった場合の例を示す図である。FIG. 6 is a diagram illustrating an example of a case where a symbol cannot be detected correctly.

以下、本発明に係る実施形態について図面を参照しながら説明する。 Embodiments according to the present invention will be described below with reference to the drawings.

＜全体処理フロー＞
図１を参照して、本実施形態に係る、構造適応型ＤＢＮを用いたオブジェクト検出に係る全体処理フローについて説明する。 <Overall processing flow>
With reference to FIG. 1, an overall processing flow related to object detection using a structure-adaptive DBN according to this embodiment will be described.

なお、本実施形態では、オブジェクトを含む画像は、各種記号を含む建築設備図面に係る画像である。ただし、本発明はこれに限られるものでなく、建築設備図面以外の線図のほか、その他検出対象のオブジェクトを含む各種画像に適用することが可能である。 Note that in this embodiment, the image including the object is an image related to a building equipment drawing including various symbols. However, the present invention is not limited to this, and can be applied to line drawings other than architectural equipment drawings, as well as various other images containing objects to be detected.

ステップＳ１では、構造適応型ＤＢＮを構築する。本ステップは、本発明者らが開発した公知の方法（非特許文献１～３等）により、構造適応型ＤＢＮの構築が行われる。訓練データ（学習用データ）およびテストデータ（評価用データ）として、複数の分類対象に係る記号の画像が用いられる。ニューラルネットワークの入力層の各ニューロンには、記号画像の各ピクセルのデータが入力される。なお、分類精度をさらに向上させるためにファインチューニングを行ってもよい。 In step S1, a structure adaptive DBN is constructed. In this step, a structure-adaptive DBN is constructed by a known method developed by the present inventors (non-patent documents 1 to 3, etc.). Images of symbols related to a plurality of classification targets are used as training data (learning data) and test data (evaluation data). Data for each pixel of the symbol image is input to each neuron in the input layer of the neural network. Note that fine tuning may be performed to further improve classification accuracy.

図２は、ステップＳ１で用いられる訓練データの数、および構造適応型ＤＢＮを評価するためのテストデータの数をカテゴリ別に示している。なお、学習データを増やすため、データ増強（ＤａｔａＡｕｇｍｅｎｔ）を行ってデータ数を増やした。具体的には、元の記号画像に対して、左右反転、上下反転、９０度回転、１９０度回転、上下移動、拡大および縮小の操作を行うことでデータ数を約１２倍に増やした。図２に示す数は、データ増強後のデータ数である。 FIG. 2 shows, by category, the number of training data used in step S1 and the number of test data for evaluating the structure-adaptive DBN. In addition, in order to increase the learning data, data augmentation was performed to increase the number of data. Specifically, the number of data was increased approximately 12 times by performing operations such as horizontal flip, vertical flip, 90 degree rotation, 190 degree rotation, vertical movement, enlargement, and reduction on the original symbol image. The numbers shown in FIG. 2 are the numbers of data after data augmentation.

図３は、カテゴリ別の記号の例を示している。記号Ｍ１は、カテゴリ「衛生器具」に属する記号画像の一例である。記号Ｍ２は、カテゴリ「電灯」に属する記号画像の一例である。記号Ｍ３は、カテゴリ「コンセント」に属する記号画像の一例である。記号Ｍ４は、カテゴリ「非常灯・誘導灯」に属する記号画像の一例である。記号Ｍ５は、カテゴリ「インターホン」に属する記号画像の一例である。 FIG. 3 shows examples of symbols for each category. The symbol M1 is an example of a symbol image belonging to the category "sanitary utensils." Symbol M2 is an example of a symbol image belonging to the category "electric light." Symbol M3 is an example of a symbol image belonging to the category "outlet". Symbol M4 is an example of a symbol image belonging to the category "emergency lights/guidance lights." Symbol M5 is an example of a symbol image belonging to the category "intercom."

図４は、構築された構造適応型ＤＢＮの例を模式的に示している。出力層は、カテゴリ数に対応してニューロンＮ１，Ｎ２，Ｎ３，Ｎ４，Ｎ５を有している。ニューロンＮ１～Ｎ５の出力値は、総和が１となるようにソフトマックス関数等により正規化される。ニューロンＮ１，Ｎ２，Ｎ３，Ｎ４，Ｎ５の出力値は、対応付けられたオブジェクトの確率を示す。例えば、ニューロンＮ１の出力値が０．８の場合、入力層に入力された記号画像がカテゴリ「衛生器具」である確率は８０％である。 FIG. 4 schematically shows an example of the constructed structure-adaptive DBN. The output layer has neurons N1, N2, N3, N4, and N5 corresponding to the number of categories. The output values of the neurons N1 to N5 are normalized by a softmax function or the like so that the sum becomes 1. The output values of neurons N1, N2, N3, N4, and N5 indicate the probability of the associated object. For example, when the output value of neuron N1 is 0.8, the probability that the symbol image input to the input layer is in the category "sanitary utensils" is 80%.

訓練データを用いて構造適応型ＤＢＮの構築後、分類精度の評価を行う。図５は、構造適応型ＤＢＮの評価結果を示している。ここでは、訓練データ、テストデータ、およびファインチューニング後におけるテストデータに対する分類精度をそれぞれカテゴリ別に示している。なお、テストデータの分類精度における括弧の数値は、分類できなかったデータの数を示している。 After constructing a structure-adaptive DBN using training data, the classification accuracy is evaluated. FIG. 5 shows the evaluation results of the structure-adaptive DBN. Here, the classification accuracy for training data, test data, and test data after fine-tuning is shown for each category. Note that the numerical value in parentheses in the classification accuracy of test data indicates the number of data that could not be classified.

図５に示すように、訓練データに対する正答率は、すべてのカテゴリで１００．０％となった。一方、テストデータについては、９５．６％以上となった。また、ファインチューニング法により分類精度の改善を試みたところ、すべてのカテゴリに対して１００．０％の分類精度が得られた。 As shown in FIG. 5, the correct answer rate for the training data was 100.0% in all categories. On the other hand, the test data was over 95.6%. Furthermore, when an attempt was made to improve the classification accuracy using a fine tuning method, a classification accuracy of 100.0% was obtained for all categories.

ステップＳ２では、建築設備図面の記号を検出するための検出パラメタ（第１の検出パラメタおよび第２の検出パラメタ）の生成を行う。ここで、第１の検出パラメタ（Ｔ１_ｉ）は、オブジェクト検出用の画像（検出用画像）を分割して得られた分割画像にｉ番目のオブジェクト（オブジェクトｉ）が存在するかどうかを判定するための閾値である。第２の検出パラメタ（Ｔ２_ｉ）は、分割画像に基づいて作成された変形画像にオブジェクトｉが存在するかどうかを判定するための閾値である。検出パラメタの生成については、後ほど図６～図８を参照して詳しく説明する。 In step S2, detection parameters (a first detection parameter and a second detection parameter) for detecting symbols on building equipment drawings are generated. Here, the first detection parameter (T1 _i ) determines whether the i-th object (object i) exists in the divided image obtained by dividing the image for object detection (detection image). This is the threshold for The second detection parameter (T2 _i ) is a threshold value for determining whether object i exists in the deformed image created based on the divided images. Generation of detection parameters will be explained in detail later with reference to FIGS. 6 to 8.

ステップＳ３では、ステップＳ２で生成された検出パラメタを用いて、検出用画像からオブジェクトを検出する。本ステップＳ３では、ステップＳ２で生成された第１および第２の検出パラメタを、画像中にオブジェクトが存在するかどうかを判定する際の閾値として用いる。オブジェクト検出については、後ほど図９～図１４を参照して詳しく説明する。 In step S3, an object is detected from the detection image using the detection parameters generated in step S2. In step S3, the first and second detection parameters generated in step S2 are used as thresholds for determining whether an object exists in the image. Object detection will be described in detail later with reference to FIGS. 9 to 14.

＜検出パラメタ生成装置＞
本実施形態に係る検出パラメタ生成装置１について、図６を参照して説明する。以下に説明するように、検出パラメタ生成装置１は、ステップＳ１で構築された構造適応型ＤＢＮを用いて画像からオブジェクトを検出するための検出パラメタを生成（調整）するように構成されている。 <Detection parameter generation device>
The detection parameter generation device 1 according to this embodiment will be described with reference to FIG. 6. As described below, the detection parameter generation device 1 is configured to generate (adjust) detection parameters for detecting objects from images using the structure-adaptive DBN constructed in step S1.

検出パラメタ生成装置１は、図６に示すように、制御部１０と、通信部２０と、操作入力部３０と、表示部４０と、記憶部５０とを備えている。 As shown in FIG. 6, the detection parameter generation device 1 includes a control section 10, a communication section 20, an operation input section 30, a display section 40, and a storage section 50.

制御部１０は、検出パラメタ生成装置１の動作を制御するものであり、ハードウェアとしてはＣＰＵ（中央処理装置）等のプロセッサにより構成される。本実施形態では、制御部１０は、検出パラメタ生成装置１内のプロセッサが所定のプログラムを実行することにより実現される。なお、制御部１０の少なくとも一部がＡＳＩＣ等のハードウェアにより構成されてもよい。制御部１０の詳細については後ほど詳しく説明する。 The control unit 10 controls the operation of the detection parameter generation device 1, and is constituted by a processor such as a CPU (central processing unit) as hardware. In this embodiment, the control unit 10 is realized by a processor in the detection parameter generation device 1 executing a predetermined program. Note that at least a portion of the control unit 10 may be configured by hardware such as an ASIC. Details of the control unit 10 will be explained in detail later.

通信部２０は、検出パラメタ生成装置１と外部の情報処理装置（図示せず）との間で情報を送受信する。なお、通信部２０による通信は、有線・無線の別を問わず、また通信プロトコルも限定されない。 The communication unit 20 transmits and receives information between the detection parameter generation device 1 and an external information processing device (not shown). Note that the communication by the communication unit 20 may be wired or wireless, and the communication protocol is not limited.

操作入力部３０は、ユーザが検出パラメタ生成装置１に情報を入力するためのインターフェースであり、例えば、キーボード、マウス、タッチパネル、ボタン等である。操作入力部３０は、ユーザから検出パラメタの初期値や検出パラメタの生成指示などを受け付ける。 The operation input unit 30 is an interface for a user to input information to the detection parameter generation device 1, and is, for example, a keyboard, a mouse, a touch panel, a button, or the like. The operation input unit 30 receives initial values of detection parameters, detection parameter generation instructions, etc. from the user.

表示部４０は、ユーザへ各種情報（生成された検出パラメタの値など）を出力するインターフェースである。この表示部４０は、例えば、映像を表示するディスプレイ（液晶ディスプレイ、有機ＥＬディスプレイ等）である。 The display unit 40 is an interface that outputs various information (such as generated detection parameter values) to the user. The display unit 40 is, for example, a display (liquid crystal display, organic EL display, etc.) that displays images.

記憶部５０は、ハードディスクまたは半導体メモリなどから構成される記憶装置である。この記憶部５０には、図面データベース５１が記憶されている。なお、記憶部５０には、制御部１０による情報処理に必要なプログラムや各種データが記憶されてもよい。 The storage unit 50 is a storage device composed of a hard disk, a semiconductor memory, or the like. This storage unit 50 stores a drawing database 51. Note that the storage unit 50 may store programs and various data necessary for information processing by the control unit 10.

図面データベース５１は、複数の建築設備図面に係る画像を有するデータセットから抽出された、各図面における記号ごとの出現数（描画数）を格納したデータベースである。図面データベース５１は、人が図面画像を見て作成される。なお、部品ごとの頻出度や希少度の傾向が誤っていなければ、図面データベース５１の部品数は必ずしも厳密な値でなくてもよい。 The drawing database 51 is a database that stores the number of appearances (number of drawings) of each symbol in each drawing, which is extracted from a data set having images related to a plurality of building equipment drawings. The drawing database 51 is created by a person viewing a drawing image. Note that the number of parts in the drawing database 51 does not necessarily have to be a precise value, as long as the frequency and rarity trends of each part are correct.

図７は、図面データベース５１の一例を示している。図面データベース５１には、Ｊ枚の建築設備図面の各々について、部品（建築設備図面の記号）ごとの数が格納されている。なお、Ｉは部品の種類の総数である。すなわち、図面データベース５１には、図面ｊ（１≦ｊ≦Ｊ）に描画されている部品ｉ（１≦ｉ≦Ｉ）の数が格納されている。右端のカラムには、各図面について部品の総数が格納されている。なお、このカラムは必須ではなく、制御部１０が演算により算出してもよい。
次に、制御部１０について詳しく説明する。 FIG. 7 shows an example of the drawing database 51. The drawing database 51 stores the number of each part (symbol of the building equipment drawing) for each of the J building equipment drawings. Note that I is the total number of types of parts. That is, the drawing database 51 stores the number of parts i (1≦i≦I) drawn in drawing j (1≦j≦J). The rightmost column stores the total number of parts for each drawing. Note that this column is not essential, and may be calculated by the control unit 10.
Next, the control section 10 will be explained in detail.

制御部１０は、図６に示すように、データ入力部１１と、頻出度指標算出部１２と、希少度指標算出部１３と、検出パラメタ更新部１４とを有する。なお、制御部１０が有する各機能部は、通信接続された複数の情報処理装置に分散して設けられ、これら複数の情報処理装置が協働することによって制御部１０の機能が実現されてもよい。 As shown in FIG. 6, the control unit 10 includes a data input unit 11, a frequency index calculation unit 12, a rarity index calculation unit 13, and a detection parameter update unit 14. Note that each functional unit included in the control unit 10 is provided in a distributed manner in a plurality of information processing devices that are communicatively connected, and the functions of the control unit 10 are realized by cooperation of these plurality of information processing devices. good.

データ入力部１１は、第１の検出パラメタの初期値および第２の検出パラメタの初期値を取得する。第１の検出パラメタの初期値（T1_i,1）と第２の検出パラメタの初期値（T2_i,1）は、T1_i,1≦T2_i,1の関係であることが好ましい。なお、初期値としては、オブジェクトの種類に依存しない共通の値が取得されてもよいし、オブジェクト種類ごとの値が取得されてもよい。 The data input unit 11 acquires the initial value of the first detection parameter and the initial value of the second detection parameter. It is preferable that the initial value of the first detection parameter (T1 _i,1 ) and the initial value of the second detection parameter (T2 _i,1 ) have a relationship of T1 _i,1 ≦T2 _i,1 . Note that as the initial value, a common value that does not depend on the type of object may be acquired, or a value for each type of object may be acquired.

頻出度指標算出部１２は、前記データセットのうちの一つの画像中における、あるオブジェクト（オブジェクトｉ）の頻出度を示す第１の指標を求める。本実施形態では、頻出度指標算出部１２は、図面データベース５１を参照し、式（１）により第１の指標x_i,jを算出する。第１の指標x_i,jは、画像ｊの全オブジェクトに対するオブジェクトｉの割合を示す。

The frequency index calculation unit 12 calculates a first index indicating the frequency of a certain object (object i) in one image of the data set. In this embodiment, the frequency index calculation unit 12 refers to the drawing database 51 and calculates the first index x _i,j using equation (1). The first index x _i,j indicates the proportion of object i to all objects in image j.

ここで、x_i,j：第１の指標、n_i,j:画像ｊにおけるオブジェクトiの数、I：前記データセットにおけるオブジェクトの種類の総数である。 Here, x _i,j is the first index, n _i,j is the number of objects i in image j, and I is the total number of object types in the data set.

例えば、ある建築設備図面に係る画像ｊに１００個の部品が含まれ、部品ｉの数が０個の場合、第１の指標x_i,jは０となる。一方、画像ｊに１００個の部品が含まれ、そのすべてが部品ｉの場合、第１の指標x_i,jは１となる。このように第１の指標x_i,jは、画像ｊにおけるオブジェクトｉの頻出度に応じて、０以上１以下の値をとる。 For example, if an image j related to a certain building equipment drawing includes 100 parts and the number of parts i is 0, the first index x _i,j is 0. On the other hand, if image j includes 100 parts, all of which are part i, the first index x _i,j is 1. In this way, the first index x _i,j takes a value of 0 or more and 1 or less depending on the frequency of object i in image j.

なお、頻出度指標算出部１２は、式（１）の右辺の分母として、図７の図面データベース５１の右欄のカラムに格納された値を使用する。右欄のカラムがない場合、頻出度指標算出部１２が部品１、部品２、・・・、部品Ｉの総和を算出してもよい。 Note that the frequency index calculation unit 12 uses the value stored in the right column of the drawing database 51 in FIG. 7 as the denominator on the right side of equation (1). If there is no column in the right column, the frequency index calculation unit 12 may calculate the sum of parts 1, 2, . . . , part I.

希少度指標算出部１３は、前記データセット（Ｊ枚の画像）におけるオブジェクトｉの希少度を示す第２の指標を求める。第２の指標y_iは、前記データセットにおいてオブジェクトｉを含む画像の数が少ないほど大きい値を返す関数により求められる。本実施形態では、希少度指標算出部１３は、図面データベース５１を参照し、式（２）により第２の指標を算出する。

The rarity index calculation unit 13 calculates a second index indicating the rarity of object i in the data set (J images). The second index y _i is determined by a function that returns a larger value as the number of images containing object i in the data set is smaller. In this embodiment, the rarity index calculation unit 13 refers to the drawing database 51 and calculates the second index using equation (2).

ここで、y_i：第２の指標、J:前記データセットにおける画像の総数、A_i：オブジェクトｉを含む画像の数である。 Here, y _i is the second index, J is the total number of images in the data set, and A _i is the number of images containing object i.

例えば、建築設備図面に係る画像の総数が１０枚で、そのうちの１枚の画像にのみ部品ｉが含まれる場合、第２の指標y_iは１となる。一方、図面の総数が１０枚で、すべての図面に部品ｉが含まれる場合、第２の指標y_iは０となる。このように第２の指標y_iは、データセットに含まれる複数の画像におけるオブジェクトｉの希少度に応じて、０以上ｌｏｇ｜Ｊ｜以下の値をとる。 For example, if the total number of images related to architectural equipment drawings is 10, and only one of the images includes part i, the second index y _i is 1. On the other hand, if the total number of drawings is 10 and all drawings include part i, the second index y _i will be 0. In this way, the second index y _i takes a value greater than or equal to 0 and less than or equal to log|J|, depending on the rarity of the object i in the plurality of images included in the data set.

検出パラメタ更新部１４は、第１の指標と第２の指標との積に基づいて、第１の検出パラメタ（Ｔ１_ｉ）と、第２の検出パラメタ（Ｔ２_ｉ）とを更新する。 The detection parameter updating unit 14 updates the first detection parameter (T1 _i ) and the second detection parameter (T2 _i ) based on the product of the first index and the second index.

より詳しくは、検出パラメタ更新部１４は、第１の指標と第２の指標との積が大きくなるにつれて、第１の検出パラメタおよび第２の検出パラメタを小さくする。本実施形態では、検出パラメタ更新部１４は、式（３）により第１の検出パラメタを更新する。

More specifically, the detection parameter updating unit 14 decreases the first detection parameter and the second detection parameter as the product of the first index and the second index increases. In this embodiment, the detection parameter update unit 14 updates the first detection parameter using equation (3).

ここで、T1_i,j+1：オブジェクトｉの更新後の第１の検出パラメタ、T1_i,j：オブジェクトｉの更新前の第１の検出パラメタ、z_i,jは第１の指標と第２の指標との積（＝x_i,j×y_i）である。 Here, T1 _i,j+1 : the first detection parameter of object i after update, T1 _i,j : the first detection parameter of object i before update, z _i,j are the first index and 2 with the index (=x _i,j ×y _i ).

同様に、検出パラメタ更新部１４は、式（４）により第２の検出パラメタを更新する。

Similarly, the detection parameter updating unit 14 updates the second detection parameter using equation (4).

ここで、T2_i,j+1：オブジェクトｉの更新後の第２の検出パラメタ、T2_i,j：オブジェクトｉの更新前の第２の検出パラメタ、z_i,jは第１の指標と第２の指標との積（＝x_i,j×y_i）である。 Here, T2 _i,j+1 : second detection parameter after update of object i, T2 _i,j : second detection parameter before update of object i, z _i,j are the first index and 2 with the index (=x _i,j ×y _i ).

式（３）および式（４）によれば、オブジェクトｉが頻出かつ希少であるほど（すなわち、z_i,jが大きいほど）、更新後の第１および第２の検出パラメタの値は小さくなるため、当該オブジェクトを検出し易くなる。一般的には検出パラメタを小さくすると誤検出のおそれがあるが、本実施形態では、第１の指標と第２の指標の積に基づいて検出パラメタを調整することで、検出精度の向上を実現している。実際の検出結果については、後ほど図１３を参照して説明する。 According to equations (3) and (4), the more frequent and rare object i is (that is, the larger z _i,j is), the smaller the updated values of the first and second detection parameters will be. Therefore, it becomes easier to detect the object. Generally, if the detection parameter is made small, there is a risk of false detection, but in this embodiment, detection accuracy is improved by adjusting the detection parameter based on the product of the first index and the second index. are doing. The actual detection results will be explained later with reference to FIG. 13.

なお、式（３）、式（４）は一例に過ぎない。例えば、式（５）に示すように、関数ｆを用いてz_i,jを計算してもよい。関数ｆは、増加関数であり、例えば指数関数、対数関数、多項式関数である。

Note that equations (3) and (4) are just examples. For example, as shown in equation (5), z _i,j may be calculated using the function f. The function f is an increasing function, such as an exponential function, a logarithmic function, or a polynomial function.

また、z_i,jは、式（６）に示すように、関数ｆおよび関数ｇを用いて第１の指標x_i,jおよび第２の指標y_iをそれぞれ変換した値の積として求められてもよい。関数ｆ，関数ｇは、増加関数であり、例えば指数関数、対数関数、多項式関数である。本願において「第１の指標と第２の指標との積（に基づいて）」との文言は、第１の指標に基づく値と第２の指標に基づく値との積も含む。

In addition, z _i,j is obtained as the product of the values obtained by converting the first index x _i,j and the second index y _i using the function f and the function g, respectively, as shown in equation (6). It's okay. The functions f and g are increasing functions, such as exponential functions, logarithmic functions, and polynomial functions. In this application, the expression "(based on) the product of the first index and the second index" includes the product of the value based on the first index and the value based on the second index.

以上説明したように、本実施形態に係る検出パラメタ生成装置１では、頻出度を示す第１の指標と、希少度を示す第２の指標とを、オブジェクトごとに算出する。そして、第１の指標と第２の指標との積に基づいて、各オブジェクトの第１および第２の検出パラメタを生成する。これにより、オブジェクトの頻出度および希少度に応じて検出パラメタがオブジェクトごとに調整され、特徴の異なる複数種類のオブジェクトを高い精度で検出することを可能とする検出パラメタを試行錯誤することなく容易に生成することができる。 As described above, the detection parameter generation device 1 according to the present embodiment calculates a first index indicating frequency and a second index indicating rarity for each object. Then, first and second detection parameters for each object are generated based on the product of the first index and the second index. This allows the detection parameters to be adjusted for each object according to the frequency and rarity of the object, making it possible to detect multiple types of objects with different characteristics with high accuracy without the need for trial and error. can be generated.

さらに、本実施形態によれば、オブジェクトごとの検出パラメタが自動的に調整されるため、従来の手動による場合と比較して大幅な効率化を実現することができる。 Further, according to the present embodiment, since detection parameters for each object are automatically adjusted, it is possible to achieve a significant increase in efficiency compared to the conventional manual method.

なお、検出パラメタの更新にあたっては、手動で設定可能な調整パラメタを盛り込んでもよい。これにより、人の感覚に基づいて検出パラメタを調整することができる。例えば、第１の検出パラメタおよび第２の検出パラメタは、式（７）および式（８）よりそれぞれ更新される。これらの式において調整パラメタα_１，α_２は通常１であるが、それまでと傾向の異なるデータである等の理由で検出精度が思うように向上しない場合には、これらの手動調整パラメタを１から増減させることにより、検出精度の向上を図ることができる。

Note that when updating the detection parameters, adjustment parameters that can be manually set may be included. This allows the detection parameters to be adjusted based on human senses. For example, the first detection parameter and the second detection parameter are updated using equation (7) and equation (8), respectively. In these formulas, the adjustment parameters α ₁ and α ₂ are usually 1, but if the detection accuracy does not improve as expected due to the data having a different trend from before, these manual adjustment parameters can be set to 1. Detection accuracy can be improved by increasing or decreasing from .

また、データセットの更新に合わせて第１および第２の検出パラメタを更新してもよい。すなわち、Ｊ枚の建築設備図面に対する検出パラメタが生成された後、新たな図面が取得された場合、図面データベース５１を更新し、第１および第２の検出パラメタを更新してもよい。 Furthermore, the first and second detection parameters may be updated in accordance with the update of the data set. That is, if a new drawing is acquired after detection parameters for J building equipment drawings have been generated, the drawing database 51 may be updated and the first and second detection parameters may be updated.

＜検出パラメタの生成方法＞
次に、図８を参照して、本実施形態に係る検出パラメタの生成方法について説明する。 <How to generate detection parameters>
Next, with reference to FIG. 8, a method for generating detection parameters according to this embodiment will be described.

ステップＳ１１では、データ入力部１１が、オブジェクトごとの第１の検出パラメタおよび第２の検出パラメタの初期値（T1_i,1、T2_i,1）を取得する。なお、本ステップで取得される初期値は、オブジェクトの種類に依存しない共通の値でもよい。また、初期値は、ユーザが操作入力部３０を介して入力してもよいし、記憶部５０に予め格納されていてもよいし、通信部２０を介して外部の装置から取得されてもよい。 In step S11, the data input unit 11 obtains initial values (T1 _i,1 , T2 _i,1 ) of the first detection parameter and the second detection parameter for each object. Note that the initial value acquired in this step may be a common value that does not depend on the type of object. Further, the initial value may be input by the user via the operation input unit 30, may be stored in advance in the storage unit 50, or may be acquired from an external device via the communication unit 20. .

ステップＳ１２では、制御部１０が、２つの変数ｉ，ｊを初期化する。本実施形態では、変数ｉおよび変数ｊを１にする。変数ｉはオブジェクトの番号を示し、変数ｊは画像の番号を示す。なお、本ステップＳ１２はステップＳ１１の前に行ってもよい。 In step S12, the control unit 10 initializes two variables i and j. In this embodiment, variable i and variable j are set to 1. The variable i indicates the object number, and the variable j indicates the image number. Note that this step S12 may be performed before step S11.

ステップＳ１３では、制御部１０が、変数ｊがデータセットの画像数Ｊ以下であるかどうかを判定する。変数ｊが画像数Ｊ以下であれば（Ｓ１３：Ｙｅｓ）、ステップＳ１４に進む。 In step S13, the control unit 10 determines whether the variable j is less than or equal to the number J of images in the data set. If the variable j is less than or equal to the number of images J (S13: Yes), the process advances to step S14.

一方、変数ｊが画像数Ｊより大きければ（Ｓ１３：Ｎｏ）、検出パラメタ生成処理を終了する。この際、第１の検出パラメタが０未満である場合は、最終的に生成する第１の検出パラメタの値を所定の最小値（例えば０）とする。また、第１の検出パラメタが１より大きい場合は、最終的に生成する第１の検出パラメタの値は所定の最大値（例えば１）とする。第２の検出パラメタについても同様である。 On the other hand, if the variable j is larger than the number of images J (S13: No), the detection parameter generation process is ended. At this time, if the first detection parameter is less than 0, the finally generated value of the first detection parameter is set to a predetermined minimum value (for example, 0). Further, when the first detection parameter is larger than 1, the value of the first detection parameter finally generated is a predetermined maximum value (for example, 1). The same applies to the second detection parameter.

ステップＳ１４では、制御部１０が、変数ｉがオブジェクトの種類数Ｉ以下であるかどうかを判定する。変数ｉが種類数Ｉ以下であれば（Ｓ１４：Ｙｅｓ）、ステップＳ１５に進む。一方、変数ｉが種類数Ｉより大きければ（Ｓ１４：Ｎｏ）、変数ｊを１つ増加させた後（ステップＳ１９）、ステップＳ１３に戻る。 In step S14, the control unit 10 determines whether the variable i is less than or equal to the number of object types I. If the variable i is less than or equal to the number of types I (S14: Yes), the process advances to step S15. On the other hand, if the variable i is larger than the number of types I (S14: No), the variable j is increased by one (step S19), and then the process returns to step S13.

ステップＳ１５では、頻出度指標算出部１２が、画像ｊにおけるオブジェクトｉの頻出度を示す第１の指標x_i,jを求める。本実施形態では、頻出度指標算出部１２は、図７の図面データベースを参照し、前述の式（１）を用いて第１の指標x_i,jを算出する。 In step S15, the frequency index calculation unit 12 calculates a first index x _i,j indicating the frequency of object i in image j. In this embodiment, the frequency index calculation unit 12 refers to the drawing database of FIG. 7 and calculates the first index x _i,j using the above-mentioned equation (1).

ステップＳ１６では、希少度指標算出部１３が、データセット（Ｊ枚の画像）におけるオブジェクトｉの希少度を示す第２の指標y_iを求める。本実施形態では、希少度指標算出部１３は、図７に示す図面データベースを参照し、前述の式（２）を用いて第２の指標y_iを算出する。なお、ステップＳ１５とステップＳ１６の実行順序は逆であってもよい。 In step S16, the rarity index calculation unit 13 calculates a second index y _i indicating the rarity of the object i in the data set (J images). In this embodiment, the rarity index calculation unit 13 refers to the drawing database shown in FIG. 7 and calculates the second index y _i using the above-mentioned formula (2). Note that the order of execution of step S15 and step S16 may be reversed.

ステップＳ１７では、検出パラメタ更新部１４が、第１の指標x_i,jと第２の指標y_iとの積に基づいて、第１の検出パラメタＴ１_ｉおよび第２の検出パラメタＴ２_ｉを更新する。本実施形態では、検出パラメタ更新部１４は、前述の式（３）を用いて第１の検出パラメタＴ１_ｉを更新し、前述の式（４）を用いて第２の検出パラメタＴ２_ｉを更新する。 In step S17, the detection parameter updating unit 14 updates the first detection parameter T1 i and the second detection parameter _T2 _i based on the product of the first index x _i,j and the second index y _i . do. In this embodiment, the detection parameter update unit 14 updates the first detection parameter T1 _i using the above-mentioned equation (3), and updates the second detection parameter T2 _i using the above-mentioned equation (4). do.

ステップＳ１７の後、制御部１０が、変数ｉを１つ増加させた後（ステップＳ１８）、ステップＳ１４に戻る。 After step S17, the control unit 10 increases the variable i by one (step S18), and then returns to step S14.

上記の検出パラメタの生成方法によれば、オブジェクトの頻出度および希少度に応じて検出パラメタがオブジェクトごとに調整され、特徴の異なる複数種類のオブジェクトを高い精度で検出することを可能とする検出パラメタを容易に生成することができる。 According to the above detection parameter generation method, the detection parameters are adjusted for each object according to the frequency and rarity of the object, and the detection parameters make it possible to detect multiple types of objects with different characteristics with high accuracy. can be easily generated.

＜オブジェクト検出装置＞
次に、図９を参照して、検出パラメタ生成装置１により生成された第１および第２の検出パラメタを用いて、画像（検出用画像）に含まれるオブジェクトを検出するオブジェクト検出装置１００について説明する。 <Object detection device>
Next, with reference to FIG. 9, an object detection device 100 that detects an object included in an image (detection image) using the first and second detection parameters generated by the detection parameter generation device 1 will be described. do.

本実施形態に係るオブジェクト検出装置１００は、図９に示すように、制御部１１０と、通信部１２０と、操作入力部１３０と、表示部１４０と、記憶部１５０とを備えている。 The object detection device 100 according to this embodiment includes a control section 110, a communication section 120, an operation input section 130, a display section 140, and a storage section 150, as shown in FIG.

制御部１１０は、オブジェクト検出装置１００の動作を制御するものであり、ハードウェアとしてはＣＰＵ（中央処理装置）等のプロセッサにより構成される。本実施形態では、制御部１１０は、オブジェクト検出装置１００内のプロセッサが所定のプログラムを実行することにより実現される。なお、制御部１１０の少なくとも一部がＡＳＩＣ等のハードウェアにより構成されてもよい。制御部１１０の詳細については後ほど詳しく説明する。 The control unit 110 controls the operation of the object detection device 100, and is constituted by a processor such as a CPU (central processing unit) as hardware. In this embodiment, the control unit 110 is realized by a processor within the object detection device 100 executing a predetermined program. Note that at least a portion of the control unit 110 may be configured by hardware such as an ASIC. Details of the control unit 110 will be explained in detail later.

通信部１２０は、オブジェクト検出装置１００と外部の情報処理装置（図示せず）との間で情報を送受信する。例えば、通信部１２０は検出パラメタ生成装置１から検出パラメタを受信する。なお、通信部１２０による通信は、有線・無線の別を問わず、また通信プロトコルも限定されない。 The communication unit 120 transmits and receives information between the object detection device 100 and an external information processing device (not shown). For example, the communication unit 120 receives detection parameters from the detection parameter generation device 1 . Note that the communication by the communication unit 120 may be wired or wireless, and the communication protocol is not limited.

操作入力部１３０は、ユーザがオブジェクト検出装置１００に情報を入力するためのインターフェースであり、例えば、キーボード、マウス、タッチパネル、ボタン等である。操作入力部１３０は、ユーザから、検出用画像の指定やオブジェクト検出指示などを受け付ける。 The operation input unit 130 is an interface for a user to input information to the object detection device 100, and is, for example, a keyboard, a mouse, a touch panel, a button, or the like. The operation input unit 130 receives designations of detection images, object detection instructions, and the like from the user.

表示部１４０は、ユーザへ各種情報（例えばオブジェクトの検出結果など）を出力するインターフェースである。この表示部１４０は、例えば、映像を表示するディスプレイ（液晶ディスプレイ、有機ＥＬディスプレイ等）である。 The display unit 140 is an interface that outputs various information (for example, object detection results, etc.) to the user. The display unit 140 is, for example, a display (liquid crystal display, organic EL display, etc.) that displays images.

記憶部１５０は、ハードディスクまたは半導体メモリなどから構成される記憶装置である。この記憶部１５０には、ＤＢＮデータ１５１および検出パラメタデータベース１５２が記憶されている。なお、記憶部１５０には、制御部１１０による情報処理に必要なプログラムやデータが記憶されてもよい。 The storage unit 150 is a storage device composed of a hard disk, a semiconductor memory, or the like. This storage unit 150 stores DBN data 151 and a detection parameter database 152. Note that the storage unit 150 may store programs and data necessary for information processing by the control unit 110.

ＤＢＮデータ１５１は、ステップＳ１で構築された構造適応型ＤＢＮに係るデータ（ニューラルネットワークの形状（層数、各相のニューロン数）、ニューロン間の重み、ニューロンのバイアス等のパラメタ値）である。 The DBN data 151 is data related to the structure-adaptive DBN constructed in step S1 (parameter values such as the shape of the neural network (number of layers, number of neurons in each phase), weights between neurons, bias of neurons, etc.).

検出パラメタデータベース１５２は、検出パラメタ生成装置１によって生成された、オブジェクトｉ（１≦ｉ≦Ｉ）の第１の検出パラメタＴ１_ｉおよび第２の検出パラメタＴ２_ｉが格納されている。 The detection parameter database 152 stores a first detection parameter T1 _i and a second detection parameter T2 _i of an object i (1≦i≦I), which are generated by the detection parameter generation device 1.

次に、制御部１１０について詳しく説明する。 Next, the control unit 110 will be explained in detail.

制御部１１０は、図９に示すように、画像入力部１１１と、画像分割部１１２と、確率取得部（第１の確率取得部）１１３と、変形画像作成部１１４と、確率取得部（第２の確率取得部）１１５と、判定部１１６とを有する。なお、制御部１１０が有する各機能部は、通信接続された複数の情報処理装置に分散して設けられ、これら複数の情報処理装置が協働することによって制御部１１０の機能が実現されてもよい。 As shown in FIG. 9, the control unit 110 includes an image input unit 111, an image division unit 112, a probability acquisition unit (first probability acquisition unit) 113, a deformed image creation unit 114, and a probability acquisition unit (first probability acquisition unit). 2 (probability acquisition unit) 115, and a determination unit 116. Note that each functional unit included in the control unit 110 is provided in a distributed manner in a plurality of information processing devices that are communicatively connected, and the functions of the control unit 110 are realized by cooperation of these plurality of information processing devices. good.

画像入力部１１１は、検出用画像を入力する。図１０は検出用画像の一例（マンションの建築設備図面）を示している。なお、検出用画像は、予め記憶部１５０に記憶されていてもよいし、通信部１２０を介して外部装置から取得されてもよいし、ＣＤ－ＲＯＭやＳＤカード等の情報記録媒体から取得されてもよい。 The image input unit 111 inputs a detection image. FIG. 10 shows an example of a detection image (a building equipment drawing of an apartment building). Note that the detection image may be stored in advance in the storage unit 150, may be obtained from an external device via the communication unit 120, or may be obtained from an information recording medium such as a CD-ROM or an SD card. You can.

画像分割部１１２は、検出用画像を複数の分割画像に分割する。例えば、画像分割部１１２は、入力された検出用画像を複数の矩形領域に沿って分割する。 The image dividing unit 112 divides the detection image into a plurality of divided images. For example, the image dividing unit 112 divides the input detection image along a plurality of rectangular regions.

なお、検出用画像の分割は矩形に限られず、例えばボロノイ領域を用いて矩形の場合よりもオブジェクト検出に適した分割画像を生成してもよい。この場合、画像分割部１１２は、離散ボロノイ図に基づいて複数のボロノイ領域を生成し、生成された複数のボロノイ領域を用いて検出用画像を複数の分割画像に分割する。 Note that the division of the detection image is not limited to rectangular shapes, and for example, a Voronoi region may be used to generate divided images that are more suitable for object detection than a rectangular shape. In this case, the image dividing unit 112 generates a plurality of Voronoi regions based on the discrete Voronoi diagram, and divides the detection image into a plurality of divided images using the plurality of generated Voronoi regions.

確率取得部１１３は、分割画像を構造適応型ＤＢＮの入力層に与えて、構造適応型ＤＢＮの出力層に出力された、あるオブジェクトに対する確率（第１の確率）を取得する。本実施形態では、確率取得部１１３は、画像分割部１１２により作成された複数の分割画像を順次、構造適応型ＤＢＮの入力層に与えて、各分割画像に対する出力層の値（すなわち、オブジェクトごとの確率）を取得する。 The probability acquisition unit 113 provides the divided images to the input layer of the structure-adaptive DBN, and obtains the probability (first probability) for a certain object output to the output layer of the structure-adaptive DBN. In this embodiment, the probability acquisition unit 113 sequentially supplies the plurality of divided images created by the image division unit 112 to the input layer of the structure adaptive DBN, and outputs the value of the output layer for each divided image (i.e., for each object). probability).

変形画像作成部１１４は、確率取得部１１３により取得された確率（オブジェクトｉの確率）が第１の検出パラメタ（Ｔ１_ｉ）よりも大きければ、分割画像に基づいて複数の変形画像を作成する。 If the probability acquired by the probability acquisition unit 113 (probability of object i) is larger than the first detection parameter (T1 _i ), the deformed image creation unit 114 creates a plurality of deformed images based on the divided images.

ここで、図１１を参照して変形画像の作成方法について説明する。なお、図１１では変形画像の輪郭のみを示している。 Here, a method for creating a deformed image will be described with reference to FIG. Note that FIG. 11 shows only the outline of the deformed image.

まず、変形画像作成部１１４は、分割画像の中心点から所定のサイズを抽出する。図１１の例では、分割画像の中心点から縦５０ピクセル、横５０ピクセルの画像Ｔを抽出する。そして、変形画像作成部１１４は、画像Ｔのサイズを変えて複数の画像を作成する。図１１の例では、画像Ｔの縦横のサイズを１００ピクセルまで１０ピクセルずつ増加させて合計３５個の画像を作成している。変形画像作成部１１４が作成する複数の変形画像は、画像Ｔと、画像Ｔのサイズを変更した画像（この例では３６枚の画像）を含む。 First, the deformed image creation unit 114 extracts a predetermined size from the center point of the divided image. In the example of FIG. 11, an image T of 50 pixels vertically and 50 pixels horizontally is extracted from the center point of the divided image. Then, the modified image creation unit 114 creates a plurality of images by changing the size of the image T. In the example of FIG. 11, the vertical and horizontal sizes of the image T are increased by 10 pixels up to 100 pixels to create a total of 35 images. The plurality of deformed images created by the deformed image creation unit 114 include image T and images obtained by changing the size of image T (36 images in this example).

確率取得部１１５は、変形画像作成部１１４により作成された変形画像を、ステップＳ１で構築された構造適応型ＤＢＮの入力層に与えて、構造適応型ＤＢＮの出力層に出力された、あるオブジェクトに対する確率（第２の確率）を取得する。本実施形態では、確率取得部１１５は、変形画像作成部１１４により作成された複数の変形画像を順次、構造適応型ＤＢＮの入力層に与えて、各変形画像についての出力層の値（すなわち、オブジェクトごとの確率）を取得する。 The probability acquisition unit 115 supplies the deformed image created by the deformed image creation unit 114 to the input layer of the structurally adaptive DBN constructed in step S1, and obtains a certain object outputted to the output layer of the structurally adaptive DBN. Obtain the probability (second probability) for . In this embodiment, the probability acquisition unit 115 sequentially supplies the plurality of deformed images created by the deformed image creation unit 114 to the input layer of the structure adaptive DBN, and outputs the value of the output layer for each deformed image (i.e., Obtain the probability for each object.

判定部１１６は、確率取得部１１５により取得された、オブジェクトｉに対する確率が、当該オブジェクトに対する第２の検出パラメタＴ２_ｉよりも大きければ、オブジェクトｉを検出したと判定する。本実施形態では、変形画像作成部１１４により作成された複数の変形画像のうち少なくとも１つの変形画像について、オブジェクトｉに対する確率が第２の検出パラメタＴ２_ｉより大きいならば、オブジェクトｉを検出したと判定する。 The determination unit 116 determines that the object i has been detected if the probability for the object i acquired by the probability acquisition unit 115 is greater than the second detection parameter T2 _i for the object. In this embodiment, if the probability of at least one deformed image among the plurality of deformed images created by the deformed image creation unit 114 for object i is greater than the second detection parameter T2 _i , it is determined that object i has been detected. judge.

上記のように、本実施形態に係るオブジェクト検出装置１００では、検出用画像を分割した分割画像について第１の検出パラメタで１段階目の判定を行い、その後、分割画像から作成した変形画像について第２の検出パラメタで２段階目の判定を行う。これにより、本実施形態によれば、検出用画像から、特徴の異なる複数種類のオブジェクトを高精度で検出することができる。 As described above, the object detection device 100 according to the present embodiment performs the first-stage determination using the first detection parameter on the divided images obtained by dividing the detection image, and then performs the first-stage determination on the deformed images created from the divided images. A second-stage determination is performed using the detection parameter No. 2. As a result, according to the present embodiment, it is possible to detect a plurality of types of objects with different characteristics from the detection image with high accuracy.

＜オブジェクトの検出方法＞
次に、図１２を参照して、本実施形態に係るオブジェクトの検出方法について説明する。 <How to detect objects>
Next, an object detection method according to this embodiment will be described with reference to FIG. 12.

ステップＳ２１では、画像入力部１１１が検出用画像を入力する。 In step S21, the image input unit 111 inputs a detection image.

ステップＳ２２では、画像分割部１１２が、ステップＳ２１で入力された検出用画像を複数の分割画像に分割する。本ステップにおいて、検出用画像はＮ個の分割画像に分割される。 In step S22, the image dividing unit 112 divides the detection image input in step S21 into a plurality of divided images. In this step, the detection image is divided into N divided images.

ステップＳ２３では、制御部１１０が、２つの変数ｉ，ｎを初期化する。変数ｉはオブジェクトの番号を示し、変数ｎは分割画像の番号を示す。本実施形態では、変数ｉおよび変数ｎを１にする。なお、本ステップＳ２３はステップＳ２１またはステップＳ２２の前に行ってもよい。 In step S23, the control unit 110 initializes two variables i and n. The variable i indicates the object number, and the variable n indicates the divided image number. In this embodiment, variable i and variable n are set to 1. Note that this step S23 may be performed before step S21 or step S22.

ステップＳ２４では、制御部１１０が、変数ｎが分割画像の総数Ｎ以下であるかどうかを判定する。変数ｎが総数Ｎ以下であれば（Ｓ２４：Ｙｅｓ）、ステップＳ２５に進む。一方、変数ｎが総数Ｎより大きければ（Ｓ２４：Ｎｏ）、オブジェクト検出処理を終了する。 In step S24, the control unit 110 determines whether the variable n is less than or equal to the total number N of divided images. If the variable n is less than or equal to the total number N (S24: Yes), the process advances to step S25. On the other hand, if the variable n is larger than the total number N (S24: No), the object detection process ends.

ステップＳ２５では、確率取得部１１３が、ステップＳ１で構築された構造適応型ＤＢＮの入力層にｎ番目の分割画像を与えて、出力層に出力された各オブジェクトの確率を取得する。 In step S25, the probability acquisition unit 113 provides the n-th divided image to the input layer of the structure-adaptive DBN constructed in step S1, and acquires the probability of each object output to the output layer.

ステップＳ２６では、制御部１１０が、変数ｉがオブジェクトの種類数Ｉ以下であるかどうかを判定する。変数ｉが種類数Ｉ以下であれば（Ｓ２６：Ｙｅｓ）、ステップＳ２７に進む。一方、変数ｉが種類数Ｉより大きければ（Ｓ２６：Ｎｏ）、変数ｎを１つ増加させて（ステップＳ３３）、ステップＳ２４に戻る。 In step S26, the control unit 110 determines whether the variable i is less than or equal to the number of object types I. If the variable i is less than or equal to the number of types I (S26: Yes), the process advances to step S27. On the other hand, if the variable i is larger than the number of types I (S26: No), the variable n is increased by one (step S33), and the process returns to step S24.

ステップＳ２７では、制御部１１０が、オブジェクトｉの確率が第１の検出パラメタＴ１_ｉよりも大きいかどうかを判定する。オブジェクトｉの確率が第１の検出パラメタＴ１_ｉよりも大きい場合（Ｓ２７：Ｙｅｓ）、ステップＳ２８に進む。一方、オブジェクトｉの確率が第１の検出パラメタＴ１_ｉ以下の場合（Ｓ２７：Ｎｏ）、変数ｉを１つ増加させた後（ステップＳ３２）、ステップＳ２６に戻る。 In step S27, the control unit 110 determines whether the probability of object i is greater than the first detection parameter T1 _i . If the probability of object i is greater than the first detection parameter T1 _i (S27: Yes), the process advances to step S28. On the other hand, if the probability of object i is less than or equal to the first detection parameter T1 _i (S27: No), the variable i is increased by one (step S32), and then the process returns to step S26.

ステップＳ２８では、変形画像作成部１１４が、ｎ番目の分割画像に基づいて複数の変形画像を作成する。本ステップでは、図１１を参照して説明したように分割画像から複数の変形画像が作成される。 In step S28, the deformed image creation unit 114 creates a plurality of deformed images based on the n-th divided image. In this step, a plurality of deformed images are created from the divided images as described with reference to FIG.

ステップＳ２９では、確率取得部１１５が、ステップＳ２８で作成された複数の変形画像をステップＳ１で構築された構造適応型ＤＢＮの入力層に順次与えて、各変形画像についての出力層の値（すなわち、オブジェクトごとの確率）を取得する。 In step S29, the probability acquisition unit 115 sequentially supplies the plurality of deformed images created in step S28 to the input layer of the structure-adaptive DBN constructed in step S1, and the value of the output layer for each deformed image (i.e. , probability for each object).

ステップＳ３０では、判定部１１６が、ステップＳ２８で作成された複数の変形画像ごとに、ステップＳ２９において取得されたオブジェクトｉに対する確率が第２の検出パラメタＴ２_ｉより大きいかどうかを判定する。そして、複数の変形画像のうち少なくとも１つの変形画像について、オブジェクトｉに対する確率が第２の検出パラメタＴ２_ｉより大きいならば（Ｓ３０：Ｙｅｓ）、オブジェクトｉを検出したと判定する（ステップＳ３１）。その後、変数ｉを１つ増加させた後（ステップＳ３２）、ステップＳ２６に戻る。 In step S30, the determination unit 116 determines whether the probability for the object i acquired in step S29 is greater than the second detection parameter T2 _i for each of the plurality of deformed images created in step S28. Then, if the probability of at least one of the plurality of deformed images for object i is greater than the second detection parameter T2 _i (S30: Yes), it is determined that object i has been detected (step S31). Thereafter, after incrementing the variable i by one (step S32), the process returns to step S26.

一方、ステップＳ２８で作成された複数の変形画像のすべてについて、オブジェクトｉに対する確率が第２の検出パラメタＴ２_ｉ以下であるならば（Ｓ３０：Ｎｏ）、判定部１１６が、オブジェクトｉを検出しないと判定し、変数ｉを１つ増加させた後（ステップＳ３２）、ステップＳ２６に戻る。 On the other hand, if the probability for object i is less than or equal to the second detection parameter T2 _i for all of the plurality of deformed images created in step S28 (S30: No), the determination unit 116 determines that object i is not detected. After making a determination and incrementing the variable i by one (step S32), the process returns to step S26.

上記のように本実施形態に係るオブジェクトの検出方法では、分割画像について第１の検出パラメタを用いて１段階目の判定を行い（ステップＳ２７）、その後、分割画像から作成された変形画像について第２の検出パラメタを用いて２段階目の判定を行う（ステップＳ３０）。検出パラメタ生成装置１により生成された第１および第２の検出パラメタを用いて２段階の判定処理を行う。これにより、本実施形態によれば、検出用画像から、特徴の異なる複数種類のオブジェクトを高精度で検出することができる。 As described above, in the object detection method according to the present embodiment, the first stage determination is performed using the first detection parameter for the divided image (step S27), and then the first stage determination is performed for the deformed image created from the divided image. A second-stage determination is performed using the second detection parameter (step S30). A two-stage determination process is performed using the first and second detection parameters generated by the detection parameter generation device 1. As a result, according to the present embodiment, it is possible to detect a plurality of types of objects with different characteristics from the detection image with high accuracy.

図１３は、本実施形態に係るオブジェクト検出装置・方法によるオブジェクトの検出精度を、従来のオブジェクト検出アルゴリズム（手動調整）による検出精度と比較したものである。ここでいう検出精度は、データセット（Ｊ枚の建築設備図面）に含まれる部品総数のうち正しく検出された部品の割合である。なお、図１３中のＴ１，Ｔ２の値は検出パラメタの初期値を示している。ここでは、自動調整の場合も初期値は各部品に共通の値とした。 FIG. 13 compares the object detection accuracy by the object detection apparatus and method according to this embodiment with the detection accuracy by a conventional object detection algorithm (manual adjustment). The detection accuracy here is the proportion of correctly detected parts out of the total number of parts included in the data set (J building equipment drawings). Note that the values of T1 and T2 in FIG. 13 indicate the initial values of the detection parameters. Here, even in the case of automatic adjustment, the initial value is a common value for each component.

図１３に示すように、検出パラメタを手動で調整した場合では、最高で９８．５％の検出精度が得られたものの、検出精度にばらつきが見られた。これは、各部品の特徴を考慮せずに一律の検出パラメタを使用していることに起因すると考えられる。一方、本実施形態による場合（自動調整）、手動調整のような試行錯誤を経なくとも、部品ごとに検出パラメタが最適値に自動調整されるため、初期値の異なるケース（自動調整１、自動調整２）のいずれにおいても最高の検出精度が得られた。 As shown in FIG. 13, when the detection parameters were manually adjusted, a maximum detection accuracy of 98.5% was obtained, but there were variations in the detection accuracy. This is considered to be due to the fact that uniform detection parameters are used without considering the characteristics of each component. On the other hand, in the case of this embodiment (automatic adjustment), the detection parameters are automatically adjusted to the optimal values for each component without going through trial and error like manual adjustment. The highest detection accuracy was obtained in all cases of adjustment 2).

ここで、記号を正しく検出できなかった場合について説明する。図１４は、記号を検出できなかった場合の建築設備図面の一部を示している。図１４には、コンセントを示す記号Ｓ３１，Ｓ３２，Ｓ３３，Ｓ３４が含まれている。記号Ｓ３２と記号Ｓ３４については、コンセント記号の上に他の器具等の画像が重なっているため、検出できなかった。一方、記号Ｓ３１と記号Ｓ３４については、コンセント記号の近傍に他の線が存在するものの正しく検出された。 Here, a case will be described in which a symbol cannot be detected correctly. FIG. 14 shows a part of the building equipment drawing when the symbol cannot be detected. FIG. 14 includes symbols S31, S32, S33, and S34 indicating electrical outlets. Symbols S32 and S34 could not be detected because images of other appliances were superimposed on the outlet symbols. On the other hand, symbols S31 and S34 were correctly detected although there were other lines near the outlet symbol.

このように本実施形態によれば、建築設備図面に含まれる特徴の異なる様々な種類の記号を高い精度で効率良く検出することができる。これにより、建築設備図面を正確に理解する作業を大幅に効率化することができる。その結果、建築物の多様化、働き方改革および人手不足問題に対して貢献することが可能となる。 As described above, according to the present embodiment, various types of symbols with different characteristics included in building equipment drawings can be detected efficiently with high accuracy. This makes it possible to greatly improve the efficiency of accurately understanding architectural equipment drawings. As a result, it will be possible to contribute to the diversification of buildings, work style reform, and labor shortage issues.

以上、本発明の実施形態について説明した。なお、上記の説明では検出パラメタ生成装置１とオブジェクト検出装置１００を別個の装置として説明したが、一体の装置として構成してもよい。 The embodiments of the present invention have been described above. Although the detection parameter generation device 1 and the object detection device 100 are described as separate devices in the above description, they may be configured as an integrated device.

上記の記載に基づいて、当業者であれば、本発明の追加の効果や種々の変形を想到できるかもしれないが、本発明の態様は、上述した実施形態に限定されるものではない。特許請求の範囲に規定された内容およびその均等物から導き出される本発明の概念的な思想と趣旨を逸脱しない範囲で種々の追加、変更および部分的削除が可能である。 Based on the above description, those skilled in the art may be able to envision additional effects and various modifications of the present invention, but aspects of the present invention are not limited to the embodiments described above. Various additions, changes, and partial deletions are possible without departing from the conceptual idea and gist of the present invention derived from the content defined in the claims and equivalents thereof.

１検出パラメタ生成装置
１０制御部
１１データ入力部
１２頻出度指標算出部
１３希少度指標算出部
１４検出パラメタ更新部
２０通信部
３０操作入力部
４０表示部
５０記憶部
５１図面データベース
１００オブジェクト検出装置
１１０制御部
１１１画像入力部
１１２画像分割部
１１３確率取得部
１１４変形画像作成部
１１５確率取得部
１１６判定部
１２０通信部
１３０操作入力部
１４０表示部
１５０記憶部
１５１ＤＢＮデータ
１５２検出パラメタデータベース
Ｍ１，Ｍ２，Ｍ３，Ｍ４，Ｍ５記号
Ｎ１，Ｎ２，Ｎ３，Ｎ４，Ｎ５（出力層の）ニューロン 1 Detection parameter generation device 10 Control unit 11 Data input unit 12 Frequency index calculation unit 13 Rarity index calculation unit 14 Detection parameter update unit 20 Communication unit 30 Operation input unit 40 Display unit 50 Storage unit 51 Drawing database 100 Object detection device 110 Control section 111 Image input section 112 Image division section 113 Probability acquisition section 114 Deformed image creation section 115 Probability acquisition section 116 Judgment section 120 Communication section 130 Operation input section 140 Display section 150 Storage section 151 DBN data 152 Detection parameter database M1, M2, M3, M4, M5 symbol N1, N2, N3, N4, N5 (output layer) neuron

Claims

A detection parameter generation device that generates detection parameters for detecting an object from an image using a structure adaptive DBN, comprising:
a frequency index calculation unit that calculates a first index indicating the frequency of a certain object in one image of a dataset having a plurality of images;
a rarity index calculation unit that calculates a second index indicating the rarity of the object in the data set;
a first detection parameter for determining whether the object is present in a divided image obtained by dividing the detection image based on the product of the first index and the second index; a detection parameter updating unit that updates a second detection parameter for determining whether the object is present in the deformed image created based on the divided image;
A detection parameter generation device comprising:

2. The detection parameter updating unit reduces the first detection parameter and the second detection parameter as the product of the first index and the second index increases. The detection parameter generation device described.

3. The detection parameter generation device according to claim 1, wherein the frequency index calculation unit calculates the first index using equation (1).

Here, x _i,j is the first index, n _i,j is the number of objects in image j, and I is the total number of object types in the data set.

4. The detection parameter generation device according to claim 1, wherein the rarity index calculation unit calculates the second index using equation (2).

Here, y _i is the second index, J is the total number of images in the data set, and A _i is the number of images including the object.

5. The detection parameter generation device according to claim 1, wherein the detection parameter updating unit updates the first detection parameter using equation (3).

Here, T1 _i,j+1 : the first detection parameter after the update of the object, T1 _i,j : the first detection parameter before the update of the object, α ₁ : adjustment parameter, z _i,j : It is the product of the first index and the second index.

6. The detection parameter generation device according to claim 1, wherein the detection parameter updating unit updates the second detection parameter using equation (4).

Here, T2 _i,j+1 : second detection parameter after update of the object, T2 _i,j : second detection parameter before update of the object, α ₂ : adjustment parameter, z _i,j : It is the product of the first index and the second index.

7. The detection parameter generation device according to claim 1, wherein the image is an image related to a building equipment drawing, and the object is a symbol of the building equipment drawing.

an image input section for inputting a detection image;
an image dividing unit that divides the detection image into a plurality of divided images;
a first probability acquisition unit that applies the divided image to an input layer of a structure adaptive DBN and obtains a first probability for a certain object output to an output layer of the structure adaptive DBN;
If the first probability is larger than the first detection parameter generated by the detection parameter generation device according to claim 1, a deformed image creation unit that creates a deformed image based on the divided image;
a second probability acquisition unit that applies the deformed image to the input layer of the structure-adaptive DBN to obtain a second probability for the object output to the output layer of the structure-adaptive DBN;
a determination unit that determines that the object has been detected if the second probability is greater than the second detection parameter generated by the detection parameter generation device according to claim 1;
An object detection device comprising:

A detection parameter generation method for generating detection parameters for detecting an object from an image using a structure adaptive DBN, the method comprising:
determining a first indicator of the frequency of an object in one of the images in the dataset;
determining a second indicator of the rarity of the object in the dataset;
a first detection parameter for determining whether the object is present in a divided image obtained by dividing the detection image based on the product of the first index and the second index; updating a second detection parameter for determining whether the object is present in the deformed image created based on the divided image;
A detection parameter generation method comprising:

inputting a detection image;
dividing the detection image into a plurality of divided images;
providing the divided image to an input layer of a structure-adaptive DBN to obtain a first probability for a certain object output to an output layer of the structure-adaptive DBN;
If the first probability is greater than the first detection parameter generated by the detection parameter generation method according to claim 9, creating a deformed image based on the divided image;
providing the deformed image to an input layer of the structure-adaptive DBN to obtain a second probability for the object output to an output layer of the structure-adaptive DBN;
If the second probability is greater than the second detection parameter generated by the detection parameter generation method according to claim 9, determining that the object has been detected;
An object detection method comprising:

A detection parameter generation program that generates detection parameters for detecting an object from an image using a structure adaptive DBN, comprising:
determining a first indicator of the frequency of an object in one of the images in the dataset;
determining a second indicator of the rarity of the object in the dataset;
a first detection parameter for determining whether the object is present in a divided image obtained by dividing the detection image based on the product of the first index and the second index; updating a second detection parameter for determining whether the object is present in the deformed image created based on the divided image;
A detection parameter generation program that causes a computer to execute.

inputting a detection image;
dividing the detection image into a plurality of divided images;
providing the divided image to an input layer of a structure-adaptive DBN to obtain a first probability for a certain object output to an output layer of the structure-adaptive DBN;
If the first probability is greater than the first detection parameter generated by the detection parameter generation program according to claim 11, creating a deformed image based on the divided image;
providing the deformed image to an input layer of the structure-adaptive DBN to obtain a second probability for the object output to an output layer of the structure-adaptive DBN;
If the second probability is greater than the second detection parameter generated by the detection parameter generation program according to claim 11, determining that the object has been detected;
An object detection program that causes a computer to run