JP6825299B2

JP6825299B2 - Information processing equipment, information processing methods and programs

Info

Publication number: JP6825299B2
Application number: JP2016207914A
Authority: JP
Inventors: 関　海克; 海克関
Original assignee: Ricoh Co Ltd
Current assignee: Ricoh Co Ltd
Priority date: 2016-10-24
Filing date: 2016-10-24
Publication date: 2021-02-03
Anticipated expiration: 2036-10-24
Also published as: JP2018072884A

Description

本発明は、情報処理装置、情報処理方法およびプログラムに関する。 The present invention relates to information processing devices, information processing methods and programs.

車載カメラなどの撮影装置で撮影された撮影画像を解析し、撮影画像に含まれる信号機の信号（典型的には点灯している信号）を検出することで、運転者を支援するシステムが知られている。また、信号機の信号を認識するために、信号機の信号色を検出する装置が知られている。 A system that assists the driver by analyzing a captured image taken by a photographing device such as an in-vehicle camera and detecting a signal of a traffic light (typically a lit signal) included in the captured image is known. ing. Further, in order to recognize the signal of the traffic light, a device for detecting the signal color of the traffic light is known.

例えば、特許文献１には、カメラのゲインを２回調整して２枚の撮影画像を取得し、これらの２枚の撮影画像を用いて信号色を検知する方法が開示されている。特許文献１では、１回目の撮影で取得した撮影画像を用いて、２回目の撮影用のゲインを調整する。 For example, Patent Document 1 discloses a method in which the gain of a camera is adjusted twice to acquire two captured images, and the signal color is detected using these two captured images. In Patent Document 1, the gain for the second shooting is adjusted by using the shot image obtained in the first shooting.

しかしながら従来の技術では、ゲインの異なる複数枚の撮影画像が必要であった。また、ゲイン調整の誤差により、検出精度が大幅に低下する場合があった。さらに、撮影画像のうち信号機が映り込んだ領域を検出する前の状態では、撮影画像における信号の位置が不明であることから、信号色を正確に検出可能なゲインの設定は困難であった。 However, in the conventional technique, a plurality of captured images having different gains are required. In addition, the detection accuracy may be significantly reduced due to an error in gain adjustment. Further, in the state before detecting the region of the captured image in which the traffic light is reflected, the position of the signal in the captured image is unknown, so that it is difficult to set the gain that can accurately detect the signal color.

また、カメラゲインを最適に設定しても、信号機の信号と同じ色、同じ形状の物体が現れると、その物体を信号機の信号として誤認識する問題がある。例えば、赤い看板の一部が木に隠れ、見える部分の形状が、赤信号の色を示す円の形になった場合、その赤い看板を、赤信号を示す領域として認識してしまう。従来の技術では、誤認識を削除不可能である。 Further, even if the camera gain is optimally set, if an object having the same color and shape as the signal of the traffic light appears, there is a problem that the object is erroneously recognized as a signal of the traffic light. For example, if a part of the red signboard is hidden by a tree and the visible part is shaped like a circle indicating the color of the red traffic light, the red signboard is recognized as an area indicating the red traffic light. With conventional technology, false recognition cannot be deleted.

すなわち、従来の技術では、撮影画像に映り込んだ信号機の信号を高精度に認識することは困難であるという問題があった。 That is, with the conventional technique, there is a problem that it is difficult to recognize the signal of the traffic light reflected in the captured image with high accuracy.

本発明は、上記に鑑みてなされたものであって、撮影画像に映り込んだ信号機の信号を高精度に認識することが可能な情報処理装置、情報処理方法およびプログラムを提供することを目的とする。 The present invention has been made in view of the above, and an object of the present invention is to provide an information processing device, an information processing method, and a program capable of recognizing a signal of a traffic light reflected in a captured image with high accuracy. To do.

上述した課題を解決し、目的を達成するために、本発明は、撮影画像を取得する取得部と、前記取得部により取得された前記撮影画像のうち、信号機の信号を示す信号領域を認識する信号領域認識部と、検出対象となる信号機の信号を示す対象信号を識別するために予め定められた対象信号識別情報と、前記信号領域認識部により認識された前記信号領域と、に基づいて、前記信号領域が、前記対象信号を示す領域であるか否かを判断する判断部と、を備え、前記対象信号識別情報は、前記撮影画像のうち対象信号の領域と、前記対象信号とは異なる信号の領域を示す誤認識領域との分離面を、少なくとも複数の色信号、撮影画像の端部からの距離と信号領域の幅からなる特徴量ベクトルより算出する評価関数である情報処理装置である。 In order to solve the above-mentioned problems and achieve the object, the present invention recognizes a signal region indicating a signal of a signal among an acquisition unit that acquires a captured image and the captured image acquired by the acquisition unit. Based on the signal area recognition unit, the target signal identification information predetermined for identifying the target signal indicating the signal of the signal to be detected, and the signal area recognized by the signal area recognition unit. The target signal identification information includes a determination unit for determining whether or not the signal region is a region indicating the target signal, and the target signal identification information is different from the target signal region in the captured image and the target signal. It is an information processing device that is an evaluation function that calculates the separation surface from the false recognition area indicating the signal area from a feature quantity vector consisting of at least a plurality of color signals, the distance from the end of the captured image, and the width of the signal area. ..

本発明によれば、撮影画像に映り込んだ信号機の信号を高精度に認識することができる。 According to the present invention, the signal of the traffic light reflected in the captured image can be recognized with high accuracy.

図１は、情報処理装置の一例の説明図である。FIG. 1 is an explanatory diagram of an example of an information processing device. 図２は、実施形態の情報処理装置が有する機能の一例を示す図である。FIG. 2 is a diagram showing an example of a function of the information processing apparatus of the embodiment. 図３は、横方向に配置される信号機の青信号を示す信号領域を示す図である。FIG. 3 is a diagram showing a signal region showing a green light of a traffic light arranged in the horizontal direction. 図４は、横方向に配置される信号機の赤信号を示す信号領域を示す図である。FIG. 4 is a diagram showing a signal region showing a red light of a traffic light arranged in the horizontal direction. 図５は、信号領域認識部の認識結果の一例を示す図である。FIG. 5 is a diagram showing an example of the recognition result of the signal area recognition unit. 図６は、複数の信号機を認識した結果の例を示す図である。FIG. 6 is a diagram showing an example of the result of recognizing a plurality of traffic lights. 図７は、任意の１つの時間帯における２次元の特徴量ベクトルの分布の例を示す図である。FIG. 7 is a diagram showing an example of the distribution of the two-dimensional feature amount vector in any one time zone. 図８は、時間帯判別部の詳細な構成例を示す図である。FIG. 8 is a diagram showing a detailed configuration example of the time zone determination unit. 図９は、基準撮影画像における、平均輝度（Ｉａｖ）と低輝度ブロック数（Ｎｂｌｋ）によって示されるサンプル点の一例を示す図である。FIG. 9 is a diagram showing an example of sample points indicated by the average luminance (Iav) and the number of low luminance blocks (Nblk) in the reference captured image. 図１０は、画素値の（Ｕ、Ｖ）空間での閾値処理で抽出した赤信号の画素例を示す図である。FIG. 10 is a diagram showing a pixel example of a red signal extracted by threshold processing in the (U, V) space of pixel values. 図１１は、信号画素領域を膨張処理した後の画像例を示す図である。FIG. 11 is a diagram showing an example of an image after the signal pixel region is expanded. 図１２は、Ｈｏｕｇｈ変換により抽出した赤信号の円領域を示す図である。FIG. 12 is a diagram showing a circular region of a red light extracted by the Hough transform. 図１３は、信号領域決定部によって決定された信号領域を示す図である。FIG. 13 is a diagram showing a signal region determined by the signal region determination unit. 図１４は、認識処理部の動作例を示すフローチャートである。FIG. 14 is a flowchart showing an operation example of the recognition processing unit. 図１５は、撮影装置のハードウェア構成の一例を示す図である。FIG. 15 is a diagram showing an example of the hardware configuration of the photographing apparatus. 図１６は、情報処理装置のハードウェア構成例を示すブロック図である。FIG. 16 is a block diagram showing a hardware configuration example of the information processing device.

以下、添付図面を参照しながら、本発明に係る情報処理装置、情報処理方法およびプログラムの実施形態を詳細に説明する。 Hereinafter, embodiments of the information processing apparatus, information processing method, and program according to the present invention will be described in detail with reference to the accompanying drawings.

図１は、本実施の形態の情報処理装置１０の一例の説明図である。情報処理装置１０は、例えば、移動体に搭載されている。移動体は、自立走行または牽引などにより、実空間において位置移動の可能な物体である。移動体は、例えば、車両、飛行機、電車、台車、などである。本実施の形態では、移動体が車両２０である場合を一例として説明する。すなわち、本実施の形態では、情報処理装置１０を車両２０に搭載した形態を一例として説明する。 FIG. 1 is an explanatory diagram of an example of the information processing device 10 of the present embodiment. The information processing device 10 is mounted on a mobile body, for example. A moving body is an object that can move its position in real space by running independently or towing. The moving body is, for example, a vehicle, an airplane, a train, a bogie, or the like. In the present embodiment, the case where the moving body is the vehicle 20 will be described as an example. That is, in the present embodiment, a mode in which the information processing device 10 is mounted on the vehicle 20 will be described as an example.

車両２０には、撮影装置１２が搭載されている。撮影装置１２は、車両２０の周辺を撮影した撮影画像を得る。撮影装置１２は、例えば、公知のビデオカメラや、デジタルカメラなどである。本実施の形態では、撮影装置１２は、車両２０の周辺を連続して撮影することで、複数の撮影画像（すなわち、複数のフレーム）を撮影する。なお、撮影装置１２は、情報処理装置１０と一体的に構成してもよいし、情報処理装置１０と別体として構成してもよい。 The vehicle 20 is equipped with a photographing device 12. The photographing device 12 obtains a photographed image of the periphery of the vehicle 20. The photographing device 12 is, for example, a known video camera, a digital camera, or the like. In the present embodiment, the photographing device 12 continuously photographs the periphery of the vehicle 20 to capture a plurality of captured images (that is, a plurality of frames). The photographing device 12 may be configured integrally with the information processing device 10 or may be configured as a separate body from the information processing device 10.

また、撮影装置１２は、車両２０に搭載した形態に限定されない。撮影装置１２は、信号機３０を撮影可能な位置に設置されていればよく、地面に対して固定されていてもよい。なお、本実施の形態の情報処理装置１０で検知した検知結果を、車両２０の運転者の運転支援などに用いる場合には、撮影装置１２は、車両２０に搭載した形態であることが好ましい。 Further, the photographing device 12 is not limited to the form mounted on the vehicle 20. The photographing device 12 may be installed at a position where the traffic light 30 can be photographed, and may be fixed to the ground. When the detection result detected by the information processing device 10 of the present embodiment is used for driving support of the driver of the vehicle 20, the photographing device 12 is preferably mounted on the vehicle 20.

本実施形態では、撮影装置１２は、オートゲインコントロール（ＡＧＣ）機能を搭載している。このため、撮影装置１２は、感度を自動調整し、撮影画像の全画面の明るさが最適となるように自動調整した撮影画像を得る。 In the present embodiment, the photographing device 12 is equipped with an automatic gain control (AGC) function. Therefore, the photographing device 12 automatically adjusts the sensitivity and obtains a photographed image automatically adjusted so that the brightness of the entire screen of the photographed image is optimized.

情報処理装置１０は、撮影画像を解析する。そして、情報処理装置１０は、撮影画像に含まれる信号機３０を検出する。 The information processing device 10 analyzes the captured image. Then, the information processing device 10 detects the traffic light 30 included in the captured image.

次に、情報処理装置１０が有する機能の一例を説明する。図２は、情報処理装置１０が有する機能の一例を示すブロック図である。説明の便宜上、ここでは、本実施形態に関する機能を主に例示しているが、情報処理装置１０が有する機能はこれらに限られるものではない。 Next, an example of the function of the information processing apparatus 10 will be described. FIG. 2 is a block diagram showing an example of the functions of the information processing device 10. For convenience of explanation, the functions related to the present embodiment are mainly illustrated here, but the functions of the information processing apparatus 10 are not limited to these.

情報処理装置１０は、インタフェース部１４と、認識処理部１６と、記憶部１８と、を備える。インタフェース部１４および記憶部１８は、認識処理部１６に電気的に接続されている。 The information processing device 10 includes an interface unit 14, a recognition processing unit 16, and a storage unit 18. The interface unit 14 and the storage unit 18 are electrically connected to the recognition processing unit 16.

インタフェース部１４は、撮影装置１２から撮影画像を受付ける。撮影装置１２は、車両２０の周辺を経時的に連続して撮影し、撮影によって得られた撮影画像（この例ではカラー画像）の各々を、撮影順に順次、インタフェース部１４へ出力する。インタフェース部１４は、撮影装置１２から撮影画像を順次受け付け、受け付けた順に、認識処理部１６へ順次出力する。 The interface unit 14 receives a photographed image from the photographing device 12. The photographing device 12 continuously photographs the periphery of the vehicle 20 over time, and outputs each of the captured images (color images in this example) obtained by the photographing to the interface unit 14 in the order of photographing. The interface unit 14 sequentially receives captured images from the photographing device 12, and sequentially outputs them to the recognition processing unit 16 in the order of acceptance.

記憶部１８は、各種データを記憶する。本実施の形態では、記憶部１８は、時間帯認識辞書ＤＢ１８Ａと、対象信号識別辞書ＤＢ１８Ｂを有し、時間帯認識辞書ＤＢ１８Ａは時間帯認識辞書を保持し、対象信号識別辞書ＤＢ１８Ｂは対象信号識別辞書を保持する。これらの詳細については後述する。 The storage unit 18 stores various data. In the present embodiment, the storage unit 18 has a time zone recognition dictionary DB 18A and a target signal identification dictionary DB 18B, the time zone recognition dictionary DB 18A holds a time zone recognition dictionary, and the target signal identification dictionary DB 18B identifies the target signal. Keep a dictionary. Details of these will be described later.

認識処理部１６は、撮影画像を解析し、撮影画像に映り込んだ信号機の信号を認識（検出）する。図２に示すように、認識処理部１６は、画像取得部１０１と、信号領域認識部１０２と、判断部１０３と、時間帯判別部１０４と、位置情報取得部１０５と、選択部１０６と、出力部１０７と、を含む。信号領域認識部１０２は、候補領域認識部１１１と、信号形状認識部１１２と、信号領域決定部１１３とを含む。 The recognition processing unit 16 analyzes the captured image and recognizes (detects) the signal of the traffic light reflected in the captured image. As shown in FIG. 2, the recognition processing unit 16 includes an image acquisition unit 101, a signal area recognition unit 102, a determination unit 103, a time zone determination unit 104, a position information acquisition unit 105, and a selection unit 106. The output unit 107 and the like are included. The signal area recognition unit 102 includes a candidate area recognition unit 111, a signal shape recognition unit 112, and a signal area determination unit 113.

画像取得部１０１、信号領域認識部１０２、判断部１０３、時間帯判別部１０４、位置情報取得部１０５、選択部１０６、出力部１０７の一部またはすべては、例えば、ＣＰＵ（ＣｅｎｔｒａｌＰｒｏｃｅｓｓｉｎｇＵｎｉｔ）などの処理装置にプログラムを実行させること、すなわち、ソフトウェアにより実現してもよいし、ＩＣ（ＩｎｔｅｇｒａｔｅｄＣｉｒｃｕｉｔ）などのハードウェアにより実現してもよいし、ソフトウェアおよびハードウェアを併用して実現してもよい。 Some or all of the image acquisition unit 101, signal area recognition unit 102, determination unit 103, time zone determination unit 104, position information acquisition unit 105, selection unit 106, and output unit 107 may be, for example, a CPU (Central Processing Unit) or the like. The program may be executed by the processing unit of the above, that is, it may be realized by software, it may be realized by hardware such as an IC (Integrated Circuit), or it may be realized by using software and hardware together. Good.

画像取得部１０１は、「取得部」の一例であり、撮影画像を取得する。この例では、画像取得部１０１は、インタフェース部１４から順次に入力される撮影画像（動画像）を取得する。つまり、画像取得部１０１は、車両に搭載された撮影装置１２によって撮影された撮影画像を取得する。 The image acquisition unit 101 is an example of the “acquisition unit” and acquires a captured image. In this example, the image acquisition unit 101 acquires captured images (moving images) sequentially input from the interface unit 14. That is, the image acquisition unit 101 acquires a photographed image taken by the photographing device 12 mounted on the vehicle.

信号領域認識部１０２は、画像取得部１０１により取得された撮影画像のうち、信号機の信号を示す信号領域を認識する。この例では、信号領域認識部１０２は、候補領域認識部１１１と、信号形状認識部１１２と、信号領域決定部１１３と、を含む。信号領域の具体的な認識方法については後述するが、信号領域決定部１１３は、候補領域認識部１１１の認識結果と、信号形状認識部１１２の認識結果とを受けて、信号領域を決定する。図３は、横方向に配置される信号機の青信号を示す信号領域を示す図であり、図４は、横方向に配置される信号機の赤信号を示す信号領域を示す図である。国や地域によって信号機の形は異なる。図３および図４は、日本における信号機の例を示す図である。 The signal area recognition unit 102 recognizes a signal area indicating a signal of a traffic light among the captured images acquired by the image acquisition unit 101. In this example, the signal area recognition unit 102 includes a candidate area recognition unit 111, a signal shape recognition unit 112, and a signal area determination unit 113. Although a specific recognition method of the signal region will be described later, the signal region determination unit 113 determines the signal region based on the recognition result of the candidate region recognition unit 111 and the recognition result of the signal shape recognition unit 112. FIG. 3 is a diagram showing a signal region showing a green signal of a traffic light arranged in the horizontal direction, and FIG. 4 is a diagram showing a signal region showing a red signal of a traffic light arranged in the horizontal direction. The shape of the traffic light differs depending on the country or region. 3 and 4 are diagrams showing an example of a traffic light in Japan.

図５は、信号領域認識部１０２の認識結果の一例を示す図である。画像の原点は撮影画像（入力画像）の左上の点Ｏとする。左右方向の横軸はｘで、下向きの縦軸はｙとする。ここでは、信号領域の認識結果は、点灯する信号を囲む矩形の領域で表される。信号領域の具体的な認識方法については後述する。また、ここでは、認識された矩形領域の左上の座標Ｐ（Ｘ，Ｙ）が信号領域の認識結果として出力される。Ｙは、画面の一番上から、認識された矩形領域の左上の点Ｐまでの距離を示す。このＹは、検出対象となる信号機の信号を判別するための一つの特徴量となる。Ｘは、画面の一番左から、認識された矩形領域の左上の点Ｐまでの距離を示す。このＸも、検出対象となる信号機の信号を判別するための一つの特徴量となる。また、ここでは、認識された矩形領域の幅をＷ、高さをＨとする。説明の便宜上、矩形領域はＷ＝Ｈの正方形とする。 FIG. 5 is a diagram showing an example of the recognition result of the signal area recognition unit 102. The origin of the image is the point O on the upper left of the captured image (input image). The horizontal axis in the left-right direction is x, and the vertical axis in the downward direction is y. Here, the recognition result of the signal area is represented by a rectangular area surrounding the lit signal. The specific recognition method of the signal region will be described later. Further, here, the coordinates P (X, Y) on the upper left of the recognized rectangular area are output as the recognition result of the signal area. Y indicates the distance from the top of the screen to the point P on the upper left of the recognized rectangular area. This Y is one feature amount for discriminating the signal of the traffic signal to be detected. X indicates the distance from the leftmost of the screen to the point P on the upper left of the recognized rectangular area. This X is also a feature amount for discriminating the signal of the traffic signal to be detected. Further, here, the width of the recognized rectangular area is W, and the height is H. For convenience of explanation, the rectangular area is a square with W = H.

図６は、複数の信号機を認識した結果の例を示す図である。認識結果Ａは、対象信号機の赤信号を示す信号領域である。認識結果Ｂは、遠方に存在する信号機の青信号を示す信号領域であり、対象信号（検出対象となる信号機の信号）の領域ではない。認識結果Ｃは、垂直方向の道路の青信号の信号を示す信号領域であり、対象信号の領域ではない。認識結果Ｄ〜Ｆは誤認識の結果を示す（対象信号の領域ではない）。信号機の認識結果は、赤信号、黄信号および青信号の各々を表す（Ｃｒ，Ｃｙ，Ｃｇ）の形で記述する。（Ｃｒ，Ｃｙ，Ｃｇ）のそれぞれの要素は１か０の値である。数値「１」は該当する色の信号が検知されたことを意味する。（１，０，０）は赤信号が検知されたことを表し、（０，１，０）は黄信号が検知されたことを表し、（０，０，１）は青信号が検知されたことを表す。 FIG. 6 is a diagram showing an example of the result of recognizing a plurality of traffic lights. The recognition result A is a signal region indicating a red light of the target traffic light. The recognition result B is a signal region indicating a green signal of a traffic light existing in a distant place, and is not a region of a target signal (a signal of a traffic light to be detected). The recognition result C is a signal region indicating a green signal of a road in the vertical direction, and is not a region of a target signal. The recognition results D to F indicate the result of erroneous recognition (not in the region of the target signal). The recognition result of the traffic light is described in the form of (Cr, Cy, Cg) representing each of the red signal, the yellow signal, and the green signal. Each element of (Cr, Cy, Cg) is a value of 1 or 0. The numerical value "1" means that a signal of the corresponding color has been detected. (1,0,0) means that a red light was detected, (0,1,0) means that a yellow light was detected, and (0,0,1) means that a green light was detected. Represents.

図２に戻って説明を続ける。判断部１０３は、検出対象となる信号機（対象信号機）の信号を示す対象信号を識別するために予め定められた対象信号識別情報と、信号領域認識部１０２により認識された信号領域と、に基づいて、該信号領域が、対象信号を示す領域であるか否かを判断する。対象信号識別情報は、ＳＶＭ（ＳｕｐｐｏｒｔＶｅｃｔｏｒＭａｃｈｉｎｅ）を用いた機械学習方法を利用して予め作成される。より具体的には、対象信号識別情報は、撮影画像のうち対象信号の領域と、対象信号とは異なる信号の領域を示す誤認識領域との分離面を示す評価関数である。 The explanation will be continued by returning to FIG. The determination unit 103 is based on predetermined target signal identification information for identifying a target signal indicating a signal of a signal to be detected (target signal) and a signal area recognized by the signal area recognition unit 102. Therefore, it is determined whether or not the signal region is a region indicating a target signal. The target signal identification information is created in advance by using a machine learning method using SVM (Support Vector Machine). More specifically, the target signal identification information is an evaluation function indicating a separation surface between a region of the target signal in the captured image and a false recognition region indicating a region of a signal different from the target signal.

信号領域認識部１０２による認識結果の特徴量（Ｃｒ，Ｃｙ，Ｃｇ，Ｗ，Ｘ，Ｙ）を６次元の特徴量ベクトルとする。実際には６次元の特徴量ベクトル（ベクトルデータ）により対象信号を判別するが、説明の便宜上、２次元の特徴量ベクトルの例を説明する。信号領域認識部１０２による認識結果を表す２次元の特徴量ベクトルが、対象信号（以下では「赤信号」を例に挙げて説明するが、他の色の信号も同様に考えることができる）を示すか否かを識別するための対象信号識別情報を作成する場合、対象信号の検出に用いられる撮影画像とは別に、この検出処理より前に、対象地域の複数の時間帯ごとに、予め撮影装置１２によって対象信号機を撮影した画像（信号画像と称する）を集める。そして、複数の時間帯ごとに、信号画像に映り込んだ赤信号の信号領域の幅Ｗと、画面上端からの距離Ｙとを含む特徴量ベクトル（２次元の特徴量ベクトル）を求める。図７は、任意の１つの時間帯における２次元の特徴量ベクトルの分布の例を示す図である。図７の例では、信号領域の幅（サイズ）Ｗが比較的に大きく、かつ、画面上端からの距離Ｙが比較的に小さい信号領域の特徴量ベクトルは、対象信号の領域を表す特徴量データであり、左上の白い矩形の集合である。一方、信号領域の幅Ｗが比較的に小さく、かつ、画面上端からの距離Ｙが比較的に大きい信号領域の特徴量ベクトルは、対象信号とは異なる信号（例えば遠方に存在する信号）の領域（誤認識領域）を表す特徴量データであり、右下の黒円の集合である。 Let the feature amount (Cr, Cy, Cg, W, X, Y) of the recognition result by the signal area recognition unit 102 be a six-dimensional feature amount vector. Actually, the target signal is discriminated by the 6-dimensional feature vector (vector data), but for convenience of explanation, an example of the 2-dimensional feature vector will be described. The two-dimensional feature amount vector representing the recognition result by the signal area recognition unit 102 describes the target signal (hereinafter, “red signal” is taken as an example, but signals of other colors can be considered in the same manner). When creating the target signal identification information for identifying whether to show or not, separately from the captured image used for detecting the target signal, the target area is photographed in advance for each of a plurality of time zones before the detection process. An image (referred to as a signal image) obtained by photographing the target traffic light by the device 12 is collected. Then, a feature amount vector (two-dimensional feature amount vector) including the width W of the signal region of the red signal reflected in the signal image and the distance Y from the upper end of the screen is obtained for each of a plurality of time zones. FIG. 7 is a diagram showing an example of the distribution of the two-dimensional feature amount vector in any one time zone. In the example of FIG. 7, the feature amount vector of the signal area in which the width (size) W of the signal area is relatively large and the distance Y from the upper edge of the screen is relatively small is the feature amount data representing the area of the target signal. Is a set of white rectangles on the upper left. On the other hand, the feature amount vector of the signal region in which the width W of the signal region is relatively small and the distance Y from the upper edge of the screen is relatively large is a region of a signal different from the target signal (for example, a signal existing in a distant place). It is feature data representing (misrecognition area), and is a set of black circles in the lower right.

ここでは、判断部１０３は、対象信号の領域の特徴量データ（正解データ）の集合と、誤認識領域の特徴量データの集合とを分離する分離面を、マージンｄが最大となるように配置する。そして、判断部１０３は、この分離面（図７の例では直線Ｌ）を示す評価関数を、対象信号識別辞書１８Ｂとして予め算出（学習により算出）しておくことができる。なお、この例では、評価関数を算出する主体は判断部１０３であるとして説明したが、これに限らず、評価関数を算出する主体は判断部１０３以外の他の機能であってもよい。 Here, the determination unit 103 arranges a separation surface that separates a set of feature amount data (correct answer data) in the target signal area and a set of feature amount data in the erroneous recognition area so that the margin d is maximized. To do. Then, the determination unit 103 can pre-calculate (calculate by learning) the evaluation function indicating this separation surface (straight line L in the example of FIG. 7) as the target signal identification dictionary 18B. In this example, the subject that calculates the evaluation function is the determination unit 103, but the present invention is not limited to this, and the subject that calculates the evaluation function may have a function other than the determination unit 103.

上記評価関数は、例えば以下の式（１）のような線形識別関数で表されるが、これに限らず、他の非線形識別関数であってもよい。以下の式（１）におけるＡ，Ｂ，Ｃは評価関数の係数である。ここでは、この評価関数が「対象信号識別情報」に対応する。

The evaluation function is represented by a linear discriminant function such as the following equation (1), but is not limited to this and may be another non-linear discriminant function. A, B, and C in the following equation (1) are the coefficients of the evaluation function. Here, this evaluation function corresponds to the "target signal identification information".

判断部１０３は、信号領域認識部１０２による認識結果を示す特徴量データ（信号領域認識部１０２により認識された信号領域に相当）を上記評価関数に代入して、評価関数ｆ（Ｗ，Ｙ）の値を算出する。そして、算出した評価関数ｆ（Ｗ，Ｙ）の値が予め定めた閾値以上であれば、判断部１０３は、信号領域認識部１０２による認識結果を示す特徴量データは、対象信号の領域を表すと判断する。一方、算出した評価関数ｆ（Ｗ，Ｙ）の値が閾値未満であれば、判断部１０３は、信号領域認識部１０２による認識結果を示す特徴量データは、対象信号の領域を表すものではないと判断する。 The determination unit 103 substitutes the feature amount data (corresponding to the signal area recognized by the signal area recognition unit 102) indicating the recognition result by the signal area recognition unit 102 into the evaluation function, and evaluates the evaluation function f (W, Y). Calculate the value of. If the calculated value of the evaluation function f (W, Y) is equal to or greater than a predetermined threshold value, the determination unit 103 indicates the recognition result by the signal area recognition unit 102, and the feature amount data indicating the recognition result represents the area of the target signal. Judge. On the other hand, if the calculated value of the evaluation function f (W, Y) is less than the threshold value, the determination unit 103 does not represent the area of the target signal in the feature amount data indicating the recognition result by the signal area recognition unit 102. Judge.

以上においては、正解であるか否かを判断する対象となる認識結果を示す特徴量データは２次元の特徴量ベクトルである場合を例に挙げて説明したが、次元数は任意に変更可能である。例えば６次元の場合、正解データの集合と、誤認識領域の特徴量データの集合とを分離する分離面を示す評価関数ｆ（Ｃｒ，Ｃｙ，Ｃｇ，Ｗ，Ｘ，Ｙ）は、以下の式（２）で表される。

In the above, the feature amount data indicating the recognition result to be judged whether or not the answer is correct has been described by taking the case of a two-dimensional feature amount vector as an example, but the number of dimensions can be changed arbitrarily. is there. For example, in the case of 6 dimensions, the evaluation function f (Cr, Cy, Cg, W, X, Y) indicating the separation surface that separates the set of correct answer data and the set of feature data of the misrecognition region is the following equation. It is represented by (2).

この場合、信号領域認識部１０２による認識結果を示す特徴量データ（Ｃｒ，Ｃｙ，Ｃｇ，Ｗ，Ｘ，Ｙ）は６次元のベクトルデータであり、判断部１０３は、この特徴量データ（Ｃｒ，Ｃｙ，Ｃｇ，Ｗ，Ｘ，Ｙ）を上記評価関数に代入して、評価関数ｆ（Ｃｒ，Ｃｙ，Ｃｇ，Ｗ，Ｘ，Ｙ）の値を算出する。そして、算出した評価関数ｆ（Ｃｒ，Ｃｙ，Ｃｇ，Ｗ，Ｘ，Ｙ）の値が予め定めた閾値以上であれば、判断部１０３は、信号領域認識部１０２による認識結果を示す特徴量データは、対象信号の領域を表すと判断する。一方、算出した評価関数ｆ（Ｃｒ，Ｃｙ，Ｃｇ，Ｗ，Ｘ，Ｙ）の値が閾値未満であれば、判断部１０３は、信号領域認識部１０２による認識結果を示す特徴量データは、対象信号の領域を表すものではないと判断する。 In this case, the feature amount data (Cr, Cy, Cg, W, X, Y) showing the recognition result by the signal area recognition unit 102 is 6-dimensional vector data, and the determination unit 103 uses this feature amount data (Cr, By substituting Cy, Cg, W, X, Y) into the above evaluation function, the value of the evaluation function f (Cr, Cy, Cg, W, X, Y) is calculated. Then, if the value of the calculated evaluation function f (Cr, Cy, Cg, W, X, Y) is equal to or higher than a predetermined threshold value, the determination unit 103 determines the feature amount data indicating the recognition result by the signal area recognition unit 102. Is determined to represent the region of the target signal. On the other hand, if the value of the calculated evaluation function f (Cr, Cy, Cg, W, X, Y) is less than the threshold value, the determination unit 103 targets the feature amount data indicating the recognition result by the signal area recognition unit 102. It is judged that it does not represent the area of the signal.

ここでは、時間帯と、国または地域との組み合わせごとに、事前学習により評価関数を作成しておき、対象信号識別辞書ＤＢ１８Ｂに格納しておく。判断部１０３は、後述の選択部１０６により選択された評価関数と、信号領域認識部１０２による認識結果を示す特徴量データと、に基づいて、信号領域認識部１０２による認識結果を示す特徴量データが、対象信号の領域を示すか否かを判断する。 Here, an evaluation function is created by pre-learning for each combination of the time zone and the country or region, and stored in the target signal identification dictionary DB18B. The determination unit 103 is based on the evaluation function selected by the selection unit 106, which will be described later, and the feature amount data indicating the recognition result by the signal area recognition unit 102, and the feature amount data showing the recognition result by the signal area recognition unit 102. Determines whether or not indicates the region of the target signal.

出力部１０７は、対象信号を示す領域であると判断された特徴量データ（信号領域認識部１０２による認識結果を示す特徴量データ）に対応する信号領域を、認識結果として出力する。 The output unit 107 outputs a signal area corresponding to the feature amount data (feature amount data indicating the recognition result by the signal area recognition unit 102) determined to be the area indicating the target signal as the recognition result.

時間帯判別部１０４は、画像取得部１０１により取得された撮影画像が撮影された時間帯を判別する。時間帯判別部１０４は、撮影画像を解析することによって、該撮影画像の撮影時間帯を判別する。 The time zone determination unit 104 determines the time zone in which the captured image acquired by the image acquisition unit 101 was captured. The time zone determination unit 104 determines the shooting time zone of the captured image by analyzing the captured image.

撮影時間帯とは、１日（２４時間）を撮影環境の互いに異なる複数の時間帯に分割したものである。撮影環境が異なる、とは、光強度が異なることを意味する。例えば、撮影時間帯は、昼、または、夜、である。なお、撮影時間帯は、昼、夜、夕方、であってもよい。また、撮影時間帯は、１日を撮影環境の異なる複数の時間帯に分割した、各時間帯であればよく、昼、夜、夕方、などに限定されない。また撮影時間帯は、２種類や３種類に限定されず、４種類以上であってもよい。また、撮影時間帯は、撮影画像の撮影環境（例えば、季節、国、地域、北半球・南半球）に応じて、適宜、設定すればよい。 The shooting time zone is a day (24 hours) divided into a plurality of time zones different from each other in the shooting environment. Different shooting environments mean different light intensities. For example, the shooting time zone is daytime or nighttime. The shooting time zone may be daytime, nighttime, or evening. Further, the shooting time zone may be any time zone in which the day is divided into a plurality of time zones having different shooting environments, and is not limited to day, night, evening, and the like. The shooting time zone is not limited to two or three types, and may be four or more types. In addition, the shooting time zone may be appropriately set according to the shooting environment of the shot image (for example, season, country, region, northern hemisphere / southern hemisphere).

本実施形態では、一例として、撮影時間帯が、昼または夜を示す場合を説明する。撮影時間帯が昼を示す、とは、撮影環境の光強度が閾値以上の時間帯である。撮影時間帯が夜である、とは、撮影環境の光強度が該閾値未満の時間帯である。この光強度の閾値には、予め任意の値を定めればよい。例えば、この光強度の閾値には、処理対象の撮影画像Ｐを撮影した撮影装置１２のオートゲインコントロール機能によって、撮影画像Ｐに含まれる信号機３０の信号３０Ａを示す領域の光量が飽和し始める（または飽和が解除される）光強度を定めればよい。 In the present embodiment, as an example, a case where the shooting time zone indicates daytime or nighttime will be described. The shooting time zone indicates daytime means that the light intensity of the shooting environment is equal to or higher than the threshold value. The shooting time zone is night is a time zone in which the light intensity of the shooting environment is less than the threshold value. An arbitrary value may be set in advance for the threshold value of this light intensity. For example, the light intensity in the region indicating the signal 30A of the traffic light 30 included in the captured image P begins to saturate at the threshold value of the light intensity by the auto gain control function of the photographing device 12 that captured the captured image P to be processed ( Alternatively, the light intensity (which is desaturated) may be determined.

時間帯判別部１０４は、画像取得部１０１により取得された撮影画像の輝度を用いて、撮影画像の撮影時間帯を判別する。詳細には、図８に示すように、時間帯判別部１０４は、第１の算出部１１６Ｈと、第２の算出部１１６Ｉと、第３の算出部１１６Ｊと、を含む。 The time zone determination unit 104 determines the shooting time zone of the captured image by using the brightness of the captured image acquired by the image acquisition unit 101. More specifically, as shown in FIG. 8, the time zone determination unit 104 includes a first calculation unit 116H, a second calculation unit 116I, and a third calculation unit 116J.

第１の算出部１１６Ｈは、撮影画像の平均輝度を算出する。第１の算出部１１６Ｈは、撮影画像に含まれる画素の各々の輝度値の平均値を算出する。これにより、第１の算出部１１６Ｈは、撮影画像の平均輝度を算出する。撮影時間帯が昼の場合、撮影画像の平均輝度は高い。一方、撮影時間帯が夜の場合、撮影画像の平均輝度は、昼より低い。 The first calculation unit 116H calculates the average brightness of the captured image. The first calculation unit 116H calculates the average value of the brightness values of each of the pixels included in the captured image. As a result, the first calculation unit 116H calculates the average brightness of the captured image. When the shooting time is daytime, the average brightness of the shot image is high. On the other hand, when the shooting time zone is night, the average brightness of the shot image is lower than that of daytime.

第２の算出部１１６Ｉは、撮影画像を複数のブロックに分割する。例えば、第２の算出部１１６Ｉは、撮影画像をｍ×ｎ個のブロックに分割する。なお、ｍおよびｎは、１以上の整数であり、且つ、ｍおよびｎの少なくとも一方は、２以上の整数である。 The second calculation unit 116I divides the captured image into a plurality of blocks. For example, the second calculation unit 116I divides the captured image into m × n blocks. Note that m and n are integers of 1 or more, and at least one of m and n is an integer of 2 or more.

そして、第２の算出部１１６Ｉは、ブロックの各々ごとに、平均輝度を算出する。例えば、第２の算出部１１６Ｉは、ブロックの各々ごとに、ブロックに含まれる画素の輝度値の平均値を算出する。これにより、第２の算出部１１６Ｉは、撮影画像Ｐに含まれるブロックの各々ごとに、平均輝度を算出する。 Then, the second calculation unit 116I calculates the average brightness for each of the blocks. For example, the second calculation unit 116I calculates the average value of the brightness values of the pixels included in the block for each block. As a result, the second calculation unit 116I calculates the average brightness for each of the blocks included in the captured image P.

さらに、第２の算出部１１６Ｉは、撮影画像における平均輝度が閾値以下のブロックの数を、該撮影画像における低輝度ブロック数として算出する。低輝度ブロック数の算出に用いる平均輝度の閾値には、任意の値を予め定めればよい。 Further, the second calculation unit 116I calculates the number of blocks whose average brightness in the captured image is equal to or less than the threshold value as the number of low-luminance blocks in the captured image. An arbitrary value may be predetermined for the threshold value of the average luminance used for calculating the number of low-luminance blocks.

そして、時間帯判別部１０４は、撮影画像の平均輝度と、該撮影画像の低輝度ブロック数と、を含む特徴量に基づいて、撮影時間帯を判別する。 Then, the time zone determination unit 104 determines the photographing time zone based on the feature amount including the average brightness of the photographed image and the number of low-luminance blocks of the photographed image.

具体的には、時間帯判別部１０４は、例えば、撮影画像の平均輝度と撮影画像の低輝度ブロック数を、時間帯判別時の特徴量として用いる。 Specifically, the time zone determination unit 104 uses, for example, the average brightness of the captured image and the number of low-luminance blocks of the captured image as feature quantities at the time of time zone determination.

本実施形態では、時間帯判別部１０４は、予め、撮影時間帯の判別に用いる時間帯認識辞書１８Ａを作成する。すなわち、時間帯判別部１０４は、撮影画像の時間帯判別処理より以前に、予め、時間帯認識辞書１８Ａを作成する。 In the present embodiment, the time zone determination unit 104 creates in advance the time zone recognition dictionary 18A used for determining the shooting time zone. That is, the time zone determination unit 104 creates the time zone recognition dictionary 18A in advance before the time zone determination process of the captured image.

本実施形態では、時間帯判別部１０４は、ＳＶＭ（ＳｕｐｐｏｒｔＶｅｃｔｏｒＭａｃｈｉｎｅ）を用いた機械学習方法を用いて、時間帯認識辞書１８Ａを予め作成する。 In the present embodiment, the time zone determination unit 104 creates a time zone recognition dictionary 18A in advance by using a machine learning method using an SVM (Support Vector Machine).

詳細には、時間帯判別部１０４は、撮影時間帯ごとに予め撮影された複数の撮影画像を基準撮影画像として用いる。 Specifically, the time zone determination unit 104 uses a plurality of captured images captured in advance for each imaging time zone as reference captured images.

基準撮影画像とは、信号機の信号を示す領域の検出に用いられる撮影画像とは別に、この検出処理より前に、撮影装置１２によって予め撮影された撮影画像である。まず、時間帯判別部１０４は、基準撮影画像における、平均輝度と低輝度ブロック数によって示されるサンプル点の群を、平均輝度と低輝度ブロック数によって定まる二次元空間に登録する。図９は、基準撮影画像における、平均輝度（Ｉａｖ）と低輝度ブロック数（Ｎｂｌｋ）によって示されるサンプル点の一例を示す図である。 The reference captured image is a captured image previously captured by the imaging device 12 prior to the detection process, in addition to the captured image used for detecting the region showing the signal of the traffic light. First, the time zone determination unit 104 registers a group of sample points indicated by the average luminance and the number of low-luminance blocks in the reference captured image in a two-dimensional space determined by the average luminance and the number of low-luminance blocks. FIG. 9 is a diagram showing an example of sample points indicated by the average luminance (Iav) and the number of low luminance blocks (Nblk) in the reference captured image.

図９中、サンプル点群４０Ｂは、夜を示す撮影時間帯に撮影された基準撮影画像における、平均輝度と低輝度ブロック数によって示されるサンプル点の群である。また、サンプル点群４０Ａは、昼を示す撮影時間帯に撮影された基準撮影画像における、平均輝度と低輝度ブロック数によって示されるサンプル点の群である。 In FIG. 9, the sample point group 40B is a group of sample points indicated by the average brightness and the number of low-luminance blocks in the reference shooting image taken during the shooting time zone indicating night. Further, the sample point group 40A is a group of sample points indicated by the average luminance and the number of low-luminance blocks in the reference photographing image photographed in the photographing time zone indicating daytime.

時間帯判別部１０４は、昼のサンプル点群４０Ａと夜のサンプル点群４０Ｂとを分離する分離面（ここでは、直線Ｌａ）を、昼のサンプル点群４０Ａと夜のサンプル点群４０Ｂの各々の境界線（直線Ｌａ１および直線Ｌａ２）間の距離ｄ（マージンと称される場合がある）が最大となるように配置する。そして、時間帯判別部１０４は、この分離面（図９では、直線Ｌａ）を示す評価関数を算出する。下記式（３）は、この直線Ｌａを示す評価関数を示す式である。 The time zone determination unit 104 sets a separation surface (here, a straight line La) that separates the daytime sample point group 40A and the nighttime sample point group 40B, and each of the daytime sample point group 40A and the nighttime sample point group 40B. The distance d (sometimes referred to as a margin) between the boundary lines (straight line La1 and straight line La2) is maximized. Then, the time zone determination unit 104 calculates an evaluation function indicating this separation surface (straight line La in FIG. 9). The following equation (3) is an equation showing an evaluation function showing this straight line La.

上記式（３）中、ｆ（Ｉａｖ，Ｎｂｌｋ）は、撮影画像の平均輝度と撮影画像の低輝度ブロック数を時間帯判別時の特徴量として用いた場合の、評価関数である。式（３）中、Ａ、Ｂ、およびＣは、該評価関数の係数である。ここでは、評価関数ｆ（Ｉａｖ，Ｎｂｌｋ）または係数Ａ、Ｂ、およびＣが時間帯認識辞書１８Ａに対応している。 In the above equation (3), f (Iav, Nblk) is an evaluation function when the average brightness of the captured image and the number of low-luminance blocks of the captured image are used as feature quantities at the time of time zone determination. In formula (3), A, B, and C are coefficients of the evaluation function. Here, the evaluation function f (Iav, Nblk) or the coefficients A, B, and C correspond to the time zone recognition dictionary 18A.

時間帯判別部１０４は、式（３）に示す評価関数を予め作成し、時間帯認識辞書１８Ａとして予め記憶部１８に記憶する。 The time zone determination unit 104 creates an evaluation function represented by the equation (3) in advance and stores it in the storage unit 18 in advance as the time zone recognition dictionary 18A.

時間帯判別部１０４は、撮影画像の撮影時間帯の判別時には、記憶部１８に記憶されている時間帯認識辞書１８Ａ（式（３）に示す評価関数）を用いて、該撮影画像の撮影時間帯を判別する。 When determining the shooting time zone of the captured image, the time zone determining unit 104 uses the time zone recognition dictionary 18A (evaluation function shown in the equation (3)) stored in the storage unit 18 to capture the captured image. Determine the band.

詳細には、時間帯判別部１０４は、第１の算出部１１６Ｈおよび第２の算出部１１６Ｉによって算出した、撮影画像の平均輝度（Ｉａｖ）と低輝度ブロック数（Ｎｂｌｋ）を、式（３）に代入することで、評価関数ｆ（Ｉａｖ，Ｎｂｌｋ）の値を算出する。 Specifically, the time zone determination unit 104 calculates the average brightness (Iav) and the number of low-luminance blocks (Nblk) of the captured image calculated by the first calculation unit 116H and the second calculation unit 116I in the equation (3). By substituting into, the value of the evaluation function f (Iav, Nblk) is calculated.

そして、時間帯判別部１０４は、算出した評価関数ｆ（Ｉａｖ，Ｎｂｌｋ）の値が、予め定めた閾値以上である場合、該撮影画像の撮影時間帯が昼を示すと判別する。一方、時間帯判別部１０４は、算出した評価関数ｆ（Ｉａｖ，Ｎｂｌｋ）の値が、該閾値未満である場合、該撮影画像の撮影時間帯が夜を示すと判別する。この閾値は、判別対象の撮影時間帯の撮影環境の光強度に応じて、予め定めればよい。 Then, when the value of the calculated evaluation function f (Iav, Nblk) is equal to or greater than a predetermined threshold value, the time zone determination unit 104 determines that the imaging time zone of the captured image indicates daytime. On the other hand, when the value of the calculated evaluation function f (Iav, Nblk) is less than the threshold value, the time zone determination unit 104 determines that the imaging time zone of the captured image indicates night. This threshold value may be set in advance according to the light intensity of the shooting environment in the shooting time zone to be discriminated.

なお、時間帯判別部１０４は、撮影画像の平均輝度と、該撮影画像の低輝度ブロック数と、該撮影画像におけるブロック毎の平均輝度の分散値と、を含む特徴量に基づいて、撮影時間帯を判別してもよい。 The time zone determination unit 104 is based on a feature amount including the average brightness of the captured image, the number of low-luminance blocks of the captured image, and the dispersion value of the average brightness of each block in the captured image. The band may be discriminated.

この場合、時間帯判別部１０４の第３の算出部１１６Ｊが、撮影画像におけるブロック毎の平均輝度の分散値を算出する。第３の算出部１１６Ｊは、第２の算出部１１６Ｉが分割したブロック毎の、平均輝度の分散値を算出する。例えば、時間帯判別部１０４は、下記式（４）を用いて、分散値（σ）を算出する。 In this case, the third calculation unit 116J of the time zone determination unit 104 calculates the dispersion value of the average brightness for each block in the captured image. The third calculation unit 116J calculates the dispersion value of the average brightness for each block divided by the second calculation unit 116I. For example, the time zone determination unit 104 calculates the variance value (σ) using the following equation (4).

式（４）中、σは、撮影画像におけるブロック毎の平均輝度の分散値を示す。式（４）中、Ｎは、撮影画像に含まれるブロック数（すなわち、ｍ×ｎの値）を示す。Ｉｉは、ｉ番目のブロックの平均輝度を示す。Ｉａｖは、撮影画像の全体の平均輝度を示す。 In the formula (4), σ indicates the dispersion value of the average brightness for each block in the captured image. In the formula (4), N indicates the number of blocks (that is, the value of m × n) included in the captured image. Ii indicates the average brightness of the i-th block. Iav indicates the overall average brightness of the captured image.

この場合、時間帯判別部１０４は、撮影画像の平均輝度と、該撮影画像の低輝度ブロック数と、該撮影画像におけるブロック毎の平均輝度の分散値と、を特徴量として用いる時間帯認識辞書１８Ａを、予め作成すればよい。 In this case, the time zone determination unit 104 uses a time zone recognition dictionary that uses the average brightness of the captured image, the number of low-luminance blocks of the captured image, and the dispersion value of the average brightness of each block in the captured image as feature quantities. 18A may be created in advance.

すなわち、時間帯判別部１０４は、撮影画像の平均輝度と、該撮影画像の低輝度ブロック数と、を特徴量として用いる場合と同様に、ＳＶＭ（ＳｕｐｐｏｒｔＶｅｃｔｏｒＭａｃｈｉｎｅ）を用いた機械学習方法を用いて、時間帯認識辞書１８Ａを予め作成すればよい。この場合、時間帯判別部１０４は、基準撮影画像における、平均輝度と低輝度ブロック数と分散値とによって示されるサンプル点の群を、平均輝度と低輝度ブロック数と分散値によって定まる三次元空間に登録すればよい。 That is, the time zone determination unit 104 uses a machine learning method using SVM (Support Vector Machine), as in the case of using the average brightness of the captured image and the number of low-luminance blocks of the captured image as feature quantities. Therefore, the time zone recognition dictionary 18A may be created in advance. In this case, the time zone determination unit 104 determines a group of sample points indicated by the average brightness, the number of low-luminance blocks, and the variance value in the reference captured image in a three-dimensional space determined by the average brightness, the number of low-luminance blocks, and the variance value. You can register at.

そして、時間帯判別部１０４は、上記と同様にして、昼のサンプル点群と夜のサンプル点群とを分離する分離面を、マージンが最大となるように配置する。そして、時間帯判別部１０４は、この分離面を示す評価関数を、時間帯認識辞書１８Ａとして算出すればよい。下記式（５）は、撮影画像Ｐの平均輝度、撮影画像Ｐの低輝度ブロック数、および分散値を、時間帯判別時の特徴量として用いる場合の、評価関数（時間帯認識辞書１８Ａ）である。 Then, the time zone determination unit 104 arranges the separation surface for separating the daytime sample point cloud and the nighttime sample point cloud so as to maximize the margin in the same manner as described above. Then, the time zone determination unit 104 may calculate the evaluation function indicating this separation surface as the time zone recognition dictionary 18A. The following equation (5) is an evaluation function (time zone recognition dictionary 18A) when the average brightness of the captured image P, the number of low-luminance blocks of the captured image P, and the variance value are used as feature quantities at the time of time zone discrimination. is there.

式（５）中、ｆ（Ｉａｖ，Ｎｂｌｋ，σ）は、撮影画像の平均輝度と撮影画像の低輝度ブロック数と分散値とを時間帯判別時の特徴量として用いる場合の、評価関数（時間帯認識辞書１８Ａ）である。式（５）中、Ａ、Ｂ、Ｃ、およびＤは、該評価関数の係数である。 In equation (5), f (Iav, Nblk, σ) is an evaluation function (time) when the average brightness of the captured image, the number of low-luminance blocks of the captured image, and the variance value are used as feature quantities at the time of time zone discrimination. The band recognition dictionary 18A). In formula (5), A, B, C, and D are coefficients of the evaluation function.

このように、時間帯判別部１０４は、例えば、式（５）に示す評価関数を予め作成し、時間帯認識辞書１８Ａとして予め記憶部１８に記憶してもよい。 As described above, the time zone determination unit 104 may, for example, create the evaluation function represented by the equation (5) in advance and store it in the storage unit 18 in advance as the time zone recognition dictionary 18A.

この場合、時間帯判別部１０４は、撮影画像の撮影時間帯の判別時には、記憶部１８に記憶されている時間帯認識辞書１８Ａ（式（５）に示す評価関数）を用いて、該撮影画像Ｐの撮影時間帯を判別する。 In this case, the time zone determination unit 104 uses the time zone recognition dictionary 18A (evaluation function shown in the equation (5)) stored in the storage unit 18 to determine the shooting time zone of the captured image. The shooting time zone of P is determined.

詳細には、時間帯判別部１０４は、第１の算出部１１６Ｈ、第２の算出部１１６Ｉ、および第３の算出部１１６Ｊの各々によって算出された、撮影画像の平均輝度（Ｉａｖ）、低輝度ブロック数（Ｎｂｌｋ）、および分散値（σ）を得る。そして、時間帯判別部１０４は、これらの平均輝度（Ｉａｖ）、低輝度ブロック数（Ｎｂｌｋ）、および分散値（σ）を、式（５）に代入することで、評価関数ｆ（Ｉａｖ，Ｎｂｌｋ，σ）の値を算出する。 Specifically, the time zone determination unit 104 has an average brightness (Iav) and low brightness of the captured image calculated by each of the first calculation unit 116H, the second calculation unit 116I, and the third calculation unit 116J. The number of blocks (Nblk) and the variance value (σ) are obtained. Then, the time zone determination unit 104 substitutes the average luminance (Iav), the number of low-luminance blocks (Nblk), and the variance value (σ) into the equation (5), thereby substituting the evaluation function f (Iav, Nblk). , Σ) is calculated.

そして、時間帯判別部１０４は、算出した評価関数ｆ（Ｉａｖ，Ｎｂｌｋ，σ）の値が、予め定めた閾値以上である場合、該撮影画像の撮影時間帯が昼を示すと判別する。一方、時間帯判別部１０４は、算出した評価関数ｆ（Ｉａｖ，Ｎｂｌｋ，σ）の値が、該閾値未満である場合、該撮影画像の撮影時間帯が夜を示すと判別する。この閾値は、予め定めればよい。 Then, when the value of the calculated evaluation function f (Iav, Nblk, σ) is equal to or greater than a predetermined threshold value, the time zone determination unit 104 determines that the imaging time zone of the captured image indicates daytime. On the other hand, when the value of the calculated evaluation function f (Iav, Nblk, σ) is less than the threshold value, the time zone determination unit 104 determines that the imaging time zone of the captured image indicates night. This threshold value may be set in advance.

以上のようにして、時間帯判別部１０４は、画像取得部１０１により取得された撮影画像が撮影された時間帯を判別し、判別した撮影時間帯を選択部１０６へ出力する。 As described above, the time zone determination unit 104 determines the time zone in which the captured image acquired by the image acquisition unit 101 was captured, and outputs the determined shooting time zone to the selection unit 106.

図２に戻り、認識処理部１６が有する機能の説明を続ける。位置情報取得部１０５は、自装置の位置情報を取得する。例えば位置情報取得部１０５は、ＧＰＳ装置などの外部装置から位置情報を取得することができる。位置情報取得部１０５は、取得した位置情報を選択部１０６へ出力する。 Returning to FIG. 2, the description of the function of the recognition processing unit 16 will be continued. The position information acquisition unit 105 acquires the position information of the own device. For example, the position information acquisition unit 105 can acquire position information from an external device such as a GPS device. The position information acquisition unit 105 outputs the acquired position information to the selection unit 106.

選択部１０６は、時間帯判別部１０４により判別された時間帯に対応する対象信号識別情報を選択する。また、選択部１０６は、位置情報取得部１０５により取得された位置情報（国または地域を特定可能）に対応する対象信号識別情報を選択する。より具体的には、選択部１０６は、時間帯判別部１０４により判別された時間帯と、位置情報取得部１０５により取得された位置情報（国または地域を特定可能）との組み合わせに対応する対象信号識別情報を選択する。 The selection unit 106 selects the target signal identification information corresponding to the time zone determined by the time zone determination unit 104. Further, the selection unit 106 selects the target signal identification information corresponding to the position information (country or region can be specified) acquired by the position information acquisition unit 105. More specifically, the selection unit 106 corresponds to a combination of the time zone determined by the time zone determination unit 104 and the position information (country or region can be specified) acquired by the position information acquisition unit 105. Select signal identification information.

次に、信号領域認識部１０２による信号領域の認識方法について説明する。信号領域認識部１０２は、画像取得部１０１により取得された撮影画像のうち、信号機の信号色を示す信号画素を含む信号画素領域を抽出し、その抽出した信号画素領域を膨張した膨張領域に予め定めた形状の定型領域が含まれる場合、該定型領域を含む領域を信号領域として認識する。より具体的には、信号領域認識部１０２は、膨張領域に円形の定型領域が含まれる場合、該定型領域を含む領域を信号領域として認識する。 Next, a method of recognizing the signal region by the signal region recognition unit 102 will be described. The signal region recognition unit 102 extracts a signal pixel region including a signal pixel indicating the signal color of the traffic light from the captured image acquired by the image acquisition unit 101, and previously expands the extracted signal pixel region into an expanded expansion region. When a standard region having a predetermined shape is included, the region including the standard region is recognized as a signal region. More specifically, when the expansion region includes a circular standard region, the signal region recognition unit 102 recognizes the region including the standard region as a signal region.

信号領域認識部１０２は、候補領域認識部１１１と、信号形状認識部１１２と、信号領域決定部１１３とを含む。候補領域認識部１１１は、画像取得部１０１により取得された撮影像画像のうち、信号機の信号色を示す信号画素を含む信号画素領域を候補領域として抽出（認識）する。信号形状認識部１１２は、候補領域認識部１１１により認識された候補領域を膨張した膨張領域に予め定めた形状の定型領域が含まれるか否かを認識する。信号領域決定部１１３は、候補領域認識部１１１による認識結果と、信号形状認識部１１２による認識結果とに基づいて、信号領域を決定する。以下、具体的な内容を説明する。 The signal area recognition unit 102 includes a candidate area recognition unit 111, a signal shape recognition unit 112, and a signal area determination unit 113. The candidate area recognition unit 111 extracts (recognizes) a signal pixel area including a signal pixel indicating the signal color of the traffic light from the captured image image acquired by the image acquisition unit 101 as a candidate area. The signal shape recognition unit 112 recognizes whether or not the expansion region in which the candidate region recognized by the candidate region recognition unit 111 is expanded includes a fixed region having a predetermined shape. The signal area determination unit 113 determines the signal area based on the recognition result by the candidate area recognition unit 111 and the recognition result by the signal shape recognition unit 112. The specific contents will be described below.

画像取得部１０１により取得された撮影画像が（Ｒ，Ｇ，Ｂ）色空間の画像データである場合、候補領域認識部１１１は、画像取得部１０１により取得されたＲＧＢ色空間の撮影画像（入力画像）を、（Ｙ，Ｕ，Ｖ）色空間の画像データに変更し、信号画素を抽出する。ここで、（Ｒ，Ｇ，Ｂ）の色空間と（Ｙ，Ｕ，Ｖ）の色空間との変換は以下の式（６）により行われる。

When the captured image acquired by the image acquisition unit 101 is image data in the (R, G, B) color space, the candidate area recognition unit 111 uses the captured image (input) in the RGB color space acquired by the image acquisition unit 101. Image) is changed to image data in the (Y, U, V) color space, and signal pixels are extracted. Here, the conversion between the color space of (R, G, B) and the color space of (Y, U, V) is performed by the following equation (6).

この例では、撮影装置１２によって予め信号機の信号を撮影して得られる撮影画像（「信号画像サンプル」と称する）を集めた学習により、信号機の信号の色値の範囲を示す信号認識辞書を予め作成しておく。例えば赤信号を示す（Ｕ，Ｖ）値分布と、赤信号以外の（Ｕ，Ｖ）値分布とを分離する分離面を、両者の間のマージン（距離）が最大になるように配置する。そして、その分離面を示す評価関数を、赤信号を認識するための赤信号認識辞書として算出することができる。以下の式（７）は、評価関数を表す式である。式（７）において、ｆ（Ｕ，Ｖ）は、赤信号認識辞書を示す評価関数であり、ａ，ｂ，ｃは、この評価関数の係数である。青信号認識辞書および黄信号認識辞書も同様の方法で作成することができる。

In this example, a signal recognition dictionary indicating the range of color values of the signal of the traffic light is obtained in advance by learning by collecting captured images (referred to as "signal image samples") obtained by photographing the signal of the traffic light in advance by the photographing device 12. Create it. For example, a separation surface that separates the (U, V) value distribution showing a red signal and the (U, V) value distribution other than the red signal is arranged so that the margin (distance) between the two is maximized. Then, the evaluation function indicating the separation surface can be calculated as a red signal recognition dictionary for recognizing the red signal. The following equation (7) is an equation representing the evaluation function. In the equation (7), f (U, V) is an evaluation function indicating a red light recognition dictionary, and a, b, and c are coefficients of this evaluation function. A green signal recognition dictionary and a yellow signal recognition dictionary can be created in the same manner.

候補領域認識部１１１は、（Ｙ，Ｕ，Ｖ）色空間の撮影画像を構成する画素ごとに、該画素の（Ｕ，Ｖ）値を、各信号色（赤、黄および青の何れか）に対応する評価関数ｆ（Ｕ，Ｖ）に代入して、該評価関数ｆ（Ｕ，Ｖ）の値を求める。算出した評価関数ｆ（Ｕ，Ｖ）の値が、予め定めた閾値以上であれば、その画素は、対応する信号色の信号画素であると判断する。例えば最小値の閾値と最大値の閾値とを設けて信号画素を抽出してもよい。赤、青、黄色のそれぞれのＵの最小値および最大値、Ｖの最小値および最大値の閾値を設定してもよい。また、信号画素以外の画素の（Ｕ、Ｖ）値と重ならないように閾値を設定する。 The candidate area recognition unit 111 sets the (U, V) value of each pixel constituting the captured image in the (Y, U, V) color space to each signal color (any of red, yellow, and blue). Is assigned to the evaluation function f (U, V) corresponding to, and the value of the evaluation function f (U, V) is obtained. If the value of the calculated evaluation function f (U, V) is equal to or greater than a predetermined threshold value, it is determined that the pixel is a signal pixel of the corresponding signal color. For example, a signal pixel may be extracted by setting a minimum value threshold value and a maximum value threshold value. Thresholds for the minimum and maximum values of U and the minimum and maximum values of V, respectively, of red, blue, and yellow may be set. Further, the threshold value is set so as not to overlap with the (U, V) values of pixels other than the signal pixel.

図１０は、画素値の（Ｕ、Ｖ）空間での閾値処理で抽出した赤信号の画素例を示す図である。図１０の画像中の信号画素（赤信号を示す画素）を含む信号画素領域では、飽和した画素、ノイズ画素が含まれるので、抽出した信号画素領域は、赤信号を示す実際の領域の一部しかない。そこで、候補領域認識部１１１は、抽出した信号画素に対して、膨張処理を行う。膨張処理は、１画素に対して、オリジナルの信号画像からＮ×Ｎの画素ブロックを加える。例えばＮ＝７のとき、膨張対象の１画素に対して、オリジナルの信号画像から、７×７の画素ブロックを加える。図１１は、図１０の信号画素領域を膨張処理した後の画像例を示す図である。以下では、膨張処理した後の信号画素領域を「膨張領域」と称する。 FIG. 10 is a diagram showing a pixel example of a red signal extracted by threshold processing in the (U, V) space of pixel values. Since the signal pixel region including the signal pixels (pixels showing the red signal) in the image of FIG. 10 includes saturated pixels and noise pixels, the extracted signal pixel region is a part of the actual region showing the red signal. There is only. Therefore, the candidate area recognition unit 111 performs expansion processing on the extracted signal pixels. In the expansion process, an N × N pixel block is added to one pixel from the original signal image. For example, when N = 7, a 7 × 7 pixel block is added to one pixel to be expanded from the original signal image. FIG. 11 is a diagram showing an example of an image after the signal pixel region of FIG. 10 has been expanded. Hereinafter, the signal pixel region after the expansion processing is referred to as an “expansion region”.

信号形状認識部１１２は、膨張領域に対して、信号形状認識処理を行う。赤、青、黄信号に対して、円形抽出により、形状認識を行う。ここで、膨張領域に対して、円抽出により、形状認識を行う。円抽出処理はＨｏｕｇｈ変換により円抽出を行うことができる。信号形状認識部１１２により円が抽出された場合、信号領域決定部１１３は、その抽出された円領域に外接する矩形を求める。そして、信号領域決定部１１３は、この矩形領域を信号領域として決定し、該信号領域に対応する特徴量（認識結果を示す特徴量データ）を算出する。図１２に示す太線で描かれた円は、Ｈｏｕｇｈ変換により抽出した赤信号の円領域を示す。図１３に示す矩形領域は、信号領域決定部１１３によって決定された信号領域を示す。図５に示す矩形領域は、図４の撮影画像から認識された候補領域（赤信号を示す信号領域）を示す図である。 The signal shape recognition unit 112 performs signal shape recognition processing on the expansion region. Shape recognition is performed for red, blue, and yellow signals by circular extraction. Here, the shape of the expanded region is recognized by extracting a circle. The circle extraction process can perform circle extraction by Hough transform. When a circle is extracted by the signal shape recognition unit 112, the signal area determination unit 113 obtains a rectangle circumscribing the extracted circle area. Then, the signal area determination unit 113 determines this rectangular area as a signal area, and calculates the feature amount (feature amount data indicating the recognition result) corresponding to the signal area. The circle drawn by the thick line shown in FIG. 12 indicates the circle region of the red signal extracted by the Hough transform. The rectangular region shown in FIG. 13 indicates a signal region determined by the signal region determination unit 113. The rectangular region shown in FIG. 5 is a diagram showing a candidate region (signal region showing a red light) recognized from the captured image of FIG.

図１４は、本実施形態の認識処理部１６の動作例を示すフローチャートである。各ステップの具体的な内容は上述したとおりであるので、詳細な説明は適宜に省略する。図１４に示すように、まず画像取得部１０１は、撮影画像を取得する（ステップＳ１）。次に、信号領域認識部１０２は、ステップＳ１で取得された撮影画像のうち、信号機の信号を示す信号領域を認識する（ステップＳ２）。また、以上の処理と並行して、時間帯判別部１０４は、ステップＳ１により取得された撮影画像が撮影された時間帯を判別する（ステップＳ３）。 FIG. 14 is a flowchart showing an operation example of the recognition processing unit 16 of the present embodiment. Since the specific contents of each step are as described above, detailed description thereof will be omitted as appropriate. As shown in FIG. 14, first, the image acquisition unit 101 acquires a captured image (step S1). Next, the signal area recognition unit 102 recognizes the signal area indicating the signal of the traffic light among the captured images acquired in step S1 (step S2). Further, in parallel with the above processing, the time zone determination unit 104 determines the time zone in which the captured image acquired in step S1 was captured (step S3).

次に、選択部１０６は、ステップＳ３で判別された時間帯と、位置情報取得部１０５により取得された位置情報により特定される国または地域との組み合わせに対応する評価関数を選択する（ステップＳ４）。次に、判断部１０３は、ステップＳ２で認識された信号領域（特徴量データ）と、ステップＳ４で選択された評価関数と、に基づいて、該信号領域が対象信号を示す領域であるか否かを判断する判断処理を行う（ステップＳ５）。出力部１０７は、ステップＳ５の判断処理により、対象信号を示す領域であると判断された信号領域を、認識結果として出力する（ステップＳ６）。 Next, the selection unit 106 selects an evaluation function corresponding to the combination of the time zone determined in step S3 and the country or region specified by the position information acquired by the position information acquisition unit 105 (step S4). ). Next, the determination unit 103 determines whether or not the signal region is a region indicating the target signal based on the signal region (feature amount data) recognized in step S2 and the evaluation function selected in step S4. Performs a determination process to determine whether or not (step S5). The output unit 107 outputs the signal area determined to be the area indicating the target signal by the determination process in step S5 as the recognition result (step S6).

以上に説明したように、本実施形態では、検出対象となる信号機の信号を示す対象信号を識別するために予め定められた対象信号識別情報と、信号領域認識部１０２により認識された信号領域と、に基づいて、該信号領域が、対象信号を示す領域であるか否かを判断する。本実施形態によれば、撮影画像に映り込んだ信号機の信号を高精度に認識することができる。 As described above, in the present embodiment, the target signal identification information predetermined for identifying the target signal indicating the signal of the signal to be detected and the signal region recognized by the signal region recognition unit 102. Based on, it is determined whether or not the signal region is a region indicating a target signal. According to this embodiment, the signal of the traffic light reflected in the captured image can be recognized with high accuracy.

次に、撮影装置１２のハードウェア構成の一例を説明する。図１５は、撮影装置１２のハードウェア構成の一例を示す図である。 Next, an example of the hardware configuration of the photographing device 12 will be described. FIG. 15 is a diagram showing an example of the hardware configuration of the photographing device 12.

撮影装置１２は、撮影光学系２０１０、メカシャッタ２０２０、モータドライバ２０３０、ＣＣＤ（ＣｈａｒｇｅＣｏｕｐｌｅｄＤｅｖｉｃｅ）２０４０、ＣＤＳ（ＣｏｒｒｅｌａｔｅｄＤｏｕｂｌｅＳａｍｐｌｉｎｇ：相関２重サンプリング）回路２０５０、Ａ／Ｄ変換器２０６０、タイミング信号発生器２０７０、画像処理回路２０８０、ＬＣＤ（ＬｉｑｕｉｄＣｒｙｓｔａｌＤｉｓｐｌａｙ）２０９０、ＣＰＵ（ＣｅｎｔｒａｌＰｒｏｃｅｓｓｉｎｇＵｎｉｔ）２１００、ＲＡＭ（ＲａｎｄｏｍＡｃｃｅｓｓＭｅｍｏｒｙ）２１１０、ＲＯＭ（ＲｅａｄＯｎｌｙＭｅｍｏｒｙ）２１２０、ＳＤＲＡＭ（ＳｙｎｃｈｒｏｎｏｕｓＤｙｎａｍｉｃＲａｎｄｏｍＡｃｃｅｓｓＭｅｍｏｒｙ）２１３０、圧縮伸張回路２１４０、メモリ２１５０、操作部２１６０、および、出力Ｉ／Ｆ２１７０を備える。 The photographing device 12 includes a photographing optical system 2010, a mechanical shutter 2020, a motor driver 2030, a CCD (Charge Coupled Device) 2040, a CDS (Correlated Double Sample) circuit 2050, an A / D converter 2060, and a timing signal generator. 2070, image processing circuit 2080, LCD (Liquid Crystal Device) 2090, CPU (Central Processing Unit) 2100, RAM (Random Access Memory) 2110, ROM (Read Only Memory) 2120, SRAM (Syndy) It includes an extension circuit 2140, a memory 2150, an operation unit 2160, and an output I / F 2170.

画像処理回路２０８０、ＣＰＵ２１００、ＲＡＭ２１１０、ＲＯＭ２１２０、ＳＤＲＡＭ２１３０、圧縮伸張回路２１４０、メモリ２１５０、操作部２１６０、および、出力Ｉ／Ｆ２１７０は、バス２２００を介して接続されている。 The image processing circuit 2080, CPU 2100, RAM 2110, ROM 2120, SDRAM 2130, compression / decompression circuit 2140, memory 2150, operation unit 2160, and output I / F 2170 are connected via a bus 2200.

撮影光学系２０１０は、被写体の反射光を集光する。メカシャッタ２０２０は、所定の時間、開くことにより、撮影光学系２０１０により集光された光をＣＣＤ２０４０に入射させる。モータドライバ２０３０は、撮影光学系２０１０およびメカシャッタ２０２０を駆動する。 The photographing optical system 2010 collects the reflected light of the subject. By opening the mechanical shutter 2020 for a predetermined time, the light focused by the photographing optical system 2010 is incident on the CCD 2040. The motor driver 2030 drives the photographing optical system 2010 and the mechanical shutter 2020.

ＣＣＤ２０４０は、メカシャッタ２０２０を介して入射した光を被写体の像として結像し、当該被写体の像を示すアナログの画像データをＣＤＳ回路２０５０に入力する。 The CCD 2040 forms an image of the light incident through the mechanical shutter 2020 as an image of the subject, and inputs analog image data showing the image of the subject to the CDS circuit 2050.

ＣＤＳ回路２０５０は、ＣＣＤ２０４０からアナログの画像データを受け付けると、当該画像データのノイズ成分を除去する。また、ＣＤＳ回路２０５０は、相関二重サンプリングやゲインコントロールなどのアナログ処理を行う。そして、ＣＤＳ回路２０５０は、処理後のアナログの画像データを、Ａ／Ｄ変換器２０６０へ出力する。 When the CDS circuit 2050 receives analog image data from the CCD 2040, the CDS circuit 2050 removes the noise component of the image data. Further, the CDS circuit 2050 performs analog processing such as correlated double sampling and gain control. Then, the CDS circuit 2050 outputs the processed analog image data to the A / D converter 2060.

Ａ／Ｄ変換器２０６０は、ＣＤＳ回路２０５０からアナログの画像データを受け付けると、当該アナログの画像データをデジタルの画像データに変換する。Ａ／Ｄ変換器２０６０はデジタルの画像データを画像処理回路２０８０に入力する。タイミング信号発生器２０７０はＣＰＵ２１００からの制御信号に応じて、ＣＣＤ２０４０、ＣＤＳ回路２０５０、およびＡ／Ｄ変換器２０６０にタイミング信号を送信することにより、ＣＣＤ２０４０、ＣＤＳ回路２０５０、およびＡ／Ｄ変換器２０６０の動作タイミングを制御する。 When the A / D converter 2060 receives the analog image data from the CDS circuit 2050, the A / D converter 2060 converts the analog image data into digital image data. The A / D converter 2060 inputs digital image data to the image processing circuit 2080. The timing signal generator 2070 transmits a timing signal to the CCD 2040, the CDS circuit 2050, and the A / D converter 2060 in response to the control signal from the CPU 2100, thereby transmitting the timing signal to the CCD 2040, the CDS circuit 2050, and the A / D converter 2060. Controls the operation timing of.

画像処理回路２０８０は、Ａ／Ｄ変換器２０６０からデジタルの画像データを受け付けると、ＳＤＲＡＭ２１３０を使用して、当該デジタルの画像データの画像処理を行う。画像処理は、例えばＣｒＣｂ変換処理、ホワイトバランス制御処理、コントラスト補正処理、エッジ強調処理、および、色変換処理等である。 When the image processing circuit 2080 receives the digital image data from the A / D converter 2060, the image processing circuit 2080 uses the SDRAM 2130 to perform image processing of the digital image data. The image processing includes, for example, CrCb conversion processing, white balance control processing, contrast correction processing, edge enhancement processing, color conversion processing, and the like.

画像処理回路２０８０は上述の画像処理が行われた画像データをＬＣＤ２０９０、または、圧縮伸張回路２１４０に出力する。ＬＣＤ２０９０は画像処理回路２０８０から受け付けた画像データを表示する液晶ディスプレイである。 The image processing circuit 2080 outputs the image data subjected to the above-mentioned image processing to the LCD 2090 or the compression / decompression circuit 2140. The LCD 2090 is a liquid crystal display that displays image data received from the image processing circuit 2080.

圧縮伸張回路２１４０は、画像処理回路２０８０から画像データを受け付けると、当該画像データを圧縮する。圧縮伸張回路２１４０は圧縮された画像データを、メモリ２１５０に記憶する。また、圧縮伸張回路２１４０はメモリ２１５０から画像データを受け付けると、当該画像データを伸張する。圧縮伸張回路２１４０は伸張された画像データをＳＤＲＡＭ２１３０に一時的に記憶する。メモリ２１５０は圧縮された画像データを記憶する。 When the compression / decompression circuit 2140 receives the image data from the image processing circuit 2080, the compression / decompression circuit 2140 compresses the image data. The compression / decompression circuit 2140 stores the compressed image data in the memory 2150. Further, when the compression / decompression circuit 2140 receives the image data from the memory 2150, the compression / decompression circuit 2140 decompresses the image data. The compression / decompression circuit 2140 temporarily stores the decompressed image data in the SDRAM 2130. Memory 2150 stores compressed image data.

出力Ｉ／Ｆ２１７０は、画像処理回路２０８０で処理された画像データを、撮影画像Ｐとして、情報処理装置１０へ出力する。 The output I / F 2170 outputs the image data processed by the image processing circuit 2080 to the information processing apparatus 10 as a captured image P.

なお上述の図２で説明したインタフェース部１４、および認識処理部１６に含まれる機能部の少なくとも一部を、信号処理ボード（信号処理回路）として撮影装置１２に実装してもよい。 At least a part of the functional unit included in the interface unit 14 and the recognition processing unit 16 described with reference to FIG. 2 may be mounted on the photographing device 12 as a signal processing board (signal processing circuit).

次に、上記実施の形態の、情報処理装置１０のハードウェア構成を説明する。図１６は、上記実施形態の情報処理装置１０のハードウェア構成例を示すブロック図である。 Next, the hardware configuration of the information processing apparatus 10 according to the above embodiment will be described. FIG. 16 is a block diagram showing a hardware configuration example of the information processing device 10 of the above embodiment.

上記実施形態の情報処理装置１０は、出力部８０、Ｉ／Ｆ部８２、入力部９４、ＣＰＵ８６、ＲＯＭ（ＲｅａｄＯｎｌｙＭｅｍｏｒｙ）８８、ＲＡＭ（ＲａｎｄｏｍＡｃｃｅｓｓＭｅｍｏｒｙ）９０、およびＨＤＤ９２等がバス９６により相互に接続されており、通常のコンピュータを利用したハードウェア構成となっている。 In the information processing device 10 of the above embodiment, the output unit 80, the I / F unit 82, the input unit 94, the CPU 86, the ROM (Read Only Memory) 88, the RAM (Random Access Memory) 90, the HDD 92, and the like are mutually connected by the bus 96. It is connected to and has a hardware configuration using a normal computer.

ＣＰＵ８６は、上記実施形態の情報処理装置１０で実行する処理を制御する演算装置である。ＲＡＭ９０は、ＣＰＵ８６による各種処理に必要なデータを記憶する。ＲＯＭ８８は、ＣＰＵ８６による各種処理を実現するプログラム等を記憶する。ＨＤＤ９２は、上述した記憶部１８に格納されるデータを記憶する。Ｉ／Ｆ部８２は、他の装置との間でデータを送受信するためのインタフェースである。 The CPU 86 is an arithmetic unit that controls the processing executed by the information processing device 10 of the above embodiment. The RAM 90 stores data required for various processes by the CPU 86. The ROM 88 stores a program or the like that realizes various processes by the CPU 86. The HDD 92 stores the data stored in the storage unit 18 described above. The I / F unit 82 is an interface for transmitting and receiving data to and from other devices.

上記実施形態の情報処理装置１０で実行される上記各種処理を実行するためのプログラムは、ＲＯＭ８８等に予め組み込んで提供される。 A program for executing the various processes executed by the information processing apparatus 10 of the above embodiment is provided by being incorporated in ROM 88 or the like in advance.

なお、上記実施形態の情報処理装置１０で実行されるプログラムは、これらの装置にインストール可能な形式又は実行可能な形式のファイルでＣＤ−ＲＯＭ、フレキシブルディスク（ＦＤ）、ＣＤ−Ｒ、ＤＶＤ（ＤｉｇｉｔａｌＶｅｒｓａｔｉｌｅＤｉｓｋ）等のコンピュータで読み取り可能な記録媒体に記録されて提供するように構成してもよい。 The program executed by the information processing device 10 of the above-described embodiment is a file in a format that can be installed or executed in these devices, and is a CD-ROM, a flexible disk (FD), a CD-R, or a DVD (Digital). It may be configured to be recorded and provided on a computer-readable recording medium such as a Versail Disc).

また、上記実施形態の情報処理装置１０で実行されるプログラムを、インターネット等のネットワークに接続されたコンピュータ上に格納し、ネットワーク経由でダウンロードさせることにより提供するように構成してもよい。また、上記実施形態の情報処理装置１０における上記各処理を実行するためのプログラムを、インターネット等のネットワーク経由で提供または配布するように構成してもよい。 Further, the program executed by the information processing apparatus 10 of the above embodiment may be stored on a computer connected to a network such as the Internet and provided by downloading the program via the network. Further, the program for executing each of the above processes in the information processing apparatus 10 of the above embodiment may be configured to be provided or distributed via a network such as the Internet.

上記実施形態の情報処理装置１０で実行される上記各種処理を実行するためのプログラムは、上述した各部が主記憶装置上に生成されるようになっている。 Each of the above-described parts of the program for executing the various processes executed by the information processing device 10 of the above-described embodiment is generated on the main storage device.

なお、上記ＨＤＤ９２に格納されている各種情報は、外部装置に格納してもよい。この場合には、該外部装置とＣＰＵ８６と、を、ネットワーク等を介して接続した構成とすればよい。 The various information stored in the HDD 92 may be stored in an external device. In this case, the external device and the CPU 86 may be connected to each other via a network or the like.

１０情報処理装置
１４インタフェース部
１６認識処理部
１８記憶部
１０１画像取得部
１０２信号領域認識部
１０３判断部
１０４時間帯判別部
１０５位置情報取得部
１０６選択部
１０７出力部
１１１候補領域認識部
１１２信号形状認識部
１１３信号領域決定部 10 Information processing device 14 Interface unit 16 Recognition processing unit 18 Storage unit 101 Image acquisition unit 102 Signal area recognition unit 103 Judgment unit 104 Time zone discrimination unit 105 Position information acquisition unit 106 Selection unit 107 Output unit 111 Candidate area recognition unit 112 Signal shape Recognition unit 113 Signal area determination unit

特開２００９−２４４９４６号公報JP-A-2009-2449446

Claims

The acquisition unit that acquires the captured image and
Among the captured images acquired by the acquisition unit, a signal area recognition unit that recognizes a signal area indicating a signal of a traffic light and a signal area recognition unit.
Based on the target signal identification information predetermined for identifying the target signal indicating the signal of the signal to be detected and the signal region recognized by the signal region recognition unit, the signal region is the said. It is equipped with a judgment unit that determines whether or not the area indicates the target signal.
The target signal identification information is obtained from at least a plurality of color signals and edges of the captured image on a separation surface between a region of the target signal in the captured image and a false recognition region indicating a region of a signal different from the target signal. An information processing device that is an evaluation function calculated from a feature quantity vector consisting of the distance of and the width of the signal area .

The target signal identification information is created in advance by using a machine learning method using SVM (Support Vector Machine).
The information processing device according to claim 1.

The target signal identification information is predetermined for each of a plurality of time zones.
A time zone determination unit that determines the time zone in which the captured image was taken, and
A selection unit for selecting the target signal identification information corresponding to the time zone determined by the time zone determination unit is further provided.
The information processing device according to any one of claims 1 and 2 .

The target signal identification information is predetermined for each country or region.
The location information acquisition unit that acquires location information and
A selection unit for selecting the target signal identification information corresponding to the position information acquired by the position information acquisition unit is further provided.
The information processing device according to any one of claims 1 to 3 .

The signal area recognition unit
When a signal pixel region including a signal pixel indicating the signal color of a traffic light is extracted from the captured image, and the expanded region obtained by expanding the extracted signal pixel region includes a fixed region having a predetermined shape, the fixed shape is used. Recognizing a region including a region as the signal region,
The information processing device according to any one of claims 1 to 4 .

When the expansion region includes the circular standard region, the signal region recognition unit recognizes the region including the standard region as the signal region.
The information processing device according to claim 5 .

The acquisition step to acquire the captured image and
Among the captured images acquired by the acquisition step, a signal region recognition step for recognizing a signal region indicating a signal of a traffic light and a signal region recognition step.
A separation surface is defined in advance to identify a target signal indicating a signal of a signal to be detected, and a separation surface between a target signal region in the captured image and a false recognition region indicating a signal region different from the target signal. , At least a plurality of color signals, target signal identification information which is an evaluation function calculated from a feature quantity vector consisting of a distance from an edge of a captured image and a width of a signal region, and the signal region recognized by the signal region recognition step. A determination step for determining whether or not the signal region is a region indicating the target signal based on the above.
Information processing method.

On the computer
The acquisition step to acquire the captured image and
Among the captured images acquired by the acquisition step, a signal region recognition step for recognizing a signal region indicating a signal of a traffic light and a signal region recognition step.
A separation surface is defined in advance to identify a target signal indicating a signal of a signal to be detected, and a separation surface between a target signal region in the captured image and a false recognition region indicating a signal region different from the target signal. , At least a plurality of color signals, target signal identification information which is an evaluation function calculated from a feature quantity vector consisting of a distance from an edge of a captured image and a width of a signal region, and the signal region recognized by the signal region recognition step. A program for executing a determination step of determining whether or not the signal region is a region indicating the target signal based on the above.