JP2012234432A

JP2012234432A - Vanishing point calculation device, vanishing point calculation method and program

Info

Publication number: JP2012234432A
Application number: JP2011103569A
Authority: JP
Inventors: Kaori Kataoka; 香織片岡; Akira Suzuki; 章鈴木; Masashi Morimoto; 正志森本; Hideki Koike; 秀樹小池
Original assignee: Nippon Telegraph and Telephone Corp
Current assignee: Nippon Telegraph and Telephone Corp
Priority date: 2011-05-06
Filing date: 2011-05-06
Publication date: 2012-11-29

Abstract

PROBLEM TO BE SOLVED: To provide a vanishing point calculation device capable of stably calculating a vanishing point even when many straight lines are included in an image in addition to straight lines passing through the vanishing point.SOLUTION: Images are prepared as plural learning images, Global feature amounts are calculated from the learning images based on each category, and a discriminator is previously created by leaning the feature amounts. Scene constitution estimation means 21 calculates feature amounts from a taken input image and inputs the resultant in a previously created scene constitution discriminator, and estimates a vanishing point existing candidate area which is a candidate area where a vanishing point may exist based on the category output from the scene constitution discriminator. Vanishing point calculation means 22 detects only straight lines that pass through the vanishing point existing candidate area as a straight line group from an image which has been subjected to a straight line detection on the input image, and determines an intersection point of two major straight lines from the straight line group as the vanishing point.

Description

本発明は、建物や、建物内など面で構成されるシーンを撮影した奥行き情報を有さない１枚の撮影画像から奥行き推定データを生成するための消失点を算出する消失点算出装置、消失点算出方法、及びプログラムに関する。 The present invention relates to a vanishing point calculation device for calculating a vanishing point for generating depth estimation data from a single photographed image that does not have depth information obtained by photographing a building or a scene composed of planes such as in a building, The present invention relates to a point calculation method and a program.

従来、図７（ａ）に示すような一枚の写真の画像データから建物を直方体と仮定し、図７（ｂ）に示すようにエッジ抽出を行い、消失点の位置から３次元復元を行う手法がある（例えば非特許文献１参照）。 Conventionally, a building is assumed to be a rectangular parallelepiped from image data of a single photograph as shown in FIG. 7A, edge extraction is performed as shown in FIG. 7B, and three-dimensional reconstruction is performed from the position of the vanishing point. There is a method (see, for example, Non-Patent Document 1).

情報処理学会研究報告. CVIM, "１枚の画像からの３次元復元の統計的最適化", 2004(113), 117-124, 2004-11-11IPSJ Research Report. CVIM, "Statistical optimization of 3D reconstruction from a single image", 2004 (113), 117-124, 2004-11-11

上述のように、図７（ａ）に示すように、屋内を撮影した画像１００には、天井や、壁の空調、壁、あるいは店舗内の商品類といった、建物を構成している平面を隠ぺいする物体が数多く存在し、図７（ｂ）に示すように、エッジ強調した画像１０１において、複数のエッジの中から消失点を形成するエッジを見分けるのが難しく、正しい消失点を出すのは困難であるという問題がある。 As described above, as shown in FIG. 7A, the image 100 taken indoors conceals a plane that forms a building such as a ceiling, air conditioning of a wall, a wall, or goods in a store. As shown in FIG. 7B, it is difficult to distinguish an edge that forms a vanishing point from a plurality of edges, and it is difficult to obtain a correct vanishing point, as shown in FIG. 7B. There is a problem that.

本発明は、このような事情を考慮してなされたものであり、その目的は、消失点を通る直線以外の直線が多数画像中に含まれていても、安定して消失点を算出することができる消失点算出装置、消失点算出方法、及びプログラムを提供することにある。 The present invention has been made in consideration of such circumstances, and its purpose is to stably calculate the vanishing point even if a number of straight lines other than the straight line passing through the vanishing point are included in the image. A vanishing point calculating device, a vanishing point calculating method, and a program.

上述した課題を解決するために、本発明は、建物などを撮影した一枚の画像から消失点を算出する消失点算出装置であって、消失点の位置毎にカテゴリを作成し、学習用に取得した複数の撮影画像をカテゴリ毎に分類する学習用画像カテゴリ分類手段と、前記学習用画像カテゴリ分類手段によってカテゴリに分類された複数の学習画像から特徴量を抽出し、該特徴量の特性を学習することによりシーン構成判別器を作成するシーン構成判別器作成手段と、面で構成される入力画像を撮影する画像撮影手段と、前記画像撮影手段によって撮影された入力画像から特徴量を抽出し、該特徴量を前記シーン構成判別器に入力し、前記シーン構成判別器から出力されるカテゴリに基づいて、消失点が存在する消失点存在候補領域を推定するシーン構成推定手段と、前記シーン推定手段により推定された消失点存在候補領域に基づいて、消失点を算出する消失点算出手段とを備えることを特徴とする消失点算出装置である。 In order to solve the above-described problem, the present invention is a vanishing point calculation device that calculates a vanishing point from a single image of a building or the like, and creates a category for each vanishing point position for learning. A learning image category classification means for classifying the acquired plurality of captured images for each category, and feature quantities are extracted from the plurality of learning images classified into categories by the learning image category classification means, and characteristics of the feature quantities are obtained. A scene composition discriminator creating means for creating a scene composition discriminator by learning, an image photographing means for photographing an input image composed of planes, and extracting feature quantities from the input image photographed by the image photographing means. , A scene configuration in which the feature amount is input to the scene configuration discriminator, and a vanishing point existence candidate region in which a vanishing point exists is estimated based on a category output from the scene configuration discriminator A constant section, on the basis of the vanishing point existing candidate region estimated by the scene estimating means, a vanishing point computing device, characterized in that it comprises a vanishing point calculation means for calculating a vanishing point.

本発明は、上記の発明において、前記シーン構成推定手段は、複数の解像度の画像から、それぞれのカテゴリ毎に特徴量を算出する特徴量算出手段と、前記特徴量算出手段により算出された特徴量を学習して識別器を作成する学習手段とを備えることを特徴とする。 According to the present invention, in the above invention, the scene configuration estimation unit includes a feature amount calculation unit that calculates a feature amount for each category from an image having a plurality of resolutions, and a feature amount calculated by the feature amount calculation unit. Learning means for creating a discriminator by learning.

本発明は、上記の発明において、前記消失点算出手段は、前記消失点存在候補領域を通過する直線のみを、前記入力画像に対して所定の画像処理を施した画像から直線群として検出し、それらの直線群からランダムに２つの直線を取り出し、該２つの直線の交点を求め、前記直線群の中から、該交点を通過する直線を算出し、その数を交点の得票数とするという作業を、ランダムに２つの直線を選びながら繰り返し、得票数の一番多い交点を消失点とする、ことを特徴とする。 The present invention is the above invention, wherein the vanishing point calculation means detects only a straight line passing through the vanishing point existence candidate region as a straight line group from an image obtained by performing predetermined image processing on the input image, Work to take two straight lines from the straight line group, find the intersection of the two straight lines, calculate a straight line passing through the cross point from the straight line group, and use the number as the number of votes of the intersection Is repeated while randomly selecting two straight lines, and the intersection with the largest number of votes is the vanishing point.

また、上述した課題を解決するために、本発明は、建物などを撮影した一枚の画像から消失点を算出する消失点算出方法であって、消失点の位置毎にカテゴリを作成し、学習用に取得した複数の撮影画像をカテゴリ毎に分類する学習用画像カテゴリ分類ステップと、前記学習用画像カテゴリ分類ステップでカテゴリに分類された複数の学習画像から特徴量を抽出し、該特徴量の特性を学習することによりシーン構成判別器を作成するシーン構成判別器作成ステップと、面で構成される入力画像を撮影する画像撮影ステップと、前記画像撮影ステップで撮影された入力画像から特徴量を抽出し、該特徴量を前記シーン構成判別器に入力し、前記シーン構成判別器から出力されるカテゴリに基づいて、消失点が存在する消失点存在候補領域を推定するシーン構成推定ステップと、前記シーン推定ステップで推定された消失点存在候補領域に基づいて、消失点を算出する消失点算出ステップとを含むことを特徴とする消失点算出方法である。 Further, in order to solve the above-described problem, the present invention is a vanishing point calculation method for calculating a vanishing point from a single image obtained by photographing a building or the like. A learning image category classification step for classifying a plurality of captured images acquired for each category, and extracting feature amounts from the plurality of learning images classified into categories in the learning image category classification step, A scene composition discriminator creating step for creating a scene composition discriminator by learning characteristics, an image photographing step for photographing an input image composed of faces, and a feature amount from the input image photographed in the image photographing step Extract and input the feature quantity to the scene configuration discriminator, and estimate the vanishing point existence candidate region where the vanishing point exists based on the category output from the scene configuration discriminator That the scene structure estimation step, on the basis of the vanishing point existing candidate region estimated by the scene estimation step, a vanishing point calculation method which comprises a vanishing point calculation step of calculating a vanishing point.

本発明は、上記の発明において、前記シーン構成推定ステップは、複数の解像度の画像から、それぞれのカテゴリ毎に特徴量を算出する特徴量算出ステップと、前記特徴量算出ステップで算出された特徴量を学習して識別器を作成する学習ステップとからなることを特徴とする。 According to the present invention, in the above invention, the scene configuration estimation step includes a feature amount calculation step for calculating a feature amount for each category from an image having a plurality of resolutions, and a feature amount calculated by the feature amount calculation step. And a learning step for creating a discriminator.

本発明は、上記の発明において、前記消失点算出ステップは、前記消失点存在候補領域を通過する直線のみを、前記入力画像に対して所定の画像処理を施した画像から直線群として検出し、それらの直線群からランダムに２つの直線を取り出し、該２つの直線の交点を求め、前記直線群の中から、該交点を通過する直線を算出し、その数を交点の得票数とするという作業を、ランダムに２つの直線を選びながら繰り返し、得票数の一番多い交点を消失点とする、ことを特徴とする。 According to the present invention, in the above invention, the vanishing point calculating step detects only a straight line passing through the vanishing point existence candidate region as a straight line group from an image obtained by performing predetermined image processing on the input image, Work to take two straight lines from the straight line group, find the intersection of the two straight lines, calculate a straight line passing through the cross point from the straight line group, and use the number as the number of votes of the intersection Is repeated while randomly selecting two straight lines, and the intersection with the largest number of votes is the vanishing point.

また、上述した課題を解決するために、本発明は、建物などを撮影した一枚の画像から消失点を算出する消失点算出装置のコンピュータに、消失点の位置毎にカテゴリを作成し、学習用に取得した複数の撮影画像をカテゴリ毎に分類する学習用画像カテゴリ分類機能、前記学習用画像カテゴリ分類機能でカテゴリに分類された複数の学習画像から特徴量を抽出し、該特徴量の特性を学習することによりシーン構成判別機能を作成するシーン構成判別器作成機能、面で構成される入力画像を撮影する画像撮影機能、前記画像撮影機能で撮影された入力画像から特徴量を抽出し、該特徴量を前記シーン構成判別機能に入力し、前記シーン構成判別機能から出力されるカテゴリに基づいて、消失点が存在する消失点存在候補領域を推定するシーン構成推定機能、前記シーン推定機能で推定された消失点存在候補領域に基づいて、消失点を算出する消失点算出機能を実行させることを特徴とするプログラムである。 In order to solve the above-described problem, the present invention creates a category for each vanishing point position in a computer of a vanishing point calculation device that calculates a vanishing point from a single image of a building or the like, and learns it. A learning image category classification function for classifying a plurality of captured images acquired for each category, a feature amount is extracted from a plurality of learning images classified into categories by the learning image category classification function, and characteristics of the feature amount are extracted A scene composition discriminator creating function for creating a scene composition discriminating function by learning the above, an image photographing function for photographing an input image composed of faces, and extracting feature quantities from the input image photographed by the image photographing function, The feature amount is input to the scene configuration determination function, and a scene configuration for estimating a vanishing point existence candidate region where a vanishing point exists is based on the category output from the scene configuration determination function. Estimating function, on the basis of the vanishing point existing candidate region estimated by the scene estimation function, a program characterized by executing the vanishing point calculation function of calculating a vanishing point.

この発明によれば、消失点を通る直線以外の直線が多数画像中に含まれていても、安定して消失点を算出することができる。 According to the present invention, the vanishing point can be calculated stably even if a large number of straight lines other than the straight line passing through the vanishing point are included in the image.

本実施形態によるカテゴリ例を説明するための概念図である。It is a conceptual diagram for demonstrating the example of a category by this embodiment. 本実施形態によるシーン構成判別器の構成を示すブロック図である。It is a block diagram which shows the structure of the scene structure discrimination device by this embodiment. 本実施形態による、入力画像から消失点を算出するための消失点算出装置の構成を示すブロック図である。It is a block diagram which shows the structure of the vanishing point calculation apparatus for calculating a vanishing point from an input image by this embodiment. 消失点２点が画像内に含まれている場合のカテゴリの例を示す模式図である。It is a schematic diagram which shows the example of a category when two vanishing points are contained in the image. 本実施形態によるシーン構成判別器を作成するための特徴量を説明するための概念図である。It is a conceptual diagram for demonstrating the feature-value for creating the scene structure discriminator by this embodiment. 本実施形態において、消失点算出方法を説明するための概念図である。In this embodiment, it is a conceptual diagram for demonstrating the vanishing point calculation method. 従来技術において消失点を算出するのを困難にする要因を説明する概念図である。It is a conceptual diagram explaining the factor which makes it difficult to calculate a vanishing point in a prior art.

消失点の位置が分かっている学習画像がある。消失点の位置によって画像をカテゴリに分ける。画像の特徴量を利用してカテゴリを識別する識別器を作る。実際の入力画像に対しては、識別器を利用することでカテゴリが定まる。カテゴリが定まれば、消失点の存在候補領域が分かる。その候補領域に限定して直線の交点が多い点（投票処理）を消失点とする。 There is a learning image in which the position of the vanishing point is known. Divide images into categories according to vanishing point positions. A discriminator for identifying a category is created using the feature amount of the image. For an actual input image, a category is determined by using a discriminator. If the category is determined, the vanishing point existence candidate region is known. A point (voting process) with many straight line intersections limited to the candidate area is defined as a vanishing point.

より具体的には、予め対象としている平面構造の場所を、さまざまな方向から複数枚撮影し、その撮影方向、または消失点の画像上での場所などを元にカテゴリ分けを行う。例えば、図１（ａ）〜（ｃ）に示すように、建物内の通路から天井に向けて撮影した場合に想定される平面構造３カテゴリ２０１、２０２、２０３に分け、それぞれのパターンの撮影画像を複数枚学習用画像として準備する。それぞれのカテゴリ毎に学習画像からＧｌｏｂａｌ特徴量を算出し、その特徴量を学習して識別器を作成する。入力画像がどのカテゴリに所属するか識別器にかけることで推定し、大よその消失点の位置（消失点存在候補領域）を推定する。 More specifically, a plurality of locations of the planar structure targeted in advance are photographed from various directions, and categorization is performed based on the photographing direction or the location of the vanishing point on the image. For example, as shown in FIGS. 1A to 1C, the planar structure is divided into three categories 201, 202, and 203 that are assumed when the image is taken from the passage in the building toward the ceiling, and the captured images of the respective patterns. Are prepared as learning images. A global feature value is calculated from the learning image for each category, and the feature value is learned to create a discriminator. The category to which the input image belongs is estimated by applying to the classifier, and the position of the vanishing point (vanishing point existence candidate region) is estimated.

消失点算出の際には、消失点存在候補領域を通過する直線のみに絞ることができる。これにより、消失点を形成する直線を、より正確に抽出することが可能となり、精密に消失点の位置を算出することができる。上記のように、消失点を算出する前に、入力画像が予めどのような平面構造の場所を撮影したものか推定することにより、消失点を形成する直線以外の直線の多い一枚の入力画像からであっても、ある程度、消失点位置の精度を保つことが可能となる。また、これによって、消失点を用いて画像中の３次元位置を推定し、擬似的に立体画像を生成することはもちろん、３次元位置推定値を用いて画像のセグメンテーションを行い、所望の領域を抽出することも可能となる。 When calculating the vanishing point, it is possible to narrow down only to a straight line passing through the vanishing point existence candidate region. As a result, the straight line forming the vanishing point can be extracted more accurately, and the position of the vanishing point can be accurately calculated. As described above, before calculating the vanishing points, by estimating or not the input image is obtained by photographing the location of any planar structure in advance, linear non-linear with many single input image to form a vanishing point Even from this point, the accuracy of the vanishing point position can be maintained to some extent. In addition, by this, it is possible to estimate the three-dimensional position in the image using the vanishing point and generate a pseudo three-dimensional image, and of course, perform segmentation of the image using the three-dimensional position estimated value to obtain a desired region. It is also possible to extract.

以下、本発明の一実施形態を、図面を参照して説明する。
図２は、本実施形態によるシーン構成判別器の構成を示すブロック図である。図２において、シーン構成判別器１０は、複数枚撮影した学習用画像をカテゴリ分けする学習用画像カテゴリ分類手段１１とシーン構成判別器作成手段１２とからなる。 Hereinafter, an embodiment of the present invention will be described with reference to the drawings.
FIG. 2 is a block diagram showing the configuration of the scene configuration discriminator according to this embodiment. In FIG. 2, the scene configuration discriminator 10 includes a learning image category classification unit 11 and a scene configuration discriminator creation unit 12 that categorize a plurality of learning images.

学習用画像カテゴリ分類手段１１は、まず、カテゴリを決め、各カテゴリに分類される学習用画像を複数用意する。カテゴリは、地下街の通路といったような予め対象としている場所を撮影方向毎、または消失点の場所毎に分類して作成する。例えば、図１（ａ）〜（ｃ）に示すような地下街の通路から天井と壁とを撮影した３つのカテゴリの例を元に説明する。 The learning image category classification unit 11 first determines a category, and prepares a plurality of learning images classified into each category. A category is created by classifying a target location such as a passage in an underground shopping street for each shooting direction or each vanishing point location. For example, description will be made based on examples of three categories in which a ceiling and a wall are photographed from a passage in an underground shopping street as shown in FIGS.

図１（ａ）において、２０１は、通路の方向正面に向かって撮影しているシーン、図１（ｂ）において、２０３は、向かって右側の地面に垂直な平面と天井とで構成されるシーン、図１（ｃ）において、２０２は、向かって左側の地面に垂直な平面と天井で構成されているシーンといった３つのカテゴリに分けることができる。この３つのカテゴリは、消失点の存在領域によって分類されている。 In FIG. 1 (a), 201 is a scene that is photographed in front of the direction of the passage, and in FIG. 1 (b), 203 is a scene that includes a plane perpendicular to the ground on the right side and a ceiling. In FIG. 1C, 202 can be divided into three categories such as a scene composed of a plane perpendicular to the ground on the left side and the ceiling. These three categories are classified according to the vanishing point existing area.

カテゴリ２０１は、消失点のｘ座標が領域２０４に、カテゴリ２０２は、消失点のｘ座標が領域２０５に、カテゴリ２０３は、消失点のｘ座標が領域２０６にある場合に分けている。これに限らず、床が写っているシーンなど、これ以外にも学習させることが可能である。図１（ａ）〜（ｃ）では、消失点１つで撮影方向毎に分類したが、消失点の個数毎に分類してもよい。例えば、消失点２つの例として、図４に示すように、消失点５０１と５０２の２つを持つようなシーンをカテゴリに加えてもよい。 The category 201 is divided when the x-coordinate of the vanishing point is in the region 204, the category 202 is divided into the x-coordinate of the vanishing point in the region 205, and the category 203 is divided when the x-coordinate of the vanishing point is in the region 206. However, the present invention is not limited to this, and other scenes such as a scene with a floor can be learned. In FIGS. 1A to 1C, although one vanishing point is classified for each photographing direction, the vanishing points may be classified for each number. For example, as an example of two vanishing points, a scene having two vanishing points 501 and 502 may be added to the category as shown in FIG.

次に、シーン構成判別器作成手段１２について説明する。シーン構成判別器作成手段１２は、特徴量算出手段１２１と該特徴量算出手段１２１により算出された特徴量を学習して判別器を作成する学習手段１２２とから構成されている。 Next, the scene configuration discriminator creating means 12 will be described. The scene configuration discriminator creating unit 12 includes a feature amount calculating unit 121 and a learning unit 122 that learns the feature amount calculated by the feature amount calculating unit 121 and creates a discriminator.

特徴量算出手段１２１は、まず、複数の解像度の画像を作成する。次に、図５（ａ）に示す入力画像３０１の各方向（垂直、水平、斜め方向）のＧａｂｏｒｆｉｌｔｅｒをかけた画像［文献１：J.G. Daugman: Uncertainty relations for resolution in space, spatial frequency, and orientation optimized by two-dimensional visual cortical filters, Journal of the Optical Society of America A, 1985, vol. 2, pp. 1160-1169.］を作成し、図５（ｂ）に示す、４×４の１６等分の画像３０２を作成する。 The feature amount calculation unit 121 first creates images with a plurality of resolutions. Next, an image obtained by applying a Gaborfilter in each direction (vertical, horizontal, and diagonal directions) of the input image 301 shown in FIG. 5A [Reference 1: JG Daugman: Uncertainty relations for resolution in space, spatial frequency, and orientation optimized. by two-dimensional visual cortical filters, Journal of the Optical Society of America A, 1985, vol. 2, pp. 1160-1169.], as shown in Fig. 5 (b). An image 302 is created.

Ｇａｂｏｒｆｉｌｔｅｒを施した画像（Ｇａｂｏｒ画像）３０２を２値化している場合には、１６等分にされた各領域で画素値が１の画素数を数える。Ｇａｂｏｒ画像３０２が図示するようにエッジ強度に応じた２５６色画像の場合、画素値を数えてもよい。または、分割された１６の各領域の平均値をとる。以上の処理を各解像度で行う。例えば、１／４、１／１０の２種類の解像度で行う場合、１枚の入力画像に対して、２（解像度）×１６（領域／枚）×４（方向の微分成分）＝１２８次元の特徴量となる。 In the case where the image (Gabor image) 302 to which the Gaborfilter is applied is binarized, the number of pixels having a pixel value of 1 is counted in each of the 16 divided regions. If the Gabor image 302 is a 256-color image corresponding to the edge strength as shown in the figure, the pixel value may be counted. Alternatively, an average value of each of the 16 divided areas is taken. The above processing is performed at each resolution. For example, when two resolutions of 1/4 and 1/10 are used, 2 (resolution) × 16 (region / sheet) × 4 (differential component of direction) = 128 dimensions for one input image This is a feature value.

なお、Ｇｌｏｂａｌ特徴量を算出する手法について説明したが、シーン構成を表現できるものであれば、どんな特徴量を使用しても構わない。 Although the method for calculating the global feature value has been described, any feature value may be used as long as the scene configuration can be expressed.

次に、学習手段１２２は、特徴量算出手段１２１により算出された特徴量と図１（ａ）〜（ｃ）に示したような各カテゴリを表現した教師データとを、ニューラルネットワーク（参考文献２：「パターン認識と学習の統計学」岩波書店 pp.23−31 2005年第5版）で学習させる。学習させる判別器は、ニューラルネットワークに限らず、多クラスＳＶＭ（参考文献３：「パターン認識と学習の統計学」岩波書店 pp.107−118 2005年第5版）など何を使っても構わない。 Next, the learning unit 122 uses the neural network (reference document 2) to display the feature amount calculated by the feature amount calculation unit 121 and the teacher data representing each category as shown in FIGS. : “Pattern recognition and learning statistics” Iwanami Shoten pp.23-31 2005 5th edition). The discriminator to be learned is not limited to a neural network, and any class such as multi-class SVM (Reference 3: “Statistics of Pattern Recognition and Learning” Iwanami Shoten pp.107-118 2005 5th Edition) may be used. .

次に、上述した手法により作成したシーン構成判別器を用いて消失点を算出する手法について説明する。 Next, a method for calculating the vanishing point using the scene configuration discriminator created by the above-described method will be described.

図３は、本実施形態による、入力画像から消失点を算出するための消失点算出装置の構成を示すブロック図である。また、図６は、本実施形態において、消失点算出方法を説明するための概念図である。 FIG. 3 is a block diagram illustrating a configuration of a vanishing point calculation device for calculating a vanishing point from an input image according to the present embodiment. FIG. 6 is a conceptual diagram for explaining a vanishing point calculation method in the present embodiment.

図３において、画像撮影手段２０は、図６（ａ）に示すような画像４０１を撮影する。シーン構成推定手段２１は、画像撮影手段２０により撮影された入力画像４０１から、図５（ｂ）に示すＧａｂｏｒ画像３０２と同様に特徴量を算出し、該特徴量を図２に示すシーン構成判別器１０に投入し、シーン構成判別器１０から出力されたカテゴリに応じて、図６（ｂ）に示すように、消失点のｘ座標の存在する候補領域である消失点存在候補領域４０２を推定する。 In FIG. 3, the image capturing means 20 captures an image 401 as shown in FIG. The scene configuration estimation unit 21 calculates the feature amount from the input image 401 captured by the image capturing unit 20 in the same manner as the Gabor image 302 illustrated in FIG. 5B, and the feature amount is determined as the scene configuration determination illustrated in FIG. The vanishing point existence candidate region 402, which is a candidate region where the x coordinate of the vanishing point exists, is estimated, as shown in FIG. 6B, in accordance with the category input to the device 10 and output from the scene configuration discriminator 10. To do.

消失点算出手段２２は、図６（ｃ）に示すように、消失点存在候補領域４０２を通過する直線のみを、直線検出等（参考文献４：「OpenCV」プログラミングブック毎日コミュニケーションズ 2007年pp．164-167）を施した画像４０５から直線群４０３として検出し、それらの直線群４０３からランダムに２直線取り出し、２直線の交点を求める。その交点を通過する直線を直線群４０３から算出し、その数を交点の得票数とする。上記の作業を、ランダムに２直線を選びながら繰り返す。そして、得票数の一番多い交点を消失点４０４とする。 As shown in FIG. 6C, the vanishing point calculating means 22 detects only a straight line passing through the vanishing point existence candidate region 402, etc. (Reference 4: “OpenCV” programming book Mainichi Communications 2007 pp.164). -167) is detected as a straight line group 403, two straight lines are randomly extracted from the straight line group 403, and an intersection of the two straight lines is obtained. A straight line passing through the intersection is calculated from the straight line group 403, and the number is obtained as the number of votes obtained at the intersection. The above operation is repeated while randomly selecting two straight lines. The intersection with the largest number of votes is defined as the vanishing point 404.

なお、消失点４０４は、このような手法で求めてもよいし、長さを信頼度とみなし、検出された直線群４０３のなかで一番長い直線と二番目に長い直線との交点を消失点と決めるなど、どのような手法を用いてもよい。 The vanishing point 404 may be obtained by such a method, or the length is regarded as the reliability, and the intersection of the longest straight line and the second longest straight line in the detected straight line group 403 is vanished. Any method may be used such as determining a point.

上述した本実施形態によれば、複数枚のある構造を持つシーンを撮影した画像から、予め消失点の存在領域などを元にシーン構成を学習させることにより、消失点を通る直線以外の直線が多数画像中に含まれていても、安定して消失点を算出することを可能にする。これにより、消失点を用いて一枚の非立体画像から３次元情報を推定することができる。擬似的に立体画像を生成することはもちろん、奥行きデータを用いて画像のセグメンテーションを行い、所望の領域を抽出することも可能となる。 According to the above-described embodiment, a straight line other than the straight line passing through the vanishing point is obtained by learning the scene configuration based on the area where the vanishing point exists in advance from an image obtained by photographing a scene having a plurality of structures. Even if it is included in a large number of images, the vanishing point can be calculated stably. Thereby, three-dimensional information can be estimated from one non-stereo image using a vanishing point. In addition to generating a pseudo stereoscopic image, it is possible to perform segmentation of an image using depth data and extract a desired region.

１０シーン構成判別器
１１学習用画像カテゴリ分類手段
１２シーン構成判別器作成手段
１２１特徴量算出手段
１２２学習手段
２０画像撮影手段
２１シーン構成推定手段
２２消失点算出手段 DESCRIPTION OF SYMBOLS 10 Scene structure discriminator 11 Learning image category classification | category means 12 Scene structure discriminator creation means 121 Feature-value calculation means 122 Learning means 20 Image photographing means 21 Scene structure estimation means 22 Vanishing point calculation means

Claims

A vanishing point calculating device that calculates a vanishing point from a single image of a building,
A category for each vanishing point position, and a learning image category classification means for classifying a plurality of captured images acquired for learning for each category,
Scene configuration discriminator creating means for extracting a feature quantity from a plurality of learning images classified into categories by the learning image category classification means, and creating a scene configuration discriminator by learning the characteristics of the feature quantity;
Image photographing means for photographing an input image composed of surfaces,
A feature amount is extracted from an input image photographed by the image photographing means, the feature amount is input to the scene configuration discriminator, and a vanishing point exists based on a category output from the scene configuration discriminator. A scene configuration estimating means for estimating a point presence candidate area;
A vanishing point calculating unit comprising: a vanishing point calculating unit that calculates a vanishing point based on the vanishing point existence candidate region estimated by the scene estimating unit.

The scene configuration estimation means includes
A feature amount calculating means for calculating a feature amount for each category from images of a plurality of resolutions;
The vanishing point calculation apparatus according to claim 1, further comprising: a learning unit that learns the feature amount calculated by the feature amount calculation unit and creates a discriminator.

The vanishing point calculating means is
Only a straight line passing through the vanishing point existence candidate region is detected as a line group from an image obtained by performing predetermined image processing on the input image, and two lines are randomly extracted from the line group, The process of obtaining the intersection of straight lines, calculating the straight line passing through the intersection from the group of straight lines, and setting the number as the number of votes of the intersection is repeated while randomly selecting two straight lines. The vanishing point is the intersection of the most
The vanishing point calculation apparatus according to claim 1 or 2, wherein

A vanishing point calculation method for calculating a vanishing point from a single image of a building,
A learning image category classification step for creating a category for each vanishing point position and classifying a plurality of captured images acquired for learning for each category;
A scene configuration discriminator creating step of extracting a feature amount from a plurality of learning images classified into categories in the learning image category classification step and creating a scene configuration discriminator by learning characteristics of the feature amount;
An image capturing step for capturing an input image composed of surfaces;
A feature amount is extracted from the input image captured in the image capturing step, the feature amount is input to the scene configuration discriminator, and a vanishing point exists based on a category output from the scene configuration discriminator. A scene configuration estimation step for estimating a point presence candidate region;
A vanishing point calculating step of calculating a vanishing point based on the vanishing point existence candidate region estimated in the scene estimating step.

The scene configuration estimation step includes:
A feature amount calculating step for calculating a feature amount for each category from images of a plurality of resolutions;
The vanishing point calculation method according to claim 4, further comprising a learning step of learning the feature amount calculated in the feature amount calculation step to create a discriminator.

The vanishing point calculating step includes:
Only a straight line passing through the vanishing point existence candidate region is detected as a line group from an image obtained by performing predetermined image processing on the input image, and two lines are randomly extracted from the line group, The process of obtaining the intersection of straight lines, calculating the straight line passing through the intersection from the group of straight lines, and setting the number as the number of votes of the intersection is repeated while randomly selecting two straight lines. The vanishing point is the intersection of the most
The vanishing point calculation method according to claim 4, wherein the vanishing point is calculated.

To the computer of the vanishing point calculation device that calculates the vanishing point from a single image of a building etc.,
Image category classification function for learning that creates a category for each vanishing point position and classifies a plurality of captured images acquired for learning for each category,
A scene configuration discriminator creating function for creating a scene configuration discriminating function by extracting feature amounts from a plurality of learning images classified into categories by the learning image category classification function and learning characteristics of the feature amounts;
Image capture function for capturing input images composed of
A feature amount is extracted from an input image captured by the image capturing function, the feature amount is input to the scene configuration determination function, and a vanishing point exists based on a category output from the scene configuration determination function. Scene composition estimation function for estimating point existence candidate areas,
A program for executing a vanishing point calculation function for calculating a vanishing point based on the vanishing point existence candidate region estimated by the scene estimation function.