JP7234021B2

JP7234021B2 - Image generation device, image generation system, image generation method, and program

Info

Publication number: JP7234021B2
Application number: JP2019079475A
Authority: JP
Inventors: 良徳大橋
Original assignee: Sony Interactive Entertainment Inc
Current assignee: Sony Interactive Entertainment Inc
Priority date: 2018-10-16
Filing date: 2019-04-18
Publication date: 2023-03-07
Anticipated expiration: 2039-04-18
Also published as: JP2020064592A

Description

この発明は、画像を生成する装置、システムおよび方法に関する。 The present invention relates to apparatus, systems and methods for generating images.

ゲーム機に接続されたヘッドマウントディスプレイを頭部に装着して、ヘッドマウントディスプレイに表示された画面を見ながら、コントローラなどを操作してゲームプレイすることが行われている。ヘッドマウントディスプレイを装着すると、ヘッドマウントディスプレイに表示される映像以外はユーザは見ないため、映像世界への没入感が高まり、ゲームのエンタテインメント性を一層高める効果がある。また、ヘッドマウントディスプレイに仮想現実（ＶＲ(Virtual Reality)）の映像を表示させ、ヘッドマウントディスプレイを装着したユーザが頭部を回転させると、３６０度見渡せる全周囲の仮想空間が表示されるようにすると、さらに映像への没入感が高まり、ゲームなどのアプリケーションの操作性も向上する。 A head-mounted display connected to a game machine is worn on the head, and a game is played by operating a controller or the like while viewing a screen displayed on the head-mounted display. When the user wears the head-mounted display, the user does not see anything other than the image displayed on the head-mounted display, so that the sense of immersion in the image world is enhanced, which has the effect of further enhancing the entertainment of the game. In addition, a virtual reality (VR) image is displayed on a head-mounted display, and when the user wearing the head-mounted display rotates his or her head, a 360-degree panoramic view of the virtual space is displayed. As a result, the feeling of being immersed in the video is further enhanced, and the operability of applications such as games is also improved.

また、非透過型ヘッドマウントディスプレイを装着したユーザは外界を直接見ることができなくなるが、ヘッドマウントディスプレイに搭載されたカメラによって外界の映像を撮影してディスプレイパネルに表示することのできるビデオ透過（ビデオシースルー）型ヘッドマウントディスプレイもある。ビデオ透過型ヘッドマウントディスプレイでは、カメラで撮影される外界の映像にコンピュータグラフィックス（ＣＧ(Computer Graphics)）によって生成された仮想世界のオブジェクトを重畳させることで拡張現実（ＡＲ(Augmented Reality)）の映像を生成して表示することもできる。拡張現実の映像は、現実世界から切り離された仮想現実とは違って、現実世界が仮想オブジェクトで拡張されたものであり、ユーザは現実世界とのつながりを意識しつつ、仮想世界を体験することができる。 In addition, although the user wearing the non-transmissive head-mounted display cannot directly see the outside world, the camera mounted on the head-mounted display can capture images of the outside world and display them on the display panel. There is also a video see-through) type head-mounted display. In a video transmissive head-mounted display, augmented reality (AR) is realized by superimposing virtual world objects generated by computer graphics (CG) on images of the outside world captured by a camera. You can also generate and display images. Unlike virtual reality, which is separated from the real world, augmented reality video is an extension of the real world with virtual objects, allowing users to experience the virtual world while being aware of the connection with the real world. can be done.

カメラ画像にＣＧによって生成された仮想オブジェクトを重畳させて拡張現実の映像を生成してヘッドマウントディスプレイに表示する場合、画像に対するポストプロセスの影響によって仮想オブジェクトの境界にエイリアシングが発生し、仮想世界と現実世界の境界が目立ち、一体感のあるＡＲ映像にならないことがある。また、仮想オブジェクトが現実空間に落とす影や現実空間への仮想オブジェクトの映り込みをＡＲ映像に反映させなければ、仮想世界と現実世界の一体感が得られず、仮想オブジェクトが現実世界の中で浮いているように見えてしまう。 When an augmented reality video is generated by superimposing a virtual object generated by CG on a camera image and displayed on a head-mounted display, aliasing occurs at the boundary of the virtual object due to the influence of post-processing on the image. The boundaries of the real world are conspicuous, and the AR image may not have a sense of unity. Also, unless the shadows cast by the virtual object on the real space and the reflection of the virtual object in the real space are reflected in the AR image, a sense of unity between the virtual world and the real world cannot be obtained, and the virtual object cannot be seen in the real world. It looks like it's floating.

本発明はこうした課題に鑑みてなされたものであり、その目的は、拡張現実の映像の品質を向上させることのできる画像生成装置、画像生成システム、および画像生成方法を提供することにある。 The present invention has been made in view of these problems, and its object is to provide an image generation device, an image generation system, and an image generation method that can improve the quality of augmented reality video.

上記課題を解決するために、本発明のある態様の画像生成装置は、仮想空間のオブジェクトおよび現実空間のオブジェクトをレンダリングするとともに、前記現実空間に対する前記仮想空間の光に関する表現をレンダリングしてコンピュータグラフィックス画像を生成するレンダリング部と、現実空間の撮影画像に前記コンピュータグラフィックス画像を重畳して仮の重畳画像を生成する重畳部と、前記現実空間の撮影画像の奥行き情報にもとづいて前記コンピュータグラフィックス画像をクロマキー処理してクロマキー画像を生成するクロマキー生成部と、前記仮の重畳画像に対して前記クロマキー画像によってマスクをかけることにより、前記現実空間の撮影画像に重畳して拡張現実画像を生成するために利用される合成クロマキー画像を生成する合成部とを含む。前記クロマキー生成部は、前記仮想空間のオブジェクトがレンダリングされていない現実空間の領域をクロマキー領域とする一方、前記仮想空間の光に関する表現が存在する現実空間の領域はクロマキー領域とはしない。 In order to solve the above-described problems, an image generating apparatus according to one aspect of the present invention renders an object in a virtual space and an object in a real space, and renders a representation of light in the virtual space with respect to the real space to generate a computer graphic image. a rendering unit that generates a physical image; a superimposing unit that generates a temporary superimposed image by superimposing the computer graphics image on the captured image of the real space; and the computer graphics based on depth information of the captured image of the real space. and a chromakey generation unit that generates a chromakey image by performing chromakey processing on the space image, and a chromakey generation unit that generates an augmented reality image by superimposing the virtual superimposed image on the captured image of the real space by masking the temporary superimposed image with the chromakey image. a compositing unit that generates a composite chromakey image that is used to The chromakey generation unit treats a real space area in which the object in the virtual space is not rendered as a chromakey area, but does not treat a real space area in which an expression related to light in the virtual space exists as a chromakey area.

本発明の別の態様は、画像生成システムである。この画像生成システムは、ヘッドマウントディスプレイと画像生成装置を含む画像生成システムであって、前記画像生成装置は、仮想空間のオブジェクトおよび現実空間のオブジェクトをレンダリングするとともに、前記現実空間に対する前記仮想空間の光に関する表現をレンダリングしてコンピュータグラフィックス画像を生成するレンダリング部と、前記ヘッドマウントディスプレイから伝送される現実空間の撮影画像に前記コンピュータグラフィックス画像を重畳して仮の重畳画像を生成する第１の重畳部と、前記ヘッドマウントディスプレイから伝送される前記現実空間の撮影画像の奥行き情報にもとづいて前記コンピュータグラフィックス画像をクロマキー処理してクロマキー画像を生成するクロマキー生成部と、前記仮の重畳画像に対して前記クロマキー画像によってマスクをかけることにより、前記現実空間の撮影画像に重畳して拡張現実画像を生成するために利用される合成クロマキー画像を生成する合成部とを含む。前記ヘッドマウントディスプレイは、前記現実空間の撮影画像に、前記画像生成装置から伝送される前記合成クロマキー画像を合成することにより、拡張現実画像を生成する第２の重畳部とを含む。前記クロマキー生成部は、前記仮想空間のオブジェクトがレンダリングされていない現実空間の領域をクロマキー領域とする一方、前記仮想空間の光に関する表現が存在する現実空間の領域はクロマキー領域とはしない。 Another aspect of the invention is an image generation system. This image generation system includes a head-mounted display and an image generation device. a rendering unit that renders an expression related to light to generate a computer graphics image; a chromakey generator for generating a chromakey image by chromakey processing the computer graphics image based on depth information of the captured image of the real space transmitted from the head-mounted display; and the temporary superimposed image. a synthesizing unit that generates a synthesized chromakey image that is superimposed on the captured image of the physical space and used to generate an augmented reality image by masking with the chromakey image. The head-mounted display includes a second superimposing unit that generates an augmented reality image by synthesizing the captured image of the physical space with the synthesized chromakey image transmitted from the image generating device. The chromakey generation unit treats a real space area in which the object in the virtual space is not rendered as a chromakey area, but does not treat a real space area in which an expression related to light in the virtual space exists as a chromakey area.

本発明のさらに別の態様は、画像生成方法である。この方法は、仮想空間のオブジェクトおよび現実空間のオブジェクトをレンダリングするとともに、前記現実空間に対する前記仮想空間の光に関する表現をレンダリングしてコンピュータグラフィックス画像を生成するレンダリングステップと、現実空間の撮影画像に前記コンピュータグラフィックス画像を重畳して仮の重畳画像を生成する重畳ステップと、前記現実空間の撮影画像の奥行き情報にもとづいて前記コンピュータグラフィックス画像をクロマキー処理してクロマキー画像を生成するクロマキー生成ステップと、前記仮の重畳画像に対して前記クロマキー画像によってマスクをかけることにより、前記現実空間の撮影画像に重畳して拡張現実画像を生成するために利用される合成クロマキー画像を生成する合成ステップとを含む。前記クロマキー生成ステップは、前記仮想空間のオブジェクトがレンダリングされていない現実空間の領域をクロマキー領域とする一方、前記仮想空間の光に関する表現が存在する現実空間の領域はクロマキー領域とはしない。 Yet another aspect of the invention is an image generation method. This method comprises a rendering step of rendering an object in a virtual space and an object in a real space, and rendering a representation of light in the virtual space with respect to the real space to generate a computer graphics image; A superimposing step of superimposing the computer graphics image to generate a temporary superimposed image, and a chromakey generating step of generating a chromakey image by performing chromakey processing on the computer graphics image based on depth information of the captured image of the physical space. and a synthesizing step of generating a synthesized chromakey image that is superimposed on the captured image of the physical space and used to generate an augmented reality image by masking the temporary superimposed image with the chromakey image. including. In the chromakey generating step, a real space area in which an object in the virtual space is not rendered is treated as a chromakey area, and a real space area in which light-related representation exists in the virtual space is not treated as a chromakey area.

なお、以上の構成要素の任意の組合せ、本発明の表現を方法、装置、システム、コンピュータプログラム、データ構造、記録媒体などの間で変換したものもまた、本発明の態様として有効である。 Any combination of the above constituent elements, and expressions of the present invention converted between methods, devices, systems, computer programs, data structures, recording media, etc. are also effective as aspects of the present invention.

本発明によれば、拡張現実の映像の品質を向上させることができる。 According to the present invention, it is possible to improve the quality of augmented reality video.

ヘッドマウントディスプレイの外観図である。1 is an external view of a head mounted display; FIG. 本実施の形態に係る画像生成システムの構成図である。1 is a configuration diagram of an image generation system according to an embodiment; FIG. 図１のヘッドマウントディスプレイに搭載されたカメラにより撮影されるカメラ画像の例を説明する図である。2 is a diagram illustrating an example of a camera image captured by a camera mounted on the head mounted display of FIG. 1; FIG. 図３のカメラ画像にＣＧによる仮想オブジェクトを重畳した拡張現実画像を説明する図である。4 is a diagram for explaining an augmented reality image in which a CG virtual object is superimposed on the camera image of FIG. 3; FIG. 図４の拡張現実画像に対してユーザが仮想オブジェクトに手を伸ばしている様子を示す図である。FIG. 5 is a diagram showing a state in which a user is reaching out to a virtual object with respect to the augmented reality image of FIG. 4; クロマキー合成に利用されるＣＧ画像を説明する図である。FIG. 4 is a diagram for explaining a CG image used for chromakey synthesis; 前提技術に係るヘッドマウントディスプレイの機能構成図である。1 is a functional configuration diagram of a head-mounted display according to a base technology; FIG. 前提技術に係る画像生成装置の機能構成図である。1 is a functional configuration diagram of an image generation device according to a base technology; FIG. カメラ画像にＣＧ画像を重畳して拡張現実画像を生成するための前提技術に係る画像生成システムの構成を説明する図である。1 is a diagram illustrating the configuration of an image generation system according to a base technology for superimposing a CG image on a camera image to generate an augmented reality image; FIG. 第１の実施の形態に係る画像生成装置の機能構成図である。1 is a functional configuration diagram of an image generation device according to a first embodiment; FIG. カメラ画像にＣＧ画像を重畳して拡張現実画像を生成するための第１の実施の形態に係る画像生成システムの構成を説明する図である。1 is a diagram illustrating the configuration of an image generation system according to a first embodiment for superimposing a CG image on a camera image to generate an augmented reality image; FIG. 第１の実施の形態に係る画像生成システムによってカメラ画像にＣＧ画像を重畳した拡張現実画像を説明する図である。FIG. 4 is a diagram illustrating an augmented reality image in which a CG image is superimposed on a camera image by the image generation system according to the first embodiment; 第１の実施の形態に係る画像生成システムによって利用される合成ＣＧクロマキー画像を説明する図である。FIG. 4 is a diagram for explaining a synthesized CG chromakey image used by the image generation system according to the first embodiment; FIG. 現実空間のポリゴンメッシュを変形して壁に穴を空ける例を説明する図である。FIG. 10 is a diagram illustrating an example of deforming a polygon mesh in a real space to make a hole in a wall; 現実空間の壁の穴に仮想オブジェクトが描画された例を説明する図である。FIG. 10 is a diagram illustrating an example in which a virtual object is drawn in a hole in a wall in real space; 第２の実施の形態に係る画像生成装置の機能構成図である。FIG. 10 is a functional configuration diagram of an image generating device according to a second embodiment; FIG. カメラ画像にＣＧ画像を重畳して拡張現実画像を生成するための第２の実施の形態に係る画像生成システムの構成を説明する図である。FIG. 10 is a diagram illustrating the configuration of an image generation system according to a second embodiment for superimposing a CG image on a camera image to generate an augmented reality image;

図１は、ヘッドマウントディスプレイ１００の外観図である。ヘッドマウントディスプレイ１００は、ユーザの頭部に装着してディスプレイに表示される静止画や動画などを鑑賞し、ヘッドホンから出力される音声や音楽などを聴くための表示装置である。 FIG. 1 is an external view of the head mounted display 100. FIG. The head-mounted display 100 is a display device worn on the user's head for viewing still images and moving images displayed on the display and for listening to sounds and music output from headphones.

ヘッドマウントディスプレイ１００に内蔵または外付けされたジャイロセンサや加速度センサなどによりヘッドマウントディスプレイ１００を装着したユーザの頭部の位置情報と頭部の回転角や傾きなどの姿勢（orientation）情報を計測することができる。 A gyro sensor or an acceleration sensor built in or external to the head mounted display 100 measures the position information of the head of the user wearing the head mounted display 100 and the orientation information such as the rotation angle and inclination of the head. be able to.

ヘッドマウントディスプレイ１００にはカメラユニットが搭載されており、ユーザがヘッドマウントディスプレイ１００を装着している間、外界を撮影することができる。 A camera unit is mounted on the head mounted display 100, and the user can photograph the outside world while wearing the head mounted display 100. FIG.

ヘッドマウントディスプレイ１００は、「ウェアラブルディスプレイ」の一例である。ここでは、ヘッドマウントディスプレイ１００に表示される画像の生成方法を説明するが、本実施の形態の画像生成方法は、狭義のヘッドマウントディスプレイ１００に限らず、めがね、めがね型ディスプレイ、めがね型カメラ、ヘッドフォン、ヘッドセット（マイクつきヘッドフォン）、イヤホン、イヤリング、耳かけカメラ、帽子、カメラつき帽子、ヘアバンドなどを装着した場合にも適用することができる。 Head-mounted display 100 is an example of a "wearable display." Here, a method of generating an image to be displayed on the head mounted display 100 will be described, but the image generating method of the present embodiment is not limited to the head mounted display 100 in a narrow sense. It can also be applied when wearing headphones, headsets (headphones with a microphone), earphones, earrings, ear cameras, hats, hats with cameras, hair bands, and the like.

図２は、本実施の形態に係る画像生成システムの構成図である。ヘッドマウントディスプレイ１００は、一例として、映像・音声をデジタル信号で伝送する通信インタフェースの標準規格であるＨＤＭＩ（登録商標）（High-Definition Multimedia Interface）などのインタフェース３００で画像生成装置２００に接続される。 FIG. 2 is a configuration diagram of an image generation system according to this embodiment. The head mounted display 100 is connected to the image generation device 200 via an interface 300 such as HDMI (registered trademark) (High-Definition Multimedia Interface), which is a communication interface standard for transmitting video and audio as digital signals. .

画像生成装置２００は、ヘッドマウントディスプレイ１００の現在の位置・姿勢情報から、映像の生成から表示までの遅延を考慮してヘッドマウントディスプレイ１００の位置・姿勢情報を予測し、ヘッドマウントディスプレイ１００の予測位置・姿勢情報を前提としてヘッドマウントディスプレイ１００に表示されるべき画像を描画し、ヘッドマウントディスプレイ１００に伝送する。 The image generation device 200 predicts the position/orientation information of the head-mounted display 100 from the current position/orientation information of the head-mounted display 100 in consideration of the delay from image generation to display. Based on the position/orientation information, an image to be displayed on the head mounted display 100 is drawn and transmitted to the head mounted display 100 .

画像生成装置２００の一例はゲーム機である。画像生成装置２００は、さらにネットワークを介してサーバに接続されてもよい。その場合、サーバは、複数のユーザがネットワークを介して参加できるゲームなどのオンラインアプリケーションを画像生成装置２００に提供してもよい。ヘッドマウントディスプレイ１００は、画像生成装置２００の代わりに、コンピュータや携帯端末に接続されてもよい。 An example of the image generation device 200 is a game machine. The image generation device 200 may also be connected to a server via a network. In that case, the server may provide the image generating device 200 with an online application such as a game in which multiple users can participate via a network. The head mounted display 100 may be connected to a computer or mobile terminal instead of the image generation device 200. FIG.

図３～図６を参照してカメラ画像にＣＧによる仮想オブジェクトを重畳した拡張現実画像について説明する。 An augmented reality image in which a CG virtual object is superimposed on a camera image will be described with reference to FIGS. 3 to 6. FIG.

図３は、ヘッドマウントディスプレイ１００に搭載されたカメラにより撮影されるカメラ画像の例を説明する図である。このカメラ画像は部屋を背景としてテーブルとその上にあるバスケット４００を撮影したものである。テーブルの表面には花柄の模様が付けられている。カメラ画像において背景はほとんど変化しないが、テーブルの上にあるバスケット４００はユーザが手を伸ばして動かすこともある。 FIG. 3 is a diagram illustrating an example of a camera image taken by a camera mounted on head mounted display 100. As shown in FIG. This camera image captures the table and the basket 400 on it with the room as the background. The surface of the table has a floral pattern. The background hardly changes in the camera image, but the basket 400 on the table may be reached and moved by the user.

図４は、図３のカメラ画像にＣＧによる仮想オブジェクトを重畳した拡張現実画像を説明する図である。テーブルの上にある現実のオブジェクトであるバスケット４００は、ＣＧによって生成された仮想オブジェクトであるティーポット４１０によって置き換えられ、カメラ画像にティーポット４１０が重畳される。これにより、ユーザは現実空間に仮想オブジェクトが描かれた拡張現実画像をヘッドマウントディスプレイ１００で見ることができる。 FIG. 4 is a diagram for explaining an augmented reality image in which a CG virtual object is superimposed on the camera image of FIG. A basket 400, which is a real object on the table, is replaced by a teapot 410, which is a virtual object generated by CG, and the teapot 410 is superimposed on the camera image. Thereby, the user can view an augmented reality image in which a virtual object is drawn in the real space on the head-mounted display 100 .

図５は、図４の拡張現実画像に対してユーザが仮想オブジェクトに手を伸ばしている様子を示す図である。拡張現実画像をヘッドマウントディスプレイ１００で見ているユーザが仮想オブジェクトであるティーポット４１０を触ろうとすると、ユーザの手がヘッドマウントディスプレイ１００に搭載されたカメラで撮影されるため、カメラ画像には手４２０が写り込む。手４２０が写り込んだカメラ画像に仮想オブジェクトであるティーポット４１０が重畳される。このとき、手４２０の上にティーポット４１０が重畳されて手４２０が見えなくなるなど不自然な拡張現実画像にならないように、ティーポット４１０と手４２０の位置関係を奥行き情報を用いて正しく判定する必要がある。 FIG. 5 is a diagram showing a state in which a user is reaching out to a virtual object with respect to the augmented reality image of FIG. When the user viewing the augmented reality image on the head-mounted display 100 tries to touch the virtual object teapot 410 , the user's hand is captured by the camera mounted on the head-mounted display 100 . is reflected. A teapot 410, which is a virtual object, is superimposed on the camera image in which the hand 420 is captured. At this time, it is necessary to correctly determine the positional relationship between the teapot 410 and the hand 420 using depth information so that the teapot 410 is superimposed on the hand 420 and the hand 420 is not visible, resulting in an unnatural augmented reality image. be.

そこで、カメラ画像の奥行き（デプス）情報を用いて、カメラ画像に写り込んだ物体と仮想オブジェクトの位置関係を判定し、奥行きを正しく反映した描画を行う。部屋の背景や既に存在がわかっているバスケット４００についてはあらかじめ奥行きがわかっているので、仮想オブジェクトとの位置関係はあらかじめ判定しておくことができるが、ユーザが手や足を伸ばしたときや、ユーザ以外の動体（たとえば、他人や犬、猫など）が視野に入ってくる場合などは、あらかじめ奥行きがわかっていないので、カメラ画像のデプス情報から奥行きをその都度判定する必要がある。 Therefore, the depth information of the camera image is used to determine the positional relationship between the object captured in the camera image and the virtual object, and the drawing that accurately reflects the depth is performed. Since the depth of the background of the room and the basket 400 whose existence is already known is known in advance, the positional relationship with the virtual object can be determined in advance. When a moving object other than the user (for example, another person, a dog, a cat, etc.) comes into the field of view, the depth is not known in advance, so the depth needs to be determined each time from the depth information of the camera image.

一般に、ＣＧ画像をカメラ画像に重畳する際、ＣＧ画像において背景など描画されない領域を特定の一色に塗りつぶしたクロマキー画像を作成し、クロマキー合成に利用する。クロマキーとして特定された色の領域（「クロマキー領域」と呼ぶ）は透明になるため、クロマキー画像をカメラ画像に重畳すると、クロマキー領域にはカメラ画像が表示される。 In general, when a CG image is superimposed on a camera image, a chromakey image is created by painting an area such as the background of the CG image that is not drawn with a specific color, and is used for chromakey synthesis. A region of a color specified as chromakey (referred to as a “chromakey region”) becomes transparent, so when the chromakey image is superimposed on the camera image, the camera image is displayed in the chromakey region.

図６は、クロマキー合成に利用されるＣＧ画像を説明する図である。図５の状態において、背景がクロマキーの特定色（たとえば赤）に塗りつぶされている。また、カメラ画像に写り込んだ手４２０がティーポット４１０よりも手前にあることから、ティーポット４１０の領域の内、手４２０によって隠れる領域もクロマキーの特定色に塗りつぶされている。このクロマキー画像をカメラ画像に重畳すると、クロマキーの特定色の部分は透明であるため、カメラ画像が残り、図５の拡張現実画像が得られる。 FIG. 6 is a diagram for explaining a CG image used for chromakey synthesis. In the state of FIG. 5, the background is filled with a specific chromakey color (eg, red). Further, since the hand 420 captured in the camera image is located in front of the teapot 410, the area of the teapot 410 hidden by the hand 420 is also filled with the specific chromakey color. When this chromakey image is superimposed on the camera image, the camera image remains because the specific color portion of the chromakey is transparent, and the augmented reality image shown in FIG. 5 is obtained.

図７は、前提技術に係るヘッドマウントディスプレイ１００の機能構成図である。 FIG. 7 is a functional configuration diagram of the head mounted display 100 according to the base technology.

制御部１０は、画像信号、センサ信号などの信号や、命令やデータを処理して出力するメインプロセッサである。入力インタフェース２０は、ユーザからの操作信号や設定信号を受け付け、制御部１０に供給する。出力インタフェース３０は、制御部１０から画像信号を受け取り、ディスプレイパネル３２に表示する。 The control unit 10 is a main processor that processes and outputs signals such as image signals and sensor signals, commands and data. The input interface 20 receives operation signals and setting signals from the user and supplies them to the control unit 10 . The output interface 30 receives image signals from the control unit 10 and displays them on the display panel 32 .

通信制御部４０は、ネットワークアダプタ４２またはアンテナ４４を介して、有線または無線通信により、制御部１０から入力されるデータを外部に送信する。通信制御部４０は、また、ネットワークアダプタ４２またはアンテナ４４を介して、有線または無線通信により、外部からデータを受信し、制御部１０に出力する。 The communication control unit 40 transmits data input from the control unit 10 to the outside by wired or wireless communication via the network adapter 42 or the antenna 44 . The communication control unit 40 also receives data from the outside by wired or wireless communication via the network adapter 42 or the antenna 44 and outputs the data to the control unit 10 .

記憶部５０は、制御部１０が処理するデータやパラメータ、操作信号などを一時的に記憶する。 The storage unit 50 temporarily stores data, parameters, operation signals, and the like processed by the control unit 10 .

姿勢センサ６４は、ヘッドマウントディスプレイ１００の位置情報と、ヘッドマウントディスプレイ１００の回転角や傾きなどの姿勢情報を検出する。姿勢センサ６４は、ジャイロセンサ、加速度センサ、角加速度センサなどを適宜組み合わせて実現される。３軸地磁気センサ、３軸加速度センサおよび３軸ジャイロ（角速度）センサの少なくとも１つ以上を組み合わせたモーションセンサを用いて、ユーザの頭部の前後、左右、上下の動きを検出してもよい。 The orientation sensor 64 detects position information of the head mounted display 100 and orientation information such as the rotation angle and inclination of the head mounted display 100 . The attitude sensor 64 is realized by appropriately combining a gyro sensor, an acceleration sensor, an angular acceleration sensor, and the like. A motion sensor that combines at least one of a 3-axis geomagnetic sensor, a 3-axis acceleration sensor, and a 3-axis gyro (angular velocity) sensor may be used to detect forward/backward, left/right, and up/down movements of the user's head.

外部入出力端子インタフェース７０は、ＵＳＢ（Universal Serial Bus）コントローラなどの周辺機器を接続するためのインタフェースである。外部メモリ７２は、フラッシュメモリなどの外部メモリである。 The external input/output terminal interface 70 is an interface for connecting peripheral devices such as a USB (Universal Serial Bus) controller. The external memory 72 is an external memory such as flash memory.

カメラユニット８０は、レンズ、イメージセンサ、測距センサなど撮影に必要な構成を含み、撮影された外界の映像と奥行き情報を制御部１０に供給する。制御部１０は、カメラユニット８０のフォーカスやズームなどを制御する。 The camera unit 80 includes components necessary for photographing, such as a lens, an image sensor, and a distance measuring sensor, and supplies the photographed image of the outside world and depth information to the control unit 10 . The control unit 10 controls focus, zoom, etc. of the camera unit 80 .

画像信号処理部８２は、カメラユニット８０により撮影されたＲａｗ画像に対してＲＧＢ変換（デモザイク処理）、ホワイトバランス、色補正、ノイズリダクションなどの画像信号処理（ＩＳＰ(Image Signal Processing)）を施し、さらにカメラユニット８０の光学系による歪みなどを取り除く歪み補正処理を施す。画像信号処理部８２は画像信号処理および歪み補正処理が施されたカメラ画像を制御部１０に供給する。 The image signal processing unit 82 performs image signal processing (ISP (Image Signal Processing)) such as RGB conversion (demosaicing), white balance, color correction, and noise reduction on the Raw image captured by the camera unit 80. Furthermore, distortion correction processing is performed to remove distortion due to the optical system of the camera unit 80. FIG. The image signal processing unit 82 supplies the camera image that has undergone image signal processing and distortion correction processing to the control unit 10 .

リプロジェクション部８４は、姿勢センサ６４が検出したヘッドマウントディスプレイ１００の最新の位置・姿勢情報にもとづき、カメラ画像に対してリプロジェクション処理を施し、ヘッドマウントディスプレイ１００の最新の視点位置・視線方向から見える画像に変換する。 The reprojection unit 84 performs reprojection processing on the camera image based on the latest position/orientation information of the head mounted display 100 detected by the orientation sensor 64, and performs reprojection processing on the camera image from the latest viewpoint position/line-of-sight direction of the head mounted display 100. Convert to visible image.

歪み処理部８６は、リプロジェクション処理が施されたカメラ画像に対してヘッドマウントディスプレイ１００の光学系で生じる歪みに合わせて画像を変形（distortion）させて歪ませる処理を施し、歪み処理が施されたカメラ画像を制御部１０に供給する。 The distortion processing unit 86 distorts the reprojection-processed camera image in accordance with the distortion generated in the optical system of the head mounted display 100 to distort the image. The captured camera image is supplied to the control unit 10 .

ＡＲ重畳部８８は、歪み処理が施されたカメラ画像に画像生成装置２００により生成されたＣＧ画像を重畳することで拡張現実画像を生成し、制御部１０に供給する。 The AR superimposing unit 88 generates an augmented reality image by superimposing a CG image generated by the image generating device 200 on the camera image subjected to distortion processing, and supplies the augmented reality image to the control unit 10 .

ＨＤＭＩ送受信部９０は、ＨＤＭＩにしたがって映像・音声のデジタル信号を画像生成装置２００との間で送受信する。ＨＤＭＩ送受信部９０は、画像信号処理部８２により画像信号処理および歪み補正処理が施されたＲＧＢ画像と奥行き情報を制御部１０から受け取り、ＨＤＭＩ伝送路で画像生成装置２００に送信する。ＨＤＭＩ送受信部９０は、画像生成装置２００により生成された画像をＨＤＭＩ伝送路で画像生成装置２００から受信し、制御部１０に供給する。 The HDMI transmitting/receiving unit 90 transmits/receives video/audio digital signals to/from the image generation device 200 according to HDMI. The HDMI transmission/reception unit 90 receives the RGB image and the depth information subjected to image signal processing and distortion correction processing by the image signal processing unit 82 from the control unit 10, and transmits them to the image generation device 200 through the HDMI transmission line. The HDMI transmission/reception unit 90 receives the image generated by the image generation device 200 from the image generation device 200 via the HDMI transmission line, and supplies the image to the control unit 10 .

制御部１０は、画像やテキストデータを出力インタフェース３０に供給してディスプレイパネル３２に表示させたり、通信制御部４０に供給して外部に送信させることができる。 The control unit 10 can supply image and text data to the output interface 30 for display on the display panel 32, and can supply the data to the communication control unit 40 for external transmission.

姿勢センサ６４が検出したヘッドマウントディスプレイ１００の現在の位置・姿勢情報は、通信制御部４０または外部入出力端子インタフェース７０を介して画像生成装置２００に通知される。あるいは、ＨＤＭＩ送受信部９０がヘッドマウントディスプレイ１００の現在の位置・姿勢情報を画像生成装置２００に送信してもよい。 The current position/orientation information of the head mounted display 100 detected by the orientation sensor 64 is notified to the image generation device 200 via the communication control section 40 or the external input/output terminal interface 70 . Alternatively, the HDMI transmitting/receiving section 90 may transmit the current position/orientation information of the head mounted display 100 to the image generation device 200 .

図８は、前提技術に係る画像生成装置２００の機能構成図である。同図は機能に着目したブロック図を描いており、これらの機能ブロックはハードウエアのみ、ソフトウエアのみ、またはそれらの組合せによっていろいろな形で実現することができる。 FIG. 8 is a functional configuration diagram of the image generation device 200 according to the base technology. The figure is a block diagram focusing on functions, and these functional blocks can be realized in various forms by hardware only, software only, or a combination thereof.

画像生成装置２００の少なくとも一部の機能をヘッドマウントディスプレイ１００に実装してもよい。あるいは、画像生成装置２００の少なくとも一部の機能を、ネットワークを介して画像生成装置２００に接続されたサーバに実装してもよい。 At least part of the functions of the image generation device 200 may be implemented in the head mounted display 100 . Alternatively, at least part of the functions of image generation device 200 may be implemented in a server connected to image generation device 200 via a network.

位置・姿勢取得部２１０は、ヘッドマウントディスプレイ１００の現在の位置・姿勢情報をヘッドマウントディスプレイ１００から取得する。 The position/posture acquisition unit 210 acquires the current position/posture information of the head mounted display 100 from the head mounted display 100 .

視点・視線設定部２２０は、位置・姿勢取得部２１０により取得されたヘッドマウントディスプレイ１００の位置・姿勢情報を用いて、ユーザの視点位置および視線方向を設定する。 The viewpoint/line-of-sight setting unit 220 uses the position/orientation information of the head mounted display 100 acquired by the position/orientation acquisition unit 210 to set the user's viewpoint position and line-of-sight direction.

ＨＤＭＩ送受信部２８０は、ヘッドマウントディスプレイ１００からカメラユニット８０により撮影された現実空間の映像の奥行き情報を受信し、デプス取得部２５０に供給する。 The HDMI transmitting/receiving unit 280 receives the depth information of the video of the real space captured by the camera unit 80 from the head mounted display 100 and supplies the depth information to the depth acquiring unit 250 .

画像生成部２３０は、画像記憶部２６０からコンピュータグラフィックスの生成に必要なデータを読み出し、仮想空間のオブジェクトをレンダリングしてＣＧ画像を生成し、デプス取得部２５０から提供される現実空間のカメラ画像の奥行き情報にもとづいてＣＧ画像からクロマキー画像を生成し、画像記憶部２６０に出力する。 The image generation unit 230 reads out data necessary for generating computer graphics from the image storage unit 260, renders the objects in the virtual space to generate a CG image, and converts the camera image in the real space provided by the depth acquisition unit 250 into a CG image. A chromakey image is generated from the CG image based on the depth information, and is output to the image storage unit 260 .

画像生成部２３０は、レンダリング部２３２と、クロマキー生成部２３５と、ポストプロセス部２３６と、リプロジェクション部２４０と、歪み処理部２４２とを含む。 The image generation unit 230 includes a rendering unit 232 , a chromakey generation unit 235 , a post-processing unit 236 , a reprojection unit 240 and a distortion processing unit 242 .

レンダリング部２３２は、視点・視線設定部２２０によって設定されたユーザの視点位置および視線方向にしたがって、ヘッドマウントディスプレイ１００を装着したユーザの視点位置から視線方向に見える仮想空間のオブジェクトをレンダリングしてＣＧ画像を生成し、クロマキー生成部２３５に与える。 The rendering unit 232 renders an object in the virtual space seen in the line-of-sight direction from the viewpoint position of the user wearing the head-mounted display 100 according to the user's line-of-sight position and line-of-sight direction set by the viewpoint/line-of-sight setting unit 220, and performs CG rendering. An image is generated and supplied to the chromakey generation unit 235 .

クロマキー生成部２３５は、デプス取得部２５０から与えられたカメラ画像の奥行き情報にもとづいてＣＧ画像からクロマキー画像を生成する。具体的には、現実空間のオブジェクトと仮想空間のオブジェクトの位置関係を判定し、ＣＧ画像において仮想オブジェクトの背景や仮想オブジェクトよりも手前にある現実空間のオブジェクトの部分を特定の１色（たとえば赤色）に塗りつぶしたクロマキー画像（「ＣＧクロマキー画像」と呼ぶ）を生成する。 The chromakey generation unit 235 generates a chromakey image from the CG image based on the depth information of the camera image given from the depth acquisition unit 250 . Specifically, the positional relationship between the object in the real space and the object in the virtual space is determined, and the background of the virtual object and the part of the object in the real space that is in front of the virtual object in the CG image are colored in a specific color (for example, red). ) to generate a chromakey image (referred to as a “CG chromakey image”).

ポストプロセス部２３６は、ＣＧクロマキー画像に対して、被写界深度調整、トーンマッピング、アンチエイリアシングなどのポストプロセスを施し、ＣＧクロマキー画像が自然で滑らかに見えるように後処理する。 A post-processing unit 236 performs post-processing such as depth-of-field adjustment, tone mapping, and anti-aliasing on the CG chroma-key image, and performs post-processing so that the CG chroma-key image looks natural and smooth.

リプロジェクション部２４０は、位置・姿勢取得部２１０からヘッドマウントディスプレイ１００の最新の位置・姿勢情報を受け取り、ポストプロセスが施されたＣＧクロマキー画像に対してリプロジェクション処理を施し、ヘッドマウントディスプレイ１００の最新の視点位置・視線方向から見える画像に変換する。 The reprojection unit 240 receives the latest position/orientation information of the head mounted display 100 from the position/orientation acquisition unit 210 , performs reprojection processing on the post-processed CG chromakey image, and performs reprojection processing on the head mounted display 100 . Convert to an image that can be seen from the latest viewpoint position and line-of-sight direction.

ここで、リプロジェクションについて説明する。ヘッドマウントディスプレイ１００にヘッドトラッキング機能をもたせて、ユーザの頭部の動きと連動して視点や視線方向を変えて仮想現実の映像を生成した場合、仮想現実の映像の生成から表示までに遅延があるため、映像生成時に前提としたユーザの頭部の向きと、映像をヘッドマウントディスプレイ１００に表示した時点でのユーザの頭部の向きとの間でずれが発生し、ユーザは酔ったような感覚（「ＶＲ酔い（Virtual Reality Sickness）」などと呼ばれる）に陥ることがある。 Here, reprojection will be described. When the head-mounted display 100 is provided with a head tracking function and a virtual reality image is generated by changing the viewpoint and line-of-sight direction in conjunction with the movement of the user's head, there is a delay from the generation of the virtual reality image to its display. Therefore, a deviation occurs between the orientation of the user's head assumed at the time of video generation and the orientation of the user's head when the video is displayed on the head-mounted display 100, and the user feels drunk. A sensation (called “Virtual Reality Sickness” or the like) may occur.

このように、ヘッドマウントディスプレイ１００の動きを検知し、ＣＰＵが描画コマンドを発行し、ＧＰＵ（Graphics Processing Unit）がレンダリングを実行し、描画された画像がヘッドマウントディスプレイ１００に出力されるまでには時間がかかる。描画がたとえば６０ｆｐｓ（フレーム／秒）のフレームレートで行われており、ヘッドマウントディスプレイ１００の動きを検知してから画像を出力するまでに１フレーム分の遅れが生じるとする。これはフレームレート６０ｆｐｓのもとでは、１６．６７ミリ秒ほどであり、人間がずれを感知するには十分な時間である。 In this way, the movement of the head mounted display 100 is detected, the CPU issues a drawing command, the GPU (Graphics Processing Unit) executes rendering, and the drawn image is output to the head mounted display 100. time consuming. It is assumed that drawing is performed at a frame rate of 60 fps (frames per second), for example, and that there is a delay of one frame between detection of movement of head-mounted display 100 and output of an image. At a frame rate of 60 fps, this is about 16.67 milliseconds, which is enough time for humans to perceive the shift.

そこで、「タイムワープ」または「リプロジェクション」と呼ばれる処理を行い、レンダリングした画像をヘッドマウントディスプレイ１００の最新の位置と姿勢に合わせて補正することで人間がずれを感知しにくいようにする。 Therefore, a process called "time warp" or "reprojection" is performed to correct the rendered image according to the latest position and orientation of the head mounted display 100, thereby making it difficult for humans to perceive the deviation.

歪み処理部２４２は、リプロジェクション処理が施されたＣＧクロマキー画像に対してヘッドマウントディスプレイ１００の光学系で生じる歪みに合わせて画像を変形（distortion）させて歪ませる処理を施し、画像記憶部２６０に記憶する。 The distortion processing unit 242 distorts the reprojection-processed CG chromakey image in accordance with the distortion occurring in the optical system of the head-mounted display 100 . memorize to

ＨＤＭＩ送受信部２８０は、画像記憶部２６０から画像生成部２３０により生成されたＣＧクロマキー画像のフレームデータを読み出し、ＨＤＭＩにしたがってヘッドマウントディスプレイ１００に伝送する。 The HDMI transmission/reception unit 280 reads the frame data of the CG chromakey image generated by the image generation unit 230 from the image storage unit 260, and transmits it to the head mounted display 100 according to HDMI.

図９は、カメラ画像にＣＧ画像を重畳して拡張現実画像を生成するための前提技術に係る画像生成システムの構成を説明する図である。ここでは、説明を簡単にするため、拡張現実画像を生成するためのヘッドマウントディスプレイ１００と画像生成装置２００の主な構成を図示して説明する。 FIG. 9 is a diagram illustrating the configuration of an image generation system according to the base technology for superimposing a CG image on a camera image to generate an augmented reality image. Here, in order to simplify the explanation, the main configurations of the head-mounted display 100 and the image generation device 200 for generating an augmented reality image will be illustrated and explained.

ヘッドマウントディスプレイ１００のカメラユニット８０により撮影された外界のカメラ画像とデプス情報は画像信号処理部８２に供給される。画像信号処理部８２は、カメラ画像に対して画像信号処理と歪み補正処理を施し、リプロジェクション部８４に与える。画像信号処理部８２は、デプス情報を画像生成装置２００に送信し、クロマキー生成部２３５に供給する。 A camera image of the outside world captured by the camera unit 80 of the head mounted display 100 and the depth information are supplied to the image signal processing section 82 . The image signal processing unit 82 performs image signal processing and distortion correction processing on the camera image, and supplies the processed image to the reprojection unit 84 . The image signal processing unit 82 transmits the depth information to the image generation device 200 and supplies it to the chromakey generation unit 235 .

画像生成装置２００のレンダリング部２３２は、ヘッドマウントディスプレイ１００を装着したユーザの視点位置・視線方向から見た仮想オブジェクトを生成し、クロマキー生成部２３５に与える。 The rendering unit 232 of the image generation device 200 generates a virtual object viewed from the viewpoint position/line-of-sight direction of the user wearing the head mounted display 100 , and provides it to the chromakey generation unit 235 .

クロマキー生成部２３５は、デプス情報にもとづきＣＧ画像からＣＧクロマキー画像を生成する。ポストプロセス部２３６はＣＧクロマキー画像にポストプロセスを施す。リプロジェクション部２４０はポストプロセスが施されたＣＧクロマキー画像を最新の視点位置・視線方向に合うように変換する。歪み処理部２４２はリプロジェクション後のＣＧクロマキー画像に歪み処理を施す。歪み処理後の最終的なＲＧＢ画像はヘッドマウントディスプレイ１００に送信され、ＡＲ重畳部８８に供給される。このＲＧＢ画像は、カメラ画像が重畳されるべき領域がクロマキー合成で指定する１色（たとえば赤色）で塗りつぶされた画像である。クロマキーに指定される１色についてはＣＧ画像として利用できないため、ＣＧ画像ではクロマキーに指定される１色を避けて別の色を使って表現する。たとえばクロマキー色と同じ色をＣＧ画像で使いたい場合、クロマキー色の１ビットを変更して用いればよい。 The chromakey generator 235 generates a CG chromakey image from the CG image based on the depth information. A post-processing unit 236 applies post-processing to the CG chromakey image. The reprojection unit 240 converts the post-processed CG chromakey image so as to match the latest viewpoint position and line-of-sight direction. A distortion processing unit 242 applies distortion processing to the CG chromakey image after reprojection. The final RGB image after distortion processing is transmitted to the head mounted display 100 and supplied to the AR superimposing section 88 . This RGB image is an image in which the area on which the camera image is to be superimposed is filled with one color (for example, red) designated by chromakey synthesis. Since one color specified as chroma key cannot be used as a CG image, the one color specified as chroma key is avoided in the CG image and another color is used. For example, if you want to use the same color as the chromakey color in the CG image, you can use it by changing 1 bit of the chromakey color.

ヘッドマウントディスプレイ１００のリプロジェクション部８４は、画像信号処理と歪み補正処理が施されたカメラ画像を最新の視点位置・視線方向に合うように変換し、歪み処理部８６に供給する。歪み処理部８６はリプロジェクション後のカメラ画像に歪み処理を施す。ＡＲ重畳部８８は、画像生成装置２００から供給されるＣＧクロマキー画像を歪み処理後のカメラ画像に重畳することにより、拡張現実画像を生成する。生成された拡張現実画像は、ディスプレイパネル３２に表示される。 The reprojection unit 84 of the head-mounted display 100 converts the camera image, which has been subjected to image signal processing and distortion correction processing, so as to match the latest viewpoint position and line-of-sight direction, and supplies it to the distortion processing unit 86 . A distortion processing unit 86 performs distortion processing on the reprojected camera image. The AR superimposing unit 88 generates an augmented reality image by superimposing the CG chromakey image supplied from the image generating device 200 on the camera image after distortion processing. The generated augmented reality image is displayed on the display panel 32 .

上述の前提技術に係る画像生成システムでは、クロマキー生成部２３５により生成されたＣＧクロマキー画像がポストプロセス部２３６によるポストプロセス、リプロジェクション部２４０によるリプロジェクション処理、歪み処理部２４２により歪み処理を受けるため、仮想オブジェクトの境界においてエイリアシングが発生したり、実際にはない偽色が発生し、ＡＲ重畳部８８によってカメラ画像に重畳されるときに不自然さが目立つようになる。また、一般的な画像の伝送インタフェースはＲＧＢには対応するが、アルファ値を含めたＲＧＢＡには対応しておらず、アルファ値を伝送できないことから半透明のＣＧ画像は表現できないという制約も受ける。仮想オブジェクトが現実空間に落とす影や現実空間への仮想オブジェクトの映り込みを表現するためには、半透明のＣＧ画像をカメラ画像に合成する必要がある。 In the image generation system according to the base technology described above, the CG chromakey image generated by the chromakey generation unit 235 undergoes post-processing by the post-processing unit 236, re-projection processing by the re-projection unit 240, and distortion processing by the distortion processing unit 242. , aliasing occurs at the boundary of the virtual object, and false colors that do not actually exist occur, and when superimposed on the camera image by the AR superimposing unit 88, the unnaturalness becomes conspicuous. In addition, although general image transmission interfaces support RGB, they do not support RGBA including alpha values. . In order to represent the shadow cast by the virtual object on the real space and the reflection of the virtual object in the real space, it is necessary to synthesize a translucent CG image with the camera image.

以下、前提技術に係る画像生成システムにおける課題を克服したいくつかの実施の形態に係る画像生成システムを説明するが、前提技術と重複する説明は適宜省略し、前提技術から改善した構成について説明する。 In the following, image generation systems according to several embodiments that overcome the problems in the image generation system according to the base technology will be described, but explanations overlapping with the base technology will be omitted as appropriate, and configurations improved from the base technology will be described. .

第１の実施の形態について説明する。ヘッドマウントディスプレイ１００の構成は図７に示したものと同じである。 A first embodiment will be described. The configuration of the head mounted display 100 is the same as that shown in FIG.

図１０は、第１の実施の形態に係る画像生成装置２００の機能構成図である。 FIG. 10 is a functional configuration diagram of the image generation device 200 according to the first embodiment.

ＨＤＭＩ送受信部２８０は、ヘッドマウントディスプレイ１００からカメラユニット８０により撮影されたカメラ画像とデプス情報を受信し、カメラ画像をカメラ画像取得部２５２に、デプス情報をデプス取得部２５０に供給する。なお、このデプス情報は、現実空間のカメラ画像の奥行き情報であり、「カメラデプス情報」と呼ぶ。 The HDMI transmission/reception unit 280 receives the camera image captured by the camera unit 80 and the depth information from the head mounted display 100 , and supplies the camera image to the camera image acquisition unit 252 and the depth information to the depth acquisition unit 250 . This depth information is depth information of the camera image in the physical space, and is called "camera depth information".

画像生成部２３０は、画像記憶部２６０からコンピュータグラフィックスの生成に必要なデータを読み出し、仮想空間のオブジェクトをレンダリングしてＣＧ画像を生成し、カメラ画像取得部２５２から提供されるカメラ画像にＣＧ画像を重畳して仮の重畳画像を生成するとともに、デプス取得部２５０から提供されるカメラデプス情報にもとづいてＣＧ画像からクロマキー画像を生成する。仮の重畳画像にはポストプロセス、リプロジェクション処理、歪み処理を施し、クロマキー画像にはリプロジェクション処理、歪み処理を施す。クロマキー画像にはポストプロセスを施さないことに留意する。最後にクロマキー画像で仮の重畳画像をマスクすることにより、最終的な合成ＣＧクロマキー画像を生成して画像記憶部２６０に出力する。 The image generation unit 230 reads data necessary for generating computer graphics from the image storage unit 260, renders objects in the virtual space to generate a CG image, and applies CG to the camera image provided from the camera image acquisition unit 252. Images are superimposed to generate a temporary superimposed image, and a chromakey image is generated from the CG image based on the camera depth information provided from the depth acquisition unit 250 . Post-processing, reprojection processing, and distortion processing are performed on the temporary superimposed image, and reprojection processing and distortion processing are performed on the chromakey image. Note that the chromakey image is not post-processed. Finally, by masking the temporary superimposed image with a chromakey image, a final synthesized CG chromakey image is generated and output to the image storage unit 260 .

画像生成部２３０は、レンダリング部２３２と、重畳部２３４と、クロマキー生成部２３５と、ポストプロセス部２３６と、リプロジェクション部２４０ａ、２４０ｂと、歪み処理部２４２ａ、２４２ｂと、合成部２４４とを含む。 The image generating unit 230 includes a rendering unit 232, a superimposing unit 234, a chromakey generating unit 235, a post-processing unit 236, reprojection units 240a and 240b, distortion processing units 242a and 242b, and a synthesizing unit 244. .

レンダリング部２３２は、視点・視線設定部２２０によって設定されたユーザの視点位置および視線方向にしたがって、ヘッドマウントディスプレイ１００を装着したユーザの視点位置から視線方向に見える仮想空間のオブジェクトをレンダリングしてＣＧ画像を生成する。レンダリング部２３２が仮想空間のオブジェクトをレンダリングする際、仮想オブジェクトの奥行き情報（「シーンデプス情報」と呼ぶ）を仮想空間レンダリング用のデプスバッファ（「シーンデプスバッファ」と呼ぶ）に書き込み、仮想オブジェクト間の前後関係を判定する。仮想オブジェクトが描画されない画素はシーンデプスバッファにおいて具体的なデプス値が書き込まれず、シーンデプス値は無限大（不定）である。 The rendering unit 232 renders an object in the virtual space seen in the line-of-sight direction from the viewpoint position of the user wearing the head-mounted display 100 according to the user's line-of-sight position and line-of-sight direction set by the viewpoint/line-of-sight setting unit 220, and performs CG rendering. Generate an image. When the rendering unit 232 renders an object in the virtual space, depth information of the virtual object (referred to as “scene depth information”) is written to a depth buffer for virtual space rendering (referred to as “scene depth buffer”), and determine the context of A specific depth value is not written in the scene depth buffer for pixels where no virtual object is drawn, and the scene depth value is infinite (undefined).

さらにレンダリング部２３２は、カメラユニット８０によって撮影される現実空間の現実オブジェクトをレンダリングする。現実世界の物体の形状情報や奥行き情報は、現実世界の空間を３Ｄスキャンして空間認識することで得られる。たとえば、赤外線パターン、ＳｔｒｕｃｔｕｒｅｄＬｉｇｈｔ、ＴＯＦ（Time Of Flight）などの方式のデプスセンサを用いて現実空間の奥行き情報を取得したり、ステレオカメラの視差情報から現実空間の奥行き情報を取得することができる。このように現実空間はあらかじめ３Ｄスキャンされ、ポリゴンメッシュ構造でモデリングされる。レンダリング部２３２により、現実空間の壁、床、天井、静止物体などがレンダリングされるが、色情報は設けずに白一色で描画する。レンダリング部２３２が現実空間のオブジェクトをレンダリングする際、現実オブジェクトの奥行き情報（「実空間デプス情報」と呼ぶ）を現実空間レンダリング用のデプスバッファ（「実空間デプスバッファと呼ぶ」）に書き込み、現実オブジェクト間の前後関係を判定する。 Furthermore, the rendering section 232 renders the physical object in the physical space captured by the camera unit 80 . Shape information and depth information of objects in the real world can be obtained by 3D scanning the space in the real world and recognizing the space. For example, the depth information of the real space can be obtained using depth sensors such as infrared pattern, structured light, and TOF (Time Of Flight), or the depth information of the real space can be obtained from the parallax information of a stereo camera. In this way, the real space is 3D scanned in advance and modeled with a polygon mesh structure. The rendering unit 232 renders walls, floors, ceilings, stationary objects, and the like in the real space, but renders them in white without providing color information. When the rendering unit 232 renders an object in the physical space, depth information of the physical object (referred to as “real space depth information”) is written to a depth buffer for physical space rendering (referred to as “real space depth buffer”), Determine context between objects.

シーンデプスバッファとは別に実空間デプスバッファを設ける理由は、現実空間のレンダリングの際にシーンデプスバッファにデプス値を書き込んだ場合、シーンデプス値と実空間デプス値の区別がつかなくなり、クロマキー領域とするか否かを判断できなくなるからである。 The reason why the real space depth buffer is provided separately from the scene depth buffer is that if the depth value is written to the scene depth buffer when rendering the real space, the scene depth value and the real space depth value cannot be distinguished, and the chroma key area and the This is because it becomes impossible to determine whether or not to

レンダリング部２３２は、現実空間に対する仮想空間の光に関する表現、具体的には仮想オブジェクトが現実オブジェクトに落とす影や現実空間への仮想オブジェクトの映り込み、手前にある仮想空間のオブジェクトの背後が透けて見える表現、仮想空間における仮想の光源によるライティング表現などを半透明のＣＧ画像として描画する。たとえばシャドウマッピングは光源からの深度マップを平面に投影する方法やレイトレーシングなどの手法を用いて影や映り込みを描画することができる。仮想オブジェクトの影や映り込みの半透明ＣＧ画像を現実空間の低解像度のカメラ画像に重畳することで現実空間に対する仮想オブジェクトの影や映り込みを表現することができる。現実空間のオブジェクトは白一色でレンダリングされているため、影や映り込みが描画された領域とは区別することができる。現実空間のレンダリング領域はクロマキー領域に指定されるが、影や映り込みが描画された領域はクロマキー領域には指定されない。 The rendering unit 232 expresses light in the virtual space with respect to the real space. Visible expressions, lighting expressions by virtual light sources in virtual space, etc. are drawn as translucent CG images. For example, shadow mapping can draw shadows and reflections by using methods such as a method of projecting a depth map from a light source onto a plane and methods such as ray tracing. By superimposing a translucent CG image of the shadow or reflection of the virtual object on the low-resolution camera image of the real space, the shadow or reflection of the virtual object in the real space can be expressed. Since objects in the real space are rendered in solid white, they can be distinguished from areas where shadows and reflections are drawn. A rendering area in the real space is specified as a chromakey area, but an area where shadows and reflections are drawn is not specified as a chromakey area.

レンダリング部２３２は、仮想オブジェクトと仮想オブジェクトの影や映り込みなど仮想空間の光に関する表現が描画されたＣＧ画像を重畳部２３４とクロマキー生成部２３５に与える。 The rendering unit 232 provides the superimposing unit 234 and the chromakey generating unit 235 with a CG image in which expressions related to light in the virtual space, such as the virtual object and the shadow or reflection of the virtual object, are drawn.

重畳部２３４は、カメラ画像取得部２５２から与えられた低解像度で遅延のあるカメラ画像にＣＧ画像を重畳して仮の重畳画像を生成し、ポストプロセス部２３６に与える。重畳部２３４は、シーンデプス値が無限大の領域（すなわち仮想オブジェクトが描画されていない領域）であって、実空間デプス値が書き込まれている領域に低解像度で遅延のあるカメラ画像を重畳する。これにより、影や映り込みが半透明で描画されているＣＧ画像に対して影や映り込みの色情報を残したまま低解像度で遅延のあるカメラ画像が重畳される。すなわち影や映り込みの色情報と低解像度で遅延のあるカメラ画像の色情報がアルファ合成される。 The superimposing unit 234 superimposes the CG image on the low-resolution camera image with delay provided from the camera image acquiring unit 252 to generate a temporary superimposed image, and supplies the temporary superimposed image to the post-processing unit 236 . The superimposition unit 234 superimposes a low-resolution camera image with a delay on the area where the scene depth value is infinite (that is, the area where the virtual object is not drawn) and where the real space depth value is written. . As a result, the camera image with low resolution and delay is superimposed on the CG image in which the shadows and reflections are rendered translucent while the color information of the shadows and reflections remains. That is, the color information of shadows and reflections and the color information of a low-resolution camera image with delay are alpha synthesized.

ここで、実空間デプス値が書き込まれておらず、シーンデプス値が書き込まれた領域にはカメラ画像が重畳されないことに留意する。後述のように現実空間のメッシュ構造を変形して穴などの空き領域を設けた場合、空き領域には実空間デプス値が書き込まれず、シーンデプス値が書き込まれているため、クロマキー色にはならないから、カメラ画像は重畳されない。代わりに空き領域に仮想空間をレンダリングして空き領域から向こう側が見えるような効果を与えることも可能になる。 Here, it should be noted that the real space depth value is not written and the camera image is not superimposed on the area where the scene depth value is written. As described later, when empty areas such as holes are created by transforming the mesh structure of the real space, the real space depth value is not written in the empty area, but the scene depth value is written, so the color will not be chroma key. Therefore, the camera image is not superimposed. Instead, it is possible to render the virtual space in the empty area and give the effect that the other side can be seen through the empty area.

ポストプロセス部２３６は、仮の重畳画像に対してポストプロセスを施し、仮の重畳画像が自然で滑らかに見えるように後処理する。 A post-processing unit 236 performs post-processing on the temporary superimposed image, and performs post-processing so that the temporary superimposed image looks natural and smooth.

第１のリプロジェクション部２４０ａは、位置・姿勢取得部２１０からヘッドマウントディスプレイ１００の最新の位置・姿勢情報を受け取り、ポストプロセスが施された仮の重畳画像に対してリプロジェクション処理を施し、ヘッドマウントディスプレイ１００の最新の視点位置・視線方向から見える画像に変換する。 The first reprojection unit 240a receives the latest position/orientation information of the head mounted display 100 from the position/orientation acquisition unit 210, performs reprojection processing on the post-processed provisional superimposed image, The image is converted into an image that can be seen from the latest viewpoint position/line-of-sight direction of the mount display 100 .

第１の歪み処理部２４２ａは、リプロジェクション処理が施された仮の重畳画像に対して歪み処理を施し、合成部２４４に与える。 The first distortion processing unit 242 a performs distortion processing on the temporary superimposed image that has been subjected to the reprojection processing, and supplies the result to the synthesizing unit 244 .

クロマキー生成部２３５は、デプス取得部２５０から与えられたカメラデプス情報、レンダリング部２３２から与えられたシーンデプス情報および実空間デプス情報にもとづいてクロマキー画像を生成する。具体的には、シーンデプス値が無限大、または、実空間デプス値が書き込まれている領域で、かつ、影や映り込みの描画領域以外の領域をクロマキー色に塗りつぶし、また、カメラデプス情報とシーンデプス情報を参照して仮想オブジェクトよりも手前にある現実オブジェクトの部分をクロマキー色に塗りつぶしたクロマキー画像を生成する。 The chromakey generation unit 235 generates a chromakey image based on the camera depth information provided from the depth acquisition unit 250, the scene depth information provided from the rendering unit 232, and the real space depth information. Specifically, the area where the scene depth value is infinite or the real space depth value is written, and the area other than the shadow and reflection drawing area is painted with chromakey color, and the camera depth information and A chromakey image is generated by referring to the scene depth information and painting the portion of the real object in front of the virtual object with the chromakey color.

クロマキー画像にはポストプロセスを施さない。 Chroma key images are not post-processed.

第２のリプロジェクション部２４０ｂは、位置・姿勢取得部２１０からヘッドマウントディスプレイ１００の最新の位置・姿勢情報を受け取り、クロマキー画像に対してリプロジェクション処理を施し、ヘッドマウントディスプレイ１００の最新の視点位置・視線方向から見える画像に変換する。 The second reprojection unit 240 b receives the latest position/orientation information of the head mounted display 100 from the position/orientation acquisition unit 210 , performs reprojection processing on the chromakey image, and obtains the latest viewpoint position of the head mounted display 100 .・Convert to an image that can be seen from the line of sight.

第２の歪み処理部２４２ｂは、リプロジェクション処理が施されたクロマキー画像に対して歪み処理を施し、合成部２４４に与える。 The second distortion processing unit 242b performs distortion processing on the reprojection-processed chromakey image, and supplies it to the synthesizing unit 244. FIG.

合成部２４４は、仮の重畳画像に対してクロマキー画像をマスクとして用いて合成し、合成ＣＧクロマキー画像を生成し、画像記憶部２６０に記憶する。 The synthesizing unit 244 synthesizes the temporary superimposed image using the chromakey image as a mask, generates a synthesized CG chromakey image, and stores it in the image storage unit 260 .

ここで、第１のリプロジェクション部２４０ａが仮の重畳画像に対してリプロジェクション処理を施す場合と、第１の歪み処理部２４２ａがリプロジェクション処理後の仮の重畳画像に対して歪み処理を施す場合は、ピクセルのサンプリング時に一般にバイリニア補間などの補間処理が行われる。それに対して、第２のリプロジェクション部２４０ｂがクロマキー画像に対してリプロジェクション処理を施す場合と、第２の歪み処理部２４２ｂがリプロジェクション処理後のクロマキー画像に対して歪み処理を施す場合は、ピクセルのサンプリング時にポイントサンプリングが行われる。クロマキー画像に対してバイリニア補間などの補間処理を行うと、クロマキー色が非クロマキー色に混合されて別の色になってしまい、クロマキー画像の意味がなくなるからである。 Here, the first reprojection unit 240a performs reprojection processing on the temporary superimposed image, and the first distortion processing unit 242a performs distortion processing on the temporary superimposed image after reprojection processing. In this case, interpolation processing such as bilinear interpolation is generally performed when sampling pixels. On the other hand, when the second reprojection unit 240b performs reprojection processing on the chromakey image, and when the second distortion processing unit 242b performs distortion processing on the chromakey image after reprojection processing, Point sampling occurs when pixels are sampled. This is because if interpolation processing such as bilinear interpolation is performed on the chromakey image, the chromakey color will be mixed with the non-chromakey color and become a different color, and the chromakey image will lose its meaning.

クロマキー画像に対するリプロジェクション処理と歪み処理において、上記のポイントサンプリング以外に以下の二つの方法を用いてもよい。第１の方法は、サンプリング時にクロマキー色と非クロマキー色の間でバイリニア補間を行って中間色のピクセル値をいったん算出するが、所定の閾値を設けて、補間後のピクセル値が閾値以下である場合は、ピクセル値をクロマキー色に設定し、補間後のピクセル値が閾値を超える場合は、ピクセル値を元の非クロマキー色に設定する。この方法によれば、補間によって生じる中間色は捨てられ、閾値を超えるかどうかで、クロマキー色になるか、元の非クロマキー色になるかが決まるため、クロマキー色が非クロマキー色に混合されることはない。 In reprojection processing and distortion processing for a chromakey image, the following two methods may be used in addition to the point sampling described above. In the first method, bilinear interpolation is performed between chromakey colors and non-chromakey colors at the time of sampling to calculate intermediate color pixel values. sets the pixel value to the chromakey color, and sets the pixel value to the original non-chromakey color if the interpolated pixel value exceeds the threshold. According to this method, intermediate colors caused by interpolation are discarded, and depending on whether or not they exceed the threshold, chromakey colors are mixed with non-chromakey colors, because chromakey colors or the original non-chromakey colors are determined. no.

第２の方法は、クロマキー画像を生成する際に、（ｕ，ｖ）が参照するポイントの近傍４ピクセルをサンプリングして、近傍４ピクセルのすべてがクロマキーの条件に当てはまる場合にクロマキー色に設定し、近傍４ピクセルのいずれかがクロマキーの条件に当てはまらない場合はクロマキー色には設定しない。近傍ピクセルの選び方として４ピクセル以外に９ピクセルを選んでもよい。この方法によれば、エッジ境界部分がクロマキー色によって塗りつぶされないようになるため、ＣＧ画像とカメラ画像が境界付近でより自然に融合して見える。 The second method is to sample 4 pixels near the point referred to by (u, v) when generating the chromakey image, and set the chromakey color if all of the 4 pixels near the point meet the chromakey conditions. , 4 pixels in the neighborhood do not meet the chromakey conditions, the chromakey color is not set. Nine pixels may be selected instead of four pixels as a method of selecting neighboring pixels. According to this method, since the edge boundary portion is not painted with the chromakey color, the CG image and the camera image appear to merge more naturally near the boundary.

ＨＤＭＩ送受信部２８０は、画像記憶部２６０から画像生成部２３０により生成された合成ＣＧクロマキー画像のフレームデータを読み出し、ＨＤＭＩにしたがってヘッドマウントディスプレイ１００に伝送する。 The HDMI transmission/reception unit 280 reads the frame data of the composite CG chromakey image generated by the image generation unit 230 from the image storage unit 260, and transmits it to the head mounted display 100 according to HDMI.

図１１は、カメラ画像にＣＧ画像を重畳して拡張現実画像を生成するための第１の実施の形態に係る画像生成システムの構成を説明する図である。 FIG. 11 is a diagram illustrating the configuration of an image generation system according to the first embodiment for superimposing a CG image on a camera image to generate an augmented reality image.

ヘッドマウントディスプレイ１００のカメラユニット８０により撮影された外界のカメラ画像とカメラデプス情報は画像信号処理部８２に供給される。画像信号処理部８２は、低遅延かつ高解像度のカメラ画像に対して画像信号処理と歪み補正処理を施し、リプロジェクション部８４に与える。さらに、画像信号処理部８２は、画像信号処理と歪み補正処理が施されたカメラ画像とカメラデプス情報を画像生成装置２００に送信し、カメラ画像は重畳部２３４に、カメラデプス情報はクロマキー生成部２３５に供給される。画像生成装置２００に送信されるカメラ画像は遅延があり、低解像度である。 A camera image of the outside world captured by the camera unit 80 of the head mounted display 100 and the camera depth information are supplied to the image signal processing section 82 . The image signal processing unit 82 performs image signal processing and distortion correction processing on the low-delay and high-resolution camera image, and supplies the processed image to the reprojection unit 84 . Further, the image signal processing unit 82 transmits the camera image subjected to the image signal processing and the distortion correction processing and the camera depth information to the image generation device 200. The camera image is transmitted to the superimposition unit 234, and the camera depth information is transmitted to the chromakey generation unit. 235. The camera images sent to the image generation device 200 are delayed and of low resolution.

画像生成装置２００のレンダリング部２３２は、現実空間の現実オブジェクトをレンダリングするとともに、ヘッドマウントディスプレイ１００を装着したユーザの視点位置・視線方向から見た仮想オブジェクトを生成し、現実空間に対する仮想空間の光に関する表現、具体的には仮想オブジェクトが現実オブジェクトに落とす影や仮想オブジェクトの映り込み、仮想オブジェクトの半透明化、仮想光源によるライティング表現などをレンダリングする。レンダリング部２３２は、生成されたＣＧ画像を重畳部２３４に与え、シーンデプス情報と実空間デプス情報をクロマキー生成部２３５に与える。 The rendering unit 232 of the image generation device 200 renders a real object in the real space, generates a virtual object seen from the viewpoint position/line-of-sight direction of the user wearing the head mounted display 100, and converts the light in the virtual space to the real space. It renders expressions related to objects, specifically shadows cast by virtual objects on real objects, reflections of virtual objects, translucency of virtual objects, and lighting expressions using virtual light sources. The rendering unit 232 provides the generated CG image to the superimposing unit 234 and provides the scene depth information and real space depth information to the chromakey generating unit 235 .

重畳部２３４は、ＣＧ画像にカメラ画像を重畳し、仮の重畳画像を生成し、ポストプロセス部２３６に与える。ここで、ヘッドマウントディスプレイ１００から提供されるカメラ画像は低解像度で遅延があってもよい。なぜなら、仮の重畳画像におけるカメラ画像の部分はクロマキー画像でマスクされて最終的には消去されるからである。重畳部２３４は、シーンデプス値は書き込まれていないが、実空間デプス値が書き込まれている領域にカメラ画像を重畳する。したがって、仮想オブジェクトが描画されていない領域と、影や映り込みなど仮想空間の光に関する表現が描画されている領域にカメラ画像が重畳される。影や映り込みなど仮想空間の光に関する表現が描画されている領域では影や映り込みなど仮想空間の光に関する表現の色情報が半透明のＣＧ画像として低解像度のカメラ画像に合成される。 The superimposing unit 234 superimposes the camera image on the CG image to generate a temporary superimposed image, and supplies it to the post-processing unit 236 . Here, the camera image provided from the head mounted display 100 may have a low resolution and a delay. This is because the part of the camera image in the temporary superimposed image is masked with the chromakey image and finally erased. The superimposing unit 234 superimposes the camera image on the area in which the scene depth value is not written but the real space depth value is written. Therefore, the camera image is superimposed on an area where no virtual object is drawn and an area where expressions related to light in the virtual space, such as shadows and reflections, are drawn. In a region where an expression related to the light in the virtual space, such as a shadow or reflection, is drawn, the color information of the expression related to the light in the virtual space, such as a shadow or reflection, is combined with the low-resolution camera image as a translucent CG image.

ポストプロセス部２３６は仮の重畳画像にポストプロセスを施す。リプロジェクション部２４０ａはポストプロセスが施された仮の重畳画像を最新の視点位置・視線方向に合うように変換する。歪み処理部２４２ａはリプロジェクション後の仮の重畳画像に歪み処理を施し、合成部２４４に与える。 A post-processing unit 236 applies post-processing to the temporary superimposed image. The reprojection unit 240a converts the post-processed provisional superimposed image so as to match the latest viewpoint position and line-of-sight direction. The distortion processing unit 242 a applies distortion processing to the temporary superimposed image after reprojection, and supplies it to the synthesizing unit 244 .

クロマキー生成部２３５は、カメラデプス情報、シーンデプス情報および実空間デプス情報にもとづきクロマキー画像を生成する。シーンデプス値が書き込まれていない領域はクロマキー色に塗りつぶされるが、影や映り込みなど仮想空間の光に関する表現が描画されている半透明のＣＧ領域はクロマキー色に塗りつぶされない。また、カメラデプス情報とシーンデプス情報にもとづいて現実空間のオブジェクトと仮想空間のオブジェクトの位置関係がリアルタイムで判定され、仮想オブジェクトよりも手前にある現実の移動物体（手やボールなど）の部分がクロマキー色に塗りつぶされる。リプロジェクション部２４０ｂはクロマキー画像を最新の視点位置・視線方向に合うように変換する。歪み処理部２４２ｂはリプロジェクション後のクロマキー画像に歪み処理を施し、合成部２４４に与える。 The chromakey generation unit 235 generates a chromakey image based on the camera depth information, scene depth information, and real space depth information. Areas in which scene depth values are not written are filled with the chromakey color, but translucent CG areas where representations related to light in the virtual space, such as shadows and reflections, are drawn are not painted with the chromakey color. In addition, based on the camera depth information and scene depth information, the positional relationship between the objects in the real space and the objects in the virtual space is determined in real time. Filled with chromakey color. The reprojection unit 240b converts the chromakey image so as to match the latest viewpoint position and line-of-sight direction. The distortion processing unit 242b applies distortion processing to the chromakey image after reprojection, and supplies it to the synthesizing unit 244. FIG.

合成部２４４は、クロマキー画像で仮の重畳画像をマスクすることで合成ＣＧクロマキー画像を生成する。ここで、仮の重畳画像はポストプロセスを施されているため、滑らかな画像となっており、カメラ画像とＣＧ画像の境界は目立たない。他方、クロマキー画像はポストプロセスが施されていないため、仮想オブジェクトの境界にエイリアシングや偽色が発生していない。したがって、ポストプロセスが施されていないクロマキー画像でポストプロセスが施された重畳画像をマスクすれば、仮想オブジェクトの境界にエイリアシングや偽色がなく、かつ自然で滑らかな合成ＣＧクロマキー画像が合成される。 The combining unit 244 generates a combined CG chromakey image by masking the temporary superimposed image with the chromakey image. Here, since the temporary superimposed image has undergone post-processing, it is a smooth image, and the boundary between the camera image and the CG image is inconspicuous. On the other hand, chromakey images are not post-processed, so there is no aliasing or false colors at the borders of virtual objects. Therefore, by masking the post-processed superimposed image with a non-post-processed chromakey image, a natural and smooth composite CG chroma-key image is synthesized without aliasing or false colors at the boundary of the virtual object. .

シーンデプス値が無限大であっても、影や映り込みが描画されている半透明のＣＧ領域はクロマキー色で塗りつぶされないため、影や映り込みの半透明のＣＧ領域は低解像度のカメラ画像に重畳された状態で合成ＣＧクロマキー画像の中に残される。 Even if the scene depth value is infinite, semi-transparent CG areas with shadows and reflections are not filled with chromakey colors, so the semi-transparent CG areas with shadows and reflections are rendered as low-resolution camera images. are left in the synthesized CG chromakey image in a superimposed state.

合成ＣＧクロマキー画像は特定の１色がクロマキーに指定されたＲＧＢ画像としてヘッドマウントディスプレイ１００に送信され、ＡＲ重畳部８８に供給される。 The synthesized CG chromakey image is transmitted to the head-mounted display 100 as an RGB image in which one specific color is designated as chromakey, and supplied to the AR superimposing unit 88 .

ヘッドマウントディスプレイ１００のリプロジェクション部８４は、画像信号処理と歪み補正処理が施された低遅延かつ高解像度のカメラ画像を最新の視点位置・視線方向に合うように変換し、歪み処理部８６に供給する。歪み処理部８６はリプロジェクション後の低遅延かつ高解像度のカメラ画像に歪み処理を施す。ＡＲ重畳部８８は、画像生成装置２００から供給される合成ＣＧクロマキー画像を歪み処理後の低遅延かつ高解像度のカメラ画像に重畳することにより、拡張現実画像を生成する。生成された拡張現実画像は、ディスプレイパネル３２に表示される。クロマキー色が指定された領域にはヘッドマウントディスプレイ１００側で低遅延かつ高解像度のカメラ画像が重畳される。 The reprojection unit 84 of the head-mounted display 100 converts the low-delay, high-resolution camera image that has undergone image signal processing and distortion correction processing so as to match the latest viewpoint position and line-of-sight direction, and sends it to the distortion processing unit 86. supply. A distortion processing unit 86 performs distortion processing on the low-delay and high-resolution camera image after reprojection. The AR superimposing unit 88 generates an augmented reality image by superimposing the synthesized CG chromakey image supplied from the image generating device 200 on the low-delay and high-resolution camera image after distortion processing. The generated augmented reality image is displayed on the display panel 32 . A low-delay and high-resolution camera image is superimposed on the area where the chromakey color is specified on the head-mounted display 100 side.

図１２は、第１の実施の形態に係る画像生成システムによってカメラ画像にＣＧ画像を重畳した拡張現実画像を説明する図である。図５と同様、ユーザが仮想オブジェクトであるティーポット４１０を触ろうとしており、手４２０が写り込んだカメラ画像に仮想オブジェクトであるティーポット４１０が重畳される。仮想オブジェクトであるティーポット４１０が現実のテーブルに落とす影４１２が半透明ＣＧとしてカメラ画像に重畳される形で描画されている。そのため影４１２の描画領域には現実のテーブル表面の花柄が透けて見える。 FIG. 12 is a diagram for explaining an augmented reality image in which a CG image is superimposed on a camera image by the image generation system according to the first embodiment. As in FIG. 5 , the user is about to touch the virtual object teapot 410 , and the virtual object teapot 410 is superimposed on the camera image in which the hand 420 is captured. A shadow 412 cast by a teapot 410, which is a virtual object, on a real table is drawn as semi-transparent CG so as to be superimposed on the camera image. Therefore, the flower pattern on the actual table surface can be seen through the drawing area of the shadow 412 .

図１３は、第１の実施の形態に係る画像生成システムによって利用される合成ＣＧクロマキー画像を説明する図である。合成ＣＧクロマキー画像は、仮想オブジェクト（ここではティーポット）の描画領域と仮想オブジェクトの影や映り込みの描画領域（ここではティーポットが花柄のテーブル表面に落とす影）がＣＧであり、それ以外はクロマキー色で塗りつぶされている。また、仮想オブジェクトよりも手前に現実の移動物体（ここではユーザの手）がある場合は手前の移動物体の部分もクロマキー色で塗りつぶされる。 FIG. 13 is a diagram for explaining a composite CG chromakey image used by the image generation system according to the first embodiment. In the synthesized CG chromakey image, the drawing area of the virtual object (here, the teapot) and the drawing area of the shadow and reflection of the virtual object (here, the shadow cast by the teapot on the floral table surface) are CG, and the rest is chromakey. filled with color. Also, if there is a real moving object (here, the user's hand) in front of the virtual object, the portion of the moving object in front is also painted with the chromakey color.

図１３の合成ＣＧクロマキー画像がヘッドマウントディスプレイ１００側で高解像度のカメラ画像に重畳されることで図１２の拡張現実画像が生成される。ティーポット４１０の影の部分は低解像度のカメラ画像と合成されているが、それ以外のカメラ画像はユーザの手の領域も含めて高解像度であることに留意する。 The augmented reality image shown in FIG. 12 is generated by superimposing the synthesized CG chromakey image shown in FIG. 13 on the high-resolution camera image on the head-mounted display 100 side. Note that the shaded portion of the teapot 410 is composited with the low resolution camera image, but the rest of the camera image is high resolution, including the area of the user's hand.

第１の実施の形態の画像生成システムを用いれば、現実空間の壁や天井、テーブルなどに仮想的に穴を空けてＣＧ画像を重畳することもできる。これを図１４および図１５を参照して説明する。 By using the image generation system of the first embodiment, it is possible to virtually make a hole in a wall, ceiling, table, or the like in the real space and superimpose a CG image. This will be described with reference to FIGS. 14 and 15. FIG.

図１４は、現実空間のポリゴンメッシュを変形して壁に穴を空ける例を説明する図である。３Ｄスキャンによって現実空間のポリゴンメッシュが得られるが、これを変形することで図１４のように壁に穴４３０を設けることができる。穴４３０の別の表現方法として、アルファマスクのあるテクスチャを利用してもよい。 FIG. 14 is a diagram illustrating an example of deforming a polygon mesh in the real space to make a hole in a wall. A polygon mesh in the real space is obtained by 3D scanning, and by deforming it, a hole 430 can be provided in the wall as shown in FIG. Another representation of the hole 430 may utilize a texture with an alpha mask.

穴４３０には現実空間の物体が何も存在していないため、レンダリング部２３２が現実空間のレンダリングをする際、実空間デプスバッファの穴４３０に対応する領域にはデプス情報が書き込まれない。一方、レンダリング部２３２は、穴４３０に仮想的なオブジェクトをレンダリングすることができ、その際、シーンデプスバッファの穴４３０に対応する領域にデプス情報が書き込まれる。したがって、重畳部２３４がカメラ画像をＣＧ画像に重畳する際、実空間デプス値が設定されていない穴４３０にはカメラ画像が重畳されることがなく、シーンデプス値が設定されている穴４３０にＣＧ画像が表示される。 Since no physical space object exists in the hole 430 , depth information is not written to the area corresponding to the hole 430 in the real space depth buffer when the rendering unit 232 renders the physical space. On the other hand, the rendering unit 232 can render a virtual object in the hole 430, and depth information is written in the area corresponding to the hole 430 in the scene depth buffer. Therefore, when the superimposing unit 234 superimposes the camera image on the CG image, the camera image is not superimposed on the hole 430 for which the real space depth value is not set, and the hole 430 for which the scene depth value is set. A CG image is displayed.

図１５は、現実空間の壁の穴に仮想オブジェクトが描画された例を説明する図である。図１４の穴４３０に対応して図１５では仮想的な穴４４０がＣＧで描画され、仮想的な穴４４０の向こう側に仮想的な自動車がＣＧで描画されている。 FIG. 15 is a diagram illustrating an example in which a virtual object is drawn in a hole in a wall in real space. A virtual hole 440 is drawn by CG in FIG. 15 corresponding to the hole 430 in FIG. 14 , and a virtual car is drawn by CG on the other side of the virtual hole 440 .

第１の実施の形態の画像生成システムによれば、カメラ画像とＣＧ画像の境界の違和感をポストプロセスによって緩和するとともに、クロマキー画像にはポストプロセスを施さないため、エイリアシングや偽色のないクロマキー画像を生成して、高品質の合成ＣＧクロマキー画像を生成できる。この合成ＣＧクロマキー画像をカメラ画像に重畳して拡張現実画像を生成するため、不自然さのない拡張現実画像を生成できる。また、ＣＧ画像に半透明部分がある場合もカメラ画像に重畳してポスト処理する際に半透明処理を施すことができるので、半透明部分もレンダリングによるレイテンシはあるが表現することができる。また、合成ＣＧクロマキー画像は、特定の１色をクロマキーとしたＲＧＢ画像であるため、ＲＧＢ画像を伝送することのできるＨＤＭＩなどの一般的な通信インタフェースで伝送することができる。 According to the image generation system of the first embodiment, since post-processing alleviates discomfort at the boundary between the camera image and the CG image, and post-processing is not applied to the chromakey image, the chromakey image is free from aliasing and false colors. can be generated to generate a high-quality synthetic CG chromakey image. Since an augmented reality image is generated by superimposing this synthesized CG chromakey image on a camera image, an augmented reality image without unnaturalness can be generated. Also, even if the CG image has a semi-transparent part, it can be superimposed on the camera image and subjected to post-processing. Also, since the synthesized CG chromakey image is an RGB image with one specific color as the chromakey, it can be transmitted through a general communication interface such as HDMI capable of transmitting RGB images.

なお、上記の説明では、合成ＣＧクロマキー画像は、特定の１色をクロマキーとしたＲＧＢ画像であったが、クロマキー画像のクロマキー色をＲＧＢＡのアルファ成分に格納してもよい。即ち、ポストプロセスの中間処理までは、画像のクロマキー色となる部分をＲＧＢＡのアルファ成分を透過にして処理を行い、ポストプロセスの最終段階でアルファ成分が透過である部分の色をクロマキー色に置き換えてもよい。これにより、仮の重畳画像とクロマキー画像を格納するために二つの別個のフレームバッファを使う必要がなくなり、一つのフレームバッファ上で仮の重畳画像とクロマキー画像の処理を行うことができる。 In the above description, the synthesized CG chromakey image was an RGB image with one specific color as the chromakey, but the chromakey color of the chromakey image may be stored in the RGBA alpha component. That is, until the intermediate processing of the post-process, the chromakey color part of the image is processed with the RGBA alpha component transparent, and in the final stage of the post-process, the color of the part where the alpha component is transparent is replaced with the chromakey color. may This eliminates the need to use two separate frame buffers for storing the temporary superimposed image and the chromakey image, and the temporary superimposed image and the chromakey image can be processed in one frame buffer.

第２の実施の形態について説明する。ヘッドマウントディスプレイ１００の構成は基本的には図７に示したものと同じであるが、リプロジェクション部８４は、カメラ画像用の第１のリプロジェクション部８４ａと合成ＣＧクロマキー画像用の第２のリプロジェクション部８４ｂを有する。 A second embodiment will be described. The configuration of the head-mounted display 100 is basically the same as that shown in FIG. It has a reprojection part 84b.

図１６は、第２の実施の形態に係る画像生成装置２００の機能構成図である。 FIG. 16 is a functional configuration diagram of an image generation device 200 according to the second embodiment.

ＨＤＭＩ送受信部２８０は、ヘッドマウントディスプレイ１００からカメラユニット８０により撮影されたカメラ画像とデプス情報を受信し、カメラ画像をカメラ画像取得部２５２に、デプス情報をデプス取得部２５０に供給する。 The HDMI transmission/reception unit 280 receives the camera image captured by the camera unit 80 and the depth information from the head mounted display 100 , and supplies the camera image to the camera image acquisition unit 252 and the depth information to the depth acquisition unit 250 .

画像生成部２３０は、画像記憶部２６０からコンピュータグラフィックスの生成に必要なデータを読み出し、仮想空間のオブジェクトをレンダリングしてＣＧ画像を生成し、カメラ画像取得部２５２から提供されるカメラ画像にＣＧ画像を重畳して仮の重畳画像を生成するとともに、デプス取得部２５０から提供されるカメラデプス情報にもとづいてＣＧ画像からクロマキー画像を生成する。仮の重畳画像にはポストプロセスを施すが、クロマキー画像にはポストプロセスを施さない。最後にクロマキー画像で仮の重畳画像をマスクすることにより、最終的な合成ＣＧクロマキー画像を生成して画像記憶部２６０に出力する。 The image generation unit 230 reads data necessary for generating computer graphics from the image storage unit 260, renders objects in the virtual space to generate a CG image, and applies CG to the camera image provided from the camera image acquisition unit 252. Images are superimposed to generate a temporary superimposed image, and a chromakey image is generated from the CG image based on the camera depth information provided from the depth acquisition unit 250 . A post-process is applied to the temporary superimposed image, but a post-process is not applied to the chromakey image. Finally, by masking the temporary superimposed image with a chromakey image, a final synthesized CG chromakey image is generated and output to the image storage unit 260 .

画像生成部２３０は、レンダリング部２３２と、重畳部２３４と、クロマキー生成部２３５と、ポストプロセス部２３６と、合成部２４４とを含む。 The image generating section 230 includes a rendering section 232 , a superimposing section 234 , a chromakey generating section 235 , a post-processing section 236 and a synthesizing section 244 .

レンダリング部２３２は、視点・視線設定部２２０によって設定されたユーザの視点位置および視線方向にしたがって、ヘッドマウントディスプレイ１００を装着したユーザの視点位置から視線方向に見える仮想空間のオブジェクトをレンダリングしてＣＧ画像を生成する。レンダリング部２３２が仮想空間のオブジェクトをレンダリングする際、シーンデプス情報をシーンデプスバッファに書き込む。 The rendering unit 232 renders an object in the virtual space seen in the line-of-sight direction from the viewpoint position of the user wearing the head-mounted display 100 according to the user's line-of-sight position and line-of-sight direction set by the viewpoint/line-of-sight setting unit 220, and performs CG rendering. Generate an image. When rendering an object in the virtual space, the rendering unit 232 writes scene depth information to the scene depth buffer.

さらにレンダリング部２３２は、カメラユニット８０によって撮影される現実空間の現実オブジェクトをレンダリングする。レンダリング部２３２が現実空間のオブジェクトをレンダリングする際、現実オブジェクトの実空間デプス情報を実空間デプスバッファに書き込む。 Furthermore, the rendering section 232 renders the physical object in the physical space captured by the camera unit 80 . When the rendering unit 232 renders the physical space object, it writes the physical space depth information of the physical object to the physical space depth buffer.

重畳部２３４は、カメラ画像取得部２５２から与えられた低解像度で遅延のあるカメラ画像にＣＧ画像を重畳して仮の重畳画像を生成し、ポストプロセス部２３６に与える。重畳部２３４は、シーンデプス値が無限大の領域であって、実空間デプス値が書き込まれている領域に低解像度で遅延のあるカメラ画像を重畳する。 The superimposing unit 234 superimposes the CG image on the low-resolution camera image with delay provided from the camera image acquiring unit 252 to generate a temporary superimposed image, and supplies the temporary superimposed image to the post-processing unit 236 . The superimposing unit 234 superimposes a low-resolution camera image with a delay on an area where the scene depth value is infinite and where the real space depth value is written.

クロマキー生成部２３５は、デプス取得部２５０から与えられたカメラデプス情報、レンダリング部２３２から与えられたシーンデプス情報および実空間デプス情報にもとづいてクロマキー画像を生成する。 The chromakey generation unit 235 generates a chromakey image based on the camera depth information provided from the depth acquisition unit 250, the scene depth information provided from the rendering unit 232, and the real space depth information.

図１７は、カメラ画像にＣＧ画像を重畳して拡張現実画像を生成するための第２の実施の形態に係る画像生成システムの構成を説明する図である。 FIG. 17 is a diagram illustrating the configuration of an image generation system according to the second embodiment for superimposing a CG image on a camera image to generate an augmented reality image.

ヘッドマウントディスプレイ１００のカメラユニット８０により撮影された外界のカメラ画像とカメラデプス情報は画像信号処理部８２に供給される。画像信号処理部８２は、低遅延かつ高解像度のカメラ画像に対して画像信号処理と歪み補正処理を施し、リプロジェクション部８４に与える。さらに、画像信号処理部８２は、画像信号処理と歪み補正処理が施されたカメラ画像とカメラデプス情報を画像生成装置２００に送信し、カメラ画像は重畳部２３４に、デプス情報はクロマキー生成部２３５に供給される。画像生成装置２００に送信されるカメラ画像は遅延があり、低解像度である。 A camera image of the outside world captured by the camera unit 80 of the head mounted display 100 and the camera depth information are supplied to the image signal processing section 82 . The image signal processing unit 82 performs image signal processing and distortion correction processing on the low-delay and high-resolution camera image, and supplies the processed image to the reprojection unit 84 . Further, the image signal processing unit 82 transmits the camera image subjected to the image signal processing and the distortion correction processing and the camera depth information to the image generation device 200 , and transmits the camera image to the superimposition unit 234 and the depth information to the chromakey generation unit 235 . supplied to The camera images sent to the image generation device 200 are delayed and of low resolution.

画像生成装置２００のレンダリング部２３２は、現実空間の現実オブジェクトをレンダリングするとともに、ヘッドマウントディスプレイ１００を装着したユーザの視点位置・視線方向から見た仮想オブジェクトを生成し、仮想オブジェクトが現実オブジェクトに落とす影や仮想オブジェクトの映り込みなどのライティング表現をレンダリングする。レンダリング部２３２は、生成されたＣＧ画像を重畳部２３４に与え、シーンデプス情報と実空間デプス情報をクロマキー生成部２３５に与える。 The rendering unit 232 of the image generating device 200 renders a real object in the real space, generates a virtual object seen from the viewpoint position/line-of-sight direction of the user wearing the head mounted display 100, and renders the virtual object onto the real object. Render lighting expressions such as shadows and reflections of virtual objects. The rendering unit 232 provides the generated CG image to the superimposing unit 234 and provides the scene depth information and real space depth information to the chromakey generating unit 235 .

重畳部２３４は、ＣＧ画像にカメラ画像を重畳し、仮の重畳画像を生成し、ポストプロセス部２３６に与える。第１の実施の形態と同様、ヘッドマウントディスプレイ１００から提供されるカメラ画像は低解像度で遅延があってもよい。 The superimposing unit 234 superimposes the camera image on the CG image to generate a temporary superimposed image, and supplies it to the post-processing unit 236 . As in the first embodiment, the camera image provided from the head-mounted display 100 may be of low resolution and delayed.

ポストプロセス部２３６は仮の重畳画像にポストプロセスを施し、合成部２４４に与える。 The post-processing unit 236 applies post-processing to the temporary superimposed image and supplies it to the synthesizing unit 244 .

クロマキー生成部２３５は、カメラデプス情報、シーンデプス情報および実空間デプス情報にもとづきクロマキー画像を生成し、合成部２４４に与える。 The chromakey generation unit 235 generates a chromakey image based on the camera depth information, the scene depth information, and the real space depth information, and provides the chromakey image to the synthesis unit 244 .

合成部２４４は、クロマキー画像で重畳画像をマスクすることで合成ＣＧクロマキー画像を生成する。第１の実施の形態と同様、ポストプロセスが施されていないクロマキー画像でポストプロセスが施された重畳画像をマスクすれば、仮想オブジェクトの境界にエイリアシングや偽色がなく、かつ自然で滑らかな合成ＣＧクロマキー画像が合成される。 The combining unit 244 generates a combined CG chromakey image by masking the superimposed image with the chromakey image. As in the first embodiment, by masking the post-processed superimposed image with a non-post-processed chromakey image, there is no aliasing or false color at the boundary of the virtual object, and natural and smooth synthesis is achieved. A CG chromakey image is synthesized.

合成ＣＧクロマキー画像は特定の１色がクロマキーに指定されたＲＧＢ画像としてヘッドマウントディスプレイ１００に送信され、リプロジェクション部８４ｂに供給される。 The synthesized CG chromakey image is transmitted to the head-mounted display 100 as an RGB image in which one specific color is designated as chromakey, and supplied to the reprojection unit 84b.

ヘッドマウントディスプレイ１００の第１のリプロジェクション部８４ａは、画像信号処理と歪み補正処理が施された低遅延かつ高解像度のカメラ画像を最新の視点位置・視線方向に合うように変換し、ＡＲ重畳部８８に供給する。 The first reprojection unit 84a of the head-mounted display 100 converts the low-delay, high-resolution camera image that has been subjected to image signal processing and distortion correction processing so as to match the latest viewpoint position and line-of-sight direction, and superimposes the AR image. 88.

ヘッドマウントディスプレイ１００の第２のリプロジェクション部８４ｂは、合成ＣＧクロマキー画像を最新の視点位置・視線方向に合うように変換し、ＡＲ重畳部８８に供給する。 The second reprojection unit 84 b of the head-mounted display 100 converts the synthesized CG chromakey image so as to match the latest viewpoint position/line-of-sight direction, and supplies it to the AR superimposition unit 88 .

ここで、ヘッドマウントディスプレイ１００において、カメラ画像用の第１のリプロジェクション部８４ａと合成ＣＧクロマキー画像用の第２のリプロジェクション部８４ｂに分かれているのは、画像生成装置２００のレンダリングには時間がかかり、リプロジェクションによって補正すべき差分量が異なるからである。たとえば、第１のリプロジェクション部８４ａで１フレーム先のリプロジェクションを行うのに対して、第２のリプロジェクション部８４ｂでは２フレーム先のリプロジェクションを行う必要がある。 Here, in the head mounted display 100, the reason why the first reprojection unit 84a for the camera image and the second reprojection unit 84b for the synthesized CG chroma key image are divided is that the rendering of the image generation device 200 takes time. This is because the amount of difference to be corrected differs depending on the reprojection. For example, while the first reprojection section 84a reprojects one frame forward, the second reprojection section 84b needs to reproject two frames forward.

ＡＲ重畳部８８は、第２のリプロジェクション部８４ｂによりリプロジェクション処理が施された合成ＣＧクロマキー画像を、第１のリプロジェクション部８４ａによりリプロジェクション処理が施された低遅延かつ高解像度のカメラ画像に重畳することにより、拡張現実画像を生成し、歪み処理部８６に供給する。クロマキー色が指定された領域にはヘッドマウントディスプレイ１００側で低遅延かつ高解像度のカメラ画像が重畳される。 The AR superimposing unit 88 converts the composite CG chromakey image reprojected by the second reprojection unit 84b into a low-delay and high-resolution camera image reprojected by the first reprojection unit 84a. , an augmented reality image is generated and supplied to the distortion processing unit 86 . A low-delay and high-resolution camera image is superimposed on the area where the chromakey color is specified on the head-mounted display 100 side.

歪み処理部８６は拡張現実画像に歪み処理を施す。生成された拡張現実画像は、ディスプレイパネル３２に表示される。 A distortion processor 86 applies distortion processing to the augmented reality image. The generated augmented reality image is displayed on the display panel 32 .

第２の実施の形態の画像生成システムによれば、第１の実施の形態と同様に、合成ＣＧクロマキー画像をカメラ画像に重畳して拡張現実画像を生成するため、不自然さのない拡張現実画像を生成できるという利点の他、次の利点がある。第１の実施の形態とは異なり、ヘッドマウントディスプレイ１００側でカメラ画像と合成ＣＧクロマキー画像に対してリプロジェクション処理を施すため、ディスプレイパネル３２に表示する直前の視点位置・視線方向に合うようにカメラ画像と合成ＣＧクロマキー画像を変換でき、高い精度で追従性のある拡張現実画像を提供できる。また、画像生成装置２００側のリプロジェクション処理の負担を軽減できるので、画像生成装置２００側ではレンダリングにより多くのリソースをかけることができる。 According to the image generation system of the second embodiment, similar to the first embodiment, since an augmented reality image is generated by superimposing a synthetic CG chromakey image on a camera image, augmented reality without unnaturalness can be generated. In addition to the advantage of being able to generate images, there are the following advantages. Unlike the first embodiment, since the head-mounted display 100 performs reprojection processing on the camera image and the synthesized CG chromakey image, the viewpoint position and line-of-sight direction are adjusted to match the viewpoint position and line-of-sight direction immediately before being displayed on the display panel 32 . It can convert a camera image and a synthesized CG chromakey image, and can provide an augmented reality image with high accuracy and followability. In addition, since the load of the reprojection processing on the image generation device 200 side can be reduced, more resources can be used for rendering on the image generation device 200 side.

第１の実施の形態および第２の実施の形態において、クロマキー画像にはポストプロセス部２３６によるポストプロセスを施さない構成を説明したが、変形例として、クロマキー画像に対して視点合わせやスケーリングのためにポストプロセスを適用する構成としてもよい。この場合、画素を補間する際に周囲の画素の平均値を使うなどの方法を採用すると、境界の色がクロマキー色とは異なる色に変わってしまう。そこでクロマキー色を変えないようにポストプロセスを適用することが必要である。あるいは、境界の色が変わってしまうような通常のポストプロセスをクロマキー画像に適用した後、元のクロマキー色と完全に一致する領域のみをマスクとして利用するようにしてもよい。 In the first and second embodiments, a configuration was described in which the chromakey image was not post-processed by the post-processing unit 236. However, as a modified example, the chromakey image may be adjusted for viewpoint adjustment or scaling. may be configured to apply post-processing to . In this case, if a method such as using the average value of surrounding pixels is adopted when interpolating pixels, the color of the boundary changes to a color different from the chromakey color. Therefore, it is necessary to apply a post-process so as not to change the chromakey color. Alternatively, after applying a normal post-process to the chromakey image that changes the color of the border, only the areas that exactly match the original chromakey color may be used as a mask.

以上、本発明を実施の形態をもとに説明した。実施の形態は例示であり、それらの各構成要素や各処理プロセスの組合せにいろいろな変形例が可能なこと、またそうした変形例も本発明の範囲にあることは当業者に理解されるところである。 The present invention has been described above based on the embodiments. It should be understood by those skilled in the art that the embodiments are examples, and that various modifications can be made to combinations of each component and each treatment process, and that such modifications are within the scope of the present invention. .

上記の説明では、ポストプロセスとして、被写界深度調整、トーンマッピング、アンチエイリアシングなどの処理を例示したが、歪み処理、単純な拡大・縮小、台形変換なども含めてポストプロセスと呼んでもよい。 In the above description, processing such as depth-of-field adjustment, tone mapping, and anti-aliasing are examples of post-processing, but distortion processing, simple enlargement/reduction, trapezoidal transformation, and the like may also be called post-processing.

１０制御部、２０入力インタフェース、３０出力インタフェース、３２ディスプレイパネル、４０通信制御部、４２ネットワークアダプタ、４４アンテナ、５０記憶部、６４姿勢センサ、７０外部入出力端子インタフェース、７２外部メモリ、８０カメラユニット、８２画像信号処理部、８４リプロジェクション部、８６歪み処理部、８８ＡＲ重畳部、１００ヘッドマウントディスプレイ、２００画像生成装置、２１０位置・姿勢取得部、２２０視点・視線設定部、２３０画像生成部、２３２レンダリング部、２３４重畳部、２３５クロマキー生成部、２３６ポストプロセス部、２３８シーンデプスバッファ、２３９実空間デプスバッファ、２４０リプロジェクション部、２４２歪み処理部、２４４合成部、２５０デプス取得部、２５２カメラ画像取得部、２６０画像記憶部、２８０ＨＤＭＩ送受信部、３００インタフェース。 10 control unit 20 input interface 30 output interface 32 display panel 40 communication control unit 42 network adapter 44 antenna 50 storage unit 64 attitude sensor 70 external input/output terminal interface 72 external memory 80 camera unit , 82 image signal processing unit, 84 reprojection unit, 86 distortion processing unit, 88 AR superimposition unit, 100 head mounted display, 200 image generation device, 210 position/orientation acquisition unit, 220 viewpoint/line of sight setting unit, 230 image generation unit , 232 rendering unit, 234 superimposition unit, 235 chromakey generation unit, 236 post-processing unit, 238 scene depth buffer, 239 real space depth buffer, 240 reprojection unit, 242 distortion processing unit, 244 synthesis unit, 250 depth acquisition unit, 252 Camera image acquisition unit 260 Image storage unit 280 HDMI transmission/reception unit 300 Interface.

Claims

a rendering unit that renders an object in a virtual space and an object in a real space, and renders a representation of light in the virtual space with respect to the real space to generate a computer graphics image;
a superimposing unit that superimposes the computer graphics image on the captured image of the real space to generate a temporary superimposed image;
a chromakey generating unit that generates a chromakey image by performing chromakey processing on the computer graphics image based on the depth information of the captured image of the physical space;
a synthesizing unit that generates a synthesized chromakey image that is used to generate an augmented reality image by superimposing the virtual superimposed image with the chromakey image to superimpose the captured image of the physical space on the superimposed image. ,
The chromakey generation unit is characterized in that a region of the real space in which the object in the virtual space is not rendered is set as a chromakey region, while a region of the real space in which an expression related to light in the virtual space exists is not set as a chromakey region. image generation device.

The representation of the light in the virtual space with respect to the real space includes the shadow or reflection of the object in the virtual space on the object in the real space, the representation in which the back of the object in the virtual space can be seen through, and the representation of the virtual space in the virtual space. 2. The image generating apparatus according to claim 1, wherein the image generating apparatus is at least one lighting representation by a light source.

The rendering unit deforms the polygon mesh structure obtained by spatially recognizing the physical space to generate a free space in which the virtual space is rendered, and then renders the physical space based on the deformed polygon mesh structure. 2. The image generation device of claim 1, wherein the device renders an object.

further comprising a post-processing unit for post-processing the temporary superimposed image;
The synthesizing unit generates the synthetic chromakey image by masking the post-processed temporary superimposed image with the chromakey image that has not been post-processed. Item 4. The image generation device according to any one of Items 1 to 3.

further comprising a reprojection unit that converts the post-processed temporary superimposed image and the chromakey image so as to match a new viewpoint position or line-of-sight direction;
The composition unit generates the composite chromakey image by masking the temporary superimposed image that has been subjected to the reprojection processing with the chromakey image that has been subjected to the reprojection processing. Item 5. The image generation device according to any one of Items 1 to 4.

The captured image of the physical space used by the superimposing unit to generate the temporary superimposed image has a lower resolution than the captured image of the physical space used to generate the augmented reality image. 6. The image generation device according to any one of claims 1 to 5.

7. The image generating apparatus according to claim 6, wherein the representation of light in said virtual space with respect to said real space is superimposed on said captured image with low resolution as a translucent computer graphics image.

An image generation system including a head-mounted display and an image generation device,
The image generation device is
a rendering unit that renders an object in a virtual space and an object in a real space, and renders a representation of light in the virtual space with respect to the real space to generate a computer graphics image;
a first superimposing unit that superimposes the computer graphics image on the captured image of the real space transmitted from the head-mounted display to generate a temporary superimposed image;
a chromakey generating unit that generates a chromakey image by chromakey processing the computer graphics image based on depth information of the captured image of the real space transmitted from the head-mounted display;
a synthesizing unit that generates a synthesized chromakey image that is used to generate an augmented reality image by superimposing the virtual superimposed image with the chromakey image to superimpose the captured image of the physical space on the superimposed image. ,
The head mounted display is
a second superimposing unit that generates an augmented reality image by synthesizing the composite chromakey image transmitted from the image generation device with the captured image of the physical space;
The chromakey generation unit is characterized in that a region of the real space in which the object in the virtual space is not rendered is set as a chromakey region, while a region of the real space in which an expression related to light in the virtual space exists is not set as a chromakey region. image generation system.

a rendering step of rendering a virtual space object and a real space object, and rendering a representation of light in said virtual space relative to said real space to generate a computer graphics image;
a superimposing step of superimposing the computer graphics image on a photographed image of the real space to generate a temporary superimposed image;
a chromakey generating step of generating a chromakey image by chromakey processing the computer graphics image based on the depth information of the captured image of the physical space;
and a synthesizing step of generating a synthesized chromakey image that is superimposed on the captured image of the physical space and used to generate an augmented reality image by masking the temporary superimposed image with the chromakey image. ,
In the chromakey generating step, a real space area in which an object in the virtual space is not rendered is set as a chromakey area, and a real space area in which an expression related to light in the virtual space exists is not set as a chromakey area. image generation method.

a rendering function that renders an object in a virtual space and an object in a real space and renders a representation of light in the virtual space with respect to the real space to generate a computer graphics image;
a superimposing function that superimposes the computer graphics image on the captured image of the real space to generate a temporary superimposed image;
a chromakey generation function for generating a chromakey image by performing chromakey processing on the computer graphics image based on depth information of the captured image of the physical space;
a synthesizing function for generating a synthesized chromakey image that is used to generate an augmented reality image by superimposing the virtual superimposed image with the chromakey image and superimposing it on the captured image of the physical space; to realize
The chromakey generation function is characterized in that a region of the real space in which the object in the virtual space is not rendered is set as a chromakey region, while a region of the real space in which an expression related to light in the virtual space exists is not set as a chromakey region. A program to