JP2017201742A

JP2017201742A - Processing device, and image determining method

Info

Publication number: JP2017201742A
Application number: JP2016092622A
Authority: JP
Inventors: 泰史奥村; Yasushi Okumura; 博之勢川; Hiroyuki Segawa; 小林　郁夫; Ikuo Kobayashi; 郁夫小林
Original assignee: Sony Interactive Entertainment LLC
Current assignee: Sony Interactive Entertainment LLC
Priority date: 2016-05-02
Filing date: 2016-05-02
Publication date: 2017-11-09
Anticipated expiration: 2036-05-02
Also published as: JP6580516B2

Abstract

PROBLEM TO BE SOLVED: To provide a technique for effective utilization of viewing data acquired by robots.SOLUTION: An image recording unit 220 records image data that is photographed by a camera moving the line of sight direction interlocked with movements of a human face and to which vector information indicating the line of sight direction is added. A sensor information acquiring unit 204 acquires attitude information detecting the attitude of an HMD 100a fitted to the head of a user B. A line of sight direction determining unit 208 determines from the attitude information the line of sight direction of a virtual camera. An image determining unit 210 determines image data to be offered to the user on the basis of the line of sight direction of the virtual camera and vector information added to the image data. A viewing data offering unit 214 provides the HMD 100a with the image data determined by the image determining unit 210.SELECTED DRAWING: Figure 15

Description

本発明は、ユーザの動きに応じてロボットを動かし、ロボットが生成した視聴データを利用する技術に関する。 The present invention relates to a technique for moving a robot according to a user's movement and using viewing data generated by the robot.

ヘッドマウントディスプレイ（ＨＭＤ）が様々な分野で利用されている。ＨＭＤにヘッドトラッキング機能をもたせ、ユーザの頭部の姿勢と連動して表示画面を更新することで、映像世界への没入感を高められる。 Head mounted displays (HMD) are used in various fields. By providing the HMD with a head tracking function and updating the display screen in conjunction with the posture of the user's head, a sense of immersion in the video world can be enhanced.

特開２０１５−９５０４５号公報Japanese Patent Laying-Open No. 2015-95045

近年、遠隔地に配置したロボットを自分の分身として利用するテレイグジスタンスと呼ばれる技術が登場している。遠隔地にいるロボットが周囲の画像データや音声データをユーザに送信し、ユーザ側で再生することで、ユーザは、ロボットの場所にいるような臨場感をもって、周囲の人達とコミュニケーションをとることが可能となる。 In recent years, a technology called tele-existence has appeared that uses a robot placed in a remote location as its alternation. A remote robot sends surrounding image data and audio data to the user and reproduces it on the user side, so that the user can communicate with the surrounding people with a sense of presence like being in the robot's location. It becomes possible.

本発明者はテレイグジステンスとＨＭＤの連携による可能性に注目し、テレイグレジスタンスシステムの利便性および有用性を高める技術を開発するに至った。 The present inventor has paid attention to the possibility of the cooperation between the telexistence and the HMD, and has developed a technique for enhancing the convenience and usefulness of the telejeg resistance system.

本発明はこうした課題に鑑みてなされたものであり、その目的は、遠隔操作するロボットの構造や、ロボットが取得した視聴データを加工する技術、またロボットが取得した視聴データを有用に活用するための技術を提供することにある。 The present invention has been made in view of these problems, and its purpose is to make effective use of the structure of a remotely operated robot, technology for processing viewing data acquired by the robot, and viewing data acquired by the robot. Is to provide the technology.

上記課題を解決するために、本発明のある態様の処理装置は、顔の動きに連動して視線方向を動かすカメラにより撮影された画像データであって、視線方向を示すベクトル情報を付加された画像データを記録する記録部と、ユーザの頭部に装着されたヘッドマウントディスプレイの姿勢を検出した姿勢情報を取得する取得部と、姿勢情報から、仮想カメラの視線方向を決定する視線方向決定部と、仮想カメラの視線方向と、記録部に記録された画像データに付加されたベクトル情報とにもとづいて、ユーザに提供する画像データを決定する画像決定部と、画像決定部が決定した画像データをヘッドマウントディスプレイに提供する提供部とを備える。 In order to solve the above problems, a processing device according to an aspect of the present invention is image data captured by a camera that moves the line-of-sight direction in conjunction with the movement of the face, and is added with vector information indicating the line-of-sight direction Recording unit for recording image data, acquisition unit for acquiring posture information for detecting the posture of a head mounted display mounted on the user's head, and a gaze direction determining unit for determining the gaze direction of the virtual camera from the posture information And an image determination unit that determines image data to be provided to the user based on the viewing direction of the virtual camera and the vector information added to the image data recorded in the recording unit, and the image data determined by the image determination unit And a providing unit that provides a head-mounted display.

本発明の別の態様は、顔の動きに連動して視線方向を動かすカメラにより撮影された画像データであって、視線方向を示すベクトル情報を付加された画像データを記録した記録部から、画像データを読み出して、ユーザに提供する画像を決定する方法である。この方法は、ユーザの頭部に装着されたヘッドマウントディスプレイの姿勢を検出した姿勢情報を取得するステップと、姿勢情報から、仮想カメラの視線方向を決定するステップと、仮想カメラの視線方向と、記録部に記録された画像データに付加されたベクトル情報とにもとづいて、ユーザに提供する画像データを決定するステップと、画像データをヘッドマウントディスプレイに提供するステップと、を備える。 Another aspect of the present invention is image data captured by a camera that moves the line-of-sight direction in conjunction with the movement of the face, from a recording unit that records image data to which vector information indicating the line-of-sight direction is added. This is a method of reading data and determining an image to be provided to a user. This method includes the steps of obtaining posture information that detects the posture of a head mounted display mounted on the user's head, determining the visual line direction of the virtual camera from the posture information, the visual line direction of the virtual camera, The method includes determining image data to be provided to the user based on vector information added to the image data recorded in the recording unit, and providing the image data to the head mounted display.

なお、以上の構成要素の任意の組合せ、本発明の表現を方法、装置、システム、コンピュータプログラム、コンピュータプログラムを読み取り可能に記録した記録媒体、データ構造などの間で変換したものもまた、本発明の態様として有効である。 Note that any combination of the above components, the expression of the present invention converted between a method, an apparatus, a system, a computer program, a recording medium on which the computer program is recorded so as to be readable, a data structure, and the like are also included in the present invention. It is effective as an embodiment of

本発明によれば、遠隔操作するロボットの構造や、ロボットが取得した視聴データを加工する技術、またロボットが取得した視聴データを有用に活用するための技術を提供できる。 ADVANTAGE OF THE INVENTION According to this invention, the structure of the robot operated remotely, the technique which processes the viewing-and-listening data which the robot acquired, and the technique for utilizing the viewing-and-listening data which the robot acquired effectively can be provided.

実施例における情報処理システムの構成例を示す図である。It is a figure which shows the structural example of the information processing system in an Example. ロボットの利用場面の例を示す図である。It is a figure which shows the example of the usage scene of a robot. ＨＭＤの外観形状の例を示す図である。It is a figure which shows the example of the external appearance shape of HMD. ＨＭＤの機能ブロックを示す図である。It is a figure which shows the functional block of HMD. ロボットの外観構成を示す図である。It is a figure which shows the external appearance structure of a robot. 挿通部材の構成を示す図である。It is a figure which shows the structure of an insertion member. ロボットの断面を示す図である。It is a figure which shows the cross section of a robot. ロボットにおける筐体の姿勢の例を示す図である。It is a figure which shows the example of the attitude | position of the housing | casing in a robot. ロボットにおける筐体の姿勢の例を示す図である。It is a figure which shows the example of the attitude | position of the housing | casing in a robot. ロボットの機能ブロックを示す図である。It is a figure which shows the functional block of a robot. 音声処理部が備える位相差増幅装置の回路構成を示す図である。It is a figure which shows the circuit structure of the phase difference amplifier with which an audio | voice processing part is provided. 信号波形の位相差を説明するための図である。It is a figure for demonstrating the phase difference of a signal waveform. 入力信号波形の位相差を増幅する原理を説明するための図である。It is a figure for demonstrating the principle which amplifies the phase difference of an input signal waveform. 応用技術を実現するためのロボットの機能ブロックを示す図である。It is a figure which shows the functional block of the robot for implement | achieving applied technology. 処理装置の機能ブロックを示す図である。It is a figure which shows the functional block of a processing apparatus. 全天球パノラマ画像を説明するための図である。It is a figure for demonstrating an omnidirectional panoramic image. 画像記録部に記録されている撮影画像データを説明するための図である。It is a figure for demonstrating the picked-up image data currently recorded on the image recording part. 画像決定部が生成するフレーム画像と、画像データとの関係を示す図である。It is a figure which shows the relationship between the frame image which an image determination part produces | generates, and image data.

図１は、実施例における情報処理システム１の構成例を示す。情報処理システム１は、ロボット１０と、ユーザＡが頭部に装着するヘッドマウントディスプレイ装置（ＨＭＤ）１００とを備える。ＨＭＤ１００は、両眼用の表示パネル１０２と、両耳用のイヤホン１０４と、マイク１０６とを備える。ここでは音声出力手段としてイヤホン１０４を採用しているが、耳に当てる形状のヘッドホンを採用してもよい。ＨＭＤ１００はアクセスポイント（ＡＰ）２を介して、ネットワーク４に接続される。ＡＰ２は無線アクセスポイントおよびルータの機能を有し、ＨＭＤ１００は、ＡＰ２と既知の無線通信プロトコルで接続するが、ケーブルで接続してもよい。 FIG. 1 shows a configuration example of an information processing system 1 in the embodiment. The information processing system 1 includes a robot 10 and a head mounted display device (HMD) 100 that a user A wears on the head. The HMD 100 includes a display panel 102 for both eyes, an earphone 104 for both ears, and a microphone 106. Here, the earphone 104 is used as the sound output means, but a headphone shaped to touch the ear may be used. The HMD 100 is connected to the network 4 via an access point (AP) 2. The AP 2 has functions of a wireless access point and a router, and the HMD 100 is connected to the AP 2 by a known wireless communication protocol, but may be connected by a cable.

ロボット１０は、アクチュエータ装置１２と、アクチュエータ装置１２により姿勢を変更可能に駆動される筐体２０とを備える。筐体２０には、右カメラ１４ａ、左カメラ１４ｂ、右マイク１６ａ、左マイク１６ｂおよびスピーカ１８が搭載される。以下、右カメラ１４ａおよび左カメラ１４ｂを特に区別しない場合には「カメラ１４」と呼び、右マイク１６ａおよび左マイク１６ｂを特に区別しない場合には「マイク１６」と呼ぶ。実施例においてカメラ１４およびマイク１６は、アクチュエータ装置１２により駆動される筐体２０に設けられるが、スピーカ１８は、たとえばアクチュエータ装置１２の半球状のハウジング３６に設けられてもよい。ロボット１０はアクセスポイント（ＡＰ）３を介して、ネットワーク４に接続される。ロボット１０は、ＡＰ３と既知の無線通信プロトコルで接続するが、ケーブルで接続してもよい。 The robot 10 includes an actuator device 12 and a housing 20 that is driven by the actuator device 12 so that the posture can be changed. The housing 20 is equipped with a right camera 14a, a left camera 14b, a right microphone 16a, a left microphone 16b, and a speaker 18. Hereinafter, the right camera 14a and the left camera 14b will be referred to as “camera 14” unless otherwise distinguished, and the right microphone 16a and the left microphone 16b will be referred to as “mic 16” unless otherwise distinguished. In the embodiment, the camera 14 and the microphone 16 are provided in the housing 20 driven by the actuator device 12, but the speaker 18 may be provided in the hemispherical housing 36 of the actuator device 12, for example. The robot 10 is connected to the network 4 via an access point (AP) 3. The robot 10 is connected to the AP 3 by a known wireless communication protocol, but may be connected by a cable.

情報処理システム１において、ＨＭＤ１００とロボット１０はネットワーク４を介して通信可能に接続し、ロボット１０は、ユーザＡのいわば分身として動作する。ユーザＡが装着しているＨＭＤ１００の動きはロボット１０に伝達され、アクチュエータ装置１２が、ＨＭＤ１００の動きに連動して筐体２０を動かす。たとえばユーザＡが首を前後に振ると、アクチュエータ装置１２が筐体２０を前後に振るように動かし、ユーザＡが首を左右に振ると、アクチュエータ装置１２が筐体２０を左右に振るように動かす。これによりロボット１０の周囲にいる人は、ユーザＡがその場にいるかのような感覚をもって、ユーザＡとコミュニケーションをとることができる。 In the information processing system 1, the HMD 100 and the robot 10 are communicably connected via the network 4, and the robot 10 operates as a so-called alternate user A. The movement of the HMD 100 worn by the user A is transmitted to the robot 10, and the actuator device 12 moves the housing 20 in conjunction with the movement of the HMD 100. For example, when the user A swings his / her neck back and forth, the actuator device 12 moves the casing 20 to swing back and forth, and when the user A swings his / her neck left and right, the actuator device 12 moves the casing 20 to swing left and right. . Thus, a person around the robot 10 can communicate with the user A with a sense that the user A is on the spot.

右カメラ１４ａおよび左カメラ１４ｂは、筐体２０の前面にて横方向に所定の間隔を空けて配置される。右カメラ１４ａおよび左カメラ１４ｂはステレオカメラを構成し、右カメラ１４ａは右目用画像を所定の周期で撮影し、左カメラ１４ｂは左目用画像を所定の周期で撮影する。撮影された右目用画像および左目用画像は、リアルタイムでユーザＡのＨＭＤ１００に送信される。ＨＭＤ１００は、受信した右目用画像を右目用表示パネルに表示し、受信した左目用画像を左目用表示パネルに表示する。これによりユーザＡは、ロボット１０の筐体２０が向いている方向の映像をリアルタイムで見ることができる。 The right camera 14a and the left camera 14b are arranged on the front surface of the housing 20 with a predetermined interval in the lateral direction. The right camera 14a and the left camera 14b constitute a stereo camera. The right camera 14a captures a right-eye image at a predetermined cycle, and the left camera 14b captures a left-eye image at a predetermined cycle. The captured right-eye image and left-eye image are transmitted to the user A's HMD 100 in real time. The HMD 100 displays the received right-eye image on the right-eye display panel, and displays the received left-eye image on the left-eye display panel. As a result, the user A can view the video in the direction in which the housing 20 of the robot 10 is facing in real time.

右マイク１６ａおよび左マイク１６ｂは、筐体２０において横方向に所定の間隔を空けて配置される。右マイク１６ａおよび左マイク１６ｂはステレオマイクを構成し、横方向に所定の間隔を空けて配置されることで、音源の位置に応じて音声が到達する時間が異なるようにされる。音声の到達時間の差は、右マイク１６ａおよび左マイク１６ｂが生成する音声信号の位相差として表れる。なお右マイク１６ａおよび左マイク１６ｂの音声信号の位相差を大きくするために、右マイク１６ａおよび左マイク１６ｂは可能な限り離して、具体的には筐体２０の両側面に配置されることが好ましい。 The right microphone 16a and the left microphone 16b are arranged in the housing 20 with a predetermined interval in the lateral direction. The right microphone 16a and the left microphone 16b constitute a stereo microphone, and are arranged at a predetermined interval in the horizontal direction, so that the time for the sound to reach varies depending on the position of the sound source. The difference in the arrival time of sound appears as a phase difference between sound signals generated by the right microphone 16a and the left microphone 16b. In order to increase the phase difference between the audio signals of the right microphone 16a and the left microphone 16b, the right microphone 16a and the left microphone 16b may be arranged as far apart as possible, specifically on both sides of the housing 20. preferable.

右マイク１６ａおよび左マイク１６ｂで生成された音声信号は、後述するように加工されて、右耳用音声データおよび左耳用音声データとしてリアルタイムでユーザＡのＨＭＤ１００に送信される。ＨＭＤ１００は、受信した右耳用音声データを右耳用のイヤホン１０４から出力し、受信した左耳用音声データを左耳用のイヤホン１０４から出力する。これによりユーザＡは、ロボット１０の周囲の音声をリアルタイムで聞くことができる。 The audio signals generated by the right microphone 16a and the left microphone 16b are processed as described later, and are transmitted to the user A's HMD 100 in real time as audio data for the right ear and audio data for the left ear. The HMD 100 outputs the received right ear audio data from the right ear earphone 104, and outputs the received left ear audio data from the left ear earphone 104. Thereby, the user A can hear the sound around the robot 10 in real time.

人間が左右方向の音源の位置を音波の両耳への到達時間の差によって知覚することは知られているが、実際には到達時間の差だけでなく、音波を集める耳介の形状、音波を中耳に伝える外耳道の形状等にも依存して音源の位置を知覚している。また人間の正面に対して右側または左側に音源がある場合、距離の近い側の耳介と比べると、距離の遠い側の耳介に音波が到達するためには経路中に顔が位置するため、音波の到達時間差は音源からの距離差以上に大きくなる。 It is known that humans perceive the position of a sound source in the left-right direction based on the difference in arrival time of sound waves to both ears, but in reality, not only the difference in arrival time but also the shape of the pinna that collects sound waves, The position of the sound source is perceived depending on the shape of the ear canal that conveys the sound to the middle ear. Also, when the sound source is on the right or left side of the human front, the face is located in the path for the sound wave to reach the pinna on the far side compared to the pinna on the near side. The difference in arrival time of sound waves becomes larger than the difference in distance from the sound source.

一方で筐体２０の前面は平坦な形状を有し、またマイク１６は耳介や外耳道に相当する形状を有していないため、音声到達時間差は、実質的に音源と両マイクとの距離差に対応することになる。実施例では、筐体２０の両側面に右マイク１６ａおよび左マイク１６ｂを配置して両者を最大限離れた位置に設けているが、右マイク１６ａで生成した音声信号と左マイク１６ｂで生成した音声信号を増幅して右耳用イヤホンと左耳用イヤホンから出力しても、音源の左右方向の位置をよく知覚できないことが本発明者の実験により明らかになった。 On the other hand, the front surface of the housing 20 has a flat shape, and the microphone 16 does not have a shape corresponding to an auricle or an external auditory canal, so the difference in sound arrival time is substantially the difference between the sound source and both microphones. It will correspond to. In the embodiment, the right microphone 16a and the left microphone 16b are arranged on both side surfaces of the housing 20 and are provided at positions farthest from each other. However, the sound signal generated by the right microphone 16a and the left microphone 16b are generated. The inventor's experiment has revealed that even if the audio signal is amplified and output from the right earphone and the left earphone, the position of the sound source in the left-right direction cannot be perceived well.

つまり人間が普段聞き慣れている音と比べると、右マイク１６ａおよび左マイク１６ｂが生成する音声信号の位相差は、左右方向を知覚するには小さいことが実験により判明した。そこでロボット１０は、右マイク１６ａおよび左マイク１６ｂの音声信号の位相差を増幅して、より人間の両耳で聞こえる音に近づけた音声データをＨＭＤ１００に提供する仕組みを備えている。この仕組みについては後述する。 In other words, it has been experimentally found that the phase difference between the audio signals generated by the right microphone 16a and the left microphone 16b is small to perceive the left-right direction as compared with the sound that humans are usually accustomed to listening to. Therefore, the robot 10 has a mechanism for amplifying the phase difference between the audio signals of the right microphone 16a and the left microphone 16b and providing the HMD 100 with audio data that is closer to the sound that can be heard by both human ears. This mechanism will be described later.

ＨＭＤ１００において、マイク１０６は、ユーザＡが発した音声信号を生成する。ユーザＡによる音声データは、リアルタイムでロボット１０に送信され、ロボット１０は、受信した音声データをスピーカ１８から出力する。これによりロボット１０の周辺にいる人は、ユーザＡが発した音声をリアルタイムで聞くことができる。 In the HMD 100, the microphone 106 generates an audio signal emitted by the user A. The voice data by the user A is transmitted to the robot 10 in real time, and the robot 10 outputs the received voice data from the speaker 18. Thereby, the person around the robot 10 can hear the voice uttered by the user A in real time.

このように情報処理システム１では、ロボット１０がユーザＡにより遠隔操作されてユーザＡの顔の動きや音声を再現し、またユーザＡがＨＭＤ１００を通じて、ロボット周辺の画像や音声を視聴でき、ユーザＡとロボット１０周辺の人とが、リアルタイムでコミュニケーションをとることができる。このような情報処理システム１は、様々な環境において有用に利用される。 As described above, in the information processing system 1, the robot 10 is remotely operated by the user A to reproduce the movement and sound of the face of the user A, and the user A can view images and sounds around the robot through the HMD 100. And people around the robot 10 can communicate in real time. Such an information processing system 1 is usefully used in various environments.

図２は、ロボット１０の利用場面の例を示す。この例では、部屋で会議をしており、ユーザＡの分身であるロボット１０が、テーブル上に配置されている。この例でロボット１０は正面の４人の方向を向いており、カメラ１４は、正面の４人を画角内で撮影している。ロボット１０は、カメラ１４の撮影画像をリアルタイムでユーザＡのＨＭＤ１００に送信する。ユーザＡはＨＭＤ１００の表示パネル１０２を通じて部屋の状況を見ながら会議に参加し、発言するとユーザＡの音声がロボット１０にリアルタイムで送信され、ロボット１０は、スピーカ１８からユーザＡの音声を出力する。 FIG. 2 shows an example of a usage scene of the robot 10. In this example, a meeting is held in a room, and a robot 10 that is a user A's alternate is placed on a table. In this example, the robot 10 faces the front four people, and the camera 14 photographs the front four people within the angle of view. The robot 10 transmits the captured image of the camera 14 to the user A's HMD 100 in real time. The user A participates in the conference while viewing the room status through the display panel 102 of the HMD 100 and speaks to transmit the voice of the user A to the robot 10 in real time, and the robot 10 outputs the voice of the user A from the speaker 18.

また上記したように、ロボット１０は、左右のマイク１６で生成した音声信号の位相差を増幅した音声データを、リアルタイムでＨＭＤ１００に送信する。これによりユーザＡは、部屋内で声を出した人が、筐体２０が向いている方向に対して右側に位置するのか、または左側に位置するのか、または正面に位置するのかを知覚できる。ユーザＡは、自分の右側の人が発言したと感じると、首を右に回して右側を向く。このときロボット１０の筐体２０もユーザＡの首の動きに連動して右側に向くため、カメラ１４は、右側に座っている参加者を撮影することになる。 As described above, the robot 10 transmits the audio data obtained by amplifying the phase difference between the audio signals generated by the left and right microphones 16 to the HMD 100 in real time. Thereby, the user A can perceive whether the person who has spoken in the room is located on the right side, the left side, or the front side with respect to the direction in which the housing 20 is facing. When the user A feels that the person on the right side speaks, the user A turns the neck to the right and turns to the right side. At this time, since the housing 20 of the robot 10 also faces the right side in conjunction with the movement of the neck of the user A, the camera 14 photographs the participant sitting on the right side.

このようにユーザＡは、分身であるロボット１０がユーザＡの動きに連動することで、遠隔地にいながら、あたかも部屋にいるような感覚で会議に参加できる。また実際に部屋にいる参加者も、ユーザＡの声や、また筐体２０の動きから、ユーザＡと違和感なくコミュニケーションをとることができる。なお図２に示す利用場面は一例であり、他の利用場面においてもユーザＡは遠隔地にいながら、ロボット１０から視聴データを得ることができる。 In this way, the user A can join the conference as if he was in a room while being at a remote place by having the alternate robot 10 interlocked with the movement of the user A. In addition, participants who are actually in the room can communicate with the user A from the voice of the user A and the movement of the housing 20 without feeling uncomfortable. Note that the usage scene shown in FIG. 2 is an example, and the user A can obtain viewing data from the robot 10 while in a remote location even in other usage scenes.

図３は、ＨＭＤ１００の外観形状の例を示す。この例においてＨＭＤ１００は、出力機構部１１０および装着機構部１１２から構成される。装着機構部１１２は、ユーザが被ることにより頭部を一周してＨＭＤ１００を頭部に固定する装着バンド１０８を含む。装着バンド１０８はユーザの頭囲に合わせて長さの調節が可能な素材または構造とする。 FIG. 3 shows an example of the external shape of the HMD 100. In this example, the HMD 100 includes an output mechanism unit 110 and a mounting mechanism unit 112. The wearing mechanism unit 112 includes a wearing band 108 that goes around the head when worn by the user and fixes the HMD 100 to the head. The wearing band 108 is made of a material or a structure whose length can be adjusted according to the user's head circumference.

出力機構部１１０は、ＨＭＤ１００をユーザが装着した状態において左右の目を覆う形状の筐体１１４を含み、内部には目に正対する位置に表示パネル１０２を備える。表示パネル１０２は液晶パネルや有機ＥＬパネルなどであってよい。筐体１１４内部には、表示パネル１０２とユーザの目との間に位置し、ユーザの視野角を拡大する左右一対の光学レンズが備えられる。 The output mechanism unit 110 includes a housing 114 shaped to cover the left and right eyes when the user wears the HMD 100, and includes a display panel 102 in a position facing the eyes. The display panel 102 may be a liquid crystal panel or an organic EL panel. Inside the housing 114, a pair of left and right optical lenses that are positioned between the display panel 102 and the user's eyes and expand the viewing angle of the user are provided.

ＨＭＤ１００はさらに、装着時にユーザの耳に差し込まれるイヤホン１０４を備える。なおイヤホン１０４は、音声出力手段の一例であり、ＨＭＤ１００はヘッドホンを備えてもよい。このときＨＭＤ１００とヘッドホンとは、一体に構成されてもよいが、別体であってもよい。 The HMD 100 further includes an earphone 104 that is inserted into the user's ear when worn. The earphone 104 is an example of an audio output unit, and the HMD 100 may include a headphone. At this time, the HMD 100 and the headphones may be configured integrally, but may be separate.

ＨＭＤ１００は、姿勢センサが検出したセンサ情報、およびマイク１０６からの音声信号を符号化した音声データをロボット１０に送信し、またロボット１０で生成された画像データおよび音声データを受信して、表示パネル１０２およびイヤホン１０４から出力する。 The HMD 100 transmits the sensor information detected by the posture sensor and the voice data obtained by encoding the voice signal from the microphone 106 to the robot 10 and receives the image data and voice data generated by the robot 10 to display the display panel. 102 and the earphone 104.

なお図３に示すＨＭＤ１００は、両目を完全に覆う没入型（非透過型）のディスプレイ装置を示すが、透過型のディスプレイ装置であってもよい。また形状としては、図示されるような帽子型であってもよいが、眼鏡型であってもよい。 3 shows an immersive (non-transmissive) display device that completely covers both eyes, but may be a transmissive display device. The shape may be a hat shape as shown in the figure, but may also be a glasses shape.

図４は、ＨＭＤ１００の機能ブロックを示す。制御部１２０は、画像信号、音声信号、センサ情報などの各種信号およびデータや、命令を処理して出力するメインプロセッサである。記憶部１２２は、制御部１２０が処理するデータや命令などを一時的に記憶する。姿勢センサ１２４は、ＨＭＤ１００の回転角度や傾きなどの姿勢情報を所定の周期で検出する。姿勢センサ１２４は、少なくとも３軸の加速度センサおよび３軸のジャイロセンサを含む。マイク１０６は、ユーザの声を電気信号に変換して音声信号を生成する。 FIG. 4 shows functional blocks of the HMD 100. The control unit 120 is a main processor that processes and outputs various signals and data such as image signals, audio signals, sensor information, and commands. The storage unit 122 temporarily stores data, commands, and the like that are processed by the control unit 120. The attitude sensor 124 detects attitude information such as the rotation angle and inclination of the HMD 100 at a predetermined cycle. The posture sensor 124 includes at least a triaxial acceleration sensor and a triaxial gyro sensor. The microphone 106 converts a user's voice into an electrical signal and generates an audio signal.

通信制御部１２６は、ネットワークアダプタまたはアンテナを介して、有線または無線通信により、ロボット１０との間で信号やデータを送受信する。通信制御部１２６は、制御部１２０から、姿勢センサ１２４で検出された姿勢情報、およびマイク１０６からの音声信号を符号化した音声データを受け取り、ロボット１０に送信する。また通信制御部１２６は、ロボット１０から、画像データおよび音声データを受け取り、制御部１２０に供給する。制御部１２０は、画像データおよび音声データをロボット１０から受け取ると、画像データを表示パネル１０２に供給して表示させ、また音声データをイヤホン１０４に供給して音声出力させる。 The communication control unit 126 transmits and receives signals and data to and from the robot 10 by wired or wireless communication via a network adapter or an antenna. The communication control unit 126 receives, from the control unit 120, the posture information detected by the posture sensor 124 and the voice data obtained by encoding the voice signal from the microphone 106, and transmits them to the robot 10. In addition, the communication control unit 126 receives image data and audio data from the robot 10 and supplies them to the control unit 120. Upon receiving image data and audio data from the robot 10, the control unit 120 supplies the image data to the display panel 102 for display, and supplies the audio data to the earphone 104 for audio output.

図５は、ロボット１０の外観構成を示す。筐体２０は、カメラ１４、マイク１６およびスピーカ１８を収容する。カメラ１４およびスピーカ１８は筐体前面に設けられ、マイク１６は筐体側面に設けられる。筐体２０は保護カバー１９を有し、ロボット１０を使用しない状態では、保護カバー１９が筐体前面を覆う閉位置に配置されて、カメラ１４およびスピーカ１８を保護する。図５に示す状態は、保護カバー１９が閉位置から略１８０度回転した開位置に配置され、カメラ１４が露出して、周囲を撮影可能となっている。保護カバー１９は開位置で固定されるストッパ機構を有することが好ましい。 FIG. 5 shows an external configuration of the robot 10. The housing 20 accommodates the camera 14, the microphone 16, and the speaker 18. The camera 14 and the speaker 18 are provided on the front surface of the housing, and the microphone 16 is provided on the side surface of the housing. The housing 20 has a protective cover 19. When the robot 10 is not used, the protective cover 19 is disposed at a closed position covering the front surface of the housing to protect the camera 14 and the speaker 18. In the state shown in FIG. 5, the protective cover 19 is disposed at the open position rotated approximately 180 degrees from the closed position, and the camera 14 is exposed so that the surroundings can be photographed. The protective cover 19 preferably has a stopper mechanism that is fixed in the open position.

筐体２０はアクチュエータ装置１２によって姿勢を変更可能に支持されている。アクチュエータ装置１２は、脚部４０と、脚部４０の上部に支持される半球状のハウジング３６と、筐体２０を駆動するための駆動機構５０とを備える。駆動機構５０は、長尺方向に第１貫通長孔３２ａを形成された第１円弧状アーム３２と、長尺方向に第２貫通長孔３４ａを形成された第２円弧状アーム３４と、第１円弧状アーム３２と第２円弧状アーム３４とを交差させた状態で、第１円弧状アーム３２と第２円弧状アーム３４とを回動可能に支持する台座３０とを備える。台座３０の上側は、カバー３８により覆われており、カバー３８で覆われた空間には、第１円弧状アーム３２および第２円弧状アーム３４をそれぞれ回転させるモータが配置されている。なお台座３０は、ハウジング３６に対して回動可能に支持されており、ハウジング３６内には、台座３０を回転させるモータが配置されている。 The housing 20 is supported by the actuator device 12 so that the posture can be changed. The actuator device 12 includes a leg portion 40, a hemispherical housing 36 supported on the upper portion of the leg portion 40, and a drive mechanism 50 for driving the housing 20. The drive mechanism 50 includes a first arcuate arm 32 having a first through hole 32a formed in the longitudinal direction, a second arcuate arm 34 having a second through hole 34a formed in the longitudinal direction, A pedestal 30 that rotatably supports the first arc-shaped arm 32 and the second arc-shaped arm 34 in a state where the first arc-shaped arm 32 and the second arc-shaped arm 34 intersect each other is provided. The upper side of the pedestal 30 is covered with a cover 38, and motors for rotating the first arcuate arm 32 and the second arcuate arm 34 are arranged in the space covered with the cover 38, respectively. The pedestal 30 is rotatably supported with respect to the housing 36, and a motor for rotating the pedestal 30 is disposed in the housing 36.

第１円弧状アーム３２および第２円弧状アーム３４は半円状に形成され、同じ回転中心を有するように両端部が台座３０に支持される。半円状の第１円弧状アーム３２の径は、半円状の第２円弧状アーム３４の径よりも僅かに大きく、第１円弧状アーム３２は、第２円弧状アーム３４の外周側に配置される。第１円弧状アーム３２と第２円弧状アーム３４は、台座３０において直交するように配置されてよい。実施例では、第１円弧状アーム３２が台座３０に支持された両端部を結ぶラインと、第２円弧状アーム３４が台座３０に支持された両端部を結ぶラインとが直交する。挿通部材４２は、第１貫通長孔３２ａおよび第２貫通長孔３４ａに挿通されて、第１貫通長孔３２ａおよび第２貫通長孔３４ａの交差位置に配置される。挿通部材４２は、第１円弧状アーム３２および第２円弧状アーム３４の回転により、第１貫通長孔３２ａ内および第２貫通長孔３４ａ内を摺動する。 The first arc-shaped arm 32 and the second arc-shaped arm 34 are formed in a semicircular shape, and both ends are supported by the pedestal 30 so as to have the same center of rotation. The diameter of the semicircular first arc-shaped arm 32 is slightly larger than the diameter of the semicircular second arc-shaped arm 34, and the first arc-shaped arm 32 is located on the outer peripheral side of the second arc-shaped arm 34. Be placed. The first arc-shaped arm 32 and the second arc-shaped arm 34 may be arranged so as to be orthogonal to each other on the pedestal 30. In the embodiment, a line connecting both ends of the first arcuate arm 32 supported by the pedestal 30 and a line connecting both ends of the second arcuate arm 34 supported by the pedestal 30 are orthogonal to each other. The insertion member 42 is inserted into the first through long hole 32a and the second through long hole 34a, and is disposed at the intersection of the first through long hole 32a and the second through long hole 34a. The insertion member 42 slides in the first through long hole 32a and the second through long hole 34a by the rotation of the first arc-shaped arm 32 and the second arc-shaped arm 34.

図６は、挿通部材４２の構成を示す。挿通部材４２は、第１貫通長孔３２ａおよび第２貫通長孔３４ａの挿通状態を維持するように、第１貫通長孔３２ａよりも幅広の第１規制部４２ａと、第２貫通長孔３４ａよりも幅広の第２規制部４２ｂとを備える。第１規制部４２ａは第１貫通長孔３２ａよりも上側に配置され、第２規制部４２ｂは第２貫通長孔３４ａよりも下側に配置されて、挿通部材４２が第１貫通長孔３２ａおよび第２貫通長孔３４ａから脱落することを防止する。挿通部材４２を第１貫通長孔３２ａおよび第２貫通長孔３４ａに取り付ける際は、第１規制部４２ａまたは第２規制部４２ｂのいずれか一方が軸部４２ｃとは別体に形成され、軸部４２ｃを第１貫通長孔３２ａおよび第２貫通長孔３４ａに挿入した状態で、軸部４２ｃの端部に固定する構造をとってもよい。 FIG. 6 shows the configuration of the insertion member 42. The insertion member 42 has a first restriction portion 42a wider than the first through long hole 32a and the second through long hole 34a so as to maintain the insertion state of the first through long hole 32a and the second through long hole 34a. And a wider second restricting portion 42b. The first restricting portion 42a is disposed above the first through long hole 32a, the second restricting portion 42b is disposed below the second through long hole 34a, and the insertion member 42 is the first through long hole 32a. Further, it is prevented from dropping off from the second through long hole 34a. When the insertion member 42 is attached to the first through long hole 32a and the second through long hole 34a, either the first restricting portion 42a or the second restricting portion 42b is formed separately from the shaft portion 42c, and the shaft A structure may be adopted in which the portion 42c is fixed to the end portion of the shaft portion 42c in a state where the portion 42c is inserted into the first through long hole 32a and the second through long hole 34a.

軸部４２ｃは、第１貫通長孔３２ａおよび第２貫通長孔３４ａに挿入される部分であり、第１貫通長孔３２ａおよび第２貫通長孔３４ａの交差箇所に常時位置する。軸部４２ｃは、第１貫通長孔３２ａ内および第２貫通長孔３４ａ内において回転を規制される。実施例では軸部４２ｃが、第１貫通長孔３２ａおよび第２貫通長孔３４ａの幅よりも僅かに狭い幅をもつ矩形断面を有し、第１貫通長孔３２ａ内および第２貫通長孔３４ａ内で回転を規制されるが、それ以外の手段により軸部４２ｃの回転が規制されてよい。たとえば第２円弧状アーム３４の内周面にレールが設けられ、第２規制部４２ｂにレール溝が設けられて、レールとレール溝とが嵌合することで軸部４２ｃの回転が規制されてもよい。第１規制部４２ａには筐体２０が取り付けられ、軸部４２ｃの回転が規制されることで、筐体２０を所望の姿勢に維持することが可能となる。 The shaft portion 42c is a portion that is inserted into the first through long hole 32a and the second through long hole 34a, and is always located at the intersection of the first through long hole 32a and the second through long hole 34a. The rotation of the shaft portion 42c is restricted in the first through long hole 32a and the second through long hole 34a. In the embodiment, the shaft portion 42c has a rectangular cross section having a width slightly narrower than the widths of the first through long hole 32a and the second through long hole 34a, and the inside of the first through long hole 32a and the second through long hole Although the rotation is restricted within 34a, the rotation of the shaft portion 42c may be restricted by other means. For example, a rail is provided on the inner peripheral surface of the second arc-shaped arm 34, a rail groove is provided in the second restricting portion 42b, and the rotation of the shaft portion 42c is restricted by fitting the rail and the rail groove. Also good. The housing 20 is attached to the first restricting portion 42a, and the rotation of the shaft portion 42c is restricted, so that the housing 20 can be maintained in a desired posture.

なお軸部４２ｃは、第１貫通長孔３２ａおよび第２貫通長孔３４ａの幅よりも狭い幅を有することで、第１貫通長孔３２ａ内および第２貫通長孔３４ａ内を摺動可能とする。これにより挿通部材４２は、第１円弧状アーム３２および第２円弧状アーム３４の回転により、第１貫通長孔３２ａに沿って移動でき、また第２貫通長孔３４ａに沿って移動できる。 The shaft portion 42c has a narrower width than the first through long hole 32a and the second through long hole 34a, so that it can slide in the first through long hole 32a and the second through long hole 34a. To do. Accordingly, the insertion member 42 can move along the first through long hole 32a and can move along the second through long hole 34a by the rotation of the first circular arc arm 32 and the second circular arc arm 34.

図７は、ロボット１０の断面を示す。図７（ａ）は、第１円弧状アーム３２と第２円弧状アーム３４とが台座３０に対して９０度起立した状態で第２円弧状アーム３４に沿って切断した断面を示し、図７（ｂ）は、第１円弧状アーム３２と第２円弧状アーム３４とが台座３０に対して９０度起立した状態で第１円弧状アーム３２に沿って切断した断面を示す。 FIG. 7 shows a cross section of the robot 10. FIG. 7A shows a cross section cut along the second arcuate arm 34 in a state where the first arcuate arm 32 and the second arcuate arm 34 stand 90 degrees with respect to the pedestal 30. (B) shows a cross-section cut along the first arc-shaped arm 32 in a state where the first arc-shaped arm 32 and the second arc-shaped arm 34 stand 90 degrees with respect to the pedestal 30.

第１モータ５２は、第１円弧状アーム３２を回転させるために設けられ、第２モータ５４は、第２円弧状アーム３４を回転させるために設けられる。第１モータ５２および第２モータ５４は、台座３０上に配置されて、台座３０が回転すると、第１モータ５２および第２モータ５４も台座３０とともに回転する。第３モータ５６は、台座３０を回転させるために設けられ、ハウジング３６内に配置される。第１モータ５２、第２モータ５４および第３モータ５６は、図示しない電源装置から電力を供給されて回転する。 The first motor 52 is provided for rotating the first arcuate arm 32, and the second motor 54 is provided for rotating the second arcuate arm 34. The first motor 52 and the second motor 54 are arranged on the pedestal 30, and when the pedestal 30 rotates, the first motor 52 and the second motor 54 also rotate together with the pedestal 30. The third motor 56 is provided to rotate the pedestal 30 and is disposed in the housing 36. The first motor 52, the second motor 54, and the third motor 56 are rotated by power supplied from a power supply device (not shown).

第１モータ５２が第１円弧状アーム３２を回転し、第２モータ５４が第２円弧状アーム３４を回転し、第３モータ５６が台座３０を回転することで、アクチュエータ装置１２は、挿通部材４２に取り付けられた筐体２０の向きおよび姿勢を変化させられる。 The first motor 52 rotates the first arc-shaped arm 32, the second motor 54 rotates the second arc-shaped arm 34, and the third motor 56 rotates the pedestal 30. The orientation and posture of the housing 20 attached to 42 can be changed.

図８および図９は、ロボット１０における筐体２０の姿勢の例を示す図である。
図８（ａ）および（ｂ）は、筐体２０を左右方向に傾けた例を示す。図９（ａ）および（ｂ）は、筐体２０を前後方向に傾けた例を示す。このようにロボット１０の駆動機構５０は、筐体２０に任意の姿勢をとらせることが可能となる。筐体２０の姿勢は、第１モータ５２および第２モータ５４の駆動量を調整することで制御され、また筐体２０の向きは、第３モータ５６の駆動量を調整することで制御される。 8 and 9 are diagrams illustrating examples of the posture of the housing 20 in the robot 10.
8A and 8B show an example in which the housing 20 is tilted in the left-right direction. FIGS. 9A and 9B show an example in which the housing 20 is tilted in the front-rear direction. Thus, the drive mechanism 50 of the robot 10 can cause the housing 20 to take an arbitrary posture. The attitude of the housing 20 is controlled by adjusting the driving amounts of the first motor 52 and the second motor 54, and the orientation of the housing 20 is controlled by adjusting the driving amount of the third motor 56. .

図１０は、ロボット１０の機能ブロックを示す。ロボット１０は、外部からの入力を受け付けて処理する入力系統２２と、外部への出力を処理する出力系統２４とを備える。入力系統２２は、受信部６０、センサ情報取得部６２、動き検出部６４、視線方向決定部６６、アクチュエータ制御部６８、音声データ取得部７０および音声処理部７２を備える。また出力系統２４は、画像処理部８０、音声処理部８２および送信部９０を備える。 FIG. 10 shows functional blocks of the robot 10. The robot 10 includes an input system 22 that receives and processes input from the outside, and an output system 24 that processes output to the outside. The input system 22 includes a reception unit 60, a sensor information acquisition unit 62, a motion detection unit 64, a line-of-sight direction determination unit 66, an actuator control unit 68, an audio data acquisition unit 70, and an audio processing unit 72. The output system 24 includes an image processing unit 80, an audio processing unit 82, and a transmission unit 90.

図１０において、さまざまな処理を行う機能ブロックとして記載される各要素は、ハードウェア的には、回路ブロック、メモリ、その他のＬＳＩで構成することができ、ソフトウェア的には、メモリにロードされたプログラムなどによって実現される。したがって、これらの機能ブロックがハードウェアのみ、ソフトウェアのみ、またはそれらの組合せによっていろいろな形で実現できることは当業者には理解されるところであり、いずれかに限定されるものではない。 In FIG. 10, each element described as a functional block for performing various processes can be configured by a circuit block, a memory, and other LSIs in terms of hardware, and loaded in the memory in terms of software. Realized by programs. Therefore, it is understood by those skilled in the art that these functional blocks can be realized in various forms by hardware only, software only, or a combination thereof, and is not limited to any one.

上記したようにＨＭＤ１００は、姿勢センサ１２４が検出したセンサ情報およびマイク１０６が生成した音声信号を符号化した音声データをロボット１０に送信し、受信部６０は、センサ情報および音声データを受信する。音声データ取得部７０は、受信した音声データを取得し、音声処理部７２が音声処理を実施して、スピーカ１８から出力する。これによりロボット１０は、ユーザＡの音声をリアルタイムで再生し、ロボット１０の周囲の人が、ユーザＡの声を聞くことができる。 As described above, the HMD 100 transmits to the robot 10 the audio data obtained by encoding the sensor information detected by the posture sensor 124 and the audio signal generated by the microphone 106, and the receiving unit 60 receives the sensor information and the audio data. The audio data acquisition unit 70 acquires the received audio data, the audio processing unit 72 performs audio processing, and outputs it from the speaker 18. Thereby, the robot 10 reproduces the voice of the user A in real time, and the people around the robot 10 can hear the voice of the user A.

センサ情報取得部６２は、ＨＭＤ１００の姿勢センサ１２４が検出した姿勢情報を取得する。動き検出部６４は、ユーザＡの頭部に装着されたＨＭＤ１００の姿勢を検出する。視線方向決定部６６は、動き検出部６４により検出されたＨＭＤ１００の姿勢に応じて筐体２０のカメラ１４の視線方向を定める。 The sensor information acquisition unit 62 acquires posture information detected by the posture sensor 124 of the HMD 100. The motion detection unit 64 detects the posture of the HMD 100 attached to the user A's head. The line-of-sight direction determination unit 66 determines the line-of-sight direction of the camera 14 of the housing 20 according to the attitude of the HMD 100 detected by the motion detection unit 64.

動き検出部６４は、ＨＭＤ１００を装着したユーザの頭部の姿勢を検出するヘッドトラッキング処理を行う。ヘッドトラッキング処理は、ユーザの頭部の姿勢に、ＨＭＤ１００の表示パネル１０２に表示する視野を連動させるために行われ、実施例のヘッドトラッキング処理では、ＨＭＤ１００の水平基準方向に対する回転角度と、水平面に対する傾き角度とが検出される。水平基準方向は、たとえばＨＭＤ１００の電源がオンされたときに向いている方向として設定されてよい。 The motion detection unit 64 performs head tracking processing that detects the posture of the head of the user wearing the HMD 100. The head tracking process is performed in order to link the visual field displayed on the display panel 102 of the HMD 100 to the posture of the user's head. In the head tracking process of the embodiment, the rotation angle with respect to the horizontal reference direction of the HMD 100 and the horizontal plane The tilt angle is detected. The horizontal reference direction may be set, for example, as a direction facing when the power of the HMD 100 is turned on.

視線方向決定部６６は、動き検出部６４により検出されたＨＭＤ１００の姿勢に応じて、視線方向を定める。この視線方向は、ユーザＡの視線方向であり、ひいては分身であるロボット１０のカメラ１４の視線方向（光軸方向）である。 The line-of-sight direction determination unit 66 determines the line-of-sight direction according to the posture of the HMD 100 detected by the motion detection unit 64. This line-of-sight direction is the line-of-sight direction of the user A, and by extension, the line-of-sight direction (optical axis direction) of the camera 14 of the robot 10 that is a substitute.

カメラ１４の視線方向（光軸方向）をユーザＡの視線方向に連動させるために、ロボット１０の基準姿勢を事前に設定しておく必要がある。図５には、第１円弧状アーム３２と第２円弧状アーム３４とが台座３０に対して９０度起立した状態を示しているが、この状態を水平方向として設定し、またロボット１０の電源がオンされたときに筐体２０の前面が向いている方向を、水平基準方向として設定してよい。なおロボット１０は、ＨＭＤ１００と同様に姿勢センサを有して、水平方向を自律的に設定できるようにしてもよい。 In order to link the viewing direction (optical axis direction) of the camera 14 with the viewing direction of the user A, the reference posture of the robot 10 needs to be set in advance. FIG. 5 shows a state in which the first arc-shaped arm 32 and the second arc-shaped arm 34 stand up by 90 degrees with respect to the pedestal 30. This state is set as the horizontal direction, and the power supply of the robot 10 The direction in which the front surface of the housing 20 faces when is turned on may be set as the horizontal reference direction. Note that the robot 10 may have a posture sensor as in the HMD 100 so that the horizontal direction can be set autonomously.

ＨＭＤ１００およびロボット１０の基準姿勢を設定した状態で、視線方向決定部６６は、動き検出部６４により検出された回転角度および傾き角度を、そのままカメラ１４の視線方向（光軸方向）として決定してよい。動き検出部６４が、ＨＭＤ１００の回転角度および傾き角度を検出すると、視線方向決定部６６は、ＨＭＤ１００の視線方向を３次元座標のベクトル（ｘ，ｙ，ｚ）として決定し、このときロボット１０のカメラ１４の視線方向を同じ（ｘ，ｙ，ｚ）と決定してもよく、また何らかの補正を加えた（ｘ’，ｙ’，ｚ’）として決定してもよい。 With the reference postures of the HMD 100 and the robot 10 set, the line-of-sight direction determination unit 66 determines the rotation angle and tilt angle detected by the motion detection unit 64 as the line-of-sight direction (optical axis direction) of the camera 14 as it is. Good. When the motion detection unit 64 detects the rotation angle and the inclination angle of the HMD 100, the line-of-sight direction determination unit 66 determines the line-of-sight direction of the HMD 100 as a vector (x, y, z) of three-dimensional coordinates. The line-of-sight direction of the camera 14 may be determined as the same (x, y, z), or may be determined as (x ′, y ′, z ′) with some correction.

アクチュエータ制御部６８は、視線方向決定部６６で決定された視線方向となるようにカメラ１４の向きを制御する。具体的にアクチュエータ制御部６８は、第１モータ５２、第２モータ５４、第３モータ５６に供給する電力を調整して、ＨＭＤ１００の動きに、筐体２０の動きを追従させる。アクチュエータ制御部６８によるモータ駆動制御は、リアルタイムに実施され、したがって筐体２０の向きは、ユーザＡの視線の向きと同じように動かされる。 The actuator control unit 68 controls the orientation of the camera 14 so that the line-of-sight direction determined by the line-of-sight direction determination unit 66 is obtained. Specifically, the actuator control unit 68 adjusts the power supplied to the first motor 52, the second motor 54, and the third motor 56 so that the movement of the housing 20 follows the movement of the HMD 100. The motor drive control by the actuator control unit 68 is performed in real time, and therefore the direction of the housing 20 is moved in the same way as the direction of the line of sight of the user A.

実施例のアクチュエータ装置１２によれば、筐体２０は、第１円弧状アーム３２および第２円弧状アーム３４の回転中心を基準として駆動されるが、この動きは人の首と同じ動きを示す。アクチュエータ装置１２は、２本の半円アームを交差させた簡易な構造でユーザＡの首の動きを再現する。 According to the actuator device 12 of the embodiment, the housing 20 is driven with reference to the rotation centers of the first arc-shaped arm 32 and the second arc-shaped arm 34, and this movement shows the same movement as the human neck. . The actuator device 12 reproduces the movement of the neck of the user A with a simple structure in which two semicircular arms are crossed.

人は首の動きによって意思を伝達する。たとえば日本では首を縦に振ると肯定を、横に振ると否定を表現するが、アクチュエータ装置１２は、ユーザＡの首の動きと同じように筐体２０を動かすため、ロボット１０の周囲の人は、筐体２０の動きによってもユーザＡの意思を感じ取ることができる。そのためユーザＡの首の動きを簡易な構造で再現できることは、テレイグジステンス技術において非常に有用である。 A person communicates his intention by moving his neck. For example, in Japan, if the head is swung vertically, positive is expressed, and if it is swung horizontally, negative is expressed. However, since the actuator device 12 moves the housing 20 in the same manner as the movement of the neck of the user A, people around the robot 10 Can sense the intention of the user A also by the movement of the housing 20. Therefore, the ability to reproduce the movement of the neck of the user A with a simple structure is very useful in the teleexistence technology.

次に出力系統２４について説明する。
出力系統２４において、右カメラ１４ａおよび左カメラ１４ｂは、アクチュエータ装置１２により制御された方向に向けられて、それぞれの画角内を撮影する。右カメラ１４ａおよび左カメラ１４ｂは、たとえば大人の平均的な両目の間隔となるように離れて配置されてよい。右カメラ１４ａが撮影した右目用画像データおよび左カメラ１４ｂが撮影した左目用画像データは、送信部９０からＨＭＤ１００に送信されて、それぞれ表示パネル１０２の右半分および左半分に表示される。これらの画像は、右目および左目から見た視差画像を形成し、表示パネル１０２を２分割してなる領域にそれぞれ表示させることで、画像を立体視させることができる。なおユーザＡは光学レンズを通して表示パネル１０２を見るために、画像処理部８０は、予めレンズによる光学歪みを補正した画像データを生成して、ＨＭＤ１００に供給してもよい。 Next, the output system 24 will be described.
In the output system 24, the right camera 14a and the left camera 14b are directed in the directions controlled by the actuator device 12 and shoot the respective angles of view. The right camera 14a and the left camera 14b may be arranged apart from each other, for example, so as to be an average distance between eyes of an adult. The right-eye image data captured by the right camera 14a and the left-eye image data captured by the left camera 14b are transmitted from the transmission unit 90 to the HMD 100 and displayed on the right half and the left half of the display panel 102, respectively. These images form parallax images viewed from the right eye and the left eye, and can be displayed stereoscopically by displaying the display panel 102 in an area divided into two. In order for the user A to view the display panel 102 through the optical lens, the image processing unit 80 may generate image data in which optical distortion caused by the lens is corrected in advance and supply the image data to the HMD 100.

右カメラ１４ａおよび左カメラ１４ｂは、所定の周期（たとえば１／６０秒）で撮影を行い、送信部９０は遅延なく画像データをＨＭＤ１００に送信する。これによりユーザＡはロボット１０の周囲の状況をリアルタイムで見ることができ、また顔の向きを変えることで、見たい方向を見ることができる。 The right camera 14a and the left camera 14b capture images at a predetermined cycle (for example, 1/60 seconds), and the transmission unit 90 transmits image data to the HMD 100 without delay. As a result, the user A can see the situation around the robot 10 in real time, and can also see the desired direction by changing the direction of the face.

右マイク１６ａおよび左マイク１６ｂは、ロボット１０の周囲の音を電気信号に変換して音声信号を生成する。以下、右マイク１６ａが生成する音声信号を「第１音声信号」と呼び、左マイク１６ｂが生成する音声信号を「第２音声信号」と呼ぶ。上記したように、右マイク１６ａおよび左マイク１６ｂは、筐体２０において横方向に離れて配置されているため、右マイク１６ａが生成する第１音声信号と左マイク１６ｂが生成する第２音声信号には、位相差が生じる。 The right microphone 16a and the left microphone 16b convert sound around the robot 10 into an electric signal to generate a sound signal. Hereinafter, the audio signal generated by the right microphone 16a is referred to as a “first audio signal”, and the audio signal generated by the left microphone 16b is referred to as a “second audio signal”. As described above, since the right microphone 16a and the left microphone 16b are disposed laterally apart from each other in the housing 20, the first audio signal generated by the right microphone 16a and the second audio signal generated by the left microphone 16b. Causes a phase difference.

本発明者は、第１音声信号および第２音声信号を、そのままの位相差で符号化してＨＭＤ１００に提供した場合に、ユーザが、音源の方向を認識できない、つまり音声が右側から聞こえてくるのか、または左側から聞こえてくるのかを判別しにくいという知見を実験により得た。実験では筐体２０の横方向の幅を大人の人間の顔幅程度（１６ｃｍ）に設定しているが、人間の耳における音波伝達構造をマイク１６では再現できないために、第１音声信号および第２音声信号の位相差のみでは、人間が音源の方向を知覚するには足りないという結論が得られた。 When the present inventor encodes the first audio signal and the second audio signal with the same phase difference and provides them to the HMD 100, the user cannot recognize the direction of the sound source, that is, whether the audio can be heard from the right side. Or, it was found through experiments that it was difficult to determine whether the sound was heard from the left side. In the experiment, the lateral width of the housing 20 is set to about the adult human face width (16 cm). However, since the sound transmission structure in the human ear cannot be reproduced by the microphone 16, It was concluded that only the phase difference between the two audio signals is insufficient for humans to perceive the direction of the sound source.

これを解決する手段として、筐体２０の横方向の幅を大きくして、第１音声信号と第２音声信号の位相差を大きくすることが考えられるが、その場合は筐体２０の重量が重くなり、アクチュエータ装置１２で使用するモータの出力を高める必要が生じる。また筐体２０の横方向の幅を大きくすると、右マイク１６ａと左マイク１６ｂの間隔が、人間の両耳の間隔よりも広くなるため、実際に人が音を聞く感覚とは異なる音声信号が取得されることになる。 As a means for solving this, it is conceivable to increase the lateral width of the casing 20 to increase the phase difference between the first audio signal and the second audio signal. It becomes heavier and it becomes necessary to increase the output of the motor used in the actuator device 12. Further, when the lateral width of the housing 20 is increased, the interval between the right microphone 16a and the left microphone 16b becomes wider than the interval between both ears of a human, so that an audio signal that is different from the sense that a person actually hears sound is generated. Will be acquired.

そこで本発明者は、第１音声信号と第２音声信号の位相差を増幅することで、この問題を解決することを考え出した。音声処理部８２は、以下に説明するように、右マイク１６ａが生成する第１音声信号および左マイク１６ｂが生成する第２音声信号の位相差を増幅する機能を有する。なおロボット１０は、リアルタイムでマイク音声をＨＭＤ１００に伝達する必要があるため、音声処理部８２は、位相差増幅機能を、ハードウェア回路によって実現する。 Therefore, the present inventor has devised to solve this problem by amplifying the phase difference between the first audio signal and the second audio signal. As will be described below, the audio processing unit 82 has a function of amplifying the phase difference between the first audio signal generated by the right microphone 16a and the second audio signal generated by the left microphone 16b. Since the robot 10 needs to transmit the microphone sound to the HMD 100 in real time, the sound processing unit 82 realizes the phase difference amplification function by a hardware circuit.

図１１は、音声処理部８２が備える位相差増幅装置８２ａの回路構成を示す。位相差増幅装置８２ａは、右マイク１６ａが生成した第１音声信号ｖ_Ｒと左マイク１６ｂが生成した第２音声信号ｖ_Ｌの位相差を増幅して出力するアナログ回路装置である。 FIG. 11 shows a circuit configuration of the phase difference amplifying device 82a included in the audio processing unit 82. Phase difference amplifier 82a is an analog circuit device for amplifying and outputting the phase difference between the second audio signal v _L of the first audio signal v _R and the left microphone 16b to the right microphone 16a was formed was produced.

第１増幅器８４ａは、右マイク１６ａから第１音声信号ｖ_Ｒを入力されると、第１音声信号ｖ_Ｒを増幅した第１正相信号Ｖ_Ｒ ^＋と、第１音声信号ｖ_Ｒを反転増幅した第１逆相信号Ｖ_Ｒ ⁻とを出力する。第１増幅器８４ａは、入力信号の正相成分を増幅して出力するオペアンプと、入力信号の逆相成分を増幅して出力するオペアンプとから構成されてもよいが、正相成分および逆相成分を出力する２つの出力端子を有するオペアンプから構成されてもよい。 The first amplifier 84a, when the right microphone 16a is inputted to the first audio signal _{v R,} the first and positive-phase signal _V ^{R +} obtained by amplifying the first voice signal _{v R,} inverting amplifying the first voice signal _{v R} The first negative phase signal V _R ⁻ is output. The first amplifier 84a may be composed of an operational amplifier that amplifies and outputs the positive phase component of the input signal and an operational amplifier that amplifies and outputs the negative phase component of the input signal. May be composed of an operational amplifier having two output terminals for outputting.

また第２増幅器８４ｂは、左マイク１６ｂから第２音声信号ｖ_Ｌを入力されると、第２音声信号ｖ_Ｌを増幅した第２正相信号Ｖ_Ｌ ^＋と、第２音声信号ｖ_Ｌを反転増幅した第２逆相信号Ｖ_Ｌ ⁻とを出力する。第２増幅器８４ｂも、第１増幅器８４ａと同様に、それぞれ正相成分および逆相成分を出力する２つのオペアンプから構成されてもよく、また正相成分および逆相成分の双方を出力する１つのオペアンプから構成されてもよい。 The second amplifier 84b is inverted and left microphone 16b is input to the second audio signal _{v L,} the second positive phase signal _V ^{L +} obtained by amplifying the second audio signal _{v L,} the second audio signal _{v L} The amplified second negative phase signal V _L ⁻ is output. Similarly to the first amplifier 84a, the second amplifier 84b may be composed of two operational amplifiers that output the positive phase component and the negative phase component, respectively, and one output that outputs both the positive phase component and the negative phase component. You may comprise from an operational amplifier.

第１加算器８６ａは、第１正相信号Ｖ_Ｒ ^＋を第１係数倍（α倍）した信号と、第２逆相信号Ｖ_Ｌ ⁻を第２係数倍（β倍）した信号とを加算した出力信号Ｖ_ｒＯＵＴを出力する。ここでα、βは、０より大きく、１以下の値を示す。なおαとβは異なるように設定され、この例ではα＞βである。出力信号Ｖ_ｒＯＵＴは、以下の式で表現される。
Ｖ_ｒＯＵＴ＝α×Ｖ_Ｒ ^＋＋β×Ｖ_Ｌ ⁻ The first adder 86a adds a signal obtained by multiplying the first positive phase signal V _R ⁺ by a first coefficient (α times) and a signal obtained by multiplying the second negative phase signal V _L ⁻ by a second coefficient (β times). The output signal V _rOUT is output. Here, α and β are values greater than 0 and 1 or less. Note that α and β are set to be different, and α> β in this example. The output signal V _rOUT is expressed by the following equation.
V _rOUT = α × V _R ⁺ + β × V _L ⁻

第１加算器８６ａは、第１正相信号Ｖ_Ｒ ^＋をα倍に分圧する分圧回路の出力と、第２逆相信号Ｖ_Ｌ ⁻をβ倍に分圧する分圧回路の出力とを加算する加算回路であってもよいが、第１正相信号Ｖ_Ｒ ^＋をα倍した電圧信号と第２逆相信号Ｖ_Ｌ ⁻をβ倍した電圧信号とを加算するオペアンプであってもよい。 The first adder 86a adds the output of the voltage dividing circuit that divides the first positive phase signal V _R ⁺ by α times and the output of the voltage dividing circuit that divides the second negative phase signal V _L ⁻ by β times. However, it may be an operational amplifier that adds a voltage signal obtained by multiplying the first positive phase signal V _R ⁺ by α and a voltage signal obtained by multiplying the second negative phase signal V _L ⁻ by β.

第２加算器８６ｂは、第２正相信号Ｖ_Ｌ ^＋を第１係数倍（α倍）した信号と、第１逆相信号Ｖ_Ｒ ⁻を第２係数倍（β倍）した信号とを加算した出力信号Ｖ_ｌＯＵＴを出力する。出力信号Ｖ_ｌＯＵＴは、以下の式で表現される。
Ｖ_ｌＯＵＴ＝α×Ｖ_Ｌ ^＋＋β×Ｖ_Ｒ ⁻ The second adder 86b adds the signal obtained by multiplying the second positive phase signal V _L ⁺ by the first coefficient (α times) and the signal obtained by multiplying the first negative phase signal V _R ⁻ by the second coefficient (β times). The output signal _VlOUT is output. The output signal _VlOUT is expressed by the following equation.
V _lOUT = α × V _L ⁺ + β × V _R ⁻

第２加算器８６ｂは、第２正相信号Ｖ_Ｌ ^＋をα倍に分圧する分圧回路の出力と、第１逆相信号Ｖ_Ｒ ⁻をβ倍に分圧する分圧回路の出力とを加算する加算回路であってもよいが、第２正相信号Ｖ_Ｌ ^＋をα倍した電圧信号と第１逆相信号Ｖ_Ｒ ⁻をβ倍した電圧信号とを加算するオペアンプであってもよい。 The second adder 86b adds the output of the voltage dividing circuit that divides the second positive phase signal V _L ⁺ by α times and the output of the voltage dividing circuit that divides the first negative phase signal V _R ⁻ by β times. However, it may be an operational amplifier that adds a voltage signal obtained by multiplying the second positive phase signal V _L ⁺ by α and a voltage signal obtained by multiplying the first negative phase signal V _R ⁻ by β.

第３増幅器８８ａは、第１加算器８６ａの出力信号Ｖ_ｒＯＵＴを第３係数倍（γ倍）してＶ_ＲＯＵＴを出力し、第４増幅器８８ｂは、第２加算器８６ｂの出力信号Ｖ_ｌＯＵＴを第３係数倍（γ倍）してＶ_ＬＯＵＴを出力する。音声処理部８２において、位相差増幅装置８２ａからの出力信号Ｖ_ＲＯＵＴ、Ｖ_ＬＯＵＴは、それぞれ音声符号化されて、右耳用音声データおよび左耳用音声データとして送信部９０からＨＭＤ１００に送信される。 The third amplifier 88a is an output signal _{V ROUT} of the first adder 86a third factor multiplication (gamma times) and outputs a _{V ROUT,} fourth amplifier 88b is an output signal _{V LOUT} of the second adder 86b V _LOUT is output by multiplying by the third coefficient (γ times). In the audio processing unit 82, the output signals V _ROUT and V _LOUT from the phase difference amplifying device 82a are respectively audio encoded and transmitted from the transmission unit 90 to the HMD 100 as audio data for right ear and audio data for left ear. .

図１２は、信号波形の位相差を説明するための図である。図１２（ａ）は、右マイク１６ａが生成する第１音声信号ｖ_Ｒと左マイク１６ｂが生成する第２音声信号ｖ_Ｌの波形の関係を示す。ここでは説明の便宜上、第１音声信号ｖ_Ｒと第２音声信号ｖ_Ｌとをそれぞれ同倍に増幅した第１正相信号Ｖ_Ｒ ^＋と第２正相信号Ｖ_Ｌ ^＋との関係を示している。この入力波形では、ロボット１０の筐体２０から見て音源が右側に配置されており、第１正相信号Ｖ_Ｒ ^＋の位相の方が、第２正相信号Ｖ_Ｌ ^＋よりも僅かに進んでおり、また振幅は第１正相信号Ｖ_Ｒ ^＋の方が高い。 FIG. 12 is a diagram for explaining a phase difference between signal waveforms. 12 (a) shows a relationship between the second audio signal _{v L} waveform first audio signal _{v R} and the left microphone 16b to the right microphone 16a is generated is produced. Here, for convenience of explanation, the relationship between the first positive phase signal V _R ⁺ and the second positive phase signal V _L ^{+ obtained} by amplifying the first audio signal v _R and the second audio signal v _L to the same magnification is shown. Yes. In this input waveform, the sound source is arranged on the right side when viewed from the housing 20 of the robot 10, and the phase of the first positive phase signal V _R ⁺ slightly advances from the second positive phase signal V _L ^+. Also, the amplitude of the first positive phase signal V _R ⁺ is higher.

図１２（ｂ）は、第１加算器８６ａの出力信号Ｖ_ｒＯＵＴと第２加算器８６ｂの出力信号Ｖ_ｌＯＵＴの波形の関係を示す。図１２（ａ）に示す入力波形の位相差と比較すると、図１２（ｂ）に示す加算器の出力波形の位相差が広がっている（増幅している）ことが分かる。 FIG. _12B shows the relationship between the waveforms of the output signal V _rOUT of the first adder 86a and the output signal V _lOUT of the second adder 86b. Compared with the phase difference of the input waveform shown in FIG. 12A, it can be seen that the phase difference of the output waveform of the adder shown in FIG. 12B is widened (amplified).

図１３は、入力信号波形の位相差を増幅する原理を説明するための図である。図１３（ａ）は、第１正相信号Ｖ_Ｒ ^＋および第１逆相信号Ｖ_Ｒ ⁻と、第２正相信号Ｖ_Ｌ ^＋および第２逆相信号Ｖ_Ｌ ⁻を２次元座標系で表現している。第１正相信号Ｖ_Ｒ ^＋と第２正相信号Ｖ_Ｌ ^＋の位相差はθである。 FIG. 13 is a diagram for explaining the principle of amplifying the phase difference of the input signal waveform. FIG. 13A shows the first positive phase signal V _R ⁺ and the first negative phase signal V _R ⁻ , the second positive phase signal V _L ⁺ and the second negative phase signal V _L ⁻ in a two-dimensional coordinate system. doing. The phase difference between the first positive phase signal V _R ⁺ and the second positive phase signal V _L ⁺ is θ.

図１３（ｂ）は、第１加算器８６ａの出力信号Ｖ_ｒＯＵＴと第２加算器８６ｂの出力信号Ｖ_ｌＯＵＴを示す。上記したように、Ｖ_ｒＯＵＴ、Ｖ_ｌＯＵＴは、
Ｖ_ｒＯＵＴ＝α×Ｖ_Ｒ ^＋＋β×Ｖ_Ｌ ⁻
Ｖ_ｌＯＵＴ＝α×Ｖ_Ｌ ^＋＋β×Ｖ_Ｒ ⁻
と表現される。図１３（ｂ）では、α＝１．０、β＝０．６を設定する。 Figure 13 (b) shows the output signal _{V ROUT} of the first adder 86a and the output signal _{V LOUT} of the second adder 86b. As described above, V _rOUT and V _lOUT are
V _rOUT = α × V _R ⁺ + β × V _L ⁻
V _lOUT = α × V _L ⁺ + β × V _R ⁻
It is expressed. In FIG. 13B, α = 1.0 and β = 0.6 are set.

図１３（ｂ）に示すように、Ｖ_ｒＯＵＴとＶ_ｌＯＵＴの位相差はθ’となり、図１３（ａ）に示す位相差θよりも大きくなっている。このように位相差増幅装置８２ａは、入力された２つの音声信号の位相差を増幅する。 As shown in FIG. _13B , the phase difference between V _rOUT and V _1OUT is θ ′, which is larger than the phase difference θ shown in FIG. Thus, the phase difference amplifying device 82a amplifies the phase difference between the two input audio signals.

本発明者によるシミュレーションの結果、入力信号の位相差が１５度のとき、出力信号の位相差は４倍の６０度となり、入力信号の位相差が３０度のとき、出力信号の位相差は３倍の９０度となり、入力信号の位相差が４５度のとき、出力信号の位相差は約２．７倍の１２０度となることが分かった。 As a result of simulation by the present inventor, when the phase difference of the input signal is 15 degrees, the phase difference of the output signal is 4 times 60 degrees, and when the phase difference of the input signal is 30 degrees, the phase difference of the output signal is 3 It was found that when the input signal phase difference was 45 degrees, the output signal phase difference was approximately 2.7 times 120 degrees when the input signal was 90 degrees.

このシミュレーション結果によると、位相差が小さいほど増幅率が大きくなっている。実際の筐体２０では、入力信号の位相差は５度〜２０度程度であり、位相差増幅装置８２ａは、この範囲における増幅率を大きくできることで、出力信号の位相差を、ユーザが音源の方向を聞き分けられる程度に広げられる。位相差増幅装置８２ａからの出力信号Ｖ_ＲＯＵＴ、Ｖ_ＬＯＵＴは、それぞれ音声符号化されて、右耳用音声データおよび左耳用音声データとして送信部９０からＨＭＤ１００に送信される。 According to this simulation result, the smaller the phase difference, the greater the amplification factor. In the actual case 20, the phase difference of the input signal is about 5 to 20 degrees, and the phase difference amplifying device 82a can increase the amplification factor in this range, so that the phase difference of the output signal can be determined by the user. It is widened to the extent that you can hear the direction. The output signals V _ROUT and V _LOUT from the phase difference amplifying device 82a are each audio-encoded and transmitted from the transmission unit 90 to the HMD 100 as audio data for the right ear and audio data for the left ear.

ＨＭＤ１００において、右耳用音声データは、右耳用のイヤホン１０４から音声として出力され、左耳用音声データは、左耳用のイヤホン１０４から音声として出力される。ユーザＡは、位相差を増幅された音声を両耳から聞くことで、音源の方向を認識する。ユーザＡは、右側から声が聞こえてきたと感じれば、顔を右側に向ける。このときユーザＡの顔の動きに連動してロボット１０の筐体２０が右側を向くため（図２参照）、ロボット１０のカメラ１４は、右側の環境を撮影して、撮影画像データをリアルタイムでＨＭＤ１００に送信する。これによりユーザＡは、発声した人の顔を見ながら話すことができ、従来にない優れたユーザインタフェースを実現できる。 In the HMD 100, right ear audio data is output as audio from the right ear earphone 104, and left ear audio data is output as audio from the left ear earphone 104. User A recognizes the direction of the sound source by listening to the sound with the amplified phase difference from both ears. If the user A feels that a voice has been heard from the right side, the user A turns his face to the right side. At this time, since the housing 20 of the robot 10 faces the right side in conjunction with the movement of the face of the user A (see FIG. 2), the camera 14 of the robot 10 captures the environment on the right side and captures the captured image data in real time. Send to HMD100. As a result, the user A can speak while looking at the face of the person who speaks, and can realize an unprecedented excellent user interface.

なお上記した例では、α＝１．０、β＝０．６と設定したが、α、βの値は、実験により適切に設定されることが好ましい。図５に示すように、右マイク１６ａおよび左マイク１６ｂは、筐体２０の側面を窪ませた位置であって、前面からみて奥側の位置に設けている。マイク１６における音波の伝達構造は、筐体側面の形状に依存するため、α、βの比は、実験により最適に求められることが好ましい。 In the above example, α = 1.0 and β = 0.6 are set. However, it is preferable that the values of α and β are appropriately set by experiments. As shown in FIG. 5, the right microphone 16 a and the left microphone 16 b are provided at positions where the side surface of the housing 20 is recessed and at the back side when viewed from the front. Since the sound wave transmission structure in the microphone 16 depends on the shape of the side surface of the housing, the ratio of α and β is preferably determined optimally through experiments.

なお図５において、マイク１６は、後板１７の横方向の内側に配置されている。これは後板１７に、前方からの音波と後方からの音波の周波数特性を異ならせ、後方からの高域成分を低減させる役割をもたせるためである。つまり後板１７は、マイク１６に対して人の耳介のような機能をもち、後方からの音波が後板１７を回り込んでマイク１６に到達するようにしている。なお前方からの音波と後方からの音波の周波数特性を異ならせるために、後板１７は、さらに上下方向および横方向に広げられて形成されてもよい。マイク１６の後方に後板１７のような音波遮蔽体を形成することで、ユーザＡは、音源の前後方向の位置を聞き分けることも可能となる。 In FIG. 5, the microphone 16 is disposed inside the rear plate 17 in the lateral direction. This is because the rear plate 17 has a function of making the frequency characteristics of the sound wave from the front and the sound wave from the rear different from each other and reducing the high frequency component from the rear. In other words, the rear plate 17 has a function like a human auricle with respect to the microphone 16, and a sound wave from the rear wraps around the rear plate 17 and reaches the microphone 16. In order to make the frequency characteristics of the sound wave from the front and the sound wave from the rear different, the rear plate 17 may be formed to be further expanded in the vertical direction and the horizontal direction. By forming a sound wave shield such as the rear plate 17 behind the microphone 16, the user A can also recognize the position of the sound source in the front-rear direction.

このように情報処理システム１では、ユーザＡが、自分の分身であるロボット１０を用いて、リアルタイムでロボット１０の周囲にいる人達と自由にコミュニケーションをとることができる。以下では、情報処理システム１の利用可能性をさらに高める技術について提案する。 As described above, in the information processing system 1, the user A can freely communicate with the people around the robot 10 in real time using the robot 10 that is his or her own character. Below, the technique which further raises the usability of the information processing system 1 is proposed.

従来より、カメラの傾きを変えながら撮影した画像をスティッチ（縫い合わせ）して全天球パノラマ画像を生成する技術が知られている。最近では、専用のパンチルトカメラも販売されており、個人でも全天球パノラマ画像を撮影できるようになっている。 2. Description of the Related Art Conventionally, a technique for generating an omnidirectional panoramic image by stitching (sewing) images taken while changing the tilt of a camera is known. Recently, a dedicated pan / tilt camera has also been sold, and individuals can shoot spherical panoramic images.

情報処理システム１において、ロボット１０は、ユーザＡの頭部の動きに応じた視線方向にカメラ１４を向けて、周囲を撮影する。ユーザＡが様々な方向を向くことで、カメラ１４が様々な方向を撮影する。この撮影画像に、視線方向を表現する３次元ベクトルを付加して記録しておくことで、仮想的な全天球パノラマ画像を生成することが可能となる。 In the information processing system 1, the robot 10 photographs the surroundings by directing the camera 14 in the line-of-sight direction according to the movement of the user A's head. As the user A faces various directions, the camera 14 captures various directions. It is possible to generate a virtual omnidirectional panoramic image by adding a three-dimensional vector representing the line-of-sight direction to this captured image and recording it.

図１４は、ロボット１０の機能ブロックの変形例を示す。この機能ブロックは、図１０に示す機能ブロックを前提としており、その中で視線方向決定部６６から画像処理部８０に対して、決定した視線方向が供給されることを示している。 FIG. 14 shows a modification of the functional block of the robot 10. This functional block is based on the functional block shown in FIG. 10, and shows that the determined visual line direction is supplied from the visual line direction determining unit 66 to the image processing unit 80.

ユーザＡによるロボット１０の使用中、送信部９０は、両眼用の画像データおよび両耳用の音声データ（以下、まとめて「視聴データ」と呼ぶこともある）を、ネットワーク４経由でユーザＡのＨＭＤ１００に送信している。このとき送信部９０は、同じ視聴データをネットワーク４経由でルータ５を介して処理装置２００にも送信し、処理装置２００はユーザＡの視聴データを記録する。 During the use of the robot 10 by the user A, the transmission unit 90 transmits the image data for both eyes and the sound data for both ears (hereinafter sometimes collectively referred to as “viewing data”) via the network 4 to the user A. To the HMD100. At this time, the transmission unit 90 transmits the same viewing data to the processing device 200 via the network 4 via the router 5, and the processing device 200 records the viewing data of the user A.

処理装置２００は、ユーザＡの視聴データを記録しつつ、ユーザＡの画像データをもとに全天球パノラマ画像をリアルタイム生成し、ユーザＡとは異なるユーザＢの視線方向に応じた画像をユーザＢのＨＭＤ１００ａに提供する機能をもつ。なおＨＭＤ１００ａは、これまで説明したＨＭＤ１００と同じ構成を備える。処理装置２００は、たとえば単一のサーバにより構成されてもよいが、クラウドサービスを提供するサーバ群により構成されてもよい。 The processing device 200 generates the omnidirectional panoramic image in real time based on the image data of the user A while recording the viewing data of the user A, and displays an image corresponding to the viewing direction of the user B different from the user A. B has a function provided to the HMD 100a. The HMD 100a has the same configuration as the HMD 100 described so far. The processing device 200 may be configured by a single server, for example, but may be configured by a server group that provides a cloud service.

処理装置２００が全天球パノラマ画像を生成できるようにするために、画像処理部８０は、フレーム画像データのそれぞれに、視線方向決定部６６から供給される視線方向を示すベクトル情報と、撮影開始点からの経過時間を示す撮影時間情報とを付加する。ベクトル情報は、ロボット１０のカメラ１４の視線方向を示す。撮影時間情報は、撮影開始点からの時間を表現するものであればよく、たとえば撮影された順番を示すフレーム番号であってもよい。 In order to enable the processing device 200 to generate an omnidirectional panoramic image, the image processing unit 80 includes vector information indicating the line-of-sight direction supplied from the line-of-sight direction determination unit 66 and the start of shooting for each frame image data. Shooting time information indicating the elapsed time from the point is added. The vector information indicates the line-of-sight direction of the camera 14 of the robot 10. The shooting time information only needs to express the time from the shooting start point, and may be a frame number indicating the order of shooting, for example.

この技術では、ユーザＡによるロボット１０の使用中に、ユーザＢがＨＭＤ１００ａを装着し、ロボット１０から供給されるユーザＡの視聴データをもとに生成される画像データおよび音声データを、ＨＭＤ１００ａに提供する。ユーザＡの視聴データをそのまま再生するだけであれば、処理装置２００は、受信した視聴データをそのままユーザＢのＨＭＤ１００ａにストリーミング配信するだけでよいが、この技術では、処理装置２００が、ユーザＡの画像データをもとに構成される全天球パノラマ画像から、ユーザＢの視線方向にもとづいた画像を再構成して、ユーザＢのＨＭＤ１００ａに提供できるようにする。なお音声データは、ユーザＢのＨＭＤ１００ａにストリーミング配信される。 In this technique, while the robot 10 is being used by the user A, the user B wears the HMD 100a and provides the HMD 100a with image data and audio data generated based on the viewing data of the user A supplied from the robot 10. To do. If the viewing data of the user A is simply reproduced as it is, the processing device 200 may simply stream the received viewing data to the HMD 100a of the user B as it is. An image based on the line-of-sight direction of the user B is reconstructed from the panoramic image formed based on the image data, and can be provided to the user B's HMD 100a. The audio data is streamed to user B's HMD 100a.

図１５は、処理装置２００の機能ブロックを示す。処理装置２００は、受信部２０２、センサ情報取得部２０４、動き検出部２０６、視線方向決定部２０８、画像決定部２１０、音声決定部２１２、視聴データ提供部２１４、送信部２１６および記録部２１８を備える。記録部２１８は画像記録部２２０および音声記録部２２２を含む。受信部２０２が、ロボット１０から送信された視聴データを受信すると、画像記録部２２０は、受信した画像データを順次記録し、音声記録部２２２は、受信した音声データを順次記録する。なお画像データは、フレーム画像ごとに、撮影時のベクトル情報および撮影時間情報を付加されている。 FIG. 15 shows functional blocks of the processing apparatus 200. The processing device 200 includes a reception unit 202, a sensor information acquisition unit 204, a motion detection unit 206, a line-of-sight direction determination unit 208, an image determination unit 210, an audio determination unit 212, a viewing data provision unit 214, a transmission unit 216, and a recording unit 218. Prepare. The recording unit 218 includes an image recording unit 220 and an audio recording unit 222. When the receiving unit 202 receives the viewing data transmitted from the robot 10, the image recording unit 220 sequentially records the received image data, and the audio recording unit 222 sequentially records the received audio data. The image data is added with vector information and shooting time information at the time of shooting for each frame image.

ユーザＢは、ＨＭＤ１００ａを通じて、処理装置２００に、ユーザＡの視聴データの再生指示を送信する。処理装置２００は、再生指示を受け付けると、視聴データの再生処理を開始する。音声決定部２１２はユーザＢに提供する音声データを決定し、音声記録部２２２に記録された音声データを、音声記録部２２２からただちに読み出し視聴データ提供部２１４に提供する。つまり音声決定部２１２は、ロボット１０から提供される音声データを、ＨＭＤ１００ａにストリーミング配信する。したがってユーザＢは、ユーザＡが聞いている音声と同じ音声をＨＭＤ１００ａのイヤホン１０４から聞くことができる。 User B transmits an instruction to reproduce viewing data of user A to processing apparatus 200 through HMD 100a. When receiving the playback instruction, the processing device 200 starts playback processing of viewing data. The audio determination unit 212 determines audio data to be provided to the user B, and immediately reads out the audio data recorded in the audio recording unit 222 from the audio recording unit 222 and provides it to the viewing data providing unit 214. That is, the sound determination unit 212 distributes the sound data provided from the robot 10 to the HMD 100a in a streaming manner. Therefore, the user B can hear the same sound as the sound that the user A is listening from the earphone 104 of the HMD 100a.

処理装置２００による再生処理中、受信部２０２は、ユーザＢが装着したＨＭＤ１００ａから送信されるセンサ情報を受信し、センサ情報取得部２０４は、受信したセンサ情報を取得する。このセンサ情報は、姿勢センサ１２４がＨＭＤ１００ａの姿勢を検出した姿勢情報である。動き検出部２０６は、ユーザＢの頭部に装着されたＨＭＤ１００ａの姿勢を検出する。視線方向決定部２０８は、動き検出部２０６により検出されたＨＭＤ１００ａの姿勢に応じて、全天球パノラマ画像における仮想カメラの視線方向を定める。画像決定部２１０はユーザＢに提供する画像データを決定し、画像記録部２２０に記録された複数の画像データを用いて、決定された視線方向に向けた仮想カメラにより撮影される画像を合成して画像データを生成する。 During the reproduction process by the processing device 200, the reception unit 202 receives sensor information transmitted from the HMD 100a worn by the user B, and the sensor information acquisition unit 204 acquires the received sensor information. This sensor information is posture information obtained by the posture sensor 124 detecting the posture of the HMD 100a. The motion detection unit 206 detects the posture of the HMD 100a mounted on the user B's head. The line-of-sight direction determination unit 208 determines the line-of-sight direction of the virtual camera in the omnidirectional panoramic image according to the attitude of the HMD 100 a detected by the motion detection unit 206. The image determination unit 210 determines image data to be provided to the user B, and uses the plurality of image data recorded in the image recording unit 220 to synthesize an image photographed by the virtual camera toward the determined line-of-sight direction. To generate image data.

視聴データ提供部２１４は、画像決定部２１０で決定された画像データと、音声決定部２１２で決定された音声データとを合わせた視聴データを、送信部２１６からユーザＢのＨＭＤ１００ａに提供する。 The viewing data providing unit 214 provides viewing data that combines the image data determined by the image determining unit 210 and the sound data determined by the sound determining unit 212 from the transmitting unit 216 to the HMD 100a of the user B.

図１５において、さまざまな処理を行う機能ブロックとして記載される各要素は、ハードウェア的には、回路ブロック、メモリ、その他のＬＳＩで構成することができ、ソフトウェア的には、メモリにロードされたプログラムなどによって実現される。したがって、これらの機能ブロックがハードウェアのみ、ソフトウェアのみ、またはそれらの組合せによっていろいろな形で実現できることは当業者には理解されるところであり、いずれかに限定されるものではない。 In FIG. 15, each element described as a functional block for performing various processes can be configured by a circuit block, a memory, and other LSIs in terms of hardware, and loaded in the memory in terms of software. Realized by programs. Therefore, it is understood by those skilled in the art that these functional blocks can be realized in various forms by hardware only, software only, or a combination thereof, and is not limited to any one.

処理装置２００は、全方位のパノラマ画像を生成する。したがってユーザＢが首を左または右に回して水平方向の視線を左または右に回転させることで、左方向または右方向のパノラマ画像がＨＭＤ１００ａの表示パネル１０２に表示され、またユーザＢが首を上または下に傾けて、垂直方向に視線を傾けることで、上方向または下方向のパノラマ画像がＨＭＤ１００ａの表示パネル１０２に表示されるようになる。 The processing device 200 generates an omnidirectional panoramic image. Therefore, when the user B turns his / her neck left or right and rotates the horizontal line of sight to the left or right, the panorama image in the left or right direction is displayed on the display panel 102 of the HMD 100a, and the user B By tilting up or down and tilting the line of sight in the vertical direction, a panoramic image in the upward or downward direction is displayed on the display panel 102 of the HMD 100a.

図１６は、処理装置２００が生成する全天球パノラマ画像を説明するための図である。この技術では、ユーザＢが球体の中心に位置し、視線の方向を変更することで、見える画像が変更される仮想環境を実現する。画像決定部２１０は、画像記録部２２０に記録されている画像データをスティッチ（縫い合わせ）して、全天球パノラマ画像を生成する。 FIG. 16 is a diagram for explaining an omnidirectional panoramic image generated by the processing device 200. In this technology, a virtual environment in which a visible image is changed is realized by the user B being positioned at the center of the sphere and changing the direction of the line of sight. The image determining unit 210 stitches the image data recorded in the image recording unit 220 to generate an omnidirectional panoramic image.

実施例では、説明を単純にするためにロボット１０がカメラ１４をズームせず、一定の拡大率で画像データを取得する。そのため画像決定部２１０は、画像データに付加されたベクトル情報にもとづいて、画像データを全天球の内周面に張り合わせることで、全天球パノラマ画像を構成する。なお複数の画像データが重複する箇所については、最新の画像データで上書きし、これによりリアルタイムの状況に近い全天球パノラマ画像を構成できるようになる。 In the embodiment, for simplicity of explanation, the robot 10 does not zoom the camera 14 and acquires image data at a constant enlargement ratio. Therefore, the image determination unit 210 forms an omnidirectional panoramic image by pasting the image data onto the inner peripheral surface of the omnidirectional sphere based on the vector information added to the image data. It should be noted that a portion where a plurality of image data overlap is overwritten with the latest image data, so that an omnidirectional panoramic image close to a real-time situation can be constructed.

なお実際の画像決定部２１０の画像生成処理は、処理負荷を軽減するために、常に全天球パノラマ画像を再構成するのではなく、ユーザＢが位置する中心点９から撮影されるフレーム画像７を動的に生成する処理となる。このとき画像決定部２１０は、仮想カメラ８の撮影範囲（画角）を、実際のロボット１０のカメラ１４の撮影範囲（画角）に対応するように設定することが好ましい。これによりユーザＡの視線方向とユーザＢの視線方向とが一致しているタイミングでは、ユーザＢは、ユーザＡと同じ画像を見られるようになる。 Note that the actual image generation processing of the image determination unit 210 does not always reconstruct the omnidirectional panoramic image in order to reduce the processing load, but the frame image 7 taken from the center point 9 where the user B is located. Is a process of dynamically generating. At this time, the image determination unit 210 preferably sets the shooting range (view angle) of the virtual camera 8 so as to correspond to the actual shooting range (view angle) of the camera 14 of the robot 10. As a result, at the timing when the line-of-sight direction of user A and the line-of-sight direction of user B match, user B can see the same image as user A.

このように画像決定部２１０は、画像データにメタデータとして設定されているベクトル情報を用いて、イメージスティッチング処理を実施し、ユーザＢの視線方向から定まる撮影範囲のフレーム画像７を生成する。動き検出部２０６は、ユーザＢのヘッドトラッキング処理を行うことで、ユーザＢの頭部（実際にはＨＭＤ１００ａ）の回転角度および傾きを検出する。ここでＨＭＤ１００ａの回転角度は、水平面の基準方向に対する回転角度であり、基準方向は、たとえばＨＭＤ１００ａの電源がオンされたときに向いている方向として設定されてよい。またＨＭＤ１００ａの傾きは、水平面に対する傾斜角度である。ヘッドトラッキング処理として既知の技術が利用されてよく、動き検出部２０６は、ＨＭＤ１００ａの姿勢センサが検出したセンサ情報から、ＨＭＤ１００ａの回転角度および傾きを検出する。 In this manner, the image determination unit 210 performs image stitching processing using the vector information set as metadata in the image data, and generates the frame image 7 in the shooting range determined from the line-of-sight direction of the user B. The motion detection unit 206 detects the rotation angle and inclination of the head of the user B (actually, the HMD 100a) by performing the head tracking process of the user B. Here, the rotation angle of the HMD 100a is a rotation angle with respect to the reference direction of the horizontal plane, and the reference direction may be set as a direction facing when the power of the HMD 100a is turned on, for example. The inclination of the HMD 100a is an inclination angle with respect to the horizontal plane. A known technique may be used as the head tracking process, and the motion detection unit 206 detects the rotation angle and inclination of the HMD 100a from the sensor information detected by the attitude sensor of the HMD 100a.

視線方向決定部２０８は、検出したＨＭＤ１００ａの回転角度および傾きにしたがって、仮想球体における仮想カメラ８の姿勢を定める。仮想カメラ８は、仮想球体の中心点９から仮想球体の内周面を撮影するように配置されており、視線方向決定部２０８は、仮想カメラ８の光軸の向きを、ロボット１０のカメラ１４の光軸方向と一致するように決定してもよい。 The line-of-sight direction determination unit 208 determines the posture of the virtual camera 8 in the virtual sphere according to the detected rotation angle and inclination of the HMD 100a. The virtual camera 8 is arranged so as to photograph the inner peripheral surface of the virtual sphere from the center point 9 of the virtual sphere, and the line-of-sight direction determination unit 208 changes the direction of the optical axis of the virtual camera 8 to the camera 14 of the robot 10. It may be determined so as to coincide with the optical axis direction.

ロボット１０において、視線方向決定部６６は、ユーザＡのＨＭＤ１００の視線方向を３次元座標のベクトル（ｘ，ｙ，ｚ）として決定すると、ロボット１０のカメラ１４の視線方向を同じ（ｘ，ｙ，ｚ）と決定してもよいことを説明した。処理装置２００においても、視線方向決定部２０８は、ユーザＢのＨＭＤ１００ａの視線方向を３次元座標のベクトル（ｘ，ｙ，ｚ）として決定すると、仮想カメラ８の視線方向を同じ（ｘ，ｙ，ｚ）と決定してもよい。また視線方向決定部６６において、ＨＭＤ１００の視線方向を所定の変換式で補正してカメラ１４の視線方向を求める場合には、視線方向決定部２０８においても、ＨＭＤ１００ａの視線方向を同じ変換式で補正して仮想カメラ８の視線方向を求めてもよい。このように、それぞれの３次元座標系を取り扱うことで、ユーザＡの視線方向とユーザＢの視線方向とが一致しているタイミングでは、ユーザＢは、ユーザＡと同じ画像を見られるようになる。 In the robot 10, when the line-of-sight direction determination unit 66 determines the line-of-sight direction of the HMD 100 of the user A as a three-dimensional coordinate vector (x, y, z), the line-of-sight direction of the camera 14 of the robot 10 is the same (x, y, It was explained that z) may be determined. Also in the processing apparatus 200, when the line-of-sight direction determination unit 208 determines the line-of-sight direction of the HMD 100a of the user B as a vector (x, y, z) of three-dimensional coordinates, the line-of-sight direction of the virtual camera 8 is the same (x, y, z) may be determined. When the gaze direction determining unit 66 determines the gaze direction of the camera 14 by correcting the gaze direction of the HMD 100 with a predetermined conversion formula, the gaze direction determination unit 208 also corrects the gaze direction of the HMD 100a with the same conversion formula. Then, the line-of-sight direction of the virtual camera 8 may be obtained. In this way, by handling each three-dimensional coordinate system, the user B can see the same image as the user A at the timing when the line-of-sight direction of the user A matches the line-of-sight direction of the user B. .

画像決定部２１０は、仮想カメラ８のフレーム画像７を生成すると、光学レンズ用の光学歪み補正を施し、視聴データ提供部２１４に画像データを供給する。なお図１６においては１つの仮想カメラ８が示されているが、実際には左目用と右目用の２つの仮想カメラ８が配置されて、それぞれの画像データが、ロボット１０から提供される左目用画像データおよび右目用画像データにもとづいて生成される。 When the image determination unit 210 generates the frame image 7 of the virtual camera 8, the image determination unit 210 performs optical distortion correction for the optical lens, and supplies image data to the viewing data providing unit 214. In FIG. 16, one virtual camera 8 is shown. Actually, however, two virtual cameras 8 for the left eye and the right eye are arranged, and each image data is provided for the left eye provided from the robot 10. It is generated based on the image data and the right-eye image data.

図１７は、画像記録部２２０に記録されている撮影画像データを説明するための図である。ここでは説明の便宜上、片目用の複数の画像データを示し、またユーザＢの視線方向に対して適切なアフィン変換を施した状態の画像データを２次元平面上に配置している。なおユーザＢの視線方向については後述する。 FIG. 17 is a diagram for describing captured image data recorded in the image recording unit 220. Here, for convenience of explanation, a plurality of image data for one eye is shown, and image data in a state in which an appropriate affine transformation is applied to the user B's line-of-sight direction is arranged on a two-dimensional plane. Note that the line-of-sight direction of the user B will be described later.

画像決定部２１０は、各撮影画像の重複する部分をつなぎ合わせて、全天球パノラマ画像を生成する機能をもつ。撮影画像をつなぎ合わせる技術については、たとえば同一出願人による特許第５８６５３８８号にも記載されているように既知の技術を利用してよい。以下では、画像記録部２２０に記録された複数の撮影画像データのうち、いずれの撮影画像データを用いるかを選択する手法について説明する。 The image determination unit 210 has a function of generating an omnidirectional panoramic image by connecting overlapping portions of the captured images. As a technique for joining photographed images, for example, a known technique may be used as described in Japanese Patent No. 5865388 by the same applicant. Hereinafter, a method for selecting which captured image data to use from among a plurality of captured image data recorded in the image recording unit 220 will be described.

図１７に、５つの画像データＩ１〜Ｉ５を示す。各画像データに含まれる（ｘ，ｙ，ｚ）は、撮影時のカメラ１４の視線方向（ベクトル情報）を表現し、“ｔ”は、撮影時間情報を表現する。ここで画像データＩ１は、ベクトル情報（ｘ１，ｙ１，ｚ１）および撮影時間情報ｔ１を付加情報として有する。同様に画像データＩ２は、ベクトル情報（ｘ２，ｙ２，ｚ２）および撮影時間情報ｔ２を付加情報として有し、画像データＩ３は、ベクトル情報（ｘ３，ｙ３，ｚ３）および撮影時間情報ｔ３を付加情報として有し、画像データＩ４は、ベクトル情報（ｘ４，ｙ４，ｚ４）および撮影時間情報ｔ４を付加情報として有し、画像データＩ５は、ベクトル情報（ｘ５，ｙ５，ｚ５）および撮影時間情報ｔ５を付加情報として有する。 FIG. 17 shows five image data I1 to I5. (X, y, z) included in each image data represents the viewing direction (vector information) of the camera 14 at the time of photographing, and “t” represents photographing time information. Here, the image data I1 has vector information (x1, y1, z1) and shooting time information t1 as additional information. Similarly, the image data I2 has vector information (x2, y2, z2) and shooting time information t2 as additional information, and the image data I3 has vector information (x3, y3, z3) and shooting time information t3 as additional information. The image data I4 includes vector information (x4, y4, z4) and shooting time information t4 as additional information, and the image data I5 includes vector information (x5, y5, z5) and shooting time information t5. It has as additional information.

なお付加情報である撮影時間情報ｔ１〜ｔ５は、撮影開始点（時間０）からの経過時間を表現し、ｔ１＜ｔ２＜ｔ３＜ｔ４＜ｔ５の関係にある。したがって画像データＩ１〜Ｉ５の中では、画像データＩ１が最初に撮影され、画像データＩ５が最後に撮影されている。画像決定部２１０は、撮影時間情報と、視線方向決定部２０８が決定した仮想カメラ８の視線方向にもとづいて、合成画像を生成するための画像データを選択する。 Note that the shooting time information t1 to t5 as additional information expresses the elapsed time from the shooting start point (time 0) and has a relationship of t1 <t2 <t3 <t4 <t5. Therefore, among the image data I1 to I5, the image data I1 is photographed first and the image data I5 is photographed last. The image determination unit 210 selects image data for generating a composite image based on the shooting time information and the visual line direction of the virtual camera 8 determined by the visual line direction determination unit 208.

具体的に画像決定部２１０は、視線方向決定部２０８が決定した仮想カメラ８の視線方向、つまりＨＭＤ１００ａを装着したユーザＢが向いている方向から、全天球パノラマ画像から切り取る撮影範囲（仮想カメラ８の画角）を定め、撮影範囲内に含まれる画像を含む画像データを、画像データに付加されたベクトル情報にもとづいて抽出する。 Specifically, the image determination unit 210 captures a shooting range (virtual camera) cut from the panoramic image from the gaze direction of the virtual camera 8 determined by the gaze direction determination unit 208, that is, the direction in which the user B wearing the HMD 100a faces. 8) and image data including an image included in the shooting range is extracted based on vector information added to the image data.

図１８は、画像決定部２１０が生成するべきフレーム画像７と、画像データとの関係を示す図である。図１７および図１８において、各画像データＩ１〜Ｉ５は、各ベクトル情報にもとづいて、仮想カメラ８の視線方向（Ｘ，Ｙ，Ｚ）に直交する２次元平面上に写像されており、各画像データＩ１〜Ｉ５の位置は、２次元平面において４つの頂点座標で定義される。画像決定部２１０は、仮想カメラ８の視線方向（Ｘ，Ｙ，Ｚ）により、全天球パノラマ画像における仮想カメラ８の画角の位置（撮影範囲）を定め、視線方向に直交する２次元平面のフレーム画像７の４つの頂点座標を定める。画像決定部２１０は、画像記録部２２０に記録された画像データのうち、フレーム画像７に含まれる画像データを抽出する。図示されるように画像データＩ１〜Ｉ５は、フレーム画像７に含まれる画像を含んでいるため、仮想カメラ８の撮影範囲内に含まれる画像データとして抽出される。 FIG. 18 is a diagram illustrating the relationship between the frame image 7 to be generated by the image determination unit 210 and the image data. 17 and 18, each image data I1 to I5 is mapped on a two-dimensional plane orthogonal to the viewing direction (X, Y, Z) of the virtual camera 8 based on each vector information. The positions of the data I1 to I5 are defined by four vertex coordinates on the two-dimensional plane. The image determination unit 210 determines the position (shooting range) of the field angle of the virtual camera 8 in the panoramic image based on the line-of-sight direction (X, Y, Z) of the virtual camera 8, and is a two-dimensional plane orthogonal to the line-of-sight direction. The four vertex coordinates of the frame image 7 are determined. The image determination unit 210 extracts image data included in the frame image 7 from the image data recorded in the image recording unit 220. As illustrated, the image data I1 to I5 include images included in the frame image 7, and thus are extracted as image data included in the shooting range of the virtual camera 8.

画像決定部２１０は、複数の画像データが重複する領域については、遅い撮影時間情報をもつ画像データを優先して用いて合成画像を生成する。図１８に示す例では、フレーム画像７に、撮影時間の早い画像データから順に、つまり画像データＩ１から順に書き込み、新しい画像データで順次上書きしていくようにフレーム画像７を合成する。 For an area where a plurality of image data overlaps, the image determination unit 210 generates a composite image by preferentially using image data having late shooting time information. In the example shown in FIG. 18, the frame image 7 is combined with the frame image 7 so that the image data is written in order from the image data with the earliest shooting time, that is, in order from the image data I1.

このように画像決定部２１０は、複数の画像データのうち重複する領域については、より現在時刻に近い撮影時間情報をもつ画像データを用いて、合成画像を生成するようにする。たとえば撮影範囲内に含まれる画像で、画像データＩ４と画像データＩ５に重複する部分が存在した場合には、より遅い撮影時間をもつ画像データＩ５を重複部分に埋め込む。これにより、現在時刻に近い画像データを用いて合成画像を生成できるようになり、ユーザＢに対して、現在時刻に近い合成画像を提供できるようになる。 As described above, the image determination unit 210 generates a composite image using image data having shooting time information closer to the current time for overlapping regions of the plurality of image data. For example, in the image included in the shooting range, when there is an overlapping portion between the image data I4 and the image data I5, the image data I5 having a later shooting time is embedded in the overlapping portion. As a result, a composite image can be generated using image data close to the current time, and a composite image close to the current time can be provided to the user B.

この画像再生アプリケーションでは、ユーザＢが向く方向によっては、画像データが不足して、フレーム画像７の生成が困難な場合がある。特にロボット１０が撮影を開始した直後は、画像データ数がそもそも少ないため、画像決定部２１０が、ユーザＢの視線方向に応じたフレーム画像７を生成できないことも生じる。実際には起こりえないが、撮影期間中、ユーザＡがＨＭＤ１００を全く動かさなかった場合には、画像記録部２２０に記録されている画像データのベクトル情報は全て同じとなるため、たとえばユーザＢが、ユーザＡと真逆の方向を向くと、当該視線方向の仮想カメラ８の撮影範囲内に含まれる画像データは存在しない。 In this image reproduction application, depending on the direction in which the user B is facing, image data may be insufficient and it may be difficult to generate the frame image 7. In particular, immediately after the robot 10 starts shooting, the number of image data is small in the first place, so that the image determination unit 210 may not be able to generate the frame image 7 according to the user B's line-of-sight direction. Although this cannot actually occur, if the user A does not move the HMD 100 during the shooting period, the vector information of the image data recorded in the image recording unit 220 is all the same. When facing the direction opposite to the user A, there is no image data included in the shooting range of the virtual camera 8 in the line-of-sight direction.

そのような場合、画像決定部２１０は、受信したユーザＡの画像データに、ユーザＢの視線方向の画像を生成できない旨のメッセージを重畳した画像データを生成して、視聴データ提供部２１４からＨＭＤ１００ａに提供してもよい。たとえばユーザＢの視線方向の画像を所定割合（たとえば３０％）以上合成できない場合に、画像決定部２１０は合成画像の生成を行わず、上記メッセージとともに、ユーザＡが見た画像データを視聴データ提供部２１４に供給してもよい。 In such a case, the image determination unit 210 generates image data in which a message indicating that an image in the line-of-sight direction of the user B cannot be generated is superimposed on the received image data of the user A, and the viewing data providing unit 214 sends the image data to the HMD 100a. May be provided. For example, when the image of the user B's line-of-sight direction cannot be synthesized by a predetermined ratio (for example, 30%) or more, the image determination unit 210 does not generate a synthesized image and provides viewing data for the image data viewed by the user A together with the above message It may be supplied to the unit 214.

また画像決定部２１０は、複数の画像データからフレーム画像７を合成するため、生成されたフレーム画像７は、つぎはぎ画像となり、視認性が悪くなる場合もある。そのため、たとえば撮影範囲内の所定割合（たとえば５０％）の画像を１つの画像データから形成できない場合には、上記したように、画像決定部２１０は、ユーザＢの視線方向の画像を生成できない旨のメッセージを、ユーザＡの画像データに重畳した画像データを生成してもよい。 Further, since the image determination unit 210 synthesizes the frame image 7 from a plurality of image data, the generated frame image 7 becomes a stitched image, and visibility may be deteriorated. Therefore, for example, when an image of a predetermined ratio (for example, 50%) within the shooting range cannot be formed from one image data, as described above, the image determination unit 210 cannot generate an image in the line-of-sight direction of the user B. The image data may be generated by superimposing the above message on the image data of the user A.

上記した例では、画像決定部２１０が、遅い撮影時間情報をもつ画像データを優先して用いて合成画像を生成することを説明したが、より早い撮影時間情報をもつ画像データを用いることでフレーム画像７の所定割合以上を構成できる場合には、より早い撮影時間情報をもつ画像データを用いてもよい。 In the example described above, it has been described that the image determination unit 210 generates a composite image by using image data having late shooting time information preferentially. However, a frame can be generated by using image data having earlier shooting time information. When a predetermined ratio or more of the image 7 can be configured, image data having earlier shooting time information may be used.

また時間が経過すると、ロボット１０の撮影している環境に変化が生じるため、過去の画像データを用いた合成画像をユーザＢに提供することが好ましくないことも考えられる。そのため画像決定部２１０は、所定時間以上前の画像データについては、合成画像に含めないように、画像抽出処理を行ってもよい。 Further, as time elapses, the environment in which the robot 10 is shooting changes, so it may be undesirable to provide the user B with a composite image using past image data. Therefore, the image determination unit 210 may perform image extraction processing so that image data older than a predetermined time is not included in the composite image.

以上は、ユーザＢが、ユーザＡの視聴データをリアルタイムで利用する例を示した。以下は、その応用技術について説明する。応用技術では、処理装置２００が、ユーザＡの視聴データのリアルタイム再生を目的とするのではなく、二次利用を目的として視聴データを記録する。 The above shows an example in which the user B uses the viewing data of the user A in real time. The applied technology will be described below. In the applied technology, the processing device 200 records the viewing data for the secondary use, not for the purpose of real-time reproduction of the viewing data of the user A.

視聴データの二次利用のために、ロボット１０において画像処理部８０は、フレーム画像データのそれぞれに、撮影時間情報とベクトル情報を付加し、また音声処理部８２は、音声データに、録音開始点からの経過時間を示す録音時間情報を付加する。なおカメラ１４による撮影（録画）およびマイク１６による録音は同じタイミングで開始されるため、撮影開始点および録音開始点は同じタイミングを示す。撮影時間情報および録音時間情報は、ロボット１０におけるクロック生成部で生成された時刻情報であってよい。画像データおよび音声データに付加情報を付加する形式は何であってもよく、処理装置２００が、再生用の視聴データを生成する際に参照できる形式であればよい。 For secondary use of viewing data, in the robot 10, the image processing unit 80 adds shooting time information and vector information to each of the frame image data, and the audio processing unit 82 adds the recording start point to the audio data. Recording time information indicating the elapsed time from is added. Note that since shooting (recording) by the camera 14 and recording by the microphone 16 are started at the same timing, the shooting start point and the recording start point indicate the same timing. The shooting time information and the recording time information may be time information generated by a clock generation unit in the robot 10. Any format may be used for adding the additional information to the image data and the audio data as long as the processing device 200 can refer to the viewing data for reproduction.

この応用技術では、ユーザＡがロボット１０の使用を終了した後、別のユーザＢ（ユーザＡであってもよい）がＨＭＤ１００ａを装着して、処理装置２００に記録されたユーザＡの視聴データをもとに生成される画像データおよび音声データを、ＨＭＤ１００ａに提供する。このとき実施例で説明したように、処理装置２００は、ユーザＡの視聴データをもとに全天球パノラマ画像を構成し、全天球パノラマ画像からユーザＢの視線方向にもとづいた画像を再構成して、ユーザＢのＨＭＤ１００ａに提供できるようにする。この利用環境では、ロボット１０は使用しない。 In this applied technology, after the user A finishes using the robot 10, another user B (which may be the user A) wears the HMD 100 a and the viewing data of the user A recorded in the processing device 200 is stored. Originally generated image data and audio data are provided to the HMD 100a. At this time, as described in the embodiment, the processing device 200 constructs an omnidirectional panoramic image based on the viewing data of the user A, and regenerates an image based on the viewing direction of the user B from the omnidirectional panoramic image. It is configured so that it can be provided to the HMD 100a of the user B. In this usage environment, the robot 10 is not used.

図１５を参照して、画像記録部２２０は、ロボット１０から送信された画像データを記録しており、音声記録部２２２は、ロボット１０から送信された音声データを記録している。この応用技術においては、画像記録部２２０および音声記録部２２２は、ロボット１０からユーザＡに対して送信された視聴データの全てが記録済みの状態にある。なお画像データは、撮影時間情報と、撮影時のベクトル情報とを付加されており、音声データは、録音時間情報を付加されている。 Referring to FIG. 15, the image recording unit 220 records image data transmitted from the robot 10, and the audio recording unit 222 records audio data transmitted from the robot 10. In this applied technology, the image recording unit 220 and the audio recording unit 222 are in a state where all the viewing data transmitted from the robot 10 to the user A has been recorded. The image data is added with shooting time information and vector information at the time of shooting, and the audio data is added with recording time information.

ユーザＢは、ＨＭＤ１００ａを通じて、処理装置２００に、ユーザＡの視聴データの再生指示を送信する。処理装置２００は、再生指示を受け付けると、視聴データの再生処理を開始する。なお記録部２１８が１時間分の視聴データを記録している場合、ユーザＢは、１時間の範囲内で、任意の時間から再生を開始できるようにしてもよい。この場合、受信部２０２は、ユーザＢから時間指定を受け付け、画像決定部２１０および音声決定部２１２に供給する。 User B transmits an instruction to reproduce viewing data of user A to processing apparatus 200 through HMD 100a. When receiving the playback instruction, the processing device 200 starts playback processing of viewing data. Note that when the recording unit 218 records viewing data for one hour, the user B may be allowed to start playback from any time within the range of one hour. In this case, the receiving unit 202 receives time designation from the user B and supplies it to the image determining unit 210 and the sound determining unit 212.

音声決定部２１２は、再生開始点からの経過時間を示す再生時間情報に対応する録音時間情報をもつ音声データを、音声記録部２２２から読み出し、視聴データ提供部２１４に提供する。再生開始点は、視聴データの再生開始点を意味し、したがって撮影開始点および録音開始点と同じタイミングを示す。音声決定部２１２は再生時間情報に録音時間情報が一致する音声データを音声記録部２２２から読み出し、視聴データ提供部２１４に提供する。 The audio determination unit 212 reads audio data having recording time information corresponding to the reproduction time information indicating the elapsed time from the reproduction start point from the audio recording unit 222 and provides the audio data to the viewing data providing unit 214. The playback start point means a playback start point of viewing data, and therefore indicates the same timing as the shooting start point and the recording start point. The audio determination unit 212 reads out audio data whose recording time information matches the reproduction time information from the audio recording unit 222 and provides it to the viewing data providing unit 214.

処理装置２００による再生処理中、受信部２０２は、ユーザＢが装着したＨＭＤ１００ａから送信されるセンサ情報を受信し、センサ情報取得部２０４は、受信したセンサ情報を取得する。このセンサ情報は、姿勢センサ１２４がＨＭＤ１００ａの姿勢を検出した姿勢情報である。動き検出部２０６は、ユーザＢの頭部に装着されたＨＭＤ１００ａの姿勢を検出する。視線方向決定部２０８は、動き検出部２０６により検出されたＨＭＤ１００ａの姿勢に応じて、仮想カメラの視線方向を定める。画像決定部２１０は、画像記録部２２０に記録された複数の画像データを用いて、決定された視線方向に向けた仮想カメラにより撮影される画像を合成する。視聴データ提供部２１４は、画像決定部２１０で合成された画像データと、音声決定部２１２で読み出された音声データとを合わせた視聴データを、送信部２１６からＨＭＤ１００ａに提供する。 During the reproduction process by the processing device 200, the reception unit 202 receives sensor information transmitted from the HMD 100a worn by the user B, and the sensor information acquisition unit 204 acquires the received sensor information. This sensor information is posture information obtained by the posture sensor 124 detecting the posture of the HMD 100a. The motion detection unit 206 detects the posture of the HMD 100a mounted on the user B's head. The line-of-sight direction determination unit 208 determines the line-of-sight direction of the virtual camera according to the posture of the HMD 100a detected by the motion detection unit 206. The image determination unit 210 uses the plurality of image data recorded in the image recording unit 220 to synthesize an image photographed by the virtual camera oriented in the determined line-of-sight direction. The viewing data providing unit 214 provides viewing data, which is a combination of the image data combined by the image determining unit 210 and the audio data read by the audio determining unit 212, from the transmitting unit 216 to the HMD 100a.

画像決定部２１０は、ユーザＢによる視聴データの再生時間以前にユーザＡが見た画像をスティッチ（縫い合わせ）して、ユーザＢが位置する中心点９から撮影されるフレーム画像７を動的に生成する。 The image determination unit 210 stitches the images viewed by the user A before the reproduction time of the viewing data by the user B, and dynamically generates a frame image 7 photographed from the center point 9 where the user B is located. To do.

ユーザＢによる視聴データの再生時間以前にユーザＡが見た画像について説明する。画像記録部２２０に、撮影開始点から１時間分の画像データが記録されている場合、ユーザＢによる再生開始点からの再生時間は、１時間以内のどこかのタイミングで特定される。たとえば再生時間が再生開始から１５分のタイミングである場合、１５分以内の撮影時間情報が付加された画像、つまり撮影開始点から１５分が経過するまでに撮影された画像が、再生時間以前にユーザＡが見た画像となる。つまり再生開始から１５分の時点を再生しているのであれば、画像決定部２１０は、撮影開始から１５分以内の撮影時間情報が付加された画像データを用いてフレーム画像７を生成し、再生開始から４５分の時点を再生しているのであれば、画像決定部２１０は、撮影開始から４５分以内の撮影時間情報が付加された画像データを用いてフレーム画像７を生成する。 An image viewed by the user A before the viewing data playback time by the user B will be described. When image data for one hour from the shooting start point is recorded in the image recording unit 220, the playback time from the playback start point by the user B is specified at some timing within one hour. For example, if the playback time is 15 minutes from the start of playback, an image to which shooting time information within 15 minutes has been added, that is, an image shot until 15 minutes have passed since the start of shooting is displayed before the playback time. The image viewed by the user A is displayed. That is, if the playback is performed at the time point of 15 minutes from the start of playback, the image determination unit 210 generates the frame image 7 using the image data to which shooting time information within 15 minutes from the start of shooting is added, and plays back. If the time point of 45 minutes from the start is being played back, the image determination unit 210 generates the frame image 7 using image data to which shooting time information within 45 minutes from the start of shooting is added.

図１８を参照して、画像決定部２１０は、再生時間情報以前の撮影時間情報を付加された画像データを抽出するようにし、再生時間情報よりも後の撮影時間情報を付加された画像データを抽出しないようにする。たとえば、フレーム画像７を再生する時間情報が時間ｔ３より後であって、時間ｔ４より前であれば、画像決定部２１０は、画像データＩ１〜Ｉ３を抽出し、画像データＩ４、Ｉ５は抽出しない。このように、再生時間情報以前の撮影時間情報を付加された画像データを用いて合成画像を生成することで、画像決定部２１０は、再生時間よりも後に撮影された画像をユーザＢに見せないようにする。 Referring to FIG. 18, the image determination unit 210 extracts image data to which shooting time information before the playback time information is added, and extracts image data to which shooting time information after the playback time information is added. Do not extract. For example, if the time information for reproducing the frame image 7 is after the time t3 and before the time t4, the image determination unit 210 extracts the image data I1 to I3 and does not extract the image data I4 and I5. . As described above, by generating a composite image using image data to which shooting time information before playback time information is added, the image determination unit 210 does not show the user B an image shot after the playback time. Like that.

視聴データ提供部２１４は、再生時間に対応する録音時間情報をもつ音声データをＨＭＤ１００ａに送信しているため、ユーザＢは、再生時間に同期した音声を聞いている。そのため再生時間以前の状況については概ね承知しており、提供される画像データが再生時間以前の画像データから合成されたものであれば、どのような状況が表示されているかを把握できる。しかしながら提供される画像データが、再生時間より後の画像データから合成されていれば、ユーザＢは承知していない画像を見せられることになり、ユーザＢを混乱させることが予想される。そこで画像決定部２１０は、再生時間よりも後に撮影された画像をユーザＢに見せないようにする。 Since the viewing data providing unit 214 transmits audio data having recording time information corresponding to the reproduction time to the HMD 100a, the user B is listening to audio synchronized with the reproduction time. Therefore, the situation before the reproduction time is generally known, and if the provided image data is synthesized from the image data before the reproduction time, it is possible to grasp what kind of situation is displayed. However, if the provided image data is synthesized from image data after the reproduction time, the user B will be shown an image that he / she does not know, and it is expected that the user B will be confused. Therefore, the image determination unit 210 prevents the user B from seeing an image taken after the reproduction time.

なお画像決定部２１０は、複数の画像データのうち重複する部分については、再生時間情報に近い撮影時間情報をもつ画像データを用いて、合成画像を生成するようにする。たとえば撮影範囲内に含まれる画像で、画像データＩ１と画像データＩ２に重複する部分が存在した場合には、より後に撮影された画像データＩ２を重複部分に埋め込む。これにより、再生時間情報に近い画像データを用いて合成画像を生成できるようになり、ユーザＢには、再生時間の直近の画像データから合成した画像を提供できるようになる。 Note that the image determination unit 210 generates a composite image by using image data having shooting time information close to reproduction time information for overlapping portions of the plurality of image data. For example, in the case where an image included in the imaging range includes an overlapping portion between the image data I1 and the image data I2, the image data I2 captured later is embedded in the overlapping portion. As a result, a composite image can be generated using image data close to the reproduction time information, and an image synthesized from image data with the latest reproduction time can be provided to the user B.

以上、本発明を実施例をもとに説明した。実施例は例示であり、それらの各構成要素や各処理プロセスの組合せにいろいろな変形例が可能なこと、またそうした変形例も本発明の範囲にあることは当業者に理解されるところである。 In the above, this invention was demonstrated based on the Example. It is to be understood by those skilled in the art that the embodiments are exemplifications, and that various modifications can be made to combinations of the respective components and processing processes, and such modifications are within the scope of the present invention.

実施例では、画像決定部２１０が、イメージスティッチング処理を実施し、ユーザＢの視線方向から定まる撮影範囲のフレーム画像７を生成することを説明した。変形例では、イメージスティッチング処理を実施することなく、画像決定部２１０が、仮想カメラ８の視線方向と、画像記録部２２０に記録された画像データに付加されたベクトル情報とにもとづいて、ユーザＢに提供する画像データを決定する。 In the embodiment, it has been described that the image determination unit 210 performs the image stitching process and generates the frame image 7 in the shooting range determined from the user B's line-of-sight direction. In the modification, the image determination unit 210 performs the user based on the line-of-sight direction of the virtual camera 8 and the vector information added to the image data recorded in the image recording unit 220 without performing the image stitching process. The image data to be provided to B is determined.

この変形例で画像決定部２１０は、仮想カメラ８の視線方向に対応するベクトル情報を付加された画像データを、ユーザＢに提供する画像データとして決定する。仮想カメラ８の視線方向に対応するベクトル情報とは、仮想カメラ８の視線方向と一致するベクトル情報、および仮想カメラ８の視線方向と実質的に一致するとみなせるベクトル情報を含む。具体的には、仮想カメラ８の視線方向とベクトル情報とが所定の角度以内（たとえば１０度）にある場合に、画像決定部２１０は、仮想カメラ８の視線方向とベクトル情報とが実質的に一致することを判定してもよい。 In this modification, the image determination unit 210 determines the image data to which the vector information corresponding to the viewing direction of the virtual camera 8 is added as the image data to be provided to the user B. The vector information corresponding to the line-of-sight direction of the virtual camera 8 includes vector information that matches the line-of-sight direction of the virtual camera 8 and vector information that can be considered to substantially match the line-of-sight direction of the virtual camera 8. Specifically, when the visual line direction of the virtual camera 8 and the vector information are within a predetermined angle (for example, 10 degrees), the image determination unit 210 substantially determines that the visual line direction of the virtual camera 8 and the vector information are substantially the same. It may be determined that they match.

ユーザＡの視聴データを同期再生する場合、画像決定部２１０は、仮想カメラ８の視線方向に対応するベクトル情報を付加された画像データのうち、最も遅い撮影時間情報をもつ画像データを、ユーザＢに提供する画像データとして決定する。これによりユーザＢに対して、現在時刻に近い画像を提供できるようになる。 When synchronously reproducing the viewing data of the user A, the image determination unit 210 converts the image data having the latest shooting time information from among the image data to which the vector information corresponding to the line-of-sight direction of the virtual camera 8 is added. Determined as image data to be provided. As a result, an image close to the current time can be provided to the user B.

仮想カメラ８の視線方向に対応するベクトル情報を付加された画像データが画像記録部２２０に記録されていない場合、画像決定部２１０は、高さ方向（ｚ軸方向）の成分を除いた（ｘ、ｙ）成分が実質的に一致するとみなせるベクトル情報を付加された画像データを、ユーザＢに提供する画像データとして決定してもよい。一致するとみなせるベクトル情報とは、（ｘ、ｙ）成分が所定角度以内（たとえば７度）の範囲にあるベクトル情報である。（ｘ、ｙ）成分のみの近似をみることで、画像決定部２１０は、仮想カメラ８の視線方向に対応するベクトル情報を付加された画像データを見つけやすくし、これによりユーザＢに画像データを提供できない事態を回避できるようになる。 When the image data to which the vector information corresponding to the line-of-sight direction of the virtual camera 8 is not recorded in the image recording unit 220, the image determination unit 210 removes the component in the height direction (z-axis direction) (x , Y) Image data to which vector information that can be regarded as substantially matching components may be determined as image data to be provided to the user B. The vector information that can be regarded as matching is vector information in which the (x, y) component is within a predetermined angle (for example, 7 degrees). By looking at the approximation of only the (x, y) component, the image determination unit 210 makes it easy to find the image data to which the vector information corresponding to the line-of-sight direction of the virtual camera 8 is added. It will be possible to avoid situations that cannot be provided.

ユーザＡの視聴データを二次利用する場合、画像決定部２１０は、ユーザＢによる視聴データの再生時間以前にユーザＡが見た画像から、ユーザＢに提供する画像データを決定する。つまり画像決定部２１０は、再生時間情報以前の撮影時間情報を付加された画像データの中から、仮想カメラ８の視線方向に対応するベクトル情報を付加された画像データを決定する。このとき、該当する画像データが複数存在すれば、画像決定部２１０は、再生時間情報に近い撮影時間情報をもつ画像データを選択することが好ましい。 When the user A's viewing data is secondarily used, the image determination unit 210 determines image data to be provided to the user B from images viewed by the user A before the playback time of the viewing data by the user B. That is, the image determination unit 210 determines image data to which vector information corresponding to the line-of-sight direction of the virtual camera 8 is added from among image data to which shooting time information before reproduction time information is added. At this time, when there are a plurality of corresponding image data, the image determination unit 210 preferably selects image data having shooting time information close to reproduction time information.

なおユーザＡが横方向に首を回したときの画像データが画像記録部２２０に記録されているケースを検討する。ユーザＢがユーザＡから少し遅れて、ユーザＡと逆方向に首を回すと、ＨＭＤ１００ａに、ユーザＡが見た画像が逆再生されることが生じうる。この場合、画像データの時系列が逆転することになり、ユーザＢに違和感を生じさせる可能性がある。そのためユーザＢが連続的に視線方向を変化させる際に、ユーザＡの画像データを逆再生させることになる場合には、画像決定部２１０は、画像データを固定して、提供する画像データを変化させないようにしてもよい。 A case where image data when the user A turns his / her neck in the horizontal direction is recorded in the image recording unit 220 will be considered. When the user B is slightly delayed from the user A and turns his head in the opposite direction to the user A, an image viewed by the user A may be reversely reproduced on the HMD 100a. In this case, the time series of the image data is reversed, which may cause the user B to feel uncomfortable. Therefore, when the user B continuously changes the line-of-sight direction, when the image data of the user A is reversely reproduced, the image determination unit 210 fixes the image data and changes the provided image data. It may not be allowed to.

情報処理システム１の有用性を高めるために、ロボット１０は、触覚センサや振動センサなど、外部からの入力を受け付ける入力センサをさらに備えてもよい。図１０に示す機能ブロックにおいて、入力センサは、出力系統２４に設けられ、入力センサのセンサ情報は、送信部９０からＨＭＤ１００に送信される。ＨＭＤ１００は、センサ情報を出力する出力手段を備え、センサ情報を振動などに変換してユーザＡに伝達してもよい。 In order to enhance the usefulness of the information processing system 1, the robot 10 may further include an input sensor that accepts an external input, such as a tactile sensor or a vibration sensor. In the functional block shown in FIG. 10, the input sensor is provided in the output system 24, and sensor information of the input sensor is transmitted from the transmission unit 90 to the HMD 100. The HMD 100 may include an output unit that outputs sensor information, and may convert the sensor information into vibration or the like and transmit it to the user A.

また情報処理システム１では、ロボット１０が、ユーザＡの首の動きに筐体２０を連動させることを説明したが、さらにユーザＡの表情などの状態を伝達する手段を有してもよい。たとえばＨＭＤ１００は、装着したユーザＡの目や眉の動きを検出するセンサや、声の調子を解析する手段などを備える。目や眉の動きは、ユーザの表情を表現するものであり、また声の調子は、ユーザの心理状態を表現する。目や眉の動きおよび／または声の調子に関する情報は、ＨＭＤ１００からロボット１０に送信され、ロボット１０は、筐体２０に設けた表情ユニットを駆動して、ユーザＡの表情、心理状態などを再現してもよい。表情ユニットは、筐体２０の前面においてカメラ１４の上部に形成した駆動部（たとえば眉の形状を模したもの）であってよく、ＨＭＤ１００から送信された情報をもとに、駆動部が駆動される。また表情ユニットは、ユーザＡの表情や心理状態を色で表現するディスプレイであってよく、表示色を変化させることで、ユーザＡの表情や心理状態を表現してもよい。 In the information processing system 1, it has been described that the robot 10 interlocks the housing 20 with the movement of the neck of the user A. However, the robot 10 may further include means for transmitting a state such as the facial expression of the user A. For example, the HMD 100 includes a sensor that detects the movement of the user A's eyes and eyebrows, a means for analyzing the tone of voice, and the like. The movement of the eyes and eyebrows expresses the facial expression of the user, and the tone of the voice expresses the psychological state of the user. Information about the movement of the eyes and eyebrows and / or the tone of the voice is transmitted from the HMD 100 to the robot 10, and the robot 10 drives the facial expression unit provided in the housing 20 to reproduce the facial expression, psychological state, etc. of the user A. May be. The expression unit may be a drive unit (for example, imitating the shape of an eyebrow) formed on the top of the camera 14 on the front surface of the housing 20, and the drive unit is driven based on information transmitted from the HMD 100. The The facial expression unit may be a display that expresses the facial expression and psychological state of the user A with colors, and may express the facial expression and psychological state of the user A by changing the display color.

１・・・情報処理システム、１０・・・ロボット、１２・・・アクチュエータ装置、１４ａ・・・右カメラ、１４ｂ・・・左カメラ、１６ａ・・・右マイク、１６ｂ・・・左マイク、２０・・・筐体、２２・・・入力系統、２４・・・出力系統、３０・・・台座、３２・・・第１円弧状アーム、３２ａ・・・第１貫通長孔、３４・・・第２円弧状アーム、３４ａ・・・第２貫通長孔、３６・・・ハウジング、３８・・・カバー、４０・・・脚部、４２・・・挿通部材、４２ａ・・・第１規制部、４２ｂ・・・第２規制部、４２ｃ・・・軸部、５０・・・駆動機構、５２・・・第１モータ、５４・・・第２モータ、５６・・・第３モータ、６０・・・受信部、６２・・・センサ情報取得部、６４・・・動き検出部、６６・・・視線方向決定部、６８・・・アクチュエータ制御部、７０・・・音声データ取得部、７２・・・音声処理部、８０・・・画像処理部、８２・・・音声処理部、８２ａ・・・位相差増幅装置、８４ａ・・・第１増幅器、８４ｂ・・・第２増幅器、８６ａ・・・第１加算器、８６ｂ・・・第２加算器、８８ａ・・・第３増幅器、８８ｂ・・・第４増幅器、９０・・・送信部、９２・・・画像記録装置、１００・・・ＨＭＤ、１０２・・・表示パネル、１０４・・・イヤホン、１０６・・・マイク、１０８・・・装着バンド、１１０・・・出力機構部、１１２・・・装着機構部、１１４・・・筐体、１２０・・・制御部、１２２・・・記憶部、１２４・・・姿勢センサ、１２６・・・通信制御部、２００・・・処理装置、２０２・・・受信部、２０４・・・センサ情報取得部、２０６・・・動き検出部、２０８・・・視線方向決定部、２１０・・・画像決定部、２１２・・・音声決定部、２１４・・・視聴データ提供部、２１６・・・送信部、２１８・・・記録部、２２０・・・画像記録部、２２２・・・音声記録部。 DESCRIPTION OF SYMBOLS 1 ... Information processing system, 10 ... Robot, 12 ... Actuator apparatus, 14a ... Right camera, 14b ... Left camera, 16a ... Right microphone, 16b ... Left microphone, 20 ... Case, 22 ... Input system, 24 ... Output system, 30 ... Pedestal, 32 ... First arc arm, 32a ... First through slot, 34 ... Second arcuate arm, 34a, second through hole, 36, housing, 38, cover, 40, leg, 42, insertion member, 42a, first regulating portion , 42b... Second restricting portion, 42c... Shaft portion, 50... Driving mechanism, 52... First motor, 54. ..Reception unit, 62... Sensor information acquisition unit, 64... Motion detection unit, 66. DESCRIPTION OF SYMBOLS 8 ... Actuator control part, 70 ... Sound data acquisition part, 72 ... Sound processing part, 80 ... Image processing part, 82 ... Sound processing part, 82a ... Phase difference amplifier, 84a ... first amplifier, 84b ... second amplifier, 86a ... first adder, 86b ... second adder, 88a ... third amplifier, 88b ... fourth amplifier, DESCRIPTION OF SYMBOLS 90 ... Transmission part, 92 ... Image recording apparatus, 100 ... HMD, 102 ... Display panel, 104 ... Earphone, 106 ... Microphone, 108 ... Wearing band, 110 ... Output mechanism section 112... Mounting mechanism section 114... Casing 120... Control section 122... Storage section 124. ... Processing device, 202 ... Receiving unit, 204 ... Sensor information Acquisition unit, 206 ... motion detection unit, 208 ... gaze direction determination unit, 210 ... image determination unit, 212 ... audio determination unit, 214 ... viewing data provision unit, 216 ... transmission 218 recording unit 220 image recording unit 222 audio recording unit

Claims

A recording unit that records image data captured by a camera that moves the line-of-sight direction in conjunction with the movement of the face, with image data added with vector information indicating the line-of-sight direction;
An acquisition unit for acquiring posture information obtained by detecting a posture of a head mounted display mounted on the user's head;
A line-of-sight direction determining unit that determines the line-of-sight direction of the virtual camera from the posture information;
An image determination unit that determines image data to be provided to a user based on a line-of-sight direction of the virtual camera and vector information added to the image data recorded in the recording unit;
A providing unit that provides the image data determined by the image determining unit to a head mounted display;
A processing apparatus comprising:

The image determination unit determines image data to which vector information corresponding to the line-of-sight direction of the virtual camera is added as provided image data.
The processing apparatus according to claim 1.

The image determination unit generates image data by combining images captured by a virtual camera using a plurality of image data recorded in the recording unit.
The processing apparatus according to claim 1.

Shooting time information indicating the elapsed time from the shooting start point is added to the image data,
The image determination unit generates a composite image using image data having slow shooting time information for overlapping portions of a plurality of image data.
The processing apparatus according to claim 3.

Read image data from a recording unit that records image data captured by a camera that moves the line-of-sight direction in conjunction with the movement of the face, and adds vector information indicating the line-of-sight direction, and provides it to the user A method for determining an image to be performed,
Obtaining posture information that detects the posture of the head mounted display mounted on the user's head;
Determining the viewing direction of the virtual camera from the posture information;
Determining image data to be provided to the user based on the line-of-sight direction of the virtual camera and vector information added to the image data recorded in the recording unit;
Providing image data to a head mounted display;
An image determination method comprising:

On the computer,
Read image data from a recording unit that records image data captured by a camera that moves the line-of-sight direction in conjunction with the movement of the face, and adds vector information indicating the line-of-sight direction, and provides it to the user A program for realizing a function for determining an image to be executed,
A function of acquiring posture information obtained by detecting the posture of a head mounted display mounted on the user's head;
A function to determine the viewing direction of the virtual camera from the posture information;
A function of determining image data to be provided to the user based on the viewing direction of the virtual camera and the vector information added to the image data recorded in the recording unit;
A function to provide image data to a head-mounted display;
A program to realize