JP2015220534A

JP2015220534A - Auxiliary apparatus, auxiliary system and auxiliary method for communication, and program

Info

Publication number: JP2015220534A
Application number: JP2014101551A
Authority: JP
Inventors: 真弓松原; Mayumi Matsubara; 正豪原島; Masatake Harashima; 余平　哲也; Tetsuya Yohira; 哲也余平; 雄也遠藤; Yuya Endo
Original assignee: Ricoh Co Ltd
Current assignee: Ricoh Co Ltd
Priority date: 2014-05-15
Filing date: 2014-05-15
Publication date: 2015-12-07
Anticipated expiration: 2034-05-15
Also published as: JP6543891B2

Abstract

PROBLEM TO BE SOLVED: To provide a technique which can adjust the degree of presenting the existence of the self to an opposite party.SOLUTION: There is provided a communication auxiliary apparatus for assisting communication between a video conference terminal 10A disposed in a first base 11A and a video conference terminal 10B disposed in a second base 11B through a communication network N. The first base includes: a wireless communication unit 53 for receiving a second base state information which represents an estimated state of the second base; a sense quantity range calculation unit 55a for calculating the range of a sense amount which a first participant 13 located in the first base is perceptible; an intention amount calculation unit 55b for calculating the intention amount of a second participant located in the second base, on the basis of the second base state information and the range of the sense amount of the first participant; and a stimulus output unit 57 for outputting a stimulus according to the intention amount so that can be perceived by the first participant.

Description

本発明は、通信ネットワークを介して、遠隔地間でのコミュニケーションを補助するのに好適なコミュニケーション補助装置、コミュニケーション補助システム、コミュニケーション補助方法及びプログラムに関する。 The present invention relates to a communication assistance device, a communication assistance system, a communication assistance method, and a program suitable for assisting communication between remote locations via a communication network.

近年、通信ネットワーク技術が向上し、遠隔地とのコミュニケーションが手軽に行えるようになっている。例えば、遠隔地間で行われるテレビ会議では、音声と映像を送受信することにより、拠点を移動せずに会議を実施することができるため、空間的制約も少なく需要も高まっており、既に多くの場面で活用されている。
また、携帯電話のメールやチャットなど、文字を送受信することで、遠隔地間で手軽にコミュニケーションを行うことが可能となっている。
しかし、音声や映像ならびに文字を用いた遠隔コミュニケーションにおいては、参加者が実際に１つの空間を共有していない。
例えば、テレビ会議では、一つの拠点内でテレビ会議が白熱すると、異なる拠点にいる参加者の存在を忘れてしまう場合があり、一つの拠点内で会議が盛り上がってしまうといった問題が発生し易い。
例えば、テレビ会議では、一つの拠点で異なる拠点の様子を映像としてモニタに表示するが、映像には限りがあるので、充分に目視確認が可能な映像情報（非言語情報）の伝達ができていないため、このような問題が起こり易い。 In recent years, communication network technology has improved and communication with remote locations can be easily performed. For example, in teleconferences held between remote locations, by transmitting and receiving audio and video, the conference can be carried out without moving the base, so there is little space restriction and demand is increasing. It is used in the scene.
In addition, it is possible to easily communicate between remote locations by sending and receiving characters such as e-mail and chat on mobile phones.
However, in remote communication using voice, video, and characters, participants do not actually share one space.
For example, in a video conference, when a video conference is heated in one base, there is a case where the presence of a participant in a different base may be forgotten, and a problem that the conference is excited in one base is likely to occur.
For example, in a video conference, the situation of different sites at one site is displayed as a video on the monitor, but since the video is limited, video information (non-linguistic information) that can be sufficiently visually confirmed can be transmitted. Such a problem is likely to occur.

特許文献１には、映像に非言語情報を付加して会議を行う一例として、参加者が発言者に対して同調の度合いを表す情報に基づいて、次に発言しようとしている参加者の候補が選択されて各参加者に提示させる。
また、非特許文献１では、移動機能とカメラによる映像撮影機能を有するロボットによって、遠隔地にいる参加者がロボットを操縦し、自分（参加者）がどこを見ているのかを異なる拠点の参加者に提示させている。さらに見られているという意識を異なる拠点の参加者に持たせることで存在感を表現させている。 In Patent Document 1, as an example of performing a conference by adding non-linguistic information to a video, a participant candidate who is going to speak next is based on information indicating a degree of synchronization with the speaker. Selected and presented to each participant.
Further, in Non-Patent Document 1, a robot having a moving function and a camera video shooting function allows a remote participant to control the robot and determine where he (participant) is looking at. To present. Furthermore, the presence is expressed by giving the participants of different bases the awareness of being seen.

特許文献１は、参加者が発言しようと試みた場合に対してのみ有効である。一方、参加者が発言するまでではないが、自分の存在を異なる拠点の参加者に気にして欲しい場合には無効である。つまり、自分が異なる拠点の参加者に提示したい自分の存在を表す度合いを調整できないという問題があった。
さらに、テレビ会議で資料のみをモニタに表示させた場合、参加者がモニタに表示されていないので、異なる拠点の参加者に自分の存在を気にして欲しくても気にしてもらえないといった問題があった。
また、非特許文献１は、さりげなく存在感を提示することは難しい。テレビ会議の最中にロボットが動き回った場合、参加者の感覚に与える刺激が強く、必要以上にロボットの存在感が提示される。したがって、異なる拠点の参加者に自分が提示したい存在の度合いを調整できないといった問題があった。
以上のように、従来の遠隔コミュニケーションでは、相手に自分の存在を提示する度合いを表す量を調整できず、遠隔地の参加者の存在感が低下するという問題があった。逆に、ロボットを用いた場合、遠隔地の参加者が存在感を強く提示されるという問題があった。
本発明は、上記に鑑みてなされたもので、その目的としては、相手に自分の存在を提示する度合いの調整を行うことが可能な技術を提供することにある。 Patent Document 1 is effective only when a participant attempts to speak. On the other hand, it is not effective when the participant wants to be aware of his / her presence by a participant at a different base, not until the participant speaks. In other words, there is a problem that the degree of expressing one's presence that he wants to present to participants at different bases cannot be adjusted.
In addition, when only materials are displayed on a monitor in a video conference, the participants are not displayed on the monitor, so there is a problem that participants in different locations may not care if they want to be aware of their existence. there were.
Further, it is difficult for Non-Patent Document 1 to present a presence casually. When the robot moves around during the video conference, the stimulus given to the participant's sense is strong, and the presence of the robot is presented more than necessary. Therefore, there is a problem that it is impossible to adjust the degree of existence that one wants to present to participants at different bases.
As described above, in the conventional remote communication, there is a problem that the amount representing the degree of presenting the presence of the user to the other party cannot be adjusted, and the presence of the participants in the remote place is lowered. On the other hand, when using a robot, there was a problem that participants at remote locations were strongly presented.
The present invention has been made in view of the above, and an object thereof is to provide a technique capable of adjusting the degree of presenting his / her presence to the other party.

請求項１記載の発明は、上記課題を解決するため、第１拠点に配置された第１会議端末および第２拠点に配置された第２会議端末の間でネットワークを経由したコミュニケーションを補助するコミュニケーション補助装置であって、前記第１拠点には、推定された前記第２拠点の状況を表す第２拠点状況情報を受信する受信手段と、前記第１拠点にいる第１参加者が知覚可能な感覚量の範囲を算出する感覚量範囲算出手段と、前記第２拠点状況情報と前記第１参加者の感覚量の範囲に基づいて、前記第２拠点にいる第２参加者の意思量を算出する意思量算出手段と、前記意思量に応じた刺激を前記第１参加者が知覚できるように出力する刺激出力手段と、を備えることを特徴とする。 In order to solve the above problem, the invention according to claim 1 is a communication for assisting communication via a network between the first conference terminal arranged at the first base and the second conference terminal arranged at the second base. An auxiliary device, wherein the first site is perceivable by a receiving means for receiving second site status information representing the estimated status of the second site and a first participant at the first site Based on the sensory amount range calculating means for calculating the sensory amount range, the second site status information and the sensory amount range of the first participant, the intention amount of the second participant at the second site is calculated. Intention amount calculating means, and stimulus output means for outputting the stimulus corresponding to the intention amount so that the first participant can perceive.

本発明によれば、第２拠点状況情報と第１参加者の感覚量の範囲に基づいて、第２拠点にいる第２参加者意思量を算出し、当該意思量に応じた刺激を第１参加者が知覚できるように出力することで、相手に自分の存在を提示する度合いの調整を行うことができる。 According to the present invention, the second participant intention amount at the second base is calculated based on the second site status information and the range of the first participant's sense amount, and the first stimulus corresponding to the intention amount is calculated. By outputting so that a participant can perceive, it is possible to adjust the degree of presenting his / her presence to the other party.

本発明の第１実施形態に係るコミュニケーション補助システムがテレビ会議システムに使用されている構成について説明するための模式図である。It is a schematic diagram for demonstrating the structure by which the communication assistance system which concerns on 1st Embodiment of this invention is used for the video conference system. 本発明の第１実施形態に係るコミュニケーション補助システムを示すブロック図である。It is a block diagram which shows the communication assistance system which concerns on 1st Embodiment of this invention. 本発明の第１実施形態に係るコミュニケーション補助システムの動作を表すフローチャートである。It is a flowchart showing operation | movement of the communication assistance system which concerns on 1st Embodiment of this invention. 本発明の第２実施形態に係るコミュニケーション補助システムの動作を表すフローチャートであり、特に、図３に示すステップＳ２００の具体的な例を示すフローチャートである。It is a flowchart showing operation | movement of the communication assistance system which concerns on 2nd Embodiment of this invention, and is especially a flowchart which shows the specific example of step S200 shown in FIG. 本発明の第３実施形態に係るコミュニケーション補助システムの動作を表すフローチャートであり、特に、図４に示す第１拠点状況算出処理（ステップＳ２１０）の具体的な例を表すフローチャートである。It is a flowchart showing operation | movement of the communication assistance system which concerns on 3rd Embodiment of this invention, and is a flowchart showing the specific example of the 1st location situation calculation process (step S210) shown in FIG. 4 especially. 本発明の第４実施形態に係るコミュニケーション補助システムの動作を表すフローチャートであり、特に、図４に示す第２拠点状況算出処理（ステップＳ２５０）の具体的な例を説明するためのフローチャートである。It is a flowchart showing operation | movement of the communication assistance system which concerns on 4th Embodiment of this invention, and is especially a flowchart for demonstrating the specific example of the 2nd site | part condition calculation process (step S250) shown in FIG. 本発明の第５実施形態に係るコミュニケーション補助システムの動作を表すフローチャートであり、特に、図５とは異なる図４に示す第１拠点状況算出処理（ステップＳ２１０）の具体的な例を説明したフローチャートである。It is a flowchart showing operation | movement of the communication assistance system which concerns on 5th Embodiment of this invention, and the flowchart explaining the specific example of the 1st site | part condition calculation process (step S210) shown in FIG. 4 different from FIG. 5 especially. It is. 本発明の第６実施形態に係るコミュニケーション補助システムの動作を表すフローチャートであり、特に、図３に示す刺激量算出処理（ステップＳ３００）の具体的な例を説明するためのフローチャートである。It is a flowchart showing operation | movement of the communication assistance system which concerns on 6th Embodiment of this invention, and is especially a flowchart for demonstrating the specific example of the stimulus amount calculation process (step S300) shown in FIG. 本発明の第７実施形態に係るコミュニケーション補助システムに用いる、図１に示す第２参加者を表すアバターロボットの具体的な例を説明した図である。It is a figure explaining the specific example of the avatar robot showing the 2nd participant shown in FIG. 1 used for the communication assistance system which concerns on 7th Embodiment of this invention. 第１拠点１１Ａの第１参加者１３と第３参加者１４の感覚Ｅの分布のさりげない範囲を具体的に説明するためのグラフで示した図である。It is the figure shown with the graph for demonstrating concretely the casual range of distribution of the sense E of the 1st participant 13 of the 1st base 11A, and the 3rd participant. （ａ）は第１拠点１１Ａの第１参加者１３と第３参加者１４の感覚Ｅの分布から第１拠点１１Ａの感覚Ｅのさりげない範囲を具体的に説明するためのグラフ図であり、（ｂ）は第１拠点１１Ａに沢山の参加がいた場合参加者毎の感覚Ｅの分布のさりげないと感じる範囲から、第２参加者が与えたいと思っている感覚Ｅを算出する方法を説明するためのグラフで示した図である。(A) is a graph for specifically explaining the casual range of the sensation E of the first base 11A from the distribution of the sensation E of the first participant 13 and the third participant 14 of the first base 11A. (B) explains how to calculate the sensation E that the second participant wants to give from the range where the distribution of the sensation E per participant is felt casual when there is a lot of participation at the first base 11A. It is the figure shown with the graph for doing. （ａ）は、第１拠点１１Ａに存在する複数参加者のさりげないと感じる感覚の最大値Ｅ＿ｍａｘの確率密度の具体例を説明するためのグラフで示した図であり、（ｂ）は第１拠点１１Ａに存在する複数参加者のさりげないと感じる感覚の最小値Ｅ＿ｍｉｎの確率密度の具体例を説明した図であり、（ｃ）は第１拠点１１Ａのテレビ会議の拠点状況情報に応じて、第１拠点１１Ａのさりげないと感じる感覚量の範囲Ｅ＿ｍａｘとＥ＿ｍｉｎを更新する具体的な例を説明するためのグラフで示した図である。(A) is the figure shown with the graph for demonstrating the specific example of the probability density of the maximum value E_max of the sensation which the multiple participant who exists in the 1st base 11A feels casually, (b) is 1st. It is the figure explaining the specific example of the probability density of the minimum value E_min of the sensation that a plurality of participants existing in the base 11A feel casually, (c) according to the base situation information of the video conference of the first base 11A, It is the figure shown with the graph for demonstrating the specific example which updates the range E_max and E_min of the amount of sensations which the 1st base 11A feels casually. 第２拠点１１Ｂの第２参加者１５が自分の存在に気付いて欲しいと思っている度合いを反映し、第１拠点参加者に提示する感覚量Ｅを算出する具体的な例を説明するためのグラフで示した図である。Reflecting the degree that the second participant 15 at the second base 11B wants to be aware of his / her presence, a specific example for calculating the sensory amount E to be presented to the first base participant It is the figure shown by the graph.

本発明は、遠隔地間でのコミュニケーションによる会議や会話の進行を止めることなく、さりげなく自分の存在感を相手側に提示することで、遠隔地という空間が異なることによる存在感の差を低減できるようにし、これにより通信ネットワークを介して行われるコミュニケーションの一体感を提供することを可能にする。
一般に、人間の感覚量は刺激強度と関係性を有している。例えばフェヒナーの法則では、感覚量は刺激強度の対数に比例する。
ここで、定数Ｋ、刺激量Ｒとすると、感覚量（心理量）Ｅは、Ｅ＝Ｋ・ｌｏｇ（Ｒ）と表すことができる。
従って、遠隔地の参加者に与える刺激量を調整すれば、自分の存在感の量を調整できることが期待できるため、遠隔地の参加者にさりげない存在感の提示が可能になる。すなわち、相手に自分の存在を提示する度合いの調整を行うことができる。 The present invention reduces the difference in presence due to the difference in remote space by presenting its presence casually to the other party without stopping the progress of conferences and conversations by communication between remote locations This makes it possible to provide a sense of unity of communication performed over a communication network.
In general, the amount of human sense has a relationship with the stimulus intensity. For example, according to Fechner's law, the sensory amount is proportional to the logarithm of the stimulus intensity.
Here, if the constant K and the stimulus amount R are given, the sensory amount (psychological amount) E can be expressed as E = K · log (R).
Therefore, if the amount of stimulation given to the remote participant is adjusted, it can be expected that the amount of presence can be adjusted, so that a casual presence can be presented to the remote participant. That is, it is possible to adjust the degree of presenting one's presence to the other party.

本発明の実施形態について図面を参照して説明する。
＜第１実施形態＞
図１は本発明の第１実施形態に係るコミュニケーション補助システムがテレビ会議システムに使用されている構成の一例について説明するための模式図である。
ここで、図１を参照して、コミュニケーション補助システムの機能構成について説明する。
図１に示すコミュニケーション補助システムは、第１拠点１１Ａ、第２拠点１１Ｂに夫々設置されたテレビ会議端末（会議端末）１０Ａ、テレビ会議端末（会議端末）１０Ｂと、これらテレビ会議端末１０Ａ、テレビ会議端末１０Ｂが接続される通信ネットワークＮで構成される。
テレビ会議端末１０Ａ、テレビ会議端末１０Ｂは、夫々撮像装置４、マイク５、スピーカ６、ディスプレイ７、処理装置８を備えている。 Embodiments of the present invention will be described with reference to the drawings.
<First Embodiment>
FIG. 1 is a schematic diagram for explaining an example of a configuration in which the communication assistance system according to the first embodiment of the present invention is used in a video conference system.
Here, with reference to FIG. 1, the functional configuration of the communication assist system will be described.
1 includes a video conference terminal (conference terminal) 10A and a video conference terminal (conference terminal) 10B installed at the first base 11A and the second base 11B, the video conference terminal 10A, and the video conference, respectively. The communication network N is connected to the terminal 10B.
The video conference terminal 10A and the video conference terminal 10B each include an imaging device 4, a microphone 5, a speaker 6, a display 7, and a processing device 8.

例えば、第１拠点１１Ａに設置されたテレビ会議端末１０Ａの撮像装置４で撮影された画像（静止画像又は動画像）は、撮像装置４と処理装置８との連携動作により画像データに補正処理を施し、当該テレビ会議端末１０Ａ内のディスプレイ７に画像を表示する。同時に、この補正処理を施した画像データは、通信ネットワークＮに接続された第２拠点１１Ｂに設置されている相手側のテレビ会議端末１０Ｂにも伝送されて、そのディスプレイ７に画像が表示される。
例えば、第２拠点１１Ｂに設置されているテレビ会議端末１０Ｂにおいて、操作者が操作部（図示しない）の拡大／縮小ボタンを押下すると、当該テレビ会議端末１０Ｂ内に設けられた同じく撮像装置４と処理装置８と連携動作によりデジタルズーム処理が施される。この結果、デジタルズーム画像が当該テレビ会議端末１０Ｂのディスプレイ７に表示される。同時に、通信ネットワークＮに接続された第１拠点１１Ａに設置されているテレビ会議端末１０Ａにも画像データが伝送されて、そのディスプレイ７に画像表示される。 For example, an image (still image or moving image) captured by the imaging device 4 of the video conference terminal 10 A installed at the first base 11 A is subjected to a correction process on the image data by a cooperative operation between the imaging device 4 and the processing device 8. The image is displayed on the display 7 in the video conference terminal 10A. At the same time, the image data subjected to this correction processing is also transmitted to the video conference terminal 10B on the other side installed at the second base 11B connected to the communication network N, and an image is displayed on the display 7. .
For example, in the video conference terminal 10B installed at the second base 11B, when the operator presses an enlarge / reduce button of an operation unit (not shown), the same imaging device 4 provided in the video conference terminal 10B. Digital zoom processing is performed in cooperation with the processing device 8. As a result, the digital zoom image is displayed on the display 7 of the video conference terminal 10B. At the same time, the image data is also transmitted to the video conference terminal 10A installed at the first base 11A connected to the communication network N and displayed on the display 7 thereof.

図１において、第１拠点１１Ａには第１参加者１３、第３参加者１４が存在し、第２拠点１１Ｂには、第２参加者１５が存在している。さらに、第１拠点１１Ａには第２拠点１１Ｂに存在する第２参加者１５を表すアバターロボット１７が移動自在に配置される。
本発明は、３人以上で２つの異なる拠点間においてコミュニケーションを行う場合に効力を発揮するものである。以下、遠隔地に配置されているテレビ会議端末１０Ａ、１０Ｂを用いてテレビ会議を行う場合を一例にして説明する。
図１は、異なる拠点間においてコミュニケーションを行う場合の具体的な例である。
第１拠点１１Ａでの会話が白熱した状況、つまり、第１参加者１３と第３参加者１４がテレビ会議の会話が盛り上がり、第２参加者１５の存在を忘れて会話を継続している状況を想定する。
第２参加者１５は、第１参加者１３と第３参加者１４に向けて発言し、第１参加者１３と第３参加者１４との会話中に割り込むことで、第１参加者１３と第３参加者１４に第２参加者１５の存在を知らしめることも可能である。
しかし、第２参加者１５は、テレビ会議の進行を停止することは望んでおらず、発言はしたくないと思っていると仮定する。しかし、自分の存在には気付いて欲しいとは思っている。このような場合に、第１拠点１１Ａに存在する、第２参加者１５を表すアバターロボット１７がさりげなく第２参加者１５の存在を提示することで、テレビ会議の進行を妨げることなく、第２参加者１５の存在を、第１参加者１３と第３参加者１４に伝えることができる。本発明は、上記のような状況において使用する発明である。 In FIG. 1, a first participant 13 and a third participant 14 exist at the first base 11A, and a second participant 15 exists at the second base 11B. Further, an avatar robot 17 representing the second participant 15 existing at the second base 11B is movably disposed at the first base 11A.
The present invention is effective when three or more people communicate between two different sites. Hereinafter, a case where a video conference is performed using the video conference terminals 10 A and 10 B arranged at remote locations will be described as an example.
FIG. 1 is a specific example when communication is performed between different bases.
The situation where the conversation at the first base 11A was incandescent, that is, the situation where the first participant 13 and the third participant 14 were excited about the video conference conversation and forgot the existence of the second participant 15 and continued the conversation. Is assumed.
The second participant 15 speaks to the first participant 13 and the third participant 14, and interrupts during the conversation between the first participant 13 and the third participant 14, It is also possible to inform the third participant 14 of the presence of the second participant 15.
However, it is assumed that the second participant 15 does not want to stop the video conference, and does not want to speak. However, I want you to be aware of my existence. In such a case, the avatar robot 17 representing the second participant 15 present at the first base 11A casually presents the presence of the second participant 15, so that the progress of the video conference is not hindered. The presence of the two participants 15 can be communicated to the first participant 13 and the third participant 14. The present invention is used in the above situation.

図１において、本発明の第１実施形態に係るコミュニケーション補助システムは、第１拠点状況計測推定ユニット２０、中継ユニット３０、第２拠点状況計測推定ユニット４０、刺激量算出出力ユニット５０等のコミュニケーション補助装置を備えている。
図２は本発明の第１実施形態に係るコミュニケーション補助システムを示すブロック図である。
図２において、コミュニケーション補助システムは、第１拠点状況計測推定ユニット２０、第２拠点状況計測推定ユニット４０、刺激量算出出力ユニット５０を備えている。
第１拠点状況計測推定ユニット２０は、第１拠点状況計測部２１、第１拠点状況取込部２３、第１拠点状況推定部２５、無線通信部２７を備える。第１拠点状況計測部２１は、第１拠点１１Ａの状況を計測する。第１拠点状況取込部２３は、第１拠点状況計測部２１で計測した第１拠点１１Ａの状況を取り込む。第１拠点状況推定部２５は、第１拠点状況取込部２３で取り込まれた第１拠点１１Ａの状況から第１拠点１１Ａの状況の情報を示す第１拠点状況情報を推定する。無線通信部２７は、第１拠点状況推定部２５で推定された第１拠点状況情報を無線通信部５１に送信する。このように、第１拠点状況計測推定ユニット２０は、第１拠点１１Ａの状況を表す第１拠点状況情報を推定し、無線通信部２７から刺激量算出出力ユニット５０に設けられた無線通信部５１に第１拠点状況情報を送信する。 In FIG. 1, the communication assistance system according to the first embodiment of the present invention is a communication assistance system such as a first site situation measurement estimation unit 20, a relay unit 30, a second site situation measurement estimation unit 40, and a stimulus amount calculation output unit 50. Equipment.
FIG. 2 is a block diagram showing the communication assist system according to the first embodiment of the present invention.
In FIG. 2, the communication assistance system includes a first site situation measurement estimation unit 20, a second site situation measurement estimation unit 40, and a stimulus amount calculation output unit 50.
The first site status measurement estimation unit 20 includes a first site status measurement unit 21, a first site status capture unit 23, a first site status estimation unit 25, and a wireless communication unit 27. The first site status measurement unit 21 measures the status of the first site 11A. The first site status capture unit 23 captures the status of the first site 11 A measured by the first site status measurement unit 21. The first site status estimating unit 25 estimates first site status information indicating the status information of the first site 11A from the status of the first site 11A captured by the first site status capturing unit 23. The wireless communication unit 27 transmits the first site status information estimated by the first site status estimation unit 25 to the radio communication unit 51. In this way, the first site status measurement estimation unit 20 estimates the first site status information representing the status of the first site 11A, and the radio communication unit 51 provided in the stimulus amount calculation output unit 50 from the radio communication unit 27. To send the first site status information.

中継ユニット３０は、通信ネットワークＮを介して送受信される通信情報を第２拠点状況計測推定ユニット４０と刺激量算出出力ユニット５０との間で無線中継する。
第２拠点状況計測推定ユニット４０は、第２拠点状況計測部４１、第２拠点状況取込部４３、第２拠点状況推定部４５、第２拠点通信制御部４７を備える。第２拠点状況計測部４１は、第２拠点１１Ｂの状況を計測する。第２拠点状況取込部４３は、第２拠点状況計測部４１で計測した第２拠点１１Ｂの状況を取り込む。第２拠点状況推定部４５は、第２拠点状況取込部４３で取り込まれた第２拠点１１Ｂの状況から第２拠点１１Ｂの状況の情報を示す第２拠点状況情報を推定する。第２拠点通信制御部４７は、中継ユニット３０に第２拠点状況情報を送信する。このように、第２拠点状況計測推定ユニット４０は、第２拠点１１Ｂの状況を表す第２拠点状況情報を推定し、第２拠点通信制御部４７から中継ユニット３０に第２拠点状況情報を送信する。
中継ユニット３０は、第２拠点通信制御部４７から通信ネットワークＮを経由して送信された第２拠点状況情報を受信し、当該第２拠点状況情報を刺激量算出出力ユニット５０に送信する。
刺激量算出出力ユニット５０は、無線通信部５１、無線通信部５３、刺激量算出部５５、刺激出力部５７を備えている。 The relay unit 30 wirelessly relays communication information transmitted / received via the communication network N between the second site situation measurement estimation unit 40 and the stimulus amount calculation output unit 50.
The second site status measurement estimation unit 40 includes a second site status measurement unit 41, a second site status capture unit 43, a second site status estimation unit 45, and a second site communication control unit 47. The second site status measurement unit 41 measures the status of the second site 11B. The second site status capturing unit 43 captures the status of the second site 11B measured by the second site status measuring unit 41. The second site status estimating unit 45 estimates second site status information indicating the status information of the second site 11B from the status of the second site 11B captured by the second site status capturing unit 43. The second site communication control unit 47 transmits the second site status information to the relay unit 30. As described above, the second site status measurement estimation unit 40 estimates the second site status information representing the status of the second site 11B, and transmits the second site status information from the second site communication control unit 47 to the relay unit 30. To do.
The relay unit 30 receives the second site status information transmitted from the second site communication control unit 47 via the communication network N, and transmits the second site status information to the stimulus amount calculation output unit 50.
The stimulus amount calculation output unit 50 includes a wireless communication unit 51, a wireless communication unit 53, a stimulus amount calculation unit 55, and a stimulus output unit 57.

無線通信部５１は、第１拠点状況計測推定ユニット２０から第１拠点状況情報を受信し、刺激量算出部５５に出力する。
無線通信部５３は、第２拠点状況計測推定ユニット４０から第２拠点状況情報を受信し、刺激量算出部５５に出力する。
刺激量算出部５５は、第１拠点１１Ａにいる第１参加者が知覚可能な感覚量の範囲を算出する感覚量範囲算出部５５ａ、第２拠点状況情報、第１参加者の感覚量の範囲に基づいて、第２拠点１１Ｂにいる第２参加者の言動に現れる会議への参加意思の程度を表す意思量を算出する意思量算出部５５ｂ、当該意思量に応じた刺激量を算出する刺激量算出部５５ｃを備えている。
刺激量算出部５５は、第２拠点１１Ｂの第２参加者１５が第１拠点１１Ａにいる参加者に向けて提示したい感覚量から刺激量を算出しているが、本発明はこれに限定されるものではない。すなわち、第２拠点状況情報、第１参加者の感覚量の範囲に基づいて、第２拠点１１Ｂにいる第２参加者の言動に現れる会議への参加意思の程度を表す意思量を意思量算出部５５ｂにより算出し、刺激出力部５７により意思量に応じた刺激を第１参加者が知覚できるように出力してもよい。
刺激出力部５７は、刺激量に応じた刺激を第１参加者が知覚できるように出力する。
なお、各ユニットに設けられた各無線通信部は、第１拠点１１Ａにおいてユニット間で無線通信を利用したデータ通信を行うためのものである。第１拠点状況計測推定ユニット２０に設けられた無線通信部２７は、刺激量算出出力ユニット５０に設けられた無線通信部５１に対向している。一方、中継ユニット３０は、刺激量算出出力ユニット５０に設けられた無線通信部５３に対向している。 The wireless communication unit 51 receives the first site status information from the first site status measurement estimation unit 20 and outputs the first site status information to the stimulus amount calculation unit 55.
The wireless communication unit 53 receives the second site status information from the second site status measurement estimation unit 40 and outputs the second site status information to the stimulus amount calculation unit 55.
The stimulus amount calculation unit 55 calculates a sensory amount range that can be perceived by the first participant at the first base 11A, second base state information, and the first participant's sensory amount range. The intention amount calculating unit 55b that calculates the amount of intention representing the degree of intention to participate in the conference that appears in the behavior of the second participant in the second base 11B, the stimulus that calculates the amount of stimulation according to the amount of intention An amount calculation unit 55c is provided.
The stimulus amount calculation unit 55 calculates the stimulus amount from the sense amount that the second participant 15 of the second base 11B wants to present to the participant at the first base 11A, but the present invention is not limited to this. It is not something. That is, based on the second site status information and the range of the amount of sensation of the first participant, the amount of intention representing the degree of intention to participate in the conference that appears in the behavior of the second participant at the second site 11B is calculated. It may be calculated by the unit 55b and output by the stimulus output unit 57 so that the first participant can perceive the stimulus according to the intention amount.
The stimulus output unit 57 outputs a stimulus corresponding to the stimulus amount so that the first participant can perceive it.
Each wireless communication unit provided in each unit is for performing data communication using wireless communication between units at the first base 11A. The wireless communication unit 27 provided in the first site situation measurement estimation unit 20 faces the wireless communication unit 51 provided in the stimulus amount calculation output unit 50. On the other hand, the relay unit 30 faces the wireless communication unit 53 provided in the stimulus amount calculation output unit 50.

詳しくは、第１拠点状況計測推定ユニット２０は、図１に示す第１拠点１１Ａに存在する第１参加者１３と第３参加者１４の状況を計測し推定する。また、第１拠点状況計測推定ユニット２０は、図１に示す第１拠点１１Ａに存在する第１参加者１３と第３参加者１４とアバターロボット１７（第２参加者１５を表す）の状況を計測し推定する。
第１拠点状況計測推定ユニット２０は、内部にＲＯＭ、ＲＡＭ及びＣＰＵを有し、ＲＯＭからオペレーティングシステムＯＳを読み出してＲＡＭ上に展開してＯＳを起動し、ＯＳ管理下において、ＲＯＭからプログラムを読み出し、状況データ収集処理を実行する。
第２拠点状況計測推定ユニット４０は、図１に示す第２拠点１１Ｂに存在する第３参加者１４の状況を計測し推定する。
第２拠点状況計測推定ユニット４０は、内部にＲＯＭ、ＲＡＭ及びＣＰＵを有し、ＲＯＭからオペレーティングシステムＯＳを読み出してＲＡＭ上に展開してＯＳを起動し、ＯＳ管理下において、ＲＯＭからプログラムを読み出し、状況データ収集処理を実行する。
刺激量算出出力ユニット５０は、第１拠点状況計測推定ユニット２０で推定された第１拠点１１Ａの状況と、第２拠点状況計測推定ユニット４０で推定された第２拠点１１Ｂの状況と、から図１に示す第２参加者１５を表すアバターロボット１７が、第２参加者１５の存在をさりげなく提示するための刺激量を算出し出力する。
刺激量算出出力ユニット５０は、内部にＲＯＭ、ＲＡＭ及びＣＰＵを有し、ＲＯＭからオペレーティングシステムＯＳを読み出してＲＡＭ上に展開してＯＳを起動し、ＯＳ管理下において、ＲＯＭからプログラムを読み出し、刺激量算出処理を実行する。 Specifically, the first site situation measurement estimation unit 20 measures and estimates the statuses of the first participant 13 and the third participant 14 existing at the first site 11A shown in FIG. In addition, the first site status measurement estimation unit 20 indicates the status of the first participant 13, the third participant 14, and the avatar robot 17 (representing the second participant 15) existing in the first site 11A shown in FIG. Measure and estimate.
The first site situation measurement estimation unit 20 has a ROM, a RAM, and a CPU inside, reads the operating system OS from the ROM, expands it on the RAM, starts the OS, and reads the program from the ROM under OS management. The status data collection process is executed.
The second site status measurement estimation unit 40 measures and estimates the status of the third participant 14 existing at the second site 11B shown in FIG.
The second site status measurement estimation unit 40 has a ROM, a RAM, and a CPU inside, reads the operating system OS from the ROM, expands it on the RAM, starts the OS, and reads the program from the ROM under OS management. The status data collection process is executed.
The stimulus amount calculation output unit 50 is based on the situation of the first base 11A estimated by the first base situation measurement estimation unit 20 and the situation of the second base 11B estimated by the second base situation measurement estimation unit 40. The avatar robot 17 representing the second participant 15 shown in FIG. 1 calculates and outputs a stimulus amount for casually presenting the presence of the second participant 15.
The stimulus amount calculation output unit 50 has a ROM, a RAM, and a CPU inside, reads the operating system OS from the ROM, expands the RAM on the RAM, starts the OS, reads the program from the ROM under the OS management, and stimulates The amount calculation process is executed.

第１拠点状況計測部２１と第２拠点状況計測部４１は、例えばマイクとカメラを用いて音声信号と映像信号を計測する。又は、マイクと加速度センサを用いて音声信号と加速度の信号を計測する。第１拠点状況計測部２１と第２拠点状況計測部４１は、同じ種類の信号を用いてもよいし、異なる種類の信号を用いてもよい。具体的には、第１拠点状況計測部２１と第２拠点状況計測部４１は、共にマイクとカメラを用いて音声信号と映像信号を計測してもよい。また、第１拠点状況計測部２１と第２拠点状況計測部４１は、共にマイクと加速度センサを用いて音声信号と加速度の信号を計測してもよい。また、第１拠点状況計測部２１はマイクとカメラを用いて音声信号と映像信号を計測してもよく、第２拠点状況計測部４１はマイクと加速度センサを用いて音声信号と加速度の信号を計測してもよい。また、第１拠点状況計測部２１はマイクと加速度センサを用いて音声信号と加速度の信号を計測してもよく、第２拠点状況計測部４１はマイクとカメラを用いて音声信号と映像信号を計測してもよい。 The first site status measuring unit 21 and the second site status measuring unit 41 measure an audio signal and a video signal using, for example, a microphone and a camera. Alternatively, a voice signal and an acceleration signal are measured using a microphone and an acceleration sensor. The first site status measuring unit 21 and the second site status measuring unit 41 may use the same type of signal or different types of signals. Specifically, both the first site situation measuring unit 21 and the second site situation measuring unit 41 may measure an audio signal and a video signal using a microphone and a camera. Further, both the first site situation measuring unit 21 and the second site situation measuring unit 41 may measure a voice signal and an acceleration signal using a microphone and an acceleration sensor. Further, the first site situation measuring unit 21 may measure an audio signal and a video signal using a microphone and a camera, and the second site situation measuring unit 41 uses the microphone and an acceleration sensor to obtain an audio signal and an acceleration signal. You may measure. Further, the first site situation measuring unit 21 may measure an audio signal and an acceleration signal using a microphone and an acceleration sensor, and the second site situation measuring unit 41 uses the microphone and the camera to obtain an audio signal and a video signal. You may measure.

第１拠点状況取込部２３は、第１拠点状況計測部２１から得られたデータを取り込む。第２拠点状況取込部４３は、第２拠点状況計測部４１から得られたデータを取り込む。
第１拠点状況取込部２３と第２拠点状況取込部４３の夫々は、例えば、音声信号を参加者の発言内容や発音の有無を検出するためにノイズを除去する。また、参加者の発言内容や発音の有無を検出するために適切なサンプリング周期で信号を処理する。また、映像信号からノイズを除去し、図１に示す第１参加者１３と第２参加者１５と第３参加者１４とアバターロボット１７の姿勢や動きの信号のデータを取り込みやすくする前処理を行う。また、加速度の信号からノイズを除去し、第１参加者１３、第２参加者１５と第３参加者１４とアバターロボット１７の姿勢や動きの信号を抽出しやすくする前処理を行う。 The first site status capturing unit 23 captures data obtained from the first site status measuring unit 21. The second site status capturing unit 43 captures data obtained from the second site status measuring unit 41.
Each of the first site status capturing unit 23 and the second site status capturing unit 43 removes noise in order to detect, for example, the speech content of the participant and the presence or absence of pronunciation of the audio signal. In addition, the signal is processed at an appropriate sampling period in order to detect the content of the participant's speech and the presence or absence of pronunciation. Also, pre-processing is performed to remove noise from the video signal and to easily capture data of posture and movement signals of the first participant 13, the second participant 15, the third participant 14, and the avatar robot 17 shown in FIG. Do. In addition, noise is removed from the acceleration signal, and preprocessing is performed to make it easier to extract the posture and movement signals of the first participant 13, the second participant 15, the third participant 14, and the avatar robot 17.

第１拠点状況推定部２５は、第１拠点状況取込部２３から得られた信号に基づいて第１拠点１１Ａの状況を推定する。第２拠点状況推定部４５は、第２拠点状況取込部４３から得られた信号に基づいて第２拠点１１Ｂの状況を推定する。
例えば、音声信号から図１に示す第１参加者１３と第２参加者１５と第３参加者１４の発言の内容や、発言している時間や、発言していない時間や、発言の音量から、第１拠点１１Ａのテレビ会議の白熱の度合いや、第１拠点１１Ａのテレビ会議の参加の状況や、第２拠点１１Ｂのテレビ会議の参加状況や、第２参加者１５の状況を推定する。
具体的には、第１拠点状況推定部２５は、例えば、映像信号から、図１に示す第１参加者１３と第２参加者１５と第３参加者１４のテレビ会議中の体の動きや、予め定められた特徴的な動作、具体的には、うなずく、みつめる、頬杖をつくなどの動作や、発言者を特定し、その発言者の視線の先を特定し、発言相手を特定することなどからテレビ会議の状況を推定する。
例えば、映像信号から、図１に示す第１参加者１３と第３参加者１４と、第２参加者１５を表すアバターロボット１７（ロボット部）との相対距離から第１拠点１１Ａの状況を推定する。例えば、加速度の信号から、第１参加者１３と第２参加者１５と第３参加者１４のテレビ会議中の体の動きや、予め定められた特徴的な動作、具体的には、うなずく、みつめる、頬杖をつくなどの動作からテレビ会議の状況を推定する。動作や音声の有無や回数のみならず、その動作の大きさについても考慮し、状況の推定に反映させる。
具体的には、第１拠点状況推定部２５では、第１拠点１１Ａの第１参加者１３と第３参加者１４が、第２参加者１５を表すアバターロボット１７や、遠隔テレビ会議を行う時、多くの場合利用される第２参加者１５の姿を投影させた映像など、に視線を向けることなく、第１拠点１１Ａの第１参加者１３と第３参加者１４のみで会話を続けたりし、第２参加者１５に向けた発言を行わないという状況を、音声信号や映像信号や、加速度の信号を用いて音と体の動きや視線から推定する。 The first site status estimating unit 25 estimates the status of the first site 11A based on the signal obtained from the first site status capturing unit 23. The second site status estimating unit 45 estimates the status of the second site 11B based on the signal obtained from the second site status capturing unit 43.
For example, from the voice signal, the content of the speech of the first participant 13, the second participant 15 and the third participant 14 shown in FIG. 1, the time of speech, the time of speech, and the volume of speech The degree of incandescence of the video conference at the first site 11A, the status of participation in the video conference at the first site 11A, the status of participation in the video conference at the second site 11B, and the status of the second participant 15 are estimated.
Specifically, the first site situation estimation unit 25, for example, from the video signal, the body movement during the video conference of the first participant 13, the second participant 15, and the third participant 14 shown in FIG. Predetermined characteristic actions, specifically nodding, staring, cheek sticking, etc., identifying the speaker, identifying the point of the speaker's line of sight, and identifying the other party The situation of the video conference is estimated from the above.
For example, the situation of the first base 11A is estimated from the relative distance between the first participant 13 and the third participant 14 shown in FIG. 1 and the avatar robot 17 (robot unit) representing the second participant 15 from the video signal. To do. For example, from the acceleration signal, the body movement of the first participant 13, the second participant 15 and the third participant 14 during the video conference, a predetermined characteristic action, specifically, nodding, Estimate the status of video conferencing from actions such as staring and cheek sticks. Considering not only the presence / absence and number of times of motion and voice, but also the size of motion, it is reflected in the estimation of the situation.
Specifically, in the first site situation estimation unit 25, when the first participant 13 and the third participant 14 of the first site 11A perform an avatar robot 17 representing the second participant 15 or a remote video conference. The conversation can be continued only with the first participant 13 and the third participant 14 at the first base 11A without directing the line of sight to the projected image of the second participant 15 used in many cases. Then, the situation in which the speech toward the second participant 15 is not performed is estimated from the sound, the movement of the body, and the line of sight using the audio signal, the video signal, and the acceleration signal.

例えば、第１拠点状況推定部２５では、第１拠点１１Ａの第１参加者１３と第３参加者１４のみで会話を続けている時間の長さや、会話の音量、動作の大きさ、から、第２拠点１１Ｂの第２参加者１５の存在を忘れてテレビ会議に白熱している度合いを推定する。また、第１拠点状況推定部２５では、第１拠点１１Ａの第１参加者１３と第３参加者１４と、第２参加者１５を表すアバターロボット１７との距離から、後述する刺激量算出出力ユニット５０から出力する刺激量を調整する。例えば、第１拠点１１Ａの第１参加者１３と第３参加者１４と、第２参加者１５を表すアバターロボット１７との距離が離れている場合には、第２参加者１５を表すアバターロボットが出力する刺激量を、第１拠点１１Ａの第１参加者１３と第３参加者１４が感じにくい状況と推定する。上記感じにくいと推定された場合には、第２参加者１５を表すアバターロボット１７が大きな刺激を出力する。
例えば、第１拠点状況推定部２５では、第１拠点１１Ａの第１参加者１３と第３参加者１４と、第２参加者１５を表すアバターロボット１７との距離が近い場合には、第１拠点１１Ａの第１参加者１３と第３参加者１４が感じやすい状況と推定する。その結果、第２参加者１５を表すアバターロボット１７は小さな刺激を出力することとなる。上記感じやすいと推定された場合には、これは、同じ刺激を出力した場合に、第１拠点１１Ａの第１参加者１３と第３参加者１４と、第２参加者１５を表すアバターロボット１７との距離に応じて、第１拠点１１Ａの第１参加者１３と第３参加者１４が感じる感覚の量が異なることに対応するためである。 For example, in the first site situation estimation unit 25, from the length of time during which the conversation continues only with the first participant 13 and the third participant 14 of the first site 11A, the volume of the conversation, and the size of the operation, Forgetting the presence of the second participant 15 at the second base 11B, the degree of incandescence in the video conference is estimated. Further, the first site status estimation unit 25 calculates a stimulus amount calculation output described later from the distances between the first participant 13 and the third participant 14 at the first site 11A and the avatar robot 17 representing the second participant 15. The amount of stimulation output from the unit 50 is adjusted. For example, when the distance between the first participant 13 and the third participant 14 at the first base 11 A and the avatar robot 17 representing the second participant 15 is long, the avatar robot representing the second participant 15. Is estimated to be a situation in which the first participant 13 and the third participant 14 at the first base 11A are difficult to feel. When it is estimated that it is difficult to feel, the avatar robot 17 representing the second participant 15 outputs a large stimulus.
For example, in the 1st base situation estimation part 25, when the distance of the 1st participant 13 of the 1st base 11A, the 3rd participant 14, and the avatar robot 17 showing the 2nd participant 15 is near, it is 1st. It is estimated that the first participant 13 and the third participant 14 at the base 11A can easily feel. As a result, the avatar robot 17 representing the second participant 15 outputs a small stimulus. When it is estimated that it is easy to feel, this means that when the same stimulus is output, the avatar robot 17 representing the first participant 13, the third participant 14, and the second participant 15 at the first base 11 A. This is because the amount of sensation felt by the first participant 13 and the third participant 14 at the first base 11 A varies depending on the distance to the first base 11 A.

第２拠点状況推定部４５では、第２拠点１１Ｂの第２参加者１５が、予め定められた時間以上発言しておらず、頬杖をついたり、うなずいたり、ため息をついたりなどの特徴的な動作（挙動、言動）を行い、第１拠点１１Ａの第１参加者１３と第３参加者１４に自分の存在に気づいて欲しいと思っている状況を、音声信号や映像信号や加速度の信号を用いて音と体の動きや視線から推定する。動作や音声の有無や回数のみならず、その動作の大きさについても考慮し、拠点状況情報の推定に反映させる。第２拠点１１Ｂの第２参加者１５が発言していない時間の長さや、動作の大きさ、から、第２拠点１１Ｂの第２参加者１５が自分の存在に気づいて欲しいと思っている度合いを推定する。 In the second site status estimation unit 45, the second participant 15 of the second site 11B has not spoken for a predetermined time, and has a characteristic such as wearing a cheek stick, nodding or sighing. An action (behavior, behavior) is performed, and a situation in which the first participant 13 and the third participant 14 of the first base 11A want to be aware of their existence is expressed by an audio signal, a video signal, or an acceleration signal. Used to estimate from sound and body movements and line of sight. Considering not only the presence / absence and frequency of actions and voices but also the magnitude of the actions, it is reflected in the estimation of the base situation information. The degree to which the second participant 15 at the second base 11B wants to be aware of his / her presence from the length of time that the second participant 15 at the second base 11B is not speaking and the size of the movement. Is estimated.

刺激量算出部５５は、第１拠点状況取込部２３で推定された拠点状況情報と第２拠点状況取込部４３で推定されたと拠点状況情報から刺激量を算出する。具体的には、第２拠点１１Ｂの第２参加者１５が第１拠点１１Ａの第１参加者１３と第３参加者１４に自分の存在に気づいて欲しいと思っている拠点状況情報であると推定された時、かつ、第１拠点１１Ａの第１参加者１３と第３参加者１４が議論に白熱し第２拠点１１Ｂの第２参加者１５の存在を忘れている拠点状況情報であると推定された時、第１拠点１１Ａの第１参加者１３と第３参加者１４に、第２拠点１１Ｂの第２参加者１５の存在をさりげなく提示するために、刺激量を算出する。
例えば、刺激量算出部５５は、第１拠点状況取込部２３で推定された拠点状況情報と第２拠点状況取込部４３で推定されたと拠点状況情報に基づいて刺激量を算出する。刺激量算出部５５は、出力する刺激量を、第１拠点１１Ａの第１参加者１３と第３参加者１４が議論に白熱し第２拠点１１Ｂの第２参加者１５の存在を忘れている度合いと、第２拠点１１Ｂの第２参加者１５が第１拠点１１Ａの第１参加者１３と第３参加者１４に自分の存在に気づいて欲しいと思っている度合いによって調節する。例えば、第２拠点１１Ｂの第２参加者１５が自分の存在に気づいて欲しいと思っている度合いが大きい場合には、予め定められた刺激の範囲の中から大きい刺激量を算出する。第２拠点１１Ｂの第２参加者１５が自分の存在に気づいて欲しいと思っている度合いが小さい場合には、予め定められた刺激の範囲の中から小さい刺激量を算出する。第１拠点１１Ａの第１参加者１３と第３参加者１４が議論に白熱し第２拠点１１Ｂの第２参加者１５の存在を忘れている度合いが大きい場合には、予め定められた刺激の範囲の中から大きい刺激量を算出する。第１拠点１１Ａの第１参加者１３と第３参加者１４が議論に白熱し第２拠点１１Ｂの第２参加者１５の存在を忘れている度合いが小さい場合には、予め定められた刺激の範囲の中から小さい刺激量を算出する。
例えば、第１拠点１１Ａの第１参加者１３と第３参加者１４と、第２参加者１５を表すアバターロボット１７との距離が離れている場合には第２参加者１５を表すアバターロボット１７が大きな刺激量を算出し、距離が近い場合には第２参加者１５を表すアバターロボット１７が小さな刺激量を算出する。 The stimulus amount calculating unit 55 calculates the stimulus amount from the base state information estimated by the first base state capturing unit 23 and the base state information estimated by the second base state capturing unit 43. Specifically, the second participant 15 at the second base 11B is the base situation information that the first participant 13 and the third participant 14 at the first base 11A want to be aware of their existence. When it is estimated, it is the base situation information that the first participant 13 and the third participant 14 of the first base 11A are heated in the discussion and have forgotten the presence of the second participant 15 of the second base 11B. When estimated, in order to casually present the presence of the second participant 15 at the second base 11B to the first participant 13 and the third participant 14 at the first base 11A, the stimulation amount is calculated.
For example, the stimulus amount calculation unit 55 calculates the stimulus amount based on the base state information estimated by the first base state capture unit 23 and the base state information estimated by the second base state capture unit 43. The stimulus amount calculation unit 55 inclines the output stimulus amount by the first participant 13 and the third participant 14 at the first base 11A and forgets the presence of the second participant 15 at the second base 11B. The degree and the degree that the second participant 15 of the second base 11B wants the first participant 13 and the third participant 14 of the first base 11A to be aware of their presence are adjusted. For example, when the second participant 15 at the second base 11B wants to be aware of his / her presence, the stimulation amount is calculated from a predetermined stimulation range. When the second participant 15 at the second base 11B wants to be aware of his / her presence is small, a small stimulus amount is calculated from a predetermined stimulus range. When the first participant 13 and the third participant 14 at the first base 11A are incandescent in the discussion and forget about the presence of the second participant 15 at the second base 11B, Calculate a large amount of stimulation from the range. If the first participant 13 and the third participant 14 at the first base 11A are incandescent in the discussion and the degree of forgetting the presence of the second participant 15 at the second base 11B is small, Calculate a small amount of stimulus from the range.
For example, when the distance between the first participant 13 and the third participant 14 at the first base 11 A and the avatar robot 17 representing the second participant 15 is long, the avatar robot 17 representing the second participant 15. Calculates a large stimulus amount, and when the distance is short, the avatar robot 17 representing the second participant 15 calculates a small stimulus amount.

刺激量算出部５５は、上記算出された刺激量から、刺激出力部５７に応じた信号への変換を行う部である。後に説明するが、例えば刺激出力部５７がモータなどの振動を利用する場合、上記算出された刺激量に応じて振幅と周波数を算出する。例えば刺激出力部５７が発光ダイオードＬＥＤなどの光を発する場合、上記算出された刺激量に応じて出力する光量や電流、電圧の値を算出する。刺激出力部５７がＬＥＤなどの色を出力する場合、算出された刺激量に応じて出力する色の濃淡、例えば濃い赤や薄い赤、や、色の変化、例えば青や赤、を出力する。刺激出力部５７は、第１参加者が目視することにより感知可能な光波信号を発生する。
刺激出力部５７がスピーカなどの音を出力する場合、上記算出された刺激量に応じて出力する音量を算出する。刺激出力部５７がスピーカなどの音を出力する場合、上記算出された刺激量に応じて出力する言語又は音声の内容やその音量を算出する。具体的には、「あーあ」「んー」等の言語又は音声を選んだり、その音声の音量を算出したりする。刺激出力部５７は、第１参加者が聴取することにより感知可能な音波信号を発生する。
刺激出力部５７は、第１拠点状況取込部２３で推定された拠点状況情報と第２拠点状況取込部４３で推定された拠点状況情報に基づいて刺激量算出部５５で算出された刺激量を出力する。例えば、図１に示す第２参加者１５を表すアバターロボット１７に、モータやＬＥＤやスピーカなどのデバイスが具備されており、上記デバイスが、第１拠点状況取込部２３で推定された拠点状況情報と第２拠点状況取込部４３で推定されたと拠点状況情報に基づいて刺激量算出部５５で算出された刺激量を出力する。 The stimulus amount calculation unit 55 is a unit that converts the calculated stimulus amount into a signal corresponding to the stimulus output unit 57. As will be described later, for example, when the stimulus output unit 57 uses vibration of a motor or the like, the amplitude and the frequency are calculated according to the calculated stimulus amount. For example, when the stimulus output unit 57 emits light such as a light emitting diode LED, the light amount, current, and voltage values to be output are calculated according to the calculated stimulus amount. When the stimulus output unit 57 outputs a color such as an LED, it outputs a shade of color to be output according to the calculated stimulus amount, such as dark red or light red, or a color change such as blue or red. The stimulus output unit 57 generates a light wave signal that can be sensed by visual observation by the first participant.
When the stimulus output unit 57 outputs sound from a speaker or the like, the sound output volume is calculated according to the calculated stimulus amount. When the stimulus output unit 57 outputs a sound from a speaker or the like, the language or voice content to be output or the volume thereof is calculated according to the calculated stimulus amount. Specifically, a language or voice such as “Ah” or “Nh” is selected, and the volume of the voice is calculated. The stimulus output unit 57 generates a sound wave signal that can be sensed by listening to the first participant.
The stimulus output unit 57 calculates the stimulus calculated by the stimulus amount calculation unit 55 based on the site status information estimated by the first site status capturing unit 23 and the site status information estimated by the second site status capturing unit 43. Output quantity. For example, the avatar robot 17 representing the second participant 15 shown in FIG. 1 is equipped with devices such as motors, LEDs, and speakers, and the above-mentioned devices are estimated by the first site status capturing unit 23. The stimulus amount calculated by the stimulus amount calculation unit 55 based on the information and the site state information estimated by the second site state capturing unit 43 is output.

図３は本発明の第１実施形態に係るコミュニケーション補助システムの動作を表すフローチャートである。
まずステップＳ１００では状況入力処理を行う。図２に示す第１拠点状況計測部２１と、第１拠点状況取込部２３と、第２拠点状況計測部４１と、第２拠点状況取込部４３とを用いて、図１に示す第１拠点１１Ａと第２拠点１１Ｂの状況を計測して取り込む。ここで、状況入力処理に用いる入力は、図２に示す第１拠点状況計測部２１と第２拠点状況計測部４１と、から得られる計測データである。
ステップＳ１００では、上記得られた計測データを、ステップＳ２００で用いるデータへ変換する。
具体的には、例えば、ステップＳ１００は、第２拠点１１Ｂの第２参加者１５が第１拠点１１Ａの第１参加者１３と第３参加者１４に自分の存在に気づいて欲しいと思っている状況かどうか、ならびに第１拠点１１Ａの第１参加者１３と第３参加者１４が議論に白熱し第２拠点１１Ｂの第２参加者１５の存在を忘れている状況であるかどうかを計測し取り込むステップである。例えばマイクとカメラを用いて音声信号と映像信号を入力する。又は、マイクと加速度センサを用いて音声信号と加速度の信号を入力する。ここで、音声信号を参加者の発言内容や発音の有無を検出するためにノイズを除去したり、参加者の発言内容や発音の有無を検出するために適切なサンプリング周期で信号を処理したり、映像信号からノイズを除去し、図１に示す第１参加者１３と第２参加者１５と第３参加者１４とアバターロボット１７の姿勢や動きの信号を抽出したりする。
ステップＳ１００の出力は、上記状況入力処理を行うステップＳ１００で処理されたデータである。 FIG. 3 is a flowchart showing the operation of the communication assist system according to the first embodiment of the present invention.
First, in step S100, status input processing is performed. The first site status measurement unit 21, the first site status acquisition unit 23, the second site status measurement unit 41, and the second site status acquisition unit 43 shown in FIG. Measure and capture the status of the first base 11A and the second base 11B. Here, the input used for the situation input process is measurement data obtained from the first site situation measurement unit 21 and the second site situation measurement unit 41 shown in FIG.
In step S100, the obtained measurement data is converted into data used in step S200.
Specifically, for example, in step S100, the second participant 15 at the second base 11B wants the first participant 13 and the third participant 14 at the first base 11A to be aware of their presence. It is measured whether it is a situation and whether the first participant 13 and the third participant 14 at the first base 11A are in a heated discussion and have forgotten the presence of the second participant 15 at the second base 11B. This is the step to capture. For example, an audio signal and a video signal are input using a microphone and a camera. Alternatively, an audio signal and an acceleration signal are input using a microphone and an acceleration sensor. Here, noise is removed from the audio signal in order to detect the content of the participant's speech and the presence of pronunciation, or the signal is processed at an appropriate sampling period to detect the content of the participant's speech and the presence of pronunciation. Then, noise is removed from the video signal, and signals of posture and movement of the first participant 13, the second participant 15, the third participant 14, and the avatar robot 17 shown in FIG. 1 are extracted.
The output of step S100 is the data processed in step S100 for performing the situation input process.

状況算出処理を行うステップＳ２００では、図２に示す第１拠点状況推定部２５と、第２拠点状況推定部４５を用いて、図１に示す第１拠点１１Ａと第２拠点１１Ｂの拠点状況情報を推定し算出する。
ステップＳ２００の入力は、ステップＳ１００の出力である。
ステップＳ２００の処理は、図２に示す第１拠点状況推定部２５と、第２拠点状況推定部４５を用いて、図１に示す第１拠点１１Ａと第２拠点１１Ｂの拠点状況情報を推定し算出する。
ステップＳ２００の出力は、上記推定された図１に示す第１拠点１１Ａと第２拠点１１Ｂの拠点状況情報である。
具体的には、例えば、ステップＳ２００では、第２拠点１１Ｂの第２参加者１５が第１拠点１１Ａの第１参加者１３と第３参加者１４に自分の存在に気づいて欲しいと思っている度合いを算出する、ならびに第１拠点１１Ａの第１参加者１３と第３参加者１４が議論に白熱し、第２拠点１１Ｂの第２参加者１５の存在を忘れている度合いを算出する。
ステップＳ２００の入力は、例えばマイクやカメラや加速度センサを用いて得られた音声信号や映像信号や加速度の信号からノイズを除去し、参加者の発言内容や発音の有無を検出するために適切なサンプリング周期で信号を処理された信号である。 In step S200 for performing the situation calculation process, the site status information of the first site 11A and the second site 11B shown in FIG. 1 is used by using the first site status estimation unit 25 and the second site status estimation unit 45 shown in FIG. Is estimated and calculated.
The input of step S200 is the output of step S100.
The process of step S200 estimates the site status information of the first site 11A and the second site 11B shown in FIG. 1 using the first site status estimation unit 25 and the second site status estimation unit 45 shown in FIG. calculate.
The output of step S200 is the estimated site status information of the first site 11A and the second site 11B shown in FIG.
Specifically, for example, in step S200, the second participant 15 at the second base 11B wants the first participant 13 and the third participant 14 at the first base 11A to be aware of their presence. The degree is calculated, and the degree to which the first participant 13 and the third participant 14 at the first base 11A are incite of discussion and forgetting the presence of the second participant 15 at the second base 11B is calculated.
The input in step S200 is appropriate for removing noise from audio signals, video signals, and acceleration signals obtained using, for example, a microphone, a camera, and an acceleration sensor, and detecting the participant's speech content and the presence or absence of pronunciation. It is a signal obtained by processing a signal at a sampling period.

ステップＳ２００では、例えば音声信号から図１に示す第１参加者１３と第２参加者１５と第３参加者１４の発言の内容や、発言している時間、発言していない時間や、発言の音量から、図１に示す第１拠点１１Ａのテレビ会議の白熱の度合いや、図１に示す第１拠点１１Ａのテレビ会議の参加の拠点状況情報を推定する。例えば、映像信号から、図１に示す第１参加者１３と第２参加者１５と第３参加者１４のテレビ会議中の体の動きや、予め定められた特徴的な動作、具体的にはうなずく、みつめる、頬杖をつくなどの動作や、視線の先を特定し誰に向かって会話しているのか等の情報からテレビ会議の拠点状況情報を推定する。例えば、動作や音声の有無や回数のみならず、その動作の大きさについても考慮し、拠点状況情報を推定する。
ステップＳ２００の出力は、上記推定された図１に示す第１拠点１１Ａのテレビ会議の拠点状況情報や、図１に示す第１拠点１１Ａのテレビ会議の拠点状況情報である。例えば、図１に示す第１拠点１１Ａのテレビ会議の白熱度合いや、第２拠点１１Ｂにいる第２参加者１５が自分の存在に気づいて欲しいと思っている度合いである。例えば、図１に示す第１拠点１１Ａの第１参加者１３と第３参加者１４が第２拠点１１Ｂの第２参加者１５の存在を忘れていると推定された拠点状況情報かどうかや、第２拠点１１Ｂの参加者が自分の存在に気づいて欲しいと思っていると推定された状態かどうかである。例えば図１に示す第２参加者１５を表すアバターロボット１７と第１拠点１１Ａの第１参加者１３と第３参加者１４の距離により、第２参加者１５に気づきやすい拠点状況情報かどうかを表す値である。 In step S200, for example, the speech contents of the first participant 13, the second participant 15 and the third participant 14 shown in FIG. 1 from the audio signal, the time of speech, the time of speech, From the volume, the degree of incandescence of the video conference at the first site 11A shown in FIG. 1 and the site status information of participation in the video conference at the first site 11A shown in FIG. 1 are estimated. For example, from the video signal, the body movement during the video conference of the first participant 13, the second participant 15, and the third participant 14 shown in FIG. Estimate the video conference site status information from information such as nodding, staring, and cheek sticking, and identifying the point of gaze and who is talking to. For example, the base situation information is estimated in consideration of not only the presence / absence and number of times of the motion and voice, but also the size of the motion.
The output of step S200 is the estimated location status information of the video conference of the first location 11A shown in FIG. 1 and the location status information of the video conference of the first location 11A shown in FIG. For example, the degree of incandescence of the video conference at the first base 11A shown in FIG. 1 or the degree that the second participant 15 at the second base 11B wants to be aware of their presence. For example, whether the first participant 13 and the third participant 14 of the first base 11A shown in FIG. 1 are the base situation information estimated to have forgotten the presence of the second participant 15 of the second base 11B, Whether or not it is estimated that the participant at the second base 11B wants to be aware of his / her presence. For example, based on the distance between the avatar robot 17 representing the second participant 15 shown in FIG. 1 and the first participant 13 and the third participant 14 in the first site 11A, it is determined whether the site status information is easily noticed by the second participant 15. The value to represent.

刺激量算出処理を行うステップＳ３００は、図２に示す第１拠点状況推定部２５で推定された拠点状況情報と第２拠点状況推定部４５で推定されたと拠点状況情報に基づいて、刺激量算出部５７を用いて、刺激量を算出する。
ステップＳ３００の入力は、上記ステップＳ２００の出力である。ステップＳ３００は、図２に示す第１拠点状況推定部２５で推定された拠点状況情報と第２拠点状況推定部４５で推定されたと拠点状況情報に基づいて、刺激量算出部５７を用いて、刺激量を算出する。ステップＳ３００の出力は、上記処理にて算出された刺激量を表す値である。
具体的には、例えば、ステップＳ３００では、図１に示す第１拠点１１Ａのテレビ会議の拠点状況情報と図１に示す第２拠点１１Ｂのテレビ会議の拠点状況情報に応じて刺激量を算出する。
ステップＳ３００の入力は、例えば、図１に示す第１拠点１１Ａの第１参加者１３と第３参加者１４が第２拠点１１Ｂの第２参加者１５の存在を忘れていると推定された拠点状況情報かどうかや、第２拠点１１Ｂの参加者が自分の存在に気づいて欲しいと思っていると推定された状態かどうかである。例えば図１に示す第２参加者１５を表すアバターロボット１７と第１拠点１１Ａの第１参加者１３と第３参加者１４の距離により、第２参加者１５に気づきやすい拠点状況情報かどうかを表す値である。例えば、図１に示す第１拠点１１Ａのテレビ会議の拠点状況情報や、図１の第１拠点１１Ａのテレビ会議の拠点状況情報である。例えば、図１に示す第１拠点１１Ａのテレビ会議の白熱度合いや、第２拠点１１Ｂにいる第２参加者１５が自分の存在に気づいて欲しいと思っている度合いである。
ステップＳ３００では、第２拠点状況情報、第１参加者１３の感覚量の範囲に基づいて、第２拠点１１Ｂにいる第２参加者１５の言動に現れる会議への参加意思の程度を表す意思量を算出する。
ステップＳ３００では、例えば、図１に示す第１拠点１１Ａのテレビ会議の白熱度合いや、第２拠点１１Ｂにいる第２参加者１５が自分の存在に気づいて欲しいと思っている度合いに応じて図１に示す第１拠点１１Ａにいる第１参加者１３と第３参加者１４に提示する刺激量を算出する。
ステップＳ３００の出力は、例えば、図１に示す第１拠点１１Ａのテレビ会議の白熱度合いや、第２拠点１１Ｂにいる第２参加者１５が自分の存在に気づいて欲しいと思っている度合いに応じて図１に示す第１拠点１１Ａにいる第１参加者１３と第３参加者１４に提示する刺激量である。 In step S300 for performing the stimulus amount calculation process, the stimulus amount is calculated based on the site status information estimated by the first site status estimation unit 25 and the site status information estimated by the second site status estimation unit 45 shown in FIG. The stimulation amount is calculated using the unit 57.
The input of step S300 is the output of step S200. Step S300 uses the stimulus amount calculation unit 57 based on the site status information estimated by the first site status estimation unit 25 and the site status information estimated by the second site status estimation unit 45 shown in FIG. Calculate the amount of stimulation. The output of step S300 is a value representing the stimulus amount calculated in the above process.
Specifically, for example, in step S300, the amount of stimulation is calculated according to the video conference site status information of the first site 11A shown in FIG. 1 and the video conference site status information of the second site 11B shown in FIG. .
The input of step S300 is, for example, the base estimated that the first participant 13 and the third participant 14 of the first base 11A shown in FIG. 1 have forgotten the presence of the second participant 15 of the second base 11B. Whether it is situation information or whether it is estimated that the participant of the second base 11B wants to be aware of his / her existence. For example, based on the distance between the avatar robot 17 representing the second participant 15 shown in FIG. 1 and the first participant 13 and the third participant 14 in the first site 11A, it is determined whether the site status information is easily noticed by the second participant 15. The value to represent. For example, it is the site status information of the video conference of the first site 11A shown in FIG. 1 and the site status information of the video conference of the first site 11A of FIG. For example, the degree of incandescence of the video conference at the first base 11A shown in FIG. 1 or the degree that the second participant 15 at the second base 11B wants to be aware of their presence.
In step S300, based on the second site status information and the range of the amount of sensation of the first participant 13, a will amount representing the degree of intention to participate in the conference that appears in the behavior of the second participant 15 at the second site 11B. Is calculated.
In step S300, for example, the diagram shows the degree of incandescence of the video conference at the first base 11A shown in FIG. 1 or the degree that the second participant 15 at the second base 11B wants to be aware of their presence. The stimulation amount to be presented to the first participant 13 and the third participant 14 in the first base 11A shown in FIG.
The output of step S300 depends on, for example, the incandescence of the video conference at the first base 11A shown in FIG. 1 or the degree that the second participant 15 at the second base 11B wants to be aware of his / her presence. The amount of stimulation presented to the first participant 13 and the third participant 14 at the first base 11A shown in FIG.

刺激出力処理を行うステップＳ４００では、図２に示す刺激量算出部５５で算出された刺激量を用いて刺激を出力する。
ステップＳ４００の入力は、ステップＳ３００の出力である。ステップＳ４００は、ステップＳ３００にて出力された値を算出された刺激出力部５７の入力値に変換する。
ステップＳ４００の出力は、図２に示す刺激出力部５７で算出された刺激量を用いて実際に刺激を出力するものである。
具体的には、例えば、ステップＳ４００では、上記ステップＳ３００の出力である刺激を実際に刺激の出力として使用する機器の信号へと変換し、実際に刺激を出力する。
ステップＳ４００の入力は、例えば、図１に示す第１拠点１１Ａのテレビ会議の白熱度合いや、第２拠点１１Ｂにいる第２参加者１５が自分の存在に気づいて欲しいと思っている度合いに応じて図１に示す第１拠点１１Ａにいる第１参加者１３と第３参加者１４に提示する刺激量である。 In step S400 for performing the stimulus output process, a stimulus is output using the stimulus amount calculated by the stimulus amount calculation unit 55 shown in FIG.
The input of step S400 is the output of step S300. In step S400, the value output in step S300 is converted into the calculated input value of the stimulus output unit 57.
The output of step S400 is to actually output a stimulus using the stimulus amount calculated by the stimulus output unit 57 shown in FIG.
Specifically, for example, in step S400, the stimulus that is the output of step S300 is converted into a signal of a device that is actually used as the stimulus output, and the stimulus is actually output.
The input in step S400 depends on, for example, the incandescence of the video conference at the first base 11A shown in FIG. 1 or the degree that the second participant 15 at the second base 11B wants to be aware of his / her presence. The amount of stimulation presented to the first participant 13 and the third participant 14 at the first base 11A shown in FIG.

ステップＳ４００では、例えば、図２に示す刺激出力部５７で算出された刺激量を振動モータに出力する場合、上記入力された刺激量を振幅と周波数に変換する。振幅は固定で周波数に上記入力された刺激量を変換してもよいし、周波数は固定で振幅に上記入力された刺激量を変換してもよい。
ステップＳ４００の出力は、例えば、図２に示す刺激出力部５７で算出された刺激量を振動モータに出力する場合、上記処理にて変換された振幅と周波数にて振動モータを振動させることである。
なお、本実施形態によれば、第２拠点１１Ｂの第２参加者１５が第１拠点１１Ａにいる参加者に向けて提示したい感覚量から刺激量を算出しているが、本発明はこれに限定されるものではない。すなわち、第２拠点状況情報、第１参加者の感覚量の範囲に基づいて、第２拠点１１Ｂにいる第２参加者の言動に現れる会議への参加意思の程度を表す意思量を意思量算出部５５ｂにより算出し、刺激出力部５７により意思量に応じた刺激を第１参加者が知覚できるように出力してもよい。これにより、第２拠点状況情報、第１参加者の感覚量の範囲に基づいて、遠隔地の参加者の言動に現れる会議への参加意思の程度を表す意思量を提示する度合いの調整を行うことができる。
また、第１拠点状況情報、第２拠点状況情報、第１参加者の感覚量の範囲に基づいて、遠隔地の参加者の言動に現れる会議への参加意思の程度を表す意思量を提示する度合いの調整を行うことができる。 In step S400, for example, when the stimulus amount calculated by the stimulus output unit 57 shown in FIG. 2 is output to the vibration motor, the input stimulus amount is converted into an amplitude and a frequency. The amplitude may be fixed and the input stimulus amount may be converted into a frequency, or the frequency may be fixed and the input stimulus amount may be converted into an amplitude.
The output of step S400 is, for example, to vibrate the vibration motor with the amplitude and frequency converted in the above process when the stimulus amount calculated by the stimulus output unit 57 shown in FIG. 2 is output to the vibration motor. .
In addition, according to this embodiment, although the 2nd participant 15 of the 2nd base 11B calculates the stimulus amount from the sensory amount which he would like to show toward the participant in the 1st base 11A, this invention does this. It is not limited. That is, based on the second site status information and the range of the amount of sensation of the first participant, the amount of intention representing the degree of intention to participate in the conference that appears in the behavior of the second participant at the second site 11B is calculated. It may be calculated by the unit 55b and output by the stimulus output unit 57 so that the first participant can perceive the stimulus according to the intention amount. This adjusts the degree of presenting the amount of intention representing the degree of willingness to participate in the conference that appears in the speech of the remote participant based on the second site status information and the range of the sensory amount of the first participant. be able to.
In addition, based on the first site status information, the second site status information, and the range of the sense amount of the first participant, an intention amount representing the degree of intention to participate in the conference that appears in the speech and behavior of the remote participant is presented. The degree can be adjusted.

＜第２実施形態＞
図４は本発明の第２実施形態に係るコミュニケーション補助システムの動作を表すフローチャートであり、特に、図３に示すステップＳ２００の具体的な例を示すフローチャートである。
ステップＳ２１０は第１拠点状況算出処理を行うためのサブルーチンであり、ステップＳ２５０は第２拠点状況算出処理を行うためのサブルーチンである。図３に示すステップＳ２００は、第１拠点状況算出処理Ｓ２１０と、第２拠点状況算出処理Ｓ２５０からなるステップである。
ステップＳ２１０の入力は、図２に示す第１拠点状況取込部２３により取り込まれた情報である。一方、ステップＳ２１０では、図２に示す第１拠点状況推定部２５を用いて第１拠点１１Ａの拠点状況情報を算出する。ステップＳ２１０の出力は、上記処理で算出された第１拠点１１Ａの拠点状況情報である。
具体的には、例えば、ステップＳ２１０の入力は、図１に示す第１拠点１１Ａの第１参加者１３と第３参加者１４の状況を計測した情報である。例えば、ノイズの除去や適切なサンプリング周期へ信号変換されたような、前処理を加えた音声信号や映像信号や加速度の信号のデータである。例えば、図１に示す第１参加者１３と第２参加者１５と第３参加者１４のテレビ会議中の体の動きや、予め定められた特徴的な動作、具体的にはうなずく、みつめる、頬杖をつくなどの動作や、視線の先を特定し誰に向かって会話しているのか等のデータである。 Second Embodiment
FIG. 4 is a flowchart showing the operation of the communication assistance system according to the second embodiment of the present invention, and in particular, is a flowchart showing a specific example of step S200 shown in FIG.
Step S210 is a subroutine for performing the first site status calculation process, and step S250 is a subroutine for performing the second site status calculation process. Step S200 shown in FIG. 3 is a step including a first site status calculation process S210 and a second site status calculation process S250.
The input in step S210 is information captured by the first site status capturing unit 23 illustrated in FIG. On the other hand, in step S210, the site status information of the first site 11A is calculated using the first site status estimation unit 25 shown in FIG. The output of step S210 is the site status information of the first site 11A calculated by the above processing.
Specifically, for example, the input in step S210 is information obtained by measuring the situation of the first participant 13 and the third participant 14 in the first base 11A shown in FIG. For example, it is data of an audio signal, a video signal, and an acceleration signal subjected to preprocessing such as noise removal and signal conversion into an appropriate sampling period. For example, the first participant 13, the second participant 15, and the third participant 14 shown in FIG. This is data such as the action of putting a cheek cane and the person who is talking to the person with the point of gaze specified.

ステップＳ２１０では、例えば、第１拠点１１Ａの第１参加者１３と第３参加者１４が、第２参加者１５を表すアバターロボット１７や、多くの場合遠隔テレビ会議に存在する第２参加者１５の姿を投影する映像に視線を向けることなく、第１拠点１１Ａの第１参加者１３と第３参加者１４のみで会話を続けたりし、第２参加者１５に向けた発言を行わないという拠点状況情報を、音声信号や映像信号や、加速度の信号を用いて音と体の動きや視線から推定し算出する。例えば、第１拠点１１Ａの第１参加者１３と第３参加者１４と、第２参加者１５を表すアバターロボット１７との距離を算出する。
ステップＳ２１０の出力は、第１拠点１１Ａの第１参加者１３と第３参加者１４が第２拠点１１Ｂの第２参加者１５の存在を忘れてテレビ会議に白熱している度合いを表す値である。
ステップＳ２５０の入力は、図２に示す第２拠点状況取込部４３により取り込まれたデータである。
ステップＳ２５０では、図２に示す第２拠点状況推定部４５を用いて図１に示す第２拠点１１Ｂの拠点状況情報を推定し算出する。 In step S210, for example, the first participant 13 and the third participant 14 of the first base 11A are the avatar robot 17 representing the second participant 15, or the second participant 15 often present in a remote video conference. Continuing the conversation with only the first participant 13 and the third participant 14 at the first base 11A without giving a gaze to the image that projects the figure, and not making a statement toward the second participant 15 The site status information is estimated and calculated from sound, body movement, and line of sight using audio signals, video signals, and acceleration signals. For example, the distance between the first participant 13 and the third participant 14 at the first base 11A and the avatar robot 17 representing the second participant 15 is calculated.
The output of step S210 is a value representing the degree to which the first participant 13 and the third participant 14 at the first base 11A forget the presence of the second participant 15 at the second base 11B and are incandescent in the video conference. is there.
The input in step S250 is data captured by the second site status capturing unit 43 shown in FIG.
In step S250, the base state information of the second base 11B shown in FIG. 1 is estimated and calculated using the second base state estimation unit 45 shown in FIG.

ステップＳ２５０の出力は、上記処理で算出された図１に示す第２拠点１１Ｂの拠点状況情報を表すデータある。
具体的には、例えば、ステップＳ２５０の入力は、図１に示す第２拠点１１Ｂの第２参加者１５の拠点状況情報を計測した情報である。例えば、ノイズの除去や適切なサンプリング周期へ信号変換されたような、前処理を加えた音声信号や映像信号や加速度の信号のデータである。例えば、図１に示す第２拠点１１Ｂの第２参加者１５の音声信号であったり、例えば、頬杖をついたり、うなずいたり、ため息をついたりなどの特徴的な動作の信号である。
ステップＳ２５０では、図１に示す第２拠点１１Ｂにいる第２参加者１５の拠点状況情報を算出する。例えば第２拠点１１Ｂの第２参加者１５のテレビ会議の参加が、発言を予め定められた時間以上発言しておらず、例えば、頬杖をついたり、うなずいたり、ため息をついたりなどの特徴的な動作を行い、第１拠点１１Ａの第１参加者１３と第３参加者１４に存在に気づいて欲しいと思っている拠点状況情報を、音声信号や映像信号や加速度の信号を用いて音と体の動きや視線から推定し算出する処理である。
ステップＳ２５０の出力は、例えば第２拠点１１Ｂの第２参加者１５が発言するまでではないが、第１拠点１１Ａの第１参加者１３と第３参加者１４に自分の存在を気にして欲しい又は気づいて欲しいと思っている拠点状況情報の度合いを表す値である。
このように、第２拠点状況情報、第１参加者の感覚量の範囲に基づいて、遠隔地の参加者の言動に現れる会議への参加意思の程度を表す意思量を提示する度合いの調整を行うことができる。 The output of step S250 is data representing the base state information of the second base 11B shown in FIG.
Specifically, for example, the input in step S250 is information obtained by measuring the base state information of the second participant 15 in the second base 11B shown in FIG. For example, it is data of an audio signal, a video signal, and an acceleration signal subjected to preprocessing such as noise removal and signal conversion into an appropriate sampling period. For example, it is an audio signal of the second participant 15 at the second base 11B shown in FIG. 1, or a characteristic operation signal such as wearing a cheek stick, nodding or sighing.
In step S250, the base situation information of the second participant 15 in the second base 11B shown in FIG. 1 is calculated. For example, the participation of the second participant 15 of the second base 11B in the video conference does not speak for more than a predetermined time, such as wearing a cheek stick, nodding or sighing. The base situation information that the first participant 13 and the third participant 14 of the first base 11A want to be aware of the presence of the sound is transmitted using sound signals, video signals, and acceleration signals. This is a process of estimating and calculating from body movement and line of sight.
The output of step S250 is not until the second participant 15 of the second base 11B speaks, for example, but the first participant 13 and the third participant 14 of the first base 11A want to care about their existence. Or, it is a value that represents the degree of base situation information that you want to notice.
Thus, based on the second site status information and the range of the first participant's sense amount, the degree of presenting the amount of intention representing the degree of intention to participate in the conference that appears in the speech of the remote participant is adjusted. It can be carried out.

＜第３実施形態＞
図５は本発明の第３実施形態に係るコミュニケーション補助システムの動作を表すフローチャートであり、特に、図４に示す第１拠点状況算出処理（Ｓ２１０）の具体的な例を表すフローチャートである。
まず、図５に示すフローチャートについて概略的に説明する。
ステップＳ２１１では、第２参加者１５が予め定められた時間以上発言していないか判定する。第２参加者１５が予め定められた時間以上発言していない場合は、ステップＳ２１２に進み、変数の初期化を行う。ステップＳ２１３では、参加者の挙動に特徴的動作があるか判定する。特徴的動作がある場合はステップＳ２１４に進み、発言者の特定と視線検出の処理を行う。
ステップＳ２１５では、発言者の視線の先は第２参加者１５に相当する場所以外であるか判定する。発言者の視線の先は第２参加者１５に相当する場所以外である場合はステップＳ２１６に進み、検出された第１拠点１１Ａの特徴的動作の回数を更新する。次いで、ステップＳ２１７では、検出された第１拠点１１Ａの特徴的動作の総量を更新する。次いで、ステップＳ２１８では、ループ変数を更新する。次いで、ステップＳ２１９では、次のステップで考慮する時刻が０より小さいか判定する。
次のステップで考慮する時刻が０より小さい場合は、ステップＳ２２０に進み、検出された第１拠点１１Ａの特徴的動作の回数が予め定められた特徴的動作の回数の閾値より大きいか判定する。
第１拠点１１Ａの特徴的動作の回数が予め定められた特徴的動作の回数の閾値より大きい場合は、ステップＳ２２１に進み、第１拠点１１Ａの拠点状況情報を表すフラグを算出する。次いで、ステップＳ２２２では、算出されたデータを出力する。 <Third Embodiment>
FIG. 5 is a flowchart showing the operation of the communication assistance system according to the third embodiment of the present invention, and in particular, is a flowchart showing a specific example of the first site situation calculation process (S210) shown in FIG.
First, the flowchart shown in FIG. 5 will be schematically described.
In step S211, it is determined whether the second participant 15 has spoken for a predetermined time. If the second participant 15 has not spoken for a predetermined time or longer, the process proceeds to step S212, and variables are initialized. In step S213, it is determined whether there is a characteristic action in the behavior of the participant. If there is a characteristic action, the process proceeds to step S214, where the speaker is identified and the line of sight is detected.
In step S 215, it is determined whether the point of the speaker's line of sight is other than the place corresponding to the second participant 15. If the point of sight of the speaker is other than the place corresponding to the second participant 15, the process proceeds to step S216, and the number of detected characteristic actions of the first base 11A is updated. Next, in step S217, the detected total amount of characteristic operations of the first base 11A is updated. Next, in step S218, the loop variable is updated. Next, in step S219, it is determined whether the time taken into consideration in the next step is smaller than zero.
When the time to be considered in the next step is smaller than 0, the process proceeds to step S220, and it is determined whether the number of detected characteristic actions of the first base 11A is larger than a predetermined threshold value of the number of characteristic actions.
When the number of characteristic actions of the first site 11A is larger than a predetermined threshold value of the number of characteristic actions, the process proceeds to step S221, and a flag representing the site status information of the first site 11A is calculated. Next, in step S222, the calculated data is output.

ここで、図５に示すフローチャートに用いる変数の定義について説明する。
Ｔｉｍｅ＿ｃｕｒｅｎｔは、現在の時刻を表す。Ｔｉｍｅ＿ｉｎｔｅｒｖａｌは、予め定められた時間の間隔を表す値で、現在の時刻からこの時間の間隔毎に特長的動作の検出を行うために使用する。Ｔｉｍｅ＿ｎｏｎＶｏｉｃｅは、予め定められた時間の長さを表す値で、第２参加者１５がこの値以上に無音区間であるかを判定するステップで使用する。Ｆｌａｇ＿１１Ａは、第１拠点１１Ａの拠点状況情報を表すフラグであり、０か１の値をとる。
１の時に、第１拠点１１Ａの拠点状況情報は第２拠点１１Ｂの拠点状況情報を忘れて議論に白熱している状態を表す。それ以外のときは０の値をとる。ｊは、ループの変数を表す。ＭｏｔｉｏｎＶａｌｕｅ＿ａｖｅｒａｇｅは、予め定められた特徴的動作の大きさの平均値を表す。ＭｏｔｉｏｎＣＴＳ＿Ｔｈｒｅｓｈｏｌｄは、予め定められた特徴的動作の回数の閾値を表す。ＭｏｔｉｏｎＶａｌｕｅ＿ｄｅｔｅｃｔｉｏｎは、検出された特徴的動作の量を表す。ＭｏｔｉｏｎＶａｌｕｅ＿１１Ａは、検出された第１拠点１１Ａの特徴的動作の総量を表す。ＭｏｔｉｏｎＣＴＳ＿１１Ａは、検出された第１拠点１１Ａの特徴的動作の回数を表す。 Here, the definition of variables used in the flowchart shown in FIG. 5 will be described.
Time_current represents the current time. Time_interval is a value that represents a predetermined time interval, and is used to detect a characteristic action at each time interval from the current time. Time_nonVoice is a value that represents a predetermined length of time, and is used in the step of determining whether the second participant 15 is in a silent section greater than this value. Flag_11A is a flag representing the base state information of the first base 11A, and takes a value of 0 or 1.
At 1, the site status information of the first site 11A represents a state in which the site status information of the second site 11B is forgotten and the discussion is heated. Otherwise, it takes a value of 0. j represents a variable of the loop. MotionValue_average represents an average value of predetermined characteristic motion magnitudes. MotionCTS_Threshold represents a predetermined threshold value of the number of characteristic operations. MotionValue_detection represents the amount of characteristic motion detected. MotionValue_11A represents the total amount of characteristic operations of the detected first base 11A. MotionCTS_11A represents the number of detected characteristic operations of the first base 11A.

図５に示すフローチャートにおいて、第２参加者１５が予め定められた時間以上発言していないか判定するステップＳ２１１の入力は、第２拠点１１Ｂの拠点状況情報を計測したデータである。例えば、第２拠点１１Ｂの第２参加者１５のマイクにより計測された音声信号である。
第２参加者１５が予め定められた時間以上発言していないか判定するステップＳ２１１の処理は、第２参加者１５が予め定められた時間Ｔｉｍｅ＿ｎｏｎＶｏｉｃｅ以上に発言していないか判定する処理である。現在の時刻Ｔｉｍｅ＿ｃｕｒｅｎｔからＴｉｍｅ＿ｃｕｒｅｎｔ−Ｔｉｍｅ＿ｎｏｎＶｏｉｃｅまで第２参加者１５が予め定められた音量以上の音声を発していない場合、第２参加者１５が予め定められた時間以上発言していない、と判定される。第２参加者１５が予め定められた時間以上発言していないか判定するステップＳ２１１の出力は、「はい」又は「いいえ」である。「はい」である場合はＳ２１２へ、「いいえ」である場合は終了のステップへ進む。加えて、現在の時刻Ｔｉｍｅ＿ｃｕｒｅｎｔを出力する。
次いで、変数の初期化を行うステップＳ２１２の入力は、Ｓ２１１の出力である。変数の初期化を行うステップＳ２１２の処理は、変数の初期化を行う処理である。具体的には、第１拠点１１Ａの拠点状況情報を表すフラグＦｌａｇ＿１１Ａと、検出された第１拠点１１Ａの特徴的動作の総量ＭｏｔｉｏｎＶａｌｕｅ＿１１Ａと、検出された第１拠点１１Ａの特徴的動作の回数ＭｏｔｉｏｎＣＴＳ＿１１Ａと、ループの変数ｊと、の値を０にすることである。変数の初期化を行うステップＳ２１２の出力は、Ｆｌａｇ＿１１ＡとＭｏｔｉｏｎＶａｌｕｅ＿１１ＡとＭｏｔｉｏｎＣＴＳ＿１１Ａとｊと、Ｓ２１１で出力されたＴｉｍｅ＿ｃｕｒｅｎｔの値である。 In the flowchart shown in FIG. 5, the input in step S211 for determining whether or not the second participant 15 has spoken for a predetermined time or more is data obtained by measuring the site status information of the second site 11B. For example, it is an audio signal measured by the microphone of the second participant 15 at the second base 11B.
The process of step S211 for determining whether or not the second participant 15 has spoken for a predetermined time or more is a process for determining whether or not the second participant 15 has spoken for a predetermined time Time_nonVoice or more. When the second participant 15 does not utter a sound higher than a predetermined volume from the current time Time_current to Time_current-Time_nonVoice, it is determined that the second participant 15 does not speak for a predetermined time or more. . The output of step S211 for determining whether or not the second participant 15 has spoken for a predetermined time is “Yes” or “No”. If “yes”, the process proceeds to S212, and if “no”, the process proceeds to an end step. In addition, the current time Time_current is output.
Next, the input of step S212 for initializing variables is the output of S211. The process of step S212 for initializing variables is a process for initializing variables. Specifically, a flag Flag_11A representing the base state information of the first base 11A, a detected total amount of characteristic motion of the first base 11A, MotionValue_11A, and a detected number of characteristic actions of the first base 11A, MotionCTS_11A, The value of the variable j of the loop is set to 0. The output of step S212 for initializing variables is Flag_11A, MotionValue_11A, MotionCTS_11A and j, and the value of Time_current output in S211.

次いで、特徴的動作があるか判定するステップＳ２１３の入力は、Ｓ２１２の出力と、予め定められた正規化された特徴的動作の信号である。特徴的動作があるか判定するステップＳ２１３の処理は、時刻｛Ｔｉｍｅ＿ｃｕｒｅｎｔ − ｊ×予め定められた時間の間隔Ｔｉｍｅ＿ｉｎｔｅｒｖａｌ｝から、時刻｛Ｔｉｍｅ＿ｃｕｒｅｎｔ − （ｊ＋１）×Ｔｉｍｅ＿ｉｎｔｅｒｖａｌ｝まで、の間に予め定められた特徴的動作があるか判定する処理である。
具体的には、加速度センサの信号を用いる場合、予め定められた正規化された特徴的動作、例えば、うなずくという動作の、加速度センサのｘ、ｙ、ｚの信号を用意し、パターン認識の処理により、上記時刻の区間に、うなずくという動作の信号とマッチしている信号があるかを判定する。上記の予め定められた特徴的動作は、１つの動作に限ることなく、複数の動作を用意しておいてもよい。検出したい動作の数だけ用意する必要がある。 Next, the input of step S213 for determining whether or not there is a characteristic action is the output of S212 and a signal of a predetermined normalized characteristic action. The process of step S213 for determining whether or not there is a characteristic action is predetermined between time {Time_current−j × predetermined time interval Time_interval} and time {Time_current− (j + 1) × Time_interval}. This is a process for determining whether or not there is a characteristic action.
Specifically, when using the signal of the acceleration sensor, a predetermined normalized characteristic operation, for example, an operation of nodding is prepared, and signals of x, y, and z of the acceleration sensor are prepared, and pattern recognition processing is performed. Thus, it is determined whether or not there is a signal that matches the nodding operation signal in the time interval. The predetermined characteristic operation is not limited to one operation, and a plurality of operations may be prepared. It is necessary to prepare as many actions as you want to detect.

このフローチャートでは、検出したい特徴的動作が１つである場合を例に説明したものである。特徴的動作があるか判定するステップＳ２１３の出力は、「はい」、又は「いいえ」、である。上記処理にて特徴的動作が抽出された場合は「はい」、されなかった場合は「いいえ」、を出力する。「はい」である場合はステップＳ２１４へ、「いいえ」である場合はＳ２１８のステップに進む。
加えて、上記処理にて得られた特徴的動作のデータである特徴的動作情報の種類、回数、その大きさを出力する。具体的には、特徴的動作の大きさは、検出された特徴的動作の量ＭｏｔｉｏｎＶａｌｕｅ＿ｄｅｔｅｃｔｉｏｎの変数に保持する。例えば、大きくうなずいた場合、予め定められた正規化された加速度センサの信号より大きな信号となる。加速度センサの信号、具体的には波形の振幅などのデータにより、ステップＳ２１３の処理にて大きさを算出し、出力する。
加えて、ｊ、Ｆｌａｇ＿１１Ａ、ＭｏｔｉｏｎＶａｌｕｅ＿１１Ａ、ＭｏｔｉｏｎＣＴＳ＿１１Ａ、Ｔｉｍｅ＿ｃｕｒｅｎｔの値も出力する。これらの変数の値は、その度、出力してもよいし、データベースで管理してもよい。今後、出力や入力に関するデータについては必要に応じて省略する。 In this flowchart, the case where there is one characteristic operation to be detected is described as an example. The output of step S213 for determining whether or not there is a characteristic action is “Yes” or “No”. If a characteristic action is extracted by the above process, “Yes” is output, and if it is not, “No” is output. If “yes”, the process proceeds to step S214. If “no”, the process proceeds to step S218.
In addition, the type, number of times, and the size of characteristic action information that is characteristic action data obtained by the above processing are output. Specifically, the magnitude of the characteristic action is held in a variable of the detected characteristic action amount MotionValue_detection. For example, when nodding loudly, the signal is larger than a signal of a predetermined normalized acceleration sensor. The magnitude is calculated in the process of step S213 based on the acceleration sensor signal, specifically the data such as the amplitude of the waveform, and output.
In addition, the values of j, Flag_11A, MotionValue_11A, MotionCTS_11A, and Time_current are also output. The values of these variables may be output each time or may be managed in a database. In the future, output and input data will be omitted as necessary.

次いで、発言者の特定と視線検出の処理を行うステップＳ２１４の入力は、ステップＳ２１３の出力である。発言者の特定と視線検出の処理を行うステップＳ２１４の処理は、第１拠点１１Ａで発言している発言者の特定と、発言している発言者の発言相手を視線検出により特定する処理である。
具体的には、例えば、音声信号により、第１拠点１１Ａにいる第１参加者１３と第３参加者１４のうち、どちら、又は両方が会話しているのか、又はだれも会話していないのかを判定する。１人以上の発話が確認された場合、映像信号により、視線を検出し、発言相手を特定する処理を行う。発言者の特定と視線検出の処理を行うステップＳ２１４の出力は、発言者とその視線の先を表すデータである。
発言者の視線の先は第２参加者１５に相当する場所以外であるか判定するステップＳ２１５の入力は、ステップＳ２１４の出力である。 Next, the input in step S214 for performing the speaker identification and line-of-sight detection processing is the output in step S213. The process of step S214 for performing the process of specifying the speaker and detecting the line of sight is a process of specifying the speaker who is speaking at the first base 11A and specifying the speaking partner of the speaker who is speaking by detecting the line of sight. .
Specifically, for example, by voice signal, which one or both of the first participant 13 and the third participant 14 at the first base 11A are talking, or no one is talking. Determine. When one or more utterances are confirmed, a line of sight is detected from the video signal, and a process of specifying the speaking partner is performed. The output of step S214, which performs speaker identification and line-of-sight detection processing, is data representing the speaker and the point of the line of sight.
The input of step S215 for determining whether the point of the speaker's line of sight is outside the place corresponding to the second participant 15 is the output of step S214.

発言者の視線の先は、第２参加者１５に相当する場所以外であるか判定するステップＳ２１５の処理は、ステップＳ２１４の出力である、発言者の視線の先が第２参加者１５を表すものでないかを判定する処理である。発言者の視線の先は第２参加者１５に相当する場所以外であるか判定するステップＳ２１５の出力は、「はい」又は「いいえ」である。「はい」である場合はＳ２２０へ、「いいえ」である場合は終了のステップに進む。
例えば、図１に示す第１拠点１１Ａには第２参加者１５を表すアバターロボット１７が存在しているとする。具体的には、発言者の視線の先がアバターロボット１７である場合は、発言者の視線の先は第２参加者１５を表すものであるため、判定は「いいえ」、となる。例えば、多くの遠隔テレビ会議で用いられているように、スクリーンに遠隔地参加者の様子を映す場合、発言者の視線の先がスクリーンであった場合、判定は「いいえ」となる。逆に、発言者が、自分と同じ拠点にいる参加者に向けて発言している場合、具体的には、第１拠点１１Ａの第１参加者１３が第３参加者１４に向けて発言している場合は、判定は、「はい」、になる。 The process of step S215 for determining whether the point of the speaker's line of sight is other than the place corresponding to the second participant 15 is the output of step S214. The point of the speaker's line of sight represents the second participant 15. This is a process for determining whether or not a thing is present. The output of step S215 for determining whether the point of sight of the speaker is outside the place corresponding to the second participant 15 is “Yes” or “No”. If “yes”, the process proceeds to S220, and if “no”, the process proceeds to an end step.
For example, it is assumed that an avatar robot 17 representing the second participant 15 exists in the first base 11A shown in FIG. Specifically, when the speaker's line of sight is the avatar robot 17, the determination is “No” because the speaker's line of sight represents the second participant 15. For example, as used in many remote video conferences, when a remote participant is shown on the screen, if the speaker's line of sight is the screen, the determination is “No”. Conversely, when the speaker is speaking to a participant who is at the same base as himself, specifically, the first participant 13 at the first base 11A speaks to the third participant 14. If yes, the determination is “yes”.

次いで、検出された第１拠点１１Ａの特徴的動作情報の回数を更新するステップＳ２１６の入力は、ステップＳ２１５の出力、「はい」、である。加えて、ステップＳ２１３で検出した特徴的動作情報のデータである。検出された第１拠点１１Ａの特徴的動作情報の回数を更新するステップＳ２１６の処理は、検出された第１拠点１１Ａの特徴的動作情報の回数ＭｏｔｉｏｎＣＴＳ＿１１Ａの値を更新する処理である。
具体的には、例えば、ステップＳ２１３にて検出された特徴的動作情報が１つである場合、ＭｏｔｉｏｎＣＴＳ＿１１Ａの値に１を加える。仮に、ステップＳ２１３にて検出された特徴的動作情報が１つ以上である場合には、その数に応じて値を加えてもよい。検出された第１拠点１１Ａの特徴的動作情報の回数を更新するステップＳ２１６の出力は、更新されたＭｏｔｉｏｎＣＴＳ＿１１Ａの値である。
次いで、検出された第１拠点１１Ａの特徴的動作情報の総量を更新するステップＳ２１７の入力は、ステップＳ２１３で検出した特徴的動作情報のデータである。具体的には、例えば、検出された特徴的動作情報の量Ｍｏｔｉｏｎ＿ｄｅｔｅｃｔｉｏｎである。検出された第１拠点１１Ａの特徴的動作情報の総量を更新するステップＳ２１７の処理は、ステップＳ２１７の入力データから特徴的動作情報の総量ＭｏｔｉｏｎＶａｌｕｅ＿１１Ａを更新する処理を行う。 Next, the input of step S216 for updating the number of detected characteristic operation information of the first base 11A is the output of step S215, “Yes”. In addition, the characteristic operation information data detected in step S213. The process of step S216 for updating the number of detected characteristic motion information of the first base 11A is a process of updating the value of the detected number of characteristic motion information of the first base 11A MotionCTS_11A.
Specifically, for example, when there is only one characteristic motion information detected in step S213, 1 is added to the value of MotionCTS_11A. If there is one or more characteristic motion information detected in step S213, a value may be added according to the number. The output of step S216 for updating the number of detected characteristic operation information of the first base 11A is the updated value of MotionCTS_11A.
Next, the input in step S217 for updating the total amount of characteristic operation information of the first base 11A detected is the data of characteristic operation information detected in step S213. Specifically, for example, the amount of detected characteristic motion information Motion_detection. The process of step S217 for updating the detected total amount of characteristic operation information of the first base 11A performs a process of updating the total amount of characteristic operation information MotionValue_11A from the input data in step S217.

例えば、ステップＳ２１３にて検出された特徴的動作情報が１つである場合、具体的には、その検出された特徴的動作情報の大きさの平均値がＭｏｔｉｏｎＶａｌｕｅ＿ａｖｅｒａｇｅである場合、検出された第１拠点１１Ａの特徴的動作情報の総量ＭｏｔｉｏｎＶａｌｕｅ＿１１Ａの値は、現在のＭｏｔｉｏｎＶａｌｕｅ＿１１Ａの値に、ＭｏｔｉｏｎＶａｌｕｅ＿ｄｅｔｅｃｔｉｏｎ÷ＭｏｔｉｏｎＶａｌｕｅ＿ａｖｅｒａｇｅの値を加えたものである。
ステップＳ２１３にて検出された特徴的動作情報が大きい場合、ＭｏｔｉｏｎＶａｌｕｅ＿１１Ａの値は大きな値が加えられることになる。これは、議論が白熱すれば体の動きも大きくなることを反映した値である。検出された第１拠点１１Ａの特徴的動作情報の総量を更新するステップＳ２１７の出力は、上記処理にて算出され、更新された、第１拠点１１Ａの特徴的動作情報の総量ＭｏｔｉｏｎＶａｌｕｅ＿１１Ａである。 For example, when the number of characteristic motion information detected in step S213 is one, specifically, when the average value of the size of the detected characteristic motion information is MotionValue_average, the first detected The total value MotionValue_11A of the characteristic operation information of the base 11A is obtained by adding the value of MotionValue_detection / MotionValue_average to the current value of MotionValue_11A.
When the characteristic motion information detected in step S213 is large, a large value is added to the value of MotionValue_11A. This is a value that reflects the fact that the body movement increases as the discussion gets heated. The output of step S217 for updating the detected total amount of characteristic operation information of the first base 11A is the total amount MotionValue_11A of the characteristic operation information of the first base 11A calculated and updated by the above-described processing.

次いで、ループ変数を更新するステップＳ２１８の入力は、現在のｊの値である。ループ変数を更新するステップＳ２１８の処理は、ｊに１を加える処理である。ループ変数を更新するステップＳ２１８の出力は、更新されたｊの値である。これは、ステップＳ２１３にて特徴的動作情報を検出する時刻の区間を更新するために行う処理である。
次のステップで考慮する時刻が０より小さいか判定するステップＳ２１９の入力は、ステップＳ２１８の出力であるｊと、Ｔｉｍｅ＿ｃｕｒｅｎｔとＴｉｍｅ＿ｉｎｔｅｒｖａｌである。次のステップで考慮する時刻が０より小さいか判定するステップＳ２１９の処理は、次のステップで行うステップＳ２１３にて特徴的動作情報を検出する時刻の区間が、０より小さいかを判定するステップである。具体的には、Ｔｉｍｅ＿ｃｕｒｅｎｔ −（ｊ＋１）×Ｔｉｍｅ＿ｉｎｔｅｒｖａｌの値が０より小さいかを判定する。０より小さい場合は、「はい」、０以上である場合は「いいえ」、となる。
次のステップで考慮する時刻が０より小さいか判定するステップＳ２１９の出力は、「はい」、又は「いいえ」である。「はい」である場合はステップＳ２２０へ、「いいえ」である場合はステップＳ２１３のステップへ進む。
次いで、検出された第１拠点１１Ａの特徴的動作情報の回数が予め定められた特徴的動作情報の回数の閾値より大きいか判定するステップＳ２２０の入力は、ステップＳ２１９の「はい」、とＭｏｔｉｏｎＣＴＳ＿１００と予め定められた特徴的動作情報の回数の閾値ＭｏｔｉｏｎＣＴＳ＿Ｔｈｒｅｓｈｏｌｄの値である。 Next, the input of step S218 for updating the loop variable is the current value of j. The process of step S218 for updating the loop variable is a process of adding 1 to j. The output of step S218 for updating the loop variable is the updated value of j. This is a process performed to update the time interval for detecting characteristic motion information in step S213.
The input of step S219 for determining whether the time taken into consideration in the next step is smaller than 0 is j, Time_current and Time_interval which are the outputs of step S218. The process of step S219 for determining whether the time to be considered in the next step is smaller than 0 is a step for determining whether the time interval for detecting characteristic motion information in step S213 performed in the next step is smaller than 0. is there. Specifically, it is determined whether the value of Time_current− (j + 1) × Time_interval is smaller than zero. If it is less than 0, “Yes” is set, and if it is 0 or more, “No” is set.
The output of step S219 for determining whether the time to be considered in the next step is less than 0 is “Yes” or “No”. If “yes”, the process proceeds to step S220, and if “no”, the process proceeds to step S213.
Next, the input in step S220 for determining whether or not the number of detected characteristic operation information of the first base 11A is greater than a predetermined threshold value of the number of characteristic operation information is “Yes” in step S219, and MotionCTS_100. This is the value of the threshold value MotionCTS_Threshold of the number of times of predetermined characteristic motion information.

検出された第１拠点１１Ａの特徴的動作情報の回数が予め定められた特徴的動作情報の回数の閾値より大きいか判定するステップＳ２２０の処理は、ステップＳ２１６にて更新されたＭｏｔｉｏｎＣＴＳ＿１００の値がＭｏｔｉｏｎＣＴＳ＿Ｔｈｒｅｓｈｏｌｄより大きいかを判定する処理を行う。
ＭｏｔｉｏｎＣＴＳ＿１００の値がＭｏｔｉｏｎＣＴＳ＿Ｔｈｒｅｓｈｏｌｄより大きい場合には「はい」、小さい場合には「いいえ」と判定する。例えば、第１拠点１１Ａにいる第１参加者１３が、かつステップＳ２１４、ステップＳ２１５で第２参加者１５を表すもの以外に視線を向けた状態で、特徴的動作情報の回数が多く検出された場合、具体的には、第２拠点１１Ｂの第２参加者１５を忘れて議論が白熱している可能性がある状態である。その回数が予め定められた回数以上である場合には、議論が白熱している状態であると判定する処理を行っている。検出された第１拠点１１Ａの特徴的動作情報の回数が予め定められた特徴的動作情報の回数の閾値より大きいか判定するステップＳ２２０の出力は、上記処理にて判定された「はい」、又は「いいえ」である。 In the process of step S220 for determining whether or not the number of detected characteristic motion information of the first base 11A is larger than a predetermined threshold value of the number of characteristic motion information, the value of MotionCTS_100 updated in step S216 is MotionCTS_Threshold. A process for determining whether the value is larger is performed.
When the value of MotionCTS_100 is larger than MotionCTS_Threshold, it is determined as “Yes”, and when it is smaller, it is determined as “No”. For example, a large number of characteristic motion information was detected in a state where the first participant 13 at the first base 11A turned his gaze to something other than the one representing the second participant 15 in steps S214 and S215. In this case, specifically, the second participant 15 at the second base 11B is forgotten and there is a possibility that the discussion is incandescent. When the number of times is equal to or greater than a predetermined number, processing is performed to determine that the discussion is in a heated state. The output of step S220 for determining whether the number of detected characteristic operation information of the first base 11A is greater than a predetermined threshold value of the number of characteristic operation information is “Yes” determined in the above process, or “No”.

第１拠点１１Ａの拠点状況情報を表すフラグを算出するステップＳ２２１の入力は、ステップＳ２２０の出力、「はい」、である。第１拠点１１Ａの拠点状況情報を表すフラグを算出するステップＳ２２１の処理は、第１拠点１１Ａの拠点状況情報を表すフラグＦｌａｇ＿１１Ａの値を算出する処理である。Ｆｌａｇ＿１１Ａは、０か１の値をとる。Ｆｌａｇ＿１１Ａは、１の時、第１拠点１１Ａの拠点状況情報は第２拠点１１Ｂの状況を忘れて議論に白熱している状態を表す。それ以外のときは０の値をとる。具体的には、例えば、ステップＳ２２０にて、第２拠点１１Ｂの第２参加者１５を忘れて議論が白熱している状態であると判定された場合には１の値を設定する。それ以外の時には０の値を設定する処理を行う。第１拠点１１Ａの拠点状況情報を表すフラグを算出するステップＳ２２１の出力は、Ｆｌａｇ＿１１Ａの値である。
算出されたデータを出力するステップＳ２２２の入力は、Ｆｌａｇ＿１１Ａと、ＭｏｔｉｏｎＣＴＳ＿１１Ａと、ＭｏｔｉｏｎＶａｌｕｅ＿１１Ａの値である。算出されたデータを出力するステップＳ２２２の処理は、入力データを出力することである。これまでのステップにて算出、更新されたデータの値を出力する処理を行う。算出されたデータを出力するステップＳ２２２の出力は、Ｆｌａｇ＿１１Ａと、ＭｏｔｉｏｎＣＴＳ＿１１Ａと、ＭｏｔｉｏｎＶａｌｕｅ＿１１Ａの値である。 The input of step S221 for calculating the flag representing the site status information of the first site 11A is the output of step S220, “Yes”. The process of step S221 for calculating the flag indicating the site status information of the first site 11A is a process of calculating the value of the flag Flag_11A indicating the site status information of the first site 11A. Flag_11A takes a value of 0 or 1. When Flag_11A is 1, the base state information of the first base 11A represents a state in which the situation of the second base 11B is forgotten and heated to discussion. Otherwise, it takes a value of 0. Specifically, for example, when it is determined in step S220 that the second participant 15 at the second base 11B is forgotten and the discussion is heated, a value of 1 is set. In other cases, a process of setting a value of 0 is performed. The output of step S221 for calculating the flag representing the site status information of the first site 11A is the value of Flag_11A.
The input of step S222 for outputting the calculated data is the values of Flag_11A, MotionCTS_11A, and MotionValue_11A. The process of step S222 for outputting the calculated data is to output the input data. A process for outputting the data value calculated and updated in the previous steps is performed. The output of step S222 for outputting the calculated data is the values of Flag_11A, MotionCTS_11A, and MotionValue_11A.

例えば、第１拠点１１Ａの状況が第２拠点１１Ｂの状況を忘れて議論に白熱している状態である場合、具体的には、Ｆｌａｇ＿１１Ａは１の値をとる。白熱している度合いが大きい場合には、ＭｏｔｉｏｎＣＴＳ＿１１Ａと、ＭｏｔｉｏｎＶａｌｕｅ＿１１Ａの値が大きくなり、白熱している度合いが小さい場合には、ＭｏｔｉｏｎＣＴＳ＿１１Ａと、ＭｏｔｉｏｎＶａｌｕｅ＿１１Ａの値は小さくなる。
このように、第１特徴的動作情報、発言者情報、及び対話相手情報を第１拠点状況情報として用いることで、遠隔地の参加者の言動に現れる会議への参加意思の程度を表す意思量を提示する度合いの調整を行うことができる。
また、所定時間内における第２拠点１１Ｂの音声信号が所定音量以下である場合に、第２拠点１１Ｂの映像信号に基づいて第２特徴的動作情報を抽出するので、遠隔地の参加者の言動に現れる会議への参加意思の程度を表す意思量を算出することができる。 For example, when the status of the first site 11A is in a state where the status of the second site 11B is forgotten and the discussion is heated, specifically, Flag_11A takes a value of 1. When the degree of incandescence is large, the values of MotionCTS_11A and MotionValue_11A are large, and when the degree of incandescence is small, the values of MotionCTS_11A and MotionValue_11A are small.
In this way, the first characteristic action information, the speaker information, and the conversation partner information are used as the first site situation information, and the will amount representing the degree of intention to participate in the conference that appears in the behavior of the remote participant The degree of presenting can be adjusted.
In addition, when the audio signal of the second base 11B within a predetermined time is below a predetermined volume, the second characteristic motion information is extracted based on the video signal of the second base 11B. It is possible to calculate the amount of intention that represents the degree of intention to participate in the conference that appears.

＜第４実施形態＞
図６は、本発明の第４実施形態に係るコミュニケーション補助システムの動作を表すフローチャートであり、特に、図４に示す第２拠点状況算出処理（ステップＳ２５０）の具体的な例を説明するためのフローチャートである。
まず、図６に示すフローチャートについて概略的に説明する。
ステップＳ２５１では、第２参加者が予め定められた時間以上発言していないか判定する。ステップＳ２５２では、変数の初期化を行う。ステップＳ２５３では、特徴的動作情報があるか判定する。ステップＳ２５４では、検出された第２拠点１１Ｂの特徴的動作情報の回数を更新する。ステップＳ２５５では、検出された第２拠点１１Ｂの特徴的動作情報の総量を更新する。ステップＳ２５６では、ループ変数を更新する。ステップＳ２５７では、次のステップで考慮する時刻が０より小さいか判定する。ステップＳ２５８では、検出された第２拠点１１Ｂの特徴的動作情報の回数が予め定められた特徴的動作情報の回数の閾値より大きいか判定する。ステップＳ２５９では、第２拠点１１Ｂの拠点状況情報を表すフラグを算出する。ステップＳ２６０では、算出されたデータを出力する。 <Fourth embodiment>
FIG. 6 is a flowchart showing the operation of the communication assistance system according to the fourth embodiment of the present invention, and in particular, for explaining a specific example of the second site situation calculation process (step S250) shown in FIG. It is a flowchart.
First, the flowchart shown in FIG. 6 will be schematically described.
In step S251, it is determined whether the second participant has spoken for a predetermined time. In step S252, variables are initialized. In step S253, it is determined whether there is characteristic operation information. In step S254, the number of detected characteristic operation information of the second base 11B is updated. In step S255, the total amount of characteristic operation information of the detected second base 11B is updated. In step S256, the loop variable is updated. In step S257, it is determined whether the time taken into consideration in the next step is less than zero. In step S258, it is determined whether the number of detected characteristic motion information of the second base 11B is greater than a predetermined threshold value of the number of characteristic motion information. In step S259, a flag representing the base state information of the second base 11B is calculated. In step S260, the calculated data is output.

ここで、図６に示すフローチャートに用いる変数の定義について説明する。
なお、変数の定義のうち、図５を参照して説明した変数については省略する。
Ｆｌａｇ＿１１Ｂは、第２拠点１１Ｂの拠点状況情報を表すフラグ。０か１の値をとる。１の時に、第２拠点１１Ｂの拠点状況情報は第１拠点１１Ａにいる参加者に発言するまでではないが自分の存在に気づいて欲しいと思っている拠点状況情報を表す。それ以外のときは０の値をとる。ｉは、ループの変数を表す。ＭｏｔｉｏｎＶａｌｕｅ＿１１Ｂは、検出された第２拠点１１Ｂの特徴的動作情報の総量を表す。ＭｏｔｉｏｎＣＴＳ＿１１Ｂは、検出された第２拠点１１Ｂの特徴的動作情報の回数を表す。
第２参加者１５が予め定められた時間以上発言していないか判定するステップＳ２５１は、図５に示すステップＳ２１１と同じである。
変数の初期化を行うステップＳ２５２は、図４に示すステップＳ２１２と同様の内容である。ステップＳ２５２では、検出された第２拠点１１Ｂの特徴的動作情報の総量ＭｏｔｉｏｎＶａｌｕｅ＿１１Ｂ、検出された第２拠点１１Ｂの特徴的動作情報の回数ＭｏｔｉｏｎＣＴＳ＿１１Ｂと、ループの変数ｉと、第２拠点１１Ｂの拠点状況情報を表すフラグＦｌａｇ＿１１Ｂを用いており、ＭｏｔｉｏｎＶａｌｕｅ＿１１ＢはステップＳ２１２のＭｏｔｉｏｎＶａｌｕｅ＿１１Ａに、ＭｏｔｉｏｎＣＴＳ＿１１ＢはステップＳ２１２のＭｏｔｉｏｎＣＴＳ＿１１Ａに、ｉはステップＳ２１２のｊに、Ｆｌａｇ＿１１ＢはステップＳ２１２のＦｌａｇ＿１１Ａに相当する。 Here, the definition of variables used in the flowchart shown in FIG. 6 will be described.
Of the variable definitions, the variables described with reference to FIG. 5 are omitted.
Flag_11B is a flag representing the base state information of the second base 11B. It takes a value of 0 or 1. At 1, the site status information of the second site 11B represents the site status information that the user wants to be aware of, but not to speak to the participants at the first site 11A. Otherwise, it takes a value of 0. i represents a variable of the loop. MotionValue_11B represents the total amount of characteristic operation information of the detected second base 11B. MotionCTS_11B represents the number of detected characteristic operation information of the second base 11B.
Step S251 for determining whether or not the second participant 15 has spoken for a predetermined time or more is the same as step S211 shown in FIG.
Step S252 for initializing variables has the same contents as step S212 shown in FIG. In step S252, the total amount of detected characteristic motion information of the second base 11B, MotionValue_11B, the number of detected characteristic motion information of the second base 11B, MotionCTS_11B, the loop variable i, and the base state of the second base 11B The flag Flag_11B representing the information is used. The MotionValue_11B corresponds to the MotionValue_11A in step S212, the MotionCTS_11B corresponds to the MotionCTS_11A in step S212, i corresponds to j in the step S212, and Flag_11B corresponds to Flag_11A in the step S212.

特徴的動作情報があるか判定するステップＳ２５３は、図４に示すステップＳ２１３と同様の内容である。ただし、図４に示すステップＳ２１３では、第１拠点１１Ａの第１参加者１３と第３参加者１４の、テレビ会議に白熱している様子を表す特徴的動作情報を抽出していたが、ステップＳ２５３で対象とする特徴的動作情報は、第２拠点１１Ｂの第２参加者１５が、自分の存在にきづいて欲しいと思った時にみせる特徴的動作情報である。例えば、ため息をつく動作など、テレビ会議においていかれていると思った時にみせる特徴的動作情報である。
次いで、検出された第２拠点１１Ｂの特徴的動作情報の回数を更新するステップＳ２５４は、図５のステップＳ２１６と同様の内容である。ただし、用いる変数は、図６の変数の定義にて説明した第２拠点１１Ｂに関する変数である。
検出された第２拠点１１Ｂの特徴的動作情報の総量を更新するステップＳ２５５は、図５に示すステップＳ２１７と同様の内容である。ただし、用いる変数は、図６の変数の定義にて説明した第２拠点１１Ｂに関する変数である。
ループ変数を更新するステップＳ２５６は、図５に示すステップＳ２１８と同様の内容である。ただし用いる変数はｉである。
次のステップで考慮する時刻が０より小さいか判定するステップＳ２５７は、図５に示すステップＳ２１９と同様の内容である。 Step S253 for determining whether or not there is characteristic operation information is the same as step S213 shown in FIG. However, in step S213 shown in FIG. 4, characteristic operation information representing the state in which the first participant 13 and the third participant 14 of the first base 11A are incandescent in the video conference is extracted. The characteristic operation information targeted in S253 is characteristic operation information that is displayed when the second participant 15 at the second base 11B wants to know his / her presence. For example, it is characteristic operation information that is shown when the user thinks he / she is in a video conference, such as a sighing operation.
Next, Step S254 for updating the number of detected characteristic operation information of the second base 11B is the same as Step S216 in FIG. However, the variable to be used is a variable related to the second base 11B described in the definition of the variable in FIG.
Step S255 for updating the detected total amount of characteristic operation information of the second base 11B is the same as step S217 shown in FIG. However, the variable to be used is a variable related to the second base 11B described in the definition of the variable in FIG.
Step S256 for updating the loop variable has the same contents as step S218 shown in FIG. However, the variable used is i.
Step S257 for determining whether the time to be considered in the next step is less than 0 is the same as step S219 shown in FIG.

次いで、検出された第２拠点１１Ｂの特徴的動作情報の回数が予め定められた特徴的動作情報の回数の閾値より大きいか判定するステップＳ２５８は、図５に示すステップＳ２２０と同様の内容である。ただし、用いる変数は、図６の変数の定義にて説明した第２拠点１１Ｂに関する変数である。
次いで、第２拠点１１Ｂの拠点状況情報を表すフラグを算出するステップＳ２５９は、図５のステップＳ２２１と同様の内容である。ただし、用いる変数は、図６の変数の定義にて説明した第２拠点１１Ｂに関する変数である。Ｆｌａｇ＿１１Ｂは、第２拠点１１Ｂの拠点状況情報を表すフラグである。１の時、第２拠点１１Ｂの拠点状況情報は第１拠点１１Ａにいる参加者に発言するまでではないが自分の存在に気づいて欲しいと思っている拠点状況情報を表す。それ以外のときは０の値をとる。
次いで、算出されたデータを出力するステップＳ２６０は、図５に示すステップＳ２２２と同様の内容である。ただし、用いる変数は、図６の変数の定義にて説明した第２拠点１１Ｂに関する変数である。具体的には、例えば、第２拠点１１Ｂの第２参加者１５が、第１拠点１１Ａの第１参加者１３と第３参加者１４に、発言するほどではないが、自分の存在を提示したいと思っている度合いが大きい場合には、ＭｏｔｉｏｎＣＴＳ＿１１Ａと、ＭｏｔｉｏｎＶａｌｕｅ＿１１Ａの値が大きくなり、発言するほどではないが、自分の存在を提示したいと思っている度合いが小さい場合には、ＭｏｔｉｏｎＣＴＳ＿１１Ａと、ＭｏｔｉｏｎＶａｌｕｅ＿１１Ａの値は小さくなる。
このように、所定時間内における第２拠点１１Ｂの音声信号が所定音量以下である場合に、第２拠点１１Ｂの映像信号に基づいて第２特徴的動作情報を抽出するので、遠隔地の参加者の言動に現れる会議への参加意思の程度を表す意思量を算出することができる。 Next, step S258 for determining whether or not the number of detected characteristic motion information of the second base 11B is greater than a predetermined threshold value of the number of characteristic motion information is the same as step S220 shown in FIG. . However, the variable to be used is a variable related to the second base 11B described in the definition of the variable in FIG.
Next, Step S259 for calculating a flag representing the base state information of the second base 11B has the same contents as Step S221 in FIG. However, the variable to be used is a variable related to the second base 11B described in the definition of the variable in FIG. Flag_11B is a flag representing the base state information of the second base 11B. At 1, the site status information of the second site 11B represents the site status information that the user wants to be aware of, but not to speak to the participants at the first site 11A. Otherwise, it takes a value of 0.
Next, Step S260 for outputting the calculated data has the same contents as Step S222 shown in FIG. However, the variable to be used is a variable related to the second base 11B described in the definition of the variable in FIG. Specifically, for example, the second participant 15 at the second base 11B does not speak to the first participant 13 and the third participant 14 at the first base 11A, but wants to present his / her presence. When the degree of thinking is large, the values of MotionCTS_11A and MotionValue_11A are large and not enough to speak, but when the degree of wanting to present their presence is small, the values of MotionCTS_11A and MotionValue_11A The value becomes smaller.
As described above, when the audio signal of the second base 11B within a predetermined time is below a predetermined volume, the second characteristic operation information is extracted based on the video signal of the second base 11B. It is possible to calculate the amount of intention representing the degree of intention to participate in the conference that appears in

＜第５実施形態＞
図７は本発明の第５実施形態に係るコミュニケーション補助システムの動作を表すフローチャートであり、特に、図５とは異なる図４に示す第１拠点状況算出処理（ステップＳ２１０）の具体的な例を説明したフローチャートである。
ステップＳ２２３では、第１拠点１１Ａにいる参加者とアバターロボット１７との距離Ｄｉｓｔａｎｃｅ＿１７を算出する処理を行う。ステップＳ２２４では、Ｆｌａｇ＿１１Ａ、ＭｏｔｉｏｎＣＴＳ＿１１Ａ、ＭｏｔｉｏｎＶａｌｕｅ＿１１Ａ、Ｄｉｓｔａｎｃｅ＿１７を出力する。
ここで、図７に示すフローチャートに用いる変数の定義について説明する。
なお、図５、図６で定義済みの変数については省略する。
Ｄｉｓｔａｎｃｅ＿１７は、第１拠点１１Ａにいる参加者とアバターロボット１７との距離を表す。
図７に示すフローチャートは図５に示すフローチャートとほぼ同じである。異なる点は、図５に示すフローチャートには存在しなかった、第１拠点１１Ａにいる参加者とアバターロボット１７との距離Ｄｉｓｔａｎｃｅ＿１７を算出する処理を行うステップＳ２２３があることである。また、図５に示すステップＳ２２２に代わって、図５に示すステップＳ２２２で出力していたデータに第１拠点１１Ａの参加者とアバターロボット１７との距離Ｄｉｓｔａｎｃｅ＿１７を加えて出力しているステップＳ２２４があることである。 <Fifth Embodiment>
FIG. 7 is a flowchart showing the operation of the communication assistance system according to the fifth embodiment of the present invention. In particular, a specific example of the first site situation calculation process (step S210) shown in FIG. 4 different from FIG. It is the flowchart demonstrated.
In step S223, a process of calculating a distance Distance_17 between the participant at the first base 11A and the avatar robot 17 is performed. In step S224, Flag_11A, MotionCTS_11A, MotionValue_11A, and Distance_17 are output.
Here, the definition of variables used in the flowchart shown in FIG. 7 will be described.
Note that variables already defined in FIGS. 5 and 6 are omitted.
Distance_17 represents the distance between the participant at the first site 11A and the avatar robot 17.
The flowchart shown in FIG. 7 is almost the same as the flowchart shown in FIG. The difference is that there is step S223 for performing a process of calculating the distance Distance_17 between the participant at the first base 11A and the avatar robot 17, which did not exist in the flowchart shown in FIG. Further, instead of step S222 shown in FIG. 5, step S224 in which the distance Distance_17 between the participant at the first base 11A and the avatar robot 17 is added to the data outputted in step S222 shown in FIG. That is.

第１拠点１１Ａの状況を計測し、算出するステップにおいて、例えばステップＳ２１３、第１参加者１３と第３参加者１４の状況に加えて、第２参加者１５を表すアバターロボット１７の状況も計測し、後のステップにてこの拠点状況情報についても算出することで、図５に示す場合より、詳しく第１拠点１１Ａの拠点状況情報を算出することができる。
具体的には、ステップＳ２１３では、映像データから、第１拠点１１Ａの第１参加者１３と第２参加者１５を表すアバターロボット１７との距離と、第１拠点１１Ａの第３参加者１４と第２参加者１５を表すアバターロボット１７との距離と、を計測し、上記計測された距離の平均を算出し、変数Ｄｉｓｔａｎｃｅ＿１７に代入する処理を行う。
例えば、距離の平均のみならず、第２参加者１５を表すアバターロボット１７と、第１拠点１１Ａにいる参加者のうちいずれか近いほうの距離を変数Ｄｉｓｔａｎｃｅ＿１７に代入してもよいし、いずれか遠いほうの距離を変数Ｄｉｓｔａｎｃｅ＿１７に代入してもよいし、距離を予め定められた関数を用いて算出した値を代入してもよい。 In the step of measuring and calculating the situation of the first base 11A, for example, in addition to the situation of step S213, the first participant 13 and the third participant 14, the situation of the avatar robot 17 representing the second participant 15 is also measured. However, by calculating the site status information in a later step, the site status information of the first site 11A can be calculated in more detail than in the case shown in FIG.
Specifically, in step S213, the distance between the first participant 13 at the first base 11A and the avatar robot 17 representing the second participant 15 and the third participant 14 at the first base 11A are determined from the video data. The distance to the avatar robot 17 representing the second participant 15 is measured, the average of the measured distances is calculated, and the process of substituting for the variable Distance_17 is performed.
For example, not only the average of distances but also the closest distance between the avatar robot 17 representing the second participant 15 and the participants at the first base 11A may be substituted into the variable Distance_17. The far distance may be substituted into the variable Distance_17, or a value calculated using a predetermined function for the distance may be substituted.

第１拠点１１Ａにいる参加者とアバターロボット１７との距離Ｄｉｓｔａｎｃｅ＿１７は、刺激を出力するアバターロボット１７が、同程度の刺激を出力したとしても、Ｄｉｓｔａｎｃｅ＿１７の値が大きければ、つまり、第１拠点１１Ａにいる参加者とアバターロボット１７との距離が遠ければ、第１拠点１１Ａの参加者が感じる感覚量が小さくなる。
逆に、Ｄｉｓｔａｎｃｅ＿１７の値が小さければ、つまり、第１拠点１１Ａの参加者とアバターロボット１７との距離が近ければ、第１拠点１１Ａにいる参加者が感じる感覚量は大きくなる、ということに対応するために用いる値である。第１拠点１１Ａの参加者とアバターロボット１７との距離Ｄｉｓｔａｎｃｅ＿１７に応じて、第１拠点１１Ａの参加者に提示する刺激量を調整することを可能とするために必要なステップである。上記の例は、距離に関してだが、距離以外の値を用いてもよい。例えば、第１拠点１１Ａの参加者と第２参加者の１００２を表すアバターロボット１７が共に動いている場合、相対速度をもって上記ステップを行ってもよい。
このように、第１参加者とロボット部１７の間の距離を第１拠点状況情報として用いることで、遠隔地の参加者の言動に現れる会議への参加意思の程度を表す意思量を提示する度合いの調整を行うことができる。 The distance Distance_17 between the participant in the first site 11A and the avatar robot 17 is equal to the distance 1 from the first site 11A, even if the avatar robot 17 that outputs the stimulus outputs the same degree of stimulus. If the distance between the participant who is in the room and the avatar robot 17 is long, the amount of sensation felt by the participant at the first base 11A becomes small.
Conversely, if the value of Distance_17 is small, that is, if the distance between the participant at the first base 11A and the avatar robot 17 is short, the amount of sensation felt by the participant at the first base 11A increases. It is a value used to This is a step necessary to enable adjustment of the amount of stimulation to be presented to the participant at the first base 11A according to the distance Distance_17 between the participant at the first base 11A and the avatar robot 17. The above example relates to distance, but values other than distance may be used. For example, when the avatar robot 17 representing the participant of the first base 11A and the second participant 1002 is moving together, the above steps may be performed with a relative speed.
In this way, by using the distance between the first participant and the robot unit 17 as the first site status information, an intention amount representing the degree of willingness to participate in the conference that appears in the behavior of the remote participant is presented. The degree can be adjusted.

＜第６実施形態＞
図８は、本発明の第６実施形態に係るコミュニケーション補助システムの動作を表すフローチャートであり、特に、図３に示す刺激量算出処理を行うステップＳ３００の具体的な例を説明するためのフローチャートである。
まず、図８に示すフローチャートについて概略的に説明する。
ステップＳ３０１では、フラグを判定する。ステップＳ３０２では、データの取り込みを行う。ステップＳ３０３では、第１拠点１１Ａの拠点状況情報から感覚量の範囲を算出する処理を行う。ステップＳ３０４では、第２拠点１１Ｂの拠点状況情報から感覚量Ｅを算出する処理を行う。ステップＳ３０５では、刺激量Ｒの算出を行う。
ステップＳ３０６では、刺激出力部５７のための信号変換処理を行う。ステップＳ３０７では、刺激出力部５７への出力処理を行う。 <Sixth Embodiment>
FIG. 8 is a flowchart showing the operation of the communication assistance system according to the sixth embodiment of the present invention, and in particular, a flowchart for explaining a specific example of step S300 for performing the stimulus amount calculation process shown in FIG. is there.
First, the flowchart shown in FIG. 8 will be schematically described.
In step S301, a flag is determined. In step S302, data is captured. In step S303, a process of calculating the range of the sensory amount from the site status information of the first site 11A is performed. In step S304, a process for calculating the sensory amount E from the site status information of the second site 11B is performed. In step S305, the stimulus amount R is calculated.
In step S306, signal conversion processing for the stimulus output unit 57 is performed. In step S307, output processing to the stimulus output unit 57 is performed.

ここで、図８に示すフローチャートに用いる変数の定義について説明する。
なお、図５、図６、図７を参照して、定義済みの変数については省略する。
Ｅは、第１参加者１３の感覚量を表す。Ｅ＿ｍａｘは、感覚量の閾値の最大値を表す。Ｅ＿ｍｉｎは、感覚量の閾値の最小値を表す。Ｅ＿ｍａｘ＿ｂａｓｉｓは、予め定められた基準となる感覚量の閾値の最大値を表す。Ｅ＿ｍｉｎ＿ｂａｓｉｓは、予め定められた基準となる感覚量の閾値の最小値を表す。ＭｏｔｉｏｎＶａｌｕｅ＿１１Ａ＿ｂａｓｉｓは、予め定められた基準となる第１拠点１１Ａの拠点状況情報を表す値を表す。ＭｏｔｉｏｎＣＴＳ＿１１Ａ＿ｂａｓｉｓは、予め定められた基準となる第１拠点１１Ａの拠点状況情報を表す値を表す。Ｍｏｔｉｏｎ＿ｍａｘ＿ｂａｓｉｓは、予め定められた基準となる第２拠点１１Ｂの拠点状況情報の最大値を表す。Ｍｏｔｉｏｎ＿ｍｉｎ＿ｂａｓｉｓは、予め定められた基準となる第２拠点１１Ｂの拠点状況情報の最小値を表す。Ｗ＿ｃｔｓは、重み付けの値を表す。Ｗ＿ｖａｌｕｅは、重み付けの値を表す。Ｒは、刺激量を表す。Ｋは、予め定められた定数を表す。Ｒ＿ｂａｓｉｓは、予めさだめられた刺激量の基準値を表す。
Ａは、振幅の値を表す。Ａ＿ｂａｓｉｓは、予め定められた振幅の基準値を表す。 Here, the definition of variables used in the flowchart shown in FIG. 8 will be described.
In addition, with reference to FIG.5, FIG.6, FIG.7, it omits about the predefined variable.
E represents the sensory amount of the first participant 13. E_max represents the maximum value of the sensory threshold. E_min represents the minimum value of the sensory threshold. E_max_basis represents the maximum threshold value of the sensory amount that is a predetermined reference. E_min_basis represents the minimum threshold value of the sensory amount that is a predetermined reference. MotionValue_11A_basis represents a value representing the base state information of the first base 11A that is a predetermined reference. MotionCTS_11A_basis represents a value representing the base state information of the first base 11A that is a predetermined reference. Motion_max_basis represents the maximum value of the base state information of the second base 11B, which is a predetermined reference. Motion_min_basis represents the minimum value of the base state information of the second base 11B, which is a predetermined reference. W_cts represents a weighting value. W_value represents a weighting value. R represents the amount of stimulation. K represents a predetermined constant. R_basis represents the reference value of the stimulation amount that has been determined in advance.
A represents an amplitude value. A_basis represents a reference value of a predetermined amplitude.

フラグを判定するステップＳ３０１の入力は、図５に示すステップＳ２２２の出力である第１拠点１１Ａの拠点状況情報を表すフラグＦｌａｇ＿１１Ａと、図６に示すステップＳ２６０の出力である第２拠点１１Ｂの拠点状況情報を表すフラグＦｌａｇ＿１１Ｂである。フラグを判定するステップＳ３０１の処理は、Ｆｌａｇ＿１１ＡとＦｌａｇ＿１１Ｂの乗算が１であるかを判定するステップである。上記、乗算が１である場合は、第１拠点１１Ａの第１参加者１３と第３参加者１４がテレビ会議に白熱している状況、かつ、第２拠点１１Ｂの第２参加者１５が自分の存在にきづいて欲しいと思っている状況である。フラグを判定するステップＳ３０１の出力は、「はい」、又は「いいえ」である。上記処理にて上記乗算の積が１である場合は「はい」、それ以外である場合は「いいえ」を出力する。「はい」である場合はステップＳ３０２へ、「いいえ」である場合は終了のステップへ進む。
データの取り込みを行うステップＳ３０２の入力は、図５に示すステップＳ２２２の出力と、図６に示すステップＳ２６０の出力である。仮に、Ｄｉｓｔａｎｃｅ＿１７の計測も行っている場合には、図６に示すステップＳ２６０の代わりに図７に示すステップＳ２２４の出力を用いる。データの取り込みを行うステップＳ３０２の処理は、データを取り込む処理である。具体的には、ＭｏｔｉｏｎＣＴＳ＿１１Ａ、ＭｏｔｉｏｎＣＴＳ＿１１Ｂ、ＭｏｔｉｏｎＶａｌｕｅ＿１１Ａ、ＭｏｔｉｏｎＶａｌｕｅ＿１１Ｂ、場合によってはＤｉｓｔａｎｃｅ＿１７も加えた上記変数である。データの取り込みを行うステップＳ３０２の出力は、上記変数の出力である。例えば、第１拠点１１Ａの拠点状況情報を表すデータ、具体的には、第１拠点１１Ａの第１参加者１３と第３参加者１４がテレビ会議に白熱している度合いを表すデータと、第２拠点１１Ｂの第２参加者１５が自分の存在にきづいて欲しいと思っている度合いを表すデータを取り込むステップである。 The input of the step S301 for determining the flag is the flag Flag_11A indicating the base state information of the first base 11A that is the output of the step S222 shown in FIG. 5, and the base of the second base 11B that is the output of the step S260 shown in FIG. It is a flag Flag_11B that represents status information. The process of step S301 for determining the flag is a step of determining whether the multiplication of Flag_11A and Flag_11B is 1. When the multiplication is 1, when the first participant 13 and the third participant 14 at the first base 11A are incandescent in the video conference, and the second participant 15 at the second base 11B is himself It is a situation that I want you to become aware of. The output of step S301 for determining the flag is “Yes” or “No”. In the above processing, “Yes” is output when the product of the multiplication is 1, and “No” is output otherwise. If “yes”, the process proceeds to step S302, and if “no”, the process proceeds to an end step.
The input of step S302 for capturing data is the output of step S222 shown in FIG. 5 and the output of step S260 shown in FIG. If the distance_17 is also measured, the output of step S224 shown in FIG. 7 is used instead of step S260 shown in FIG. The process of step S302 for fetching data is a process for fetching data. More specifically, these variables include MotionCTS_11A, MotionCTS_11B, MotionValue_11A, MotionValue_11B, and, in some cases, Distance_17. The output of step S302 for fetching data is the output of the variable. For example, data representing the site status information of the first site 11A, specifically, data representing the degree to which the first participant 13 and the third participant 14 of the first site 11A are incandescent in the video conference, This is a step of taking in data representing the degree to which the second participant 15 at the two bases 11B wants to know his / her presence.

次いで、第１拠点１１Ａの拠点状況情報から感覚量の範囲を算出する処理を行うステップＳ３０３の入力は、第１拠点１１Ａの拠点状況情報を表すデータ、例えば第１拠点１１Ａの第１参加者１３と第３参加者１４がテレビ会議に白熱している度合いを表すデータＭｏｔｉｏｎＶａｌｕｅ＿１１Ａと、ＭｏｔｉｏｎＣＴＳ＿１１Ａと、感覚量の閾値の最大値Ｅ＿ｍａｘと感覚量の閾値の最小値Ｅ＿ｍｉｎの値である。
加えて、予め基準となる第１拠点１１Ａにいる参加者がテレビ会議に白熱している度合いを表すデータＭｏｔｉｏｎＶａｌｕｅ＿１１Ａ＿ｂａｓｉｓと、ＭｏｔｉｏｎＣＴＳ＿１１Ａ＿ｂａｓｉｓと、予め、基準となる感覚量の閾値の値、Ｅ＿ｍａｘ＿ｂａｓｉｓ、Ｅ＿ｍｉｎ＿ｂａｓｉｓと、重み付けの値として、Ｗ＿ｃｔｓとＷ＿ｖａｌｕｅも入力のデータとして用意する。
第１拠点１１Ａの状況から感覚量の範囲を算出する処理を行うステップＳ３０３の処理は、第１拠点１１Ａの第１参加者１３と第３参加者１４がテレビ会議に白熱している度合いを表すデータに応じて、感覚量の閾値の最大値Ｅ＿ｍａｘと感覚量の閾値の最小値Ｅ＿ｍｉｎの値を算出し、更新する処理を行うステップである。例えば、以下の式（１）（２）を用いて、感覚量の閾値の最大値Ｅ＿ｍａｘと感覚量の閾値の最小値Ｅ＿ｍｉｎの値を算出し、更新する。 Next, the input of step S303 for performing the process of calculating the range of the sensory amount from the site status information of the first site 11A is data representing the site status information of the first site 11A, for example, the first participant 13 of the first site 11A. And the values MotionValue_11A, MotionCTS_11A, and the maximum value E_max of the sensory amount threshold and the minimum value E_min of the threshold of the sensory amount.
In addition, the data MotionValue_11A_basis, the MotionCTS_11A_basis representing the degree to which the participants at the first base 11A serving as the reference are incandescent in the video conference, the threshold value of the reference sensory amount, E_max_basis, E_min_basis, in advance, W_cts and W_value are also prepared as input data as weighting values.
The process of step S303 that performs the process of calculating the range of the sensory amount from the situation of the first base 11A represents the degree to which the first participant 13 and the third participant 14 of the first base 11A are incandescent in the video conference. This is a step of performing processing for calculating and updating the maximum value E_max of the threshold value of the sensory amount and the minimum value E_min of the threshold value of the sensory amount according to the data. For example, the following values (1) and (2) are used to calculate and update the maximum value E_max of the sensory amount threshold value and the minimum value E_min of the sensory amount threshold value.

Ｅ＿ｍａｘ＝｛Ｗ＿ｃｔｓ×（ＭｏｔｉｏｎＣＴＳ＿１１Ａ÷ＭｏｔｉｏｎＣＴＳ＿１１Ａ＿ｂａｓｉｓ）＋Ｗ＿ｖａｌｕｅ（ＭｏｔｉｏｎＶａｌｕｅ＿１１Ａ÷ＭｏｔｉｏｎＶａｌｕｅ＿１１Ａ＿ｂａｓｉｓ）｝×Ｅ＿ｍａｘ＿ｂａｓｉｓ（１）
Ｅ＿ｍｉｎ＝｛Ｗ＿ｃｔｓ×（ＭｏｔｉｏｎＣＴＳ＿１１Ａ÷ＭｏｔｉｏｎＣＴＳ＿１１Ａ＿ｂａｓｉｓ）＋Ｗ＿ｖａｌｕｅ（ＭｏｔｉｏｎＶａｌｕｅ＿１１Ａ÷ＭｏｔｉｏｎＶａｌｕｅ＿１１Ａ＿ｂａｓｉｓ）｝×Ｅ＿ｍｉｎ＿ｂａｓｉｓ（２） E_max = {W_cts × (MotionCTS_11A ÷ MotionCTS_11A_basis) + W_value (MotionValue_11A ÷ MotionValue_11A_basis)} × E_max_basis (1)
E_min = {W_cts * (MotionCTS_11A / MotionCTS_11A_basis) + W_value (MotionValue_11A / MotionValue_11A_basis)} * E_min_basis (2)

Ｗ＿ｃｔｓとＷ＿ｖａｌｕｅは足して１になる値をとり、０から１までの値をとるとする。そうすると、感覚量の閾値に反映させる、ＭｏｔｉｏｎＣＴＳ＿１１Ａと、ＭｏｔｉｏｎＶａｌｕｅ＿１１Ａの割合を調整することができる。基準となる感覚量の閾値の値、Ｅ＿ｍａｘ＿ｂａｓｉｓとＥ＿ｍｉｎ＿ｂａｓｉｓは、Ｅｍａｘ＿ｂａｓｉｓ＞Ｅ＿ｍｉｎ＿ｂａｓｉｓを満たす。また、Ｅ＿ｍａｘ＿ｂａｓｉｓ、Ｅ＿ｍｉｎ＿ｂａｓｉｓは、極端に大きすぎず、かつ小さすぎる値であってはならない。
具体的には、例えば、ヒトが予め決められた条件下で、さりげないと感じる感覚量の値であって、例えば、小さい振幅で振動しているものがヒトの視野に入る位置にあった状態の時、例えば１０人中８人が振動していることを感知できる振幅の振動が与える感覚量をＥ＿ｍａｘ＿ｂａｓｉｓ、１０人中２人が振動していることを感知できる振幅の振動が与える感覚量をＥ＿ｍｉｎ＿ｂａｓｉｓとする。
Ｅ＿ｍａｘ＿ｂａｓｉｓ、Ｅ＿ｍｉｎ＿ｂａｓｉｓは、ヒトが感知できない感覚量であってはならないし、だれしもが、確実に沢山の量を感知できる感覚量であってもならない。誰もが気づく大きな振幅で振動しては、さりげない状態ではないということである。
Ｅ＿ｍａｘ＿ｂａｓｉｓ、Ｅ＿ｍｉｎ＿ｂａｓｉｓの値を基準にして算出するＥ＿ｍａｘとＥ＿ｍｉｎの値も同様である。上記式は、ＭｏｔｉｏｎＣＴＳ＿１１Ａと、ＭｏｔｉｏｎＶａｌｕｅ＿１１Ａの値を重み付けし、加算した値を感覚量の閾値に反映させた例だが、加算のみならず、乗算でもよい。
このほかの関係式を用いて感覚量の閾値を算出してもよい。第１拠点１１Ａの拠点状況情報から感覚量の範囲を算出する処理を行うステップＳ３０３の出力は、上記処理にて算出され、更新された感覚量の閾値の最大値Ｅ＿ｍａｘと感覚量の閾値の最小値Ｅ＿ｍｉｎの値である。具体的には、第１拠点１１Ａの拠点状況情報を表すデータ、例えば第１拠点１１Ａの第１参加者１３と第３参加者１４がテレビ会議に白熱している度合いが高ければ、Ｅ＿ｍａｘとＥ＿ｍｉｎの値は共に増加の傾向にある。具体的には、第１拠点１１Ａの拠点状況情報を表すデータ、例えば第１拠点１１Ａの第１参加者１３と第３参加者１４がテレビ会議に白熱している度合いが低ければ、Ｅ＿ｍａｘとＥ＿ｍｉｎの値は小さく設定される。 It is assumed that W_cts and W_value take a value that becomes 1 and take a value from 0 to 1. If it does so, the ratio of MotionCTS_11A and MotionValue_11A reflected on the threshold value of sensory amount can be adjusted. The threshold values of the reference sensory amount, E_max_basis and E_min_basis, satisfy Emax_basis> E_min_basis. Also, E_max_basis and E_min_basis must not be too large and too small.
Specifically, for example, a value of a sensory amount that a human feels casually under a predetermined condition, for example, a state in which something that vibrates with a small amplitude is in a position that enters the human visual field For example, E_max_basis represents the amount of sensation given by vibration of amplitude that can sense that 8 out of 10 people vibrate, and the amount of sensation given by vibration of amplitude that can sense that 2 out of 10 people vibrate. E_min_basis.
E_max_basis and E_min_basis must not be a sensory amount that cannot be sensed by humans, and cannot be a sensory amount by which anyone can reliably sense a large amount. When it vibrates with a large amplitude that everyone notices, it is not in a casual state.
The same applies to the values of E_max and E_min calculated based on the values of E_max_basis and E_min_basis. The above formula is an example in which the values of MotionCTS_11A and MotionValue_11A are weighted and the added value is reflected in the threshold value of the sensory amount.
The threshold value of the sensory amount may be calculated using another relational expression. The output of step S303 for performing the process of calculating the range of the sensory amount from the site status information of the first site 11A is calculated by the above process, and the updated maximum value E_max of the sensory amount threshold and the minimum of the sensory amount threshold are calculated. This is the value E_min. Specifically, if data indicating the site status information of the first base 11A, for example, the first participant 13 and the third participant 14 of the first base 11A are incandescent in the video conference, E_max and E_min Both values tend to increase. Specifically, if the degree of the data indicating the base state information of the first base 11A, for example, the first participant 13 and the third participant 14 of the first base 11A is incandescent in the video conference is low, E_max and E_min The value of is set small.

次いで、第２拠点１１Ｂの拠点状況情報から感覚量Ｅを算出する処理を行うステップＳ３０４の入力は、具体的には、例えば、ステップＳ３０２の出力であるＥ＿ｍａｘとＥ＿ｍｉｎの値と、予め定められた基準となる第２拠点１１Ｂの拠点状況情報の最大値Ｍｏｔｉｏｎ＿ｍａｘ＿ｂａｓｉｓと予め定められた基準となる第２拠点１１Ｂの拠点状況情報の最小値Ｍｏｔｉｏｎ＿ｍｉｎ＿ｂａｓｉｓと、第２拠点１１Ｂの拠点状況情報を表すＭｏｔｉｏｎＣＴＳ＿１１ＢとＭｏｔｉｏｎＶａｌｕｅ＿１１Ｂと、重み付けの値Ｗ＿ｃｔｓとＷ＿ｖａｌｕｅである。第２拠点１１Ｂの拠点状況情報から感覚量Ｅを算出する処理を行うステップＳ３０４の処理は、上記入力データから、第１拠点１１Ａにいる参加者に提示する感覚量Ｅを算出するステップである。例えば、感覚量Ｅを下記の式にて求める処理である。 Next, the input of step S304 for performing the process of calculating the sensation amount E from the site status information of the second site 11B is specifically, for example, the values of E_max and E_min that are the outputs of step S302, and predetermined values. The maximum value Motion_max_basis of the base status information of the second base 11B serving as a reference, the minimum value Motion_min_basis of the base status information of the second base 11B serving as a predetermined reference, and MotionCTS_11B and MotionValue_11B representing the base status information of the second base 11B And weight values W_cts and W_value. The process of step S304, which performs the process of calculating the sensory amount E from the site status information of the second site 11B, is a step of calculating the sensory amount E to be presented to the participant at the first site 11A from the input data. For example, it is a process for obtaining the sensory amount E by the following equation.

Ｅ＝｛（Ｗ＿ｖａｌｕｅ×ＭｏｔｉｏｎＶａｌｕｅ＿１１Ｂ＋Ｗ＿ｃｔｓ×ＭｏｔｉｏｎＣＴＳ＿１１Ｂ）÷（Ｍｏｔｉｏｎ＿ｍａｘ＿ｂａｓｉｓ＋Ｍｏｔｉｏｎ＿ｍｉｎ＿ｂａｓｉｓ）｝×（Ｅ＿ｍａｘ＋Ｅ＿ｍｉｎ）（３）
上記式（３）に示す（Ｗ＿ｖａｌｕｅ×ＭｏｔｉｏｎＶａｌｕｅ＿１１Ｂ＋Ｗ＿ｃｔｓ×ＭｏｔｉｏｎＣＴＳ＿１１Ｂ）の部分では、第２拠点１１Ｂの拠点状況情報、つまり第２参加者１５が自分の存在に気づいて欲しいと思っている度合いについて、回数と量を重み付け数Ｗで重み付けし、第２拠点１１Ｂの拠点状況情報を数値化する。 E = {(W_value × MotionValue — 11B + W_cts × MotionCTS — 11B) ÷ (Motion_max_basis + Motion_min_basis)} × (E_max + E_min) (3)
In the part of (W_value × MotionValue_11B + W_cts × MotionCTS_11B) shown in the above equation (3), the number of times and the number of times the degree of desire that the second participant 15 wants to be aware of the presence of the second base 11B The amount is weighted by the weighting number W, and the base state information of the second base 11B is digitized.

上記式（３）に示す÷（Ｍｏｔｉｏｎ＿ｍａｘ＿ｂａｓｉｓ＋Ｍｏｔｉｏｎ＿ｍｉｎ＿ｂａｓｉｓ）｝の部分では、予め定められた基準となる第２拠点１１Ｂの拠点状況情報の最大値Ｍｏｔｉｏｎ＿ｍａｘ＿ｂａｓｉｓと予め定められた基準となる第２拠点１１Ｂの拠点状況情報の最小値Ｍｏｔｉｏｎ＿ｍｉｎ＿ｂａｓｉｓとの値と比較して、上記第２拠点１１Ｂの拠点状況情報を数値化した値がどの程度かを算出する。
上記式（３）に示す×（Ｅ＿ｍａｘ＋Ｅ＿ｍｉｎ）の部分では、ステップＳ３０３にて更新された感覚量の範囲の中で、上記第２拠点１１Ｂの拠点状況情報を数値化した値がどの程度かを算出した値がどの程度かを算出している。
ただし、Ｍｏｔｉｏｎ＿ｍａｘ＿ｂａｓｉｓ＞Ｍｏｔｉｏｎ＿ｍｉｎ＿ｂａｓｉｓを満たす必要がある。簡単にいうと、予め定められたＭｏｔｉｏｎ＿ｂａｓｉｓの範囲からみて、第２拠点１１Ｂの拠点状況情報がどの程度かを算出し、その程度を感覚量の範囲へ変換する処理を行うステップである。上記の式はあくまで例であり、このほかの関係式を用いてもよい。第２拠点１１Ｂの拠点状況情報から感覚量Ｅを算出する処理を行うステップＳ３０４の出力は、感覚量Ｅの値である。 In the part of ÷ (Motion_max_basis + Motion_min_basis)} shown in the above equation (3), the maximum value Motion_max_basis of the base state information of the second base 11B serving as the predetermined reference and the base state of the second base 11B serving as the predetermined reference Compared with the value of the information minimum value Motion_min_basis, the degree of the value obtained by quantifying the base state information of the second base 11B is calculated.
In the part of x (E_max + E_min) shown in the above equation (3), it is calculated how much the numerical value of the base state information of the second base 11B is within the range of the sensory amount updated in step S303. The degree of the calculated value is calculated.
However, it is necessary to satisfy Motion_max_basis> Motion_min_basis. In short, it is a step of performing a process of calculating how much the base state information of the second base 11B is calculated from the range of the predetermined Motion_basis and converting the degree into the range of the sensory amount. The above formula is merely an example, and other relational expressions may be used. The output of step S304 for performing the process of calculating the sensory amount E from the base state information of the second base 11B is the value of the sensory amount E.

刺激量Ｒの算出を行うステップＳ３０５の入力は、例えば、ステップＳ３０４で算出された感覚量Ｅの値と、予め定められた定数Ｋの値である。刺激量Ｒの算出を行うステップＳ３０５の処理は、例えば、人間の感覚量と与えられる刺激量との関係を定義したフェフィナーの法則を用いると、刺激量Ｒ＝１０＾（感覚量Ｅ／予め定められた定数Ｋ）により、感覚量Ｒを算出する処理である。
上記式（３）は、第２拠点１１Ｂの第２参加者１５が第１拠点１１Ａにいる参加者に向けて提示したい感覚量Ｅから刺激量Ｒを算出する式である。第２拠点１１Ｂの参加者が自分の存在を提示したいと思っている度合いが大きければ感覚量Ｅの値がステップＳ３０４にて大きく算出されるため、ステップＳ３０５で算出される刺激量Ｒも大きくなる。また、第１拠点１１Ａで第２参加者１５を忘れてテレビ会議が白熱している場合は、ステップＳ３０３で算出される感覚量の範囲Ｅ＿ｍａｘとＥ＿ｍｉｎの値が大きく算出されるため、ステップＳ３０４で算出される感覚量Ｅも大きくなり、ステップＳ３０５で算出される刺激量Ｒも大きくなる。刺激量Ｒが小さくなる場合はその逆である。上記は、フェフィナーの法則をもちいて感覚量Ｅから刺激量Ｒを算出したれいだが、そのほかの変換の式を用いてもよい。刺激量Ｒの算出を行うステップＳ３０５の出力は、刺激量Ｒの値である。
なお、本実施形態によれば、第２拠点１１Ｂの第２参加者１５が第１拠点１１Ａにいる参加者に向けて提示したい感覚量Ｅから刺激量Ｒを算出しているが、本発明はこれに限定されるものではない。すなわち、第２拠点状況情報、第１参加者の感覚量の範囲に基づいて、第２拠点１１Ｂにいる第２参加者の言動に現れる会議への参加意思の程度を表す意思量を意思量算出部５５ｂにより算出し、刺激出力部５７により意思量に応じた刺激を第１参加者が知覚できるように出力してもよい。これにより、第２拠点状況情報、第１参加者の感覚量の範囲に基づいて、遠隔地の参加者の言動に現れる会議への参加意思の程度を表す意思量を提示する度合いの調整を行うことができる。 The input in step S305 for calculating the stimulus amount R is, for example, the value of the sensory amount E calculated in step S304 and the value of a predetermined constant K. The process of step S305 for calculating the stimulus amount R is, for example, using the Feffiner's law that defines the relationship between the human sense amount and the given stimulus amount, and the stimulus amount R = 10 ^ (sensory amount E / predetermined The sensory amount R is calculated based on the constant K).
The above equation (3) is an equation for calculating the stimulation amount R from the sensory amount E that the second participant 15 of the second base 11B wants to present to the participant at the first base 11A. If the degree of desire that the participant at the second base 11B wants to present his / her presence is large, the value of the sensory amount E is greatly calculated in step S304, and therefore the stimulus amount R calculated in step S305 is also large. . In addition, when the second conference 15 is forgotten at the first base 11A and the video conference is incandescent, the sensory amount ranges E_max and E_min calculated in step S303 are calculated to be large, so in step S304. The sensory amount E calculated is also increased, and the stimulus amount R calculated in step S305 is also increased. The reverse is true when the stimulus amount R is small. In the above description, the stimulus amount R is calculated from the sensory amount E using Feffiner's law, but other conversion formulas may be used. The output of step S305 for calculating the stimulus amount R is the value of the stimulus amount R.
According to the present embodiment, the stimulation amount R is calculated from the sensory amount E that the second participant 15 at the second base 11B wants to present to the participant at the first base 11A. It is not limited to this. That is, based on the second site status information and the range of the amount of sensation of the first participant, the amount of intention representing the degree of intention to participate in the conference that appears in the behavior of the second participant at the second site 11B is calculated. It may be calculated by the unit 55b and output by the stimulus output unit 57 so that the first participant can perceive the stimulus according to the intention amount. This adjusts the degree of presenting the amount of intention representing the degree of willingness to participate in the conference that appears in the speech of the remote participant based on the second site status information and the range of the sensory amount of the first participant. be able to.

次いで、刺激出力部５７のための信号変換処理を行うステップＳ３０６の入力は、ステップＳ３０５の出力である刺激量Ｒの値である。刺激出力部５７のための信号変換処理を行うステップＳ３０６の処理は、例えば刺激出力部５７が振動モータである場合、刺激量Ｒを振幅と周波数へ変換する処理である。例えば刺激量Ｒを振幅へ変換する処理である。例えば、刺激量Ｒを周波数へ変換する処理である。例えば基準となる振幅Ａ＿ｂａｓｉｓがあり、基準となる刺激量Ｒ＿ｂａｓｉｓがあった場合、出力する振幅Ａは、以下の式（４）で算出される。
Ａ＝Ａ＿ｂａｓｉｓ×（Ｒ÷Ｒ＿ｂａｓｉｓ）（４）
上記の例は振幅の例だが、周波数にして当てはめてもよい。また、刺激部がＬＥＤだった場合、ＬＥＤに流す電流の量を上記の式と同様に算出し、明るさを調整してもよい。
上記の式（４）はあくまで例であり、他の関係式を用いてもよい。刺激量Ｒから刺激部への入力値として適切な値へと変換する処理を行えばよい。刺激出力部５７のための信号変換処理を行うステップＳ３０６の出力は、上記処理にて算出された刺激出力部５７へ入力できる信号のデータである。例えば、振幅の値である。例えば周波数の値である。例えば電流の値である。例えば電圧の値である。出力の値は１つであってもよいし、複数であってもよい。例えば、振幅と周波数の値であってもよい。 Next, the input of step S306 that performs signal conversion processing for the stimulus output unit 57 is the value of the stimulus amount R that is the output of step S305. The process of step S306 for performing the signal conversion process for the stimulus output unit 57 is a process of converting the stimulus amount R into an amplitude and a frequency when the stimulus output unit 57 is a vibration motor, for example. For example, it is a process of converting the stimulus amount R into an amplitude. For example, it is a process of converting the stimulus amount R into a frequency. For example, when there is a reference amplitude A_basis and there is a reference stimulus amount R_basis, the output amplitude A is calculated by the following equation (4).
A = A_basis × (R ÷ R_basis) (4)
The above example is an example of amplitude, but may be applied as a frequency. When the stimulating unit is an LED, the brightness may be adjusted by calculating the amount of current flowing through the LED in the same manner as in the above formula.
The above formula (4) is merely an example, and other relational expressions may be used. A process of converting the stimulus amount R into an appropriate value as an input value to the stimulus unit may be performed. The output of step S306 for performing signal conversion processing for the stimulus output unit 57 is data of a signal that can be input to the stimulus output unit 57 calculated in the above processing. For example, the amplitude value. For example, a frequency value. For example, the current value. For example, a voltage value. There may be one output value or a plurality of output values. For example, amplitude and frequency values may be used.

次いで、刺激出力部５７への出力処理を行うステップＳ３０７の入力は、ステップＳ３０６の出力であり、刺激出力部５７のための信号変換処理を行ったデータである。例えば、刺激出力部５７が振動モータである場合、振幅の値である。刺激出力部５７への出力処理を行うステップＳ３０７の処理は、上記入力データを刺激出力部５７へ入力し、刺激出力を実際に行う処理である。例えば、振動モータに、振幅の値の指令を入力する処理である。刺激出力部５７への出力処理を行うステップＳ３０７の出力は、刺激出力部５７が出力をしていることである。例えば、上記処理にて振動モータに入力された振幅にて振動モータが振動している状態である。上記は、振動モータについて、振幅の値を変化させる様子について具体的に説明したが、ＬＥＤなどを用いて光量や色を変化させるために電流値や抵抗値や電圧を変化させてもよい。また、スピーカを用いて音量を変化させるような出力の処理をおこなってもよい。
このように、第１参加者が感知可能な光波信号、音波信号、振動、臭気のうち、少なくとも１つを発生することで、遠隔地の参加者の言動に現れる会議への参加意思の程度を表す意思量を提示することができる。 Next, the input of step S307 for performing output processing to the stimulus output unit 57 is the output of step S306, which is data that has undergone signal conversion processing for the stimulus output unit 57. For example, when the stimulus output unit 57 is a vibration motor, the value is an amplitude value. The process of step S307 for performing the output process to the stimulus output unit 57 is a process for inputting the input data to the stimulus output unit 57 and actually performing the stimulus output. For example, this is a process of inputting an amplitude value command to the vibration motor. The output of step S307 for performing output processing to the stimulus output unit 57 is that the stimulus output unit 57 is outputting. For example, the vibration motor is oscillating with the amplitude input to the vibration motor in the above processing. In the above description, the state of changing the amplitude value of the vibration motor has been specifically described. However, the current value, resistance value, and voltage may be changed in order to change the light amount and color using an LED or the like. Further, output processing may be performed such that the volume is changed using a speaker.
In this way, by generating at least one of the light wave signal, sound wave signal, vibration, and odor that can be sensed by the first participant, the degree of intention to participate in the conference that appears in the speech and behavior of the remote participant is increased. The amount of intention to represent can be presented.

＜第７実施形態＞
図９は、本発明の第７実施形態に係るコミュニケーション補助システムに用いる、図１に示す第２参加者１５を表すアバターロボット１７の具体的な例を説明した図である。
１７は第２参加者１５を表すアバターロボット、１７ａはアバターロボットの耳に位置する刺激部、１７ｂはアバターロボットの目に位置する刺激部、１７ｃはアバターロボットの口に位置する刺激部、１７ｄはアバターロボットの手に位置する刺激部、１７ｅはアバターロボットの足に位置する刺激部、１７ｆはアバターロボットの内部に位置する刺激部である。
例えば、第２参加者１５を表すアバターロボット１７は、視覚を刺激する場合、刺激部１７ａ〜１７ｆのいずれか又は一部又はすべてが動いたり、色が変わったり、状態が時間に応じて変化するような刺激部を具備するアバターロボットである。
例えば、第２参加者１５を表すアバターロボット１７は、聴覚を刺激するとは、刺激部１７ａ〜１７ｆのいずれか又は一部又はすべてから出力される音が時間に応じて変化するような刺激部を具備するアバターロボットである。
例えば、第２参加者１５を表すアバターロボット１７は、触覚を刺激するとは、刺激部１７ａ〜１７ｆのいずれか又は一部又はすべてが動いたりし、状態が時間に応じて変化し、ロボットに触れると状態の違いを認識することができるような刺激部を具備するアバターロボットである。刺激出力部５７は、第１参加者が接触することにより感知可能な振動を発生する。 <Seventh embodiment>
FIG. 9 is a diagram illustrating a specific example of the avatar robot 17 representing the second participant 15 shown in FIG. 1 used in the communication assist system according to the seventh embodiment of the present invention.
17 is an avatar robot representing the second participant 15, 17 a is a stimulating unit located in the avatar robot's ear, 17 b is a stimulating unit located in the eyes of the avatar robot, 17 c is a stimulating unit located in the mouth of the avatar robot, and 17 d is A stimulation unit located in the hand of the avatar robot, 17e is a stimulation unit located on the foot of the avatar robot, and 17f is a stimulation unit located inside the avatar robot.
For example, when the avatar robot 17 representing the second participant 15 stimulates vision, any or a part or all of the stimulation units 17a to 17f move, the color changes, or the state changes according to time. It is an avatar robot provided with such a stimulating unit.
For example, when the avatar robot 17 representing the second participant 15 stimulates hearing, a stimulating unit in which sound output from any one or a part or all of the stimulating units 17a to 17f changes according to time. It is an avatar robot provided.
For example, when the avatar robot 17 representing the second participant 15 stimulates the sense of touch, any or a part or all of the stimulation units 17a to 17f move, the state changes according to time, and the robot is touched. It is an avatar robot having a stimulation unit that can recognize the difference in state. The stimulus output unit 57 generates a vibration that can be sensed when the first participant contacts.

例えば、第２参加者１５を表すアバターロボット１７は、臭覚を刺激するとは、刺激部１７ａ〜１７ｆのいずれか又は一部又はすべてから時間変化に応じて匂いを発し、臭覚を用いて状態の違いを認識することができるような刺激部を具備するアバターロボットである。刺激出力部５７は、第１参加者が嗅取することにより感知可能な臭気を発生する。
例えば、第２参加者１５を表すアバターロボット１７は、アバターロボットの耳に位置する刺激部１７ａが、振動モータであるアバターロボットである。
例えば、第２参加者１５を表すアバターロボット１７は、アバターロボットの目に位置する刺激部１７ｂが、ＬＥＤであるアバターロボットである。
例えば、第２参加者１５を表すアバターロボット１７は、アバターロボットの口に位置する刺激部１７ｃがスピーカであるアバターロボットである。
例えば、第２参加者１５を表すアバターロボット１７は、アバターロボットの手に位置する刺激部１７ｄが振動モータであるアバターロボットである。
上記の説明はある特定の刺激部に限って説明したものだが、上記の刺激部は、図９に示す刺激部１７ａ〜１７ｆの少なくとも１つでもよい。
このように、第１参加者とロボット部１７の間の距離を第１拠点状況情報として用いることで、遠隔地の参加者の言動に現れる会議への参加意思の程度を表す意思量を提示する度合いの調整を行うことができる。 For example, when the avatar robot 17 representing the second participant 15 stimulates an olfaction, the odor is emitted from any or a part or all of the stimulation units 17a to 17f in accordance with a time change, and the difference in state using the olfaction It is an avatar robot provided with a stimulation unit that can recognize the The stimulus output unit 57 generates an odor that can be sensed when the first participant smells it.
For example, the avatar robot 17 representing the second participant 15 is an avatar robot in which the stimulation unit 17a located at the ear of the avatar robot is a vibration motor.
For example, the avatar robot 17 representing the second participant 15 is an avatar robot in which the stimulation unit 17b located in the eyes of the avatar robot is an LED.
For example, the avatar robot 17 representing the second participant 15 is an avatar robot in which the stimulation unit 17c located in the mouth of the avatar robot is a speaker.
For example, the avatar robot 17 representing the second participant 15 is an avatar robot in which the stimulation unit 17d located in the hand of the avatar robot is a vibration motor.
Although the above description is limited to a specific stimulation unit, the stimulation unit may be at least one of the stimulation units 17a to 17f illustrated in FIG.
In this way, by using the distance between the first participant and the robot unit 17 as the first site status information, an intention amount representing the degree of willingness to participate in the conference that appears in the behavior of the remote participant is presented. The degree can be adjusted.

＜第８実施形態＞
図１０〜図１３は、本発明の第８実施形態に係るコミュニケーション補助システムに用いる、第１拠点１１Ａのテレビ会議の白熱度合いを反映した、第１拠点１１Ａの感覚Ｅの分布について説明するためのグラフ図である。
図１０は、第１拠点１１Ａの第１参加者１３と第３参加者１４の感覚Ｅの分布のさりげない範囲を具体的に説明するためのグラフ図である。
例えば、計測された第１拠点１１Ａの会議の白熱度合いから、会議参加者毎に感覚Ｅの分布をもち、さりげないと感じる感覚の範囲Ｅ＿ｍｉｎからＥ＿ｍａｘをもっているとする。 <Eighth Embodiment>
10 to 13 are diagrams for explaining the distribution of the sensation E of the first base 11A reflecting the incandescence of the video conference of the first base 11A used in the communication assist system according to the eighth embodiment of the present invention. FIG.
FIG. 10 is a graph for specifically explaining a casual range of the distribution of the sense E of the first participant 13 and the third participant 14 at the first base 11A.
For example, from the measured incandescence of the conference at the first base 11A, it is assumed that the conference E has a distribution of sensation E for each conference participant, and has a range of sensations E_min to E_max.

図１１（ａ）は、第１拠点１１Ａの第１参加者１３と第３参加者１４の感覚Ｅの分布から第１拠点１１Ａの感覚Ｅのさりげない範囲を具体的に説明するためのグラフ図である。
例えば、第１拠点１１Ａにいる第１参加者１３と第３参加者１４の２人ともがさりげないと感じる範囲を、第１拠点１１Ａの感覚Ｅのさりげない範囲とする。上記は参加者のＡＮＤをとった値となっているが、そのほかの方法で複数参加者の感覚Ｅの分布から第１拠点１１Ａの感覚Ｅのさりげない範囲を算出してもよい。下記の式（５）（６）で算出することもできる。 FIG. 11A is a graph for specifically explaining a casual range of the sensation E of the first base 11A from the distribution of the sensation E of the first participant 13 and the third participant 14 of the first base 11A. It is.
For example, a range in which both the first participant 13 and the third participant 14 at the first base 11A feel casual is set as a casual range of the sense E of the first base 11A. Although the above is a value obtained by taking the AND of the participants, the casual range of the sensation E of the first base 11A may be calculated from the distribution of the sensation E of the plurality of participants by other methods. It can also be calculated by the following formulas (5) and (6).

Ｅ＿ｍａｘ＝ｍｉｎ［Ｅ＿ｍａｘ＿１３、Ｅ＿ｍａｘ＿１４］（５）
Ｅ＿ｍｉｎ＝ｍａｘ［Ｅ＿ｍｉｎ＿１３、Ｅ＿ｍａｘ＿１４］（６） E_max = min [E_max_13, E_max_14] (5)
E_min = max [E_min_13, E_max_14] (6)

図１１（ｂ）は、第１拠点１１Ａに沢山の参加がいた場合、参加者毎の感覚Ｅの分布のさりげないと感じる範囲から、第２参加者が与えたいと思っている感覚Ｅを算出する方法を説明するためのグラフ図である。
たとえば、第１拠点１１Ａに沢山参加者がいて、図１１（ｂ）のように感覚Ｅの分布のさりげないと感じる範囲が参加者毎に存在する場合を考える。例えば、第２参加者が自分の存在に気付いて欲しいと思っている度合いがとても高い場合には、すべて参加者がさりげないと思っている感覚Ｅの値を算出したりする方法が具体例の一つである。
例えば、第２参加者が自分の存在に気付いて欲しいと思っている度合いがとても低い場合には、数人がさりげないと思っており、残りのヒトはＥ＿ｍｉｎより小さい値であるような感覚Ｅの値を算出したりする方法が具体例の一つである。 FIG. 11 (b) calculates the sensation E that the second participant wants to give from the range where the distribution of the sensation E per participant is casual when there are many participants at the first base 11A. It is a graph for demonstrating the method to do.
For example, let us consider a case where there are many participants at the first base 11A, and there is a range where each participant feels that the distribution of the sensation E is casual as shown in FIG. For example, when the second participant wants to be aware of his / her presence is very high, a specific example is a method of calculating the value of sensation E that the participant thinks is casual. One.
For example, if the second participant wants to be aware of his / her presence is very low, he / she thinks that some people are casual, and the remaining humans have a sense E that is less than E_min. One method is to calculate the value of.

図１２（ａ）は、第１拠点１１Ａに存在する複数参加者のさりげないと感じる感覚の最大値Ｅ＿ｍａｘの確率密度の具体例を説明するためのグラフ図である。
例えば、第１拠点１１Ａの感覚Ｅの範囲の更新は、図１２（ａ）に示すような確率密度分布を用いて更新する方法がある。例えば、第１拠点１１Ａが盛り上がっている場合には、確率密度が高くなるようなＥ＿ｍａｘの値を、第１拠点１１Ａのさりげないと感じる感覚の最大値Ｅ＿ｍａｘとして採用したりする方法がある。逆に、第１拠点１１Ａが盛り上がっていない場合には、確率密度が低くなるようなＥ＿ｍａｘの値を、第１拠点１１Ａのさりげないと感じる感覚の最大値Ｅ＿ｍａｘとして採用したりする方法がある。
図１２（ｂ）は、第１拠点１１Ａに存在する複数参加者のさりげないと感じる感覚の最小値Ｅ＿ｍｉｎの確率密度の具体例を説明した図である。
第１拠点１１Ａの感覚Ｅの範囲の更新において、さりげないと感じる感覚の最小値について説明した図である。具体的な方法は（ａ）と同様である。 FIG. 12A is a graph for explaining a specific example of the probability density of the maximum sensation E_max of a plurality of participants who feels casually in the first base 11A.
For example, the range of the sensation E of the first base 11A can be updated using a probability density distribution as shown in FIG. For example, when the first site 11A is exciting, there is a method of adopting the E_max value that increases the probability density as the maximum value E_max of the sense that the first site 11A feels casual. Conversely, when the first site 11A is not excited, there is a method of adopting the E_max value that reduces the probability density as the maximum value E_max that the first site 11A feels casual.
FIG. 12B is a diagram illustrating a specific example of the probability density of the minimum value E_min of the sensation felt by a plurality of participants present at the first base 11A.
It is the figure explaining the minimum value of the sensation felt casually in the update of the range of the sensation E of the 1st base 11A. The specific method is the same as (a).

図１２（ｃ）は、予め定められた第１拠点１１Ａのさりげないと感じる感覚Ｅの範囲がＥ＿ｍａｘ＿ｂａｓｉｓとＥ＿ｍｉｎ＿ｂａｓｉｓとして定められており、第１拠点１１Ａのテレビ会議の拠点状況情報に応じて、第１拠点１１Ａのさりげないと感じる感覚量の範囲Ｅ＿ｍａｘとＥ＿ｍｉｎを更新する具体的な例を説明するためのグラフ図である。
予め定められた第１拠点１１Ａのさりげないと感じる感覚Ｅの範囲がＥ＿ｍａｘ＿ｂａｓｉｓとＥ＿ｍｉｎ＿ｂａｓｉｓとして定められており、例えば第１拠点１１Ａのテレビ会議が白熱している場合には、第１拠点１１Ａのさりげないと感じる感覚量の範囲Ｅ＿ｍａｘとＥ＿ｍｉｎは大きい値へと変換、又は算出される。下記式を用いて算出される。 In FIG. 12 (c), a predetermined range E of the first base 11A that is felt casually is defined as E_max_basis and E_min_basis, and according to the base state information of the video conference of the first base 11A, It is a graph for demonstrating the specific example which updates the range E_max and E_min of the amount of sensations which one base 11A feels casually.
The predetermined range E of the first base 11A that is felt casually is defined as E_max_basis and E_min_basis. For example, when the video conference at the first base 11A is incandescent, the first base 11A is casual. The sensory amount ranges E_max and E_min that are felt to be absent are converted or calculated into large values. It is calculated using the following formula.

Ｅ＿ｍａｘ＝ｆ（ＭｏｔｉｏｎＶａｌｕｅ＿１１Ａ、ＭｏｔｉｏｎＣｏｕｎｔ＿１１Ａ、Ｅ＿ｍａｘ＿ｂａｓｉｓ）（７）
Ｅ＿ｍｉｎ＝ｆ（ＭｏｔｉｏｎＶａｌｕｅ＿１１Ａ、ＭｏｔｉｏｎＣｏｕｎｔ＿１１Ａ、Ｅ＿ｍｉｎ＿ｂａｓｉｓ）（８）
ただしｆは予めさだめられた関数である。 E_max = f (MotionValue_11A, MotionCount_11A, E_max_basis) (7)
E_min = f (MotionValue_11A, MotionCount_11A, E_min_basis) (8)
Here, f is a function that has been preliminarily set.

図１３は、第２拠点１１Ｂの第２参加者１５が自分の存在に気付いて欲しいと思っている度合いを反映し、第１拠点参加者に提示する感覚量Ｅを算出する具体的な例を説明するためのグラフ図である。
例えば、第２参加者１５が自分の存在に気付いて欲しいと思っている度合いは、図１３に示す範囲の中で、上記度合いによってその範囲の中で値を調整することで算出する。例えば、自分の存在に強く気付いて欲しいと思っている場合には、Ｅ＿ｍａｘに近い値をとる。自分の存在に気付いて欲しいと思っている度合いが小さい場合にはＥ＿ｍｉｎに近い値をとる。 FIG. 13 is a specific example of calculating the sensory amount E to be presented to the first site participant, reflecting the degree to which the second participant 15 at the second site 11B wants to be aware of his / her presence. It is a graph for demonstrating.
For example, the degree that the second participant 15 wants to be aware of his / her presence is calculated by adjusting the value in the range according to the above degree in the range shown in FIG. For example, if you want to be strongly aware of your presence, take a value close to E_max. When the degree of wanting to be aware of one's presence is small, the value is close to E_min.

上記これまでの実施例すべてについて、以下に記載の内容であってもよい。
例えば、第２拠点１１Ｂにいる参加者１５の自分の存在に気づいて欲しい度合いはこれまでの実施例では、第２参加者の動作をもって算出していたが、例えば、動作に現れない思いを脳波などの信号をもって計測し算出してもよい。
また、刺激Ｒの算出処理は、第２参加者１５を表すアバターロボット１７と、第１拠点１１Ａの第１参加者１３と第３参加者１４との距離、又は第２参加者１５を表すアバターロボット１７と、第１拠点１１Ａの第１参加者１３との距離、又は、第２参加者１５を表すアバターロボット１７と、第１拠点１１Ａの第３参加者１４との距離に比例する強さで算出する処理であってもよい。
また、第１拠点１１Ａの拠点状況情報を計測し、上記算出処理、判定処理を行う際の話者は、複数いてもよい。
このように、第２拠点状況情報、第１参加者の感覚量の分布に基づいて、遠隔地の参加者の言動に現れる会議への参加意思の程度を表す意思量を提示する度合いの調整を行うことができる。 The contents described below may be applied to all the above embodiments.
For example, the degree to which the participant 15 at the second base 11B wants to be aware of his / her own presence has been calculated based on the action of the second participant in the embodiments so far. It may be measured and calculated with a signal such as
In addition, the calculation process of the stimulus R is the distance between the avatar robot 17 representing the second participant 15 and the first participant 13 and the third participant 14 at the first base 11A, or the avatar representing the second participant 15. Strength proportional to the distance between the robot 17 and the first participant 13 at the first base 11A, or the distance between the avatar robot 17 representing the second participant 15 and the third participant 14 at the first base 11A. It may be a process to calculate by.
Further, there may be a plurality of speakers when the site status information of the first site 11A is measured and the calculation process and the determination process are performed.
Thus, based on the second site status information and the distribution of the amount of sensation of the first participant, the degree of presenting the amount of intention representing the degree of intention to participate in the conference that appears in the speech of the remote participant is adjusted. It can be carried out.

＜本発明の実施態様例と効果＞
＜第１態様＞
本態様のコミュニケーション補助装置は、第１拠点１１Ａに配置されたテレビ会議端末１０Ａおよび第２拠点１１Ｂに配置されたテレビ会議端末１０Ｂとの間で通信ネットワークＮを経由したコミュニケーションを補助するコミュニケーション補助装置であって、第１拠点１１Ａには、推定された第２拠点１１Ｂの状況を表す第２拠点状況情報を受信する無線通信部５３と、第１拠点１１Ａにいる第１参加者１３が知覚可能な感覚量の範囲を算出する感覚量範囲算出部５５ａと、第２拠点状況情報と第１参加者の感覚量の範囲に基づいて、第２拠点１１Ｂにいる第２参加者の意思量を算出する意思量算出部５５ｂと、意思量に応じた刺激を第１参加者が知覚できるように出力する刺激出力部５７と、を備えることを特徴とする。
本態様によれば、第２拠点状況情報と第１参加者の感覚量の範囲に基づいて、遠隔地の参加者の言動に現れる会議への参加意思の程度を表す意思量を提示する度合いの調整を行うことができる。すなわち、相手に自分の存在を提示する度合いの調整を行うことができる。 <Examples of Embodiments and Effects of the Present Invention>
<First aspect>
The communication assistance device of this aspect is a communication assistance device that assists communication via the communication network N between the video conference terminal 10A disposed at the first base 11A and the video conference terminal 10B disposed at the second base 11B. In the first base 11A, the wireless communication unit 53 that receives the second base state information representing the estimated state of the second base 11B and the first participant 13 in the first base 11A can perceive. The amount of intention of the second participant at the second base 11B is calculated based on the sensory amount range calculation unit 55a that calculates the range of the sensory amount, the second site status information, and the range of the sensory amount of the first participant. And a stimulus output unit 57 that outputs a stimulus according to the intention amount so that the first participant can perceive the stimulus.
According to this aspect, based on the second site situation information and the range of the first participant's sense amount, the degree of intention indicating the degree of intention to participate in the conference that appears in the speech and behavior of the remote participant Adjustments can be made. That is, it is possible to adjust the degree of presenting one's presence to the other party.

＜第２態様＞
本態様のコミュニケーション補助装置は、第１拠点１１Ａの状況を表す第１拠点状況情報を推定する第１拠点状況推定部２５を備え、意思量算出部５５ｂは、第１拠点状況情報、第２拠点状況情報および第１参加者の感覚量の範囲に基づいて、第２参加者の意思量を算出することを特徴とする。
本態様によれば、第１拠点状況情報、第２拠点状況情報、第１参加者の感覚量の範囲に基づいて、遠隔地の参加者の言動に現れる会議への参加意思の程度を表す意思量を提示する度合いの調整を行うことができる。すなわち、相手に自分の存在を提示する度合いの調整を行うことができる。 <Second aspect>
The communication assistance device of this aspect includes a first site status estimation unit 25 that estimates first site status information representing the status of the first site 11A, and the intention amount calculation unit 55b includes the first site status information and the second site. The amount of intention of the second participant is calculated based on the situation information and the range of the sense amount of the first participant.
According to this aspect, the intention indicating the degree of willingness to participate in the conference that appears in the behavior of the remote participant based on the first site status information, the second site status information, and the range of the sense amount of the first participant The degree of presenting the amount can be adjusted. That is, it is possible to adjust the degree of presenting one's presence to the other party.

＜第３態様＞
本態様のコミュニケーション補助装置は、感覚量範囲算出部５５ａは、第１参加者が知覚可能な感覚量の分布を算出し、参加意思量算出部５５ｂは、第２拠点状況情報、第１参加者の感覚量の分布に基づいて、意思量を算出することを特徴とする。
本態様によれば、第２拠点状況情報、第１参加者の感覚量の分布に基づいて、遠隔地の参加者の言動に現れる会議への参加意思の程度を表す意思量を提示する度合いの調整を行うことができる。すなわち、相手に自分の存在を提示する度合いの調整を行うことができる。 <Third aspect>
In the communication assisting device of this aspect, the sensory amount range calculating unit 55a calculates the distribution of sensory amounts that can be perceived by the first participant, and the participation intention calculating unit 55b includes the second site situation information and the first participant. The intention amount is calculated based on the distribution of the sense amount.
According to this aspect, based on the second site situation information and the distribution of the sense amount of the first participant, the degree of presenting the amount of intention representing the degree of intention to participate in the conference that appears in the speech of the remote participant Adjustments can be made. That is, it is possible to adjust the degree of presenting one's presence to the other party.

＜第４態様＞
本態様のコミュニケーション補助装置は、第１拠点状況推定部２５は、第１拠点１１Ａの音声信号、及び映像信号を計測する第１音声映像計測手段と、第１拠点１１Ａの映像信号に基づいて第１特徴的動作情報を抽出する第１特徴的動作抽出手段と、第１拠点１１Ａの音声信号と映像信号に基づいて発言者情報と当該発言者の対話相手情報を特定する特定手段と、対話相手情報が第２参加者を表すか否かを判定する判定手段と、を備え、第１拠点状況推定部２５は、第１特徴的動作情報、発言者情報、及び対話相手情報を第１拠点状況情報として出力することを特徴とする。
本態様によれば、第１特徴的動作情報、発言者情報、及び対話相手情報を第１拠点状況情報として用いることで、遠隔地の参加者の言動に現れる会議への参加意思の程度を表す意思量を提示する度合いの調整を行うことができる。すなわち、相手に自分の存在を提示する度合いの調整を行うことができる。 <4th aspect>
In the communication assisting device of this aspect, the first site situation estimation unit 25 is based on the first audio / video measurement unit that measures the audio signal and the video signal of the first site 11A and the video signal of the first site 11A. A first characteristic action extracting means for extracting one characteristic action information; a specifying means for specifying the speaker information and the conversation partner information of the speaker based on the audio signal and the video signal of the first base 11A; Determination means for determining whether or not the information represents a second participant, and the first site situation estimation unit 25 receives the first characteristic motion information, the speaker information, and the conversation partner information as the first site situation. It is output as information.
According to this aspect, by using the first characteristic operation information, the speaker information, and the conversation partner information as the first site situation information, the degree of intention to participate in the conference that appears in the behavior of the remote participant is expressed. The degree of presenting the will can be adjusted. That is, it is possible to adjust the degree of presenting one's presence to the other party.

＜第５態様＞
本態様のコミュニケーション補助装置は、刺激出力部５７は、刺激を出力するように構成されたロボット部１７を備え、第１拠点状況推定部２５は、第１参加者とロボット部１７の間の距離を計測する測距手段を備え、距離を第１拠点状況情報として推定することを特徴とする。
本態様によれば、第１参加者とロボット部１７の間の距離を第１拠点状況情報として用いることで、遠隔地の参加者の言動に現れる会議への参加意思の程度を表す意思量を提示する度合いの調整を行うことができる。すなわち、相手に自分の存在を提示する度合いの調整を行うことができる。 <5th aspect>
In the communication assisting device of this aspect, the stimulus output unit 57 includes the robot unit 17 configured to output a stimulus, and the first site situation estimation unit 25 is a distance between the first participant and the robot unit 17. A distance measuring means for measuring the distance, and the distance is estimated as the first site status information.
According to this aspect, by using the distance between the first participant and the robot unit 17 as the first site situation information, the will amount representing the degree of intention to participate in the conference that appears in the speech and behavior of the remote participant. The degree of presentation can be adjusted. That is, it is possible to adjust the degree of presenting one's presence to the other party.

＜第６態様＞
本態様のコミュニケーション補助装置は、刺激出力部５７は、第１参加者が目視することにより感知可能な光波信号を発生する光波発生手段、第１参加者が聴取することにより感知可能な音波信号を発生する音波発生手段、第１参加者が接触することにより感知可能な振動を発生する振動発生手段、第１参加者が嗅取することにより感知可能な臭気を発生する臭気発生手段のうち、少なくとも１つであることを特徴とする。
本態様によれば、第１参加者が感知可能な光波信号、音波信号、振動、臭気のうち、少なくとも１つを発生することで、遠隔地の参加者の言動に現れる会議への参加意思の程度を表す意思量を提示することができる。すなわち、相手に自分の存在を提示する度合いの調整を行うことができる。 <Sixth aspect>
In the communication assisting device of this aspect, the stimulus output unit 57 generates a light wave signal that can be sensed by visual observation by the first participant, and a sound wave signal that can be sensed by listening to the first participant. At least of a sound wave generating means for generating, a vibration generating means for generating a perceivable vibration when the first participant contacts, and an odor generating means for generating a perceptible odor when the first participant sniffs It is characterized by being one.
According to this aspect, by generating at least one of a light wave signal, a sound wave signal, vibration, and odor that can be sensed by the first participant, the intention of participating in the conference that appears in the speech and behavior of the remote participant An intention amount representing the degree can be presented. That is, it is possible to adjust the degree of presenting one's presence to the other party.

＜第７態様＞
本態様のコミュニケーション補助装置は、第２拠点状況推定部４５は、第２拠点１１Ｂの状況として音声信号、及び映像信号を計測する第２音声映像計測手段と、所定時間内における第２拠点１１Ｂの音声信号が所定音量以下である場合に、第２拠点１１Ｂの映像信号に基づいて第２特徴的動作情報を抽出する第２特徴的動作抽出手段と、第２特徴的動作情報を第２拠点状況情報として第１拠点１１Ａ側に送信する送信手段と、を備えることを特徴とする。
本態様によれば、所定時間内における第２拠点１１Ｂの音声信号が所定音量以下である場合に、第２拠点１１Ｂの映像信号に基づいて第２特徴的動作情報を抽出するので、遠隔地の参加者の言動に現れる会議への参加意思の程度を表す意思量を算出することができる。 <Seventh aspect>
In the communication assistance device of this aspect, the second site situation estimation unit 45 includes a second audio / video measurement unit that measures an audio signal and a video signal as the status of the second site 11B, and the second site 11B within a predetermined time. When the audio signal is less than or equal to a predetermined volume, the second characteristic action extracting means for extracting the second characteristic action information based on the video signal of the second place 11B, and the second characteristic action information as the second condition Transmission means for transmitting information to the first base 11A side.
According to this aspect, the second characteristic operation information is extracted based on the video signal of the second base 11B when the audio signal of the second base 11B within a predetermined time is below the predetermined volume. It is possible to calculate the amount of intention representing the degree of intention to participate in the conference that appears in the behavior of the participant.

＜第８態様＞
本態様のコミュニケーション補助システムは、第１乃至６態様の何れか１つに記載のコミュニケーション補助装置を刺激量算出出力ユニット５０とし、第７態様に記載のコミュニケーション補助装置を第２拠点状況推定ユニット４０として備えることを特徴とする。
本態様によれば、刺激量算出出力ユニット５０と、第２拠点状況推定ユニット４０とを用いてコミュニケーション補助システムを構成することができる。 <Eighth aspect>
In the communication assistance system according to this aspect, the communication assistance apparatus according to any one of the first to sixth aspects is used as the stimulus amount calculation output unit 50, and the communication assistance apparatus according to the seventh aspect is the second site situation estimation unit 40. It is characterized by providing as.
According to this aspect, a communication assistance system can be configured using the stimulus amount calculation output unit 50 and the second site situation estimation unit 40.

＜第９態様＞
本態様のコミュニケーション補助方法は、第１拠点１１Ａに配置された第１会議端末１０Ａおよび第２拠点１１Ｂに配置された第２会議端末１０Ｂの間で通信ネットワークＮを経由したコミュニケーションを補助するコミュニケーション補助方法であって、第１拠点１１Ａには、推定された第２拠点１１Ｂの状況を表す第２拠点状況情報を受信する受信ステップＳ２００と、第１拠点１１Ａにいる第１参加者１３が知覚可能な感覚量の範囲を算出する感覚量範囲算出ステップＳ３００と、第２拠点状況情報と第１参加者１３の感覚量の範囲に基づいて、第２拠点１１Ｂにいる第２参加者１５の意思量を算出する意思量算出ステップＳ３００と、意思量に応じた刺激を第１参加者１３が知覚できるように出力する刺激出力ステップＳ４００と、を備えることを特徴とする。
本態様によれば、第２拠点状況情報、第１参加者の感覚量の範囲に基づいて、遠隔地の参加者の言動に現れる会議への参加意思の程度を表す意思量を提示する度合いの調整を行うことができる。すなわち、相手に自分の存在を提示する度合いの調整を行うことができる。 <Ninth aspect>
The communication assistance method of this aspect is a communication assistance that assists communication via the communication network N between the first conference terminal 10A arranged at the first base 11A and the second conference terminal 10B arranged at the second base 11B. In this method, the first base 11A can perceive the receiving step S200 for receiving the second base state information indicating the estimated state of the second base 11B and the first participant 13 in the first base 11A. The amount of intention of the second participant 15 at the second base 11B based on the sensory amount range calculating step S300 for calculating the range of the sensory amount, the second site status information and the range of the sensory amount of the first participant 13 An intention amount calculating step S300 for calculating the stimulus, a stimulus output step S400 for outputting the stimulus according to the intention amount so that the first participant 13 can perceive, Characterized in that it comprises.
According to this aspect, based on the second site status information and the range of the first participant's sense amount, the degree of presenting the intention amount indicating the degree of intention to participate in the conference that appears in the speech and behavior of the remote participant Adjustments can be made. That is, it is possible to adjust the degree of presenting one's presence to the other party.

＜第１０態様＞
本態様のプログラムは、第９態様の各ステップをプロセッサに実行させることを特徴とする。
本態様によれば、各ステップをＣＰＵに実行させることができるので、遠隔地の参加者の言動に現れる会議への参加意思の程度を表す意思量を提示する度合いの調整を行うことができる。 <10th aspect>
The program according to this aspect is characterized by causing a processor to execute each step according to the ninth aspect.
According to this aspect, since each step can be executed by the CPU, it is possible to adjust the degree of presenting the amount of intention representing the degree of intention to participate in the conference that appears in the speech and behavior of the remote participants.

４…撮像装置、５…マイク、６…スピーカ、７…ディスプレイ、８…処理装置、１０…テレビ会議端末、１１Ａ…第１拠点、１１Ｂ…第２拠点、１３…第１参加者、１４…第３参加者、１５…第２参加者、１７…アバターロボット、２０…第１拠点状況計測推定ユニット、２１…第１拠点状況計測部、２３…第１拠点状況取込部、２５…第１拠点状況推定部、２７…無線通信部、３０…中継ユニット、４０…第２拠点状況計測推定ユニット、４１…第２拠点状況計測部、４３…第２拠点状況取込部、４５…第２拠点状況推定部、５０…刺激量算出出力ユニット、５５…刺激量算出部、５５ａ…感覚量範囲算出部、５５ｂ…意思量算出部、５５ｃ…刺激量算出部、５７…刺激出力部、Ｎ…通信ネットワーク DESCRIPTION OF SYMBOLS 4 ... Imaging device, 5 ... Microphone, 6 ... Speaker, 7 ... Display, 8 ... Processing device, 10 ... Video conference terminal, 11A ... First base, 11B ... Second base, 13 ... First participant, 14 ... First 3 participants, 15 ... second participants, 17 ... avatar robot, 20 ... first site situation measurement estimation unit, 21 ... first site situation measurement unit, 23 ... first site situation capture unit, 25 ... first site Situation estimation unit, 27 ... wireless communication unit, 30 ... relay unit, 40 ... second site status measurement estimation unit, 41 ... second site status measurement unit, 43 ... second site status capture unit, 45 ... second site status Estimator 50 ... Stimulus amount calculation output unit 55 ... Stimulus amount calculation unit 55a ... Sense amount range calculation unit 55b ... Intention amount calculation unit 55c ... Stimulus amount calculation unit 57 ... Stimulus output unit N ... Communication network

特許第５２１１００１号Patent No. 521101

ｈｔｔｐ：／／ＷＷＷ．ｏｋｉ．ｃｏｍ／ｊｐ／ｏｔｒ／２００８／ｎ２１３／ｐｄｆ／２１３＿ｒ０６．ｐｄｆhttp: // WWW. oki. com / jp / otr / 2008 / n213 / pdf / 213_r06. pdf

Claims

A communication assistance device for assisting communication via a network between a first conference terminal arranged at a first site and a second conference terminal arranged at a second site,
In the first base,
Receiving means for receiving second site status information representing the estimated status of the second site;
A sensory amount range calculating means for calculating a range of a sensory amount perceivable by the first participant at the first base;
An intention amount calculating means for calculating an intention amount of the second participant at the second base based on the second base situation information and the range of the sensory amount of the first participant;
And a stimulus output means for outputting a stimulus according to the amount of intention so that the first participant can perceive the communication assist device.

First estimation means for estimating first site status information representing the status of the first site,
The intention amount calculating means includes:
2. The communication according to claim 1, wherein the amount of intention of the second participant is calculated based on the first location status information, the second location status information, and the range of the sensory amount of the first participant. Auxiliary device.

The sensory amount range calculating means calculates a distribution of sensory amounts perceivable by the first participant,
The communication assistance device according to claim 1, wherein the intention amount calculating unit calculates the intention amount based on the second site situation information and a distribution of a sense amount of the first participant.

The first estimating means includes
First audio and video measurement means for measuring the audio signal and video signal of the first base;
First characteristic action extracting means for extracting first characteristic action information based on the video signal of the first base;
Identifying means for identifying speaker information and conversation partner information of the speaker based on the audio signal and video signal of the first site;
Determination means for determining whether or not the conversation partner information represents the second participant,
The first estimating means includes
The communication assisting apparatus according to claim 1, wherein the first characteristic operation information, the speaker information, and the conversation partner information are output as the first site status information.

The stimulus output means includes
A robot unit configured to output the stimulus,
The first estimating means includes
Ranging means for measuring the distance between the first participant and the robot unit,
The communication assisting apparatus according to claim 1, wherein the distance is estimated as the first site status information.

The stimulus output means includes
A light wave generating means for generating a light wave signal that can be sensed by visual observation of the first participant;
Sound wave generating means for generating a sound wave signal that can be sensed by listening to the first participant
Vibration generating means for generating a perceptible vibration when the first participant contacts;
The communication assisting device according to claim 1, wherein the communication assisting device is at least one of odor generating means for generating a odor that can be sensed by smelling the first participant.

The second estimating means includes
A second audio / video measuring means for measuring an audio signal and a video signal as the status of the second base;
Second characteristic motion extraction means for extracting second characteristic motion information based on the video signal of the second site when the audio signal of the second site within a predetermined time is below a predetermined volume;
The communication auxiliary device according to claim 1, further comprising: a transmission unit configured to transmit the second characteristic operation information to the first site as the second site status information.

The communication auxiliary device according to any one of claims 1 to 6 is a first unit,
A communication support system comprising the communication support device according to claim 7 as a second unit.

A communication assistance method for assisting communication via a network between a first conference terminal arranged at a first site and a second conference terminal arranged at a second site,
In the first base,
A receiving step of receiving second site status information representing the estimated status of the second site;
A sensory amount range calculating step of calculating a range of a sensory amount perceivable by the first participant at the first base;
An intention amount calculating step for calculating an intention amount of the second participant at the second base based on the second base situation information and the range of the sensory amount of the first participant;
And a stimulus output step of outputting a stimulus according to the amount of intention so that the first participant can perceive.

A program for causing a processor to execute each step according to claim 9.