JP2011077839A

JP2011077839A - Information sharing system

Info

Publication number: JP2011077839A
Application number: JP2009227466A
Authority: JP
Inventors: Koichi Sugiyama; 晃一杉山
Original assignee: Sharp Corp
Current assignee: Sharp Corp
Priority date: 2009-09-30
Filing date: 2009-09-30
Publication date: 2011-04-14

Abstract

<P>PROBLEM TO BE SOLVED: To suppress the number of terminals possible to directly attain two-way communication of video or sound or the like with a server on the basis of user's utterance request detection and reduce transmission delay, communication failure, and server's load. <P>SOLUTION: A server is connected with a plurality of information terminals via network and the information terminals are grouped into one or plural terminal groups according to the predetermined set condition. One terminal group includes one representative terminal possible to directly receive shared information from the server and a participating terminal for receiving distribution of shared information from the representative terminal. The information terminal is provided with an utterance detection section for detecting an utterance request of users and an utterance request section for transmitting utterance request information to a server. The server is provided with a two-way communication control section that switches a communication state between the information terminal and the server to a two-way communication state possible to directly and mutually transmit and receive the shared information when the server receives the utterance request information. <P>COPYRIGHT: (C)2011,JPO&INPIT

Description

この発明は、情報共有システムに関し、特に、ネットワークを介して接続された複数の情報端末間で、共通の映像や音声等の情報を伝送する情報共有システムに関する。 The present invention relates to an information sharing system, and more particularly, to an information sharing system that transmits information such as common video and audio between a plurality of information terminals connected via a network.

従来からネットワークを介して、複数の情報端末間で映像や音声等の情報の伝送を行う通信システムや、専用のサーバ等を介して遠隔地にある複数の情報端末間で音声会議やビデオ会議を行う通信システムが利用されている。
たとえば、従来のビデオ会議システムでは、多地点接続装置（ＭＣＵ：Ｍｕｌｔｉ−ｐｏｉｎｔＣｏｎｔｒｏｌＵｎｉｔ）および、ＭＣＵ機能を有したサーバに、各地点ごとに設置された情報端末や参加者ごとの情報端末が、インターネット，ＬＡＮ，あるいは専用回線等のネットワークを介して接続され、情報端末毎に入力された映像，音声，文書などの情報がＭＣＵに集積され、ＭＣＵにおいて各情報のデコード（複号化），ミキシング（合成），エンコード（符号化）等の一連の処理が行われ、各情報端末に共通の情報として配信される。
一般的に、各情報端末を利用している会議への参加者は、共通の映像を見て、共通の音声を聞きながら会議を行うものとする。 Conventionally, a communication system for transmitting information such as video and audio between a plurality of information terminals via a network, a voice conference and a video conference between a plurality of information terminals at remote locations via a dedicated server, etc. A communication system is used.
For example, in a conventional video conference system, a multipoint connection unit (MCU: Multi-point Control Unit) and a server having an MCU function include an information terminal installed at each point and an information terminal for each participant. Connected via a network such as the Internet, LAN, or a dedicated line, information such as video, audio, and documents input for each information terminal is accumulated in the MCU, and each piece of information is decoded (decoded) and mixed in the MCU. A series of processes such as (combining) and encoding (encoding) are performed, and the information is distributed to each information terminal as common information.
In general, it is assumed that participants in a conference using each information terminal perform a conference while watching a common video and listening to a common voice.

また、特許文献１には、複数の会議端末を多地点接続装置（ＭＣＵ）を介して接続した多地点会議システムにおいて、多地点接続装置が会議への参加端末をグループ毎に管理して、会議端末から多地点接続装置への上り方向のデータ送信は、そのグループの代表者端末のみが行い、一方多地点接続装置から会議端末への下り方向のデータ送信は、全会議参加端末に行い、また、グループの代表者端末を切り替える場合には、会議端末からの上りセッションを設定するための接続要求信号を、ＭＣＵを介して交換することにより、同じグループ内の他会議端末に切り替えを行い、多地点会議装置が管理するセッション数を抑えるようにした多地点会議システムのセッション管理方法が提案されている。 Further, in Patent Document 1, in a multipoint conference system in which a plurality of conference terminals are connected via a multipoint connection device (MCU), the multipoint connection device manages the terminals participating in the conference for each group. Upstream data transmission from the terminal to the multipoint connection device is performed only by the representative terminal of the group, while downlink data transmission from the multipoint connection device to the conference terminal is performed to all conference participation terminals, and When switching the representative terminal of the group, the connection request signal for setting the uplink session from the conference terminal is exchanged via the MCU, thereby switching to another conference terminal in the same group. There has been proposed a session management method for a multipoint conference system in which the number of sessions managed by the point conference device is suppressed.

また、特許文献２には、通信元端末及び通信先相手に対応して通信セッションの重要度と復旧要否の情報を予め設定して、セッション重要度テーブルに記憶しておき、緊急セッションなどの重要度の高いセッションの接続要求を受信すると、既存セッションのうち重要度のもっとも低いものを選択して切断するとともに、切断された通信セッションの情報を、セッション復旧テーブルに記憶しておき、セッションリソースに空きができ次第、セッション復旧テーブル１４０に記憶された通信セッションの情報に従い、セッションを自動的に復旧させるゲートウェイ装置が提案されている。 Also, in Patent Document 2, information on the importance level of a communication session and information on necessity of recovery corresponding to a communication source terminal and a communication destination partner are set in advance and stored in a session importance level table. When a connection request for a session with high importance is received, the lowest priority of existing sessions is selected and disconnected, and information on the disconnected communication session is stored in the session recovery table, and session resources are stored. A gateway device has been proposed that automatically recovers a session according to information on a communication session stored in the session recovery table 140 as soon as a space is available.

また、特許文献３には、複数の会議端末の中に、他の会議端末から送信されてくる音声データのミキシングを行う機能を有するものが少なくとも１台含まれている場合には、遠隔会議の開催に先立って、多地点音声接続装置に、当該ミキシング機能を有する会議端末を当該多地点音声接続装置と直接通信セッションを確立する子端末として優先的に選択させ、遠隔会議へ参加する会議端末の数の増加に伴って、各会議端末から送信される音声データのミキシングを行う多地点音声接続装置に係る処理負荷が増加することを回避するようにした遠隔会議システムが提案されている。 In Patent Document 3, when at least one of a plurality of conference terminals having a function of mixing audio data transmitted from other conference terminals is included, Prior to the event, the multipoint audio connection device causes the conference terminal having the mixing function to be preferentially selected as a child terminal that establishes a direct communication session with the multipoint audio connection device, and the conference terminal participating in the remote conference There has been proposed a remote conference system that avoids an increase in processing load associated with a multipoint audio connection device that mixes audio data transmitted from each conference terminal as the number increases.

特開２００５−８０２１２号公報JP 2005-80212 A 特開２００９−４９５５０号公報JP 2009-49550 A 特開２００９−１００１５４号公報JP 2009-1000015 A

しかし、従来のＭＣＵを利用した多地点会議システムでは、ＭＣＵに同時に接続される情報端末の数が増加すればするほど、ＭＣＵと各情報端末間の通信回線で送受信されるトラフィックが増加するので、情報伝送の遅延や、ノイズの発生による通信エラー等の通信障害が起こりやすくなり、円滑な会議の進行ができなくなる場合もある。
さらに、ＭＣＵでは、接続される情報端末が増加すればするほど、エンコード（符号化）、デコード（複号化）等の処理をすべき情報量が増加するので、ＭＣＵにかかる負荷が大きくなり、それにより各情報端末への情報伝送に遅延をもたらすことになる。 However, in a multipoint conference system using a conventional MCU, as the number of information terminals simultaneously connected to the MCU increases, traffic transmitted and received on the communication line between the MCU and each information terminal increases. Communication failures such as information transmission delays and communication errors due to noise are likely to occur, and smooth conferences may not be possible.
Furthermore, in the MCU, as the number of connected information terminals increases, the amount of information to be processed such as encoding (encoding) and decoding (decoding) increases, so the load on the MCU increases, This causes a delay in information transmission to each information terminal.

また、特許文献１に記載されたシステムでは、ＭＣＵと各会議端末との間のセッション数を抑えることができるが、ＭＣＵから会議端末への下り方向のデータ送信は、ＭＣＵからそれぞれの会議端末に対して行われるため、会議端末数が増加すれば、この下り方向のデータ送信にかかるトラフィックが増加し、ＭＣＵ側で処理すべき情報が増加するので、情報伝送の遅延等の問題が発生しやすくなり、ＭＣＵの負荷も大きくなる。 Moreover, in the system described in Patent Document 1, the number of sessions between the MCU and each conference terminal can be suppressed, but downlink data transmission from the MCU to the conference terminal is performed from the MCU to each conference terminal. Therefore, if the number of conference terminals increases, traffic for this data transmission in the downstream direction increases and information to be processed on the MCU side increases, so problems such as information transmission delay are likely to occur. As a result, the load on the MCU also increases.

また、特許文献２のゲートウェイ装置では、予め規定された最大セッション数の範囲内で、通信セッションの重要度に対応してセッションの接続と切断を切り替えているので、その最大セッション数を越えるような情報端末が接続された場合には、単にセッションの接続と切断を切り替える処理をするだけでは、要求される会議を円滑にできない場合がある。 Moreover, in the gateway apparatus of patent document 2, since connection / disconnection of a session is switched according to the importance of a communication session within the range of the predetermined maximum number of sessions, the maximum number of sessions may be exceeded. When an information terminal is connected, there may be a case where a requested conference cannot be smoothly performed by simply switching between session connection and disconnection.

また、特許文献３に記載されたシステムでは、多地点音声接続装置に対する会議端末の接続形態を、ツリー構造としているので、多地点音声接続装置にかかる処理負荷の増加が回避できたとしても、ミキシング機能を有する会議端末を少なくとも１つ以上必要とし、また、ミキシング機能を有する会議端末に接続される会議端末の数が増加すれば、そのミキシング機能を有する会議端末の処理負担が増加し、音声データの遅延等が発生しやすくなり、円滑な会議の進行ができなくなるおそれがある。 Further, in the system described in Patent Document 3, since the connection form of the conference terminal with respect to the multipoint audio connection apparatus has a tree structure, even if an increase in processing load on the multipoint audio connection apparatus can be avoided, mixing is performed. If at least one conference terminal having a function is required and the number of conference terminals connected to the conference terminal having a mixing function increases, the processing load on the conference terminal having the mixing function increases, and audio data There is a risk that a delay in the meeting will occur and smooth progress of the conference may not be possible.

そこで、この発明は、以上のような事情を考慮してなされたものであり、サーバに接続されている複数の情報端末のうち、発言要求をした利用者の情報端末を、優先的にサーバと直接双方向通信できる状態にして、実質的にサーバと直接情報通信を行う情報端末の数を抑制して、情報伝送の遅延および、通信エラー等の通信障害の発生の減少、サーバが処理する情報量の低減によるサーバの負荷の軽減を図ることのできる情報共有システムを提供することを課題とする。 Therefore, the present invention has been made in consideration of the above circumstances, and among the plurality of information terminals connected to the server, the information terminal of the user who has requested to speak is preferentially designated as the server. Information that is processed by the server, reducing the number of information terminals that can communicate directly with the server and reducing the number of information terminals that are in direct communication with the server and reducing the occurrence of communication failures such as communication errors. It is an object of the present invention to provide an information sharing system that can reduce the load on the server by reducing the amount.

この発明は、サーバと、複数の情報端末とがネットワークを介して接続され、前記情報端末において入力された端末情報をサーバに送信し、前記サーバが、各情報端末から送信された端末情報を用いて共有情報を生成し、前記共有情報を前記情報端末へ配信する情報共有システムであって、前記複数の情報端末が、予め所定の設定条件に基づいて、１または複数個の端末群にグループ化され、１つの端末群は、前記サーバから前記共有情報を直接受信可能な１つの代表端末と、前記代表端末から前記共有情報の配信を受ける参加端末とを含み、前記各情報端末が、利用者が発言要求をしたことを検知する発言検知部と、前記発言要求をしたことを示す情報をサーバへ送信する発言要求部と、前記サーバからの通知によって、情報端末の動作状態を変更する端末モード切替部とを備え、前記サーバが、前記情報端末から前記発言要求情報を受信した場合に、その情報端末とサーバとの間の通信状態を、前記端末情報と前記共有情報とを相互に直接送受信可能な双方向通信状態に切り替える双方向通信制御部を備え、前記サーバと双方向通信状態となり利用者の発言が許可される情報端末は、同時には１つだけ存在するように、サーバが管理することを特徴とする情報共有システムを提供するものである。
これにより、サーバに直接接続され双方向通信状態となる情報端末の数が増加することを抑制しているので、情報伝送遅延を低減し、通信エラー等による通信障害を低減させ、サーバで行う情報配信処理にかかる負荷を軽減させることができる。 In the present invention, a server and a plurality of information terminals are connected via a network, terminal information input in the information terminal is transmitted to the server, and the server uses terminal information transmitted from each information terminal. The information sharing system generates shared information and distributes the shared information to the information terminals, wherein the plurality of information terminals are grouped into one or a plurality of terminal groups based on predetermined setting conditions in advance. One terminal group includes one representative terminal that can directly receive the shared information from the server, and a participating terminal that receives distribution of the shared information from the representative terminal. Each information terminal is a user The speech detection unit that detects that the speech request has been made, the speech request unit that transmits information indicating that the speech request has been made to the server, and the operation status of the information terminal according to the notification from the server A terminal mode switching unit that changes the communication status between the information terminal and the server when the server receives the message request information from the information terminal, and the terminal information and the shared information. A two-way communication control unit that switches to a two-way communication state capable of direct transmission / reception between each other, and there is only one information terminal that is in a two-way communication state with the server and is allowed to speak by a user at the same time. An information sharing system managed by a server is provided.
This suppresses an increase in the number of information terminals that are directly connected to the server and enter a two-way communication state, thereby reducing information transmission delays, reducing communication failures due to communication errors, etc. The load on the distribution process can be reduced.

また、前記発言要求をした情報端末であって、サーバとの通信状態が双方向通信状態となっている第１の情報端末が存在する場合に、第２の情報端末の発言検知部が新たな発言要求を検知した場合、前記第２の情報端末の発言要求部は、その新たな発言要求を示す情報をサーバへ送信し、前記サーバが、第２の情報端末からの発言要求情報を受信した場合に、前記双方向通信制御部が、前記第１の情報端末との双方向通信状態を解除し、新たな発言要求をした前記第２の情報端末とサーバとの間の通信状態を、双方向通信状態に切り替えることを特徴とする。
これによれば、新たな発言要求をした情報端末が出現しても、双方向通信状態の情報端末の数が増加することはなく、情報伝送遅延や通信障害が増加することもなく、サーバの負荷も増加することがない。 In addition, when there is a first information terminal that has made a request for speech and the communication state with the server is a two-way communication state, the speech detection unit of the second information terminal has a new one. When a speech request is detected, the speech request unit of the second information terminal transmits information indicating the new speech request to the server, and the server receives the speech request information from the second information terminal. The bidirectional communication control unit cancels the bidirectional communication state with the first information terminal and sets both the communication state between the second information terminal and the server that has made a new speech request. It is characterized by switching to a bidirectional communication state.
According to this, even if an information terminal that makes a new request for speech appears, the number of information terminals in the two-way communication state does not increase, information transmission delay and communication failure do not increase, and the server The load will not increase.

さらに、前記利用者による発言要求の検知が、ほぼ同時に複数の情報端末の発言検知部で行われた場合、前記サーバの双方向通信制御部は、その複数の情報端末からそれぞれ送信されてきた発言要求情報のうち、最も早く受信した発言要求情報を送信してきた情報端末との通信状態を、双方向通信状態に切り替えるか、あるいは、受信した複数の発言要求情報を確認し、最も早い時刻に利用者の発言要求が検知されたと判断される情報端末との通信状態を、双方向通信状態に切り替えることを特徴とする。
これにより、利用者の発言権の制御を適切にすることができ、円滑な会議の進行ができる。 Further, when the speech request is detected by the user at the speech detection units of the plurality of information terminals almost simultaneously, the bidirectional communication control unit of the server transmits the messages transmitted from the plurality of information terminals, respectively. Of the request information, switch the communication state with the information terminal that sent the earliest received speech request information to the two-way communication state, or check multiple received speech request information and use it at the earliest time The communication state with the information terminal that is determined to have detected the user's speech request is switched to the bidirectional communication state.
Thereby, control of a user's right to speak can be made appropriate and a smooth conference can proceed.

また、前記情報端末は、利用者の操作による入力信号を入力する入力部と、カメラにより撮影された映像を入力する映像入力部と、マイクから音声を入力する音声入力部とをさらに備え、前記発言検知部が、前記発言要求を意味する入力信号、発言要求を意味する入力音声、または発言要求を意味する入力映像のいずれか１つ以上を検出した場合に、利用者が発言要求をしたと判断することを特徴とする。
これによれば、利用者の発言要求を適格に検知できる。 The information terminal further includes an input unit that inputs an input signal by a user operation, a video input unit that inputs video captured by a camera, and an audio input unit that inputs audio from a microphone. When the speech detection unit detects at least one of the input signal indicating the speech request, the input voice indicating the speech request, or the input video indicating the speech request, the user makes a speech request. It is characterized by judging.
According to this, a user's speech request can be detected appropriately.

また、前記グループ化するための前記設定条件は、各情報端末が配置される接続拠点情報であり、前記サーバは、情報端末毎に前記接続拠点情報を記憶した記憶部を備えることを特徴とする。ここで、接続拠点情報とは、地名や、後述する拠点ＩＤ等を意味する。 Further, the setting condition for grouping is connection base information in which each information terminal is arranged, and the server includes a storage unit that stores the connection base information for each information terminal. . Here, the connection base information means a place name, a base ID described later, and the like.

また、前記１つの端末群に属する代表端末は、サーバからストリーミング配信により前記共有情報を受信することを特徴とする。これにより、サーバから情報端末へ、共有情報の片方向通信をすることができる。
前記１つの端末群に属する代表端末は、その端末群に属する前記参加端末に対し、サーバから受信した共有情報を、ストリーミング配信することを特徴とする。
前記１つの端末群に属する代表端末は、その端末群には属さない他の情報端末へも、前記サーバから受信した共有情報を、ストリーミング配信することを特徴とする。これにより、サーバの情報配信にかかる負荷をより軽減させることができる。 The representative terminal belonging to the one terminal group receives the shared information from the server by streaming delivery. Thereby, one-way communication of shared information can be performed from the server to the information terminal.
The representative terminal belonging to the one terminal group performs streaming distribution of the shared information received from the server to the participating terminals belonging to the terminal group.
The representative terminal belonging to the one terminal group performs streaming delivery of the shared information received from the server to other information terminals not belonging to the terminal group. Thereby, the load concerning the information delivery of a server can be reduced more.

前記グループ化された１つの端末群が、単一の情報端末から構成されている場合、その情報端末は自らが代表端末となり、その端末群は参加端末を含まない端末群であることを特徴とする。これにより、他の情報端末と同じ端末群にグループ化されなかった単一の情報端末でも、代表端末となることにより、サーバから共有情報を受信できる。 When the grouped one terminal group is composed of a single information terminal, the information terminal itself is a representative terminal, and the terminal group is a terminal group that does not include a participating terminal. To do. Accordingly, even a single information terminal that is not grouped into the same terminal group as other information terminals can receive shared information from the server by becoming a representative terminal.

また、この発明は、サーバと、複数の情報端末とがネットワークを介して接続され、前記情報端末において入力された端末情報をサーバに送信し、前記サーバが、各情報端末から送信された端末情報を用いて共有情報を生成し、前記共有情報を前記情報端末へ配信する情報共有システムの情報共有方法であって、前記複数の情報端末が、予め所定の設定条件に基づいて、１または複数個の端末群にグループ化され、１つの端末群は、前記サーバから前記共有情報を直接受信可能な１つの代表端末と、前記代表端末から前記共有情報の配信を受ける参加端末とを含む場合に、前記情報端末が、利用者が発言要求をしたことを検知する発言検知ステップと、前記発言要求をしたことを示す情報をサーバへ送信する発言要求ステップと、前記サーバが、前記情報端末から前記発言要求情報を受信した場合に、その情報端末とサーバとの間の通信状態を、前記端末情報と前記共有情報を相互に直接送受信可能な双方向通信状態に切り替え、サーバと双方向通信状態となり利用者の発言が許可される情報端末は、同時には１つだけ存在するように、管理する通信制御ステップと、前記サーバからの通知によって、発言要求を検知した情報端末の通信状態を双方向通信状態に変更する端末モード切替ステップとを備えたことを特徴とする情報共有方法を提供するものである。
さらに、この発明は、コンピュータに、前記情報共有方法の各ステップを実行させるための情報共有システムのプログラムを提供するものである。 In the present invention, a server and a plurality of information terminals are connected via a network, terminal information input in the information terminal is transmitted to the server, and the server transmits terminal information transmitted from each information terminal. An information sharing method of an information sharing system that generates shared information using a message and distributes the shared information to the information terminal, wherein the plurality of information terminals are preliminarily set based on a predetermined setting condition. When one terminal group includes one representative terminal that can directly receive the shared information from the server and a participating terminal that receives distribution of the shared information from the representative terminal, A message detection step in which the information terminal detects that a user has requested a message; a message requesting step in which information indicating that the user has requested the message is transmitted to a server; and the server When the request information is received from the information terminal, the communication state between the information terminal and the server is switched to a bidirectional communication state in which the terminal information and the shared information can be directly transmitted and received, The information terminal that is in the two-way communication state and is allowed to speak by the user at the same time is controlled by the communication control step to be managed and the information terminal that has detected the speech request by the notification from the server. An information sharing method comprising a terminal mode switching step for changing a communication state to a two-way communication state is provided.
Furthermore, the present invention provides a program of an information sharing system for causing a computer to execute each step of the information sharing method.

この発明によれば、サーバと双方向通信状態となり、利用者の発言が許可される情報端末を、同時には１つだけ存在するようにしているので、サーバと情報端末間で送受信される情報量を比較的少なくすることができ、それによって情報の伝送遅延を低減させ、また通信エラー等による通信障害を低減させることができ、サーバと直接双方向通信を行う情報端末の数を抑制することにより、サーバの情報配信処理にかかる負荷を軽減させることができる。 According to the present invention, since only one information terminal that is in a two-way communication state with the server and is allowed to speak by a user exists at the same time, the amount of information transmitted and received between the server and the information terminal By reducing the number of information terminals that perform two-way communication directly with the server, it is possible to reduce the transmission delay of information and thereby reduce communication failures due to communication errors, etc. The load on the server information distribution process can be reduced.

本発明の情報共有システムの一実施例の概略構成図である。It is a schematic block diagram of one Example of the information sharing system of this invention. 本発明の情報端末の一実施例の構成ブロック図である。It is a block diagram of the configuration of an embodiment of the information terminal of the present invention. 本発明のサーバの一実施例の構成ブロック図である。It is a block diagram of the configuration of an embodiment of the server of the present invention. 第１実施例のサーバ端末間通信状態（実施例１−１）の説明図である。It is explanatory drawing of the communication state between server terminals (Example 1-1) of 1st Example. 第１実施例のサーバ端末間通信状態（実施例１−２）の説明図である。It is explanatory drawing of the communication state between server terminals (Example 1-2) of 1st Example. 第１実施例のサーバ端末間通信状態（実施例１−３）の説明図である。It is explanatory drawing of the communication state between server terminals (Example 1-3) of 1st Example. 第１実施例のサーバ端末間通信状態（実施例１−４）の説明図である。It is explanatory drawing of the communication state between server terminals (Example 1-4) of 1st Example. 第１実施例のサーバ端末間通信状態（実施例１−５）の説明図である。It is explanatory drawing of the communication state between server terminals (Example 1-5) of 1st Example. 第１実施例のサーバ端末間通信状態（実施例１−６）の説明図である。It is explanatory drawing of the communication state between server terminals (Example 1-6) of 1st Example. サーバの記憶部に格納されるデータベースの説明図である。It is explanatory drawing of the database stored in the memory | storage part of a server. 第１実施例で用いられる端末データベースの変化の説明図である。It is explanatory drawing of the change of the terminal database used by 1st Example. 第１実施例で用いられる端末データベースの変化の説明図である。It is explanatory drawing of the change of the terminal database used by 1st Example. 第２実施例のサーバ端末間通信状態（実施例２−１）の説明図である。It is explanatory drawing of the communication state between server terminals (Example 2-1) of 2nd Example. 第２実施例のサーバ端末間通信状態（実施例２−２）の説明図である。It is explanatory drawing of the communication state between server terminals of Example 2 (Example 2-2). 第２実施例のサーバ端末間通信状態（実施例２−３）の説明図である。It is explanatory drawing of the communication state between server terminals (Example 2-3) of 2nd Example. 第２実施例のサーバ端末間通信状態（実施例２−４）の説明図である。It is explanatory drawing of the communication state between server terminals (Example 2-4) of 2nd Example. 第２実施例のサーバ端末間通信状態（実施例２−５）の説明図である。It is explanatory drawing of the communication state between server terminals (Example 2-5) of 2nd Example. 第２実施例のサーバ端末間通信状態（実施例２−６）の説明図である。It is explanatory drawing of the communication state between server terminals (Example 2-6) of 2nd Example. 第２実施例で用いられる端末データベースの変化の説明図である。It is explanatory drawing of the change of the terminal database used in 2nd Example. 第２実施例で用いられる端末データベースの変化の説明図である。It is explanatory drawing of the change of the terminal database used in 2nd Example. 端末におけるモード切替処理のフローチャートである。It is a flowchart of the mode switching process in a terminal. 代表端末におけるストリーミング配信処理のフローチャートである。It is a flowchart of the streaming delivery process in a representative terminal. サーバにおける端末状態検出制御処理のフローチャートである。It is a flowchart of the terminal state detection control process in a server. 端末における発言検知要求処理のフローチャートである。It is a flowchart of the speech detection request process in a terminal. サーバにおける発言端末検出制御処理のフローチャートである。It is a flowchart of the speech terminal detection control process in a server. サーバと端末間の通信シーケンス（端末モード切替）の説明図である。It is explanatory drawing of the communication sequence (terminal mode switching) between a server and a terminal. サーバと端末間の通信シーケンス（発言検知制御）の説明図である。It is explanatory drawing of the communication sequence (utterance detection control) between a server and a terminal. 端末の動作モードと、サーバと端末間の通信状態の説明図である。It is explanatory drawing of the operation mode of a terminal, and the communication state between a server and a terminal.

以下、図面を使用して本発明の実施の形態を説明する。なお、以下の実施例の記載によって、この発明が限定されるものではない。
＜この発明の情報共有システムの構成＞
図１に、この発明の情報共有システムの一実施例の概略構成図を示す。
ここでは、映像，音声，文書などの情報を共有して、複数の情報端末間で画像音声等による会議を行う情報共有システム（以下、会議システムとも呼ぶ）について、説明する。 Hereinafter, embodiments of the present invention will be described with reference to the drawings. In addition, this invention is not limited by description of the following examples.
<Configuration of Information Sharing System of the Invention>
FIG. 1 shows a schematic configuration diagram of an embodiment of an information sharing system according to the present invention.
Here, an information sharing system (hereinafter also referred to as a conference system) in which information such as video, audio, and documents is shared and a conference is performed between a plurality of information terminals using image audio and the like will be described.

図１において、本発明の情報共有システムは、主として、１台のサーバ１と、複数台の情報端末（以下、会議端末、または単に端末とも呼ぶ）２とから構成される。
サーバ１と、各情報端末２とは、ネットワークを介して接続される。
サーバ１は、従来の会議システムでも利用されている会議制御機能のうち、少なくとも多地点接続装置（ＭＣＵ）と同等の機能を有する装置であり、主として、接続された端末の状態監視、接続制御、各種情報の通信制御、映像や音声の符号化、映像や音声の復号化処理、ユーザ認証、機器認証、通信暗号化等を行うものである。
また、端末２は、会議に参加する利用者が在席する部屋（オフィスや会議室など）に設置される装置であり、主として、映像および音声による利用者自身の認識処理、端末の動作状態や入力操作の確認処理、入力される各種情報の加工変換や通信制御処理を行うものである。 In FIG. 1, the information sharing system of the present invention is mainly composed of one server 1 and a plurality of information terminals (hereinafter also referred to as conference terminals or simply terminals) 2.
The server 1 and each information terminal 2 are connected via a network.
The server 1 is a device having at least a function equivalent to that of a multipoint connection device (MCU) among conference control functions also used in a conventional conference system, and mainly monitors the status of connected terminals, connection control, Various types of information communication control, video and audio encoding, video and audio decoding processing, user authentication, device authentication, communication encryption, and the like are performed.
The terminal 2 is a device installed in a room (such as an office or a conference room) in which a user who participates in a conference is present, and mainly recognizes the user himself / herself through video and audio, It performs input operation confirmation processing, processing conversion of various input information, and communication control processing.

情報端末２では、会議に必要な情報が入力され、入力された情報はサーバに送信される。入力される端末情報としては、後述するが、主として、キーボードやマウス等の入力信号、マイクからの入力音声、カメラからの入力映像である。これらの入力された情報は、符号化（エンコード）処理をした後、サーバに送信してもよい。また、通信経路における機密性、秘匿性を考慮し、暗号化処理を行ってもよい。
サーバ１は、上記のような端末から送信された端末情報を用いて共有情報を生成し、この共有情報を、各情報端末へ配信する。 In the information terminal 2, information necessary for the conference is input, and the input information is transmitted to the server. As will be described later, the input terminal information is mainly input signals such as a keyboard and a mouse, input sound from a microphone, and input video from a camera. The input information may be transmitted to the server after being encoded. In addition, encryption processing may be performed in consideration of confidentiality and confidentiality in the communication path.
The server 1 generates shared information using the terminal information transmitted from the terminal as described above, and distributes the shared information to each information terminal.

サーバ１と端末２（ＴＥ１〜ＴＥｎ）との接続は、たとえば有線の通信ケーブルによって直接接続してもよく、また、いわゆる無線ＬＡＮ等の無線通信方式によって接続してもよい。
また、各種のネットワーク（ＬＡＮ（Ｎ１），インターネット（Ｎ２），専用通信網（Ｎ３）など）を介して、サーバ１と端末２とを接続してもよい。
ある端末（ＴＥ１）を利用する利用者が会議に参加する場合、その端末ＴＥ１とサーバ１との間の通信回線を接続し、リンクを確立させた後、端末ＴＥ１に付属したカメラやマイクからその利用者の画像や音声あるいは会議に必要な文書や図面データなどを、サーバ１に送信する。
このとき、通信路における伝送量軽減や、リアルタイム（低遅延）通信を行うために、所定の規格のコーデック（例えば、映像であればＭＰＥＧ４、Ｈ．２６３、Ｈ．２６４、音声であればＧ．７１１、Ｇ．７２２、Ｇ．７２９等）を利用して、送信されるデータ量を圧縮符号化する処理（エンコード）が行われる。 The server 1 and the terminals 2 (TE1 to TEn) may be connected directly by, for example, a wired communication cable, or may be connected by a wireless communication method such as a so-called wireless LAN.
Further, the server 1 and the terminal 2 may be connected via various networks (LAN (N1), Internet (N2), dedicated communication network (N3), etc.).
When a user who uses a certain terminal (TE1) participates in the conference, the communication line between the terminal TE1 and the server 1 is connected and the link is established, and then the camera or microphone attached to the terminal TE1 is used. The user's image and sound, or documents and drawing data necessary for the conference are transmitted to the server 1.
At this time, a codec of a predetermined standard (for example, MPEG4, H.263, H.264 for video, and G. for audio for video transmission reduction and real-time (low delay) communication). 711, G.722, G.729, etc.), a process (encoding) for compressing and encoding the amount of data to be transmitted is performed.

サーバ１では、端末ＴＥ１から送られてきた映像，音声等の端末情報と、その他の端末２（ＴＥ２，…ＴＥｎ）から送られてきた同様の情報を復号化（デコード）した後、合成（ミキシング）する処理を行う。この合成処理を行った情報（合成情報）が、共有情報となる。共有情報は、現在接続が確立されている複数の端末２に向かって、配信される。
サーバ１では、受信した複数の映像のミキシングを行う前に、各端末２から送られてきた映像の復号化処理（デコード）を行い、さらに画像サイズの調整（リサイズ）を行い、その後１つの共有情報として配信するために複数の映像を合成する処理が行われる。
たとえば、４人の利用者が会議に参加していたとすると、４台の端末２から送信されてきた各利用者の人物画像の映像データをそれぞれデコードし、１画面に分割表示できるようにリサイズし、４つの映像を合成（ミキシング）する。
さらに、ミキシングした後の合成映像は、サーバで符号化処理（エンコード）が行われた後、共有情報としてサーバ１から端末２へ送信される。 The server 1 decodes (decodes) the terminal information such as video and audio sent from the terminal TE1 and similar information sent from the other terminals 2 (TE2,... TEn), and then synthesizes (mixing). ) Is performed. Information (composition information) on which this composition processing has been performed becomes shared information. The shared information is distributed toward a plurality of terminals 2 that are currently connected.
In the server 1, before mixing a plurality of received videos, the video sent from each terminal 2 is decoded (decoded), further adjusted in image size (resized), and then shared by one user. A process of synthesizing a plurality of videos for distribution as information is performed.
For example, if four users are participating in a conference, the video data of each person's person image transmitted from the four terminals 2 is decoded and resized so that it can be divided and displayed on one screen. Four images are combined (mixed).
Furthermore, the mixed video after mixing is encoded (encoded) by the server, and then transmitted from the server 1 to the terminal 2 as shared information.

このようなミキシングされた共有情報が送信されてきた各端末２では、受信情報の復号化処理（デコード）が行われ、その端末に付属した表示装置やスピーカから、画像や音声が出力される。
たとえば、現在接続が確立されている端末２が４台であれば、ミキシングされた後の共有情報は、サーバ１から時分割多重化処理等を用いてリアルタイムで、４つの端末にそれぞれ配信される。 In each terminal 2 to which such mixed shared information has been transmitted, received information is decoded (decoded), and an image or sound is output from a display device or speaker attached to the terminal.
For example, if there are four terminals 2 that are currently connected, the shared information after mixing is distributed from the server 1 to each of the four terminals in real time using a time division multiplexing process or the like. .

しかし、接続が確立されている端末数がさらに増加した場合、たとえばその端末数が２０台の場合、円滑に会議を進行させるためには、２０台のそれぞれの端末２で、同じようにリアルタイムで、共有情報の画像や文書を見て、共有情報の音声を開く必要があるので、サーバ１はより高速に情報の配信処理を行う必要がある。 However, when the number of terminals that have been established further increases, for example, when the number of terminals is 20, in order to make the conference proceed smoothly, each of the 20 terminals 2 is similarly real-time. Since it is necessary to open the sound of the shared information by looking at the image or document of the shared information, the server 1 needs to perform information distribution processing at a higher speed.

一般に、通信回線として電話回線を用いたＡＤＳＬやＩＳＤＮによる通信、光ファイバーを用いたＦＴＴＨによる通信、無線通信、その他のより高速の専用回線を用いた通信が利用できるが、いずれも回線そのものの通信速度や一定時間に伝送できる通信量に限界があるので、円滑な会議の進行にほとんど問題がなく、情報伝送の遅延が問題とならずに、リアルタイムで情報配信ができる最大端末数も、利用する通信回線の太さ（通信帯域）や、通信に用いられる映像や音声のデータ量および、端末そのもののＣＰＵ性能等によって上限値が決まってくる。 In general, ADSL or ISDN communication using a telephone line as a communication line, FTTH communication using an optical fiber, wireless communication, or other communication using a higher-speed dedicated line can be used. Because there is a limit to the amount of communication that can be transmitted in a certain period of time, there is almost no problem in smooth conference proceedings, and the maximum number of terminals that can deliver information in real time without causing problems in information transmission is also used. The upper limit is determined by the line thickness (communication bandwidth), the amount of video and audio data used for communication, the CPU performance of the terminal itself, and the like.

この発明は、情報共有システムの会議に参加可能な端末が非常に多数存在する場合でも、サーバ１に対して同時に接続が確立され、サーバから直接的に共有情報の配信が行われる端末の実質的な数をできるだけ減少させて、リアルタイム性を保持した情報の配信を行い、円滑な会議の進行をさまたげることのないようにする。 In the present invention, even when there are a large number of terminals that can participate in a conference of the information sharing system, a connection is established to the server 1 at the same time, and the shared information is distributed directly from the server. The number is reduced as much as possible, and information that maintains real-time characteristics is distributed so that smooth progress of the conference is not hindered.

この発明では、上記のようなサーバに接続される実質的な端末数の減少は、次のような２つの処理によって実現させることを特徴とする。 The present invention is characterized in that the substantial reduction in the number of terminals connected to the server as described above is realized by the following two processes.

（１）端末のグループ化によるサーバから見た接続端末数の減少化
同一ルータに接続されたネットワーク上に複数の端末が存在する場合など、複数の端末が属するグループを設定し、そのグループの中の１台の端末を代表端末に設定し、サーバと直接通信する端末を代表端末のみに制限することにより、サーバと直接共有情報の通信を行う端末の数を減少させる。
１つのグループ内に属する代表端末以外の端末は、原則として、共有情報を受信した代表端末から、その受信情報を配信してもらうようにする。 (1) Reduction of the number of connected terminals as seen from the server by grouping of terminals When there are multiple terminals on a network connected to the same router, set a group to which multiple terminals belong, Is set as a representative terminal, and the number of terminals directly communicating with the server is reduced by limiting the number of terminals directly communicating with the server to only the representative terminal.
In principle, terminals other than the representative terminal belonging to one group are allowed to receive the received information from the representative terminal that has received the shared information.

ここで、グループ化とは、ネットワークに接続される複数の情報端末を、予め所定の設定条件に基づいて、１または複数個の端末群に分割することを意味する。
また、１つの端末群は、１つの端末のみからなるものや、２つ以上の端末を含むものが存在し、原則として、１つの端末群の中には、サーバから共有情報を直接受信可能な１つの代表端末を有するものとする。
また、１つの端末群に複数の端末が属している場合には、上記のような１つの代表端末と、この代表端末から共有情報の配信を受ける参加端末とが含まれる。
ただし、後述するように、単一の端末から構成される端末群の場合、その単一の端末は自らが代表端末となる場合があり、このとき、その端末群は、参加端末を含まない端末群である。
また、そのような単一の端末は、代表端末とならない場合もあり、このとき、他の端末群に属する代表端末から共有情報の配信を受けるようにしてもよい。 Here, grouping means dividing a plurality of information terminals connected to the network into one or a plurality of terminal groups based on predetermined setting conditions in advance.
One terminal group includes only one terminal or includes two or more terminals. In principle, one terminal group can directly receive shared information from a server. Assume that there is one representative terminal.
Further, when a plurality of terminals belong to one terminal group, one representative terminal as described above and a participating terminal that receives distribution information from the representative terminal are included.
However, as will be described later, in the case of a terminal group composed of a single terminal, the single terminal itself may be a representative terminal. At this time, the terminal group does not include a participating terminal. A group.
In addition, such a single terminal may not be a representative terminal, and at this time, the shared information may be received from a representative terminal belonging to another terminal group.

また、グループ化するための設定条件は、種々のものが考えられ、１つに限定されるものではない。たとえば各情報端末が配置される接続拠点情報を用いることができる。接続拠点情報としては、たとえば、都道府県市町村などの地名，郵便番号，住所，電話番号，企業名，事業所名，団体名および、これらに準じた特定の数値コードなどがあり、後述する拠点ＩＤがこれに相当する。
拠点ＩＤが設定条件として用いられる場合、同一の拠点ＩＤを持つ端末が、１つの端末群として、グループ化される。この地理的拠点情報は、各端末ごとに、サーバの記憶部に予め記憶される。
また、設定条件は、予め固定的に決めておいてもよいが、システムの管理者や利用者が、会議を始める前に設定変更できるようにしてもよい。
また、本稿では記載しないが、端末のＩＰアドレスにおける特定数字列部分、すなわちドメインおよびネットワークセグメント情報等を用いて同一拠点であることを判断するようにしても良い。 Various setting conditions for grouping are conceivable and are not limited to one. For example, connection base information where each information terminal is arranged can be used. Examples of connection base information include place names such as prefectures, municipalities, postal codes, addresses, telephone numbers, company names, business office names, organization names, and specific numerical codes based on these. Corresponds to this.
When the base ID is used as the setting condition, terminals having the same base ID are grouped as one terminal group. This geographical base information is stored in advance in the storage unit of the server for each terminal.
The setting conditions may be fixedly determined in advance, but the system administrator or the user may be able to change the settings before starting the conference.
Further, although not described in this paper, it may be determined that the same base is used by using a specific numeric string portion in the IP address of the terminal, that is, domain and network segment information.

１つの端末群に属する代表端末は、サーバから共有情報を直接受信するが、その受信方法は、サーバからのストリーミング配信によるものとする。これにより、通常の会議システムにおける双方向通信を行う場合に比べ、伝送される情報量を少なくした効率的な片方向通信が可能となる。
また、代表端末と同じ端末群に属する参加端末に対しても、サーバから受信した共有情報を、同じように、ストリーミング配信をするようにする。
さらに、後述する第２実施例のように、代表端末の端末群には属さない他の情報端末へも、サーバから受信した共有情報を、ストリーミング配信するようにしてもよい。 The representative terminals belonging to one terminal group directly receive the shared information from the server, and the reception method is assumed to be streaming distribution from the server. This enables efficient one-way communication with a reduced amount of information to be transmitted, compared to the case of performing two-way communication in a normal conference system.
In addition, the shared information received from the server is also distributed in a streaming manner to participating terminals that belong to the same terminal group as the representative terminal.
Further, the shared information received from the server may be stream-distributed to other information terminals that do not belong to the terminal group of representative terminals as in the second embodiment described later.

（２）発言検知による端末状態の切り替え
利用者による発言要求があった場合、その利用者が利用している端末は、サーバと画像および音声等からなる情報の双方向通信が可能な状態に切り替える。
すでに発言要求をしている端末（既存発言端末と呼ぶ）が存在するときは、その既存発言端末を、発言要求のない端末に切り替え、その代わりに、新たに発言要求をした端末を、既存発言端末に切り替える。 (2) Switching the terminal state based on speech detection When a user makes a speech request, the terminal used by the user switches to a state in which bidirectional communication of information consisting of images, sounds, etc. is possible with the server. .
When there is a terminal that has already made a speech request (referred to as an existing speech terminal), the existing speech terminal is switched to a terminal that does not have a speech request, and instead, a terminal that has newly made a speech request is replaced with an existing speech terminal. Switch to the terminal.

たとえば、発言要求をした端末であって、サーバとの通信状態が双方向通信状態となっている第１の情報端末が存在する場合において、これとは異なる第２の情報端末の後述する発言検知部が新たな発言要求を検知した場合、第２の情報端末の後述する発言要求部が、その新たな発言要求を示す情報（発言要求情報）をサーバへ送信する。
サーバがこの発言要求情報を受信した場合、後述するサーバの双方向通信制御部が、第１の情報端末との双方向通信状態を解除し、新たな発言要求をした第２の情報端末とサーバとの間の通信状態を、双方向通信状態に切り替える。 For example, in the case where there is a first information terminal that has made a request for speech and the communication state with the server is a two-way communication state, speech detection of a second information terminal that is different from this is described later. When the unit detects a new message request, a message request unit (to be described later) of the second information terminal transmits information indicating the new message request (message request information) to the server.
When the server receives this message request information, the server, which will be described later, cancels the two-way communication state with the first information terminal, and the second information terminal and the server requesting a new message The communication state between is switched to the two-way communication state.

ここで、発言要求をして発言権を持った既存発言端末は、システム全体の中では、常に１台のみとする。また、新たに既存発言端末となった端末は、サーバと、画像および音声等の情報の双方向通信を行い、既存発言端末でなくなった端末は、サーバからのストリーミング配信を受けるか、あるいは、代表端末からのストリーミング配信を受けるようにする。 Here, it is assumed that there is always only one existing speech terminal that has a right to make a speech request in the entire system. In addition, a terminal that has newly become an existing speech terminal performs two-way communication of information such as images and sounds with the server, and a terminal that is no longer an existing speech terminal receives streaming distribution from the server or is represented Receive streaming distribution from the terminal.

また、利用者による発言要求の検知が、ほぼ同時に複数の情報端末の発言検知部で行われる場合もある。この場合、サーバの双方向通信制御部は、その複数の情報端末からそれぞれ送信されてきた発言要求情報のうち、最も早く受信した発言要求情報を送信してきた情報端末との通信状態を、双方向通信状態に切り替えるようにすればよい。
あるいは、受信した複数の発言要求情報の中に、利用者が発言要求をした時刻情報が含まれている場合、それぞれの発言要求情報を確認し、最も早い時刻に利用者の発言要求が検知されたと判断される情報端末との通信状態を、双方向通信状態に切り替えるようにしてもよい。 Moreover, the detection of the speech request by the user may be performed almost simultaneously by the speech detection units of a plurality of information terminals. In this case, the bidirectional communication control unit of the server indicates the communication state with the information terminal that has transmitted the speech request information received earliest among the speech request information respectively transmitted from the plurality of information terminals. What is necessary is just to switch to a communication state.
Alternatively, when the time information at which the user requests to speak is included in the plurality of received speech request information, the respective speech request information is confirmed, and the user's speech request is detected at the earliest time. The communication state with the information terminal determined to have been switched may be switched to the bidirectional communication state.

＜端末の動作モードの説明＞
端末の動作状態のモード（動作モード、あるいは単にモードとも呼ぶ）として、主として、次の４つのモードを設けるものとする。
（ａ）参加端末モード（Ｍ１）
（ｂ）代表端末モード（Ｍ２）
（ｃ）発言要求端末モード（Ｍ３）
（ｄ）発表端末モード（Ｍ４） <Description of terminal operation mode>
Assume that the following four modes are mainly provided as modes of operation states of terminals (also referred to as operation modes or simply modes).
(A) Participating terminal mode (M1)
(B) Representative terminal mode (M2)
(C) Speech request terminal mode (M3)
(D) Presentation terminal mode (M4)

参加端末モード（Ｍ１）とは、ネットワークに接続され、サーバあるいは代表端末モードの端末（代表端末と呼ぶ）から、画像および音声等からなる共有情報のストリーミング配信を受け、自らはサーバ等に対して画像および音声の送信を行わない動作状態を意味する。
したがって、参加端末モードの端末（参加端末と呼ぶ）は、サーバあるいは代表端末との間で、共有情報のストリーミング受信による片方向通信のみを行うことができる。
ただし、後述するように、この参加端末において、利用者が発言をすれば、発言要求端末モード（Ｍ３）へ切り替えられ、サーバと、画像および音声の双方向通信を行えるようになる。 Participating terminal mode (M1) is connected to a network and receives streaming distribution of shared information consisting of images and sounds from a server or a terminal in representative terminal mode (referred to as a representative terminal). This means an operating state where image and sound are not transmitted.
Therefore, a terminal in the participating terminal mode (referred to as a participating terminal) can perform only one-way communication by streaming reception of shared information with a server or a representative terminal.
However, as will be described later, in this participating terminal, when the user speaks, the user is switched to the speech requesting terminal mode (M3), and bidirectional communication of images and sounds with the server can be performed.

代表端末モード（Ｍ２）とは、ネットワークに接続され、サーバから画像および音声等からなる共有情報のストリーミング配信を受け、かつ、同一グループ内の他の参加端末、あるいは同一グループ以外の他の端末に対して、共有情報のストリーミング配信を行うことが可能な動作状態を意味する。
したがって、代表端末は、同一グループ内にただ１つ存在し、サーバに対して、直接ストリーミング受信による片方向通信を行う端末である。
同一グループに属する複数の端末は、たとえば、同一の拠点ＩＤを持つ端末に相当する。 The representative terminal mode (M2) is connected to a network, receives streaming distribution of shared information including images and sounds from a server, and is sent to other participating terminals in the same group or other terminals other than the same group. On the other hand, it means an operating state in which streaming distribution of shared information can be performed.
Therefore, there is only one representative terminal in the same group, and a terminal that performs one-way communication by direct streaming reception to the server.
A plurality of terminals belonging to the same group correspond to terminals having the same base ID, for example.

発言要求端末モード（Ｍ３）とは、ネットワークに接続され、サーバとの間で、画像および音声等からなる情報の双方向通信が可能な動作状態を意味する。
発言要求端末モードになれる端末は、サーバに発言要求をした後、サーバから発言を許可された端末である。
また、サーバと双方向通信状態となり利用者の発言が許可される端末は、同時には１つだけ存在するように、サーバによって管理される。この管理は、たとえば後述するように、発言要求端末フラグによって行えばよい。
発言要求端末モード（以下、発言端末モードとも呼ぶ）の状態にある端末を、発言要求端末あるいは発言端末と呼ぶ。
発言が許可されれば、上記した参加端末や代表端末も、発言端末となれる。 The message request terminal mode (M3) means an operating state that is connected to a network and capable of bidirectional communication of information including images and sounds with a server.
A terminal that can enter the speech request terminal mode is a terminal that is allowed to speak by the server after making a speech request to the server.
In addition, the server is managed by the server so that there is only one terminal that is in a bidirectional communication state with the server and is allowed to speak by the user. This management may be performed by a speech request terminal flag, as will be described later.
A terminal in the speech request terminal mode (hereinafter also referred to as speech terminal mode) is called a speech request terminal or speech terminal.
If speaking is permitted, the above-described participating terminals and representative terminals can also be speaking terminals.

発表端末モード（Ｍ４）とは、会議において、たとえば、プレゼンテーション等で、あるテーマについての発表を行う利用者もしくは議長等の端末の動作モードであり、その端末は、予めサーバに発表端末として設定されている。後述するように、サーバの記憶部に、「発表端末フラグ」がＯＮに設定されている端末が、発表端末である。
この発表端末は、原則としてサーバに接続されたとき、サーバに対して、画像やマイクにより入力された音声を送信のみする片方向通信を行う。このとき、サーバからの画像等の共有情報の受信は行わない。
また、送信される画像は、所定の規格に適合した方式で符号化（エンコード）された後、サーバに送信される。
また、この発表端末は、発言要求のあった発言端末がネットワークに接続された後は、サーバに対して、画像および音声等の情報による双方向通信を行うように切り替えられる。 The presentation terminal mode (M4) is an operation mode of a terminal of a user or a chairperson who makes a presentation about a certain theme in a meeting, for example, at a presentation, and the terminal is set in advance as a presentation terminal in the server. ing. As will be described later, a terminal whose “presentation terminal flag” is set to ON in the storage unit of the server is a presentation terminal.
When this announcement terminal is connected to the server, in principle, it performs one-way communication that only transmits images and voices input through a microphone to the server. At this time, shared information such as images from the server is not received.
Further, the image to be transmitted is encoded (encoded) by a method conforming to a predetermined standard, and then transmitted to the server.
In addition, this announcement terminal is switched to perform bidirectional communication with information such as images and sounds after the requesting terminal is connected to the network.

図２８に、これらの４つの動作モードと、サーバおよび他の端末との通信状態の関係の説明図を示す。
図２８において、参加端末モード（Ｍ１）の端末（参加端末）は、サーバからストリーミング配信を受ける片方向通信か、あるいは、他の端末からストリーミング配信を受ける片方向通信を行うことを示している。
サーバおよび他の端末のどちらからストリーミング配信を受けるかは、その参加端末の接続形態（グループ内の端末か、グループに属さない端末か）によって決定される。
たとえば、一つのグループ内に属する参加端末は、そのグループの代表端末からストリーミング配信を受ける。
また、どのグループにも属さない単一の参加端末は、原則として、サーバからストリーミング配信を受ける。 FIG. 28 shows an explanatory diagram of the relationship between these four operation modes and the communication state with the server and other terminals.
In FIG. 28, a terminal (participating terminal) in the participating terminal mode (M1) indicates performing one-way communication that receives streaming distribution from a server or one-way communication that receives streaming distribution from another terminal.
Whether to receive streaming distribution from the server or another terminal is determined depending on the connection form of the participating terminal (terminal in the group or terminal not belonging to the group).
For example, participating terminals belonging to one group receive streaming distribution from the representative terminal of the group.
A single participating terminal that does not belong to any group receives streaming distribution from the server in principle.

代表端末モード（Ｍ２）の端末は、サーバからのストリーミング配信を受け、さらに、同一拠点（同一グループ）内もしくは別拠点における他の端末へストリーミング配信を行う。
また、発言要求端末モード（Ｍ３）の端末は、サーバと双方向通信を行うことを示している。
さらに、発表端末モード（Ｍ４）の端末は、サーバに接続されたときは、サーバへデータ送信のみを行う片方向通信を行い、その後、他の端末で発言が行われるような会議進行時においては、サーバと双方向通信を行う。 The terminal in the representative terminal mode (M2) receives streaming distribution from the server, and further performs streaming distribution to other terminals in the same base (same group) or another base.
The terminal in the message request terminal mode (M3) indicates that bidirectional communication is performed with the server.
Furthermore, when the terminal in the presentation terminal mode (M4) is connected to the server, the terminal performs one-way communication that only transmits data to the server, and thereafter, at the time of the conference in which a speech is performed at another terminal. Two-way communication with the server.

以上のように、端末の動作状態を示す４つのモードは、それぞれサーバや他の端末との間の通信状態が異なる。この発明では、各端末が、サーバからの要求や発言検知等のイベントの発生に基づいて、その動作モードを切り替えることにより、ネットワークを流れるトラフィックをできるだけ少なくなるようにし、情報の伝送遅延を減少し、通信障害を減少させ、サーバにおける映像，音声のエンコード，デコード等にかかる処理の負荷をできるだけ少なくすることを特徴とする。 As described above, the four modes indicating the operation state of the terminal have different communication states with the server and other terminals. In the present invention, each terminal switches its operation mode based on the occurrence of an event such as a request from the server or detection of speech, thereby reducing traffic flowing through the network as much as possible and reducing information transmission delay. The present invention is characterized in that communication troubles are reduced and processing load on video / audio encoding / decoding in the server is reduced as much as possible.

以下に、この発明における端末の動作モードの切替処理や、発言要求時のサーバと端末間の通信状態の変更等の実施例について、説明する。
以下の実施例においては、１つのグループに属する複数の端末は、同一拠点に属する複数の端末であるとする。
複数の端末が同一拠点であるか否かは、サーバの記憶部に予め登録されるものとし、同一拠点の複数の端末は、すべて同一の拠点ＩＤを持つものとする。
また、１つのグループに存在する１つの代表端末は、そのグループ内で最初に電源が投入され、会議に必要なアプリケーションプログラムが起動された状態とする。
また、端末が、サーバへ自己の動作モードの動作状態通知を行い、さらに代表端末検出要求を行うことにより、サーバが、代表端末検出処理を行い代表端末を決定するものとする。代表端末であることは、サーバの記憶部に設定されるものとする。後述する代表端末フラグが、「ＯＮ」の場合、その端末が代表端末に設定されていることを示す。 Hereinafter, embodiments of the operation mode switching process of the terminal and the change of the communication state between the server and the terminal at the time of a speech request will be described.
In the following embodiments, a plurality of terminals belonging to one group are assumed to be a plurality of terminals belonging to the same base.
Whether or not a plurality of terminals are at the same site is registered in advance in the storage unit of the server, and a plurality of terminals at the same site all have the same site ID.
In addition, one representative terminal in one group is in a state where power is first turned on in the group and an application program necessary for the conference is activated.
In addition, the terminal notifies the server of the operation state of its own operation mode, and further issues a representative terminal detection request, so that the server performs representative terminal detection processing and determines the representative terminal. The representative terminal is set in the storage unit of the server. When a representative terminal flag described later is “ON”, it indicates that the terminal is set as the representative terminal.

また、あるグループの代表端末が決定した後、その同一グループに属する他の端末は参加端末として動作する。すなわち、同一グループに属し参加端末となった端末は、原則として同一グループの代表端末からのストリーミング配信を受ける。
ただし、代表端末を持たず、同一拠点ＩＤを持つ他の端末のなかった単一の参加端末は、原則として、自らが代表端末となり、サーバからストリーミング配信を受けるようにする。
また、そのような単一の参加端末は、他のグループに属する代表端末から、ストリーミング配信を受けてもよい。この場合、その代表端末は、同一グループに属する参加端末へのストリーミング配信と、そのグループに属さない単一の参加端末へのストリーミング配信とを行う。
なお、代表端末がネットワークから切断された場合、原則として、同一グループ内の他の参加端末のいずれかが、代表端末となるようにすればよい。 Further, after a representative terminal of a certain group is determined, other terminals belonging to the same group operate as participating terminals. In other words, terminals that belong to the same group and become participating terminals receive streaming distribution from the representative terminal of the same group in principle.
However, as a general rule, a single participating terminal that does not have a representative terminal and does not have other terminals having the same base ID is itself a representative terminal and receives streaming distribution from the server.
Moreover, such a single participating terminal may receive streaming distribution from a representative terminal belonging to another group. In this case, the representative terminal performs streaming distribution to participating terminals belonging to the same group and streaming distribution to a single participating terminal not belonging to the group.
When the representative terminal is disconnected from the network, in principle, any of the other participating terminals in the same group may be the representative terminal.

また、発言要求のあった端末は、サーバから許可を受けることにより発言端末となることができる。
上記したように、既存発言端末がなければ、発言要求のあった端末がそのまま発言端末となり、サーバと双方向通信を行う。一方、既存発言端末がすでに存在していた場合は、サーバからの切替要求により、既存発言端末を参加端末に切り替え、かつ新たに発言要求のあった端末を発言端末に切り替える。
また、代表端末における利用者が発言要求をした場合、その代表端末は発言端末となり、サーバと双方向通信を行うようになるが、他の参加端末へ行っていたストリーミング配信があれば、そのストリーミング配信もそのまま継続して行うものとする。
さらに、発言端末であった代表端末が、発言端末でなくなった場合、代表端末モードの端末に戻り、サーバとの通信状態は、双方向通信状態から、ストリーミング配信を受ける片方向通信状態へ切り替えられる。 Also, a terminal that has made a speech request can become a speech terminal by receiving permission from the server.
As described above, if there is no existing speech terminal, the terminal that made the speech request becomes the speech terminal as it is and performs bidirectional communication with the server. On the other hand, when the existing speech terminal already exists, the existing speech terminal is switched to the participating terminal and the terminal that newly requested the speech is switched to the speech terminal in response to a switching request from the server.
In addition, when a user at the representative terminal makes a speech request, the representative terminal becomes a speech terminal and performs two-way communication with the server. However, if there is streaming distribution that has been performed to other participating terminals, the streaming is performed. Distribution is also continued as it is.
Further, when the representative terminal that is the speaking terminal is no longer the speaking terminal, the terminal returns to the terminal in the representative terminal mode, and the communication state with the server is switched from the two-way communication state to the one-way communication state that receives streaming distribution. .

＜通信状態の切替えの説明＞
ここで、双方向通信から片方向通信への通信状態の切替えは、主として、次のような２つの方法のうち、いずれかの方法が考えられる。
（１）通信プロトコルを固定したまま、所定の通信シーケンスは継続するが、発言要求のない端末（参加端末もしくは代表端末）から送信されてきた映像および音声についての一連の信号処理を中止する。具体的には、サーバにおいて、その端末についての映像復号化処理（デコード）と、音声復号化処理（デコード）を停止する。
この場合、たとえば通信プロトコルとして、ＳＩＰやＨ．３２３、ＨＴＴＰ等の規格が採用されていたとすれば、その通信プロトコルを利用した通信シーケンスは継続して行う。この方法では、サーバ端末間にて、ストリーミング配信のための通信プロトコルを別途用意する必要がなく、従来の通信プロトコルをそのまま利用できるという利点がある。 <Description of switching of communication status>
Here, the switching of the communication state from the bidirectional communication to the unidirectional communication is mainly considered to be one of the following two methods.
(1) A predetermined communication sequence is continued with the communication protocol fixed, but a series of signal processing for video and audio transmitted from a terminal (participating terminal or representative terminal) without a request for speech is stopped. Specifically, the server stops the video decoding process (decoding) and the audio decoding process (decoding) for the terminal.
In this case, for example, SIP or H.264 is used as a communication protocol. If a standard such as H.323 or HTTP is adopted, a communication sequence using the communication protocol is continuously performed. This method has an advantage that it is not necessary to separately prepare a communication protocol for streaming delivery between server terminals, and a conventional communication protocol can be used as it is.

（２）通信プロトコルを、ストリーミング配信のプロトコルに変更する。
たとえば、ＳＩＰやＨ．３２３、ＨＴＴＰ等の通信プロトコルを中止し、映像、音声に関するストリーミング配信の通信プロトコルに変更する。ストリーミング配信とは、サーバから端末へ、一方向的に映像、音声等のデータを送信する通信方式である。
ストリーミング配信としては、所定の規格（ｍｍｓ，ｈｔｔｐ，ｒｔｍｐ，ｒｔｓｐ等）のいずれかを採用することが望ましい。 (2) The communication protocol is changed to a streaming distribution protocol.
For example, SIP and H.264. 323, the communication protocol such as HTTP is stopped, and the communication protocol is changed to the streaming distribution related to video and audio. Streaming distribution is a communication method in which data such as video and audio is unidirectionally transmitted from a server to a terminal.
For streaming delivery, it is desirable to employ any one of predetermined standards (mms, http, rtmp, rtsp, etc.).

上記いずれの方法も、端末とサーバ間で所定の通信プロトコルに基づいて、データの送信と受信が行われるが、サーバから端末に対して、一方向的に、映像および音声の送信が行われるものであるので、通信回線を流れるトラフィックを減少させることができる。したがって、データ伝送の遅延を減少させ、通信エラー等の通信障害も削減させることができ、その結果実質的にサーバに接続された端末の数を減少させることができるので、円滑な会議の進行が可能となる。
また、逆に、片方向通信から双方向通信への切り替えは、上記（１）または（２）とは逆の処理を行えばよい。 In any of the above methods, data is transmitted and received between the terminal and the server based on a predetermined communication protocol, but video and audio are transmitted unidirectionally from the server to the terminal. Therefore, the traffic flowing through the communication line can be reduced. Therefore, the delay in data transmission can be reduced, and communication failures such as communication errors can be reduced. As a result, the number of terminals connected to the server can be substantially reduced. It becomes possible.
On the contrary, the switching from the one-way communication to the two-way communication may be performed in the reverse process of (1) or (2).

＜発言要求検知の説明＞
この発明では、利用者の発言要求を検知することにより、サーバと端末との通信状態を切り替える。
一般に、参加者の発言権を管理する会議システムでは、端末利用者（参加者）が発言権を得るためには、発言要求および、要求に対する承認・許可をする方式が用いられる。発言要求をする方法としては、たとえば、次のようなものが考えられる。
（１）「発言ボタン」の押下げ（入力操作）による要求
（２）「音声」入力による要求
（３）「画像認識」による要求 <Explanation of speech request detection>
In this invention, the communication state between the server and the terminal is switched by detecting a user's request for speech.
In general, in a conference system that manages a participant's speaking right, a terminal user (participant) obtains a speaking right and uses a method for requesting a speech and approving / permitting the request. As a method for requesting a speech, for example, the following can be considered.
(1) Request by pressing (speech button) (input operation) (2) Request by inputting "voice" (3) Request by "image recognition"

ここで、（１）の場合、たとえば、キーボード等に設けられた入力キーのうち、いずれかのキーを発言要求用のコマンドキーとして割り当てればよい。
このコマンドキーを押下げた場合、発言要求の入力操作がされたと判断し、発言要求をサーバへ送信する。 Here, in the case of (1), for example, one of the input keys provided on the keyboard or the like may be assigned as a command key for requesting speech.
When this command key is pressed, it is determined that an input operation for a speech request has been performed, and the speech request is transmitted to the server.

また、（２）の場合、たとえば、利用者がマイクに向かって発声した音声の検知、もしくは、発声した音声から、発言要求に対応付けられた特定の単語を音声認識にて検知してもよい。
これらの入力音声もしくは音声認識結果から、発言要求されたことが検出されれば、発言要求をサーバへ送信する。
また、（３）の場合、発言要求として予め対応付けられた動作、たとえば、利用者が挙手するような動作や手を振るような動作を、カメラに向かって行うようにしてもよい。
このとき、カメラから入力された画像の認識を行い、挙手するような動作が、所定の発言要求の動作に一致するような動作であることが検出されれば、発言要求をサーバへ送信する。 In the case of (2), for example, a voice uttered by the user toward the microphone may be detected, or a specific word associated with the utterance request may be detected by voice recognition from the uttered voice. .
If it is detected from these input speeches or speech recognition results that a speech request is made, the speech request is transmitted to the server.
In the case of (3), an operation associated in advance as a speech request, for example, an operation of raising a user's hand or an operation of waving may be performed toward the camera.
At this time, if the image input from the camera is recognized and it is detected that the operation of raising the hand matches the operation of the predetermined speech request, the speech request is transmitted to the server.

＜情報端末の構成＞
図２に、本発明の情報端末の構成ブロック図を示す。
情報端末２（ＴＥ）は、主として、入力部１１，映像入力部２１，表示部３１，音声入力部４１，音声出力部４５，通信部５１，認証部５５，記憶部６１，信号処理部７１，および制御部８０から構成される。 <Configuration of information terminal>
FIG. 2 shows a configuration block diagram of the information terminal of the present invention.
The information terminal 2 (TE) mainly includes an input unit 11, a video input unit 21, a display unit 31, a voice input unit 41, a voice output unit 45, a communication unit 51, an authentication unit 55, a storage unit 61, a signal processing unit 71, And a control unit 80.

入力部１１は、キーボード１２，マウス１３，タブレット１４などのポインティングデバイスに加え、人検知センサ１５を備え、これらの入力を受信し、制御部８０へ送信するための信号に変換する入力処理部１６とを備える。
人検知センサ１５は、たとえば、熱感知センサ，赤外線センサ，超音波センサなどを用いることができる。人検知センサの代わりに、後述するカメラ２２やマイク４２の入力信号を元に端末の前に人がいるかどうかを検知するようにしても良い。
入力部１１からは、主として利用者の操作による入力信号が入力され、この入力信号が、発言要求情報の生成に用いられる。たとえば、発言要求に対応付けられたコマンドキーの押下操作が行われ、対応する入力信号が入力された場合、その情報端末の利用者が発言要求をしたことを示す発言要求情報が生成される。 The input unit 11 includes a human detection sensor 15 in addition to a pointing device such as a keyboard 12, a mouse 13, and a tablet 14. The input processing unit 16 receives these inputs and converts them into signals for transmission to the control unit 80. With.
As the human detection sensor 15, for example, a heat detection sensor, an infrared sensor, an ultrasonic sensor, or the like can be used. Instead of the human detection sensor, it may be detected whether there is a person in front of the terminal based on input signals from a camera 22 and a microphone 42 described later.
From the input unit 11, an input signal mainly by a user's operation is input, and this input signal is used for generating the speech request information. For example, when a command key associated with a speech request is pressed and a corresponding input signal is input, speech request information indicating that the user of the information terminal has made a speech request is generated.

映像入力部２１は、カメラにより撮影された映像を入力する部分であり、たとえば、端末の近傍の映像や、端末の前にいる会議に参加している者（利用者）の映像を入力する部分であり、カメラ２２と、カメラで撮影された映像をデジタル信号に変換する映像処理部２３とを備える。入力された映像６６は、記憶部６１に一時記憶される。この入力映像６６も、上記した発言要求情報の生成に用いられる。 The video input unit 21 is a part for inputting video captured by the camera. For example, a part for inputting video near the terminal or video of a person (user) participating in a meeting in front of the terminal. And a camera 22 and a video processing unit 23 for converting a video shot by the camera into a digital signal. The input video 66 is temporarily stored in the storage unit 61. This input video 66 is also used to generate the above-mentioned speech request information.

表示部３１は、利用者の自画像を表示したり、会議に参加している他者の人物画像等を表示する部分であり、ＬＣＤ，ＣＲＴなどのディスプレイ３２と、ディスプレイ３２へ表示させる表示信号を生成する表示処理部３３とを備える。 The display unit 31 is a part that displays a user's own image or a person image of another person participating in the conference, and displays a display 32 such as an LCD or CRT and a display signal to be displayed on the display 32. And a display processing unit 33 to be generated.

音声入力部４１は、マイクから端末ＴＥの近傍の音声を入力する部分であり、マイク４２と、マイクから入力された音声をデジタル信号に変換する入力音声処理部４３とを備える。入力された音声６７は、記憶部６１に一時記憶される。
この入力音声６７も、上記した発言要求情報の生成に用いられる。 The sound input unit 41 is a part that inputs sound near the terminal TE from a microphone, and includes a microphone 42 and an input sound processing unit 43 that converts sound input from the microphone into a digital signal. The input voice 67 is temporarily stored in the storage unit 61.
This input voice 67 is also used to generate the above-described speech request information.

音声出力部４５は、スピーカ４６，出力音声処理部４７とを備え、たとえば、サーバから送られてきた音声を、スピーカから出力する部分である。 The audio output unit 45 includes a speaker 46 and an output audio processing unit 47. For example, the audio output unit 45 is a part that outputs audio transmitted from the server from the speaker.

通信部５１は、サーバあるいは他の端末ＴＥとの間で、データ送信やデータ受信を行う部分であり、種々のネットワークを介して所定の通信プロトコル（例えば、Ｈ．３２３、ＳＩＰ、ＨＴＴＰ等）でデータ通信ができるように、データの通信制御を行う。また、会議システムでは、音声および映像の通信を行うが、必要に応じてデータの秘匿化や、データ量の削減のために、データの暗号化処理（例えば、ＡＥＳ：ＡｄｖａｎｃｅｄＥｎｃｒｙｐｔｉｏｎＳｔａｎｄａｒｄ、３ＤＥＳ：Ｔｒｉｐｌｅ−ＤａｔａＥｎｃｒｙｐｔｉｏｎＳｔａｎｄａｒｄ等）、データの圧縮処理（符号化もしくはエンコードとも呼ぶ。例えば、映像であればＭＰＥＧ４、Ｈ．２６３、Ｈ．２６４、音声であればＧ．７１１、Ｇ．７２２、Ｇ．７２９等）を行う。
したがって、通信部５１は、通信暗号部５２および通信復号部５３を備えることが好ましい。
また、通信部５１は、端末モード切替部８２によって変更された後の動作状態に従って、サーバとの間の通信状態を変更する。
たとえば、映像および音声を双方向に送受信する双方向通信状態や、映像および音声をサーバから一方向的に受信する片方向通信状態に変更する。 The communication unit 51 is a part that performs data transmission and data reception with a server or another terminal TE, and uses a predetermined communication protocol (for example, H.323, SIP, HTTP, etc.) via various networks. Data communication control is performed so that data communication is possible. In the conference system, voice and video communications are performed. Data encryption processing (for example, AES: Advanced Encryption Standard, 3DES: Triple) is performed to conceal data and reduce the amount of data as necessary. -Data Encryption Standard, etc.), data compression processing (also called encoding or encoding), for example, MPEG4, H.263, H.264 for video, G.711, G.722, G.729 for audio. Etc.).
Therefore, the communication unit 51 preferably includes a communication encryption unit 52 and a communication decryption unit 53.
In addition, the communication unit 51 changes the communication state with the server according to the operation state after being changed by the terminal mode switching unit 82.
For example, the state is changed to a bidirectional communication state in which video and audio are transmitted and received bidirectionally, and a one-way communication state in which video and audio are received unidirectionally from the server.

認証部５５は、利用者の特定や端末および会議システムへのログイン認証を行う部分であり、主として事前に登録・管理されているＩＤ，パスワード入力により、認証を行う。 The authentication unit 55 is a part that performs user authentication and login authentication to the terminal and the conference system, and performs authentication mainly by inputting an ID and a password registered and managed in advance.

記憶部６１は、各種の入力情報（入力映像６６，入力音声６７など），表示情報，出力情報，送受信データ，信号処理されたデータなどの情報を記憶する部分であり、音声認識や画像認識などを行うために予め格納されている認識データベース６２を備える。
認識データベース６２は、主として音声認識を行うための利用者の音声特徴データと、画像認識を行うための画像特徴データとから構成される。端末ＴＥを複数の利用者が利用する場合、その利用者ごとに、その認識情報を格納しておき、利用者の人物認識をしてもよい。本稿には図示しないが、認識データベース６２には、利用者ごとのパスワード情報（データ自体は暗号化されていることが望ましい）を格納しても良い。
記憶部６１には、ＲＯＭ，ＲＡＭなどの半導体素子，ハードディスク，ＣＤ，ＤＶＤ，ＵＳＢメモリなどの記憶媒体などが用いられる。
また、記憶部６１には、端末ＴＥの各機能を実行させるためのプログラムも記憶される。 The storage unit 61 is a part that stores information such as various types of input information (input video 66, input voice 67, etc.), display information, output information, transmission / reception data, signal processed data, and the like, such as voice recognition and image recognition. A recognition database 62 stored in advance is provided.
The recognition database 62 mainly includes user's voice feature data for performing voice recognition and image feature data for performing image recognition. When a plurality of users use the terminal TE, the recognition information may be stored for each user to recognize the person of the user. Although not shown in this paper, the recognition database 62 may store password information for each user (the data itself is preferably encrypted).
As the storage unit 61, a semiconductor element such as a ROM or a RAM, a storage medium such as a hard disk, a CD, a DVD, or a USB memory is used.
The storage unit 61 also stores a program for executing each function of the terminal TE.

信号処理部７１は、入力された映像および音声を、サーバに対して送信するデータに変換（符号化処理，エンコード処理とも呼ぶ）したり、サーバ等から送信されてきた映像および音声を、ディスプレイに表示およびスピーカから出力できるようにもとのデータに変換（復号化処理，デコード処理とも呼ぶ）したりする部分である。
主として、映像符号化部７２，映像復号化部７３，音声符号化部７４，音声復号化部７５から構成される。
信号処理部７１には、たとえば、所定の規格のコーデックに適合した符号化および復号化を実行することのできるコーデックＬＳＩが用いられる。符号化および復号化処理はＬＳＩのようなハードウェアコーデックではなく、ソフトウェアコーデックによって実現してもよい。 The signal processing unit 71 converts input video and audio into data to be transmitted to the server (also referred to as encoding processing and encoding processing), and displays video and audio transmitted from the server or the like on a display. It is a part that converts (also called decoding processing or decoding processing) into original data that can be output from the display and speaker.
It mainly comprises a video encoding unit 72, a video decoding unit 73, an audio encoding unit 74, and an audio decoding unit 75.
For the signal processing unit 71, for example, a codec LSI capable of executing encoding and decoding conforming to a codec of a predetermined standard is used. Encoding and decoding processing may be realized by a software codec instead of a hardware codec such as an LSI.

制御部８０は、ＣＰＵ，ＲＯＭ，ＲＡＭ，Ｉ／Ｏコントローラ，タイマー等を含むマイクロコンピュータに相当する部分であり、ＣＰＵが、記憶部６１に記憶されたプログラムをＲＡＭ（主記憶）に読み出すことにより、各種ハードウェアを有機的に動作させて、端末の各機能を実行させる。
図２では、制御部８０の行う機能として、端末状態通知部８１，端末モード切替部８２等を示しているが、これらの機能ブロック（８１〜８９）の各機能は、ＣＰＵが、その機能ごとのプログラムに基づいて、入力部１１、映像入力部２１、音声入力部４１などから得られた情報を用いて、それぞれ所定の処理を実行することにより実現される。 The control unit 80 is a part corresponding to a microcomputer including a CPU, a ROM, a RAM, an I / O controller, a timer, and the like. When the CPU reads a program stored in the storage unit 61 into a RAM (main memory) The various functions of the terminal are executed by organically operating various hardware.
In FIG. 2, the terminal state notification unit 81, the terminal mode switching unit 82, and the like are shown as the functions performed by the control unit 80. This is realized by executing predetermined processes using information obtained from the input unit 11, the video input unit 21, the audio input unit 41, and the like based on the program.

端末状態通知部８１は、情報端末の動作状態を、サーバへ送信する部分である。たとえば上記した４つのモード（Ｍ１，Ｍ２，Ｍ３，Ｍ４）のうちいずれかの状態であることを示す状態情報を生成し、通信部５１を介して、サーバに送信する。
たとえば、発言要求が検知された場合、端末が発言要求端末モードとなったことを示す状態情報を、サーバに送信する。 The terminal state notification unit 81 is a part that transmits the operation state of the information terminal to the server. For example, state information indicating that one of the four modes (M1, M2, M3, M4) is in the state is generated and transmitted to the server via the communication unit 51.
For example, when a speech request is detected, status information indicating that the terminal has entered the speech request terminal mode is transmitted to the server.

端末モード切替部８２は、サーバからの通知や、あるいは自己のハードウェア等の状態によって、自己の端末の動作状態を変更する部分であり、たとえば上記４つのモードのうちいずれかに変更する部分である。
たとえば、発言要求があり、発言が許可された場合は、その端末の現在の動作状態を、発言要求端末モードに変更する。この動作状態の変更により、サーバとの間の通信を双方向通信に切替える。 The terminal mode switching unit 82 is a part that changes the operation state of its own terminal according to the notification from the server or the state of its own hardware, for example, a part that changes to one of the above four modes. is there.
For example, when there is a speech request and speech is permitted, the current operating state of the terminal is changed to the speech request terminal mode. By changing the operation state, communication with the server is switched to bidirectional communication.

また、双方向通信制御部８９は、予め定められた通信プロトコルに従って、通信部５１を介して入力映像および入力音声を、サーバへ送信し、かつサーバから送られてくる映像および音声等からなる共有情報を、受信する部分である。
代表端末検出要求部８３は、サーバに対して、同一拠点内の代表端末を検出するように要求する部分であり、検出された代表端末の情報をサーバから受信する。
発言検知部８４は、上記したようないずれかの方法により、端末の利用者が発言要求をしたことを検知する部分である。すなわち、発言要求を意味する入力信号，入力音声，または入力映像のいずれか１つ以上を検出した場合に、利用者が発言要求をしたと判断する。
発言要求部８５は、検知された発言要求をしたことを示す発言要求情報を、サーバに送信する部分である。 In addition, the bidirectional communication control unit 89 transmits the input video and input audio to the server via the communication unit 51 according to a predetermined communication protocol, and is shared by the video and audio transmitted from the server. This is the part that receives information.
The representative terminal detection request unit 83 is a part that requests the server to detect a representative terminal in the same base, and receives information on the detected representative terminal from the server.
The speech detection unit 84 is a portion that detects that the user of the terminal has made a speech request by any of the methods described above. That is, when any one or more of an input signal, an input voice, and an input video that mean a speech request is detected, it is determined that the user has requested the speech.
The speech request unit 85 is a part that transmits speech request information indicating that a detected speech request has been made to the server.

ストリーミング要求部８６は、代表端末に対して、ストリーミング情報（音声および映像）をその端末に配信するように要求する部分である。
ストリーミング受信部８７は、サーバあるいは代表端末から配信されたストリーミング情報を受信する部分である。 The streaming request unit 86 is a part that requests the representative terminal to distribute streaming information (audio and video) to the terminal.
The streaming receiving unit 87 is a part that receives streaming information distributed from a server or a representative terminal.

ストリーミング配信部８８は、端末が代表端末である場合に、他の端末に対してストリーミング情報を送信する部分である。
ここで、ストリーミング配信とは、ストリーミング情報である映像や音声を一方向に送信することを意味する。
ストリーミング配信される情報端末は、自端末の映像、音声データの符号化や送信を行なわないので、双方向通信を行う場合よりも、端末にかかる負荷は少なくなる。 The streaming delivery unit 88 is a part that transmits streaming information to other terminals when the terminal is a representative terminal.
Here, streaming distribution means that video and audio, which are streaming information, are transmitted in one direction.
An information terminal that is distributed by streaming does not encode or transmit the video and audio data of the terminal itself, so the load on the terminal is less than that in the case of performing bidirectional communication.

以上が、端末の主要な機能ブロックであるが、制御部８０によって実現される機能は、これに限るものではない。
これらの機能は、主としてソフトウェアによって実現されるものであるので、必要に応じて他の機能を実行するプログラムを記憶部６１に格納し、そのプログラムに基づいてＣＰＵがハードウェアを有機的に動作させることにより、その機能を実行させてもよい。
また、プログラムは、予めＲＯＭ等に固定記憶して提供してもよいが、その後に削除，追加，変更，アップデート等をするために、ハードディスク等の書きかえ可能な記憶装置に記憶してもよく、ＣＤやＤＶＤなどの記録媒体によって提供してもよく、あるいはネットワークを介してサーバ等からダウンロードするようにしてもよい。 The above is the main functional block of the terminal, but the function realized by the control unit 80 is not limited to this.
Since these functions are mainly realized by software, a program for executing other functions is stored in the storage unit 61 as necessary, and the CPU organically operates the hardware based on the program. The function may be executed.
In addition, the program may be provided in a fixed storage in a ROM or the like in advance, but may be stored in a rewritable storage device such as a hard disk for subsequent deletion, addition, change, update, etc. The program may be provided by a recording medium such as a CD or a DVD, or may be downloaded from a server or the like via a network.

＜サーバの構成＞
図３に、本発明のサーバの構成ブロック図を示す。
サーバ１は、主として、入力部１０１，表示部１１１，通信部１２１，認証部１２５，ＭＣＵ部１３１，記憶部１７１，信号処理部１５１，および制御部１６１とから構成される。 <Server configuration>
FIG. 3 shows a configuration block diagram of the server of the present invention.
The server 1 mainly includes an input unit 101, a display unit 111, a communication unit 121, an authentication unit 125, an MCU unit 131, a storage unit 171, a signal processing unit 151, and a control unit 161.

入力部１０１は、各種情報を入力する部分であり、キーボード１０２，マウス１０３，タブレット１０４およびこれらの入力デバイスからの入力を、制御部１６１へ送信することのできる信号に変換する入力処理部１０５とを備える。
表示部１１１は、ＬＣＤ，ＣＲＴなどのディスプレイ１１２と、ディスプレイに表示する表示信号を生成する表示処理部１１３とからなる。
ただし、サーバ１を完全に無人で運用する場合は、入力部や表示部はなくてもよい。 The input unit 101 is a part for inputting various types of information. The input unit 105 converts input from the keyboard 102, mouse 103, tablet 104, and these input devices into signals that can be transmitted to the control unit 161, and Is provided.
The display unit 111 includes a display 112 such as an LCD or a CRT, and a display processing unit 113 that generates a display signal to be displayed on the display.
However, when the server 1 is operated completely unattended, the input unit and the display unit may not be provided.

通信部１２１は、ネットワークを介して、端末ＴＥとデータの送受信をする部分であり、端末と同様に、所定の通信プロトコル（例えば、Ｈ．３２３、ＳＩＰ、ＨＴＴＰ等）でデータ通信し、映像および音声を所定のコーデック（例えば、映像であればＭＰＥＧ４、Ｈ．２６３、Ｈ．２６４、音声であればＧ．７１１、Ｇ．７２２、Ｇ．７２９等）で符号化し、所定の規格（例えば、ＡＥＳ：ＡｄｖａｎｃｅｄＥｎｃｒｙｐｔｉｏｎＳｔａｎｄａｒｄ、３ＤＥＳ：Ｔｒｉｐｌｅ−ＤａｔａＥｎｃｒｙｐｔｉｏｎＳｔａｎｄａｒｄ等）で暗号化処理をする通信暗号部１２２と通信復号部１２３とを備える。
認証部１２５は、端末ＴＥと同様に、利用者の特定や端末および会議システムへのログイン認証を行う部分である。 The communication unit 121 is a part that transmits / receives data to / from the terminal TE via the network, and performs data communication with a predetermined communication protocol (for example, H.323, SIP, HTTP, etc.), video, Audio is encoded with a predetermined codec (for example, MPEG4, H.263, H.264 for video, G.711, G.722, G.729 for audio, etc.), and a predetermined standard (for example, AES) : Advanced Encryption Standard, 3DES: Triple-Data Encryption Standard, etc.).
Similar to the terminal TE, the authentication unit 125 is a part that performs user authentication and login authentication to the terminal and the conference system.

ＭＣＵ部１３１は、いわゆる多地点接続装置（ＭＣＵ）で行われている処理を実行する部分であり、主として、複数の映像信号を合成する映像ミキシング部１３２，映像のサイズを変更する映像リサイズ部１３３，複数の音声信号を合成する音声ミキシング部１３４とから構成される。ＭＣＵ部１３１は信号処理部１５１と共通で構成されていてもよい。 The MCU unit 131 is a part that executes processing performed in a so-called multipoint connection unit (MCU), and mainly includes a video mixing unit 132 that combines a plurality of video signals, and a video resizing unit 133 that changes the size of the video. , And a voice mixing unit 134 that synthesizes a plurality of voice signals. The MCU unit 131 may be configured in common with the signal processing unit 151.

記憶部１７１は、サーバで実行される処理で利用される各種データを記憶する部分であり、さらに、サーバに接続される端末を特定する情報（装置ＩＤ１７３，拠点ＩＤ１７４，ＩＰアドレス１７５，発表端末フラグ１７６，代表端末フラグ１７７，発言要求端末フラグ１７８）を格納した端末データベース１７２や、端末を利用する利用者を特定する情報（ユーザＩＤ１７９，拠点ＩＤ１７４，ＩＰアドレス１７５，メールアドレス１８０）を格納したユーザデータベース１８１を記憶する部分である。
この端末データベース１７２と、ユーザデータベース１８１は、サーバと端末との間で、双方向通信するとき、サーバから端末へ大容量の音声映像等の情報を一方向送信するとき、あるいは、代表端末の検出処理，既存発言端末検出処理等で利用される。
この他に、サーバの各機能を実行するのに必要なプログラムも、記憶部１７１に記憶される。
記憶部１７１には、ＲＯＭ，ＲＡＭ，ハードディスク，ＣＤやＤＶＤなどの記録媒体などが用いられる。 The storage unit 171 is a part that stores various data used in processing executed by the server, and further specifies information (device ID 173, base ID 174, IP address 175, announcement terminal flag) that is connected to the server. 176, representative terminal flag 177, speech request terminal flag 178) and a user database storing information (user ID 179, site ID 174, IP address 175, mail address 180) for identifying a user who uses the terminal. This is the part that stores the database 181.
The terminal database 172 and the user database 181 are used for two-way communication between the server and the terminal, for transmitting one-way information such as a large volume of audio / video from the server to the terminal, or for detecting the representative terminal. Used in processing, existing speech terminal detection processing, etc.
In addition to this, programs necessary for executing each function of the server are also stored in the storage unit 171.
As the storage unit 171, a ROM, a RAM, a hard disk, a recording medium such as a CD or a DVD, or the like is used.

信号処理部１５１は、端末から送られてきた映像および音声を、一旦もとの信号に戻す変換処理（復号化処理，デコード処理）をしたり、複数の端末へ映像や音声を送信するためのデータ変換処理（符号化処理，エンコード処理）をする部分である。
この信号処理部１５１は、端末ＴＥと同様に、主として映像符号化部１５２，映像復号化部１５３，音声符号化部１５４，音声復号化部１５５から構成され、コーデックＬＳＩもしくはソフトウェアコーデックを用いて実現することができる。 The signal processing unit 151 performs conversion processing (decoding processing, decoding processing) for temporarily returning the video and audio sent from the terminal to the original signal, and transmits video and audio to a plurality of terminals. This is the part that performs data conversion processing (encoding processing, encoding processing).
Similar to the terminal TE, the signal processing unit 151 mainly includes a video encoding unit 152, a video decoding unit 153, an audio encoding unit 154, and an audio decoding unit 155, and is realized using a codec LSI or a software codec. can do.

制御部１６１は、ＣＰＵ，ＲＯＭ，ＲＡＭ，Ｉ／Ｏコントローラ，タイマー等からなるマイクロコンピュータに相当する部分であり、ＣＰＵが、記憶部１７１に記憶されたプログラムに基づいて、各種ハードウェアを有機的に動作させることにより、サーバの各機能が実現される。
図３では、制御部１６１の機能として、端末状態判断部１６２と、通信方法切替部１６３と、代表端末検出部１６４と、既存発言端末検出部１６５と、ストリーミング配信部１６６と、双方向通信制御部１６７とを示している。 The control unit 161 is a part corresponding to a microcomputer including a CPU, a ROM, a RAM, an I / O controller, a timer, and the like. The CPU performs various hardware operations based on a program stored in the storage unit 171. Each function of the server is realized by operating the server.
In FIG. 3, as a function of the control unit 161, a terminal state determination unit 162, a communication method switching unit 163, a representative terminal detection unit 164, an existing speech terminal detection unit 165, a streaming distribution unit 166, and bidirectional communication control Part 167.

端末状態判断部１６２は、主として、端末から送られてくる動作状態のモードを示す状態情報を受信して、その端末が現在どのモードになっているかを判断する部分である。
通信方法切替部１６３は、情報端末の端末状態通知部によって送信されてきた端末の動作状態に基づいて、その情報端末に対する通信方法を切り替える部分である。通信方法としては、主として、双方向通信状態、画像音声等の送信をサーバから端末への方向のみとする片方向通信（ストリーミング配信）状態、データ通信の中止状態などがある。
また、サーバと情報端末との間の通信状態は、情報端末の動作状態に従って、各情報端末ごとに変更させられる。 The terminal state determination unit 162 is a part that mainly receives state information indicating the mode of the operation state sent from the terminal and determines which mode the terminal is currently in.
The communication method switching unit 163 is a part that switches the communication method for the information terminal based on the operation state of the terminal transmitted by the terminal state notification unit of the information terminal. As communication methods, there are mainly a bidirectional communication state, a one-way communication (streaming distribution) state in which transmission of video and audio and the like is only in the direction from the server to the terminal, and a data communication stop state.
The communication state between the server and the information terminal is changed for each information terminal according to the operation state of the information terminal.

また、代表端末検出部１６４は、記憶部１７１の端末データベース１７２に記憶された拠点ＩＤ１７４を用いて、代表端末とすべき端末が、同一拠点内に存在するか否かを検出する部分である。 The representative terminal detection unit 164 is a part that detects whether or not a terminal to be a representative terminal exists in the same base using the base ID 174 stored in the terminal database 172 of the storage unit 171.

図１０に、サーバの記憶部に格納されている各データベースの内容の一実施例の説明図を示す。 FIG. 10 shows an explanatory diagram of an embodiment of the contents of each database stored in the storage unit of the server.

図１０（ａ）は、サーバの記憶部に格納されている端末データベース１７２の一実施例を示している。
ここでは、端末ごとに、装置ＩＤ１７３と、拠点ＩＤ１７４と、ＩＰアドレス１７５と、発表端末フラグ１７６と、代表端末フラグ１７７と、発言要求端末フラグ１７８とが、関連付けて記憶されている。 FIG. 10A shows an example of the terminal database 172 stored in the storage unit of the server.
Here, for each terminal, a device ID 173, a base ID 174, an IP address 175, an announcement terminal flag 176, a representative terminal flag 177, and a speech request terminal flag 178 are stored in association with each other.

図１０（ｂ）は、サーバの記憶部に格納されているユーザデータベース１８１の一実施例を示している。
ここでは、ユーザ名１７９と、拠点ＩＤ１７４と、メールアドレス１８０と、ＩＰアドレス１７５とが関連付けて記憶されている。
なお、各データベースに格納される情報の項目は、以上のものに限られず、他に必要な項目があれば、追加して記憶してもよい。 FIG. 10B shows an example of the user database 181 stored in the storage unit of the server.
Here, the user name 179, the base ID 174, the mail address 180, and the IP address 175 are stored in association with each other.
Note that the items of information stored in each database are not limited to those described above, and any other necessary items may be additionally stored.

図１０において、代表端末フラグ１７７は、端末が代表端末であるか否かを識別するための情報であり、「ＯＮ」のとき、その端末が代表端末であることを意味し、「ＯＦＦ」のとき、その端末は代表端末でないことを意味する。
図１０においては、４つの端末のうち、拠点ＩＤが「関西」となっている２つの端末が、同一拠点の端末であり、この同一拠点に含まれる２つの端末のうち、装置ＩＤ＝「Ｇ５３１８２」の端末の代表端末フラグ１７７が「ＯＮ」であるので、この端末が代表端末であることを意味する。
ここで、同一拠点が１つのグループを形成し、１つのグループには、代表端末は１つだけ存在する。 In FIG. 10, a representative terminal flag 177 is information for identifying whether or not the terminal is a representative terminal. When “ON”, it means that the terminal is a representative terminal, and “OFF” Sometimes the terminal is not a representative terminal.
In FIG. 10, of the four terminals, two terminals whose base ID is “Kansai” are terminals of the same base, and among the two terminals included in the same base, the device ID = “G53182” The representative terminal flag 177 of the terminal “” is “ON”, which means that this terminal is the representative terminal.
Here, the same base forms one group, and there is only one representative terminal in one group.

グループ化は、たとえば、図１０のように、比較的近くの位置に設置されている端末同士を１つのグループとして設定するものであり、同じ拠点に属する複数の端末は、１つのグループに属するものとして扱う。
また、同一ルータあるいは同一中継サーバの配下のネットワークに属する複数の端末を、１つのグループに属するものとして扱ってもよく、さらに、ＩＰアドレスを構成する数字列のうち、特定の数字列部分が同一の端末（同一ドメイン内の端末）を１つのグループに属するものとして扱ってもよい。
以下の説明では、同一拠点ＩＤを持つ複数の端末は、同一のグループに属するものとして扱うことにする。 In the grouping, for example, as shown in FIG. 10, terminals installed at relatively close positions are set as one group, and a plurality of terminals belonging to the same base belong to one group. Treat as.
In addition, a plurality of terminals belonging to a network under the same router or the same relay server may be treated as belonging to one group, and a specific numeric string portion of the numeric strings constituting the IP address is the same. Terminals (terminals in the same domain) may be treated as belonging to one group.
In the following description, a plurality of terminals having the same base ID are treated as belonging to the same group.

また、図１０の場合、同一拠点に含まれる２つの端末は、１つのグループに属する端末とみなされ、サーバから見た場合、そのグループに属する代表端末のみがサーバに接続されているかのように扱われる。すなわち、サーバから映像音声等の情報を送信する場合、サーバから、同一拠点（同一グループ）に属する代表端末のみにその情報がストリーミングで送信され、その同一拠点に属する代表端末でない他の参加端末にはサーバから直接送信されない。
代表端末でない参加端末には、原則として、同一拠点に属する代表端末から、ストリーミング配信で、情報を送信するものとする。 In the case of FIG. 10, two terminals included in the same site are regarded as terminals belonging to one group, and as viewed from the server, only the representative terminals belonging to the group are connected to the server. Be treated. That is, when information such as video and audio is transmitted from the server, the information is transmitted from the server to only the representative terminals belonging to the same base (the same group), and is transmitted to other participating terminals that are not representative terminals belonging to the same base. Is not sent directly from the server.
In principle, information is transmitted to participating terminals that are not representative terminals by streaming distribution from representative terminals belonging to the same base.

また、同一拠点内の代表端末は、原則として同じグループ内に属する他の参加端末に対してのみストリーミング配信を行うが、他の拠点に属する端末ＴＥｎからストリーミング配信の要求を受けた場合は、代表端末は、他の拠点のその端末ＴＥｎに対して、ストリーミング配信をしてもよい。 Further, in principle, the representative terminal in the same base performs streaming distribution only to other participating terminals belonging to the same group. However, when a streaming distribution request is received from a terminal TEn belonging to another base, The terminal may perform streaming delivery to the terminal TEn at another base.

また、既存発言端末検出部１６５は、すでに、発言端末となっている端末があるか否かを検出する部分であり、端末データベースの発言要求端末フラグ１７８を利用して検出する。
図１０に示す「発言要求端末フラグ」が「ＯＮ」となっている端末があれば、その端末が発言端末であることを示し、「ＯＦＦ」の場合は、発言端末でないことを示している。
図１０の場合は、端末ＴＥ４が発言要求端末であることを示している。 Further, the existing speech terminal detection unit 165 is a portion that detects whether or not there is a terminal that is already a speech terminal, and detects it using the speech request terminal flag 178 in the terminal database.
If there is a terminal whose “utterance request terminal flag” shown in FIG. 10 is “ON”, it indicates that the terminal is a speaking terminal, and “OFF” indicates that it is not a speaking terminal.
In the case of FIG. 10, it is indicated that the terminal TE4 is a speech request terminal.

ストリーミング配信部１６６は、端末に対して、映像および音声等からなる共有情報を、ストリーミングとして送信する部分である。
双方向通信制御部１６７は、上記した端末の双方向通信制御部８９と同様に、所定の通信プロトコルに従って、サーバと端末間の双方向の通信を制御する部分である。
また、情報端末から発言要求情報を受信した場合に、その情報端末とサーバとの間の通信状態を、端末情報と共有情報とを相互に直接送受信可能な双方向通信状態に切り替える。
以上が、サーバの主な機能である。 The streaming distribution unit 166 is a part that transmits shared information including video and audio to the terminal as streaming.
The bidirectional communication control unit 167 is a part that controls bidirectional communication between the server and the terminal according to a predetermined communication protocol, like the bidirectional communication control unit 89 of the terminal described above.
When speech request information is received from an information terminal, the communication state between the information terminal and the server is switched to a bidirectional communication state in which terminal information and shared information can be directly transmitted and received.
The above is the main function of the server.

＜第１実施例：サーバ端末間通信状態の説明＞
以下に、サーバ端末間通信状態の一実施例として、６つのサーバと端末間の通信状態を説明する。
ここでは、サーバ１に対し、４台の端末（ＴＥ１〜ＴＥ４）が、ネットワークを介して接続されているものとして説明する。ただし、接続される端末の数は、４台に限定されるものではない。
図４に、第１実施例において、サーバ端末間通信状態の説明図（実施例１−１）を示す。
図４では、１台の端末ＴＥ１のみが、電源ＯＮとされた状態を示している。
この端末ＴＥ１は、発表端末であるものとし、サーバには、端末ＴＥ１の発表端末フラグが、「ＯＮ」に予め設定されているものとする。
この場合、発表端末ＴＥ１とサーバとの通信状態は、端末ＴＥ１からサーバへの片方向通信のみが可能な状態となっている。
すなわち、端末ＴＥ１のカメラで撮影された自画像の入力映像（ＳＤ１）が、端末で符号化（エンコード）された後、所定の通信プロトコルで、サーバへ送信される。この片方向通信では、端末からサーバに対して画像および音声のデータ送信のみが可能で、サーバから端末への画像および音声のデータ送信は行われない。 <First embodiment: description of communication state between server terminals>
Below, the communication state between six servers and a terminal is demonstrated as an Example of the communication state between server terminals.
Here, description will be made assuming that four terminals (TE1 to TE4) are connected to the server 1 via a network. However, the number of connected terminals is not limited to four.
FIG. 4 shows an explanatory diagram (embodiment 1-1) of a communication state between server terminals in the first embodiment.
FIG. 4 shows a state in which only one terminal TE1 is turned on.
It is assumed that the terminal TE1 is a presentation terminal, and the presentation terminal flag of the terminal TE1 is set to “ON” in advance in the server.
In this case, the communication state between the presentation terminal TE1 and the server is a state in which only one-way communication from the terminal TE1 to the server is possible.
That is, the input image (SD1) of the self-portrait photographed with the camera of the terminal TE1 is encoded by the terminal and then transmitted to the server using a predetermined communication protocol. In this one-way communication, only image and audio data transmission from the terminal to the server is possible, and image and audio data transmission from the server to the terminal is not performed.

その他の端末（ＴＥ２，ＴＥ３，ＴＥ４）は、まだ電源ＯＦＦの状態で、サーバに対して未接続状態であるものとする。
また、端末ＴＥ２は、同一拠点に属する他の端末はなく、単一の端末とする。
端末ＴＥ３とＴＥ４は、同一拠点に属し、１つのグループにグループ化される端末とする。
図４の状態では、これらの３つの端末（ＴＥ２，ＴＥ３，ＴＥ４）は、送信される画像（送信画像ＳＤ２，ＳＤ３，ＳＤ４）はなく、さらに、受信画像（ＲＤ２，ＲＤ３，ＲＤ４）もない。
また、図４では、サーバ１に対して１台の端末ＴＥ１のみが接続されており、しかも片方向通信状態なので、端末ＴＥ１のディスプレイには、表示される受信画像はない。 The other terminals (TE2, TE3, TE4) are still in a power-off state and are not connected to the server.
The terminal TE2 is a single terminal without other terminals belonging to the same base.
Terminals TE3 and TE4 are terminals belonging to the same base and grouped into one group.
In the state of FIG. 4, these three terminals (TE2, TE3, TE4) do not have images to be transmitted (transmission images SD2, SD3, SD4) and further have no reception images (RD2, RD3, RD4).
Further, in FIG. 4, only one terminal TE1 is connected to the server 1, and since it is in a one-way communication state, there is no received image displayed on the display of the terminal TE1.

図１１（ａ）に、この実施例１−１の場合における端末データベースの３つのフラグの状態の説明図を示す。
図１１（ａ）に示すように、端末ＴＥ１の発表端末フラグのみが「ＯＮ」であり、他のフラグは、すべて「ＯＦＦ」である。
また、拠点ＩＤにおいて、「関西」となっている２つの端末（ＴＥ３，ＴＥ４）は、同一拠点、すなわち同一のグループに属する端末として扱われる。 FIG. 11A shows an explanatory diagram of the states of the three flags of the terminal database in the case of the embodiment 1-1.
As shown in FIG. 11A, only the announcement terminal flag of the terminal TE1 is “ON”, and all other flags are “OFF”.
In addition, the two terminals (TE3, TE4) having “Kansai” in the base ID are treated as terminals belonging to the same base, that is, the same group.

図５に、第１実施例において、サーバ端末間通信状態（実施例１−２）の説明図を示す。
ここでは、図４の状態から、端末ＴＥ３が電源ＯＮされ、起動された状態を示している。端末ＴＥ３は、会議に必要なアプリケーションプログラムが起動され、サーバとの通信が可能な状態になったとする。
他の端末ＴＥ２とＴＥ４は、電源ＯＦＦの状態で、サーバに未接続であるとする。
端末ＴＥ３は、２台の端末（ＴＥ３，ＴＥ４）が属する１つの同一拠点の中で、最初に起動した端末であるとする。このとき、端末ＴＥ３は、サーバに対して、代表端末検出要求を送信する。
図４の状態のときに、この同一拠点内には、代表端末となっている端末がまだ存在しなかったとすると、サーバから代表端末未検出が通知されるので、端末ＴＥ３は、自らが代表端末となる。
この後、代表端末となった端末ＴＥ３は、その同一拠点の代表端末となったことをサーバへ通知し、サーバからストリーミング配信を受けることができるように、動作モードを代表端末モードへ切り替える。
サーバは、デコードからエンコードまでの一連の信号処理を行って、現在受信している端末ＴＥ１の映像ＳＤ１のみからなる合成画像を生成し、端末ＴＥ３へ、ストリーミングにより配信する。この合成画像が、共有情報の一つである。 FIG. 5 is an explanatory diagram of a communication state between server terminals (embodiment 1-2) in the first embodiment.
Here, the state where the terminal TE3 is powered on and activated from the state of FIG. 4 is shown. The terminal TE3 is assumed to be in a state where an application program necessary for the conference is activated and communication with the server is possible.
It is assumed that the other terminals TE2 and TE4 are not connected to the server in a power-off state.
The terminal TE3 is assumed to be the first activated terminal in one same site to which the two terminals (TE3, TE4) belong. At this time, the terminal TE3 transmits a representative terminal detection request to the server.
In the state of FIG. 4, if there is no terminal serving as a representative terminal in the same base, the server TE3 notifies that the representative terminal has not been detected. It becomes.
Thereafter, the terminal TE3, which has become the representative terminal, notifies the server that it has become the representative terminal of the same base, and switches the operation mode to the representative terminal mode so that streaming distribution can be received from the server.
The server performs a series of signal processing from decoding to encoding, generates a composite image including only the video SD1 of the terminal TE1 currently being received, and distributes it to the terminal TE3 by streaming. This composite image is one piece of shared information.

端末ＴＥ３では、ストリーミング配信により受信した画像ＲＤ３を、ディスプレイに表示する。ここで受信画像ＲＤ３は、たとえば１画面を４分割して、その１つの分割画面（図５では左上の画面）に、端末ＴＥ１の自画像を表示したものである。
図５の状態におけるフラグは、図１１（ｂ）に示すように、端末ＴＥ３の代表端末フラグが、「ＯＦＦ」から「ＯＮ」に設定変更される。
なお、発表端末ＴＥ１では、サーバと片方向通信をしているので、サーバからの受信画像ＲＤ１は受信されないが、端末ＴＥ１側で、その自画像のみからなる受信画像ＲＤ１を生成して、端末ＴＥ１のディスプレイに表示してもよい。 The terminal TE3 displays the image RD3 received by streaming delivery on the display. Here, the received image RD3 is obtained by, for example, dividing one screen into four and displaying the self-portrait of the terminal TE1 on the one divided screen (the upper left screen in FIG. 5).
As for the flag in the state of FIG. 5, the representative terminal flag of the terminal TE3 is changed from “OFF” to “ON” as shown in FIG. 11B.
In addition, since the presentation terminal TE1 performs one-way communication with the server, the reception image RD1 from the server is not received, but the terminal TE1 side generates a reception image RD1 including only the self-portrait of the terminal TE1. You may display on a display.

図６に、第１実施例において、サーバ端末間通信状態（実施例１−３）の説明図を示す。
ここでは、図５の状態から、端末ＴＥ２とＴＥ４とが、電源ＯＮされ、起動された状態を示している。
代表端末ＴＥ３は、サーバから合成画像のストリーミング配信を受け、同一拠点内の他の端末（ＴＥ４）に対しては、ストリーミング配信を行うことが可能な状態となっている。
まず、同一拠点に属する端末ＴＥ４は、すでに代表端末ＴＥ３があるので、参加端末となり、代表端末ＴＥ３からストリーミング配信を受ける。サーバからストリーミング配信される受信画像ＲＤ３と、代表端末ＴＥ３からストリーミング配信される受信画像ＲＤ４とは、同じ共有情報である。
代表端末ＴＥ３から送られてくる画像ＲＤ４は、図５と同じように、端末ＴＥ１の自画像のみからなる４分割画面の映像であり、端末ＴＥ４は、この受信画像ＲＤ４をディスプレイに表示する。
参加端末ＴＥ４は、代表端末ＴＥ３からのストリーミング配信のみを受け、自らサーバに対して映像等を送信することができないので、送信画像ＳＤ４はない。 FIG. 6 is an explanatory diagram of a communication state between server terminals (embodiment 1-3) in the first embodiment.
Here, from the state of FIG. 5, the terminals TE2 and TE4 are in a state where the power is turned on and activated.
The representative terminal TE3 receives streaming distribution of the composite image from the server, and is in a state where streaming distribution can be performed to other terminals (TE4) in the same base.
First, since the terminal TE4 belonging to the same base already has the representative terminal TE3, it becomes a participating terminal and receives streaming distribution from the representative terminal TE3. The received image RD3 that is streamed from the server and the received image RD4 that is streamed from the representative terminal TE3 are the same shared information.
Similarly to FIG. 5, the image RD4 sent from the representative terminal TE3 is a four-divided screen image consisting only of the self-portrait of the terminal TE1, and the terminal TE4 displays the received image RD4 on the display.
Since the participating terminal TE4 receives only streaming distribution from the representative terminal TE3 and cannot transmit video or the like to the server itself, there is no transmission image SD4.

次に、端末ＴＥ２は、単一の端末であるので、サーバに代表端末検出要求を送信した後、未検出が通知されるので、自らが代表端末となる。代表端末となった端末ＴＥ２は、サーバからストリーミング配信を受ける。端末ＴＥ２も、サーバから受信した画像ＲＤ２は、端末ＴＥ１の自画像のみからなる４分割画像である。
また、端末ＴＥ２も、サーバへ映像等の送信はできないので、送信画像ＳＤ２はない。
一方、サーバでは、端末ＴＥ２が代表端末となったことが設定され、図１１（ｃ）に示すように、端末ＴＥ２の代表端末フラグが「ＯＮ」となる。
この図６の状態では、４台の端末がすべて起動状態となっているが、そのうち１台の端末ＴＥ１が、サーバに対して、送信のみを行う片方向通信状態であり、２台の端末（ＴＥ２，ＴＥ３）が、サーバに対して、ストリーミング受信を行う片方向通信状態であり、もう１台の端末ＴＥ４は、サーバと直接映像等の通信は行わない端末である。 Next, since the terminal TE2 is a single terminal, it transmits a representative terminal detection request to the server, and is notified of non-detection. The terminal TE2 that has become the representative terminal receives streaming distribution from the server. Also for terminal TE2, image RD2 received from the server is a four-divided image consisting only of the self-portrait of terminal TE1.
Also, since the terminal TE2 cannot transmit video or the like to the server, there is no transmission image SD2.
On the other hand, in the server, it is set that the terminal TE2 has become the representative terminal, and as shown in FIG. 11C, the representative terminal flag of the terminal TE2 is set to “ON”.
In the state of FIG. 6, all four terminals are in an activated state, but one terminal TE1 is in a one-way communication state in which only transmission is performed to the server, and two terminals ( TE2 and TE3) are in a one-way communication state in which streaming reception is performed with respect to the server, and the other terminal TE4 is a terminal that does not perform communication such as video directly with the server.

ただし、図６の状態では、３つの端末（ＴＥ２，ＴＥ３，ＴＥ４）は、いずれも自らの自画像（ＳＤ２，ＳＤ３，ＳＤ４）を送信できない状態であり、発表端末ＴＥ１の利用者が一方的に、発表内容を他の端末の利用者に対して説明する状態と言える。
また、図６の状態では、端末の状態（動作モード）によって通信方向を制限しているので、４つの端末すべてがサーバと双方向通信を行う場合に比べて、サーバの負荷を軽減でき、さらにネットワークを流れるトラフィックを低減させることにより、伝送遅延を少なくし、通信障害の発生も低減できる。 However, in the state of FIG. 6, none of the three terminals (TE2, TE3, TE4) can transmit their own images (SD2, SD3, SD4), and the user of the presentation terminal TE1 unilaterally, It can be said that the contents of the announcement are explained to users of other terminals.
In the state of FIG. 6, the communication direction is limited by the state (operation mode) of the terminal, so that the load on the server can be reduced compared to the case where all four terminals perform bidirectional communication with the server. By reducing the traffic flowing through the network, transmission delay can be reduced and the occurrence of communication failures can be reduced.

図７に、第１実施例において、サーバ端末間通信状態（実施例１−４）の説明図を示す。
ここでは、図６の状態において、端末ＴＥ２の利用者が発言要求操作をし、端末ＴＥ２からサーバへ、発言要求があり、その発言が許可された場合を示している。
端末ＴＥ２は、代表端末でもあり、かつ発言端末となっている。発言端末となった端末ＴＥ２は、サーバと映像および音声の双方向通信を行う状態に切り替えられる。
また、サーバでは、図１２（ｄ）に示すように、端末ＴＥ２の発言要求端末フラグが「ＯＮ」に設定される。
さらに、端末ＴＥ２は、サーバとの双方向通信が行えるようになったので、その自画像（ＳＤ２）をサーバに送信する。
サーバでは、端末ＴＥ１から送られてくる映像ＳＤ１と、端末ＴＥ２から送られてくる映像ＳＤ２とについて、デコード，リサイズ，ミキシング，およびエンコードという一連の信号処理を行って、この２つの端末（ＴＥ１，ＴＥ２）の映像からなる合成画像を生成し、端末へ送信する。 FIG. 7 is an explanatory diagram of a communication state between server terminals (embodiment 1-4) in the first embodiment.
Here, in the state of FIG. 6, the user of the terminal TE2 performs a speech request operation, and there is a speech request from the terminal TE2 to the server, and the speech is permitted.
The terminal TE2 is also a representative terminal and a speaking terminal. The terminal TE2 that has become the speaking terminal is switched to a state in which bidirectional communication of video and audio is performed with the server.
In the server, as shown in FIG. 12D, the speech request terminal flag of the terminal TE2 is set to “ON”.
Furthermore, since the terminal TE2 can perform bidirectional communication with the server, the terminal TE2 transmits its own image (SD2) to the server.
The server performs a series of signal processing such as decoding, resizing, mixing, and encoding on the video SD1 sent from the terminal TE1 and the video SD2 sent from the terminal TE2, and these two terminals (TE1, TE1) are processed. A composite image composed of the video of TE2) is generated and transmitted to the terminal.

図７の状態では、発表端末ＴＥ１は、他に発言端末となった端末が出現したので、サーバと双方向通信ができるモードに切り替えられるものとする。
この状態では、サーバから３つの端末（ＴＥ１，ＴＥ２，ＴＥ３）に対して合成画像が送信される。このうち、代表端末ＴＥ３に対する送信は、ストリーミング送信である。
また、参加端末ＴＥ４へは、図６と同様に、代表端末ＴＥ３からのストリーミング配信で、同じ合成画像が送信される。これにより、４台の端末では、いずれも同じ合成画像が表示され、発表端末ＴＥ１の自画像と、現在発言している利用者の端末ＴＥ２の自画像とが分割表示された画面を見ながら、会議を進行することができる。
この場合も、４台の端末がサーバと双方向通信する場合に比べて、伝送遅延と通信障害を低減でき、サーバの負荷を軽減できる。
また、発言要求により、その発言要求した端末とサーバとの通信状態を双方向通信に切り替えているので、トラフィックは増加するが、その増加は最小限必要な範囲に抑制でき、かつ発言している利用者の自画像を各端末で見ることが可能となるので、会議に必要な情報を得ることができ、円滑な会議の進行が可能である。 In the state of FIG. 7, it is assumed that the presentation terminal TE <b> 1 is switched to a mode in which bidirectional communication with the server can be performed because another terminal that has become a speech terminal has appeared.
In this state, the composite image is transmitted from the server to the three terminals (TE1, TE2, TE3). Among these, transmission to the representative terminal TE3 is streaming transmission.
Similarly to FIG. 6, the same composite image is transmitted to the participating terminal TE4 by streaming delivery from the representative terminal TE3. As a result, the same composite image is displayed on all four terminals, and the conference is held while viewing the screen on which the self-portrait of the presentation terminal TE1 and the self-portrait of the terminal TE2 of the user currently speaking are displayed separately. Can proceed.
Also in this case, the transmission delay and communication failure can be reduced and the load on the server can be reduced as compared with the case where four terminals communicate with the server in two ways.
In addition, because the communication state between the terminal that requested the speech and the server is switched to bidirectional communication by the speech request, the traffic increases, but the increase can be suppressed to the minimum necessary range and the speech is made. Since the user's self-portrait can be viewed on each terminal, information necessary for the conference can be obtained, and the conference can proceed smoothly.

図８に、第１実施例において、サーバ端末間通信状態（実施例１−５）の説明図を示す。
ここでは、図７の状態において、端末ＴＥ３の利用者が発言要求操作をし、その発言要求が許可されて、発言端末が、端末ＴＥ２からＴＥ３へ変更された場合を示している。
まず、今まで発言端末であった端末ＴＥ２（既存発言要求端末）は、代表端末（図６と同じ状態）に戻り、サーバとの通信状態も、サーバからストリーミング配信を受ける片方向通信状態に戻る。
したがって、端末ＴＥ２の自画像ＳＤ２の送信は、中止される。
一方、端末ＴＥ３は、発言端末となり、サーバと双方向通信を行う。
端末ＴＥ３は、サーバへ映像等を送信できるようになったので、その自画像ＳＤ３をサーバへ送信する。サーバでは、２つの端末の自画像（ＴＥ１，ＴＥ３）を合成した画像が生成され、この合成画像を３つの端末（ＴＥ１，ＴＥ２，ＴＥ３）へ送信する。 FIG. 8 is an explanatory diagram of the communication state between the server terminals (Example 1-5) in the first example.
Here, in the state of FIG. 7, the user of the terminal TE3 performs a speech request operation, the speech request is permitted, and the speech terminal is changed from the terminal TE2 to the TE3.
First, the terminal TE2 (existing speech request terminal), which has been a speech terminal until now, returns to the representative terminal (the same state as in FIG. 6), and the communication state with the server also returns to the one-way communication state that receives streaming distribution from the server. .
Accordingly, transmission of the self-portrait SD2 of the terminal TE2 is stopped.
On the other hand, the terminal TE3 becomes a speech terminal and performs bidirectional communication with the server.
Since the terminal TE3 can transmit video and the like to the server, the terminal TE3 transmits the self-portrait SD3 to the server. The server generates an image obtained by combining the self-portraits (TE1, TE3) of the two terminals, and transmits the combined image to the three terminals (TE1, TE2, TE3).

また、サーバでは、図１２（ｅ）に示すように、ＴＥ２の発言要求端末フラグが「ＯＮ
」から「ＯＦＦ」に変更され、ＴＥ３の発言要求端末フラグが「ＯＮ」に設定される。
また、参加端末ＴＥ４は、代表端末でもある端末ＴＥ３からのストリーミング配信により、同じ合成画像を受信する。これにより、４台の端末では、２つの端末（ＴＥ１，ＴＥ３）の自画像からなる同じ受信画像が受信され、それぞれのディスプレイに表示される。
この図８の場合は、双方向通信可能な発言端末が、端末ＴＥ２からＴＥ３へ変更されただけであるので、図７の場合と同程度に、伝送遅延の低減と通信障害の低減と、サーバの負荷を軽減できる。 In the server, as shown in FIG. 12E, the message request terminal flag of TE2 is “ON”.
”Is changed to“ OFF ”, and the speech request terminal flag of TE3 is set to“ ON ”.
In addition, the participating terminal TE4 receives the same composite image by streaming delivery from the terminal TE3 which is also the representative terminal. Thereby, in the four terminals, the same received image consisting of the self-portraits of the two terminals (TE1, TE3) is received and displayed on the respective displays.
In the case of FIG. 8, the speaking terminal capable of bidirectional communication is merely changed from the terminal TE2 to the TE3. Therefore, as in the case of FIG. Can reduce the load.

図９に、第１実施例において、サーバ端末間通信状態（実施例１−６）の説明図を示す。
ここでは、図８の状態において、端末ＴＥ４の利用者が発言要求操作をし、その発言要求が許可されて、発言端末が端末ＴＥ４に変更された場合を示している。
まず、端末ＴＥ３（既存発言要求端末）は、発言端末から代表端末へ戻り、サーバとの通信状態は、サーバからストリーミング配信を受ける通信状態に変更される。
端末ＴＥ３からサーバへ、その自画像ＳＤ３は送信できなくなる。
また、今まで参加端末であった端末ＴＥ４は、発言端末となるので、サーバと双方向通信ができるように切り替えられる。すなわち、端末ＴＥ３から端末ＴＥ４へのストリーミング配信は中止され、端末ＴＥ４は、直接サーバと通信を行う状態となり、その自画像（ＳＤ４）をサーバへ送信できるようになる。 FIG. 9 is an explanatory diagram of the communication state between the server terminals (Example 1-6) in the first example.
Here, in the state of FIG. 8, a user of the terminal TE4 performs a speech request operation, the speech request is permitted, and the speech terminal is changed to the terminal TE4.
First, the terminal TE3 (existing speech request terminal) returns from the speech terminal to the representative terminal, and the communication state with the server is changed to a communication state in which streaming distribution is received from the server.
The self-portrait SD3 cannot be transmitted from the terminal TE3 to the server.
Further, since the terminal TE4 which has been a participating terminal until now becomes a speaking terminal, the terminal TE4 is switched so as to be able to perform two-way communication with the server. That is, the streaming delivery from the terminal TE3 to the terminal TE4 is stopped, and the terminal TE4 enters a state of directly communicating with the server, and can transmit its own image (SD4) to the server.

サーバでは、図１２（ｆ）に示すように、端末ＴＥ３の発言要求端末フラグを「ＯＦＦ」に変更し、端末ＴＥ４の発言要求端末フラグを「ＯＮ」に設定する。
また、サーバでは、２つの端末（ＴＥ１，ＴＥ４）の自画像を合成して、その合成映像を、４つの端末へ送信する。
各端末では、２つの端末（ＴＥ１，ＴＥ４）の自画像からなる同一の受信画像を受信して、それぞれのディスプレイに表示する。
この図９の場合、サーバと２つの端末（ＴＥ１，ＴＥ４）との通信状態は双方向通信となり、サーバと２つの端末（ＴＥ２，ＴＥ３）との通信状態は、サーバから端末へのストリーミング配信となる。
したがって、４つの端末がすべて双方向通信を行う場合に比べて、トラフィックは少ないので、伝送遅延の減少，通信障害の減少，およびサーバの負荷の軽減ができる。 As shown in FIG. 12F, the server changes the message request terminal flag of the terminal TE3 to “OFF” and sets the message request terminal flag of the terminal TE4 to “ON”.
Further, the server synthesizes the self-portraits of the two terminals (TE1, TE4) and transmits the synthesized video to the four terminals.
Each terminal receives the same received image made up of the self-portraits of the two terminals (TE1, TE4) and displays them on the respective displays.
In the case of FIG. 9, the communication state between the server and the two terminals (TE1, TE4) is bidirectional communication, and the communication state between the server and the two terminals (TE2, TE3) Become.
Therefore, the traffic is less than when all four terminals perform two-way communication, so that transmission delay can be reduced, communication failure can be reduced, and server load can be reduced.

＜第１実施例の通信シーケンスの説明＞
図２６に、端末の動作モードを切り替える場合の通信シーケンスの一実施例の概略説明図を示す。
ここで、情報端末ＴＥ１が、現在参加端末モード（Ｍ１）であったとする。
図２６（ａ）は、代表端末の検出要求をサーバに送信したときに、代表端末が検出された場合のシーケンスを示し、図２６（ｂ）は、代表端末が検出されず、端末ＴＥ１が自ら代表端末に動作モードを変更する場合のシーケンスを示している。 <Description of Communication Sequence of First Embodiment>
FIG. 26 shows a schematic explanatory diagram of an embodiment of a communication sequence when the operation mode of the terminal is switched.
Here, it is assumed that the information terminal TE1 is currently in the participating terminal mode (M1).
FIG. 26A shows a sequence when a representative terminal is detected when a request for detecting a representative terminal is transmitted to the server, and FIG. 26B shows that the representative terminal is not detected and the terminal TE1 itself The sequence for changing the operation mode to the representative terminal is shown.

図２６（ａ）において、まず、端末ＴＥ１が、自己が属することができるグループがあるか否かを確認するために、サーバに対して、自己の現在の動作モードが参加端末モード（Ｍ１）であることを示す端末状態通知を送信し、さらに、代表端末検出要求を送信する。
端末状態通知には、たとえば、端末ＴＥ１の装置ＩＤ，拠点ＩＤ，ＩＤアドレス，参加端末モードであることを示す情報などが含まれる。 In FIG. 26 (a), first, in order for the terminal TE1 to confirm whether there is a group to which the terminal TE1 can belong, the current operation mode of the terminal TE1 is the participating terminal mode (M1). A terminal status notification indicating the presence is transmitted, and a representative terminal detection request is further transmitted.
The terminal status notification includes, for example, the device ID of the terminal TE1, the base ID, the ID address, information indicating the participation terminal mode, and the like.

端末ＴＥ１から端末状態通知を受信したサーバは、端末データベースの中の端末ＴＥ１の３つのフラグを、すべて「ＯＦＦ」に設定し、参加端末モードであることを示す状態にする。
また、代表端末検出要求を受信したサーバは、端末データベースを利用して端末ＴＥ１とグループ化できる代表端末があるか否か調べる。
端末データベースの中に、端末ＴＥ１と同じ拠点ＩＤを持ち、かつ代表端末フラグが「ＯＮ」となっている端末（ＴＥ２）があったとすれば、この２つの端末（ＴＥ１，ＴＥ２）を、同一グループにグループ化できる。このとき代表端末（ＴＥ２）が検出されたことになるので、サーバは、端末ＴＥ１に対して、代表端末が見つかったことを示す代表端末検出通知を送信する。この通知には、代表端末ＴＥ２の装置ＩＤとＩＰアドレスとが含まれる。 The server that has received the terminal state notification from the terminal TE1 sets all the three flags of the terminal TE1 in the terminal database to “OFF”, and sets the state indicating the participation terminal mode.
Also, the server that has received the representative terminal detection request checks whether there is a representative terminal that can be grouped with the terminal TE1 using the terminal database.
If there is a terminal (TE2) having the same base ID as the terminal TE1 and the representative terminal flag “ON” in the terminal database, these two terminals (TE1, TE2) are assigned to the same group. Can be grouped into Since the representative terminal (TE2) is detected at this time, the server transmits a representative terminal detection notification indicating that the representative terminal is found to the terminal TE1. This notification includes the device ID and IP address of the representative terminal TE2.

端末ＴＥ１は、サーバからの代表端末検出通知により、自己が属することのできるグループ（同一拠点）が存在し、そのグループの代表端末が端末ＴＥ２であることを知る。
その後、端末ＴＥ１は、代表端末ＴＥ２に対して、ストリーミング要求を送信し、通信状態をストリーミング受信が可能な状態とする。
ストリーミング要求を受信した端末ＴＥ２は、サーバからストリーミング配信により送られてきた受信画像ＲＤ２を、端末ＴＥ１に対して、ストリーミング配信する。
端末ＴＥ１は、代表端末ＴＥ２からストリーミング配信により送られてくる映像や音声等を受信し、受信画像ＲＤ１をディスプレイに表示する。
この場合、端末ＴＥ１は、サーバから直接映像等の配信を受けることはなく、端末ＴＥ２から、映像や音声等の共有情報をストリーミング配信による片方向通信（ストリーミング受信）で取得する。これにより、端末ＴＥ１は、サーバと直接通信を行わない状態であっても、共有情報を受信することができる。 The terminal TE1 knows from the representative terminal detection notification from the server that there is a group (same base) to which the terminal TE1 can belong and that the representative terminal of the group is the terminal TE2.
After that, the terminal TE1 transmits a streaming request to the representative terminal TE2 so that the communication state can be received by streaming.
The terminal TE2 that has received the streaming request distributes the received image RD2 sent from the server by streaming distribution to the terminal TE1.
The terminal TE1 receives video, audio, etc. sent from the representative terminal TE2 by streaming delivery, and displays the received image RD1 on the display.
In this case, the terminal TE1 does not receive distribution of video or the like directly from the server, and acquires shared information such as video and audio from the terminal TE2 by one-way communication (streaming reception) by streaming distribution. Thereby, the terminal TE1 can receive the shared information even when the terminal TE1 does not communicate directly with the server.

図２６（ｂ）において、サーバが代表端末の検出処理を行ったが、端末ＴＥ１とグループ化できる他の端末がなく、代表端末が検出できなかったとする。このとき、サーバから端末ＴＥ１に対して、代表端末の未検出通知が送信される。
この未検出通知を受けた端末ＴＥ１は、自らを代表端末モードに切り替え、代表端末モードに移行したことを示す端末状態通知を、サーバに送信する。端末ＴＥ１は、サーバからのストリーミング配信を受信できる状態にする。
この通知を受けたサーバは、端末ＴＥ１の動作モードが代表端末モードに変更されたことを検出し、端末ＴＥ１の代表端末フラグを「ＯＮ」に設定する。その後、サーバは、端末ＴＥ１に対して、ストリーミング配信を行う。
これにより、端末ＴＥ１は、グループに属さない単一の代表端末として、サーバからのストリーミング配信により、映像や音声等の共有情報を受信する。 In FIG. 26B, it is assumed that the server performs the representative terminal detection process, but there is no other terminal that can be grouped with the terminal TE1, and the representative terminal cannot be detected. At this time, an undetected notification of the representative terminal is transmitted from the server to the terminal TE1.
The terminal TE1 that has received this non-detection notification switches itself to the representative terminal mode, and transmits a terminal state notification indicating that the terminal TE1 has shifted to the representative terminal mode to the server. The terminal TE1 is in a state where it can receive streaming delivery from the server.
Upon receiving this notification, the server detects that the operation mode of the terminal TE1 has been changed to the representative terminal mode, and sets the representative terminal flag of the terminal TE1 to “ON”. Thereafter, the server performs streaming delivery to the terminal TE1.
Thereby, the terminal TE1 receives shared information such as video and audio by streaming delivery from the server as a single representative terminal that does not belong to the group.

図２６（ａ）の通信シーケンスは、たとえば、図６の端末ＴＥ４とサーバとの間で行われる通信シーケンスに相当し、図２６（ｂ）の通信シーケンスは、図６の端末ＴＥ２とサーバとの間で行われる通信シーケンスに相当する。 The communication sequence in FIG. 26 (a) corresponds to, for example, a communication sequence performed between the terminal TE4 and the server in FIG. 6, and the communication sequence in FIG. 26 (b) is performed between the terminal TE2 and the server in FIG. This corresponds to a communication sequence performed between the two.

図２７に、端末で発言要求がされた場合の発言検知制御について、通信シーケンスの一実施例の概略説明図を示す。
図２７（ａ）は、参加端末モードであって、まだ発言要求をしていない端末ＴＥ１の利用者が、発言要求をした場合の通信シーケンスを示している。
図２７（ｂ）は、現在発言権を有する発言要求端末モードである端末（既存発言端末）ＴＥ１が存在している状態において、他の端末ＴＥ２の利用者が、新たに発言要求をした場合の通信シーケンスを示している。
図２７（ａ）では、端末ＴＥ１は、現在参加端末モードであり、サーバから映像等の共有情報を直接受信できない状態であるとする。このとき、サーバから共有情報の直接受信はできないが、端末ＴＥ１とサーバ間では、発言要求や端末状態通知などの情報は、所定の通信プロトコルを介して送受信できるものとする。 FIG. 27 shows a schematic explanatory diagram of an embodiment of a communication sequence regarding speech detection control when a speech request is made at a terminal.
FIG. 27A shows a communication sequence when the user of the terminal TE1 who is in the participating terminal mode and has not yet requested a speech makes a speech request.
FIG. 27B shows a case where a user of another terminal TE2 makes a new speech request in a state where the terminal (existing speech terminal) TE1 that is in the speech requesting terminal mode that currently has the right to speak is present. A communication sequence is shown.
In FIG. 27A, it is assumed that the terminal TE1 is currently in a participating terminal mode and cannot receive shared information such as video directly from the server. At this time, shared information cannot be directly received from the server, but information such as a speech request and a terminal status notification can be transmitted and received between the terminal TE1 and the server via a predetermined communication protocol.

図２７（ａ）において、端末ＴＥ１の利用者が、発言要求を意味する入力操作，音声入力，あるいは挙手等の動作による映像入力をしたとする。
このとき、端末ＴＥ１は、発言要求がされたことを検知し、サーバに対して発言要求を意味するデータを送信する。サーバは、この発言要求を受信すると、端末データベースの発言要求端末フラグにより、現在発言権を持つ端末がないか確認する（既存発言端末検出）。
ここで、発言要求端末フラグが「ＯＮ」となっている既存発言端末が検出されたとすると、サーバはその既存発言端末（ＴＥ２）に対して、参加端末への切替要求を送信する。この切替要求を受けた既存発言端末（ＴＥ２）は、自己の動作モードを、発言要求端末モードから参加端末モードへ切りかえる。
この後、既存発言端末（ＴＥ２）は、サーバとの通信状態を、双方向通信状態から、サーバあるいは同一拠点（同一グループ）の代表端末から一方向的にストリーミング配信を受ける片方向通信状態に変更する。 In FIG. 27A, it is assumed that the user of the terminal TE1 inputs a video by an operation such as an input operation, a voice input, or a hand raising meaning a speech request.
At this time, the terminal TE1 detects that a speech request has been made, and transmits data representing the speech request to the server. When the server receives this speech request, the server confirms whether there is a terminal that currently has the right to speak based on the speech request terminal flag in the terminal database (existing speech terminal detection).
Here, if an existing speech terminal whose speech request terminal flag is “ON” is detected, the server transmits a request for switching to a participating terminal to the existing speech terminal (TE2). Receiving this switching request, the existing speech terminal (TE2) switches its own operation mode from the speech request terminal mode to the participation terminal mode.
Thereafter, the existing speech terminal (TE2) changes the communication state with the server from the bidirectional communication state to a one-way communication state in which streaming distribution is received unidirectionally from a representative terminal at the server or the same base (same group). To do.

さらに、サーバは、発言要求をした端末ＴＥ１に対し、発言要求端末モードへの切替許可通知を送信する。
この切替許可通知を受信した端末ＴＥ１は、自己の動作モードを、参加端末モードから、発言要求端末モードに切り替える。
また、サーバは、端末データベースの発言要求端末フラグについて、端末ＴＥ１を「ＯＮ」とし、端末ＴＥ２を「ＯＦＦ」に変更する。
この後、サーバと、端末ＴＥ１とは、この両者間の通信状態を、映像等の共有情報を双方向通信できる状態に切り替える。これにより、端末ＴＥ１からサーバへ、自己の映像や音声を送信することができ、サーバから端末ＴＥ１へ、端末ＴＥ１の映像等を含む共有情報を、所定のプロトコルに従って送信することができるようになる。
発言権を得た端末ＴＥ１の利用者の発言情報は、サーバに接続され会議に参加している他のすべての端末に対して、共有情報として送信される。 Further, the server transmits a notification of permission to switch to the speech request terminal mode to the terminal TE1 that has requested speech.
The terminal TE1 that has received this switching permission notification switches its operation mode from the participating terminal mode to the speech requesting terminal mode.
Also, the server changes the terminal TE1 to “ON” and the terminal TE2 to “OFF” for the message request terminal flag in the terminal database.
Thereafter, the server and the terminal TE1 switch the communication state between the two to a state in which shared information such as video can be bidirectionally communicated. As a result, the user's own video and audio can be transmitted from the terminal TE1 to the server, and the shared information including the video of the terminal TE1 can be transmitted from the server to the terminal TE1 according to a predetermined protocol. .
The speech information of the user of the terminal TE1 who has obtained the right to speak is transmitted as shared information to all other terminals connected to the server and participating in the conference.

図２７（ｂ）において、端末ＴＥ１は既存発言端末であり、サーバと双方向通信状態にあるとする。
図２７（ｂ）では、端末ＴＥ１がすでに既存発言端末となっているので、端末データベースの端末ＴＥ１の発言要求端末フラグが「ＯＮ」となっている。
このとき、他の端末ＴＥ２の利用者が発言要求を意味する入力操作をしたとする。他の端末ＴＥ２において、図２７（ａ）に示したのと同様に、サーバへ発言要求が送信され、発言要求を受信したサーバは、既存発言端末を検出する処理を行う。
サーバは、端末データベースにより、端末ＴＥ１が既存発言端末であることを検出するので、既存発言端末ＴＥ１に対して、参加端末への切替要求を送信する。 In FIG. 27B, it is assumed that the terminal TE1 is an existing speech terminal and is in a two-way communication state with the server.
In FIG. 27B, since the terminal TE1 is already an existing speech terminal, the speech request terminal flag of the terminal TE1 in the terminal database is “ON”.
At this time, it is assumed that a user of another terminal TE2 performs an input operation meaning a speech request. In the other terminal TE2, as in the case shown in FIG. 27 (a), a speech request is transmitted to the server, and the server that has received the speech request performs a process of detecting an existing speech terminal.
Since the server detects from the terminal database that the terminal TE1 is an existing speech terminal, the server transmits a request for switching to a participating terminal to the existing speech terminal TE1.

この切替要求を受信した端末ＴＥ１は、自己の動作モードを参加端末モードへ切り替え、自己端末を参加端末モードへ切り替えたことを示す端末状態通知を、サーバへ送信する。
この後、端末ＴＥ１は、参加端末モードとなるので、サーバあるいは同一拠点（同一グループ）の代表端末からのストリーミング配信を受ける片方向通信状態となる。
一方、端末ＴＥ１から端末状態通知（参加端末モード）を受信したサーバは、端末ＴＥ１との通信状態をたとえばサーバからストリーミング配信を行う片方向通信状態に切り替える。また、端末ＴＥ１の発言要求端末フラグを「ＯＦＦ」に設定する。これにより、端末ＴＥ１とサーバ間の通信状態は、サーバから端末ＴＥ１へ、映像等をストリーミング配信する片方向通信状態となる。
以上が、この発明の発言検知制御に関する通信シーケンスである。 The terminal TE1 that has received this switching request switches its own operation mode to the participating terminal mode, and transmits a terminal state notification indicating that its own terminal has been switched to the participating terminal mode to the server.
Thereafter, since the terminal TE1 enters the participation terminal mode, the terminal TE1 enters a one-way communication state in which streaming distribution is received from a representative terminal at the server or the same base (same group).
On the other hand, the server that has received the terminal state notification (participating terminal mode) from the terminal TE1 switches the communication state with the terminal TE1 to, for example, a one-way communication state in which streaming distribution is performed from the server. In addition, the speech request terminal flag of the terminal TE1 is set to “OFF”. As a result, the communication state between the terminal TE1 and the server becomes a one-way communication state in which video or the like is streamed from the server to the terminal TE1.
The communication sequence related to the speech detection control of the present invention has been described above.

発言要求をした端末は、サーバとの間で双方向通信状態となるが、発言要求端末モードとなれる端末は、サーバに接続され管理されているすべての端末の中で１つのみに制限することにより、サーバと双方向通信が可能な端末の数を少なくすることができる。
したがって、サーバと双方向通信が可能な端末を制限することにより、すべての端末がサーバと双方向通信する場合に比べて、ネットワークを流れるトラフィックを少なくすることができるので、伝送遅延の低減，通信障害の減少，およびサーバの処理負担の軽減ができる。 The terminal that has made a speech request enters a two-way communication state with the server, but the number of terminals that can enter the speech request terminal mode is limited to only one of all terminals connected to the server and managed. Thus, the number of terminals capable of bidirectional communication with the server can be reduced.
Therefore, by limiting the terminals that can communicate bidirectionally with the server, traffic flowing through the network can be reduced compared to when all terminals communicate bidirectionally with the server. Reduces failures and server processing load.

＜情報端末におけるモード切替処理等の説明＞
図２１に、この発明の端末におけるモード切替処理の一実施例のフローチャートを示す。
ステップＳ１において、端末の初期状態が、参加端末モードに設定されていたとする。参加端末モードは、上記したように、映像や音声等からなる共有情報のストリーミング配信を受けることが可能な状態である。
ステップＳ２において、端末の端末状態通知部８１が、サーバに、端末の現在の動作状態（動作モード）を通知する（端末状態通知処理）。
現在、参加端末モードなので、端末の装置ＩＤとともに、参加端末モードであることを示す情報が、サーバに送信される。
ステップＳ３において、代表端末検出要求部８３が、サーバに、代表端末検出要求を送信する。
ステップＳ４において、サーバから送られてくる代表端末が検出されたことを示す情報（代表端末検出通知）が受信されたか否か、チェックする。
受信された場合、ステップＳ５へ進み、受信されなかった場合あるいは代表端末が検出されなかったことを示す情報（未検出通知）が受信された場合は、ステップＳ９へ進む。 <Description of mode switching processing in information terminal>
FIG. 21 shows a flowchart of an embodiment of the mode switching process in the terminal of the present invention.
In step S1, it is assumed that the initial state of the terminal is set to the participating terminal mode. As described above, the participating terminal mode is a state in which streaming distribution of shared information including video and audio can be received.
In step S2, the terminal state notification unit 81 of the terminal notifies the server of the current operation state (operation mode) of the terminal (terminal state notification process).
Since it is currently a participating terminal mode, information indicating that it is in the participating terminal mode is transmitted to the server together with the device ID of the terminal.
In step S3, the representative terminal detection request unit 83 transmits a representative terminal detection request to the server.
In step S4, it is checked whether or not information (representative terminal detection notification) indicating that a representative terminal sent from the server has been received has been received.
If received, the process proceeds to step S5, and if not received or if information indicating that the representative terminal has not been detected (not-detected notification) is received, the process proceeds to step S9.

検出されたことを示す情報（検出通知）が受信された場合、この端末と同じグループに属する代表端末が存在し、その代表端末のＩＰアドレス等がわかるので、ステップＳ５において、ストリーミング要求部８６が、代表端末に対して、ストリーミング配信を要求する情報を送信する（ストリーミング要求）。
代表端末は、サーバからの検出通知の中に含まれる代表端末の装置ＩＤやＩＰアドレス等により特定できる。
ステップＳ６において、ストリーミング受信部８７が、代表端末から送られてくるストリーミング配信を受ける（ストリーミング受信）。
ステップＳ７において、ステップＳ３と同様に、再度サーバに対して、代表端末検出要求を送信する。この処理は、その後、代表端末が何らかの理由で通信回線への接続が切断されたり、電源ＯＦＦ状態となっていないかを確認するためのものである。
ステップＳ８において、サーバからの代表端末の検出通知を受信したか否かチェックし、同じ代表端末が存在することを示す検出通知を受信した場合は、処理を終了する。 When the information indicating the detection (detection notification) is received, there is a representative terminal belonging to the same group as this terminal, and the IP address and the like of the representative terminal can be known. In step S5, the streaming request unit 86 The information requesting streaming delivery is transmitted to the representative terminal (streaming request).
The representative terminal can be identified by the device ID, IP address, etc. of the representative terminal included in the detection notification from the server.
In step S6, the streaming receiving unit 87 receives streaming delivery sent from the representative terminal (streaming reception).
In step S7, a representative terminal detection request is transmitted to the server again, as in step S3. This process is for confirming whether the representative terminal is disconnected from the communication line for some reason or is not turned off.
In step S8, it is checked whether a representative terminal detection notification is received from the server. If a detection notification indicating that the same representative terminal is present is received, the process ends.

一方、検出通知を受けていない場合、あるいは代表端末の未検出通知を受信した場合、ステップＳ９へ進む。
ステップＳ９において、代表端末がなかったので、その端末自らを代表端末とするために、端末モード切替部８２が、動作モードを、参加端末モードから代表端末モードに切り替える。これにより、サーバからストリーミング配信を受けることができ、かつ他の端末に対してストリーミング配信を行えるモードに切り替える。
ステップＳ１０において、端末状態通知部８１が、サーバに、自己の動作モードを、代表端末モードに切り替えたことを示す端末状態通知を送信する。
ステップＳ１１において、ストリーミング受信部８７が、サーバからストリーミング配信を受ける（ストリーミング受信）。
以上が端末におけるモード切替処理の一実施例であるが、各ステップの処理は、図２６の通信シーケンスに示したものと対応する。 On the other hand, if no detection notification has been received, or if a non-detection notification from the representative terminal has been received, the process proceeds to step S9.
In step S9, since there is no representative terminal, the terminal mode switching unit 82 switches the operation mode from the participating terminal mode to the representative terminal mode in order to make the terminal itself a representative terminal. Thereby, it switches to the mode which can receive streaming delivery from a server and can perform streaming delivery with respect to another terminal.
In step S10, the terminal state notification unit 81 transmits to the server a terminal state notification indicating that its own operation mode has been switched to the representative terminal mode.
In step S11, the streaming receiving unit 87 receives streaming distribution from the server (streaming reception).
The above is one embodiment of the mode switching process in the terminal. The process in each step corresponds to that shown in the communication sequence of FIG.

図２２に、この発明の代表端末となった端末のストリーミング配信処理のフローチャートを示す。
ステップＳ２１において、他の端末から、ストリーミング配信要求を受信したか否か、チェックする。受信した場合は、ステップＳ２２に進み、ストリーミング配信部８８が、要求をしてきた端末に対し、共有情報のストリーミング配信を行う。 FIG. 22 shows a flowchart of the streaming distribution processing of the terminal that has become the representative terminal of the present invention.
In step S21, it is checked whether a streaming distribution request has been received from another terminal. If received, the process proceeds to step S22, where the streaming distribution unit 88 performs streaming distribution of the shared information to the requesting terminal.

＜サーバにおける端末状態検出等の説明＞
図２３に、サーバの端末状態検出制御処理のフローチャートを示す。
ステップＳ１０１において、端末状態判断部１６２が、端末状態情報を受信したか否か、チェックする。受信した場合は、ステップＳ１０２へ進み、そうでない場合は、ステップＳ１０１をループする。
ステップＳ１０２において、端末状態判断部１６２が、受信した端末状態が、参加端末モードか否かチェックする。参加端末モードである場合は、ステップＳ１０３へ進み、そうでない場合は、処理を終了する。 <Description of terminal status detection in the server>
FIG. 23 shows a flowchart of the server terminal state detection control process.
In step S101, the terminal state determination unit 162 checks whether or not terminal state information has been received. If received, the process proceeds to step S102, and if not, step S101 is looped.
In step S102, the terminal state determination unit 162 checks whether or not the received terminal state is a participating terminal mode. If it is the participation terminal mode, the process proceeds to step S103, and if not, the process ends.

ステップＳ１０３において、代表端末検出部１６４が、代表端末検出処理を行う。
ここでは、まず、端末から代表端末検出要求が送信されてきたか否かチェックし、かかる要求が受信された場合に、端末データベース１７２を調べて、代表端末の有無をチェックする。代表端末検出要求を送信してきた端末の拠点ＩＤと同じ拠点ＩＤを持つ端末が端末データベースの中にないか否かを調べる。あれば、その端末の代表端末フラグ１７７が「ＯＮ」となっているかどうか調べる。 In step S103, the representative terminal detection unit 164 performs a representative terminal detection process.
Here, first, it is checked whether or not a representative terminal detection request has been transmitted from the terminal. When such a request is received, the terminal database 172 is checked to check the presence or absence of the representative terminal. It is checked whether there is a terminal in the terminal database having the same base ID as the base ID of the terminal that has transmitted the representative terminal detection request. If there is, it is checked whether the representative terminal flag 177 of the terminal is “ON”.

その代表端末フラグが「ＯＮ」となっていれば、代表端末検出要求を送信してきた端末と同一拠点（同一グループ）に属する他の端末であって、かつ代表端末であるものが存在することが検出される。
一方、同一拠点ＩＤを持つ端末が他になければ、同じグループに属する端末がなく、代表端末も存在しないことが検出される。 If the representative terminal flag is “ON”, there may be other terminals that belong to the same base (same group) as the terminal that transmitted the representative terminal detection request and that are representative terminals. Detected.
On the other hand, if there are no other terminals having the same base ID, it is detected that there are no terminals belonging to the same group and there is no representative terminal.

ステップＳ１０４において、代表端末が検出されたか否かチェックする。検出された場合は、ステップＳ１０５へ進み、そうでない場合は、ステップＳ１０６へ進む。
ステップＳ１０５において、検出された代表端末の装置ＩＤとＩＰアドレスを含む検出通知を、検出要求をしてきた端末に送信する。
一方、ステップＳ１０６において、代表端末が検出されなかったことを示す情報（未検出通知）を、検出要求をしてきた端末に送信する。
以上のサーバの処理も、図２６の通信シーケンスに示したものと対応する。 In step S104, it is checked whether a representative terminal is detected. If detected, the process proceeds to step S105, and if not, the process proceeds to step S106.
In step S105, a detection notification including the detected device ID and IP address of the representative terminal is transmitted to the terminal that has made the detection request.
On the other hand, in step S106, information indicating that the representative terminal has not been detected (not-detected notification) is transmitted to the terminal that has made the detection request.
The above server processing also corresponds to that shown in the communication sequence of FIG.

＜端末における発言検知等の説明＞
図２４に、この発明の端末の発言検知および要求処理のフローチャートを示す。
ステップＳ３１において、端末の動作モードの初期状態が、参加端末モードであったとする。
ステップＳ３２において、発言検知部８４が、その端末の利用者が発言を意味する入力操作，音声入力あるいは映像入力（挙手動作）のいずれかをしたことを検知したか否か、チェックする（発言検知処理）。
発言検知をした場合は、ステップＳ３３へ進み、そうでない場合は処理を終了する。
ステップＳ３３において、発言要求部８５が、サーバに、発言要求があったことを意味する情報（発言要求情報）を送信する（発言要求処理）。
参加端末モードでは、自画像等をサーバへ送信することはできないが、この発言要求情報は、所定の通信プロトコルによって、サーバへ送信される。 <Explanation such as speech detection on the terminal>
FIG. 24 shows a flowchart of speech detection and request processing of the terminal of the present invention.
In step S31, it is assumed that the initial state of the operation mode of the terminal is the participation terminal mode.
In step S32, the speech detection unit 84 checks whether it has detected that the user of the terminal has performed an input operation, speech input, or video input (raising hand operation) meaning speech (speech detection). processing).
If speech is detected, the process proceeds to step S33, and if not, the process ends.
In step S33, the speech request unit 85 transmits information (speech request information) indicating that a speech request has been made to the server (speech request processing).
In the participation terminal mode, the self-portrait or the like cannot be transmitted to the server, but this request information is transmitted to the server by a predetermined communication protocol.

ステップＳ３４において、サーバから発言要求端末モードへの切替許可通知を受信すると、端末モード切替部８２が、自己の動作モードを、参加端末モードから、発言要求端末モードへ切り替える。これにより、この端末は、既存発言端末となる。
ステップＳ３５において、双方向通信制御部８９が、通信状態を、サーバと音声および映像の双方向通信ができる状態に変更する。この後、端末からサーバに対して、端末のカメラで撮影された自画像や入力音声を送信することができるようになる。
また、サーバから送られてくる受信画像などの共有情報を直接受信できるようになる。
ステップＳ３６において、サーバからの参加端末モードへの状態切替要求が受信されたか否か、チェックする。これは、たとえば、既存発言端末となっている状態で、他の端末が発言要求をした場合に対応した処理である。 In step S34, when a notification of permission to switch to the speech request terminal mode is received from the server, the terminal mode switching unit 82 switches its own operation mode from the participating terminal mode to the speech request terminal mode. Thereby, this terminal becomes an existing speech terminal.
In step S35, the two-way communication control unit 89 changes the communication state to a state in which two-way communication of audio and video with the server is possible. Thereafter, a self-portrait or input sound captured by the terminal camera can be transmitted from the terminal to the server.
In addition, shared information such as a received image sent from the server can be directly received.
In step S36, it is checked whether or not a status switching request to the participating terminal mode is received from the server. This is a process corresponding to a case where, for example, another terminal requests to speak in a state where the terminal is an existing speaking terminal.

この切替要求が受信された場合、ステップＳ３７へ進み、受信されなかった場合は処理を終了する。
ステップＳ３７において、端末モード切替部８２が、動作モードを、参加端末モードへ切り替える。参加端末モードになった端末は、サーバからあるいは、代表端末がある場合にはその代表端末からストリーミング配信を受ける片方向通信状態になる。
ステップＳ３８において、端末状態通知部８１が、サーバに、その端末の動作モードが、参加端末モードに変更されたことを通知する。
ステップＳ３９において、ストリーミング受信部８７が、サーバからストリーミング配信により送られてくる音声や映像等の共有情報を受信する。 When this switching request is received, the process proceeds to step S37, and when it is not received, the process ends.
In step S37, the terminal mode switching unit 82 switches the operation mode to the participating terminal mode. The terminal in the participation terminal mode enters a one-way communication state in which streaming distribution is received from the server or, if there is a representative terminal, from the representative terminal.
In step S38, the terminal state notification unit 81 notifies the server that the operation mode of the terminal has been changed to the participating terminal mode.
In step S39, the streaming receiver 87 receives shared information such as audio and video sent from the server by streaming delivery.

＜サーバにおける発言端末検出等の説明＞
図２５に、この発明のサーバにおける発言端末検出制御処理のフローチャートを示す。
ステップＳ１２１において、端末状態判断部１６２が、端末から端末状態情報を受信したか否か、チェックする。受信した場合、ステップＳ１２２へ進み、そうでない場合、ステップＳ１２１をループする。
ステップＳ１２２において、受信した端末状態情報が、発言要求端末モードへの移行を意味する発言要求である場合、ステップＳ１２３へ進み、そうでない場合処理を終了する。
発言要求を送信してきた端末を、発言要求端末と呼ぶ。
ステップＳ１２３において、既存発言端末検出部１６５が、既存発言端末検出処理を行う。ここでは、端末データベース１７２の発言要求端末フラグ１７８の中に、「ＯＮ」となっている端末があるか否か、チェックする。発言要求端末フラグ１７８が「ＯＮ」となっているものがあれば、すでに発言権を持つ端末が存在しており、既存発言端末が検出されたことを意味する。
一方、発言要求端末フラグ１７８が、すべて「ＯＦＦ」の場合、現在、既存発言端末は存在せず、検出されなかったことを意味する。 <Explanation of Talking Terminal Detection in Server>
FIG. 25 shows a flowchart of the speaking terminal detection control process in the server of the present invention.
In step S121, the terminal state determination unit 162 checks whether terminal state information has been received from the terminal. If received, the process proceeds to step S122, and if not, step S121 is looped.
In step S122, if the received terminal state information is a message request that means a shift to the message request terminal mode, the process proceeds to step S123, and if not, the process ends.
A terminal that has transmitted a speech request is referred to as a speech request terminal.
In step S123, the existing speech terminal detection unit 165 performs an existing speech terminal detection process. Here, it is checked whether or not there is a terminal that is “ON” in the speech request terminal flag 178 of the terminal database 172. If there is one in which the request terminal flag 178 is “ON”, it means that there is already a terminal having the right to speak, and an existing speaking terminal has been detected.
On the other hand, if all of the speech request terminal flags 178 are “OFF”, it means that there is no existing speech terminal and has not been detected.

ステップＳ１２４において、既存発言端末が検出されたか否かチェックし、検出された場合ステップＳ１２７へ進み、検出されなかった場合はステップＳ１２５へ進む。
ステップＳ１２５において、発言要求をしてきた発言要求端末に、発言要求端末モードへの切替許可通知を送信する。ここで、端末データベースの中の発言要求端末の発言要求端末フラグを、「ＯＮ」に設定する。
ステップＳ１２６において、双方向通信制御部１６７が、通信状態を、発言要求端末と、音声および映像を双方向通信できる状態にする。この後、サーバと発言要求端末との間では、相互に、映像等の情報が送受信される。 In step S124, it is checked whether or not an existing speech terminal has been detected. If it is detected, the process proceeds to step S127.
In step S125, a notification of permission to switch to the speech request terminal mode is transmitted to the speech request terminal that has requested the speech. Here, the speech request terminal flag of the speech request terminal in the terminal database is set to “ON”.
In step S126, the two-way communication control unit 167 sets the communication state to a state where voice and video can be two-way communicated with the speech request terminal. Thereafter, information such as video is transmitted and received between the server and the speech requesting terminal.

一方、ステップＳ１２７において、既存発言端末を参加端末へ移行させる設定をする。ここでは、端末データベースの中の既存発言端末の発言要求端末フラグ１７８を、「ＯＮ」から「ＯＦＦ」に変更する。
ステップＳ１２８において、通信方法切替部１６３が、既存発言端末に対して、参加端末モードへの切替要求を送信する。
ステップＳ１２９において、上記切替要求に応答して、既存発言端末であった端末から、参加端末モードへ移行したことを示す端末状態通知を受信したか否か、チェックする。受信した場合は、ステップＳ１３０へ進み、そうでない場合は、ステップＳ１２９をループする。 On the other hand, in step S127, a setting is made to shift the existing speech terminal to the participating terminal. Here, the speech request terminal flag 178 of the existing speech terminal in the terminal database is changed from “ON” to “OFF”.
In step S128, the communication method switching unit 163 transmits a request for switching to the participating terminal mode to the existing speech terminal.
In step S129, in response to the switching request, it is checked whether or not a terminal state notification indicating that the terminal has changed to the participating terminal mode has been received from a terminal that has been an existing speaking terminal. If received, the process proceeds to step S130. If not, step S129 is looped.

ステップＳ１３０において、通信方法切替部１６３が、参加端末モードに切り替えられた端末に対して、ストリーミング配信を行うようにする。
ステップＳ１３１において、ステップＳ１２２で新たに発言要求をしてきた端末に対して、発言要求端末モードへの切替許可通知を送信する。ここで、この新たな発言要求端末の発言要求端末フラグを、「ＯＮ」に設定する。
ステップＳ１３２において、双方向通信制御部１６７が、新たに発言要求端末となった端末との間の通信状態を、双方向通信状態にする。これにより、サーバと双方向通信ができる端末が変更されるが、双方向通信状態の端末の数に変化はない。 In step S130, the communication method switching unit 163 performs streaming delivery to the terminal switched to the participating terminal mode.
In step S131, a notification of permission to switch to the speech request terminal mode is transmitted to the terminal that has newly requested speech in step S122. Here, the speech request terminal flag of the new speech request terminal is set to “ON”.
In step S132, the bidirectional communication control unit 167 sets the communication state with the terminal that has newly become the speech requesting terminal to the bidirectional communication state. As a result, the terminals capable of bidirectional communication with the server are changed, but the number of terminals in the bidirectional communication state is not changed.

したがって、発言要求をする端末が別の端末に変更になったとしても、サーバと双方向通信する端末の数は同一なので、ネットワークを流れるトラフィックに大きな増減はなく、伝送遅延や通信障害の発生も増加することはなく、サーバの処理負荷についても大きな変化はない。 Therefore, even if the requesting terminal is changed to another terminal, the number of terminals that communicate bidirectionally with the server is the same, so there is no significant increase or decrease in traffic flowing through the network, and transmission delays or communication failures may occur. There is no increase, and there is no significant change in the processing load on the server.

また、この発明では、原則として発言要求できる端末に制限はなく、サーバに接続し通信が可能な端末であればよい。また、発言要求をする時期についても、原則としていつでも可能としてもよい。新たに発言要求をした端末があれば、サーバと映像等の双方向通信が可能な端末が、その新たな発言要求をした端末にすぐに切り替えられる。 Moreover, in this invention, there is no restriction | limiting in the terminal which can request a speech in principle, What is necessary is just a terminal which can be connected to a server and can communicate. In addition, the timing for requesting speech may be possible at any time in principle. If there is a terminal that has newly requested a speech, a terminal capable of bidirectional communication such as video with the server is immediately switched to the terminal that has requested the new speech.

また、既存発言端末がある場合は、その既存発言端末をたとえば参加端末モードに切り替えることにより、サーバとの双方向通信を解除し、サーバから端末への片方向通信状態となるので、その既存発言端末であった端末からの自画像の送信は行えなくなる。
しかし、新たな発言端末からの自画像や入力音声等の情報が、共有情報として、すべての端末に送信されるので、発言者の情報が参加者全員に伝送でき、円滑な会議の進行が可能となる。
また、発言要求を検出して、双方向通信をする端末の数を制限することにより、ネットワークを流れるトラフィックも、すべての端末がサーバと双方向通信する場合に比べて、少なくすることができるので、伝送遅延の低減と、通信障害の発生の低減と、サーバの処理負荷を軽減することができる。 Also, if there is an existing speech terminal, switching the existing speech terminal to the participation terminal mode, for example, cancels bidirectional communication with the server and enters a one-way communication state from the server to the terminal. The self-portrait cannot be transmitted from the terminal that was the terminal.
However, since information such as the self-portrait and input voice from the new speaking terminal is transmitted to all terminals as shared information, the information of the speaking person can be transmitted to all participants, and a smooth conference can be proceeded. Become.
In addition, by detecting the request to speak and limiting the number of terminals that perform two-way communication, the traffic flowing through the network can also be reduced compared to when all terminals communicate with the server. Reduction of transmission delay, occurrence of communication failure, and server processing load can be reduced.

＜第２実施例：サーバ端末間通信状態の説明＞
図１３から図２０に、この発明の第２実施例のサーバと端末間の通信状態の説明図を示す。
第１実施例では、代表端末から他の端末へストリーミング配信を行う場合、そのストリーミング配信を受ける端末は、同一グループ（同一拠点）に属する端末に制限していた。
たとえば、図７で、代表端末ＴＥ３からストリーミング配信する相手の端末ＴＥ４は、同一拠点に属する端末であった。
この第２実施例では、グループ化は、同一拠点に属する複数の端末に対して行うが、グループ化によって代表端末となった端末からストリーミング配信をする対象の端末としては、同一グループに属する端末に加えて、そのグループに属さない端末であって、グループ化できる端末を持たない単一の端末として存在する端末も対象とする。 <Second embodiment: description of communication state between server terminals>
FIG. 13 to FIG. 20 are explanatory diagrams of the communication state between the server and the terminal according to the second embodiment of the present invention.
In the first embodiment, when streaming delivery is performed from a representative terminal to another terminal, the terminals that receive the streaming delivery are limited to terminals belonging to the same group (same base).
For example, in FIG. 7, the partner terminal TE4 that performs streaming delivery from the representative terminal TE3 is a terminal that belongs to the same base.
In this second embodiment, grouping is performed for a plurality of terminals belonging to the same base. However, as a terminal to which streaming distribution is performed from a terminal that has become a representative terminal by grouping, terminals belonging to the same group are assigned. In addition, terminals that do not belong to the group and exist as a single terminal that does not have terminals that can be grouped are also targeted.

図１３に、第２実施例のサーバ端末間通信状態（実施例２−１）の説明図を示す。
この図１３の通信状態は、図４に示したものと同一であり、発表端末である端末ＴＥ１のみがサーバに接続されている。
図１４に、第２実施例のサーバ端末間通信状態（実施例２−２）の説明図を示す。この図１４の通信状態は、図５に示したものと同一であり、端末ＴＥ３が、同一拠点内の代表端末となった状態を示している。
図１９と図２０に、第２実施例の各通信状態における端末データベースの３つのフラグの状態を示す。
図１９（ａ）の実施例２−１のフラグの状態と、図１９（ｂ）の実施例２−２のフラグの状態は、それぞれ図１１（ａ），（ｂ）と同一である。 FIG. 13 is an explanatory diagram of a communication state between server terminals (embodiment 2-1) of the second embodiment.
The communication state in FIG. 13 is the same as that shown in FIG. 4, and only the terminal TE1, which is the presentation terminal, is connected to the server.
In FIG. 14, explanatory drawing of the communication state between server terminals (Example 2-2) of 2nd Example is shown. The communication state shown in FIG. 14 is the same as that shown in FIG. 5, and the terminal TE3 is a representative terminal in the same base.
19 and 20 show the states of the three flags of the terminal database in each communication state of the second embodiment.
The state of the flag in Example 2-1 in FIG. 19A and the state of the flag in Example 2-2 in FIG. 19B are the same as those in FIGS. 11A and 11B, respectively.

図１５に、第２実施例のサーバ端末間通信状態（実施例２−３）の説明図を示す。
図１５は、第１実施例の図６に対応するが、端末ＴＥ２が代表端末とはならず、参加端末として起動させられ、すでに代表端末となっていた端末ＴＥ３から、ストリーミング配信を受ける点が異なる。
図１５の場合、端末ＴＥ２が、電源ＯＮにされ、起動させられると、サーバに対して、代表端末検知要求を行う。しかし、サーバで代表端末を探しても、この端末ＴＥ２とグループ化できる代表端末がなかったとすると、サーバは、グループ化とは関係なく、サーバに接続された端末の中に、すでに代表端末となっている端末がないか調べる。
図１５においては、端末ＴＥ２と同一拠点ではないが、すでに代表端末となっている端末ＴＥ３が見つけられる。 FIG. 15 is an explanatory diagram of a communication state between server terminals (embodiment 2-3) of the second embodiment.
FIG. 15 corresponds to FIG. 6 of the first embodiment, but the point that the terminal TE2 does not become a representative terminal, is activated as a participating terminal, and receives streaming distribution from the terminal TE3 that has already been the representative terminal. Different.
In the case of FIG. 15, when the terminal TE2 is turned on and activated, it makes a representative terminal detection request to the server. However, if there is no representative terminal that can be grouped with the terminal TE2 even if the server searches for the representative terminal, the server is already a representative terminal among the terminals connected to the server, regardless of the grouping. Check if there is a terminal.
In FIG. 15, a terminal TE3 that is not the same base as the terminal TE2 but is already a representative terminal can be found.

そこで、サーバは、端末ＴＥ２に代表端末の未検出を通知する代わりに、代表端末を端末ＴＥ３とする情報を含む代表端末検出通知を、端末ＴＥ２へ送信する。この代表端末検出通知を受信した端末ＴＥ２は、代表端末ＴＥ３からストリーミング配信を受けることが可能な動作モードに切り替える。
一方、サーバは、代表端末ＴＥ３に対して、同一拠点（同一グループ）に属さない端末ＴＥ２に対してもストリーミング配信を行うことを要求する情報（グループ外端末へのストリーミング配信要求）を、送信する。
この要求情報を受信した端末ＴＥ３は、同一拠点内の参加端末ＴＥ４へのストリーミング配信を継続したまま、要求のあったグループ外の端末ＴＥ２へのストリーミング配信も行う。
すなわち、図１５に示すように、端末ＴＥ３は、サーバからのストリーミング配信で受信画像ＲＤ３を受信した後、端末ＴＥ４に対して、受信画像ＲＤ４をストリーミング配信し、端末ＴＥ２に対しても、受信画像ＲＤ２をストリーミング配信する。ここで、３つの受信画像（ＲＤ２，ＲＤ３，ＲＤ４）は、サーバから送信される共有情報と同一の情報である。 Therefore, instead of notifying the terminal TE2 that the representative terminal has not been detected, the server transmits a representative terminal detection notification including information indicating that the representative terminal is the terminal TE3 to the terminal TE2. The terminal TE2 that has received the representative terminal detection notification switches to an operation mode in which streaming distribution can be received from the representative terminal TE3.
On the other hand, the server transmits, to the representative terminal TE3, information (streaming distribution request to a terminal outside the group) that requests that the terminal TE2 not belonging to the same base (the same group) perform streaming distribution. .
The terminal TE3 that has received the request information also performs streaming delivery to the terminal TE2 outside the requested group while continuing streaming delivery to the participating terminal TE4 in the same base.
That is, as shown in FIG. 15, after receiving the received image RD3 by streaming delivery from the server, the terminal TE3 performs streaming delivery of the received image RD4 to the terminal TE4, and also receives the received image RD4 to the terminal TE2. RD2 is streamed. Here, the three received images (RD2, RD3, RD4) are the same information as the shared information transmitted from the server.

この実施例２−３では、端末ＴＥ２は代表端末ではなく参加端末であるので、図１９（ｃ）に示すように、端末ＴＥ２の代表端末フラグは、「ＯＦＦ」のままである。
この図１５の場合、端末ＴＥ３は、ストリーミング配信する相手の端末が２つとなるため、配信処理にかかる負荷が増大する。
しかし、サーバは、端末ＴＥ３に対してのみストリーミング配信すればよく、図６に示したように端末ＴＥ２に対するストリーミング配信をしなくてよいので、ストリーミング配信処理にかかる負荷が少なくてよい。
すなわち、この第２実施例の図１５の場合、第１実施例よりもサーバにおける処理の負荷を軽減できる。 In Example 2-3, since the terminal TE2 is not a representative terminal but a participating terminal, the representative terminal flag of the terminal TE2 remains “OFF” as illustrated in FIG. 19C.
In the case of FIG. 15, since the terminal TE3 has two counterpart terminals for streaming delivery, the load on the delivery process increases.
However, the server only needs to distribute the streaming to the terminal TE3, and does not have to perform the streaming distribution to the terminal TE2 as shown in FIG. 6, so the load on the streaming distribution process may be small.
That is, in the case of FIG. 15 of the second embodiment, the processing load on the server can be reduced as compared with the first embodiment.

図１６に、第２実施例のサーバ端末間通信状態（実施例２−４）の説明図を示す。
この図１６では、図１５の通信状態において、端末ＴＥ２の利用者が発言要求をした場合を示している。
このとき、端末ＴＥ２が発言端末となり、発表端末ＴＥ１とともに、２つの端末（ＴＥ１，ＴＥ２）がサーバと双方向通信が可能な状態となる。
ここで、図２０（ｄ）に示すように、端末ＴＥ２の発言要求端末フラグが、「ＯＮ」に設定される。
この通信状態は、図７とほぼ同様であるが、図７とは端末ＴＥ２が代表端末でない点が異なる。この場合は、サーバと双方向通信する端末が２つとなるので、トラフィックは増加するが、双方向通信が可能となった２つの端末（ＴＥ１，ＴＥ２）の自画像（ＳＤ１，ＳＤ２）を含む共有情報が、すべての端末へ送信されることになる。 FIG. 16 is an explanatory diagram of a communication state between server terminals (embodiment 2-4) of the second embodiment.
FIG. 16 shows a case where the user of terminal TE2 makes a speech request in the communication state of FIG.
At this time, the terminal TE2 becomes a speaking terminal, and the two terminals (TE1, TE2) together with the presentation terminal TE1 are in a state capable of bidirectional communication with the server.
Here, as shown in FIG. 20D, the speech request terminal flag of the terminal TE2 is set to “ON”.
This communication state is almost the same as that in FIG. 7 except that the terminal TE2 is not a representative terminal. In this case, since there are two terminals for two-way communication with the server, the traffic increases, but the shared information including the self-portraits (SD1, SD2) of the two terminals (TE1, TE2) capable of two-way communication. Will be sent to all terminals.

図１７に、第２実施例のサーバ端末間通信状態（実施例２−５）の説明図を示す。
この図１７では、図１６の通信状態において、端末ＴＥ３の利用者が発言要求をした場合を示している。このとき、発言端末が、ＴＥ２からＴＥ３へ変更される。
したがって、端末ＴＥ３とサーバとの間で、双方向通信が行われる。
また、端末ＴＥ２は、もとの参加端末へ戻り、サーバとの双方向通信状態が解除され、図１５と同様に端末ＴＥ３からストリーミング配信を受けるようになる。
また、図２０（ｅ）に示すように、発言要求端末フラグは、端末ＴＥ２が「ＯＦＦ」となり、端末ＴＥ３が「ＯＮ」に変更される。
図１６では、サーバと双方向通信を行う端末が２つで、サーバからストリーミング配信を受ける端末が１つ存在したが、図１７ではサーバと双方向通信する端末が２つだけとなり、サーバのストリーミング配信にかかる負荷がなくなり、負荷が軽減される。 In FIG. 17, explanatory drawing of the communication state between server terminals (Example 2-5) of 2nd Example is shown.
FIG. 17 shows a case where the user of the terminal TE3 makes a speech request in the communication state of FIG. At this time, the speaking terminal is changed from TE2 to TE3.
Therefore, bidirectional communication is performed between the terminal TE3 and the server.
In addition, the terminal TE2 returns to the original participating terminal, cancels the bidirectional communication state with the server, and receives streaming delivery from the terminal TE3 as in FIG.
Also, as shown in FIG. 20 (e), in the speech request terminal flag, the terminal TE2 is changed to “OFF” and the terminal TE3 is changed to “ON”.
In FIG. 16, there are two terminals that perform two-way communication with the server and one terminal that receives streaming distribution from the server. However, in FIG. 17, there are only two terminals that perform two-way communication with the server, and the server streaming is performed. The load on distribution is eliminated and the load is reduced.

図１８に、第２実施例のサーバ端末間通信状態（実施例２−６）の説明図を示す。
この図１８では、図１７の通信状態において、端末ＴＥ４の利用者が発言要求をした場合を示している。このとき、発言端末が、ＴＥ３からＴＥ４へ変更され、端末ＴＥ４とサーバ間で、双方向通信が行われるようになる。
一方、端末ＴＥ３は、もとの代表端末に戻り、サーバからストリーミング配信を受ける。また、端末ＴＥ３から端末ＴＥ２へのストリーミング配信は、そのまま継続される。
ここで、図２０（ｆ）に示すように、発言要求端末フラグは、端末ＴＥ３が「ＯＦＦ」で、端末ＴＥ４が「ＯＮ」に変更される。
この場合、サーバと端末間の通信状態は、双方向通信する端末が２つで、ストリーミング配信を受ける端末が１つとなり、サーバは、図１６と同様の負荷がかかる状態となる。
また、端末ＴＥ２は、グループ化できる他の端末がなく、代表端末ともなっていないが、１つのグループに属する他の代表端末ＴＥ３からストリーミング配信を受けるようにしているので、サーバの処理負荷を軽減することができる。 FIG. 18 shows an explanatory diagram of the communication state between server terminals (Example 2-6) of the second example.
FIG. 18 shows a case where the user of terminal TE4 makes a speech request in the communication state of FIG. At this time, the speaking terminal is changed from TE3 to TE4, and bidirectional communication is performed between the terminal TE4 and the server.
On the other hand, the terminal TE3 returns to the original representative terminal and receives streaming distribution from the server. Further, the streaming distribution from the terminal TE3 to the terminal TE2 is continued as it is.
Here, as shown in FIG. 20F, the speech request terminal flag is changed to “OFF” for the terminal TE3 and “ON” for the terminal TE4.
In this case, the communication state between the server and the terminal is two terminals for two-way communication and one terminal for receiving streaming distribution, and the server is in a state where a load similar to that in FIG. 16 is applied.
Also, the terminal TE2 has no other terminals that can be grouped and is not a representative terminal. However, since the terminal TE2 receives streaming distribution from another representative terminal TE3 belonging to one group, the processing load on the server is reduced. be able to.

以上のように、第２実施例の場合も、グループ化により、サーバもしくは、同一拠点（同一グループ）の代表端末からのストリーミング配信および、発言要求の検出によってサーバと双方向通信できる発言要求端末を１つに限定しているので、全端末がサーバと双方向通信する場合に比べて、伝送遅延を低減させ、通信障害を低減させることができ、さらにグループ化できる端末を持たない単一の端末に対して、他の代表端末からストリーミング配信をするようにしているので、サーバの共有情報を配信するのにかかる負荷を軽減させることができる。 As described above, in the case of the second embodiment as well, a message request terminal that can perform two-way communication with a server by streaming distribution from a server or a representative terminal of the same base (same group) and detection of a message request by grouping. Since it is limited to one, all terminals can reduce transmission delay and communication failure compared to the case where two-way communication with a server is performed, and a single terminal having no terminals that can be grouped On the other hand, since streaming delivery is performed from another representative terminal, it is possible to reduce the load required to deliver the shared information of the server.

また、現実には、サーバに接続される端末の台数は、５台以上の場合もあり、さらに、端末の動作モードも各端末ごとに変化するので、端末からサーバに送信される情報や、サーバから端末へ送信される合成画像も、図４〜図９、図１３〜１８等以外にも種々のパターンが考えられる。
また、サーバに接続された複数の端末がすべて双方向通信可能な状態であれば、サーバと各端末間で伝送されるデータ量は多く、サーバで処理すべきデータ量も多いため、伝送遅延や通信障害が起こる可能性は高くなるが、この発明のように、端末のグループ化と、端末における発言要求を検出することにより、サーバとの通信状態を変更するようにすれば、サーバに接続された大容量の情報の通信を行う端末の実質的な数を減少させることができ、伝送遅延の減少と通信障害の発生を低減させることができ、サーバの負荷を軽減できる。 In reality, the number of terminals connected to the server may be 5 or more, and the operation mode of the terminal changes for each terminal. Therefore, information transmitted from the terminal to the server, server In addition to FIGS. 4 to 9, FIGS. 13 to 18, and the like, various patterns can be considered for the composite image transmitted from the terminal to the terminal.
Also, if all of the terminals connected to the server are capable of bidirectional communication, the amount of data transmitted between the server and each terminal is large, and the amount of data to be processed by the server is large. Although there is a high possibility that a communication failure will occur, if the communication state with the server is changed by detecting the grouping of terminals and the request to speak at the terminal as in the present invention, the connection to the server is established. In addition, the substantial number of terminals that communicate large amounts of information can be reduced, the transmission delay can be reduced and the occurrence of communication failures can be reduced, and the load on the server can be reduced.

１サーバ
２情報端末（端末）
１１入力部
１２キーボード
１３マウス
１４タブレット
１５人検知センサ
１６入力処理部
２１映像入力部
２２カメラ
２３映像処理部
３１表示部
３２ディスプレイ
３３表示処理部
４１音声入力部
４２マイク
４３入力音声処理部
４５音声出力部
４６スピーカ
４７出力音声処理部
５１通信部
５２通信暗号部
５３通信復号部
５５認証部
６１記憶部
６２認識データベース
６３入力映像
６４入力音声
７１信号処理部
７２映像符号化部
７３映像復号化部
７４音声符号化部
７５音声復号化部
８０制御部
８１端末状態通知部
８２端末モード切替部
８３代表端末検出要求部
８４発言検知部
８５発言要求部
８６ストリーミング要求部
８７ストリーミング受信部
８８ストリーミング配信部
８９双方向通信制御部
１０１入力部
１１１表示部
１２１通信部
１３１ＭＣＵ部
１５１信号処理部
１６１制御部
１６２端末状態判断部
１６３通信方法切替部
１６４代表端末検出部
１６５既存発言端末検出部
１６６ストリーミング配信部
１６７双方向通信制御部
１７１記憶部
１７２端末データベース
１７３装置ＩＤ
１７４拠点ＩＤ
１７５ＩＰアドレス
１７６発表端末フラグ
１７７代表端末フラグ
１７８発言要求端末フラグ
１７９ユーザＩＤ
１８０メールアドレス
１８１ユーザデータベース 1 server 2 information terminal (terminal)
DESCRIPTION OF SYMBOLS 11 Input part 12 Keyboard 13 Mouse 14 Tablet 15 Human detection sensor 16 Input processing part 21 Video input part 22 Camera 23 Video processing part 31 Display part 32 Display 33 Display processing part 41 Audio | voice input part 42 Microphone 43 Input audio | voice processing part 45 Audio | voice output Unit 46 speaker 47 output audio processing unit 51 communication unit 52 communication encryption unit 53 communication decoding unit 55 authentication unit 61 storage unit 62 recognition database 63 input video 64 input audio 71 signal processing unit 72 video encoding unit 73 video decoding unit 74 audio Encoding unit 75 Speech decoding unit 80 Control unit 81 Terminal state notification unit 82 Terminal mode switching unit 83 Representative terminal detection request unit 84 Statement detection unit 85 Statement request unit 86 Streaming request unit 87 Streaming reception unit 88 Streaming distribution unit 89 Bidirectional Communication control unit 01 input unit 111 display unit 121 communication unit 131 MCU unit 151 signal processing unit 161 control unit 162 terminal state determination unit 163 communication method switching unit 164 representative terminal detection unit 165 existing speech terminal detection unit 166 streaming distribution unit 167 bidirectional communication control unit 171 Storage unit 172 Terminal database 173 Device ID
174 Base ID
175 IP address 176 Announcement terminal flag 177 Representative terminal flag 178 Speak request terminal flag 179 User ID
180 E-mail address 181 User database

Claims

A server and a plurality of information terminals are connected via a network,
Transmitting the terminal information input in the information terminal to a server;
The server generates shared information using terminal information transmitted from each information terminal,
An information sharing system for distributing the shared information to the information terminal,
The plurality of information terminals are grouped into one or a plurality of terminal groups based on predetermined setting conditions in advance, and one terminal group includes one representative terminal that can directly receive the shared information from the server And a participating terminal that receives distribution of the shared information from the representative terminal,
Each information terminal, a speech detection unit that detects that the user has made a speech request,
A speech request unit that transmits information indicating that the speech request has been made to the server;
A terminal mode switching unit that changes the operating state of the information terminal according to the notification from the server;
When the server receives the message request information from the information terminal, the communication state between the information terminal and the server can be directly transmitted / received between the terminal information and the shared information. A two-way communication control unit to switch to
An information sharing system, wherein the server manages such that only one information terminal that is in a two-way communication state with the server and is allowed to speak by the user exists at the same time.

If there is a first information terminal that has made a request for speech and the communication state with the server is a two-way communication state, the speech detection unit of the second information terminal makes a new speech request. , The message request unit of the second information terminal transmits information indicating the new message request to the server,
When the server receives speech request information from the second information terminal, the bidirectional communication control unit cancels the bidirectional communication state with the first information terminal and makes a new speech request. The information sharing system according to claim 1, wherein the communication state between the second information terminal and the server is switched to a two-way communication state.

When detection of a speech request by the user is performed at a speech detection unit of a plurality of information terminals almost simultaneously,
The bidirectional communication control unit of the server sets the communication state with the information terminal that has transmitted the speech request information received earliest among the speech request information transmitted from each of the plurality of information terminals, as the bidirectional communication state. Or switch to
The communication state with the information terminal which is judged that the user's speech request is detected at the earliest time is confirmed, and the two-way speech request information is switched to the bidirectional communication state. 2. Information sharing system.

The information terminal further includes an input unit that inputs an input signal by a user operation, a video input unit that inputs video captured by a camera, and an audio input unit that inputs audio from a microphone.
When the speech detection unit detects any one or more of an input signal meaning the speech request, an input voice meaning the speech request, or an input video meaning the speech request, the user makes a speech request The information sharing system according to claim 1, wherein the information sharing system is determined.

The setting condition for grouping is connection base information where each information terminal is arranged,
The information sharing system according to claim 1, wherein the server includes a storage unit that stores the connection base information for each information terminal.

6. The information sharing system according to claim 1, wherein representative terminals belonging to the one terminal group receive the shared information by streaming distribution from a server.

7. The information sharing according to claim 1, wherein the representative terminal belonging to the one terminal group performs streaming distribution of the shared information received from the server to the participating terminals belonging to the terminal group. system.

8. The information sharing system according to claim 7, wherein the representative terminal belonging to the one terminal group performs streaming distribution of the shared information received from the server to other information terminals not belonging to the terminal group.

When the grouped one terminal group is composed of a single information terminal, the information terminal itself is a representative terminal, and the terminal group is a terminal group that does not include a participating terminal. The information sharing system according to claim 1.

A server and a plurality of information terminals are connected via a network,
Transmitting the terminal information input in the information terminal to a server;
The server generates shared information using terminal information transmitted from each information terminal,
An information sharing method of an information sharing system for delivering the shared information to the information terminal,
The plurality of information terminals are grouped into one or a plurality of terminal groups based on predetermined setting conditions in advance, and one terminal group includes one representative terminal that can directly receive the shared information from the server , Including a participating terminal that receives the distribution of the shared information from the representative terminal,
A speech detection step in which the information terminal detects that the user has requested speech;
A speech requesting step of transmitting information indicating that the speech request has been made to a server;
When the server receives the speech request information from the information terminal, the communication state between the information terminal and the server is changed to a bidirectional communication state in which the terminal information and the shared information can be directly transmitted and received. A communication control step for managing the information terminal so that there is only one information terminal at a time, which is in a two-way communication state with the server and allowed to speak by the user;
An information sharing method comprising: a terminal mode switching step of changing a communication state of an information terminal that has detected a speech request to a bidirectional communication state based on a notification from the server.

A program of an information sharing system for causing a computer to execute each step of the information sharing method according to claim 10.