JP2011030063A

JP2011030063A - Video conference system, server apparatus, and video conference program

Info

Publication number: JP2011030063A
Application number: JP2009175274A
Authority: JP
Inventors: Thitiporn Lertrusdachakul; ティティポーンルートラットデーチャークン
Original assignee: Ricoh Co Ltd
Current assignee: Ricoh Co Ltd
Priority date: 2009-07-28
Filing date: 2009-07-28
Publication date: 2011-02-10
Anticipated expiration: 2029-07-28
Also published as: JP5316286B2

Abstract

PROBLEM TO BE SOLVED: To provide a video conference system, server apparatus and video conference program by which a video conference can be carried out inexpensively. SOLUTION: A main speaker determining section 21 of a server apparatus 3 determines a main speaker of a video conference and, based on a determination result of the main speaker determining section 21, a video display control section 22 of the server apparatus 3 specifies a first main speaker and a second main speaker. The video display control section 22 then controls a display device 11 of any other point in such a way that video images of the first main speaker and the second main speaker are displayed at different positions on left and right sides, controls a display device 11 at a point, where the first main speaker is located, in such a way that the video image of the second main speaker is displayed at a left/right position different from a video display position of the second main speaker on the display device 11 of the other point, and controls a display device 11 of a point, where the second main speaker is located, in such a way that the video image of the first main speaker is displayed at a left/right position different from the video display position of the first main speaker on the display device 11 of the other point. COPYRIGHT: (C)2011,JPO&INPIT

Description

本発明は、複数の地点にいる出席者間でビデオ会議を行うためのビデオ会議システム，サーバ装置，及びビデオ会議プログラムに関する。 The present invention relates to a video conference system, a server device, and a video conference program for conducting a video conference between attendees at a plurality of points.

従来より、撮像装置と表示装置を用いて複数の地点にいる出席者の映像を撮影及び伝送することにより、複数の地点にいる出席者間でビデオ会議を行うビデオ会議システムが知られている。このようなビデオ会議システムでは、同じ会議を行っているという臨場感を出席者に与えるために、出席者の視線を互いに一致させることが重要な課題である。 2. Description of the Related Art Conventionally, a video conference system is known in which a video conference is performed between attendees at a plurality of locations by capturing and transmitting images of attendees at the plurality of locations using an imaging device and a display device. In such a video conference system, in order to give attendees a sense of realism that the same conference is being held, it is an important issue to match attendees' line of sight with each other.

このような背景から、複数の通信先の映像を１つの画面に合成して表示する表示装置と、会話中の通信先の映像と視線が一致する角度から通信元の正面の映像を撮影する複数の正面用撮像装置と、会話中の出席者同士の視線が一致する角度から通信元の横顔の映像を撮影する複数の横顔用撮像装置を備えるビデオ会議システムが提案されている（特許文献１参照）。 From such a background, a display device that synthesizes and displays a plurality of communication destination images on a single screen, and a plurality of images that capture a front image of the communication source from an angle at which the line of sight coincides with the image of the communication destination in conversation. A video conferencing system has been proposed that includes a front-side imaging device and a plurality of side-view imaging devices that capture a video of a communication source's profile from an angle at which the lines of sight of attendees in conversation match. ).

このビデオ会議システムは、出席者（通信元）がどの通信先の出席者と会話をしているのかを検出する。そしてビデオ会議システムは、検出結果に応じて、複数の正面用撮像装置により撮影された映像のうちの対応する映像を会話中の通信先に送信し、複数の横顔用撮像装置により撮影された映像のうちの対応する映像を会話中の通信先以外の通信先に送信する。 This video conferencing system detects which attendee (conversation source) is talking to which attendee. The video conferencing system then transmits a corresponding video out of the video captured by the plurality of front imaging devices to the communication destination during the conversation according to the detection result, and the video captured by the plurality of profile imaging devices. Is transmitted to a communication destination other than the communication destination in conversation.

従来のビデオ会議システムによれば、少なくとも｛（ビデオ会議の出席者数−１）×２｝台の撮像装置を出席者がいる各地点に設置しなければならないために、出席者数の増加に伴い撮像装置の必要設置台数が増加し、ビデオ会議を安価に行うことが困難になる。 According to the conventional video conferencing system, at least {(number of attendees of video conference −1) × 2} must be installed at each point where attendees are present. Along with this, the required number of installed image pickup devices increases, making it difficult to conduct video conferencing at low cost.

本発明は、上記に鑑みてなされたものであって、その目的は、ビデオ会議を安価に行うことが可能なビデオ会議システム，サーバ装置，及びビデオ会議プログラムを提供することにある。 The present invention has been made in view of the above, and an object of the present invention is to provide a video conference system, a server device, and a video conference program capable of performing a video conference at low cost.

上述した課題を解決し、目的を達成するために、本発明は、ビデオ会議の出席者がいる地点毎に配置された複数のビデオ会議端末装置と、電気通信回線を介して複数のビデオ会議端末装置に接続されたサーバ装置とを備え、各ビデオ会議端末装置は、少なくとも２地点の他の出席者の映像を左右異なる位置に表示する表示装置と、表示装置の表示画面に対面する出席者の映像を撮影する撮像装置とを備え、サーバ装置は、ビデオ会議の主要発言者を判定する主要発言者判定部と、表示装置が表示する出席者の映像を制御する映像表示制御部とを備え、映像表示制御部は、主要発言者判定部の判定結果に基づいて第１及び第２の主要発言者を特定し、撮像装置により撮影された第１の主要発言者と前記第２の主要発言者の映像が左右異なる位置に表示されるように、第１及び前記第２の主要発言者以外の出席者がいる地点の表示装置を制御し、撮像装置により撮影された第２の主要発言者の映像が、第１及び第２の主要発言者以外の出席者がいる地点の表示装置における、第２の主要発言者の映像表示位置とは異なる左右位置に表示されるように、第１の主要発言者がいる地点の表示装置を制御し、撮像装置により撮影された第１の主要発言者の映像が、第１及び第２の主要発言者以外の出席者がいる地点の表示装置における、第１の主要発言者の映像表示位置とは異なる左右位置に表示されるように、第２の主要発言者がいる地点の表示装置を制御する。 In order to solve the above-described problems and achieve the object, the present invention provides a plurality of video conference terminal devices arranged at points where attendees of a video conference are present, and a plurality of video conference terminals via telecommunication lines. Each of the video conference terminal devices includes a display device that displays images of other attendees at least at two different positions on the left and right sides of the attendant facing the display screen of the display device. An image pickup device that captures video, and the server device includes a main speaker determination unit that determines a main speaker of a video conference, and a video display control unit that controls a video of an attendee displayed by the display device, The video display control unit identifies the first and second main speakers based on the determination result of the main speaker determination unit, and the first main speaker and the second main speaker captured by the imaging device. The position of the video is different The display device of the point where the attendees other than the first and second main speakers are present is controlled so that the video of the second main speaker captured by the imaging device is displayed. The display of the point where the first main speaker is located so that it is displayed in the left and right positions different from the video display position of the second main speaker in the display device of the point where the attendees other than the second main speaker are present An image of the first main speaker in a display device at a point where an attendee other than the first and second main speakers is present, with the video of the first main speaker captured by the imaging device controlled by the apparatus The display device at the point where the second main speaker is located is controlled so as to be displayed at the left and right positions different from the display position.

上述した課題を解決し、目的を達成するために、本発明は、複数の地点にいるビデオ会議の出席者の中からビデオ会議の主要発言者を判定する主要発言者判定部と、ビデオ会議の出席者がいる地点毎に配置された表示装置に表示させる出席者の映像を制御する映像表示制御部とを備え、映像表示制御部は、主要発言者判定部の判定結果に基づいて第１及び第２の主要発言者を特定し、第１の主要発言者と第２の主要発言者の映像が左右異なる位置に表示されるように、第１及び第２の主要発言者以外の出席者がいる地点の表示装置を制御し、第２の主要発言者の映像が、第１及び第２の主要発言者以外の出席者がいる地点の表示装置における、第２の主要発言者の映像表示位置とは異なる左右位置に表示されるように、第１の主要発言者がいる地点の表示装置を制御し、第１の主要発言者の映像が、第１及び第２の主要発言者以外の出席者がいる地点の表示装置における、第１の主要発言者の映像表示位置とは異なる左右位置に表示されるように、第２の主要発言者がいる地点の表示装置を制御する。 In order to solve the above-described problems and achieve the object, the present invention provides a main speaker determination unit that determines a main speaker of a video conference from among video conference attendees at a plurality of points, A video display control unit for controlling the video of the attendee to be displayed on a display device arranged at each point where the attendee is present, the video display control unit based on the determination result of the main speaker determination unit Attendees other than the first and second main speakers are identified so that the second main speaker is identified and the images of the first main speaker and the second main speaker are displayed at different positions. The video display position of the second main speaker in the display device of the point where the attendees other than the first and second main speakers are present is controlled. The first main speaker is displayed so that it is displayed at a different left and right position. The display device of the first main speaker is controlled, and the video display position of the first main speaker in the display device of the point where attendees other than the first and second main speakers are present is controlled. Controls the display device at the point where the second main speaker is located so that they are displayed at different left and right positions.

上述した課題を解決し、目的を達成するために、本発明は、複数の地点にいるビデオ会議の出席者の中からビデオ会議の主要発言者を判定する主要発言者判定ステップと、ビデオ会議の出席者がいる地点毎に配置された表示装置に表示させる出席者の映像を制御する映像表示制御ステップとをコンピュータに実行させ、映像表示制御ステップは、主要発言者判定ステップの結果に基づいて第１及び第２の主要発言者を特定するステップと、第１の主要発言者と第２の主要発言者の映像が左右異なる位置に表示されるように、第１及び第２の主要発言者以外の出席者がいる地点の表示装置を制御するステップと、第２の主要発言者の映像が、第１及び第２の主要発言者以外の出席者がいる地点の表示装置における、第２の主要発言者の映像表示位置とは異なる左右位置に表示されるように、第１の主要発言者がいる地点の表示装置を制御するステップと、第１の主要発言者の映像が、第１及び前記第２の主要発言者以外の出席者がいる地点の表示装置における、第１の主要発言者の映像表示位置とは異なる左右位置に表示されるように、第２の主要発言者がいる地点の表示装置を制御するステップとを含む。 In order to solve the above-described problems and achieve the object, the present invention provides a main speaker determination step for determining a main speaker of a video conference from among video conference attendees at a plurality of points, And a video display control step for controlling the video of the attendee to be displayed on the display device arranged at each point where the attendee is present. The video display control step is performed based on the result of the main speaker determination step. Steps for identifying the first and second main speakers, and other than the first and second main speakers so that the images of the first main speaker and the second main speaker are displayed at different positions on the left and right Controlling the display device at the point where the attendee is present and the video of the second main speaker is the second main in the display device at the point where attendees other than the first and second main speakers are present Video display of the speaker A step of controlling the display device at the point where the first main speaker is located so that the left and right positions are different from the position of the device, and images of the first main speaker are the first and second main messages. The display device at the point where the second main speaker is located is controlled such that the display device is displayed at a left and right position different from the video display position of the first main speaker in the display device where the attendee other than the present person is present. Steps.

本発明によれば、撮像装置の必要設置台数がビデオ会議の出席者数に依存しなくなるので、ビデオ会議を安価に行うことができる。 According to the present invention, since the required number of installed image pickup devices does not depend on the number of attendees of the video conference, the video conference can be performed at a low cost.

図１は、本発明の実施形態となるビデオ会議システムの構成を示すブロック図である。FIG. 1 is a block diagram showing a configuration of a video conference system according to an embodiment of the present invention. 図２は、図１に示す表示装置の画面構成を示す模式図である。FIG. 2 is a schematic diagram showing a screen configuration of the display device shown in FIG. 図３は、図１に示す左カメラと右カメラのレイアウトを示す模式図である。FIG. 3 is a schematic diagram showing a layout of the left camera and the right camera shown in FIG. 図４は、図１に示す主要発言者判定部の内部構成を示すブロック図である。FIG. 4 is a block diagram showing an internal configuration of the main speaker determination unit shown in FIG. 図５は、図１に示す表示装置の表示出力例を示す模式図である。FIG. 5 is a schematic diagram illustrating a display output example of the display device illustrated in FIG. 1. 図６（ａ）は、出席者が右画面領域を見ている際に右カメラにより撮影される映像を説明するための図、図６（ｂ）は、出席者が右画面領域を見ている際に左カメラにより撮影される映像を説明するための図、図６（ｃ）は、出席者が左画面領域を見ている際に右カメラにより撮影される映像を説明するための図、図６（ｄ）は、出席者が左画面領域を見ている際に左カメラにより撮影される映像を説明するための図である。FIG. 6A is a diagram for explaining an image captured by the right camera when the attendee is looking at the right screen area, and FIG. 6B is a view of the attendee looking at the right screen area. FIG. 6C is a diagram for explaining an image photographed by the right camera when the attendee is looking at the left screen area. FIG. 6D is a diagram for explaining an image captured by the left camera when the attendee is looking at the left screen area. 図７は、地点Ｐ１〜Ｐ４の表示装置に表示される出席者の映像の一例を示す模式図である。FIG. 7 is a schematic diagram illustrating an example of attendee images displayed on the display devices at the points P1 to P4. 図８は、映像表示制御処理の流れを示すフローチャート図である。FIG. 8 is a flowchart showing the flow of the video display control process. 図９は、図８に示すステップＳ４の処理のサブルーチンを示すフローチャート図である。FIG. 9 is a flowchart showing a subroutine of the process in step S4 shown in FIG. 図１０は、図８に示すステップＳ５の処理のサブルーチンを示すフローチャート図である。FIG. 10 is a flowchart showing a subroutine of the process in step S5 shown in FIG. 図１１は、図１０に示すステップＳ２３の処理のサブルーチンを示すフローチャート図である。FIG. 11 is a flowchart showing a subroutine of the process in step S23 shown in FIG. 図１２は、図１０に示すステップＳ２４の処理のサブルーチンを示すフローチャート図である。FIG. 12 is a flowchart showing a subroutine of the process in step S24 shown in FIG. 図１３は、図１０に示すステップＳ２５の処理のサブルーチンを示すフローチャート図である。FIG. 13 is a flowchart showing a subroutine of the process of step S25 shown in FIG. 図１４は、図８に示すステップＳ６の処理のサブルーチンを示すフローチャート図である。FIG. 14 is a flowchart showing a subroutine of the process in step S6 shown in FIG. 図１５は、図８に示すステップＳ８の処理のサブルーチンを示すフローチャート図である。FIG. 15 is a flowchart showing a subroutine of the process in step S8 shown in FIG. 図１６は、他地点ｎにいる出席者，第１の主要発言者，及び第２の主要発言者の表示装置の表示画面例を示す模式図である。FIG. 16 is a schematic diagram illustrating a display screen example of the display device of the attendees at the other point n, the first main speaker, and the second main speaker. 図１７は、主要発言者の変化に伴う表示画面の変化の一例を示す図である。FIG. 17 is a diagram illustrating an example of a change in the display screen accompanying a change in the main speaker.

以下、図面を参照して、本発明の実施形態となるビデオ会議システムの構成及びその動作について説明する。 Hereinafter, the configuration and operation of a video conference system according to an embodiment of the present invention will be described with reference to the drawings.

〔ビデオ会議システムの構成〕
始めに、図１を参照して、本発明の実施形態となるビデオ会議システムの構成について説明する。 [Configuration of video conferencing system]
First, the configuration of the video conference system according to the embodiment of the present invention will be described with reference to FIG.

本発明の実施形態となるビデオ会議システム１は、図１に示すように、ビデオ会議の出席者がいる地点毎に設けられたビデオ会議端末装置２と、ビデオ会議端末装置２の動作を制御するサーバ装置３とを備え、ビデオ会議端末装置２とサーバ装置３は、電気通信回線４を介して相互に情報通信可能なように構成されている。 As shown in FIG. 1, the video conference system 1 according to the embodiment of the present invention controls the video conference terminal device 2 provided at each point where the attendee of the video conference is present and the operation of the video conference terminal device 2. The video conference terminal device 2 and the server device 3 are configured to be capable of communicating information with each other via the telecommunication line 4.

〔ビデオ会議端末装置の構成〕
ビデオ会議端末装置２は、表示装置１１と撮像装置１２を備える。表示装置１１は、液晶ディスプレイ装置やCRT(Cathode Ray Tube)装置等の公知の表示装置により構成され、図２に示すように、左画面領域３１，右画面領域３２，下画面領域３３，及び発言率表示領域３４，３５を有する。左画面領域３１，右画面領域３２，及び下画面領域３３は、サーバ装置３から送信されたビデオ会議の出席者の映像を表示する。本実施形態では、左画面領域３２と右画面領域３３は、図３に示すように、それぞれの中心位置から表示画面の中心位置までの距離（図３に示す距離ａ）が同じになる位置に配置されている。 [Configuration of video conference terminal]
The video conference terminal device 2 includes a display device 11 and an imaging device 12. The display device 11 is configured by a known display device such as a liquid crystal display device or a CRT (Cathode Ray Tube) device, and as shown in FIG. 2, a left screen region 31, a right screen region 32, a lower screen region 33, and a statement Rate display areas 34 and 35 are provided. The left screen area 31, the right screen area 32, and the lower screen area 33 display images of attendees of the video conference transmitted from the server device 3. In the present embodiment, as shown in FIG. 3, the left screen area 32 and the right screen area 33 are located at the same distance from the center position to the center position of the display screen (distance a shown in FIG. 3). Has been placed.

発言率表示領域３４及び発言率表示領域３５はそれぞれ、左画面領域３１及び右画面領域３２に表示されている出席者の発言時間のビデオ会議時間中に占める割合（発言率）を表示する。本実施形態では、発言率表示領域３４，３５に表示されている黒色のバーの長さが出席者の発言率を示す。すなわち本実施形態では、黒色のバーが発言率表示領域３４，３５に表示されていない場合、出席者の発言率は０％となり、黒色のバーが発言率表示領域３４，３５の左端から右端まで表示されている場合には、出席者の発言率は１００％となる。 The speech rate display area 34 and the speech rate display area 35 display the ratio (speech rate) of the attendee's speech time displayed in the left screen region 31 and the right screen region 32 during the video conference time, respectively. In this embodiment, the length of the black bar displayed in the speech rate display areas 34 and 35 indicates the speech rate of the attendee. That is, in this embodiment, when the black bar is not displayed in the speech rate display areas 34 and 35, the attendance rate of the attendee is 0%, and the black bar extends from the left end to the right end of the speech rate display areas 34 and 35. If displayed, the attendance rate for attendees is 100%.

撮像装置１２は、表示装置１１に対面している出席者（操作者）の映像を撮影する左カメラ１２ａと右カメラ１２ｂを有する。左カメラ１２ａは、図３に示すように、操作者が左画面領域３１に表示されている出席者の映像を見ている時の視線方向と撮像方向がなす角度がθとなる位置に配置され、操作者の正面近くの映像を角度θで撮影する。一方、右カメラ１２ｂは、操作者が左画面領域３１に表示されている出席者の映像を見ている時の視線方向と撮像方向がなす角度がβとなる位置に配置され、左画面領域３１を見ている操作者の横顔の映像を角度βで撮影する。 The imaging device 12 includes a left camera 12a and a right camera 12b that capture images of attendees (operators) facing the display device 11. As shown in FIG. 3, the left camera 12 a is arranged at a position where the angle formed by the line-of-sight direction and the imaging direction when the operator is viewing the attendee's image displayed in the left screen area 31 is θ. Then, an image near the front of the operator is taken at an angle θ. On the other hand, the right camera 12b is arranged at a position where the angle formed by the line-of-sight direction and the imaging direction when the operator is viewing the attendee's video displayed in the left screen area 31 is β. An image of the profile of the operator who is watching is taken at an angle β.

左画面領域３１又は右画面領域３２に表示されている出席者の視線と操作者の視線を一致させるためには、角度θが可能な限り小さくなるように左カメラ１２ａ及び右ガメラ１２ｂを配置することが望ましい。また操作者の横顔を正しく撮影するためには、角度βが６０〜９０°の範囲内に収まるように左カメラ１２ａ及び右カメラ１２ｂを配置することが望ましい。但し、操作者の正面映像と横顔映像を区別しやすくするためには、角度βが角度θの少なくとも２倍以上であることが望ましい。 In order to match the line of sight of the attendee displayed in the left screen area 31 or the right screen area 32 with the line of sight of the operator, the left camera 12a and the right gamer 12b are arranged so that the angle θ is as small as possible. It is desirable. In order to correctly photograph the operator's profile, it is desirable to arrange the left camera 12a and the right camera 12b so that the angle β falls within the range of 60 to 90 °. However, in order to make it easy to distinguish the front image and the profile image of the operator, it is desirable that the angle β is at least twice the angle θ.

〔サーバ装置の構成〕
サーバ装置３は、ワークステーション等の公知の情報処理装置により構成され、主要発言者判定部２１と映像表示制御部２２を備える。主要発言者判定部２１と映像表示制御部２２の機能は、情報処理装置内部のCPU(Central Processing Unit)が記憶媒体からビデオ会議プログラムを読み出して実行することにより、サーバ装置３上で実現されるようになっている。 [Configuration of server device]
The server device 3 is configured by a known information processing device such as a workstation, and includes a main speaker determination unit 21 and a video display control unit 22. The functions of the main speaker determination unit 21 and the video display control unit 22 are realized on the server device 3 by a CPU (Central Processing Unit) inside the information processing device reading out and executing a video conference program from a storage medium. It is like that.

上記ビデオ会議プログラムは、インストール可能な形式又は実行可能な形式のファイルでCD-ROM，フレキシブルディスク（FD），CD-R，DVD(Digital Versatile Disk)等のコンピュータで読み取り可能な記録媒体に記録されて提供される。またビデオ会議プログラムをインターネット等のネットワークに接続されたコンピュータ上に格納し、ネットワーク経由でダウンロードさせることにより提供するように構成してもよい。またビデオ会議プログラムをインターネット等のネットワーク経由で提供又は配布するように構成してもよい。またビデオ会議プログラムをROM等に予め組み込んで提供するように構成してもよい。 The video conferencing program is a file in an installable or executable format and is recorded on a computer-readable recording medium such as a CD-ROM, flexible disk (FD), CD-R, DVD (Digital Versatile Disk). Provided. Further, the video conference program may be stored on a computer connected to a network such as the Internet and provided by being downloaded via the network. Further, the video conference program may be provided or distributed via a network such as the Internet. Further, the video conference program may be provided by being incorporated in advance in a ROM or the like.

主要発言者判定部２１は、図４に示すように、音量検出部４１，音量履歴記憶部４２，主要発言書特定部４３，及び発言時間計算部４４を含む。音量検出部４１は、ビデオ会議の出席者がいる各地点のビデオ会議端末装置２から出席者の映像を受信し、受信した映像情報に基づいて各出席者の発話音量を検出する。音量履歴記憶部４２は、音量検出部４１により検出された各出席者の発話音量を単位時間毎に記憶する。主要発言者特定部４３は、音量履歴記憶部４２に記憶されている各出席者の単位時間毎の発話音量に基づいて所定時間内の平均発話音量（例えば最近５〜１０秒間の平均発話音量）を算出し、算出された平均発話音量が最も大きい出席者を主要発言者として特定する。発言時間計算部４４は、音量履歴記憶部４２に記憶されている各出席者の単位時間毎の発話音量に基づいて各出席者の発言率を算出する。 As shown in FIG. 4, the main speaker determination unit 21 includes a volume detection unit 41, a volume history storage unit 42, a main message book specification unit 43, and a speech time calculation unit 44. The volume detection unit 41 receives the attendee's video from the video conference terminal device 2 at each point where the video conference attendee is present, and detects the speech volume of each attendee based on the received video information. The volume history storage unit 42 stores the speech volume of each attendee detected by the volume detection unit 41 for each unit time. The main speaker specifying unit 43 determines the average utterance volume within a predetermined time based on the utterance volume per unit time of each attendee stored in the volume history storage unit 42 (for example, the average utterance volume for the last 5 to 10 seconds). And the attendee with the highest calculated average utterance volume is identified as the main speaker. The speech time calculation unit 44 calculates the speech rate of each attendee based on the speech volume per unit time of each attendee stored in the volume history storage unit 42.

映像表示制御部２２は、主要発言者判定部２１（主要発言者特定部４３及び発言時間計算部４４）の処理結果に基づいて、各地点の表示装置１１に表示する映像や情報を制御する。詳しくは後述するが、地点Ｐ１〜Ｐ４にいる出席者間でビデオ会議を行っている場合において、地点Ｐ１の出席者が地点Ｐ２の出席者と会話をしている時には、映像表示制御部２２は、例えば図５に示すように、地点Ｐ１の表示装置１１の左画面領域３１及び右画面領域３２にそれぞれ地点Ｐ２及び地点Ｐ３の出席者の映像を表示する。 The video display control unit 22 controls the video and information displayed on the display device 11 at each point based on the processing result of the main speaker determination unit 21 (the main speaker specifying unit 43 and the speech time calculation unit 44). As will be described in detail later, when a video conference is being held between attendees at the points P1 to P4, when the attendant at the point P1 has a conversation with the attendant at the point P2, the video display control unit 22 For example, as shown in FIG. 5, the images of the attendees at the points P2 and P3 are displayed on the left screen region 31 and the right screen region 32 of the display device 11 at the point P1, respectively.

出席者が右画面領域３２を見ている場合、右カメラ１２ｂは図６（ａ）に示すような出席者が小さく左を向いている映像を撮影し、左カメラ１２ａは図６（ｂ）に示すような出席者が大きく左を向いている映像を撮影する。一方、出席者が左画面領域３１を見ている場合には、右カメラ１２ｂは図６（ｃ）に示すような出席者が大きく右を向いている映像を撮影し、左カメラ１２ａは図６（ｄ）に示すような出席者が小さく右を向いている映像を撮影する。 When the attendee is looking at the right screen area 32, the right camera 12b captures a video in which the attendee is small and facing the left as shown in FIG. 6A, and the left camera 12a is shown in FIG. 6B. Take a picture of the attendee as shown, pointing to the left. On the other hand, when the attendee is looking at the left screen area 31, the right camera 12b captures an image in which the attendee is greatly facing right as shown in FIG. 6C, and the left camera 12a is shown in FIG. Shoot an image in which the attendee is small and facing right as shown in (d).

従って、地点Ｐ１の出席者が地点Ｐ２の出席者と会話をしている場合、映像表示制御部２２は、図７に示すように、地点Ｐ１の表示装置１１の左画面領域３１及び右画面領域３２にはそれぞれ地点Ｐ２の出席者が小さく右を向いている映像及び地点Ｐ３の出席者が大きく左を向いている映像を表示する。また映像表示制御部２２は、地点Ｐ２の表示装置１１の左画面領域３１及び右画面領域３２にはそれぞれ地点Ｐ４の出席者が大きく右を向いている映像及び地点Ｐ１の出席者が小さく左を向いている映像を表示する。また映像表示制御部２２は、地点Ｐ３及び地点Ｐ４の表示装置１１の左画面領域３１及び右画面領域３２にはそれぞれ地点Ｐ１の出席者が大きく右を向いている映像及び地点Ｐ２の出席者が大きく左を向いている映像を表示する。 Therefore, when the attendee at the point P1 is talking to the attendee at the point P2, the video display control unit 22 performs the left screen region 31 and the right screen region of the display device 11 at the point P1, as shown in FIG. In 32, an image in which the attendee at the point P2 is small and facing right and an image in which the attendee at the point P3 is facing large and left are displayed. In addition, the video display control unit 22 displays the video in which the attendee at the point P4 is greatly facing the right and the attendant at the point P1 is small in the left screen region 31 and the right screen region 32 of the display device 11 at the point P2. Display the video you are facing. In addition, the video display control unit 22 includes a video in which the attendee at the point P1 is greatly facing the right and the attendant at the point P2 in the left screen region 31 and the right screen region 32 of the display device 11 at the point P3 and the point P4, respectively. Display a video that is facing large left.

〔映像表示制御処理〕
このようなビデオ会議システム１では、サーバ装置３が以下に示す映像表示制御処理を実行することにより、ビデオ会議を安価に行うことを可能にする。以下、図８に示すフローチャートを参照して、この映像表示制御処理を実行する際のサーバ装置３の動作について説明する。 [Video display control processing]
In such a video conference system 1, the server device 3 can perform a video conference at a low cost by executing the following video display control process. The operation of the server device 3 when executing this video display control process will be described below with reference to the flowchart shown in FIG.

図８に示すフローチャートは、サーバ装置３に対しビデオ会議の開始が指示されたタイミングで開始となり、映像表示制御処理はステップＳ１の処理に進む。 The flowchart shown in FIG. 8 starts when the server apparatus 3 is instructed to start a video conference, and the video display control process proceeds to step S1.

ステップＳ１の処理では、映像表示制御部２２が、映像表示制御処理の初期設定を実行する。具体的には、映像表示制御部２２は、ビデオ会議の主催者によって入力されたビデオ会議の出席者の主要順序に基づいて、主要順序が１番目の出席者を「第１の主要発言者」，主要順序が２番目の出席者を「第２の主要発言者」，主要順序が３番目の出席者がいる地点を「デフォルト地点」に設定する。 In the process of step S1, the video display control unit 22 executes initial setting of the video display control process. Specifically, the video display control unit 22 designates the attendee whose primary order is the first as the “first primary speaker” based on the primary order of the attendees of the video conference input by the video conference organizer. , The second participant in the main order is set as “second main speaker”, and the point where the third participant in the main order is set as “default point”.

映像表示制御部２２は、「第１の主要発言者」の表示装置１１の左画面領域３１及び右画面領域３２にそれぞれ右カメラ１２ｂにより撮影された「第２の主要発言者」の映像及び左カメラ１２ａにより撮影された「デフォルト地点」にいる出席者の映像を表示するように「第１の主要発言者」の表示装置１１の表示画面を設定する。映像表示制御部２２は、「第２の主要発言者」の表示装置１１の左画面領域３１及び右画面領域３２にそれぞれ右カメラ１２ｂにより撮影された「デフォルト地点」にいる出席者の映像及び左カメラ１２ａにより撮影された「第１の主要発言者」の映像を表示するように「第２の主要発言者」の表示装置１１の表示画面を設定する。 The video display control unit 22 displays the video of the “second main speaker” and the left captured by the right camera 12b in the left screen region 31 and the right screen region 32 of the display device 11 of “first main speaker”, respectively. The display screen of the display device 11 of the “first main speaker” is set so as to display the video of the attendee at the “default location” photographed by the camera 12a. The video display control unit 22 displays the video of the attendee at the “default location” taken by the right camera 12b in the left screen area 31 and the right screen area 32 of the display device 11 of “second main speaker” and the left. The display screen of the display device 11 of the “second main speaker” is set so as to display the video of the “first main speaker” taken by the camera 12a.

映像表示制御部２２は、「第１の主要発言者」と「第２の主要発言者」以外の出席者（「デフォルト地点」を含む「他地点」にいる出席者）の表示装置１１の左画面領域３１及び右画面領域３２にそれぞれ右カメラ１２ｂにより撮影された「第１の主要発言者」の映像及び左カメラ１２ａにより撮影された「第２の主要発言者」の映像を表示するように「他地点」の表示装置１１の表示画面を設定する。そして映像表示制御部２２は、各出席者について設定した情報、すなわち「第１の主要発言者」，「第２の主要発言者」，「デフォルト地点」，及び「他地点」の種別を示す分類情報と、表示装置１１の左画面領域３１及び右画面領域３２に表示する映像を示す表示情報を各出席者の属性情報として記憶する。これにより、ステップＳ１の処理は完了し、映像表示制御処理はステップＳ２の処理に進む。 The video display control unit 22 displays the left side of the display device 11 of attendees other than “first main speaker” and “second main speaker” (participants in “other locations” including “default location”). An image of “first main speaker” captured by the right camera 12b and an image of “second main speaker” captured by the left camera 12a are displayed in the screen area 31 and the right screen area 32, respectively. The display screen of the “other point” display device 11 is set. The video display control unit 22 then classifies the information set for each attendee, that is, the types of “first main speaker”, “second main speaker”, “default location”, and “other locations”. Information and display information indicating video to be displayed on the left screen area 31 and the right screen area 32 of the display device 11 are stored as attribute information of each attendee. Thereby, the process of step S1 is completed and the video display control process proceeds to the process of step S2.

ステップＳ２の処理では、映像表示制御部２２が、主要発言者判定部２１から主要発言者の情報を取得する。これにより、ステップＳ２の処理は完了し、映像表示制御処理はステップＳ３の処理に進む。 In the process of step S 2, the video display control unit 22 acquires information on the main speaker from the main speaker determination unit 21. Thereby, the process of step S2 is completed and the video display control process proceeds to the process of step S3.

ステップＳ３の処理では、映像表示制御部２２が、出席者の属性情報とステップＳ２の処理により取得した主要発言者の情報とを比較することにより、主要発言者が変化したか否かを判別する。判別の結果、主要発言者が変化していない場合、映像表示制御部２２は映像表示制御処理をステップＳ７の処理に進める。一方、主要発言者が変化した場合には、映像表示制御部２２は映像表示制御処理をステップＳ４の処理に進める。 In the process of step S3, the video display control unit 22 determines whether or not the main speaker has changed by comparing the attendee's attribute information with the information of the main speaker acquired by the process of step S2. . If the main speaker is not changed as a result of the determination, the video display control unit 22 advances the video display control process to the process of step S7. On the other hand, when the main speaker changes, the video display control unit 22 advances the video display control process to the process of step S4.

ステップＳ４の処理では、映像表示制御部２２が、新しい主要発言者（「第１の主要発言者」と「第２の主要発言者」）を特定し、特定結果に基づいて各出席者の属性情報を更新する（主要発言者特定処理）。この主要発言者特定処理の詳細については、図９に示すフローチャートを参照して後述する。これにより、ステップＳ４の処理は完了し、映像表示制御処理はステップＳ５の処理に進む。 In the process of step S4, the video display control unit 22 specifies new main speakers (“first main speaker” and “second main speaker”), and attributes of each attendee based on the specified result. Update information (main speaker identification process). Details of the main speaker specifying process will be described later with reference to a flowchart shown in FIG. Thereby, the process of step S4 is completed, and the video display control process proceeds to the process of step S5.

ステップＳ５の処理では、映像表示制御部２２が、ステップＳ４の処理により更新された各出席者の属性情報に従って、各出席者の表示装置１１の左画面領域３１及び右画面領域３２に映像を表示する出席者を決定する（映像表示地点決定処理）。この映像表示地点決定処理の詳細については、図１０乃至図１３に示すフローチャートを参照して後述する。これにより、ステップＳ５の処理は完了し、映像表示制御処理はステップＳ６とステップＳ７の処理に進む。 In the process of step S5, the video display control unit 22 displays the video in the left screen area 31 and the right screen area 32 of each attendee's display device 11 in accordance with the attribute information of each attendee updated in the process of step S4. Attendees to be determined (video display point determination processing). Details of this video display point determination processing will be described later with reference to the flowcharts shown in FIGS. Thereby, the process of step S5 is completed, and the video display control process proceeds to the processes of step S6 and step S7.

ステップＳ６の処理では、映像表示制御部２２が、ステップＳ５の処理結果に基づいて、各出席者の表示装置１１の左画面領域３１及び右画面領域３２に表示する出席者の映像（左カメラ１２ａ又は右カメラ１２ｂにより撮影された映像）を特定する（映像特定処理）。この映像特定処理の詳細については図１４に示すフローチャートを参照して後述する。これにより、ステップＳ６の処理は完了し、映像表示制御処理はステップＳ８の処理に進む。 In the process of step S6, the video display control unit 22 displays the attendee video (left camera 12a) displayed on the left screen area 31 and the right screen area 32 of the display device 11 of each attendee based on the processing result of step S5. Alternatively, the image captured by the right camera 12b) is identified (image identification process). Details of this video specifying process will be described later with reference to a flowchart shown in FIG. Thereby, the process of step S6 is completed, and the video display control process proceeds to the process of step S8.

ステップＳ７の処理では、映像表示制御部２２が、主要発言者判定部２１から「第１の主要発言者」，「第２の主要発言者」，及び「デフォルト地点」にいる出席者の発言率に関する情報を取得する。これにより、ステップＳ７の処理は完了し、映像表示制御処理はステップＳ８の処理に進む。 In the process of step S7, the video display control unit 22 sends a speech rate of attendees at the “first main speaker”, “second main speaker”, and “default location” from the main speaker determination unit 21. Get information about. Thereby, the process of step S7 is completed, and the video display control process proceeds to the process of step S8.

ステップＳ８の処理では、映像表示制御部２２が、ステップＳ６の処理結果に基づいて各地点の表示装置１１に送信する映像を選択し、ステップＳ７の処理により取得した発言率に関する情報と共に選択された映像を各地点の表示装置１１に送信する（情報送信処理）。この情報送信処理の詳細については、図１５に示すフローチャートを参照して後述する。これにより、ステップＳ８の処理は完了し、映像表示制御処理はステップＳ９の処理に進む。 In the process of step S8, the video display control unit 22 selects the video to be transmitted to the display device 11 at each point based on the process result of step S6, and is selected together with the information regarding the speech rate acquired by the process of step S7. The video is transmitted to the display device 11 at each point (information transmission process). Details of this information transmission processing will be described later with reference to the flowchart shown in FIG. Thereby, the process of step S8 is completed, and the video display control process proceeds to the process of step S9.

ステップＳ９の処理では、映像表示制御部２２が、ビデオ会議の終了指示が入力されたか否かを判別する。判別の結果、ビデオ会議の終了指示が入力されていない場合、映像表示制御部２２は映像表示制御処理をステップＳ２の処理に戻す。一方、ビデオ会議の終了指示が入力された場合には、映像表示制御部２２は一連の映像表示制御処理を終了する。 In the process of step S9, the video display control unit 22 determines whether or not a video conference end instruction has been input. As a result of the determination, when the video conference end instruction is not input, the video display control unit 22 returns the video display control process to the process of step S2. On the other hand, when a video conference end instruction is input, the video display control unit 22 ends the series of video display control processing.

〔主要発言者特定処理〕
次に、図９に示すフローチャートを参照して、上記ステップＳ４の主要発言者特定処理について詳しく説明する。 [Key speaker identification processing]
Next, the main speaker specifying process in step S4 will be described in detail with reference to the flowchart shown in FIG.

図９に示すフローチャートは、ステップＳ３の処理において主要発言者が変化したと判別されたタイミングで開始となり、主要発言者特定処理はステップＳ１１の処理に進む。 The flowchart shown in FIG. 9 starts at the timing when it is determined that the main speaker has changed in the process of step S3, and the main speaker specifying process proceeds to the process of step S11.

ステップＳ１１の処理では、映像表示制御部２２が、各出席者の属性情報を読み出す。これにより、ステップＳ１１の処理は完了し、主要発言者特定処理はステップＳ１２の処理に進む。 In the process of step S11, the video display control unit 22 reads the attribute information of each attendee. Thereby, the process of step S11 is completed, and the main speaker specifying process proceeds to the process of step S12.

ステップＳ１２の処理では、映像表示制御部２２が、属性情報内の分類情報を元分類情報（主要発言者が変化する前の各出席者の属性情報）に置き換える。これにより、ステップＳ１２の処理は完了し、主要発言者特定処理はステップＳ１３の処理に進む。 In the process of step S12, the video display control unit 22 replaces the classification information in the attribute information with the original classification information (attribute information of each attendee before the main speaker changes). Thereby, the process of step S12 is completed, and the main speaker specifying process proceeds to the process of step S13.

ステップＳ１３の処理では、映像表示制御部２２が、元分類情報で「第２の主要発言者」に分類されている出席者の分類情報を「第１の主要発言者」に設定する。これにより、ステップＳ１３の処理は完了し、主要発言者特定処理はステップＳ１４の処理に進む。 In the processing of step S13, the video display control unit 22 sets the classification information of attendees classified as “second main speaker” in the original classification information as “first main speaker”. Thereby, the process of step S13 is completed, and the main speaker specifying process proceeds to the process of step S14.

ステップＳ１４の処理では、映像表示制御部２２が、新しい主要発言者の分類情報を「第２の主要発言者」に設定する。これにより、ステップＳ１４の処理は完了し、主要発言者特定処理はステップＳ１４の処理は完了し、主要発言者特定処理はステップＳ１５の処理に進む。 In the processing of step S14, the video display control unit 22 sets the classification information of the new main speaker as “second main speaker”. Thereby, the process in step S14 is completed, the main speaker specifying process is completed in step S14, and the main speaker specifying process proceeds to the process in step S15.

ステップＳ１５の処理では、映像表示制御部２２が、ステップＳ１３とステップＳ１４の処理により「第１の主要発言者」及び「第２の主要発言者」に設定された出席者と元分類情報において「第１の主要発言者」及び「デフォルト地点」にいる出席者に設定されている出席者を除く出席者の中で、初期設定処理において設定された主要順位が最も高い出席者の分類情報を「デフォルト地点」に設定する。これにより、ステップＳ１５の処理は完了し、主要発言者特定処理はステップＳ１６の処理に進む。 In the process of step S15, the video display control unit 22 sets “first main speaker” and “second main speaker” set in “first main speaker” and the original classification information by the processes of steps S13 and S14. Among the attendees excluding the attendees set as attendees at the “first primary speaker” and “default location”, the classification information of the attendee with the highest priority set in the initial setting process is “ Set to “Default point”. Thereby, the process of step S15 is completed, and the main speaker specifying process proceeds to the process of step S16.

ステップＳ１６の処理では、映像表示制御部２２が、ステップＳ１５の処理により「デフォルト地点」にいる出席者に分類された出席者を含む「第１の主要発言者」と「第２の主要発言者」以外の出席者の分類情報を「他地点」に設定する。これにより、ステップＳ１６の処理は完了し、主要発言者特定処理はステップＳ１７の処理に進む。 In the process of step S16, the video display control unit 22 includes the “first main speaker” and the “second main speaker” including the attendees classified as attendees at the “default location” by the process of step S15. Set the classification information of attendees other than "Other points". Thereby, the process of step S16 is completed, and the main speaker specifying process proceeds to the process of step S17.

ステップＳ１７の処理では、映像表示制御部２２が、ステップＳ１２乃至ステップＳ１６の処理結果に基づいて、各出席者の属性情報を更新する。これにより、ステップＳ１７の処理は完了し、一連の主要発言者特定処理は完了する。 In the process of step S17, the video display control unit 22 updates the attribute information of each attendee based on the process results of steps S12 to S16. Thereby, the process of step S17 is completed and a series of main speaker specific processes are completed.

〔映像表示地点決定処理〕
次に、図１０に示すフローチャートを参照して、上記ステップＳ５の映像表示地点決定処理について詳しく説明する。 [Video display point determination processing]
Next, the video display point determination process in step S5 will be described in detail with reference to the flowchart shown in FIG.

図１０に示すフローチャートは、ステップＳ４の処理が完了したタイミングで開始となり、映像表示地点決定処理はステップＳ２１の処理に進む。 The flowchart shown in FIG. 10 starts at the timing when the process of step S4 is completed, and the video display point determination process proceeds to the process of step S21.

ステップＳ２１の処理では、映像表示制御部２２が、各出席者の属性情報を読み出す。これにより、ステップＳ２１の処理は完了し、映像表示地点決定処理はステップＳ２２の処理に進む。 In step S21, the video display control unit 22 reads the attribute information of each attendee. Thereby, the process of step S21 is completed, and the video display point determination process proceeds to the process of step S22.

ステップＳ２２の処理では、映像表示制御部２２が、ステップＳ２１の処理により読み出された属性情報から各出席者の分類情報，元分類情報、及び表示情報を抽出する。これにより、ステップＳ２２の処理は完了し、映像表示地点決定処理はステップＳ２３の処理に進む。 In the process of step S22, the video display control unit 22 extracts the classification information, the original classification information, and the display information of each attendee from the attribute information read out by the process of step S21. Thereby, the process of step S22 is completed, and the video display point determination process proceeds to the process of step S23.

ステップＳ２３の処理では、映像表示制御部２２が、ステップＳ２２の処理により抽出された分類情報及び表示情報に基づいて、「他地点」の表示装置１１に映像を表示する出席者の地点を決定する（第１表示地点決定処理）。この第１表示地点決定処理の詳細については、図１１に示すフローチャートを参照して後述する。これにより、ステップＳ２３の処理は完了し、映像表示地点決定処理はステップＳ２４の処理に進む。 In the process of step S23, the video display control unit 22 determines the location of the attendee who displays the video on the display device 11 of “other location” based on the classification information and the display information extracted by the process of step S22. (First display point determination process). The details of the first display point determination process will be described later with reference to the flowchart shown in FIG. Thereby, the process of step S23 is completed, and the video display point determination process proceeds to the process of step S24.

ステップＳ２４の処理では、映像表示制御部２２が、ステップＳ２２の処理により抽出された分類情報及び表示情報に基づいて、「第１の主要発言者」の表示装置１１に映像を表示する出席者の地点を決定する（第２表示地点決定処理）。この第２表示地点決定処理の詳細については、図１２に示すフローチャートを参照して後述する。これにより、ステップＳ２４の処理は完了し、映像表示地点決定処理はステップＳ２５の処理に進む。 In the process of step S24, the video display control unit 22 displays the video on the display device 11 of the “first main speaker” based on the classification information and the display information extracted by the process of step S22. A point is determined (second display point determination process). Details of the second display point determination process will be described later with reference to the flowchart shown in FIG. Thereby, the process of step S24 is completed, and the video display point determination process proceeds to the process of step S25.

ステップＳ２５の処理では、映像表示制御部２２が、ステップＳ２２の処理により抽出された分類情報及び表示情報に基づいて、「第２の主要発言者」の表示装置１１に映像を表示する出席者の地点を決定する（第３表示地点決定処理）。この第３表示地点決定処理の詳細については、図１３に示すフローチャートを参照して後述する。これにより、ステップＳ２５の処理は完了し、一連の映像表示地点決定処理は完了する。
〔第１表示地点決定処理〕
次に、図１１に示すフローチャートを参照して、上記ステップＳ２３の第１表示地点決定処理について詳しく説明する。 In the process of step S25, the video display control unit 22 displays the video on the display device 11 of the “second main speaker” based on the classification information and the display information extracted by the process of step S22. A point is determined (third display point determination process). The details of the third display point determination process will be described later with reference to the flowchart shown in FIG. Thereby, the process of step S25 is completed and a series of video display point determination processes are completed.
[First display point determination process]
Next, the first display spot determination process in step S23 will be described in detail with reference to the flowchart shown in FIG.

図１１に示すフローチャートは、ステップＳ２２の処理が完了したタイミングで開始となり、第１表示地点決定処理はステップＳ３１の処理に進む。 The flowchart shown in FIG. 11 starts at the timing when the process of step S22 is completed, and the first display point determination process proceeds to the process of step S31.

ステップＳ３１の処理では、映像表示制御部２２が、ステップＳ２２の処理により抽出された表示情報に基づいて、「第１の主要発言者」の映像が表示されている画面領域（左画面領域３１又は右画面領域３２）を検出する。これにより、ステップＳ３１の処理は完了し、第１表示地点決定処理はステップＳ３２の処理に進む。 In the process of step S31, the video display control unit 22 displays the screen area (the left screen area 31 or the video image of the “first main speaker”) based on the display information extracted by the process of step S22. The right screen area 32) is detected. Thereby, the process of step S31 is completed and a 1st display point determination process progresses to the process of step S32.

ステップＳ３２の処理では、映像表示制御部２２が、ステップＳ３１の処理により検出された画面領域とは反対の画面領域に「第２の主要発言者」の映像を表示するように表示画面を設定する。これにより、ステップＳ３２の処理は完了し、第１表示地点決定処理はステップＳ３３の処理に進む。 In the process of step S32, the video display control unit 22 sets the display screen so that the video of “second main speaker” is displayed in the screen area opposite to the screen area detected by the process of step S31. . Thereby, the process of step S32 is completed and a 1st display point determination process progresses to the process of step S33.

ステップＳ３３の処理では、映像表示制御部２２が、ステップＳ３２の処理により「第２の主要発言者」の映像を表示する画面と設定された画面領域とは反対の画面領域に「第１の主要発言者」の映像を表示するように表示画面を設定する。これにより、ステップＳ３３の処理は完了し、第１表示地点決定処理はステップＳ３４の処理に進む。 In the process of step S33, the video display control unit 22 displays “first main speaker” in a screen area opposite to the screen area that is set to the screen that displays the video of “second main speaker” in the process of step S32. Set the display screen to display the "Speaker" video. Thereby, the process of step S33 is completed, and the first display point determination process proceeds to the process of step S34.

ステップＳ３４の処理では、映像表示制御部２２が、ステップＳ３２，Ｓ３３の処理結果に基づいて、「他地点」にいる出席者として分類されている出席者の表示情報（属性情報）を更新する。これにより、ステップＳ３４の処理は完了し、一連の第１表示地点決定処理は終了する。
〔第２表示地点決定処理〕
次に、図１２に示すフローチャートを参照して、上記ステップＳ２４の第２表示地点決定処理について詳しく説明する。 In the process of step S34, the video display control unit 22 updates the display information (attribute information) of attendees classified as attendees at “other locations” based on the processing results of steps S32 and S33. Thereby, the process of step S34 is completed and a series of 1st display point determination processes are complete | finished.
[Second display point determination process]
Next, the second display spot determination process in step S24 will be described in detail with reference to the flowchart shown in FIG.

図１２に示すフローチャートは、ステップＳ２３の処理が完了したタイミングで開始となり、第２表示地点決定処理はステップＳ４１の処理に進む。 The flowchart shown in FIG. 12 starts at the timing when the process of step S23 is completed, and the second display point determination process proceeds to the process of step S41.

ステップＳ４１の処理では、映像表示制御部２２が、ステップＳ２２の処理により抽出された表示情報に基づいて、「第２の主要発言者」の映像が表示されている画面領域（左画面領域３１又は右画面領域３２）を検出する。これにより、ステップＳ４１の処理は完了し、第２表示地点決定処理はステップＳ４２の処理に進む。 In the process of step S41, the video display control unit 22 displays the screen area (the left screen area 31 or the video image of the “second main speaker”) based on the display information extracted by the process of step S22. The right screen area 32) is detected. Thereby, the process of step S41 is completed and the second display point determination process proceeds to the process of step S42.

ステップＳ４２の処理では、映像表示制御部２２が、ステップＳ４１の処理により検出された画面領域とは反対の画面領域に「第２の主要発言者」の映像を表示するように表示画面を設定する。これにより、ステップＳ４２の処理は完了し、第２表示地点決定処理はステップＳ４３の処理に進む。 In the process of step S42, the video display control unit 22 sets the display screen to display the video of “second main speaker” in the screen area opposite to the screen area detected by the process of step S41. . Thereby, the process of step S42 is completed and a 2nd display point determination process progresses to the process of step S43.

ステップＳ４３の処理では、映像表示制御部２２が、ステップＳ４２の処理により「第２の主要発言者」の映像を表示する画面と設定された画面領域とは反対の画面領域に「デフォルト地点」にいる出席者の映像を表示するように表示画面を設定する。これにより、ステップＳ４３の処理は完了し、第２表示地点決定処理はステップＳ４４の処理に進む。 In the process of step S43, the video display control unit 22 sets the screen that displays the video of “second main speaker” in the process of step S42 to the “default location” in the screen area opposite to the set screen area. Set the display screen to display the attendee's video. Thereby, the process of step S43 is completed, and the second display point determination process proceeds to the process of step S44.

ステップＳ４４の処理では、映像表示制御部２２が、ステップＳ４２，Ｓ４３の処理結果に基づいて、「第１の主要発言者」に分類されている出席者の表示情報（属性情報）を更新する。これにより、ステップＳ４４の処理は完了し、一連の第２表示地点決定処理は終了する。
〔第３表示地点決定処理〕
次に、図１３に示すフローチャートを参照して、上記ステップＳ２５の第３表示地点決定処理について詳しく説明する。 In the process of step S44, the video display control unit 22 updates the display information (attribute information) of attendees classified as “first main speaker” based on the processing results of steps S42 and S43. Thereby, the process of step S44 is completed and a series of 2nd display point determination processes are complete | finished.
[Third display point determination process]
Next, the third display point determination process in step S25 will be described in detail with reference to the flowchart shown in FIG.

図１３に示すフローチャートは、ステップＳ２４の処理が完了したタイミングで開始となり、第３表示地点決定処理はステップＳ５１の処理に進む。 The flowchart shown in FIG. 13 starts at the timing when the process of step S24 is completed, and the third display point determination process proceeds to the process of step S51.

ステップＳ５１の処理では、映像表示制御部２２が、ステップＳ２２の処理により抽出された表示情報に基づいて、「第１の主要発言者」の映像が表示されている画面領域（左画面領域３１又は右画面領域３２）を検出する。これにより、ステップＳ５１の処理は完了し、第３表示地点決定処理はステップＳ５２の処理に進む。 In the process of step S51, the video display control unit 22 displays the screen area (the left screen area 31 or the video image of the “first main speaker”) based on the display information extracted by the process of step S22. The right screen area 32) is detected. Thereby, the process of step S51 is completed, and the third display point determination process proceeds to the process of step S52.

ステップＳ５２の処理では、映像表示制御部２２が、ステップＳ５１の処理により検出された画面領域とは反対の画面領域に「第１の主要発言者」の映像を表示するように表示画面を設定する。これにより、ステップＳ５２の処理は完了し、第３表示地点決定処理はステップＳ５３の処理に進む。 In the process of step S52, the video display control unit 22 sets the display screen so that the video of “first main speaker” is displayed in the screen area opposite to the screen area detected by the process of step S51. . Thereby, the process of step S52 is completed, and the third display point determination process proceeds to the process of step S53.

ステップＳ５３の処理では、映像表示制御部２２が、ステップＳ５２の処理により「第１の主要発言者」の映像を表示する画面と設定された画面領域に対して反対の画面領域に「デフォルト地点」にいる出席者の映像を表示するように表示画面を設定する。これにより、ステップＳ５３の処理は完了し、第３表示地点決定処理はステップＳ５４の処理に進む。 In the process of step S53, the video display control unit 22 sets “default point” in the screen area opposite to the screen area set as the screen displaying the video of “first main speaker” by the process of step S52. Set the display screen to display the video of attendees in Thereby, the process of step S53 is completed, and the third display point determination process proceeds to the process of step S54.

ステップＳ５４の処理では、映像表示制御部２２が、ステップＳ５２，Ｓ５３の処理結果に基づいて、「第２の主要発言者」に分類されている出席者の表示情報（属性情報）を更新する。これにより、ステップＳ５４の処理は完了し、一連の第３表示地点決定処理は終了する。
〔映像特定処理〕
次に、図１４に示すフローチャートを参照して、上記ステップＳ６の映像特定処理について詳しく説明する。 In the process of step S54, the video display control unit 22 updates the display information (attribute information) of attendees classified as “second main speaker” based on the processing results of steps S52 and S53. Thereby, the process of step S54 is completed and a series of 3rd display point determination processes are complete | finished.
[Video specific processing]
Next, the video specifying process in step S6 will be described in detail with reference to the flowchart shown in FIG.

図１４に示すフローチャートは、ステップＳ５の処理が完了したタイミングで開始となり、映像表示地点決定処理はステップＳ６１の処理に進む。 The flowchart shown in FIG. 14 starts at the timing when the process of step S5 is completed, and the video display point determination process proceeds to the process of step S61.

ステップＳ６１の処理では、映像表示制御部２２が、各出席者の属性情報を読み出す。これにより、ステップＳ６１の処理は完了し、映像特定処理はステップＳ６１の処理に進む。 In the process of step S61, the video display control unit 22 reads the attribute information of each attendee. Thereby, the process of step S61 is completed, and the video specifying process proceeds to the process of step S61.

ステップＳ６２の処理では、映像表示制御部２２が、ステップＳ６１の処理により読み出された属性情報から各出席者の分類情報，元分類情報、及び表示情報を抽出する。これにより、ステップＳ６２の処理は完了し、映像特定処理はステップＳ６３の処理に進む。 In the process of step S62, the video display control unit 22 extracts the classification information, the original classification information, and the display information of each attendee from the attribute information read out in the process of step S61. Thereby, the process of step S62 is completed, and the video specifying process proceeds to the process of step S63.

ステップＳ６３の処理では、映像表示制御部２２が、「第２の主要発言者」の映像が表示される「他地点」の表示装置１１の画面領域と同じ側にある撮像装置により撮影された「第２の主要発言者」の映像を、「第１の主要発言者」の表示装置１１に表示する「第２の主要発言者」の映像に設定する。これにより、ステップＳ６３の処理は完了し、映像特定処理はステップＳ６４の処理に進む。 In the process of step S 63, the video display control unit 22 is photographed by the imaging device on the same side as the screen area of the display device 11 of “other point” on which the video of “second main speaker” is displayed. The video of “second main speaker” is set to the video of “second main speaker” displayed on display device 11 of “first main speaker”. Thereby, the process of step S63 is completed, and the video specifying process proceeds to the process of step S64.

ステップＳ６４の処理では、映像表示制御部２２が、「第１の主要発言者」の映像が表示される「他地点」の表示装置１１の画面領域と同じ側にある撮像装置により撮影された「第１の主要発言者」の映像を、「第２の主要発言者」の表示装置１１に表示する「第１の主要発言者」の映像に設定する。これにより、ステップＳ６４の処理は完了し、映像特定処理はステップＳ６５の処理に進む。 In the process of step S 64, the video display control unit 22 is photographed by the imaging device on the same side as the screen area of the display device 11 of “other point” on which the video of “first main speaker” is displayed. The video of “first main speaker” is set to the video of “first main speaker” displayed on display device 11 of “second main speaker”. Thereby, the process of step S64 is completed and the video specifying process proceeds to the process of step S65.

ステップＳ６５の処理では、映像表示制御部２２が、「第２の主要発言者」の映像が表示される「他地点」の表示装置１１の画面領域と同じ側にある撮像装置により撮影された「第１の主要発言者」の映像を、「他地点」の表示装置１１に表示する「第１の主要発言者」の映像に設定する。これにより、ステップＳ６５の処理は完了し、映像特定処理はステップＳ６６の処理に進む。 In the process of step S65, the video display control unit 22 is photographed by the imaging device on the same side as the screen area of the display device 11 of “other point” on which the video of “second main speaker” is displayed. The video of “first main speaker” is set to the video of “first main speaker” displayed on display device 11 of “other location”. Thereby, the process of step S65 is completed, and the video specifying process proceeds to the process of step S66.

ステップＳ６６の処理では、映像表示制御部２２が、「第１の主要発言者」の映像が表示される「他地点」の表示装置１１の画面領域と同じ側にある撮像装置により撮影された「第２の主要発言者」の映像を、「他地点」の表示装置１１に表示する「第２の主要発言者」の映像に設定する。これにより、ステップＳ６６の処理は完了し、映像特定処理はステップＳ６７の処理に進む。 In the process of step S 66, the video display control unit 22 is photographed by the imaging device on the same side as the screen area of the display device 11 of “other point” on which the video of “first main speaker” is displayed. The video of “second main speaker” is set to the video of “second main speaker” displayed on display device 11 of “other location”. Thereby, the process of step S66 is completed, and the video specifying process proceeds to the process of step S67.

ステップＳ６７の処理では、映像表示制御部２２が、ステップＳ６２の処理により抽出された分類情報に基づいて、「デフォルト地点」にいる出席者として分類された出席者を検出する。これにより、ステップＳ６７の処理は完了し、映像特定処理はステップＳ６８の処理に進む。 In the process of step S67, the video display control unit 22 detects the attendees classified as attendees at the “default location” based on the classification information extracted by the process of step S62. Thereby, the process of step S67 is completed, and the video specifying process proceeds to the process of step S68.

ステップＳ６８の処理では、映像表示制御部２２が、「第１の主要発言者」の映像が表示される「他地点」の表示装置１１の画面領域と同じ側にある撮像装置により撮影された「デフォルト地点」にいる出席者の映像を、「第１の主要発言者」の表示装置１１に表示する「デフォルト地点」にいる出席者の映像に設定する。これにより、ステップＳ６８の処理は完了し、映像特定処理はステップＳ６９の処理に進む。 In the process of step S 68, the video display control unit 22 is photographed by the imaging device on the same side as the screen area of the display device 11 of “other point” on which the video of “first main speaker” is displayed. The video of the attendee at the “default location” is set to the video of the attendee at the “default location” displayed on the display device 11 of the “first main speaker”. Thereby, the process of step S68 is completed and the video specifying process proceeds to the process of step S69.

ステップＳ６９の処理では、映像表示制御部２２が、「第２の主要発言者」の映像が表示される「他地点」の表示装置１１の画面領域と同じ側にある撮像装置により撮影された「デフォルト地点」にいる出席者の映像を、「第２の主要発言者」の表示装置１１に表示する「デフォルト地点」にいる出席者の映像に設定する。これにより、ステップＳ６８の処理は完了し、映像特定処理はステップＳ６９の処理に進む。 In the process of step S 69, the video display control unit 22 is photographed by the imaging device on the same side as the screen area of the display device 11 of “other point” where the video of “second main speaker” is displayed. The video of the attendee at the “default location” is set as the video of the attendee at the “default location” displayed on the display device 11 of the “second main speaker”. Thereby, the process of step S68 is completed and the video specifying process proceeds to the process of step S69.

ステップＳ７０の処理では、映像表示制御部２２が、「第１の主要発言者」，「第２の主要発言者」，及び「デフォルト地点」にいる出席者以外の出席者の両側にある撮像装置により撮影された映像を、「第１の主要発言者」及び「第２の主要発言者」の表示装置１１の下画面領域３３に表示する出席者の映像に設定する。これにより、ステップＳ７０の処理は完了し、映像特定処理はステップＳ７１の処理に進む。 In the process of step S70, the image display control unit 22 has the imaging devices on both sides of the attendees other than the attendees at the “first main speaker”, the “second main speaker”, and the “default location”. Is set as the attendee's video to be displayed in the lower screen area 33 of the display device 11 of “first main speaker” and “second main speaker”. Thereby, the process of step S70 is completed, and the video specifying process proceeds to the process of step S71.

ステップＳ７１の処理では、映像表示制御部２２が、「第１の主要発言者」，「第２の主要発言者」，及び「他地点ｎ」にいる出席者以外の出席者の両側にある撮像装置により撮影された映像を、「他地点ｎ」の表示装置１１の下画面領域３３に表示する出席者の映像に設定する。これにより、ステップＳ７１の処理は完了し、一連の映像特定処理は終了する。 In the process of step S71, the video display control unit 22 captures images on both sides of attendees other than the attendees at the “first main speaker”, the “second main speaker”, and the “other point n”. The video shot by the device is set as the video of the attendee to be displayed in the lower screen area 33 of the display device 11 of “other point n”. Thereby, the process of step S71 is completed, and a series of video specifying processes ends.

〔情報送信処理〕
最後に、図１５に示すフローチャートを参照して、ステップＳ８の情報送信処理について詳しく説明する。 [Information transmission processing]
Finally, the information transmission process in step S8 will be described in detail with reference to the flowchart shown in FIG.

図１５に示すフローチャートは、ステップＳ６，７の処理が完了したタイミングで開始となり、情報送信処理はステップＳ８１の処理に進む。 The flowchart shown in FIG. 15 starts at the timing when the processes in steps S6 and S7 are completed, and the information transmission process proceeds to the process in step S81.

ステップＳ８１の処理では、映像表示制御部２２が、各出席者の撮像装置１２の左カメラ１２ａ及び右カメラ１２ｂにより撮影された映像を受信する。これにより、ステップＳ８１の処理は完了し、情報送信処理はステップＳ８２の処理に進む。 In the process of step S81, the video display control unit 22 receives videos taken by the left camera 12a and the right camera 12b of the imaging device 12 of each attendee. Thereby, the process of step S81 is completed, and the information transmission process proceeds to the process of step S82.

ステップＳ８２の処理では、映像表示制御部２２が、ステップＳ６の処理により設定された情報とステップＳ７の処理により取得された発言率の情報を取得する。これにより、ステップＳ８２の処理は完了し、情報送信処理はステップＳ８３の処理に進む。 In the process of step S82, the video display control unit 22 acquires the information set by the process of step S6 and the information of the speech rate acquired by the process of step S7. Thereby, the process of step S82 is completed and the information transmission process proceeds to the process of step S83.

ステップＳ８３の処理では、映像表示制御部２２が、ステップＳ８２の処理により取得した情報に基づいて、ステップＳ８１の処理により受信した映像の中から各出席者の表示装置１１に送信する映像を選択する。これにより、ステップＳ８３の処理は完了し、情報送信処理はステップＳ８４の処理に進む。 In the process of step S83, the video display control unit 22 selects a video to be transmitted to each attendee's display device 11 from the video received by the process of step S81, based on the information acquired by the process of step S82. . Thereby, the process of step S83 is completed, and the information transmission process proceeds to the process of step S84.

ステップＳ８４の処理では、映像表示制御部２２が、ステップＳ８３の処理結果に基づいて、左映像領域３１，右映像領域３２，及び下映像領域３３に表示する映像を発言率に関する情報と共に各出席者の表示装置１１に送信する。具体的には、図１６に示すように、他地点ｎの表示装置１１の左画面領域３１及び右画面領域３２にそれぞれ「第１の主要発言者」及び「第２の主要発言者」の映像を表示する場合、映像表示制御部２２は、「第２の主要発言者」（「第１の主要発言者」）の映像が表示されている右画面領域３２（左画面領域３１）と同じ側に設定されている右カメラ１２ｂ（左カメラ１２ａ）により撮影された「第１の主要発言者」（「第２の主要発言者」）の映像を「第２に主要発言者」（第１の主要発言者）の表示装置１１に表示する「第１の主要発言者」（「第２の主要発言者」）の映像として送信する。また他地点ｎが「デフォルト地点」でない場合、映像表示制御部２２は、他地点ｎの左カメラ１２ａ及び右カメラ１２ｂにより撮影された映像を各地点に送信し、他地点ｎが「デフォルト地点」である場合には、「第１の主要発言者」の映像が表示されている左画面領域３１と同じ側に設定されている左カメラ１２ａにより撮影された他地点ｎにいる出席者の映像を「第１の主要発言者」及び「第２の主要発言者」の表示装置１１に表示する「デフォルト地点」にいる出席者の映像として送信する。これにより、ステップＳ８４の処理は完了し、一連の情報送信処理は終了する。 In the process of step S84, the video display control unit 22 displays the video to be displayed in the left video area 31, the right video area 32, and the lower video area 33 together with information on the speech rate based on the processing result of step S83. To the display device 11. Specifically, as shown in FIG. 16, videos of “first main speaker” and “second main speaker” in the left screen region 31 and the right screen region 32 of the display device 11 at another point n, respectively. Is displayed on the same side as the right screen area 32 (left screen area 31) on which the video of “second main speaker” (“first main speaker”) is displayed. The video of the “first main speaker” (“second main speaker”) taken by the right camera 12b (left camera 12a) set to “second main speaker” (first It is transmitted as an image of “first main speaker” (“second main speaker”) displayed on the display device 11 of the main speaker. When the other point n is not the “default point”, the video display control unit 22 transmits the video shot by the left camera 12a and the right camera 12b at the other point n to each point, and the other point n is the “default point”. Is the video of the attendee at another point n taken by the left camera 12a set on the same side as the left screen area 31 on which the video of “first main speaker” is displayed. It is transmitted as an image of the attendee at the “default location” displayed on the display device 11 of “first main speaker” and “second main speaker”. Thereby, the process of step S84 is completed and a series of information transmission processes are completed.

以上の映像表示制御処理をより具体的に説明すると以下のようになる。いま地点Ａ〜Ｆにいる出席者間でビデオ会議を行う場合を考える。初期化処理においてビデオ会議の主催者が出席者の主要順序を地点Ｆ，地点Ｃ，地点Ａ，地点Ｄ，地点Ｂ，地点Ｅの順に設定したとすると、「第１の主要発言者」，「第２の主要発言者」，及び「デフォルト地点」にいる出席者は順に地点Ｆ，地点Ｃ，及び地点Ａにいる出席者となる。従ってこの段階では、図１７に示すように、他地点の表示装置１１の左画面領域３１及び右画面領域３２にはそれぞれ地点Ｃ及び地点Ｆの出席者の映像が表示され、「第１の主要発言者」の表示装置１１の左画面領域３１及び右画面領域３２にはそれぞれ地点Ａ及び地点Ｃの出席者の映像が表示され、「第２の主要発言者」の表示装置１１の左画面領域３１及び右画面領域３２にはそれぞれ地点Ｆ及び地点Ａの出席者の映像が表示される。 The above video display control process will be described more specifically as follows. Consider a video conference between attendees at points A through F. If the organizer of the video conference sets the main order of attendees in the order of point F, point C, point A, point D, point B, and point E in the initialization process, “first main speaker”, “ The attendees at “second main speaker” and “default location” are the attendees at location F, location C, and location A in that order. Accordingly, at this stage, as shown in FIG. 17, the images of the attendees at the points C and F are displayed on the left screen region 31 and the right screen region 32 of the display device 11 at another point, respectively. In the left screen area 31 and the right screen area 32 of the “speaker” display device 11, the images of the attendees at the points A and C are displayed, respectively, and the left screen region of the “second main speaker” display device 11. In 31 and the right screen area 32, images of attendees at the points F and A are displayed, respectively.

次に、主要発言者が図１７に示すように地点Ｂ→地点Ｃ→地点Ｂ→地点Ｄ→地点Ｆ→地点Ａの順に変化したとすると、主要発言者が特定された段階で上述した出席者の分類（「第１の主要発言者」，「第２の主要発言者」，及び「デフォルト地点」にいる出席者）を元分類に置き換えた後、地点Ｃ及び地点Ｂにいる出席者をそれぞれ「第１の主要発言者」及び「第２の主要発言者」に分類する。そして他地点の表示装置１１の左画面領域３１及び右画面領域３２にはそれぞれ地点Ｃ及び地点Ｂの出席者の映像が表示され、「第１の主要発言者」の表示位置は変化させないようにする。また「第１の主要発言者」（地点Ｃにいる出席者）の表示装置１１の左画面領域３１及び右画面領域３２にはそれぞれ「第２の主要発言者」（地点Ｂにいる出席者）及び「デフォルト地点」にいる出席者（地点Ｄにいる出席者）の映像が表示され、「第２の主要発言者」（地点Ｂにいる出席者）の表示装置１１の左画面領域３１及び右画面領域３２にはそれぞれ「デフォルト地点にいる出席者」及び「第１の主要発言者」の映像が表示される。なおこの場合、「デフォルト地点」は、「第１の主要発言者」と「第２の主要発言者」を除いた（望ましくは元分類情報において「第１の主要発言者」及び「デフォルト地点」にいる出席者に分類された出席者をさらに除いた）出席者の中で主要順序が最も高い出席者がいる地点を示す。 Next, assuming that the main speaker changes in the order of point B → point C → point B → point D → point F → point A as shown in FIG. 17, the attendee described above at the stage where the main speaker is specified. After replacing the category of (the first primary speaker, the second primary speaker, and the attendees at the default location) with the original classification, the attendees at location C and location B were each It is classified into “first main speaker” and “second main speaker”. The video images of the attendees at the points C and B are displayed on the left screen region 31 and the right screen region 32 of the display device 11 at other points, respectively, so that the display position of the “first main speaker” is not changed. To do. In addition, in the left screen area 31 and the right screen area 32 of the display device 11 of the “first main speaker” (the attendee at the point C), a “second main speaker” (the attendee at the point B), respectively. And the video of the attendee at the “default location” (the attendee at the location D) is displayed, and the left screen area 31 and the right of the display device 11 of the “second main speaker” (the attendee at the location B) In the screen area 32, images of “attendees at the default location” and “first main speaker” are displayed. In this case, the “default point” excludes “first main speaker” and “second main speaker” (preferably “first main speaker” and “default point” in the original classification information). (Excluding attendees categorized as attendees in) (showing the location of the attendee with the highest primary order among attendees).

次に、主要発言者が地点Ｂにいる出席者から地点Ｃにいる出席者に変化した場合、主要発言者が変化したタイミングで、主要発言者が地点Ｂにいる出席者である時の分類情報を元分類に置き換えた後、地点Ｂ及び地点Ｃにいる出席者をそれぞれ「第１の主要発言者」及び「第２の主要発言者」に分類する。そして他地点の表示装置１１の左画面領域３１及び右画面領域３２にはそれぞれ地点Ｃ及び地点Ｂの出席者の映像が表示される。また「第１の主要発言者」（地点Ｂにいる出席者）の表示装置１１の左画面領域３１及び右画面領域３２にはそれぞれ「デフォルト地点」にいる出席者（地点Ｆにいる出席者）及び「第２の主要発言者」（地点Ｃにいる出席者）の映像が表示され、「第２の主要発言者」（地点Ｃにいる出席者）の表示装置１１の左画面領域３１及び右画面領域３２にはそれぞれ「第１の主要発言者」及び「デフォルト地点にいる出席者」の映像が表示される。以下、同様の処理を繰り返す。 Next, when the main speaker changes from an attendee at point B to an attendee at point C, the classification information when the main speaker is an attendee at point B at the timing when the main speaker changes Is replaced with the original classification, and the attendees at point B and point C are classified as "first main speaker" and "second main speaker", respectively. In the left screen area 31 and the right screen area 32 of the display device 11 at another point, the images of the attendees at the point C and the point B are displayed, respectively. The left screen region 31 and the right screen region 32 of the display device 11 of the “first main speaker” (the attendee at the point B) each have an attendee at the “default point” (the attendee at the point F). And the video of the “second main speaker” (the attendee at the point C) is displayed, and the left screen region 31 and the right of the display device 11 of the “second main speaker” (the attendee at the point C) In the screen area 32, images of “first main speaker” and “attendees at the default location” are displayed. Thereafter, the same processing is repeated.

以上の説明から明らかなように、本発明の実施形態となるビデオ会議システム１では、サーバ装置３の主要発言者判定部２１が、ビデオ会議の主要発言者を判定し、サーバ装置３の映像表示制御部２２が、主要発言者判定部２１の判定結果に基づいて、「第１の主要発言者」と「第２の主要発言者」を特定する。そして映像表示制御部２２は、「第１の主要発言者」と「第２の主要発言者」の映像が左右異なる位置に表示されるように「他地点」の表示装置１１を制御し、「第２の主要発言者」の映像が「他地点」の表示装置１１における「第２の主要発言者」の映像表示位置とは異なる左右位置に表示されるように「第１の主要発言者」がいる地点の表示装置１１を制御し、「第１の主要発言者」の映像が「他地点」の表示装置１１における「第１の主要発言者」の映像表示位置とは異なる左右位置に表示されるように「第２の主要発言者」がいる地点の表示装置１１を制御する。 As is clear from the above description, in the video conference system 1 according to the embodiment of the present invention, the main speaker determination unit 21 of the server device 3 determines the main speaker of the video conference and displays the video on the server device 3. The control unit 22 identifies the “first main speaker” and the “second main speaker” based on the determination result of the main speaker determination unit 21. Then, the video display control unit 22 controls the display device 11 for “other points” so that the videos of the “first main speaker” and the “second main speaker” are displayed at different positions on the left and right. The “first main speaker” is displayed so that the video of the “second main speaker” is displayed at the left and right positions different from the video display position of the “second main speaker” on the display device 11 of “other points”. The display device 11 at the point where the sound is present is controlled, and the video of the “first main speaker” is displayed at the left and right positions different from the video display position of the “first main speaker” on the display device 11 of “other points”. In this manner, the display device 11 at the point where the “second main speaker” is present is controlled.

すなわち、本発明の実施形態となるビデオ会議システム１では、サーバ装置３の映像表示制御部２２が、主要発言者判定部２１により判定された主要発言者に基づいて、各地点の表示装置１１に映像を表示する出席者を決定する。そしてこのような構成によれば、撮像装置の必要設置台数がビデオ会議の出席者数に依存しなくなるので、ビデオ会議を安価に行うことができる。また会議の途中で出席者が増加した場合であっても、増えた出席者分の撮像装置を必ずしも追加する必要がないので、ビデオ会議を円滑に進行することができる。また特殊な広視野角の曲面スクリーン等の特別な装置を用いることなく、主要発言者が向き合っている視線一致映像を他の出席者の表示装置１１に表示させることができる。 That is, in the video conference system 1 according to the embodiment of the present invention, the video display control unit 22 of the server device 3 controls the display device 11 at each point based on the main speaker determined by the main speaker determination unit 21. Decide who attends to view the video. According to such a configuration, since the necessary number of installed image pickup devices does not depend on the number of attendees of the video conference, the video conference can be performed at a low cost. Even if the number of attendees increases during the conference, it is not always necessary to add imaging devices for the increased attendees, so that the video conference can proceed smoothly. Further, it is possible to display the line-of-sight matching video in which the main speaker faces the other attendee's display device 11 without using a special device such as a curved screen with a special wide viewing angle.

また本発明の実施形態となるビデオ会議システム１では、映像表示制御部２２は、主要発言者が変化した場合、「第２の主要発言者」の映像を新たな主要発言者の映像に置き換えるように「他地点」の表示装置１１を制御する。このような構成によれば、主要発言者が変化したとしても、主要発言者が変化する前の「第１の主要発言者」の映像は同じ位置に表示されるので、出席者の表示位置の変化が小さくなり、ビデオ会議を円滑に進行させることができる。 In the video conference system 1 according to the embodiment of the present invention, the video display control unit 22 replaces the video of the “second main speaker” with the video of the new main speaker when the main speaker changes. The “other point” display device 11 is controlled. According to such a configuration, even if the main speaker changes, the video of the “first main speaker” before the main speaker changes is displayed at the same position. The change is reduced and the video conference can proceed smoothly.

また本発明の実施形態となるビデオ会議システム１では、主要発言者判定部２１は、ビデオ会議に出席している出席者の発話音量に基づいて、ビデオ会議の主要発言者を判定するので、主要発言者を正確に判定することができる。また本発明の実施形態となるビデオ会議システム１では、主要発言者判定部２１は、ビデオ会議の会議時間中に占める各出席者の発言時間の割合を各出席者の発言率として算出し、映像表示制御部２２は、表示装置１１に映像を表示する出席者の発言率に関する情報を主要発言者特定部２１から取得し、取得した発言率に関する情報を出席者の映像と共に表示するように表示装置１１を制御するので、出席者の発言を予想及び比較することが可能となり、操作者が適切な行動が行うことが可能となる。 In the video conference system 1 according to the embodiment of the present invention, the main speaker determination unit 21 determines the main speaker of the video conference based on the utterance volume of the attendee attending the video conference. The speaker can be accurately determined. Further, in the video conference system 1 according to the embodiment of the present invention, the main speaker determination unit 21 calculates the ratio of the speaking time of each attendee during the conference time of the video conference as the speaking rate of each attendee, and the video The display control unit 22 acquires information on the speech rate of the attendee who displays the video on the display device 11 from the main speaker specifying unit 21, and displays the acquired information on the speech rate together with the video of the attendee. 11, it is possible to predict and compare the attendees' statements, and the operator can take appropriate actions.

また本発明の実施形態となるビデオ会議システム１では、撮像装置は、左カメラ１２ａ及び右カメラ１２ｂは、表示装置１１に表示されている２人の出席者の映像のうちの一方の映像を見ている時の操作者の視線方向と右カメラ１２ｂの撮像方向がなす角度βが、視線方向と左カメラ１２ａの撮像方向がなす角度θの少なくとも２倍以上、且つ、６０度以上乃至９０度以下の範囲内になる位置に配置されているので、主要発言者が向き合っている視線一致映像を正確に撮影することができる。 In the video conference system 1 according to the embodiment of the present invention, the imaging device is such that the left camera 12a and the right camera 12b view one of the images of the two attendees displayed on the display device 11. The angle β formed by the operator's line-of-sight direction and the imaging direction of the right camera 12b is at least twice the angle θ formed by the line-of-sight direction and the imaging direction of the left camera 12a, and 60 degrees or more and 90 degrees or less. Therefore, it is possible to accurately shoot a line-of-sight image in which the main speaker is facing.

以上、本発明者によってなされた発明を適用した実施の形態について説明したが、この実施の形態による本発明の開示の一部をなす記述及び図面により本発明は限定されることはない。すなわち上記実施の形態に基づいて当業者等によりなされる他の実施の形態、実施例及び運用技術等は全て本発明の範疇に含まれる。 As mentioned above, although embodiment which applied the invention made | formed by this inventor was demonstrated, this invention is not limited with the description and drawing which make a part of indication of this invention by this embodiment. That is, other embodiments, examples, operational techniques, and the like made by those skilled in the art based on the above-described embodiments are all included in the scope of the present invention.

１ビデオ会議システム
２ビデオ会議端末装置
３サーバ装置
４電気通信回線
１１表示装置
１２撮像装置
１２ａ左カメラ
１２ｂ右カメラ
２１主要発言者判定部
２２映像表示制御部
３１左画面領域
３２右画面領域
３３下画面領域
３４，３５発言率表示領域
４１音量検出部
４２音量履歴記憶部
４３主要発言者特定部
４４発言時間計算部 DESCRIPTION OF SYMBOLS 1 Video conference system 2 Video conference terminal device 3 Server apparatus 4 Electric communication line 11 Display apparatus 12 Imaging device 12a Left camera 12b Right camera 21 Main speaker determination part 22 Video display control part 31 Left screen area 32 Right screen area 33 Lower screen Areas 34 and 35 Speech rate display area 41 Volume detection unit 42 Volume history storage unit 43 Main speaker identification unit 44 Speech time calculation unit

特許第３５８７１０６号公報Japanese Patent No. 3587106

Claims

A plurality of video conference terminal devices arranged at each point where the attendees of the video conference are present;
A server device connected to the plurality of video conference terminal devices via a telecommunication line,
Each video conference terminal device
A display device for displaying images of other attendees at least at two different positions on the left and right;
An imaging device for capturing images of attendees facing the display screen of the display device,
The server device
A main speaker determination unit for determining a main speaker of the video conference;
A video display control unit for controlling the video of attendees displayed by the display device,
The video display control unit
Identifying the first and second main speakers based on the determination result of the main speaker determination unit,
Attendees other than the first and second main speakers such that the images of the first main speaker and the second main speaker captured by the imaging device are displayed at different positions on the left and right. Control the display device at the point where
The video of the second main speaker taken at the display device at the point where the attendees other than the first and second main speakers are present is the video of the second main speaker captured by the imaging device. Controlling the display device at the point where the first main speaker is located so that the left and right positions are different from the display position;
The video of the first main speaker taken at the display device at the point where the attendees other than the first and second main speakers are present is the video of the first main speaker captured by the imaging device. A video conference system, wherein a display device at a point where the second main speaker is located is controlled so as to be displayed at a left and right position different from a display position.

The video display control unit determines whether or not the main speaker has changed based on the determination result of the main speaker determination unit. If the main speaker has changed, the video display control unit displays the video of the second main speaker. 2. The video conference system according to claim 1, wherein a display device at a point where attendees other than the first and second main speakers are present is controlled so as to be replaced with a video of a new main speaker.

The said main speaker determination part determines the main speaker of the said video conference based on the utterance volume of the participant who attends the said video conference, The Claim 1 or Claim 2 characterized by the above-mentioned. Video conferencing system.

The main speaker determination unit calculates a speech rate of each attendee during the meeting time of the video conference as a speech rate of each attendee, and the video display control unit displays a video on the display device. The information about the speaking rate of the attending attendee is acquired from the main speaker determining unit, and the display device is controlled to display the acquired information regarding the speaking rate together with the video of the attendee. The video conference system according to claim 3.

The imaging device includes a first imaging device that captures an attendee's video from a first imaging direction and a second imaging device that captures an attendee's video from a second imaging direction. The second imaging device has an angle formed by the operator's line-of-sight direction and the first imaging direction when viewing one of the two attendee images displayed on the display device. 5. The apparatus according to any one of claims 1 to 4, wherein the lens is disposed at a position that is at least twice as large as an angle formed by the line-of-sight direction and the second imaging direction and within a range of 60 degrees to 90 degrees. The video conference system according to any one of the above.

A main speaker determination unit that determines a main speaker of a video conference from among video conference attendees at a plurality of points;
A video display control unit that controls the video of attendees to be displayed on a display device arranged at each point where the attendees of the video conference are present,
The video display control unit
Identifying the first and second main speakers based on the determination result of the main speaker determination unit,
A display device for a point where attendees other than the first and second main speakers are present so that the images of the first main speaker and the second main speaker are displayed at different positions on the left and right. Control
The left and right positions where the video of the second main speaker is different from the video display position of the second main speaker in a display device at a point where attendees other than the first and second main speakers are present The display device of the point where the first main speaker is located,
Left and right positions where the video of the first main speaker is different from the video display position of the first main speaker in a display device at a point where attendees other than the first and second main speakers are present The server device is characterized in that the display device at the point where the second main speaker is located is controlled so as to be displayed on the screen.

A main speaker determination step for determining a video conference main speaker from video conference attendees at a plurality of points;
Causing the computer to execute a video display control step of controlling the video of the attendee to be displayed on a display device arranged at each point where the attendee of the video conference is present;
The video display control step includes:
Identifying first and second primary speakers based on the result of the primary speaker determination step;
A display device for a point where attendees other than the first and second main speakers are present so that the images of the first main speaker and the second main speaker are displayed at different positions on the left and right. Controlling step;
The left and right positions where the video of the second main speaker is different from the video display position of the second main speaker in a display device at a point where attendees other than the first and second main speakers are present Controlling a display device at the point where the first primary speaker is present, as shown in FIG.
Left and right positions where the video of the first main speaker is different from the video display position of the first main speaker in a display device at a point where attendees other than the first and second main speakers are present A video conferencing program comprising: controlling a display device at a point where the second main speaker is located so as to be displayed on the screen.