JPH04237288A

JPH04237288A - Audio signal output method for plural-picture window display

Info

Publication number: JPH04237288A
Application number: JP3005243A
Authority: JP
Inventors: Yuichi Fujino; 雄一藤野; Naofumi Inmaki; 印牧　直文; Kazunori Shimamura; 和典島村
Original assignee: Nippon Telegraph and Telephone Corp
Current assignee: Nippon Telegraph and Telephone Corp
Priority date: 1991-01-21
Filing date: 1991-01-21
Publication date: 1992-08-25
Anticipated expiration: 2015-04-24
Also published as: JP3036088B2

Abstract

PURPOSE:To display plural participants simultaneously in a video conference system and to easily identify each talker. CONSTITUTION:In the video conference system interconnecting points A, B, C, D, a multi-window system is adopted for a monitor 21 for a participant 11 at the pint A and windows 25, 26, 27 are arranged laterally to the screen 22 and images 16, 19, 28 of participants of the points B, C, D are displayed respectively onto the windows 25, 26, 27. Speakers 42, 43 are provided respectively to the right and left of the monitor 21 and a voice of the participant at the point B corresponds to the horizontal position of the participant image 16 on the screen 22 and an output level ratio of the speakers 42, 43 is selected to be 100:0, then as if the voice were generated from the participant image 16 and a voice of the participant at the point C corresponds to the horizontal position of the participant image 19 on the screen 22 and an output level ratio of the speakers 42, 43 is selected to be 50:50, then as if the voice were generated from the participant image 19.

Description

[Detailed description of the invention]

【０００１】0001

【産業上の利用分野】この発明は例えばテレビ会議装置
や親子画面方式などの１画面に複数のウインドウを設け
、そのウインドウに各別の画像を表示する場合における
その画像と付随した音声、音楽などの音信号の出力方法
に関する。[Industrial Application Field] This invention is applicable to, for example, when a plurality of windows are provided on one screen of a TV conference device or a parent-child screen system, and each window displays a different image, the sound and music associated with the image are displayed. This invention relates to a sound signal output method.

【０００２】0002

【従来の技術】テレビ会議システムや多地点間接続され
たテレビ電話で複数の者が同時に会話する、一種のテレ
ビ会議における表示は、従来においては１台の表示装置
に切替えて表示していた。つまり例えば地点Ａ，Ｂ，Ｃ
，Ｄの４地点間接続されたテレビ会議システムで１台の
モニタに切り替えて表示する場合、図５に示すように地
点Ａの参加者１１はカメラ１２にて撮像され、その撮像
出力は制御装置１３、通信回線１４を介して地点Ｂ，Ｃ
，Ｄの各通信相手に送信される。また、地点Ａの参加者
１１は、表示モニタ１５を使用して、例えば地点Ｂの参
加者の像１６を見ることができる。ここで、地点Ｃ，Ｄ
の参加者を見たい場合には制御装置１３を介して表示モ
ニタ１５に表示されている画像を地点Ｃ，Ｄからの画像
に切り替えて表示することにより見ることができる。また、音声は地点Ｂ，Ｃ，Ｄの音声を制御装置１３によ
りミキシングしてスピーカ１７に出力される。この方式
では、地点Ａ，Ｂ，Ｃ，Ｄから入力される音声をミキシ
ングにより出力するために、出力されている音声がどの
参加者の音声なのかを認識できない欠点がある。2. Description of the Related Art Conventionally, displays in a type of video conference in which a plurality of people converse at the same time using a video conference system or a video telephone connected between multiple points have been displayed by switching to a single display device. For example, points A, B, C
, D, when switching to one monitor for display in a video conference system connected between four points, the participant 11 at point A is imaged by the camera 12 as shown in FIG. 5, and the image output is sent to the control device. 13. Points B and C via communication line 14
, D to each communication partner. Furthermore, the participant 11 at point A can use the display monitor 15 to view the image 16 of the participant at point B, for example. Here, points C and D
If you want to see the participants, you can do so by switching the images displayed on the display monitor 15 to images from points C and D via the control device 13. Further, the audio at points B, C, and D is mixed by the control device 13 and output to the speaker 17. In this method, since the voices input from points A, B, C, and D are output by mixing, there is a drawback that it is not possible to recognize which participant's voice is being outputted.

【０００３】また従来において、１地点に複数のモニタ
を設けて、各別の地点からの像を表示する方式もあった
。図６は地点Ａ，Ｂ，Ｃ，Ｄの４地点間接続されたテレ
ビ会議システムで２台のモニタに通信相手画像を表示す
る方式を示し、地点Ａでは表示モニタ１５の他に表示モ
ニタ１８を並べて設け、この表示モニタ１８に地点Ｃか
らの画像信号を表示させる。このようにして地点Ｂ，Ｃ
の参加者の像１６，１９を見ることができる。[0003] Also, in the past, there has been a system in which a plurality of monitors are provided at one point and images from each different point are displayed. FIG. 6 shows a method of displaying images of communication partners on two monitors in a video conference system connected between four points A, B, C, and D. At point A, a display monitor 18 is used in addition to the display monitor 15. They are arranged side by side, and the image signal from point C is displayed on this display monitor 18. In this way, points B and C
Images 16 and 19 of the participants can be seen.

【０００４】地点Ｄの参加者の像を見たい場合には、表
示モニタ１６または１８の映像を切り替えて地点Ｄの参
加者の像を表示する。また、音声は地点Ｂ，Ｃ，Ｄの音
声を制御装置１３によりミキシングしてスピーカ１７か
ら出力される。この従来方式では、１台のモニタで切り
替えて表示する場合と同様に、地点Ａ，Ｂ，Ｃ，Ｄから
入力される音声をミキシングにより出力するために、出
力されている音声がどの参加者の音声なのかを認識でき
ない同じ欠点がある。[0004] When it is desired to see the image of the participant at point D, the image on the display monitor 16 or 18 is switched to display the image of the participant at point D. Furthermore, the audio from points B, C, and D is mixed by the control device 13 and output from the speaker 17 . In this conventional method, the audio input from points A, B, C, and D is output by mixing, similar to when switching and displaying on one monitor. It has the same drawback of not being able to recognize whether it is a voice or not.

【０００５】更に従来において、１画面全体を１つのウ
インドウとしてこれに画像を表示すると共にその画面（
ウインドウ）の一部に他の小ウインドウを設け、この小
ウインドウに他の画像を表示するいわゆる親子画面表示
方式においては、音声又は音楽などの音信号については
その一方のウインドウに付随するものを切り替えてスピ
ーカより出力するか、両ウインドウに付随する両音信号
を単純にミキシングしてスピーカより出力しているため
、両者を聞き分けることができなかった。[0005] Furthermore, conventionally, an entire screen is treated as one window and an image is displayed on it, and the screen (
In the so-called parent-child screen display method, in which another small window is provided as a part of a window) and another image is displayed in this small window, the sound signals such as audio or music that are attached to one of the windows are switched. Either the two sound signals associated with both windows are simply mixed and output from the speaker, making it impossible to distinguish between the two.

【０００６】[0006]

【課題を解決するための手段】この発明によれば複数の
ウインドウを設けて各別の画像を１画面に同時に表示す
る表示装置のその画面の周辺に複数のスピーカを分散配
置し、これらスピーカに、これらからウインドウまでの
距離に応じて逆比例的に、そのウインドウに付随した音
信号を分配して音信号を出力する。[Means for Solving the Problems] According to the present invention, a plurality of speakers are distributed around the screen of a display device that is provided with a plurality of windows and displays different images on one screen at the same time. , and output the sound signal by distributing the sound signal associated with the window in inverse proportion to the distance from these to the window.

【０００７】[0007]

【実施例】図１に地点Ａ，Ｂ，Ｃ，Ｄの４地点間接続さ
れたテレビ会議システムにこの発明を適用した実施例を
示す。つまり図１は地点Ａに設けられた装置であって、
表示モニタ２１はいわゆるマルチウインドウ表示方式の
ものであって、その画面２２に複数の動画ウインドウ２
３，２４，２５が設けられ、これらウインドウ２３，２
４，２５にそれぞれ地点Ｂ，Ｃ，Ｄの各参加者像１６，
１９，２８が表示される。Embodiment FIG. 1 shows an embodiment in which the present invention is applied to a video conference system connected between four points A, B, C, and D. In other words, FIG. 1 shows the device installed at point A,
The display monitor 21 is of a so-called multi-window display type, and a plurality of video windows 2 are displayed on the screen 22.
3, 24, 25 are provided, and these windows 23, 2
Images 16 of each participant at points B, C, and D are shown at 4 and 25, respectively.
19 and 28 are displayed.

【０００８】制御装置１３内において、映像／音声入出
力制御部３１が通信回線１４，テレビカメラ１２、マル
チウインドウ拡大／縮小部３２及び音声レベル検出部３
３に接続され、マルチウインドウ拡大／縮小部３２、マ
ルチウインドウ表示制御部３４、マルチウインドウ移動
処理部３５、マウス入力処理部３６及び音像定位制御部
３７が中央制御部３８に接続される。音声レベル検出部
３３の検出出力は音声レベル制御部３９へ出力され、音
声レベル制御部３９の出力は音像定位制御部３７に出力
される。マウス入力処理部３６にマウス４１が接続され
る。画面２２の左右の両側にスピーカ４２，４３が配さ
れる。In the control device 13, a video/audio input/output control section 31 connects a communication line 14, a television camera 12, a multi-window enlargement/reduction section 32, and an audio level detection section 3.
3, and a multi-window enlargement/reduction section 32, a multi-window display control section 34, a multi-window movement processing section 35, a mouse input processing section 36, and a sound image localization control section 37 are connected to the central control section 38. The detection output of the audio level detector 33 is output to the audio level controller 39, and the output of the audio level controller 39 is output to the sound image localization controller 37. A mouse 41 is connected to the mouse input processing section 36. Speakers 42 and 43 are arranged on both the left and right sides of the screen 22.

【０００９】カメラ１２により撮像された地点Ａの参加
者１１の映像は映像／音声入出力制御部３１、通信回線
１４を介して通信相手に送信される。また、通信回線１
４を介して制御装置１３に入力された多重化された３地
点の映像、音声は、映像／音声入出力制御部３１で３地
点の映像、音声に分離され、映像はマルチウインドウ拡
大／縮小部３２を介してマルチウインドウ表示制御部３
４に入力され、音声は音声レベル検出部３３、音声レベ
ル出力制御部３９を介して音像定位制御部３７に入力さ
れる。ここで、地点Ｂ，Ｃ，Ｄの映像をそれぞれ映像チ
ャネル１，２，３とし、映像チャネル１，２，３に付随
する音声をそれぞれ音声チャネル１，２，３とする。マ
ルチウインドウ表示制御部３４では中央制御部３８の指
示の下に、例えばマルチウインドウ表示モニタ２１の画
面２２に表示されているように、地点Ｂの参加者像１６
、すなわち映像チャネル１を画面２２の左側の動画ウイ
ンドウ２５に、地点Ｃの参加者像１９、すなわち映像チ
ャネル２を画面２２の中央の動画ウインドウ２６に、地
点Ｄの参加者像２８、すなわち映像チャネル３を画面２
２の右側の動画ウインドウ２７に表示する。The image of the participant 11 at point A captured by the camera 12 is transmitted to the communication partner via the video/audio input/output control section 31 and the communication line 14. Also, communication line 1
The multiplexed video and audio of the three points inputted to the control device 13 via the video/audio input/output control section 31 are separated into the video and audio of the three points, and the video is sent to the multi-window enlargement/reduction section. Multi-window display control unit 3 via 32
4, and the sound is input to the sound image localization control section 37 via the sound level detection section 33 and the sound level output control section 39. Here, the videos at points B, C, and D are designated as video channels 1, 2, and 3, respectively, and the audio accompanying video channels 1, 2, and 3 are designated as audio channels 1, 2, and 3, respectively. The multi-window display control section 34 displays the participant image 16 at point B under the instructions of the central control section 38, for example, as displayed on the screen 22 of the multi-window display monitor 21.
That is, video channel 1 is placed in the video window 25 on the left side of the screen 22, participant image 19 at point C, ie, video channel 2, is placed in the video window 26 in the center of the screen 22, and participant image 28 at point D, ie, the video channel. 3 to screen 2
2 in the video window 27 on the right side.

【００１０】この発明ではウインドウ２５に付随する音
声、つまり音声チャネル１を、ウインドウ２５からスピ
ーカ４２，４３までの距離と逆比例的にスピーカ４２，
４３に分配する。ウインドウ２６，２７にそれぞれ付随
する音声、つまり音声チャネル２，３も同様に、スピー
カ４２，４３に分配する。このため中央制御部３８は、
音像定位制御部３７に、ウインドウ２５，２６，２７そ
れぞれの表示中央水平座標位置と、ウインドウに表示さ
れている映像チャネルに付随する音声のチャネル番号と
を入力する。音像定位制御部３７では、入力された各ウ
インドウの表示中央水平座標位置に基づき、そのウイン
ドウに表示されている映像チャネルに付随する音声チャ
ネルの音声出力のスピーカ４２，４３への分配レベルを
制御する。In the present invention, the audio accompanying the window 25, that is, the audio channel 1, is transmitted to the speakers 42, 43 in inverse proportion to the distance from the window 25 to the speakers 42, 43.
43. The sounds associated with windows 26 and 27, respectively, audio channels 2 and 3, are similarly distributed to speakers 42 and 43. For this reason, the central control unit 38
The display center horizontal coordinate position of each of the windows 25, 26, and 27 and the audio channel number associated with the video channel displayed in the window are input to the sound image localization control unit 37. The sound image localization control unit 37 controls the distribution level of the audio output of the audio channel associated with the video channel displayed in that window to the speakers 42 and 43 based on the input display center horizontal coordinate position of each window. .

【００１１】音声レベル可能範囲を５段階とした場合、
図２Ａに示すように画面２２を水平方向において５つの
領域■〜■に分割し、領域■を左のスピーカ４２側に位
置させ、これら領域■〜■の何れに像（ウインドウ）が
あるかに応じて、その像（ウインドウ）に付随した音声
を、スピーカ４２，４３へ分配する分配比を決める。こ
の決め方は近いスピーカに大きなレベルが、遠いスピー
カに小さなレベルが供給されるように、図２Ｂに示すよ
うにする。例えば領域■に位置したウインドウに付随す
る音声はスピーカ４２，４３に０．７５：０．２５とな
るように分配し、スピーカ４２，４３の出力の比はこの
関係となり、地点Ａの参加者には領域■からその音声が
発声されているように聞こえる。従って図１の画面２２
に表示されている動画ウインドウ２５，２６，２７の表
示中央水平座標位置はそれぞれ領域■、■、■にあるか
ら、動画ウインドウ２５に表示されている映像、すなわ
ち映像チャネル１に付随する音声チャネル１の音声レベ
ル比はスピーカ４２：スピーカ４３＝１：０となる。同様に動画ウインドウ２６，２７に表示されて
いる映像チャネル２，３に付随する音声チャネル２，３
の音声レベル比はそれぞれスピーカ４２：スピーカ４３＝０．５：０．５スピーカ
４２：スピーカ４３＝０：１となる。音像定位制御部３７では、この比に基づきスピ
ーカ４２，４３に出力する音声レベルを変化させて出力
する。このようにして、動画ウインドウ２５に表示され
ている動画像に付随する音声チャネル１の音声、すなわ
ち地点Ｂの参加者の音声は画面２２の左側部分から発声
しているように参加者１１に聞こえる。同様に、動画ウ
インドウ２６，２７に表示されている動画像に付随する
音声チャネル２，３の音声、すなわち地点Ｃ，Ｄの各参
加者の音声はそれぞれ、画面２２の中央部、右側部分か
らそれぞれ発声されているかのように聞こえる。このよ
うにして、マルチウインドウで表示されている会議参加
者の表示位置とみかけ上の音声発声位置とが一致してい
るため、従来の技術に比べて出力されている音声が何れ
の参加者からのものであるかの認識が容易になるため、
良好なテレビ会議が可能となる。[0011] When the possible audio level range is set to five levels,
As shown in FIG. 2A, the screen 22 is divided horizontally into five regions ■ to ■, and the region ■ is located on the left speaker 42 side. Accordingly, a distribution ratio for distributing the audio accompanying the image (window) to the speakers 42 and 43 is determined. This determination is made as shown in FIG. 2B, so that a louder level is supplied to the closer speaker and a lower level is supplied to the farthest speaker. For example, the audio accompanying the window located in area ■ is distributed to the speakers 42 and 43 at a ratio of 0.75:0.25, and the ratio of the outputs of the speakers 42 and 43 is in this relationship. It sounds like the sound is coming from area ■. Therefore, screen 22 in Figure 1
Since the display center horizontal coordinate positions of the video windows 25, 26, and 27 displayed in the video windows 25, 26, and 27 are in the areas ■, ■, and ■, respectively, the video displayed in the video window 25, that is, the audio channel 1 associated with the video channel 1 The audio level ratio of speaker 42:speaker 43 is 1:0. Similarly, audio channels 2 and 3 associated with video channels 2 and 3 displayed in video windows 26 and 27
The audio level ratio of speakers 42:speakers 43=0.5:0.5 and speakers 42:speakers 43=0:1, respectively. The sound image localization control section 37 changes and outputs the sound level output to the speakers 42 and 43 based on this ratio. In this way, the voice of the audio channel 1 accompanying the video image displayed on the video window 25, that is, the voice of the participant at point B, can be heard by the participant 11 as if it were coming from the left side of the screen 22. . Similarly, the sounds of audio channels 2 and 3 accompanying the moving images displayed on the video windows 26 and 27, that is, the sounds of each participant at points C and D, are respectively transmitted from the center and right side of the screen 22. It sounds like it's being spoken. In this way, the display position of the conference participant displayed in the multi-window matches the apparent voice output position, so compared to conventional technology, the output voice can be heard from which participant. This makes it easier to recognize whether the
Good video conferencing becomes possible.

【００１２】図３Ａに示すようにスピーカ４２，４３の
中央部にスピーカ４４を設け、つまり３つのスピーカを
画面２２の下側で水平方向に配列し、音声レベル可変範
囲を７段階とする場合は、画面２２を水平方向に７つの
領域■〜■に分割し、各領域にあるウインドウの像に付
随する音声をスピーカ４２，４３，４４から図３Ｂに示
すような分配比で出力させるようにすればよい。この場
合も図２Ａと同様に表示中央水平座標位置に応じて、ス
ピーカ４２，４３，４４に出力する音声レベルを変化さ
せて出力する。３スピーカを使用し、また音声レベル可
変範囲を７段階としたため、より細かい制御が可能にな
り、上述した場合と同様に、出力されている音声の発声
者の認識がより容易になり、良好なテレビ会議が可能と
なる。As shown in FIG. 3A, if a speaker 44 is provided in the center of the speakers 42 and 43, that is, three speakers are arranged horizontally below the screen 22, and the audio level variable range is set to seven levels, , the screen 22 is horizontally divided into seven regions ■ to ■, and the audio accompanying the image of the window in each region is output from the speakers 42, 43, and 44 at a distribution ratio as shown in FIG. 3B. Bye. In this case as well, the audio level output to the speakers 42, 43, and 44 is changed and outputted according to the display center horizontal coordinate position, as in FIG. 2A. By using 3 speakers and setting the audio level variable range to 7 levels, more detailed control is possible, and as in the case described above, it is easier to recognize the speaker of the output audio, making it possible to achieve good results. Video conferencing will be possible.

【００１３】次に、マウス４１を使用して、マルチウイ
ンドウ表示モニタ２１に表示されている動画マルチウイ
ンドウを移動させた場合について説明する。マウス４１
によるウインドウ移動操作は例えば、マウス４１のスイ
ッチを操作して、画面２２の一部に機能メニューウイン
ドウを表示させ、その項目（拡大、縮小、移動、消去な
ど）中の「移動」をマーカにより選択クリックし、その
後、移動したいウインドウを同様にしてクリックし、そ
の後、移動させたい位置にマーカを移動させてクリック
すればよい。マウス４１を操作し、例えばウインドウ２
５を画面２２の中央付近へ移動させたとする。図２Ｃは
動画ウインドウ２５を中央付近に移動させた状態を示す
。まず、マウス４１により移動されるウインドウが選択
され、次にマウスを操作してマーカを所望の位置に移動
させる。これによりマウス４１により入力されたデータ
はマウス入力処理部３６にて移動座標値として中央制御
部３８に入力される。中央制御部３８ではその移動座標
値をマルチウインドウ移動処理部３５に入力し、マルチ
ウインドウ移動処理部３５にてその移動座標値に基づき
動画ウインドウの移動処理を行う。移動されたこの動画
ウインドウはマルチウインドウ表示制御部３４に入力さ
れ、動画ウインドウ２５をマーカの位置に表示させる。Next, a case will be described in which the mouse 41 is used to move the moving image multi-window displayed on the multi-window display monitor 21. mouse 41
To move a window, for example, operate a switch on the mouse 41 to display a function menu window on a part of the screen 22, and select "Move" among the items (enlargement, reduction, movement, deletion, etc.) with a marker. Click, then click in the same way on the window you want to move, then move the marker to the position you want to move and click. Operate the mouse 41 to open window 2, for example.
5 is moved to near the center of the screen 22. FIG. 2C shows a state in which the video window 25 has been moved to near the center. First, the window to be moved is selected using the mouse 41, and then the mouse is operated to move the marker to a desired position. As a result, the data input using the mouse 41 is input to the central control unit 38 as movement coordinate values by the mouse input processing unit 36. The central control unit 38 inputs the movement coordinate value to the multi-window movement processing unit 35, and the multi-window movement processing unit 35 performs movement processing of the video window based on the movement coordinate value. This moved video window is input to the multi-window display control unit 34, and the video window 25 is displayed at the marker position.

【００１４】中央制御部３８は、同時にその移動座標値
を音像定位制御部３７に入力し、音像定位制御部３７で
はその移動座標値に基づき、移動した動画ウインドウ２
５に付随する音声のスピーカ４２，４３への供給分配比
を図２Ｂに従って変化させる。例えば、図２Ｃに示す位
置に動画ウインドウ２５が移動された場合、動画ウイン
ドウ２５に付随する音声チャネルの音声、すなわち地点
Ｂの参加者の音声は画面２２の中央位置から発声されて
いるかのようになる。このようにして、マルチウインド
ウで表示されている動画像の表示位置を任意の位置に移
動することが可能で、この移動に応じてスピーカ４２，
４３から出力され、音声出力レベル比を変化させること
により、マルチウインドウで表示されている会議参加者
の表示位置と音声発声位置とが一致されて出力される。これは、実際の会議などでは、席を移動する場合に相当
し、テレビ会議においても、席の移動を模擬した状態を
実現しているため、従来の技術に比べて、より臨場感の
あるテレビ会議が可能となる。The central control unit 38 simultaneously inputs the movement coordinate value to the sound image localization control unit 37, and the sound image localization control unit 37 adjusts the moving video window 2 based on the movement coordinate value.
The supply/distribution ratio of the audio accompanying 5 to the speakers 42 and 43 is changed according to FIG. 2B. For example, when the video window 25 is moved to the position shown in FIG. Become. In this way, the display position of the moving image displayed in the multi-window can be moved to an arbitrary position, and the speaker 42,
43, and by changing the audio output level ratio, the display position of the conference participant displayed in the multi-window and the audio output position are matched and output. This corresponds to moving seats in a real meeting, and it also simulates the movement of seats in a video conference, creating a more realistic TV than with conventional technology. Meetings will be possible.

【００１５】次に動画ウインドウを拡大・縮小する場合
について説明する。マウス４１により所望の動画ウイン
ドウを選択し、そのウインドウの枠を所望の大きさに変
化させる操作をすると、中央制御部３８はマウス４１に
より決定されたその動画ウインドウの大きさ情報をマル
チウインドウ拡大／縮小部３２、音声レベル制御部３９
に転送する。マルチウインドウ拡大／縮小部３２では、
この動画ウインドウの大きさ情報に基づき動画ウインド
ウの大きさを拡大・縮小させ、マルチウインドウ表示制
御部３４に転送し表示する。音声レベル制御部３９では
転送された動画ウインドウの大きさ情報に基づきその動
画ウインドウに付随する音声のレベルを増減させる。こ
れにより、縮小表示されている動画ウインドウの音声は
小さな音量で、拡大表示されている動画ウインドウの音
声は大きな音量で出力されるため、ユーザが注目したい
ために拡大表示した動画ウインドウの音声は自動的に増
大され、音声にも注目することができ、また、利用者が
あまり会議に関係ないと思われる動画像を縮小表示した
動画ウインドウの音声は自動的に減少され、他の重要な
音声に注目することができ、ユーザインタフェースのよ
いテレビ会議が可能になる。Next, the case of enlarging/reducing the moving image window will be explained. When a desired video window is selected using the mouse 41 and an operation is performed to change the frame of the window to a desired size, the central control unit 38 uses the size information of the video window determined using the mouse 41 to perform multi-window enlargement/ Reduction section 32, audio level control section 39
Transfer to. In the multi-window enlargement/reduction section 32,
Based on this video window size information, the size of the video window is enlarged or reduced, and transferred to the multi-window display control unit 34 for display. The audio level control unit 39 increases or decreases the level of the audio accompanying the video window based on the transferred video window size information. As a result, the audio of the video window that is reduced in size is output at a low volume, and the audio of the video window that is enlarged is output at a high volume, so the audio of the video window that is enlarged to attract the user's attention is automatically output. In addition, the audio in the video window that displays a reduced video image that the user thinks is not very relevant to the meeting is automatically reduced, allowing the user to focus on other important audio. This makes it possible to conduct video conferences that attract attention and have a good user interface.

【００１６】また、逆に、受信した音声レベルに応じて
、該当する動画ウインドウを拡大・縮小する場合につい
て説明する。音声レベル検出部３３にて受信した音声レ
ベルを検出し、あるしきい値以上の音声レベルの動画ウ
インドウを拡大表示する。また、あるしきい値以下の音
声レベルの動画ウインドウを縮小表示する事により、ユ
ーザインタフェースのよいテレビ会議、テレビ電話が可
能になる。すなわち、たとえば地点Ｂの参加者が他の参
加者に注意を喚起するために大きな声を発声した場合、
この音声は音声レベル検出部３３にてしきい値以上の音
声レベルとして判断し、地点Ｂの参加者表示のためのウ
インドウ２５を拡大表示して表示する。また、たとえば
地点Ｃの参加者がしばらくの間会話に参加せず、一定の
時間以上黙っていた場合、その動画ウインドウ２６を縮
小表示する。[0016] Conversely, a case will be explained in which the corresponding video window is enlarged or reduced in accordance with the received audio level. The received audio level is detected by the audio level detection unit 33, and a video window with an audio level above a certain threshold value is displayed in an enlarged manner. Furthermore, by reducing the size of a video window with an audio level below a certain threshold, it becomes possible to conduct video conferences and video calls with a good user interface. In other words, for example, if a participant at point B makes a loud noise to draw the attention of other participants,
The sound level detecting section 33 determines that the sound level is higher than the threshold value, and the window 25 for displaying the participants at point B is enlarged and displayed. Further, for example, if the participant at point C does not participate in the conversation for a while and remains silent for more than a certain period of time, the video window 26 is displayed in a reduced size.

【００１７】ウインドウ形式で２つの動画像を表示する
システムとしては、親子画面方式で表示する方式がある
。この親子画面方式のシステムにこの発明を適用した例
について図４を参照して説明する。親子画面表示モニタ
５１の画面２２には、画面一杯のウインドウ（親画面）
５２に例えばあるチャネルの画像が表示され、画面２２
の一部、つまりウインドウ５２内の一部にウインドウ（
子画面）５３が設けられ、これに他のチャネルの画像が
表示される。この親子画面表示モニタ５１に対する親子
画面表示制御部５４には映像／音声入力制御部５５、親
子画面表示制御部５６、子画面移動処理部５７、音声レ
ベル出力制御部５８、遠隔操作器入力処理部５９及び中
央制御部６１が相互に接続されて設けられる。映像／音
声入力部制御５５に入力された映像、音声はそれぞれ親
子画面表示制御部５６、音声レベル出力制御部５８に入
力される。画面観察者は、遠隔操作器６２により子画面
５３の表示位置を指示し、指示されたデータは遠隔操作
器入力処理部５９に転送される。遠隔操作器入力処理部
５９では、指示された子画面表示位置情報を中央制御部
６１に入力する。中央制御部６１では入力された子画面
表示位置情報を親子画面表示制御部５６に転送し、親子
画面表示制御部５６では、親画面５２の中に子画面５３
を埋め込んで親子画面表示モニタ５１に表示する。同時
に中央制御部６１は、子画面表示位置情報を音声レベル
出力制御部５８に入力する。音声レベル出力制御部５８
では、子画面表示位置情報に基づき、親画面、子画面の
音声出力レベルを制御する。音声出力レベルの制御方は
図１の実施例と同様である。たとえば、親子画面が図４
の画面２２に示しているように表示されている場合、つ
まり子画面５３が右側部分に表示されている場合は親画
面５２の音声は中央から、子画面５３の音声は右側から
聞こえるようにされる。[0017] As a system for displaying two moving images in a window format, there is a system that displays them in a parent-child screen format. An example in which the present invention is applied to this parent-child screen system will be described with reference to FIG. 4. The screen 22 of the parent-child screen display monitor 51 has a window (main screen) that fills the entire screen.
For example, an image of a certain channel is displayed on the screen 22.
In other words, a part of the window 52 has a window (
A sub-screen) 53 is provided, on which images of other channels are displayed. The parent and child screen display control section 54 for this parent and child screen display monitor 51 includes a video/audio input control section 55, a parent and child screen display control section 56, a child screen movement processing section 57, an audio level output control section 58, and a remote controller input processing section. 59 and a central control section 61 are provided and connected to each other. The video and audio input to the video/audio input unit control 55 are input to a parent-child screen display control unit 56 and an audio level output control unit 58, respectively. The screen viewer instructs the display position of the child screen 53 using the remote controller 62, and the instructed data is transferred to the remote controller input processing section 59. The remote controller input processing section 59 inputs the instructed sub-screen display position information to the central control section 61 . The central control unit 61 transfers the input child screen display position information to the parent and child screen display control unit 56 , and the parent and child screen display control unit 56 displays the child screen 53 in the parent screen 52 .
is embedded and displayed on the parent-child screen display monitor 51. At the same time, the central control unit 61 inputs the child screen display position information to the audio level output control unit 58. Audio level output control section 58
Now, the audio output levels of the parent screen and the child screen are controlled based on the child screen display position information. The method of controlling the audio output level is similar to the embodiment shown in FIG. For example, the parent and child screen is shown in Figure 4.
When the screen 22 is displayed as shown in FIG. Ru.

【００１８】このようにして、親子画面方式で２つの動
画像を同時に表示する場合、子画面５３の表示位置に応
じて２台のスピーカ４２，４３に出力する親、子画面の
音声レベルを制御するため、同時に親、子画面の音声を
聞くことができ、また、どちらの音声が親画面からまた
は子画面からかを容易に認識できる利点がある。なおこ
の親子画面方式では画像（ウインドウ）に付随する音信
号としては音声に限らず音楽などの場合もある。上述で
は動画を表示したが静止画像を表示する場合にもこの発
明を適用できる。上述では複数のスピーカを横方向（水
平方向）に配列し、これらから出力される音のレベル比
を制御したが、ウインドウが上下に配列される場合はス
ピーカを縦方向（垂直方向）に配列して、これらから出
力される音のレベル比を制御して音像位置を、縦方向に
おいてウインドウと対応して定位させてもよい。更には
スピーカを画面の周辺全体に分散配置し、画面内のウイ
ンドウの２次位置と対応して、音像を定位させるように
することもできる。In this way, when two moving images are displayed simultaneously using the parent and child screen method, the audio levels of the parent and child screens output to the two speakers 42 and 43 are controlled according to the display position of the child screen 53. Therefore, it is possible to listen to the voices of the parent screen and the child screen at the same time, and there is an advantage that it is possible to easily recognize which voice is coming from the parent screen or the child screen. Note that in this parent-child screen method, the sound signal accompanying the image (window) is not limited to audio, but may also be music. Although moving images are displayed in the above description, the present invention can also be applied to displaying still images. In the above, multiple speakers were arranged horizontally (horizontally) and the level ratio of the sound output from them was controlled, but if the windows are arranged vertically, the speakers should be arranged vertically (vertically). Then, the sound image position may be localized in the vertical direction in correspondence with the window by controlling the level ratio of the sounds output from these. Furthermore, the speakers may be distributed throughout the periphery of the screen, and the sound image may be localized in correspondence with the secondary position of the window within the screen.

【００１９】[0019]

【発明の効果】以上説明したようにマルチウインドウ表
示方式や親子画面方式などで複数のウインドウに各別の
画像を表示できるシステムにおいて、表示されている複
数の画像の表示位置に応じてその複数動画像に付随する
音のレベルを変化させ、ｎ個のスピーカへ出力するため
、その結果として、画像が表示されている位置から音が
聞こえてくるから、従来の技術に比べて出力されている
音声の発声者の認識が容易になるため、良好なテレビ会
議が可能となり、あるいは親子画面表示における子画面
の音を識別して聞くことができる。Effects of the Invention As explained above, in a system that can display different images in multiple windows using a multi-window display method, a parent-child screen method, etc., multiple videos of multiple images can be displayed depending on the display position of the multiple images being displayed. Since the level of the sound accompanying the image is changed and output to n speakers, as a result, the sound is heard from the position where the image is displayed, so the sound output is lower than that of conventional technology. Since it becomes easier to recognize the speaker, it becomes possible to have a good video conference, or to identify and listen to the sound of the child screen in parent and child screen display.

【００２０】また、複数表示されているウインドウの一
つに注目するためにウインドウを拡大・縮小する場合、
拡大・縮小率に応じてそのウインドウに付随する音信号
のレベルを増減することにより、よりユーザインタフェ
ースのよいテレビ会議が可能になる。逆に、受信した音
声レベルに応じて、そのウインドウを拡大・縮小表示す
ることにより、注目されたい場合には大きな声で呼びか
けて拡大表示させ、会議の途中でしばらくの間離席する
ような場合、そのウインドウを縮小表示させることによ
り表示画面を有効に使用し、ユーザインタフェースの良
いテレビ会議が可能となる。[0020] Furthermore, when enlarging or reducing a window in order to focus on one of the multiple windows displayed,
By increasing or decreasing the level of the sound signal accompanying the window according to the enlargement/reduction ratio, a video conference with a better user interface becomes possible. Conversely, by enlarging or reducing the display of the window depending on the received audio level, if you want to attract attention, you can call out loud and enlarge the display, and if you leave your seat for a while during the meeting. By displaying the window in a reduced size, the display screen can be used effectively and a video conference with a good user interface can be held.

【００２１】また、この発明を親子画面表示方式に適用
した場合、同時に親、子画面の音を聞くことができ、ま
た、どちらの声が親画面からまたは子画面からかを容易
に認識できる利点がある。Further, when the present invention is applied to a parent and child screen display method, there is an advantage that the sound of the parent and child screens can be heard at the same time, and it is possible to easily recognize which voice is coming from the parent screen or the child screen. There is.

[Brief explanation of the drawing]

【図１】この発明をテレビ会議の表示装置における音声
出力方法に適用した実施例を示すブロック図。FIG. 1 is a block diagram showing an embodiment in which the present invention is applied to an audio output method in a display device for a video conference.

【図２】Ａは図１において音声レベル可変範囲を５段階
とした場合の画面の分割領域を示す図、Ｂはその各領域
に付随する音声のスピーカ４２，４３からの出力レベル
比を示す図、Ｃはウインドウ２５を図１の状態から中央
部に移動させた状態を示す図。2A is a diagram showing divided areas of the screen when the audio level variable range is set to five levels in FIG. 1, and B is a diagram showing the output level ratio of audio from speakers 42 and 43 associated with each area. , C is a diagram showing a state in which the window 25 has been moved from the state in FIG. 1 to the center.

【図３】Ａはスピーカを３つとし、音声レベル可変範囲
を７段階とした場合の画面の分割領域を示す図、Ｂはそ
の各領域に付随する音声のスピーカ４２，４３，４４か
らの出力レベル比を示す図。[Fig. 3] A is a diagram showing the divided areas of the screen when there are three speakers and the audio level variable range is set to seven levels, and B is the output from the audio speakers 42, 43, and 44 associated with each area. A diagram showing level ratios.

【図４】この発明を親子画面方式における音出力方法に
適用した実施例を示すブロック図。FIG. 4 is a block diagram showing an embodiment in which the present invention is applied to a sound output method in a parent-child screen system.

【図５】従来のテレビ会議システムのモニタ装置を示す
ブロック図。FIG. 5 is a block diagram showing a monitor device of a conventional video conference system.

【図６】従来のテレビ会議システムのモニタ装置の他の
ものを示すブロック図。FIG. 6 is a block diagram showing another monitor device of the conventional video conference system.

Claims

[Claims]

[Claim 1] Provide multiple windows on one screen,
Regarding a display device that displays different images in each of these windows, in a method of outputting a sound signal accompanying the image, a plurality of speakers are distributed around the screen,
A sound signal output method for displaying a plurality of image windows, characterized in that the sound signals accompanying the images of the windows are distributed to these speakers in inverse proportion to the distance from these speakers to the window, and the sound signals are outputted. .

Claim 2: Depending on the scaling of the window,
2. The method of outputting a sound signal for displaying a plurality of image windows according to claim 1, further comprising controlling the level of the sound signal associated with the window.