KR20000038684A

KR20000038684A - Volume control circuit for group video conference system

Info

Publication number: KR20000038684A
Application number: KR1019980053760A
Authority: KR
Inventors: 최상준; 유재하
Original assignee: 구자홍; 엘지전자 주식회사
Priority date: 1998-12-08
Filing date: 1998-12-08
Publication date: 2000-07-05
Also published as: KR100565184B1

Abstract

PURPOSE: A volume control circuit for group video conference system is provided to enable a user at the far-end to control the volume of participants by using a camera control signal. CONSTITUTION: A volume control circuit for group video conference system includes a beam former(221), a variation detector(222), a location information analyzer(223), a volume controller(224), and a sound transmitter/receiver(226). The beam former(221) receives outer sound signal and outputs voice of the required speaker selectively. The variation detector(222) detects the variation on the camera location and zoom parameters. The location information analyzer(223) receives the output signal from the variation detector(222) and detects the current location of the speaker. The volume controller(224) amplifies or reduces the output signal from the beam former by using a multiplier(225). The sound transmitter/receiver(226) transmits/receives the output signal from the multiplier(225) to/from a far-end terminal.

Description

Volume Control Circuit of Team Video Conferencing System

본 발명은 단체 화상 회의 시스템의 음량 제어 회로에 관한 것으로, 특히 단체 화상 회의 시스템(Group Video Conference System)에 있어서 원단(far-end) 사용자가 근단 회의 참가자의 위치에 따라 근단 카메라 각도와 줌 인자(Zoom Factor)를 제어하는 카메라 제어신호를 통해 상기 카메라의 최종 위치 변동 및 줌 크기에 따라 참가자 음성을 선택적으로 집중 및 증폭 또는 감소시키도록 한 단체 화상 회의 시스템의 음량 제어 회로에 관한 것이다.The present invention relates to a volume control circuit of a group video conferencing system. In particular, in a group video conference system, a far-end user uses a near-end camera angle and a zoom factor according to the position of a near-end conference participant. And a volume control circuit of a group video conferencing system for selectively concentrating and amplifying or reducing participant voices according to a final position change and a zoom size of the camera through a camera control signal controlling a zoom factor.

단체 화상 회의 시스템은 서로 다른 지역에 있는 집단과 집단이 지역적으로 이동이 없이 한 곳에 모여 회의나 세미나 등을 할 수 있게 하는 시스템으로, 같은 장소에 있지 않는 원격지의 사람과 음성과 영상 신호를 주고받음으로써 같은 장소에서 대화나 회의를 할 수 있다.The group video conferencing system is a system that allows groups and groups in different regions to gather together in one place without conferences and to carry out meetings and seminars. It exchanges audio and video signals with remote people who are not in the same place. This allows you to have a conversation or meeting in the same place.

또한, 회의 참가자 수에 따라 개인 대 개인, 개인 대 다수, 다수 대 다수의 환경으로 분류될 수 있고, 상대방이 다수인 경우에는 카메라를 통하여 모든 사람의 영상 및 음성을 수신함과 아울러, 일반적으로 모든 사람의 음성을 듣기보다는 특정 회의 참가자의 영상과 음성을 선택적으로 수신한다.In addition, according to the number of participants in the conference, it can be classified into individual to individual, individual to many, many to many environments, and in case of a large number of parties, all people can receive video and audio through the camera and generally everyone Rather than listening to your voice, you can selectively receive video and audio from specific conference participants.

이를 위해서는 마이크로폰 배열(Microphone Array)을 사용하여 화자의 좌우각, 상하각 및 거리를 알아내고 이에 적합한 카메라 각도와 줌을 조절하고 그 방향으로의 음성 신호만을 받아들이도록 알고리듬(Algorithm)을 구현한다.To do this, a microphone array is used to determine the speaker's left, right, top and bottom angles, adjust the camera angle and zoom accordingly, and implement an algorithm that accepts only voice signals in that direction.

일반적인 단체 화상 회의 시스템의 구성은 도 1에 도시한 바와 같이 각 지역에는 여러 명의 회의 참가자와 그들의 목소리 등을 전달하는 마이크가 중앙에 설치되어 있고, 비디오 이미지를 위해 셋톱박스 뒤에 카메라가 설치되어 된다.As shown in FIG. 1, a general group video conferencing system is composed of microphones for transmitting a plurality of conference participants and their voices in the center, and a camera is installed behind the set-top box for video images.

그리고, 상기 중앙에 설치된 마이크의 내부는 복수의 마이크로폰으로 구성된 마이크로폰 어레이 형태로 카메라가 전체 회의 참석자를 비추지 않고 말을 하고 있는 회의자만을 비추고자 할 때 그 방향을 알아내는데 사용된다.In addition, the inside of the centrally installed microphone is used to find out the direction when the camera wants to illuminate only the speaker who is speaking without the whole conference attendant in the form of a microphone array composed of a plurality of microphones.

따라서, 그 방향으로의 공간적 필터링에 해당되는 빔포밍(Beamforming)을 수행하여 그 방향의 목소리만 수음하는데 사용될 수 있고, 원래의 목적인 목소리를 채취하는 본래의 기능만으로도 사용될 수 있다.Thus, beamforming corresponding to spatial filtering in the direction may be performed to receive only the voice in the direction, and may be used only as an original function of collecting the voice which is the original purpose.

그리고, 셋톱박스 위에 설치된 카메라는 내부에 모터 등의 전동장치가 설치되어 좌우상하로 움직일 수 있으며, 줌기능이 있어 원거리의 피사체를 가까이 볼 수도 있다.In addition, the camera installed on the set-top box can be moved left and right by installing a motor, such as a motor therein, and the zoom function can also look closer to the subject in the distance.

이러한 카메라의 동적인 기능은 여러 사람의 모습이 하나의 카메라에 의해 비취지는 단체 화상 회의 시스템 환경에서는 필수적이다.The dynamic function of such a camera is essential in a group video conferencing system environment where several people's appearances are reflected by a single camera.

여기서, 실제 데이터 전송시 근거리 통신망(Local Area Network) 또는 인터넷망(Internet)에서 사용하는 프로토콜(Protocol)은 H.323이고, 종합 정보 통신망(Integrated Services Digital Network)에서 사용하는 프로토콜은 H.320이고, 공중 회선 교환 전화망(Public Switched Telephone Network)에서 사용하는 프로토콜은 H.324가 사용된다.Here, the protocol used in the local area network or the Internet when transmitting data is H.323, and the protocol used in the integrated services digital network is H.320. For example, H.324 is used as the protocol used in public switched telephone networks.

도 2는 종래 단체 화상 회의 시스템의 구성을 보인 블록도로서, 이에 도시된 바와 같이 화자의 실제 영상 및 음성을 수신함과 아울러 카메라(31)의 위치 제어 신호를 송신하는 원단 단말기(1)와; 상기 원단 단말기(1)에서 카메라(31)의 위치 제어 신호를 입력받아 상기 카메라(31)의 위치를 조정하고 실제 화자의 영상 및 음성을 송신하는 근단 단말기(2)로 구성되며, 상기 원단 단말기(1)와 근단 단말기(2)는 각각 음향부(10)(39)와 영상부(20)(40)로 구성된다.Fig. 2 is a block diagram showing the structure of a conventional group video conferencing system, which includes a far-end terminal 1 for receiving actual video and audio of a speaker and transmitting a position control signal of a camera 31; The far-end terminal 1 receives the position control signal of the camera 31, adjusts the position of the camera 31, and consists of a near-end terminal 2 for transmitting the video and audio of the actual speaker. 1) and the near-end terminal 2 are composed of sound units 10 and 39 and an image unit 20 and 40, respectively.

상기 원단 단말기(1)의 영상부(20)는 카메라의 위치 조정 신호를 입력받는 입력부(21)와; 상기 입력부(21)의 아날로그신호를 디지탈신호로 변환하는 제1 카메라 제어기(22)와; 상기 제1 카메라 제어기(22)의 카메라 제어 신호를 카메라 제어 프로토콜에 부합되도록 변환하는 제1 코드변환기(23)와; 상기 제1 코드변환기(23)의 출력 신호를 송수신하는 제1 송수신기(24)로 구성되며, 상기 음향부(10)는 음향 데이터를 송수신하는 제1 음향 송수신기(12)와; 상기 제1 음향 송수신기(12)의 출력 데이터를 외부로 송출하는 스피커(11)로 구성된다.The image unit 20 of the far-end terminal 1 includes an input unit 21 for receiving a camera position adjustment signal; A first camera controller 22 for converting an analog signal of the input unit 21 into a digital signal; A first code converter (23) for converting a camera control signal of the first camera controller (22) to conform to a camera control protocol; And a first transceiver (24) for transmitting and receiving the output signal of the first code converter (23), wherein the sound unit (10) comprises: a first sound transceiver (12) for transmitting and receiving sound data; It is composed of a speaker 11 for transmitting the output data of the first sound transceiver 12 to the outside.

그리고, 상기 근단 단말기(2)의 영상부는 원단 단말기(1)의 제1 송수신기(24)와 카메라 제어 신호를 송수신하는 제2 송수신기(34)와; 상기 제2 송수신기(34)를 통해 상기 카메라 제어 프로토콜에 부합되는 카메라 제어 신호를 출력하는 제2 코드변환기(33)와; 상기 제2 코드변환기(33)의 출력신호를 아날로그신호로 변환하여 카메라(31)를 제어하는 제2 카메라제어기(32)로 구성되며, 상기 음향부(40)는 복수의 마이크로폰(M1∼M3)을 통해 외부 소리 신호를 입력받아 이를 조합하여 원하는 화자의 음성을 선택적으로 출력하는 빔형성기(41)와; 상기 빔형성기(41)의 출력신호를 주변 환경 요소에 따라 곱셈기(43)를 통해 증폭 또는 감쇄시키는 주변환경측정기(42)와; 상기 곱셈기(43)의 출력신호를 상기 원단 단말기(1)로 송수신하는 제2 음향 송수신기(44)로 구성되며, 이와 같이 구성된 종래 기술에 따른 동작과정을 첨부한 도 3을 참조하여 상세히 설명한다.In addition, the video unit of the near-end terminal 2 and the second transceiver 34 for transmitting and receiving a camera control signal with the first transceiver 24 of the far-end terminal (1); A second code converter (33) for outputting a camera control signal conforming to the camera control protocol through the second transceiver (34); It is composed of a second camera controller 32 for controlling the camera 31 by converting the output signal of the second code converter 33 into an analog signal, the sound unit 40 is a plurality of microphone (M1 ~ M3) A beamformer 41 for receiving an external sound signal through the combination and selectively outputting a voice of a desired speaker; An environment measuring device 42 which amplifies or attenuates the output signal of the beam former 41 through a multiplier 43 according to the environment elements; It consists of a second acoustic transceiver 44 for transmitting and receiving the output signal of the multiplier 43 to the far-end terminal 1, it will be described in detail with reference to FIG.

우선, 두 지역간의 호 설정을 통해 서로 통신을 하고 있는 상황에서 원단 단말기(1) 측의 사용자가 근단 단말기(2) 측의 카메라(31)의 방향을 움직여 상기 근단 단말기(2) 측의 다른 피사체를 보려고 셋톱박스나 리모콘의 조작을 하게 되면, 제1 카메라제어기(22)는 사용자가 입력하는 아날로그 정보를 디지탈화하여 제1 코드변환기(23)로 출력하게 된다.First, the user of the far-end terminal 1 side moves the direction of the camera 31 of the near-end terminal 2 side in the situation of communicating with each other by setting up a call between two regions, and the other subject on the near-end terminal 2 side. When operating the set-top box or the remote control to see the first camera controller 22 digitalizes the analog information input by the user and outputs to the first code converter 23.

따라서, 상기 제1 코드변환기(23)는 상기 제1 카메라제어기(22)의 디지탈 출력신호를 도 3과 같이 화상회의 망 특성에 부합되는 통신 프로토콜로 변환하게 된다.Accordingly, the first code converter 23 converts the digital output signal of the first camera controller 22 into a communication protocol that conforms to the videoconferencing network characteristics as shown in FIG. 3.

상기 동작 메시지는 하나의 방향으로 일관성을 가지며 시작 메시지로 시작하여 계속 메시지를 통해 계속적인 움직임을 나타내고 정지 메시지로 움직임을 중단하게 된다.The operation message is consistent in one direction and starts with the start message, indicates continuous movement through the continuous message, and stops the movement with the stop message.

예를 들어 카메라(31)를 오른쪽 위로 3번 움직이는 경우, 우선 시작 메시지(00000001)가 들어오고, 오른쪽과 위로 움직이는 메시지(11110000)가 인가되어 카메라가 오른쪽 위로 1번 움직인 후, 400msec동안 정지대기하는 메시지가 도 3의 (a)와 같이 인가된다.For example, if the camera 31 is moved three times to the upper right, a start message (00000001) first comes in, the message moves to the right and up (11110000) is applied and the camera moves one time to the upper right, and then stops for 400 msec. Message is authorized as shown in FIG.

그 후, 도 3의 (b)와 같이 계속 동작을 유지하도록 계속 메시지(00000010) 및 오른쪽과 위로 움직이는 메시지(11110000)가 인가되어 상기 카메라(31)가 오른쪽 위로 2번째 움직이고, 다시 계속 메시지(00000010) 및 오른쪽과 위로 움직이는 메시지(11110000)가 인가되어 상기 카메라(31)가 오른쪽 위로 3번째 움직이게 된다.Thereafter, as shown in (b) of FIG. 3, a continuous message (00000010) and a right and up moving message (11110000) are applied to move the camera 31 for the second time to the upper right, and then to keep the message (00000010) again. ) And the right and up moving messages 11110000 are applied to move the camera 31 up and down the third time.

그리고, 도 3의 (c)와 같이 정지 메시지(00000011)가 인가되어 동작을 중지하게 된다.Then, as shown in FIG. 3C, a stop message (00000011) is applied to stop the operation.

위와 같은 동작 메시지들은 순차적으로 제1 송수신기(24)로 전달되고, 이에 상기 송수신기(24)는 데이터 링크(Data Link)층에 해당되는 프로토콜을 사용하여 근단 단말기(2) 측의 제2 송수신기(34)로 데이터를 전송하게 된다.The operation messages as described above are sequentially transmitted to the first transceiver 24. Accordingly, the transceiver 24 uses the protocol corresponding to the data link layer to the second transceiver 34 on the near-end terminal 2 side. Will send the data.

그리고, 상기 제2 송수신기(34)는 상기 제1 송수신기(24)에 수신되는 데이터를 제2 코드변환기(33)로 출력하게 되고, 상기 제2 코드변환기(33)에서 상기 제2 송수신기(34)의 출력신호를 상기 통신 프로토콜에 부합되는 카메라 제어 신호로 변환된다.In addition, the second transceiver 34 outputs data received by the first transceiver 24 to the second code converter 33, and the second transceiver 34 in the second code converter 33. Is converted into a camera control signal conforming to the communication protocol.

따라서, 제2 카메라제어기(32)는 상기 제2 코드변환기(33)의 출력신호를 입력받아 카메라(31)의 위치를 제어하게 된다.Therefore, the second camera controller 32 receives the output signal of the second code converter 33 to control the position of the camera 31.

이때, 복수의 마이크로폰(M1∼M3)으로부터 외부 소리를 입력받은 빔형성기(41)는 이를 조합하여 원하는 화자의 음성을 선택적으로 출력하게 되고, 주변환경측정기(42)는 주변 환경 요소에 따라 곱셈기(43)를 통해 상기 빔형성기(41)의 출력신호를 증폭 또는 감쇄시켜 출력하게 된다.At this time, the beamformer 41 receiving the external sound from the plurality of microphones M1 to M3 selectively outputs the desired speaker's voice by combining them, and the ambient environment measurer 42 is a multiplier according to the environment element. 43 amplifies or attenuates the output signal of the beamformer 41 and outputs the amplified signal.

따라서, 상기 곱셈기(43)의 출력신호를 입력받은 제2 음향 송수신기(44)는 이를 상기 데이터 링크 층에 해당되는 프로토콜을 이용하여 제1 음향 송수신기(12)로 송신하고, 이를 수신한 상기 제1 음향 송수신기(12)는 이를 스피커(11)를 통해 외부로 출력하게 된다.Accordingly, the second acoustic transceiver 44 which receives the output signal of the multiplier 43 transmits it to the first acoustic transceiver 12 using a protocol corresponding to the data link layer, and receives the first acoustic transceiver 12. The audio transceiver 12 outputs it to the outside through the speaker 11.

상기와 같이 종래 마이크로폰을 통해 입력되는 외부 신호를 빔포밍하여 원하는 화자의 음성을 선택적으로 출력함에 있어서 상기 화자의 출력을 외부 환경의 조건에 따라 스스로 계측함으로써, 오차가 심해 정확한 동작을 하기가 매우 어려워 오동작하고, 이를 해결하기 위한 알고리듬이 복합하여 구현이 어려운 문제점이 있었다.By beamforming an external signal input through a conventional microphone as described above, and selectively outputting a desired speaker's voice, the speaker's output is self-measured according to the conditions of the external environment. There was a problem that is difficult to implement because of a complex algorithm to solve the malfunction.

따라서, 본 발명은 상기와 같은 종래의 문제점을 해결하기 위하여 창안한 것으로, 단체 화상 회의 시스템에 있어서 원단 사용자가 근단 회의 참가자의 위치에 따라 근단 카메라 각도와 줌 인자를 제어하는 카메라 제어신호를 통해 상기 카메라의 최종 위치 변동 및 줌 크기에 따라 참가자 음성을 선택적으로 집중 및 증폭 또는 감소시키도록 한 단체 화상 회의 시스템의 음량 제어 회로를 제공함에 그 목적이 있다.Accordingly, the present invention has been made to solve the above-mentioned conventional problems. In the group video conferencing system, the far-end user uses the camera control signal to control the near-end camera angle and zoom factor according to the position of the near-end conference participant. It is an object of the present invention to provide a volume control circuit of a group video conferencing system for selectively concentrating and amplifying or reducing participant voice according to a final position change and a zoom size of a camera.

도 1은 일반적인 단체 화상 회의 시스템의 구성을 보인 개략도.1 is a schematic diagram showing the configuration of a general group video conferencing system;

도 2는 종래 단체 화상 회의 시스템의 구성을 보인 블록도.2 is a block diagram showing the structure of a conventional group video conferencing system.

도 3은 도 2에서 코드 변환기의 동작 메시지를 보인 도.3 is a view illustrating an operation message of a code converter in FIG. 2;

도 4는 본 발명 단체 화상 회의 시스템의 구성을 보인 블록도.4 is a block diagram showing the configuration of the present invention group video conferencing system.

도 5는 도 4에서 카메라를 중심으로 참가자의 위치 변화를 도시한 2차원 평면도.FIG. 5 is a two-dimensional plan view of a participant's position change around the camera in FIG. 4; FIG.

***도면의 주요 부분에 대한 부호의 설명****** Description of the symbols for the main parts of the drawings ***

100 : 원단 단말기 110,210 : 음향부100: fabric terminal 110,210: sound unit

120, 220 : 영상부 221 : 빔 형성기120, 220: image portion 221: beam former

222 : 변화량 측정기 223 : 위치정보 분석기222: change amount measuring instrument 223: location information analyzer

224 : 음량 조정기 225 : 곱셈기224 volume control 225 multiplier

226 : 음향 송수신기226: Acoustic Transceiver

상기와 같은 목적을 달성하기 위한 본 발명의 구성은 복수의 마이크로폰을 통해 외부 소리 신호를 입력받아 현재 화자의 위치에 따라 이를 조합하여 원하는 화자의 음성을 선택적으로 출력하는 빔형성기와; 카메라의 위치 변화 및 줌인자를 검출하는 변화량 측정기와; 상기 변화량 측정기의 출력신호를 입력받아 화자의 현재 위치를 검출하는 위치정보 분석기와; 상기 위치 정보 분석기의 출력신호에 따라 곱셈기를 통해 상기 빔형성기의 출력신호를 증폭 또는 감쇄시키는 음량 조절기와; 상기 곱셈기의 출력신호를 상기 원단 단말기로 송수신하는 음향 송수신기로 구성하여 된 것을 특징으로 한다.A configuration of the present invention for achieving the above object is a beamformer for receiving an external sound signal through a plurality of microphones and selectively outputs the desired speaker's voice by combining them according to the current speaker's position; A change amount measuring unit for detecting a position change and a zoom factor of the camera; A positional information analyzer which receives an output signal of the change amount measurer and detects a current position of the speaker; A volume controller for amplifying or attenuating the output signal of the beam former through a multiplier according to the output signal of the position information analyzer; Characterized in that it consists of an audio transceiver for transmitting and receiving the output signal of the multiplier to the far-end terminal.

이하, 본 발명에 따른 일실시예에 대한 동작과 작용효과를 첨부한 도면을 참조하여 상세히 설명하면 다음과 같다.Hereinafter, with reference to the accompanying drawings, the operation and effect of an embodiment of the present invention will be described in detail.

도 4는 본 발명 단체 화상 회의 시스템의 구성을 보인 블록도로서, 본 발명은 도 2에 도시한 종래 단체 화상 회의 시스템의 구성에서 근단 단말기(200)의 음향부(220)를 복수의 마이크로폰(M1∼M3)을 통해 외부 소리 신호를 입력받아 현재 화자의 위치에 따라 이를 조합하여 원하는 화자의 음성을 선택적으로 출력하는 빔형성기(221)와; 카메라(211)의 위치 변화 및 줌인자를 검출하는 변화량 측정기(222)와; 상기 변화량 측정기(222)의 출력신호를 입력받아 화자의 현재 위치를 검출하는 위치 정보 분석기(223)와; 상기 위치 정보 분석기(223)의 출력신호에 따라 곱셈기(225)를 통해 상기 빔형성기(221)의 출력신호를 증폭 또는 감쇄시키는 음량 조절기(224)와; 상기 곱셈기(225)의 출력신호를 상기 원단 단말기(100)로 송수신하는 음향 송수신기(226)로 구성하며, 이와 같이 구성된 본 발명에 따른 동작과정을 도 5의 2차원 평면도를 참조하여 설명한다.4 is a block diagram showing a configuration of a group video conferencing system of the present invention, and the present invention provides a plurality of microphones M1 in the sound unit 220 of the near-end terminal 200 in the configuration of the conventional group video conferencing system shown in FIG. A beamformer 221 for receiving an external sound signal through -M3) and selectively outputting the desired speaker's voice by combining the signals according to the current speaker's position; A change amount measurer 222 for detecting a position change and a zoom factor of the camera 211; A position information analyzer 223 which receives the output signal of the change amount measuring instrument 222 and detects the current position of the speaker; A volume controller 224 for amplifying or attenuating the output signal of the beamformer 221 through a multiplier 225 according to the output signal of the position information analyzer 223; It comprises a sound transceiver 226 for transmitting and receiving the output signal of the multiplier 225 to the far-end terminal 100, the operation process according to the invention configured as described above will be described with reference to the two-dimensional plan view of FIG.

우선, 최초 카메라(211)가 도 5에서의 제1 위치(P1)에 있는 화자를 화면의 특정 크기에 맞도록 줌 동작을 하며 비추고 있을 경우, 상기 제1 위치의 위치벡터(dP1)를 구할 수 있고, 또한, 상기 위치벡터(dP1)와 마이크의 위치벡터(dM)의 차신호로부터 상기 마이크로폰 어레이로부터의 제1 위치(P1)의 위치벡터(dMP1)를 구한다.First, when the first camera 211 is zooming the speaker at the first position P1 in FIG. 5 to fit a specific size of the screen, the position vector dP1 of the first position may be obtained. Further, the position vector dMP1 of the first position P1 from the microphone array is obtained from the difference signal between the position vector dP1 and the position vector dM of the microphone.

그러므로, 그에 따르는 각도에 따라 빔형성기(221)는 제1 위치(P1) 방향으로 빔을 형성하고, 거리에 따라 마이크로폰 이득을 조정한다.Therefore, the beamformer 221 forms the beam in the direction of the first position P1 according to the angle thereof, and adjusts the microphone gain according to the distance.

그리고, 두 지역간의 호 설정을 통해 서로 통신을 하고 있는 상황에서 원단 단말기(100) 측의 사용자가 근단 단말기(200) 측의 카메라(211)의 방향을 움직여 상기 근단 단말기(100)측의 다른 피사체를 보려고 셋톱박스나 리모콘을 조작하면, 제1 카메라제어기(122)는 사용자가 입력하는 아날로그 정보를 디지탈화하여 제1 코드변환기(123)로 출력한다.In addition, the user of the far-end terminal 100 moves the direction of the camera 211 of the near-end terminal 200 in a situation in which the two terminals communicate with each other through call setup. When operating the set-top box or the remote control to see, the first camera controller 122 digitalizes the analog information input by the user and outputs to the first code converter 123.

따라서, 상기 제1 코드변환기(123)는 상기 제1 카메라제어기(122)의 디지털 출력신호를 화상회의 망 특성에 부합되는 통신 프로토콜로 변환하고, 이를 제1 송수신기(124)를 통해 근단 단말기(200)의 제2 송수신기(214)로 전달한다.Accordingly, the first code converter 123 converts the digital output signal of the first camera controller 122 into a communication protocol that conforms to the videoconferencing network characteristics, and then converts the digital output signal from the first camera controller 122 to the near-end terminal 200 through the first transceiver 124. To a second transceiver 214.

그리고, 상기 제2 송수신기(214)는 상기 제1 송수신기(124)에 수신되는 데이터를 제2 코드변환기(213)로 출력하고, 상기 제2 코드변환기(213)에서 상기 제2 송수신기(214)의 출력신호를 상기 통신 프로토콜에 부합되는 카메라 제어 신호로 변환된다.The second transceiver 214 outputs the data received by the first transceiver 124 to the second code converter 213, and the second code converter 213 of the second transceiver 214. The output signal is converted into a camera control signal conforming to the communication protocol.

따라서, 제2 카메라제어기(212)는 상기 제2 코드변환기(213)의 출력신호에서 카메라 구동을 위한 아날로그 데이터를 추출하여 상기 카메라(211) 구동을 한다.Accordingly, the second camera controller 212 extracts analog data for driving the camera from the output signal of the second code converter 213 to drive the camera 211.

이때, 상기 제2 카메라제어기(212)의 다수의 동작 메시지들을 입력받은 변화량 측정기(222)는 동작 메시지의 방향성(Pan :좌우, Tilt:상하, Zoom:원근)을 저장하고 정지 메시지 전까지 같은 방향성을 갖는 계속 메시지의 개수를 조사하여 상기 동작 메시지의 방향성을 구한다.At this time, the change amount measuring unit 222 receiving the plurality of operation messages of the second camera controller 212 stores the direction of the operation message (Pan: left and right, Tilt: up and down, Zoom: perspective) and maintains the same direction until the stop message. The direction of the operation message is obtained by examining the number of continuous messages having the same.

그리고, 위치정보 분석기(223)는 상기 변화량 측정기(222)의 출력신호를 입력받아 갱신된 사용자의 위치(P2)를 알 수 있으며, 이로부터 카메라 각도를 조절하며 이에 따라 보여지는 피사체는 오토줌 기능에 의해 화면의 일정 크기를 갖는다.In addition, the location information analyzer 223 receives the output signal of the change amount measuring unit 222 to know the updated user's position P2, and adjusts the camera angle therefrom. By have a certain size of the screen.

따라서, 제2 위치벡터(dP2)와 마이크의 위치벡터(dM)의 차신호로부터 마이크로폰 어레이로부터의 제2 위치(P2)의 위치벡터(dMP2)를 구한다.Therefore, the position vector dMP2 of the second position P2 from the microphone array is obtained from the difference signal between the second position vector dP2 and the position vector dM of the microphone.

그러므로, 그에 따르는 각도에 따라 상기 빔형성기(221)는 복수의 마이크로폰에 들어온 신호들을 전자적인 신호처리를 통하여 제2 위치(P2)방향으로 빔을 형성하고, 거리에 따라 마이크로폰 이득을 조정한다.Therefore, according to the angle, the beamformer 221 forms a beam in the direction of the second position P2 through the electronic signal processing of the signals input to the plurality of microphones, and adjusts the microphone gain according to the distance.

여기서, 음량조절기(224)는 원단 사용자에 의해 근단 화자의 화상을 더 크게 혹은 작게 보기 위한 카메라(211)의 줌인자를 입력받아 오토줌 기능에 의해 이전 줌값과의 차이값(dz)을 검출하고, 상기 오토줌 기능을 통해 가까이 보는 경우 상기 차이값은 양의 값을 가지며 이득을 상향조정하고, 멀리 보는 경우 상기 차이값은 음의 값을 가지며 이득을 하향조정한다.Here, the volume controller 224 receives a zoom factor of the camera 211 to view the near end speaker's image larger or smaller by the far-end user, and detects the difference value dz from the previous zoom value by the auto zoom function. When looking closer through the auto-zoom function, the difference value has a positive value and increases the gain, and when viewed far away, the difference value has a negative value and lowers the gain.

즉, 상기 카메라(211)의 오토줌 기능은 이상값을 가지므로 각 레벨마다 3데시벨만큼 차이를 가지도록 하고, 1레벨 증가할 경우 음량 조절기는 +3데시벨에 해당하는 값(2^1/2)을 상기 곱셈기(225)를 통해 상기 빔형성기(221)의 출력신호에 곱하므로 제2 송수신기(226)를 통해 송신되는 음향신호는 3데시벨 크게 전송한다.That is, since the auto-zoom function of the camera 211 has an ideal value, the auto zoom function has a difference by 3 decibels for each level, and when the level is increased by one level, the volume controller corresponds to a value of +3 decibels (2 ^1/2 ). Multiply the output signal of the beamformer 221 by the multiplier 225, so that the acoustic signal transmitted through the second transceiver 226 is 3 decibels larger.

그러므로, 이를 수신한 상기 제1 음향 송수신기(112)는 이를 스피커(111)를 통해 외부로 출력한다.Therefore, the first acoustic transceiver 112 that receives it outputs it to the outside through the speaker 111.

상기에서 상세히 설명한 바와 같이, 본 발명은 단체 화상 회의 시스템에서 원단 사용자가 근단 회의 참가자의 위치에 따라 근단 카메라 각도와 줌 인자를 제어하는 카메라 제어신호를 통해 상기 카메라의 최종 위치 변동 및 줌 크기에 따라 참가자 음성을 선택적으로 집중 및 증폭 또는 감소시킴으로써, 원단 회의자에 의해 움직여진 근단 카메라의 변환 인자만으로 음성 신호를 집중 및 증폭하므로 효율이 좋아지고, 상기 카메라의 움직임으로 마이크로폰을 가상적으로 움직여 현실감 및 현장감있는 회의가 가능한 효과가 있다.As described in detail above, the present invention provides the user according to the final position change and the zoom size of the camera through a camera control signal for controlling the far-end camera angle and the zoom factor according to the position of the near-end conference participant in the group video conferencing system. By selectively concentrating and amplifying or reducing the participant's voice, the voice signal is concentrated and amplified only by the conversion factor of the near-end camera moved by the far-end conferee, and the efficiency is improved. Meetings are possible.

또한, 원단 회의자가 보내온 카메라의 줌인자의 변화를 통해 원단회의자가 근단회자의 목소리의 크기를 제어함으로써, 수신자의 주관적인 청각 특성에 따른 화자의 음량을 조절하여 쾌적한 회의 환경을 제공하는 효과가 있다.In addition, the far-end conference control the size of the voice of the near-end party through the change of the zoom factor of the camera sent by the far-end conference, it is effective to provide a comfortable meeting environment by adjusting the volume of the speaker according to the subjective auditory characteristics of the receiver.

Claims

A beamformer for receiving an external sound signal through a plurality of microphones and selectively outputting a desired speaker's voice by combining them according to the current speaker's position; A change amount measuring unit for detecting a position change and a zoom factor of the camera; A positional information analyzer which receives an output signal of the change amount measurer and detects a current position of the speaker; A volume controller for amplifying or attenuating the output signal of the beam former through a multiplier according to the output signal of the position information analyzer; And a sound transceiver for transmitting and receiving the output signal of the multiplier to the far-end terminal.

The group video conferencing system according to claim 1, wherein the change amount specifier stores the direction of the camera operation message, and detects the position change of the camera and the zoom factor by examining the number of continuous messages having the same direction until the stop message. Volume control circuit.

The volume control circuit of claim 1, wherein the position information analyzer calculates a position vector of the speaker according to a change in the position of the camera to detect a position vector between the current position of the speaker and the microphone.

The multiplier of claim 1, wherein the volume controller receives a zoom factor of a camera for viewing a near end speaker's image larger or smaller by a far-end user to detect a difference value from a previous zoom value to control an amplification gain level of the multiplier. A volume control circuit of a group video conference system.

The multiplier according to claim 4, wherein the volume controller transmits an output signal of the beamformer by greatly adjusting the amplification gain level of the multiplier when looking closer through the auto zoom function, and down-adjusting the amplification gain level of the multiplier when viewing the distance And transmits the output signal of the beamformer to a smaller level.