KR20170072783A

KR20170072783A - Channel adaptive audio mixing method for multi-point conference service

Info

Publication number: KR20170072783A
Application number: KR1020160116531A
Authority: KR
Inventors: 김도영; 장종현; 김현
Original assignee: 한국전자통신연구원
Priority date: 2015-12-16
Filing date: 2016-09-09
Publication date: 2017-06-27

Abstract

본 발명의 실시 예들은 다자간 회의 서비스에서의 오디오 믹싱 방법에 관한 것으로, 본 발명의 일 실시 예에 따른 다자간 회의 서비스에서의 채널 상태 적응형 오디오 믹싱 방법은, 복수의 회의 단말로부터 음성 데이터를 수신하는 단계; 상기 복수의 회의 단말 각각이 채널 상태 검출 기능 및 채널 상태 정보 교환 능력을 보유하는지 여부를 확인하는 단계; 채널 상태 검출 기능 및 채널 상태 정보 교환 능력을 보유하는 회의 단말로부터 채널 상태 정보를 수신하는 단계; 채널 상태 검출 기능 및 채널 상태 정보 교환 능력을 보유하지 않은 회의 단말로부터 수신된 음성 데이터로부터 채널 상태 정보를 획득하는 단계; 및 상기 채널 상태 정보들을 기반으로 하울링 또는 에코가 발생한 채널을 확인하고, 확인된 채널의 음성 데이터를 제외한 나머지 채널의 음성 데이터를 믹싱하여 회의 음성을 생성하는 단계를 포함한다. 본 발명의 실시 예들에 따르면, 하울링, 에코 또는 과도한 잡음이 발생한 채널의 음성 데이터를 오디오 믹싱 과정에서 원천적으로 제외시킴으로써 다자간 회의 서비스의 중단을 예방하고 음성 품질을 향상시킬 수 있다. The embodiments of the present invention relate to a method of mixing audio in a multi-party conference service, wherein a channel state adaptive audio mixing method in a multi-party conferencing service includes receiving audio data from a plurality of conference terminals step; Confirming whether each of the plurality of conference terminals has a channel state detection function and a channel state information exchange capability; Receiving channel state information from a conference terminal having a channel state detection function and a channel state information exchange capability; Obtaining channel state information from voice data received from a conference terminal having no channel state detection function and channel state information exchange capability; And a step of checking a channel where the howling or echo is generated based on the channel state information and mixing the voice data of the remaining channels except the voice data of the identified channel to generate a conference voice. According to embodiments of the present invention, audio data of a channel in which a howling, an echo, or excessive noise occurs is originally excluded from the audio mixing process, thereby preventing the interruption of the multi-party conference service and improving the voice quality.

Description

[0001] The present invention relates to a channel adaptive audio mixing method for a multi-party conference service,

본 발명의 실시 예들은, 다자간 회의 서비스에서의 오디오 믹싱 방법에 관한 것이다. Embodiments of the present invention relate to a method of audio mixing in a multipoint conference service.

통신망을 통하여 다자간 회의 서비스(다자간 영상 회의 서비스 또는 다자간 음성 회의 서비스)를 제공하는 다지점 접속 제어 장치(MCU; Multi-point Control Unit)에서, 음성 데이터를 믹싱하기 위하여 일반적으로 아래 세 가지 방식이 이용된다. 첫째는, 다자간 회의에 참여하는 회의 단말로부터 수신되는 모든 음성 데이터를 균일한 비율로 믹싱(단순 혼합)하는 단순 혼합 방식이 있다. 둘째는, 화자에 해당하는 회의 단말을 모두 검출하고, 화자에 해당하는 회의 단말의 음성 데이터는 볼륨을 증폭한 후 나머지 회의 단말의 음성 데이터와 믹싱(화자 혼합)하여 화자의 음성 크기를 강조하는 화자 혼합 방식이 있다. 셋째는, 화자 중 주화자(main speaker)를 찾아서 주화자에 해당하는 회의 단말의 음성 데이터는 볼륨을 증폭한 후 나머지 회의 단말의 음성 데이터와 믹싱하여 주화자의 음성 크기를 강조하는 주화자 강조 혼합 방식이 있다. In a multi-point control unit (MCU) that provides a multi-party conference service (multi-party video conferencing service or multi-party voice conferencing service) through a communication network, the following three methods are generally used for mixing voice data do. First, there is a simple mixing method in which all the voice data received from conference terminals participating in a multi-party conference are mixed (simply mixed) at a uniform rate. Second, it detects all the conference terminals corresponding to the speaker, amplifies the volume of the voice data of the conference terminal corresponding to the speaker, and mixes (mixes the speaker) with the voice data of the remaining conference terminals to emphasize the voice size of the speaker There is a mixing method. Third, the voice data of the conference terminal corresponding to the main speaker is searched for the main speaker among the speakers, and the voice data of the conference terminal is mixed with the voice data of the remaining conference terminals after amplifying the volume, .

다자간 회의 서비스의 경우, 다지점 접속 제어 장치에서 효과적인 회의 진행이 가능하도록 음성 데이터를 믹싱하여 회의에 참여한 모든 회의 단말들에게 믹싱된 음성 데이터를 전송한다. 이 때, 가장 큰 문제점은 불특정한 어느 하나의 회의 단말에서 발생한 하울링, 에코 또는 과도한 잡음의 영향이 다른 모든 회의 단말에게도 전달되어 서비스 품질이 저하된다는 것이다. In the case of the multi-party conference service, the multi-point access control apparatus mixes the voice data so that the conference can be effectively performed, and transmits the mixed voice data to all the conference terminals participating in the conference. At this time, the biggest problem is that the influence of howling, echo, or excessive noise generated in any one unspecified conferencing terminal is transmitted to all other conferencing terminals, thereby degrading the service quality.

이 문제를 해결하기 위한 현재의 대표적인 해결책은 회의 단말마다 에코 제거와 잡음 억제 기능, 그리고 하울링 검출과 제거 기능을 구비하게 하여 음성 데이터 품질의 저하를 막는 것이다. 하지만 회의 단말이 설치된 환경에서 마이크 또는 스피커의 위치가 바뀌면 에코의 경로도 변화하여 에코나 하울링이 발생하게 되며, 복수의 회의 단말 중 어느 하나의 회의 단말에서 발생한 소음이나 잡음도 다지점 접속 제어 장치에서 믹싱된 음성 신호에 포함되어, 다자간 회의에 참여한 모든 회의 단말에게 전파되는 문제점이 있다. A typical solution for solving this problem is to prevent degradation of voice data quality by providing echo cancellation, noise suppression function, and howling detection and cancellation function for each conference terminal. However, when the position of the microphone or the speaker is changed in the environment where the conference terminal is installed, the echo path changes and echo or howling occurs. Also, the noise or noise generated at any one of the conference terminals Is included in the mixed voice signal, and is spread to all the conference terminals participating in the multi-party conference.

일본 공개 특허 특개 2003-23499 (회의 서버 장치 및 회의 시스템)Japanese Unexamined Patent Application Publication No. 2003-23499 (conference server apparatus and conference system)

본 발명의 실시 예들은, 다자간 회의 서비스에 참여하는 회의 단말 중 하울링, 에코 또는 과도한 잡음이 발생한 회의 단말의 음성 데이터를 제외하고 오디오 믹싱을 수행하는 방안을 제공한다. Embodiments of the present invention provide a method for performing audio mixing except audio data of howling, echo, or excessive noise of a conference terminal participating in a multi-party conference service.

본 발명의 실시 예들은, 회의 단말의 채널 상태 정보 검출 기능의 유무에 따라 다지점 접속 제어 장치가 적응적으로 채널 상태 정보를 검출하는 방안을 제공하여 임의의 회의 단말에서 발생한 하울링, 에코 또는 과도한 잡음에 의한 다자간 회의 서비스의 중단을 예방한다. In embodiments of the present invention, a multi-point access control apparatus adaptively detects a channel state information according to presence or absence of a channel state information detection function of a conference terminal, and provides feedback on howling, echo or excessive noise To prevent the interruption of multi-party conferencing services.

본 발명의 일 실시 예에 따른 다자간 회의 서비스에서의 채널 상태 적응형 오디오 믹싱 방법은, 복수의 회의 단말로부터 음성 데이터를 수신하는 단계; 상기 복수의 회의 단말 각각이 채널 상태 검출 기능 및 채널 상태 정보 교환 능력을 보유하는지 여부를 확인하는 단계; 채널 상태 검출 기능 및 채널 상태 정보 교환 능력을 보유하는 회의 단말로부터 채널 상태 정보를 수신하는 단계; 채널 상태 검출 기능 및 채널 상태 정보 교환 능력을 보유하지 않은 회의 단말로부터 수신된 음성 데이터로부터 채널 상태 정보를 획득하는 단계; 및 상기 채널 상태 정보들을 기반으로 하울링 또는 에코가 발생한 채널을 확인하고, 확인된 채널의 음성 데이터를 제외한 나머지 채널의 음성 데이터를 믹싱하여 회의 음성을 생성하는 단계를 포함한다. A channel state adaptive audio mixing method in a multi-party conference service according to an embodiment of the present invention includes: receiving voice data from a plurality of conference terminals; Confirming whether each of the plurality of conference terminals has a channel state detection function and a channel state information exchange capability; Receiving channel state information from a conference terminal having a channel state detection function and a channel state information exchange capability; Obtaining channel state information from voice data received from a conference terminal having no channel state detection function and channel state information exchange capability; And a step of checking a channel where the howling or echo is generated based on the channel state information and mixing the voice data of the remaining channels except the voice data of the identified channel to generate a conference voice.

본 발명의 실시 예들에 따르면, 하울링, 에코 또는 과도한 잡음이 발생한 채널의 음성 데이터를 오디오 믹싱 과정에서 원천적으로 제외시킴으로써 다자간 회의 서비스의 중단을 방지하고 다자간 회의 서비스의 음성 품질을 향상시킬 수 있다. According to embodiments of the present invention, voice data of a channel in which a howling, an echo, or excessive noise occurs is originally excluded from the audio mixing process, thereby preventing the interruption of the multi-party conference service and improving the voice quality of the multi-party conference service.

본 발명의 실시 예들에 따르면, 회의 단말이 채널 상태 정보 검출 기능을 보유하였는지 여부에 따라 다지점 접속 제어 장치가 적응적으로 채널 상태 정보를 검출할 수 있고, 검출된 채널 상태 정보를 기반으로 향상된 음성 데이터 품질을 갖는 다자간 회의 서비스를 제공할 수 있다. According to embodiments of the present invention, a multi-point access control apparatus adaptively detects channel state information according to whether a conference terminal has a channel state information detection function, It is possible to provide a multi-party conference service having data quality.

도 1은 본 발명의 실시 예들이 적용되는 다자간 회의 시스템을 설명하기 위한 예시도,
도 2는 본 발명의 일 실시 예에 따른 다지점 접속 제어 장치를 설명하기 위한 예시도,
도 3은 본 발명의 일 실시 예에 따른 다지점 접속 제어 장치에서의 채널 상태 적응형 오디오 믹싱 방법을 설명하기 위한 흐름도. 1 is an exemplary view for explaining a multi-party conference system to which embodiments of the present invention are applied;
FIG. 2 is an exemplary diagram for explaining a multi-point connection control apparatus according to an embodiment of the present invention;
3 is a flowchart illustrating a channel state adaptive audio mixing method in a multi-point connection control apparatus according to an embodiment of the present invention.

이하에서, 본 발명의 실시 예들을 설명함에 있어, 관련된 공지 기능 또는 구성에 대한 구체적인 설명이 본 발명의 요지를 불필요하게 흐릴 수 있다고 판단되는 경우에는 그 상세한 설명을 생략한다. In the following description of the embodiments of the present invention, a detailed description of known functions and configurations incorporated herein will be omitted when it may make the subject matter of the present invention rather unclear.

이하, 첨부되는 도면을 참조하여 본 발명의 실시 예들을 설명한다. Hereinafter, embodiments of the present invention will be described with reference to the accompanying drawings.

도 1은 본 발명의 실시 예들이 적용되는 다자간 회의 시스템을 설명하기 위한 예시도이다. FIG. 1 is an exemplary diagram for explaining a multi-party conference system to which embodiments of the present invention are applied.

회의 단말(200)은, 카메라 등의 영상 촬영 모듈, 모니터 등의 영상 출력 모듈, 마이크 등의 음성 입력 모듈 및 스피커 등의 음성 출력 모듈을 포함할 수 있다. 회의 단말(200)은, 영상 촬영 모듈 및 음성 입력 모듈을 통하여 획득한 사용자의 영상 데이터 및 사용자의 음성 데이터를 인터넷 등의 네트워크를 통하여 다지점 접속 제어 장치(100)에게 전송할 수 있다. The conference terminal 200 may include a video photographing module such as a camera, a video output module such as a monitor, a voice input module such as a microphone, and a voice output module such as a speaker. The conference terminal 200 can transmit the user's image data and the user's voice data acquired through the image capturing module and the voice input module to the multi-point access control apparatus 100 through a network such as the Internet.

다지점 접속 제어 장치(100)는, 복수의 회의 단말(200)로부터 영상 데이터 및 음성 데이터를 수신할 수 있다. 다지점 접속 제어 장치(100)는, 수신된 영상 데이터 및 음성 데이터를 믹싱하고, 믹싱 결과 생성된 회의 영상 및 회의 음성을 회의 단말(200)로 전송할 수 있다. The multi-point connection control apparatus 100 can receive video data and audio data from a plurality of conference terminals 200. [ The multi-point access control apparatus 100 may mix the received video data and audio data, and may transmit the conference video and the conference voice generated as a result of the mixing to the conference terminal 200.

도 2는 본 발명의 일 실시 예에 따른 다지점 접속 제어 장치를 설명하기 위한 예시도이다. 2 is an exemplary diagram for explaining a multi-point connection control apparatus according to an embodiment of the present invention.

본 발명의 일 실시 예에 따른 다지점 접속 제어 장치는, 오디오 믹싱부(110), 하울링 검출부(120), 에코 검출부(130) 및 제어부(140)를 포함한다. The multi-point connection control apparatus according to an embodiment of the present invention includes an audio mixing unit 110, a howling detection unit 120, an echo detection unit 130, and a control unit 140.

오디오 믹싱부(110)는, 복수의 회의 단말로부터 채널별 음성 데이터를 수신할 수 있다. 오디오 믹싱부(110)는, 복수의 회의 단말로부터 수신한 음성 데이터 중 하울링 또는 에코가 발생한 음성 데이터를 제외하고 나머지 음성 데이터를 믹싱하여 회의 음성을 생성할 수 있다. 이 때, 오디오 믹싱부(110)는, 회의 음성을 수신할 회의 단말로부터 수신된 음성 데이터를 제외하고 믹싱을 수행함으로써 회의 음성을 생성할 수 있다. 오디오 믹싱부(110)는, 하울링, 에코 또는 과도한 잡음이 발견된 회의 단말의 채널 번호와 해당 회의 단말의 음성 데이터가 회의 음성에서 제외되었음을 나타내는 정보를 포함하는 믹싱 상태 정보를 생성하고, 생성된 믹싱 상태 정보를 복수의 회의 단말에게 제공할 수 있다. 이를 위하여, 오디오 믹싱부(110)는, 과도한 잡음이 발생된 채널을 확인할 수 있는 잡음 검출 기능을 보유할 수 있다. The audio mixing unit 110 can receive channel-specific audio data from a plurality of conference terminals. The audio mixing unit 110 may generate the conference voice by mixing the remaining voice data except the howling or echo generated voice data among the voice data received from the plurality of conference terminals. At this time, the audio mixing unit 110 can generate the conference voice by performing the mixing by excluding the voice data received from the conference terminal to receive the conference voice. The audio mixing unit 110 generates mixing state information including a channel number of a conference terminal in which howling, echo, or excessive noise is found and information indicating that speech data of the conference terminal is excluded from the conference speech, Status information can be provided to a plurality of conference terminals. To this end, the audio mixing unit 110 may have a noise detection function that can confirm a channel in which excessive noise is generated.

하울링 검출부(120)는, 복수의 회의 단말로부터 채널별 음성 데이터를 수신할 수 있다. 하울링 검출부(120)는, 복수의 회의 단말로부터 수신된 음성 데이터 중 제어부(140)가 채널 상태 정보를 검출할 것을 명령한 채널의 음성 데이터에서 하울링이 발생하였는지 여부를 판단하고, 하울링이 발생하였는지 여부를 알리는 채널 상태 정보를 제어부(140)에게 전달할 수 있다. 하울링 검출부(120)는, 예를 들어, 약 20msec 시간 단위로 패킷화된 음성 데이터를 버퍼링하고, 버퍼링된 음성 데이터를 주파수 영역의 신호로 변환한 후, 주파수 영역 신호의 에너지 크기를 기준으로 하울링 후보 주파수를 선택할 수 있다. 하울링 검출부(120)는, 하울링 후보 주파수의 파워 스펙트럼과 단기(short) 평균 파워 스펙트럼의 비(ratio)와 장기(long) 평균 파워 스펙트럼의 비(ratio)가 미리 설정한 임계값보다 크다면, 해당 하울링 후보 주파수에서 하울링이 발생하였다고 판단할 수 있다. The howling detection unit 120 can receive channel-specific voice data from a plurality of conference terminals. The howling detection unit 120 determines whether howling has occurred in the audio data of the channel in which the control unit 140 instructs the control unit 140 to detect the channel state information among the audio data received from the plurality of conference terminals, To the control unit 140. The control unit 140 may be configured to transmit the channel state information to the control unit 140. [ For example, the howling detection unit 120 buffers voice data packetized in units of about 20 msec time, converts the buffered voice data into a frequency domain signal, and then, based on the energy size of the frequency domain signal, The frequency can be selected. If the ratio of the power spectrum of the Howling candidate frequency to the short average power spectrum and the ratio of the long mean power spectrum are larger than a preset threshold value, It can be determined that howling occurs at the howling candidate frequency.

에코 검출부(130)는, 복수의 회의 단말로부터 채널별 음성 데이터를 수신할 수 있다. 에코 검출부(130)는, 복수의 회의 단말로부터 수신된 음성 데이터 중 제어부(140)가 채널 상태 정보를 검출할 것을 명령한 채널의 음성 데이터에서 에코가 발생하였는지 여부를 판단하고, 에코가 발생하였는지 여부를 알리는 채널 상태 정보를 제어부(140)에게 전달할 수 있다. The echo detecting unit 130 can receive channel-specific audio data from a plurality of conference terminals. The echo detecting unit 130 determines whether echo has occurred in the audio data of the channel in which the control unit 140 instructs the control unit 140 to detect the channel state information among the audio data received from the plurality of conference terminals, To the control unit 140. The control unit 140 may be configured to transmit the channel state information to the control unit 140. [

제어부(140)는, 회의 단말과의 통신을 수행하여, 각각의 회의 단말이 채널 상태 검출 기능 및 채널 상태 정보 교환 능력을 보유하였는지 여부를 확인할 수 있다. The control unit 140 can perform communication with the conference terminal and confirm whether or not each conference terminal has the channel state detection function and the channel state information exchange capability.

회의 단말이 채널 상태 검출 기능 및 채널 상태 정보 교환 능력을 보유한 경우, 제어부(140)는 해당 회의 단말로부터 수신된 음성 데이터에서 채널 상태 정보를 검출하지 않을 것을 하울링 검출부(120) 및 에코 검출부(130)에게 명령할 수 있다. 그리고, 제어부(140)는, 해당 회의 단말로부터 채널 상태 정보를 수신할 수 있다. If the conference terminal has the channel state detection function and the channel state information exchange capability, the control unit 140 notifies the howling detection unit 120 and the echo detection unit 130 that channel state information is not detected in the voice data received from the corresponding conference terminal, . Then, the control unit 140 can receive channel state information from the conference terminal.

회의 단말이 채널 상태 검출 기능 및 채널 상태 정보 교환 능력을 보유하지 않은 경우, 제어부(140)는 해당 회의 단말로부터 수신된 음성 데이터에서 채널 상태 정보를 검출할 것을 하울링 검출부(120) 및 에코 검출부(130)에게 명령할 수 있다. 이에 따라, 하울링 검출부(120) 및 에코 검출부(130)는 해당 회의 단말로부터 수신된 음성 데이터에서 채널 정보를 검출하고, 검출된 채널 정보를 제어부(140)에게 제공할 수 있다. If the conference terminal does not have the channel state detection function and the channel state information exchange capability, the control unit 140 controls the wayl detection unit 120 and the echo detection unit 130 to detect the channel state information in the voice data received from the conference terminal ). Accordingly, the howling detection unit 120 and the echo detection unit 130 can detect channel information in the voice data received from the corresponding conference terminal, and provide the detected channel information to the control unit 140.

제어부(140)는, 채널 상태 정보를 기반으로 하울링 및 에코가 발생한 채널이 있는지 여부를 확인하고, 하울링 또는 에코가 발생한 채널을 오디오 믹싱부(110)에게 통지할 수 있다. The control unit 140 can check whether there is a channel where the howling and echo are generated based on the channel state information, and notify the audio mixing unit 110 of the channel where the howling or echo occurs.

도 3은 본 발명의 일 실시 예에 따른 다지점 접속 제어 장치에서의 채널 상태 적응형 오디오 믹싱 방법을 설명하기 위한 흐름도이다. 3 is a flowchart illustrating a channel state adaptive audio mixing method in a multi-point access control apparatus according to an embodiment of the present invention.

단계(301)에서, 오디오 믹싱부(110)는, 제어부(140)로부터 영상 회의에 참여한 회의 단말의 개수와 채널 번호를 수신할 수 있다. 오디오 믹싱부(110)는, 영상 회의에 참여한 회의 단말의 개수에 해당하는 음성 데이터를 수신할 수 있다. 즉, 오디오 믹싱부(110)는, 영상 회의에 참여한 복수의 회의 단말로부터 채널별 음성 데이터를 수신할 수 있다. In operation 301, the audio mixing unit 110 may receive the number of conference terminals participating in the video conference and the channel number from the control unit 140. [ The audio mixing unit 110 may receive voice data corresponding to the number of conference terminals participating in the video conference. That is, the audio mixing unit 110 can receive channel-specific audio data from a plurality of conference terminals participating in the video conference.

단계(303)에서, 제어부(140)는, 영상 회의에 참여한 복수의 회의 단말과 통신을 수행하여 해당 회의 단말들이 채널 상태 검출 기능(하울링 검출 기능 및 에코 검출 기능)을 보유하였는지 여부와, 해당 검출 기능들을 이용하여 획득된 채널 상태 정보를 다지점 접속 제어 장치와 교환할 수 있는 능력이 있는지 여부를 확인할 수 있다. In step 303, the control unit 140 communicates with a plurality of conference terminals participating in the video conference to determine whether or not the conference terminals have a channel state detection function (howling detection function and echo detection function) Functions can be used to verify whether or not it is capable of exchanging acquired channel state information with a multi-point access control device.

단계(305)에서, 제어부(140)는, 채널 상태 검출 기능 및 채널 상태 정보 교환 능력을 보유한 회의 단말로부터 수신된 음성 데이터에서 채널 상태 정보 검출을 수행하지 않을 것을 하울링 검출부(120) 및 에코 검출부(130)에게 명령할 수 있다. 그리고, 제어부(140)는, 해당 회의 단말로부터 해당 회의 단말의 채널 상태 정보를 수신할 수 있다. In step 305, the controller 140 controls the howling detection unit 120 and the echo detection unit 120 to not perform the channel state information detection on the voice data received from the conference terminal having the channel state detection function and the channel state information exchange capability 130). Then, the control unit 140 can receive channel state information of the conference terminal from the conference terminal.

단계(307)에서, 제어부(140)는, 채널 상태 검출 기능 및 채널 상태 정보 교환 능력을 보유한 회의 단말로부터 수신된 음성 데이터에서 채널 상태 정보 검출을 수행할 것을 하울링 검출부(120) 및 에코 검출부(130)에게 명령할 수 있다. 이에 따라, 하울링 검출부(120) 및 에코 검출부(130)는 해당 회의 단말로부터 수신된 음성 데이터로부터 채널 상태 정보를 검출하고, 검출된 채널 상태 정보를 제어부(140)에게 제공할 수 있다. In step 307, the control unit 140 controls the howling detection unit 120 and the echo detection unit 130 to perform channel state information detection on the voice data received from the conference terminal having the channel state detection function and the channel state information exchange capability ). Accordingly, the howling detecting unit 120 and the echo detecting unit 130 may detect the channel state information from the voice data received from the conference terminal, and provide the detected channel state information to the controller 140.

단계(309)에서, 제어부(140)는, 에코 및 하울링 중 적어도 하나가 발생한 채널이 있는지 여부를 확인할 수 있다. 만약, 에코 및 하울링 중 적어도 하나가 발생한 채널이 있는 경우, 제어부(140)는, 해당 채널이 어떤 채널인지를 오디오 믹싱부(110)에게 통지할 수 있다. In step 309, the control unit 140 can check whether there is a channel in which at least one of echo and howling occurs. If there is a channel where at least one of echo and howling occurs, the control unit 140 can notify the audio mixing unit 110 of what channel the corresponding channel is.

단계(311)에서, 오디오 믹싱부(110)는, 에코 및 하울링 중 적어도 하나가 발생한 채널의 음성 데이터를 제외하고 나머지 채널의 음성 데이터들을 믹싱함으로써 회의 음성을 생성할 수 있다. 이 때, 오디오 믹싱부(110)는, 회의 음성을 수신할 채널(회의 단말)의 음성 데이터를 더 제외하고 나머지 채널의 음성 데이터들을 믹싱함으로써 회의 음성을 생성할 수 있다. 만약, 에코 또는 하울링이 발생한 채널이 없다면, 오디오 믹싱부(110)는, 회의 음성을 수신할 채널의 음성 데이터만을 제외하고 나머지 채널을 음성 데이터를 믹싱함으로써 회의 음성을 생성할 수 있다. In step 311, the audio mixing unit 110 may generate the conference voice by mixing the voice data of the remaining channels except for the voice data of the channel where at least one of echo and howling occurs. At this time, the audio mixing unit 110 may generate the conference voice by mixing the voice data of the remaining channels except the voice data of the channel (conference terminal) to receive the conference voice. If there is no channel in which the echo or howling occurs, the audio mixing unit 110 may generate a conference voice by mixing only voice data of a channel to receive the conference voice and voice data of the remaining channels.

단계(313)에서, 오디오 믹싱부(110)는, 하울링, 에코 또는 과도한 잡음이 발견된 회의 단말의 채널 번호와 해당 회의 단말의 음성 데이터가 회의 음성에서 제외되었음을 나타내는 정보를 포함하는 믹싱 상태 정보를 생성하고, 생성된 믹싱 상태 정보를 복수의 회의 단말에게 제공할 수 있다. In step 313, the audio mixing unit 110 receives mixing status information including information indicating that the channel number of the conference terminal in which howling, echo, or excessive noise is found and the voice data of the conference terminal are excluded from the conference voice And provide the generated mixing state information to a plurality of conference terminals.

이상에서 설명된 본 발명의 실시 예들은 임의의 다양한 방법으로 구현될 수 있다. 예를 들어, 본 발명의 실시 예들은 하드웨어, 소프트웨어 또는 그 조합을 이용하여 구현될 수 있다. 소프트웨어로 구현되는 경우에, 다양한 운영 체제 또는 플랫폼을 이용하는 하나 이상의 프로세서 상에서 실행되는 소프트웨어로서 구현될 수 있다. 추가적으로, 그러한 소프트웨어는 다수의 적합한 프로그래밍 언어들 중에서 임의의 것을 사용하여 작성될 수 있고, 또한 프레임워크 또는 가상 머신에서 실행 가능한 기계어 코드 또는 중간 코드로 컴파일 될 수 있다. The embodiments of the invention described above may be implemented in any of a variety of ways. For example, embodiments of the present invention may be implemented using hardware, software, or a combination thereof. When implemented in software, it may be implemented as software running on one or more processors using various operating systems or platforms. Additionally, such software may be written using any of a number of suitable programming languages, and may also be compiled into machine code or intermediate code executable in a framework or virtual machine.

또한, 본 발명의 실시 예들이 하나 이상의 프로세서 상에서 실행되는 경우 이상에서 논의된 본 발명의 다양한 실시 예들을 구현하는 방법을 수행하기 위한 하나 이상의 프로그램이 기록된 프로세서 판독 가능 매체(예를 들어, 메모리, 플로피 디스크, 하드 디스크, 콤팩트 디스크, 광학 디스크 또는 자기 테이프 등)로 구현될 수 있다. Also, when embodiments of the present invention are implemented on one or more processors, one or more programs for carrying out the methods of implementing the various embodiments of the invention discussed above may be stored on a processor readable medium (e.g., memory, A floppy disk, a hard disk, a compact disk, an optical disk, a magnetic tape, or the like).

Claims

Receiving voice data from a plurality of conference terminals;
Confirming whether each of the plurality of conference terminals has a channel state detection function and a channel state information exchange capability;
Receiving channel state information from a conference terminal having a channel state detection function and a channel state information exchange capability;
Obtaining channel state information from voice data received from a conference terminal having no channel state detection function and channel state information exchange capability; And
Checking a channel in which howling or echo occurs based on the channel state information, and generating a conference voice by mixing voice data of the remaining channels except for the voice data of the confirmed channel
And a channel state adaptive audio mixing method in a multi-party conference service.