KR20090056597A

KR20090056597A - Method and apparatus for calibrating the sound source signal acquired through the microphone array

Info

Publication number: KR20090056597A
Application number: KR1020070123818A
Authority: KR
Inventors: 정소영; 오광철; 정재훈; 김규홍
Original assignee: 삼성전자주식회사
Priority date: 2007-11-30
Filing date: 2007-11-30
Publication date: 2009-06-03
Also published as: KR101459317B1

Abstract

A method and an apparatus for correcting a sound source signal are provided to prevent a distortion of a sound source signal due to characteristic mismatching between individual microphones by correcting the sound source signal according to the calculated reference probability distribution. A probability distribution calculator calculates the probability distributions expressing the number of the sound source signals existing in each interval according to the size interval of the sound source signals. A probability distribution calculator includes an accumulator. The accumulator calculates and accumulates the probability distributions about the size of the sound source signals in each interval. A reference probability distribution calculator(211) calculates the reference probability distribution representing the probability distributions based on the calculated probability distributions. A signal corrector(212) corrects the sound source signals according to the calculated reference probability distribution. The reference probability distribution calculator calculates the reference probability distribution based on the accumulated probability distributions.

Description

Method and apparatus for calibrating the sound source signal acquired through the microphone array}

본 발명은 마이크로폰 어레이를 통해 획득한 음원 신호를 보정하는 방법 및 장치에 관한 발명으로서, 마이크로폰 어레이가 구비된 휴대 전화, UMPC(ultra mobile personal computer), 캠코더 등의 휴대용 사운드 획득 기기와 DTV(digital television), 원격 화상 회의 시스템 등 디지털 정보 기기 등에서 음성 통화, 녹음 및 음성 인식과 같이 사운드를 획득함에 있어서 획득된 음원 신호의 특성 차이를 보정하는 방법 및 장치에 관한 것이다.BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a method and apparatus for correcting a sound source signal acquired through a microphone array. The present invention relates to a method and apparatus for correcting a characteristic difference of a sound source signal obtained in acquiring sound such as voice call, recording, and voice recognition in a digital information device such as a teleconference system.

휴대 전화를 이용하여 전화 통화를 하거나 디지털 캠코더 및 녹음기를 통해 사운드를 취득하는 것이 일상화되는 시대가 도래하였다. CE(consumer electronics) 기기 및 휴대 전화 등 다양한 디지털 기기에서는 사운드를 취득하기 위한 수단으로서 마이크로폰(microphone)이 사용되는데, 단일 채널의 모노(mono) 사운드가 아닌 2 이상의 채널을 활용하는 스테레오(stereo) 사운드를 구현하기 위해서는 일반적으로 다수의 마이크로폰들이 포함된 마이크로폰 어레이(microphone array)가 사용된 다.The age has come to make it possible to make phone calls using mobile phones or to acquire sound through digital camcorders and recorders. Various digital devices, such as CE (consumer electronics) devices and mobile phones, use a microphone as a means for acquiring sound, and stereo sound using two or more channels instead of a single channel mono sound. In general, a microphone array including a plurality of microphones is used to implement the microphone.

마이크로폰 어레이는 다수의 마이크로폰들을 조합하여 사운드 자체뿐만 아니라 취득하려는 사운드의 방향이나 위치와 같은 지향성(directivity)에 관한 부가적인 성질을 얻을 수 있다. 지향성이라 함은 음원 신호가 어레이를 구성하는 다수의 마이크로폰들 각각에 도달하는 시간 차이를 이용하여 특정 방향에 위치한 음원으로부터 방사되는 음원 신호에 대한 감도를 크게 하는 것을 말한다. 따라서, 이러한 마이크로폰 어레이를 이용하여 음원 신호들을 취득함으로써 특정 방향으로부터 입력되는 음원 신호를 강조하거나 억제할 수 있다.The microphone array can combine multiple microphones to obtain additional properties regarding directivity such as the direction or position of the sound to be acquired as well as the sound itself. Directivity refers to increasing the sensitivity to the sound source signal emitted from the sound source located in a specific direction by using the time difference that the sound source signal reaches each of the plurality of microphones constituting the array. Thus, by acquiring sound source signals using such a microphone array, it is possible to emphasize or suppress the sound source signal input from a specific direction.

이러한 마이크로폰 어레이를 조절한다는 것은 마이크로폰 어레이를 구성하는 복수 개의 마이크로폰들 각각의 지연값이나 마이크로폰들 간의 간격 등 지향성 조절 인자(parameter)들을 조절한다는 것을 의미한다. 또한, 사용자가 의도한 대로 마이크로폰 어레이를 조절하기 위해서는 마이크로폰 어레이가 개별 마이크로폰들의 물리적 특성(신호의 크기, 위상 및 주파수 응답 등을 의미한다.)에 따라 동작하여야 한다는 기본적인 전제 조건이 성립하여야 한다.Adjusting the microphone array means adjusting the directional adjustment parameters such as the delay value of each of the microphones constituting the microphone array or the spacing between the microphones. In addition, in order to adjust the microphone array as intended by the user, a basic precondition that the microphone array must operate according to the physical characteristics of the individual microphones (meaning signal magnitude, phase, and frequency response, etc.) must be established.

본 발명이 해결하고자 하는 기술적 과제는 음원 획득이 가능한 디지털 기기에서 마이크로폰 어레이를 통해 입력된 음원 신호들이 마이크로폰 어레이를 구성하는 개별 마이크로폰들 간의 특성 불일치로 인해 음원 신호가 왜곡되는 문제점을 해결하는 음원 신호 보정 방법 및 장치를 제공하는데 있다.The technical problem to be solved by the present invention is a sound source signal correction to solve the problem that the sound source signal is distorted due to the characteristic mismatch between the individual microphones constituting the microphone array, the sound source signals input through the microphone array in a digital device capable of sound acquisition To provide a method and apparatus.

상기 기술적 과제를 달성하기 위하여, 본 발명에 따른 음원 신호 보정 방법은 마이크로폰 어레이를 통해 획득한 음원 신호들의 크기 구간별로 각 구간에 존재하는 음원 신호들의 수를 확률로서 표현한 확률 분포들을 산출하는 단계; 상기 산출된 확률 분포들을 대표하는 기준 확률 분포를 산출하는 단계; 및 상기 산출된 기준 확률 분포에 따라 상기 음원 신호들을 보정하는 단계를 포함하는 것을 특징으로 한다.In order to achieve the above technical problem, the sound source signal correction method according to the present invention comprises the steps of calculating the probability distributions expressed as the number of sound source signals present in each section for each of the magnitude section of the sound source signals obtained through the microphone array; Calculating a reference probability distribution representative of the calculated probability distributions; And correcting the sound source signals according to the calculated reference probability distribution.

상기 다른 기술적 과제를 해결하기 위하여, 본 발명은 상기 기재된 음원 신호 보정 방법을 컴퓨터에서 실행시키기 위한 프로그램을 기록한 컴퓨터로 읽을 수 있는 기록매체를 제공한다.In order to solve the other technical problem, the present invention provides a computer-readable recording medium recording a program for executing the sound source signal correction method described above on a computer.

상기 기술적 과제를 달성하기 위하여, 본 발명에 따른 음원 신호 보정 장치는 마이크로폰 어레이를 통해 획득한 음원 신호들의 크기 구간별로 각 구간에 존재하는 음원 신호들의 수를 확률로서 표현한 확률 분포들을 산출하는 확률 분포 산출부; 상기 산출된 확률 분포들을 대표하는 기준 확률 분포를 산출하는 기준 확률 분포 산출부; 및 상기 산출된 기준 확률 분포에 따라 상기 음원 신호들을 보정하는 신호 보정부를 포함하는 것을 특징으로 한다.In order to achieve the above technical problem, the sound source signal correction apparatus according to the present invention calculates a probability distribution that calculates probability distributions expressing the number of sound source signals present in each section as a probability for each of the magnitude sections of the sound source signals obtained through the microphone array. part; A reference probability distribution calculating unit for calculating a reference probability distribution representing the calculated probability distributions; And a signal correction unit for correcting the sound source signals according to the calculated reference probability distribution.

이하에서는 도면을 참조하여 본 발명의 다양한 실시예들을 상세히 설명한다. 실시예들을 설명함에 있어서, 음원(sound source)이란 사운드가 방사되어 나오는 소스(source)를 의미하는 용어로서 사용될 것이다. 또한, 음압(sound pressure)이란, 음향 에너지가 미치는 힘을 압력의 물리량을 사용하여 표현한 것이고, 음압장(sound pressure field)이란 음원을 중심으로 음압이 미치는 영역을 개념적으로 표현한 것이다.Hereinafter, various embodiments of the present invention will be described in detail with reference to the accompanying drawings. In describing the embodiments, the sound source will be used as a term meaning a source from which sound is emitted. In addition, sound pressure refers to a force exerted by sound energy using a physical quantity of pressure, and a sound pressure field conceptually expresses an area in which sound pressure affects a sound source.

도 1은 본 발명이 해결하고자 하는 문제 상황에서 음원 신호를 보정하는 개략적인 아이디어를 도시한 블럭도로서, 마이크로폰 어레이(100), 음원 신호 보정부(110) 및 신호 처리부(120)를 포함한다. 각각의 구성 요소를 설명하기에 앞서 먼저 본 발명이 해결하고자 하는 문제 상황을 설명한다.1 is a block diagram illustrating a schematic idea of correcting a sound source signal in a problem situation to be solved by the present invention, and includes a microphone array 100, a sound source signal corrector 110, and a signal processor 120. Before describing each component, a problem situation to be solved by the present invention will be described.

앞서 설명한 바와 같이 마이크로폰 어레이는 지향성과 같은 사운드의 방향 특성을 활용하기 위해 사용된다. 일반적으로 마이크로폰 어레이는 배경 잡음과 혼합된 목표 신호를 고감도로 수신하기 위해 마이크로폰 어레이에 수신된 각각의 신호에 적절한 가중치를 주어 진폭을 향상시킴으로써 원하는 목표 신호와 간섭 잡음 신호의 방향이 다를 경우의 잡음을 공간적으로 줄일 수 있는 필터 역할을 하는데, 이러한 일종의 공간적 필터(spatial filter)를 빔 형성기(beamformer)라고 한다.As described above, microphone arrays are used to take advantage of the directional characteristics of sound, such as directivity. In general, the microphone array improves the amplitude by appropriately weighting each signal received in the microphone array in order to receive a target signal mixed with background noise with high sensitivity, thereby reducing noise when the desired target signal and the interference noise signal are different in direction. It serves as a filter that can be spatially reduced. This type of spatial filter is called a beamformer.

그런데, 이러한 빔 형성기를 이용하여 음원 신호를 처리하기 위해서는 마이크로폰 어레이를 구성하는 각각의 개별 마이크로폰들의 물리적 특성이 사용자에게 알려진 바와 일치하여야 한다. 즉, 개별 마이크로폰들을 통해 획득한 신호들의 크 기, 신호의 위상 및 주파수 응답들이 모두 일치하여야 한다. 왜냐하면, 마이크로폰 어레이를 조절하기 위한 인자들이 이러한 개별 마이크로폰들의 물리적인 특성에 기초하기 때문이다.However, in order to process a sound source signal using such a beam former, the physical characteristics of each individual microphone constituting the microphone array must match those known to the user. That is, the magnitudes of the signals, phase and frequency responses of the signals acquired through the individual microphones must all match. This is because the factors for adjusting the microphone array are based on the physical characteristics of these individual microphones.

만약, 마이크로폰 어레이를 조절하기 위해 일정한 시간만큼 음원 신호를 지연시키거나 음원 신호의 크기를 변화시켰는데 마이크로폰 어레이를 구성하는 개별 마이크로폰들이 사용자의 의도에 따라 조절되지 않는다면, 결과적으로 사용자의 의도와는 다른 왜곡된 음원 신호를 얻을 수 밖에 없을 것이다. 예를 들어, 대표적인 적응적 빔 형성기(adaptive beamformer) 알고리즘인 GSC(generalized side-lobe canceller)에서 불필요한 주변 간섭 음압장인 사이드 로브(side-lobe)를 제거하려 할 때, 마이크로폰 어레이를 구성하는 개별 마이크로폰들 간의 특성 불일치(mismatch)가 존재한다면, 이러한 특성 불일치가 신호 누출(leakage)을 야기하게 되고, 결과적으로 음원 신호가 왜곡되는 문제가 발생한다.If the sound source signal is delayed or the size of the sound source signal is changed for a certain time to adjust the microphone array and the individual microphones constituting the microphone array are not adjusted according to the user's intention, the result is different from the user's intention. You will have no choice but to obtain a distorted sound source signal. For example, the individual microphones that make up the microphone array when trying to remove the unwanted ambient interference sound field side-lobe from the typical adaptive beamformer algorithm, generalized side-lobe canceller (GSC). If there is a mismatch between the features, this mismatch will cause signal leakage, resulting in a distortion of the sound source signal.

이상과 같은 마이크로폰들 간의 특성 불일치는 크게 다음의 2 가지 원인으로부터 기인한다. 첫째, 마이크로폰의 제조 과정에서 발생한 오차에 의한 경우가 있으며, 둘째, 마이크로폰의 사용 과정에서 시간이 경과함에 따라 노화에 의해 마이크로폰의 물리적 특성이 변화한 경우가 있다. 이러한 2 가지 원인들은 모두 실제 제품의 제조 과정이나 사용 과정에서 비롯된 것으로서, 필연적으로 발생할 수 밖에 없는 경우가 많다. 따라서, 이하에서 기술할 본 발명의 다양한 실시예들은 이러한 문제점들이 마이크로폰 어레이를 구비한 사운드 획득 기기에서 발생한 경우, 사운드 획득 기기가 마이크로폰들 간의 특성 불일치에 둔감하도록 음원 신호를 보정하 고자 한다. 이러한 문제 상황 하에서 도 1의 개략적인 구성을 살펴보면 다음과 같다.The characteristic mismatch between the microphones described above is largely due to the following two causes. First, there is a case due to an error generated during the manufacturing process of the microphone, and second, the physical characteristics of the microphone may change due to aging as time passes in the process of using the microphone. Both of these causes originate from the actual product manufacturing or use process, and inevitably occur in many cases. Accordingly, various embodiments of the present invention described below attempt to correct a sound source signal so that the sound acquisition device is insensitive to characteristic mismatch between microphones when these problems occur in a sound acquisition device having a microphone array. Looking at the schematic configuration of Figure 1 under such a problem situation is as follows.

마이크로폰 어레이(100)는 외부로부터 음원 신호를 획득한다. 음원의 방향이나 음원 신호의 크기 등 마이크로폰 어레이(100)를 조절하는 방법은 본 발명의 실시예가 구현되는 상황 및 목적에 따라 다양하게 설계될 수 있을 것이다.The microphone array 100 obtains a sound source signal from the outside. The method of adjusting the microphone array 100 such as the direction of the sound source or the size of the sound source signal may be variously designed according to the situation and the purpose of implementing the embodiment of the present invention.

음원 신호 보정부(110)는 마이크로폰 어레이(100)를 통해 획득한 음원 신호들의 특성 불일치를 보정한다. 개별 마이크로폰들 간의 특성 불일치로 인해 발생하는 음원 신호의 왜곡 문제를 해결하는 방법으로는 크게 다음의 2 가지 방법이 가능하다. 첫째, 마이크로폰 어레이를 통해 음원 신호가 입력되면 즉시 입력된 각각의 음원 신호들을 보정하는 방법이 있을 수 있고, 둘째, 마이크로폰 어레이를 통해 입력된 음원 신호를 특정 목적에 따라 가공하는 과정에서 왜곡된 정도는 보정하는 방법이 있을 수 있다. 본 발명의 다양한 실시예들에서는 이상의 2 가지 해결 방법 중 전자, 즉, 입력된 신호들 자체를 바로 보정하는 방법에 따른다. 이후의 도 2에서부터 자세히 설명한다.The sound source signal correcting unit 110 corrects characteristic mismatches of sound source signals obtained through the microphone array 100. The following two methods can be used to solve the distortion problem of the sound source signal caused by the characteristic mismatch between the individual microphones. First, when a sound source signal is input through the microphone array, there may be a method of correcting the respective sound source signals immediately input. Second, the degree of distortion in the process of processing the sound source signal input through the microphone array according to a specific purpose is There may be a way to calibrate. According to various embodiments of the present disclosure, the former, namely, a method of directly correcting the input signals itself, may be used. A detailed description follows from FIG. 2.

신호 처리부(120)는 음원 신호 보정부(110)를 통해 보정된 음원 신호들을 사용자의 목적에 따라 처리한다. 이는 통상적인 음원 신호 처리, 빔 형성기 및 음원 위치 추적기 등 본 발명이 구현될 다양한 실시예들 및 환경에 따라 자유롭게 설계될 수 있을 것이다.The signal processor 120 processes sound source signals corrected by the sound source signal corrector 110 according to a user's purpose. This may be freely designed according to various embodiments and environments in which the present invention will be implemented, such as conventional sound source signal processing, beam formers, and sound source position trackers.

도 2는 본 발명의 일 실시예에 따른 음원 신호 보정 장치를 도시한 블럭도로서, 마이크로폰 어레이(200), 기준 확률 분포 산출부(211), 신호 보정부(212) 및 신호 처리부(220)를 포함한다. 마이크로폰 어레이(200)와 신호 처리부(220)는 도 1에서 설명한 마이크로폰 어레이(100) 및 신호 처리부(120)와 동일한 구성으로서, 통상적인 음원 신호 처리 장치의 구성에 해당한다. 따라서, 보다 엄밀한 의미에서 본 실시예에 따른 음원 신호 보정 장치는 점선으로 도시한 영역(210)에 포함되는 기준 확률 분포 산출부(211) 및 신호 보정부(212)로 구성된다. 이하에서는 이들 2 가지 구성 요소를 중심으로 음원 신호 보정 장치를 자세히 설명한다.2 is a block diagram showing a sound source signal correction apparatus according to an embodiment of the present invention, the microphone array 200, the reference probability distribution calculator 211, the signal correction unit 212 and the signal processor 220 Include. The microphone array 200 and the signal processor 220 have the same configuration as the microphone array 100 and the signal processor 120 described with reference to FIG. 1, and correspond to the configuration of a conventional sound source signal processing apparatus. Therefore, in a more strict sense, the sound source signal correcting apparatus according to the present embodiment includes a reference probability distribution calculating unit 211 and a signal correcting unit 212 included in the region 210 shown by a dotted line. Hereinafter, the sound source signal correction apparatus will be described in detail with respect to these two components.

기준 확률 분포 산출부(211)는 마이크로폰 어레이(200)를 통해 획득한 음원 신호들의 크기 구간별로 각 구간에 존재하는 음원 신호들의 수를 확률로서 표현한 확률 분포들을 산출한다. 일반적으로 확률 분포라 함은 확률 변수의 분포 상태를 의미하는 것으로서, 어떤 시행에서 일어날 수 있는 사건마다 그 확률 값을 대응시킨 것이다. 이 때, 확률 변수의 분포 상태는 변량을 일정한 폭으로 나눈 구간에 포함된 변수의 개수를 계수함으로써 산출될 수 있다.The reference probability distribution calculator 211 calculates probability distributions expressing the number of sound source signals present in each section as a probability for each of the magnitude sections of the sound source signals acquired through the microphone array 200. In general, the probability distribution refers to a distribution state of random variables, and the probability values are mapped to events that may occur in a trial. In this case, the distribution state of the random variable may be calculated by counting the number of variables included in the interval obtained by dividing the variable by a constant width.

본 발명의 실시예들에서 확률 분포(PDF; probability distribution function)란 음원 신호들의 크기를 일정한 구간으로 나누고, 각 구간별로 해당하는 음원 신호들의 개수를 계수하여 확률로서 표현한 것을 의미한다. 통상적으로 변량을 임의의 구간으로 나누고, 각 구간에 포함되는 값의 개수(도수라고도 한다.)를 표시한 그래프를 히스토그램(histogram)이라고 하는데, 이하에서는 확률 분포를 시각적으로 표시한 그래프라는 의미로서 사용될 것이다.In the embodiments of the present invention, a probability distribution function (PDF) means dividing the magnitudes of sound source signals into predetermined sections and counting the number of sound source signals corresponding to each section and expressing them as probability. Typically, a graph that divides the variance into arbitrary intervals and displays the number of values (also called frequency) included in each interval is called a histogram, which will be used as a graph that visually displays the probability distribution. will be.

확률 분포를 산출하는 기초가 되는 음원 신호들의 크기는 다양한 방법으로 표현될 수 있는데, 대표적으로 시간 영역(time domain)에서의 신호의 크기 또는 주 파수 영역(frequency domain)에서의 신호의 크기로 표현될 수 있을 것이다.The magnitudes of the sound source signals, which are the basis for calculating the probability distribution, can be expressed in various ways, typically the magnitude of the signal in the time domain or the magnitude of the signal in the frequency domain. Could be.

시간 영역에서 입력 신호들의 크기는 신호들 자체의 진폭(amplitude)이 될 것이다. 따라서, 입력 신호들의 진폭을 일정한 크기 구간으로 나누어 각 구간에 포함되는 신호들의 개수를 계수하면 확률 분포를 산출할 수 있다. 한편, 주파수 영역에서 입력 신호들의 크기를 구하기 위해서는 다음과 같은 과정에 따른다.The magnitude of the input signals in the time domain will be the amplitude of the signals themselves. Therefore, the probability distribution may be calculated by dividing the amplitude of the input signals into a predetermined size section and counting the number of signals included in each section. On the other hand, to obtain the magnitude of the input signal in the frequency domain is as follows.

우선, 마이크로폰 어레이(200)를 통해 입력된 음원 신호들에 대한 디지털 신호 처리를 위해서는 연산의 편의를 위해 고속 푸리에 변환(fast Fourier transform)을 통해 주파수 영역으로 변환하게 된다. 일반적으로 디지털 신호 처리에서는 해당 시스템에 신호를 입력하고 그 결과로서 생성되는 출력 신호를 표현하기 위해 컨벌루션(convolution)을 사용하는데, 주어진 대상 신호를 유한하게 제한하기 위해 프레임(frame)으로 나누어 처리하게 된다. 프레임이란 시간의 변화에 따라 음원 신호를 일정한 구간으로 분리한 유닛(unit)을 의미한다. 일단, 음원 신호들이 프레임별로 주파수 영역으로 변환되면, 확률 분포를 산출하는 과정은 시간 영역에서와 유사하다. 즉, 주파주 영역의 신호의 크기를 일정한 크기 구간으로 나누어 각 구간에 포함되는 신호들의 개수를 계수함으로써 확률 분포를 산출할 수 있다.First, digital signal processing of sound source signals input through the microphone array 200 is converted into a frequency domain through a fast Fourier transform for convenience of operation. In general, digital signal processing uses a convolution to input a signal into a corresponding system and to express a resultant output signal, which is divided into frames to finitely limit a given target signal. . A frame refers to a unit that divides a sound source signal into a predetermined section according to a change in time. Once the sound source signals are converted into the frequency domain frame by frame, the process of calculating the probability distribution is similar to that in the time domain. That is, the probability distribution may be calculated by dividing the magnitude of the signal in the frequency region into a constant magnitude section and counting the number of signals included in each section.

기준 확률 분포 산출부(211)는 우선 이상의 구간별로 각각 음원 신호의 분포 정도를 확률로서 산출하고, 산출된 결과들로부터 각각의 음원 신호들을 대표하는 기준값을 산출한다. 이러한 기준값이 바로 기준 확률 분포로서, 이후의 단계에서 각각의 음원 신호들을 보정하기 위한 척도가 된다.The reference probability distribution calculating unit 211 first calculates the distribution degree of the sound source signal for each of the above sections as a probability, and calculates a reference value representing the respective sound source signals from the calculated results. This reference value is a reference probability distribution, which is a measure for correcting the respective sound source signals in a later step.

이하에서는 도 4, 도 5a 및 도 5b를 참조하여 기준 확률 분포 산출부(211)가 입력된 음원 신호들로부터 기준 확률 분포를 산출하는 과정을 보다 상세하게 설명한다. 이하의 실시예들에서는 기분 확률 분포로서 확률 분포들을 신호의 크기가 작은 쪽에서 큰 쪽으로 누적시킨 누적 확률 분포를 사용할 것이고, 또한 이러한 누적 확률 분포를 특정 값으로 정규화시킨 정규 누적 확률 분포를 사용할 것이다. 상기된 확률 분포로서 누적 확률 분포 및 정규 누적 확률 분포 이외에 다양한 확률 분포를 나타내는 수단들을 사용할 수 있음은 물론이다. 이하에서 각각의 구체적인 의미와 구성을 설명한다.Hereinafter, the process of calculating the reference probability distribution from the input sound source signals by the reference probability distribution calculator 211 will be described in more detail with reference to FIGS. 4, 5A, and 5B. In the following embodiments, the cumulative probability distribution obtained by accumulating the probability distributions from the smaller signal to the larger signal will be used as the mood probability distribution, and a normal cumulative probability distribution that normalizes the cumulative probability distribution to a specific value will be used. As the probability distributions described above, means for representing various probability distributions besides the cumulative probability distribution and the normal cumulative probability distribution may be used. Hereinafter, each specific meaning and configuration will be described.

도 4는 본 발명의 일 실시예에 따른 음원 신호 보정 장치에서 누적 확률 분포를 산출하는 방법을 도시한 도면으로서, 첫 번째 그래프는 마이크로폰 어레이를 통해 입력된 음원 신호들의 확률 분포에 대한 히스토그램을 예시한 것이고, 두 번째 그래프는 음원 신호들의 확률 분포를 누적시키고 정규화시킨 히스토그램을 예시한 것이다. 그래프의 가로축은 음원 신호의 크기 구간을 나타내고, 세로축은 각 구간에 해당하는 음원 신호들의 개수를 확률 분포로서 표시한 것이다. 도 4에서는 마이크로폰 어레이가 4 개의 마이크로폰들로 구성되어 있다고 가정하고 있으며, 각각의 마이크로폰들을 통해 입력된 음원 신호들을 4 개의 채널로 표시하였다.4 is a diagram illustrating a method of calculating a cumulative probability distribution in a sound source signal correction apparatus according to an embodiment of the present invention. The first graph illustrates a histogram of a probability distribution of sound source signals input through a microphone array. The second graph illustrates a histogram that accumulates and normalizes the probability distribution of sound source signals. The horizontal axis of the graph represents a magnitude section of the sound source signal, and the vertical axis represents the number of sound source signals corresponding to each section as a probability distribution. In FIG. 4, it is assumed that the microphone array is composed of four microphones, and sound source signals input through the respective microphones are represented by four channels.

도 4의 좌측 그래프에서 입력 음원 신호가 동일한 것임에도 불구하고 각각의 채널별로 확률 분포들 간의 차이가 나타나는 것을 볼 수 있다. 즉, 마이크로폰들 간의 특성 불일치가 나타나는 것을 볼 수 있다. 이러한 확률 분포들을 가로축을 기준으로 신호의 크기가 작은 쪽에서 큰 쪽으로 누적시켜 히스토그램으로 도시하면 도 4에 도시된 바와 같이 신호 크기가 증가하는 형태의 그래프가 그려질 것이다. 그런데, 이렇게 그려진 누적 확률 분포들은 앞서 설명한 마이크로폰들 간의 특성 불일치로 인해 누적된 최대값이 서로 일치하지 않는다. 즉, 확률 분포들의 누적 결과가 서로 달라지게 된다. 따라서, 이러한 누적 결과를 일치시키기 위한 정규화(normalization) 과정을 거치게 된다. 여기서, 정규화는 계산이 용이하도록 기준이 되는 특정 값으로 최대값을 일치시키는 것을 의미하며, 본 실시예에서는 누적된 확률 분포의 최대값을 1로, 최소값을 0으로 설정하였다. 이상의 과정을 통해 생성된 히스토그램은 도 4의 두 번째 그래프에 도시되어 있다. 도 4의 두 번째 그래프는 정규화된 누적 확률 분포(normalized cumulative probability distribution function)를 도시한 히스토그램으로서, 세로축이 모두 1로 정규화되어 있으며, 4 개 채널의 확률 분포들이 오른쪽으로 진행할수록 점진적으로 증가하는 형태의 그래프를 형성하고 있다.In the left graph of FIG. 4, although the input sound source signal is the same, it can be seen that the difference between the probability distributions for each channel appears. That is, it can be seen that the characteristic mismatch between the microphones appear. If these probability distributions are accumulated in the histogram by accumulating the signal from the smaller to the larger relative to the horizontal axis, a graph in which the signal magnitude increases as shown in FIG. 4 will be drawn. However, the cumulative probability distributions thus drawn do not coincide with each other due to the characteristic mismatch between the microphones described above. In other words, the cumulative results of the probability distributions are different. Therefore, a normalization process is performed to match these cumulative results. Here, normalization means that the maximum value is matched to a specific value that is a reference for easy calculation. In this embodiment, the maximum value of the accumulated probability distribution is set to 1 and the minimum value is set to 0. The histogram generated through the above process is shown in the second graph of FIG. 4. The second graph of FIG. 4 is a histogram showing a normalized cumulative probability distribution function, in which the vertical axes are all normalized to 1, and the probability distributions of the four channels gradually increase as the right proceeds. To form a graph.

도 5a 및 도 5b는 본 발명의 일 실시예에 따른 음원 신호 보정 장치에서 기준 누적 확률 분포를 산출하는 구성을 상세하게 도시한 블럭도로서, 도 4의 히스토그램을 생성하기 위한 구체적인 구성을 도시하고 있다.5A and 5B are detailed block diagrams illustrating a configuration of calculating a reference cumulative probability distribution in the sound source signal correcting apparatus according to an embodiment of the present invention, and show a specific configuration for generating the histogram of FIG. .

도 5a에서 기준 누적 확률 분포 산출부는 정규 누적 확률 분포 산출부(510)와 대표값 산출부(520)를 포함한다.In FIG. 5A, the reference cumulative probability distribution calculator includes a normal cumulative probability distribution calculator 510 and a representative value calculator 520.

앞서 설명한 바와 같이, 정규 누적 확률 분포 산출부(510)는 마이크로폰 어레이를 통해 입력받은 음원 신호들의 크기 구간별 신호 분포들로부터 음원 신호들에 대응하는 정규 누적 확률 분포들을 산출한다. 도 5b를 참조하면, 정규 누적 확 률 분포 산출부(510)는 다시 누적부(511)와 정규화부(512)로 구성된다. 누적부(511)는 음원 신호들의 크기 구간별로 신호들의 크기에 대한 확률 분포들을 산출하여 누적한다. 정규화부(512)는 누적부(511)를 통해 누적된 확률 분포들에 대하여 누적된 최대값들이 서로 일치하도록 정규화한다. As described above, the normal cumulative probability distribution calculator 510 calculates the normal cumulative probability distributions corresponding to the sound source signals from the signal distributions of the magnitude intervals of the sound source signals received through the microphone array. Referring to FIG. 5B, the normal cumulative probability distribution calculator 510 may be configured by an accumulator 511 and a normalizer 512. The accumulator 511 calculates and accumulates probability distributions of the magnitudes of the signals for each of the magnitude intervals of the sound source signals. The normalizer 512 normalizes the accumulated maximum values so that the accumulated maximum values coincide with each other.

대표값 산출부(520)는 정규 누적 확률 분포 산출부(510)를 통해 산출된 정규 누적 확률 분포들로부터 대표값을 산출하여 기준 누적 확률 분포로 설정한다. 마이크로폰 어레이를 구성하는 개별 마이크로폰들의 개수만큼 음원 신호가 생성되고, 생성된 음원 신호에 해당하는 정규 누적 확률 분포가 산출되면, 이들을 대표하는 대표값을 산출하여 이후의 음원 신호 보정을 위한 기준으로 설정한다. 예를 들어, 개별 마이크로폰들이 4 개인 경우를 가정하면, 정규 누적 확률 분포 역시 4 개가 산출될 것이며, 이들 4 개의 정규 누적 확률 분포로부터 특정 규칙에 따라 1 개의 기준 누적 확률 분포를 산출하게 된다. 여기서, 특정 규칙은 대표값을 산출하는 임의의 함수의 형태로 구현이 가능할 것이다.The representative value calculator 520 calculates a representative value from the normal cumulative probability distributions calculated by the normal cumulative probability distribution calculator 510 and sets the representative cumulative probability distribution as the reference cumulative probability distribution. When a sound source signal is generated as many as the number of individual microphones constituting the microphone array, and a normal cumulative probability distribution corresponding to the generated sound source signal is calculated, a representative value representing them is calculated and set as a reference for subsequent sound source signal correction. . For example, assuming four separate microphones, four normal cumulative probability distributions will also be calculated, and one reference cumulative probability distribution will be calculated from these four normal cumulative probability distributions according to a specific rule. Here, the specific rule may be implemented in the form of an arbitrary function for calculating the representative value.

대표값을 산출하는 방법으로는 통상적으로 임의의 값들을 대표할 수 있는 값을 선택하는 다양한 방법들이 사용 가능하며, 이러한 방법들은 본 발명이 속하는 기술 분야에서 통상의 지식을 가진 자가 용이하게 도출할 수 있는 것이다. 대표값을 산출하는 방법들 중 대표적인 것들을 간단히 예시하면 다음과 같다.As a method of calculating the representative value, various methods of selecting a value representative of arbitrary values are generally available, and these methods can be easily derived by those skilled in the art. It is. Representative examples of the method of calculating the representative value are as follows.

첫째, 평균값(average)을 이용해 대표값으로 설정할 수 있다. 즉, 복수 개의 정규 누적 확률 분포들의 신호의 크기의 평균값을 구해 기준 누적 확률 분포로 설정한다. 둘째, 중앙값(median)을 이용해 대표값으로 설정할 수 있다. 이 방법은 음 원 신호들의 분포 정도를 고려하여 신호의 크기의 중앙값에 해당하는 정규 누적 확률 분포를 기준 누적 확률 분포로 설정하는 방법이다. 이상의 방법들 이외에 다수의 음원 신호들 중 1 이상의 음원 신호들을 선택하여, 선택된 음원 신호들에 더 많은 가중치를 부여하고 대표값을 산출하는 방법도 사용 가능할 것이다.First, the average value can be set as a representative value. That is, the average value of the magnitudes of the signals of the plurality of normal cumulative probability distributions is obtained and set as the reference cumulative probability distribution. Second, it can be set as a representative value by using a median. In this method, the normal cumulative probability distribution corresponding to the median of the signal magnitudes is set as the reference cumulative probability distribution in consideration of the distribution of the sound source signals. In addition to the above methods, a method of selecting one or more sound source signals from among the plurality of sound source signals to give more weight to the selected sound source signals and to calculate a representative value may be used.

이상에서 도 2의 기준 확률 분포 산출부(211)에서 기준 누적 확률 분포를 산출하는 과정을 설명하였다. 이러한 과정을 통해, 복수 개의 마이크로폰들 간의 특성 불일치로 인해 입력된 음원 신호들 간에 차이가 발생한 경우, 이러한 음원 신호들을 보정하기 위한 기준이 되는 기준 누적 확률 분포를 얻을 수 있다.The process of calculating the reference cumulative probability distribution in the reference probability distribution calculating unit 211 of FIG. 2 has been described above. Through this process, when a difference occurs between input sound source signals due to characteristic mismatch between a plurality of microphones, a reference cumulative probability distribution that serves as a reference for correcting the sound source signals may be obtained.

다음으로, 신호 보정부(212)는 기준 확률 분포 산출부(211)를 통해 산출된 기준 누적 확률 분포에 따라 음원 신호들을 보정한다. 따라서, 신호 보정부(212)는 마이크로폰 어레이를 구성하는 개별 마이크로폰들의 개수만큼 존재할 수 있으며, 각각의 음원 신호(채널을 의미한다.)들을 기준 누적 확률 분포를 참조하여 보정한다. 이하에서는 도 6 및 도 7을 참조하여 신호 보정부(212)에서 음원 신호들을 보정하는 방법을 보다 상세하게 설명한다.Next, the signal corrector 212 corrects sound source signals according to the reference cumulative probability distribution calculated by the reference probability distribution calculator 211. Accordingly, the signal correction unit 212 may exist as many as the number of individual microphones constituting the microphone array, and corrects each sound source signal (meaning a channel) with reference to a reference cumulative probability distribution. Hereinafter, a method of correcting sound source signals in the signal corrector 212 will be described in more detail with reference to FIGS. 6 and 7.

도 6은 본 발명의 일 실시예에 따른 음원 신호 보정 장치에서 변환 함수를 이용하여 음원 신호를 보정하는 구성을 상세하게 도시한 블럭도로서, 신호 보정부(620)는 다시 확률 분포 변환부(621)를 포함한다.FIG. 6 is a detailed block diagram illustrating a configuration of correcting a sound source signal using a conversion function in the sound source signal correction device according to an embodiment of the present invention. The signal correction unit 620 is again a probability distribution converter 621. ).

우선, 정규 누적 확률 분포 산출부(610)를 통해 음원 신호들로부터 정규 누적 확률 분포를 산출한다. 이 과정은 앞서 도 2에서 설명한 바와 같다. 다음으로, 확률 분포 변환부(621)는 음원 신호들로부터 기준 누적 확률 분포로 변환하는 함수 를 이용하여 정규 누적 확률 분포 산출부(610)를 통해 산출된 정규 누적 확률 분포들로부터 보정된 음원 신호들을 생성한다. 즉, 정규 누적 확률 분포를 기준값에 따라 보정한다. 이러한 보정 과정은 기준 누적 확률 분포에 따라 샘플 대 샘플(sample-by-sample) 비교 과정을 통해 수행된다. 샘플 대 샘플 비교란 보정 전의 음원 신호의 특정 샘플을 이에 대응하는 기준 샘플(기준 누적 확률 분포에 따른 값을 의미한다.)과 비교하여 변환하는 것을 말한다. 이상의 변환 과정의 이해를 돕기 위해 도 7을 참조한다.First, the normal cumulative probability distribution calculator 610 calculates a normal cumulative probability distribution from sound source signals. This process is as described above with reference to FIG. Next, the probability distribution transformer 621 may correct the sound source signals corrected from the normal cumulative probability distributions calculated by the normal cumulative probability distribution calculator 610 by using a function that converts the sound source signals into the reference cumulative probability distribution. Create That is, the normal cumulative probability distribution is corrected according to the reference value. This correction process is performed through a sample-by-sample comparison process according to a reference cumulative probability distribution. The sample-to-sample comparison refers to converting a specific sample of the sound source signal before correction with a corresponding reference sample (meaning a value according to the reference cumulative probability distribution). Refer to FIG. 7 to help understand the above conversion process.

도 7은 본 발명의 일 실시예에 따른 음원 신호 보정 장치의 확률 분포 변환부에서 보정된 음원 신호를 생성하는 방법을 도시한 도면이다. 도 7에서 실선으로 도시된 그래프는 기준 누적 확률 분포의 히스토그램이고, 점선으로 도시된 그래프는 현재 보정되지 않은 음원 신호에 해당하는 정규 누적 확률 분포의 히스토그램이다. 그래프의 세로축은 0에서 1까지의 값으로 정규화한 것을 가정한다. 음원 신호를 보정한다는 것은 특정 샘플에 대한 정규 누적 확률 분포 값을 기준 누적 확률 분포 값에 일치시킨다는 것을 의미한다.7 is a diagram illustrating a method of generating a corrected sound source signal in a probability distribution converter of a sound source signal correcting apparatus according to an embodiment of the present invention. 7 is a histogram of a reference cumulative probability distribution, and a graph shown by a dotted line is a histogram of a normal cumulative probability distribution corresponding to a sound source signal that is not currently corrected. The vertical axis of the graph is assumed to be normalized to a value from 0 to 1. Correcting a sound source signal means matching a normal cumulative probability distribution value for a specific sample with a reference cumulative probability distribution value.

예를 들어, 그래프의 가로축에서 A 값이 보정되지 않은 음원 신호의 특정 값(샘플을 의미한다.)을 나타날 때, A 값에 해당하는 누적 확률 분포는 세로축의 C 값이 될 것이다. 이 때, C 값을 기준으로 이러한 누적 확률 분포에 해당하는 입력값을 역으로 추정하면, 보정된 입력 신호는 기준 누적 확률 분포의 히스토그램의 가로축에서 B 값이 될 것이다. 즉, B 값은 보정되지 않은 마이크로폰 입력 신호 A 값을 보정한 신호가 된다.For example, when the value of A on the horizontal axis of the graph indicates a specific value of the uncorrected sound source signal (meaning sample), the cumulative probability distribution corresponding to the value of A will be the value of C on the vertical axis. In this case, if the input value corresponding to the cumulative probability distribution is inversely estimated based on the C value, the corrected input signal will be the B value on the horizontal axis of the histogram of the reference cumulative probability distribution. In other words, the B value is a signal obtained by correcting the uncorrected microphone input signal A value.

이상에서 도 6 및 도 7을 참조하여 도 2의 신호 보정부(212)가 음원 신호를 보정하는 과정을 설명하였다. 이하에서는 앞서 설명한 도 2의 음원 신호 보정 장치(210)에서 기준 확률 분포를 산출하는 과정과 산출된 기준 확률 분포에 따라 음원 신호를 보정하는 과정을 수학식을 통해 보충 설명하겠다.In the above, the process of correcting the sound source signal by the signal corrector 212 of FIG. 2 has been described with reference to FIGS. 6 and 7. Hereinafter, the process of calculating the reference probability distribution and the process of correcting the sound source signal according to the calculated reference probability distribution in the sound source signal correction device 210 of FIG.

우선, 마이크로폰 어레이를 구성하는 개별 마이크로폰들의 개수를 k 개라고 가정할 때, k 개의 마이크로폰 입력 신호들을

이라고 하면, 이들 마이크로폰 입력 신호들에 대응하는 정규 누적 확률 분포들은

와 같이 표현할 수 있다. 이 때, 정규 누적 확률 분포들은

이라는 조건을 만족한다. 이어서, 마이크로폰 입력 신호 각각의 누적 확률 분포로부터 기준 누적 확률 분포를 산출하면 다음의 수학식 1과 같이 표현된다.First, assuming that the number of individual microphones constituting the microphone array is k, k microphone input signals are obtained.

, The normal cumulative probability distributions corresponding to these microphone input signals

It can be expressed as In this case, the normal cumulative probability distributions

Satisfies the condition Subsequently, when the reference cumulative probability distribution is calculated from the cumulative probability distributions of the microphone input signals, the following equation 1 is expressed.

여기서,

은 기준 누적 확률 분포에 따른 함수이고, f(·)는 대표값을 산출하는 함수를 의미한다. 앞서 도 5a에서 설명한 바와 같이 f(·)는 평균값이나 중앙값을 구하는 함수가 될 수 있을 것이다.here,

Is a function according to the reference cumulative probability distribution, and f (·) is a function for calculating a representative value. As described above with reference to FIG. 5A, f (·) may be a function for obtaining an average value or a median value.

다음으로, 마이크로폰 입력 신호가 기준 누적 확률 분포를 따르도록 보정하 는 과정은 다음의 수학식 2과 같이 표현된다.Next, a process of correcting the microphone input signal to follow the reference cumulative probability distribution is expressed by Equation 2 below.

여기서,

는 보정되지 않은 마이크로폰 입력 신호이고,

는 보정된 마이크로폰 입력 신호를 의미한다. 또한,

는 정규 누적 확률 분포에 따른 함수이고,

은 기준 누적 확률 분포에 따른 함수를 의미한다. 즉, 수학식 2는 앞서 도 7을 통해 설명한 바와 같이, 특정 입력 신호에 대한 정규 누적 확률 분포 값을 기준 누적 확률 분포 값에 일치시키는 입력 신호 보정 과정을 나타낸다.here,

Is the uncorrected microphone input signal,

Denotes the corrected microphone input signal. Also,

Is a function of the normal cumulative probability distribution,

Denotes a function according to the reference cumulative probability distribution. That is, Equation 2 illustrates an input signal correction process of matching a normal cumulative probability distribution value for a specific input signal with a reference cumulative probability distribution value, as described above with reference to FIG. 7.

수학식 2를 보정된 마이크로폰 입력 신호를 중심으로 정리하면 다음의 수학식 3과 같다.Equation 2 is summarized as the following Equation 3 with respect to the corrected microphone input signal.

이상의

및

과 같은 확률 분포에 따른 함수들은 일종의 비선형 변환 함수들로서, 수학식 3을 통해 도 2의 신호 보정부(212)는 최종적으로 보정된 음원 신호를 생성하여 신호 처리부(220)에 공급한다.ideal

And

Functions according to probability distributions as described above are a kind of nonlinear transformation functions. The signal correction unit 212 of FIG. 2 generates a finally corrected sound source signal and supplies it to the signal processor 220 through Equation 3.

신호 처리부(220)는 마이크로폰 어레이를 구비한 디지털 신호 처리 기기에서 통상적으로 구비하는 구성으로서, 본 발명의 실시예들이 구현되는 환경에 따라 자 유롭게 설계될 수 있으므로 여기에서는 자세한 설명을 생략한다.The signal processor 220 is a component that is typically provided in a digital signal processing device having a microphone array, and thus may be freely designed according to the environment in which the embodiments of the present invention are implemented.

이상에서 본 발명의 다양한 실시예들을 참조하여 음원 신호 보정 장치의 구성 및 역할을 상세하게 설명하였다. 본 실시예들에 따르면 마이크로폰 어레이를 통해 입력된 음원 신호들의 확률 분포들에 기초하여 기준 확률 분포를 산출하고, 이에 따라 음원 신호들을 보정함으로써, 음원 획득이 가능한 디지털 기기에서 마이크로폰 어레이를 구성하는 개별 마이크로폰들 간의 특성 불일치로 인해 음원 신호가 왜곡되는 문제점이 해소된다. 또한, 일단 기준 확률 분포가 산출되면, 입력된 음원 신호들을 실시간으로 보정할 수 있으므로 빠른 오류 보완이 가능해진다.The configuration and role of the sound source signal correction apparatus have been described in detail with reference to various embodiments of the present disclosure. According to the embodiments, the reference probability distribution is calculated based on the probability distributions of the sound source signals input through the microphone array, and the sound source signals are corrected accordingly, thereby making individual microphones constituting the microphone array in the digital device capable of sound source acquisition. The problem that the sound source signal is distorted due to the characteristic mismatch between them is solved. In addition, once the reference probability distribution is calculated, the input sound source signals can be corrected in real time, thereby enabling fast error compensation.

도 3은 본 발명의 다른 실시예에 따른 저장부가 추가된 음원 신호 보정 장치를 도시한 블럭도로서, 도 2의 음원 신호 보정 장치(210)에 저장부(315)를 추가한 것이다.FIG. 3 is a block diagram illustrating a sound source signal correcting apparatus in which a storage unit is added according to another exemplary embodiment of the present invention, and the storage unit 315 is added to the sound source signal correcting apparatus 210 of FIG. 2.

도 3의 음원 신호 보정 장치(310)는 도 2와 마찬가지로 기준 확률 분포 산출부(311) 및 신호 보정부(312)를 포함하므로 이하에서는 저장부(315)를 중심으로 구성상의 특징을 설명하겠다.Since the sound source signal correcting apparatus 310 of FIG. 3 includes a reference probability distribution calculating unit 311 and a signal correcting unit 312 as in FIG. 2, the configuration features of the sound source signal correction apparatus 310 will be described below.

저장부(315)는 기준 확률 분포 산출부(311)를 통해 산출된 기준 확률 분포를 저장한다. 앞서 설명한 바와 같이 마이크로폰들 간의 특성 불일치는 마이크로폰의 제조 과정에서의 오차나 마이크로폰들의 사용에 따른 노화로 인해 야기된다. 따라서, 일단 특성 불일치가 발생하였다면 이러한 문제점이 단시간 내에 크게 변화하지는 않을 가능성이 많다. 즉, 특성 불일치 자체는 시간에 따라 천천히 변화한다. 그러므로, 일단 기준 누적 확률 분포가 산출되었다면 단시간 내에 여러번 산출할 필 요는 없는 것이다. 오히려, 음원 신호를 보정할 때마다 매번 기준 확률 분포를 산출한다면 시스템 자원의 낭비가 될 것이다.The storage unit 315 stores the reference probability distribution calculated by the reference probability distribution calculator 311. As described above, the characteristic mismatch between the microphones is caused by an error in the manufacturing process of the microphone or aging due to the use of the microphones. Therefore, once a characteristic mismatch occurs, it is likely that such a problem will not change significantly in a short time. In other words, the property mismatch itself changes slowly over time. Therefore, once the reference cumulative probability distribution is calculated, it is not necessary to calculate it several times in a short time. Rather, calculating the reference probability every time the sound source signal is corrected will be a waste of system resources.

이상과 같은 이유로 저장부(315)는 이미 산출된 기준 확률 분포를 다시 산출할 필요가 없도록 저장하는 역할을 수행한다. 이러한 저장부(315)는 데이터를 기록할 수 있는 기록 장치나 네트워크를 통해 연결된 특정 저장 장치가 될 수 있을 것이다. 이어서, 신호 보정부(312)는 저장부(315)에 저장된 기준 확률 분포를 읽어들이고, 읽어들인 기준 확률 분포에 따라 음원 신호들을 보정한다. 도 3에서 음원 신호 보정 장치(310)가 보정을 수행하던 중 일정 시간이 경과하거나, 마이크로폰들의 특성이 변화하였다고 판단될 경우, 다시 기준 확률 분포 산출부(311)를 통해 새롭게 기준 확률 분포를 산출할 수 있을 것이고, 새롭게 산출된 기준 확률 분포를 저장부(315)에 갱신함으로써, 변화된 마이크로폰들 간의 특성 불일치를 해소할 수 있을 것이다.For this reason, the storage unit 315 stores the reference probability distribution so that it is not necessary to recalculate the calculated probability distribution. The storage unit 315 may be a recording device capable of recording data or a specific storage device connected via a network. Subsequently, the signal correction unit 312 reads the reference probability distribution stored in the storage unit 315 and corrects the sound source signals according to the read reference probability distribution. In FIG. 3, when a predetermined time elapses or when the characteristics of the microphones are changed while the sound source signal correcting apparatus 310 performs the correction, the reference probability distribution calculator 311 may newly calculate the reference probability distribution. By updating the newly calculated reference probability distribution in the storage unit 315, the characteristic mismatch between the changed microphones may be eliminated.

본 실시예를 통해 반복적으로 기준 확률 분포를 산출하지 않고, 일단 산출된 기준 확률 분포를 저장하였다가 음원 신호의 보정시에 활용함으로써 불필요한 연산을 줄이고, 일단 기준 확률 분포가 산출된 이후에는 해당 과정의 생략이 가능하므로 빠른 음원 신호 보정이 가능해진다.In this embodiment, the reference probability distribution is not repeatedly calculated, and the calculated reference probability distribution is stored and used for the correction of the sound source signal, thereby reducing unnecessary calculations. Since it can be omitted, fast sound source signal correction is possible.

도 8은 본 발명의 또 다른 실시예에 따른 음원 신호 보정 방법을 도시한 순서도로서, 다음과 같은 단계들을 포함한다.8 is a flowchart illustrating a sound source signal correction method according to another embodiment of the present invention, and includes the following steps.

810 단계에서 마이크로폰 어레이를 통해 획득한 음원 신호들의 크기 구간별로 각 구간에 존재하는 음원 신호들의 수를 확률로서 표현한 확률 분포들을 산출한 다. 확률 분포들은 음원 신호들의 확률 분포들을 누적시킨 후 최대값을 일치시키는 정규화 과정을 통해 산출될 수 있다.In step 810, probability distributions expressing the number of sound source signals present in each section as a probability for each of the magnitude sections of the sound source signals acquired through the microphone array are calculated. Probability distributions may be calculated through a normalization process of accumulating probability distributions of sound source signals and matching maximum values.

820 단계에서 810 단계를 통해 산출된 확률 분포들에 기초하여 각각의 확률 분포들을 대표하는 기준 확률 분포를 산출한다. 기준 확률 분포는 산출된 확률 분포들로부터 평균값 또는 중앙값과 같은 대표값을 산출하여 기준 확률 분포로 설정함으로써 얻을 수 있다.In operation 820, a reference probability distribution representing each probability distribution is calculated based on the probability distributions calculated in operation 810. The reference probability distribution can be obtained by calculating a representative value such as an average value or a median value from the calculated probability distributions and setting the reference probability distribution.

830 단계에서 820 단계를 통해 산출된 기준 확률 분포에 따라 810 단계에서 획득한 음원 신호들을 보정한다. 보정된 음원 신호들은 음원 신호들로부터 810 단계에서 산출한 기준 확률 분포로 변환하는 함수의 역함수를 이용하여 확률 분포들을 입력함으로써 얻을 수 있다.In operation 830, the sound source signals acquired in operation 810 are corrected according to the reference probability distribution calculated in operation 820. The corrected sound source signals can be obtained by inputting the probability distributions using an inverse function of a function that converts the sound source signals into the reference probability distribution calculated in step 810.

본 실시예에 따르면 마이크로폰 어레이를 구성하는 개별 마이크로폰들 간의 특성 불일치로 인해 음원 신호가 왜곡되는 문제점이 해소되며, 산출된 기준 누적 확률 분포에 따라 음원 신호의 보정이 실시간으로 이루어지므로 빠른 오류 보완이 가능하다.According to this embodiment, the problem that the sound source signal is distorted due to the characteristic mismatch between the individual microphones constituting the microphone array is eliminated, and the correction of the sound source signal is performed in real time according to the calculated reference cumulative probability distribution, thereby enabling fast error correction. Do.

한편, 본 발명은 컴퓨터로 읽을 수 있는 기록 매체에 컴퓨터가 읽을 수 있는 코드로 구현하는 것이 가능하다. 컴퓨터가 읽을 수 있는 기록 매체는 컴퓨터 시스템에 의하여 읽혀질 수 있는 데이터가 저장되는 모든 종류의 기록 장치를 포함한다.Meanwhile, the present invention can be embodied as computer readable codes on a computer readable recording medium. The computer-readable recording medium includes all kinds of recording devices in which data that can be read by a computer system is stored.

컴퓨터가 읽을 수 있는 기록 매체의 예로는 ROM, RAM, CD-ROM, 자기 테이프, 플로피디스크, 광 데이터 저장장치 등이 있으며, 또한 캐리어 웨이브(예를 들어 인 터넷을 통한 전송)의 형태로 구현하는 것을 포함한다. 또한, 컴퓨터가 읽을 수 있는 기록 매체는 네트워크로 연결된 컴퓨터 시스템에 분산되어, 분산 방식으로 컴퓨터가 읽을 수 있는 코드가 저장되고 실행될 수 있다. 그리고 본 발명을 구현하기 위한 기능적인(functional) 프로그램, 코드 및 코드 세그먼트들은 본 발명이 속하는 기술 분야의 프로그래머들에 의하여 용이하게 추론될 수 있다.Examples of computer-readable recording media include ROM, RAM, CD-ROM, magnetic tape, floppy disks, optical data storage devices, and the like, which are also implemented in the form of carrier waves (for example, transmission over the Internet). It includes. The computer readable recording medium can also be distributed over network coupled computer systems so that the computer readable code is stored and executed in a distributed fashion. And functional programs, codes and code segments for implementing the present invention can be easily inferred by programmers in the art to which the present invention belongs.

이상에서 본 발명에 대한 다양한 실시예들을 중심으로 살펴보았다. 본 발명에 속하는 기술 분야에서 통상의 지식을 가진 자는 본 발명이 본 발명의 본질적인 특성에서 벗어나지 않는 범위에서 변형된 형태로 구현될 수 있음을 이해할 수 있을 것이다. 그러므로 개시된 실시예들은 한정적인 관점이 아니라 설명적인 관점에서 고려되어야 한다. 본 발명의 범위는 전술한 설명이 아니라 특허청구범위에 나타나 있으며, 그와 동등한 범위 내에 있는 모든 차이점은 본 발명에 포함된 것으로 해석되어야 할 것이다.In the above, various embodiments of the present invention have been described. Those skilled in the art will understand that the present invention can be implemented in a modified form without departing from the essential features of the present invention. Therefore, the disclosed embodiments should be considered in descriptive sense only and not for purposes of limitation. The scope of the present invention is shown in the claims rather than the foregoing description, and all differences within the scope will be construed as being included in the present invention.

도 1은 본 발명이 해결하고자 하는 문제 상황에서 음원 신호를 보정하는 개략적인 아이디어를 도시한 블럭도이다.1 is a block diagram illustrating a schematic idea of correcting a sound source signal in a problem situation to be solved by the present invention.

도 2는 본 발명의 일 실시예에 따른 음원 신호 보정 장치를 도시한 블럭도이다.2 is a block diagram illustrating a sound source signal correction apparatus according to an embodiment of the present invention.

도 3은 본 발명의 다른 실시예에 따른 저장부가 추가된 음원 신호 보정 장치를 도시한 블럭도이다.3 is a block diagram illustrating an apparatus for compensating a sound source signal to which a storage unit is added according to another exemplary embodiment of the present invention.

도 4는 본 발명의 일 실시예에 따른 음원 신호 보정 장치에서 누적 확률 분포를 산출하는 방법을 예시한 도면이다.4 is a diagram illustrating a method of calculating a cumulative probability distribution in a sound source signal correction apparatus according to an embodiment of the present invention.

도 5a 및 도 5b는 본 발명의 일 실시예에 따른 음원 신호 보정 장치에서 기준 누적 확률 분포를 산출하는 구성을 상세하게 도시한 블럭도이다.5A and 5B are detailed block diagrams illustrating a configuration of calculating a reference cumulative probability distribution in a sound source signal correction apparatus according to an embodiment of the present invention.

도 6은 본 발명의 일 실시예에 따른 음원 신호 보정 장치에서 변환 함수를 이용하여 음원 신호를 보정하는 구성을 상세하게 도시한 블럭도이다.FIG. 6 is a detailed block diagram illustrating a configuration of correcting a sound source signal using a conversion function in the sound source signal correction apparatus according to an embodiment of the present invention.

도 7은 본 발명의 일 실시예에 따른 음원 신호 보정 장치의 확률 분포 변환부에서 보정된 음원 신호를 생성하는 방법을 도시한 도면이다.7 is a diagram illustrating a method of generating a corrected sound source signal in a probability distribution converter of a sound source signal correcting apparatus according to an embodiment of the present invention.

도 8은 본 발명의 또 다른 실시예에 따른 음원 신호 보정 방법을 도시한 순서도이다.8 is a flowchart illustrating a sound source signal correction method according to another embodiment of the present invention.

Claims

Calculating probability distributions representing the number of sound source signals existing in each section as a probability for each of the magnitude intervals of the sound source signals acquired through the microphone array;

Calculating a reference probability distribution representing the probability distributions based on the calculated probability distributions; And

And correcting the sound source signals according to the calculated reference probability distribution.

The method of claim 1,

Computing the probability distributions includes calculating and accumulating probability distributions for the magnitudes of the sound source signals for each section,

The calculating of the reference probability distribution may include calculating the reference probability distribution based on the accumulated probability distributions.

The method of claim 2,

Calculating the probability distributions further comprises normalizing the accumulated probability distributions so that the maximum values of the accumulated probability distributions coincide with a predetermined value,

The calculating of the reference probability distribution includes calculating the reference probability distribution based on the normalized cumulative probability distributions.

The method of claim 1,

The calculating of the reference probability distribution may include calculating an average value or a median value from the calculated probability distributions and setting the reference probability distribution as the reference probability distribution.

The method of claim 1,

And correcting the sound source signals comprises generating corrected sound source signals by inputting the probability distributions into an inverse function of a function for converting from the sound source signals to the reference probability distribution.

The method of claim 1,

Storing the calculated reference probability distribution in advance;

The correcting of the sound source signals may include reading the prestored reference probability distribution and correcting the sound source signals according to the read reference probability distribution.

A non-transitory computer-readable recording medium having recorded thereon a program for executing the method of claim 1.

A probability distribution calculator for calculating probability distributions representing, as a probability, the number of sound source signals existing in each section for each of the magnitude sections of the sound source signals acquired through the microphone array;

A reference probability distribution calculating unit for calculating a reference probability distribution representing the probability distributions based on the calculated probability distributions; And

And a signal correction unit for correcting the sound source signals according to the calculated reference probability distribution.

The method of claim 8,

The probability distribution calculator includes an accumulator that calculates and accumulates probability distributions for the magnitudes of the sound source signals for each section.

And the reference probability distribution calculator calculates the reference probability distribution based on the accumulated probability distributions.

The method of claim 9,

The probability distribution calculator further includes a normalization unit for normalizing the accumulated probability distributions so that the maximum values of the accumulated probability distributions coincide with a predetermined value.

And the reference probability distribution calculator calculates the reference probability distribution based on the normalized cumulative probability distributions.

The method of claim 8,

And the reference probability distribution calculator calculates an average or median value from the calculated probability distributions and sets the reference probability distribution as the reference probability distribution.

The method of claim 8,

And the signal correcting unit includes a probability distribution converting unit configured to generate corrected sound source signals by inputting the probability distributions to an inverse function of a function for converting the sound source signals into the reference probability distribution.

The method of claim 8,

Further comprising a storage unit for storing the calculated reference probability distribution in advance,

And the signal correcting unit reads the prestored reference probability distribution and corrects the sound source signals according to the read reference probability distribution.