KR101459317B1

KR101459317B1 - Method and apparatus for calibrating the sound source signal acquired through the microphone array

Info

Publication number: KR101459317B1
Application number: KR1020070123818A
Authority: KR
Inventors: 정소영; 오광철; 정재훈; 김규홍
Original assignee: 삼성전자주식회사
Priority date: 2007-11-30
Filing date: 2007-11-30
Publication date: 2014-11-07
Also published as: KR20090056597A

Abstract

본 발명은 마이크로폰 어레이를 통해 획득한 음원 신호를 보정하는 방법 및 장치에 관한 것으로, 본 발명에 따른 음원 신호 보정 방법은 마이크로폰 어레이를 통해 획득한 음원 신호들의 크기 구간별로 각 구간에 존재하는 음원 신호들의 수를 확률로서 표현한 확률 분포들을 산출하고, 산출된 확률 분포들을 대표하는 기준 확률 분포를 산출하며, 산출된 기준 확률 분포에 따라 음원 신호들을 보정함으로써, 음원 획득이 가능한 디지털 기기에서 마이크로폰 어레이를 구성하는 개별 마이크로폰들 간의 특성 불일치로 인해 음원 신호가 왜곡되는 문제점이 해소된다.The present invention relates to a method and an apparatus for correcting a sound source signal obtained through a microphone array, and a method for correcting a sound source signal according to the present invention is a method for correcting a sound source signal obtained through a microphone array, A microphone array is constructed in a digital device capable of acquiring a sound source by calculating probability distributions expressing numbers as probabilities and calculating reference probability distributions representing the calculated probability distributions and correcting the sound source signals according to the calculated reference probability distributions The problem that the sound source signal is distorted due to the characteristic mismatch between the individual microphones is solved.

마이크로폰 어레이, 보정, 확률 분포, 히스토그램, 비선형 변환 Microphone array, calibration, probability distribution, histogram, nonlinear transformation

Description

[0001] The present invention relates to a method and apparatus for calibrating a sound source signal acquired through a microphone array,

본 발명은 마이크로폰 어레이를 통해 획득한 음원 신호를 보정하는 방법 및 장치에 관한 발명으로서, 마이크로폰 어레이가 구비된 휴대 전화, UMPC(ultra mobile personal computer), 캠코더 등의 휴대용 사운드 획득 기기와 DTV(digital television), 원격 화상 회의 시스템 등 디지털 정보 기기 등에서 음성 통화, 녹음 및 음성 인식과 같이 사운드를 획득함에 있어서 획득된 음원 신호의 특성 차이를 보정하는 방법 및 장치에 관한 것이다.The present invention relates to a method and an apparatus for correcting a sound source signal acquired through a microphone array, and relates to a portable sound acquiring device such as a mobile phone equipped with a microphone array, an ultra mobile personal computer (UMPC) And a method and apparatus for correcting a difference in characteristics of a sound source signal obtained in acquiring sound such as voice call, recording, and voice recognition in a digital information device such as a remote video conference system.

휴대 전화를 이용하여 전화 통화를 하거나 디지털 캠코더 및 녹음기를 통해 사운드를 취득하는 것이 일상화되는 시대가 도래하였다. CE(consumer electronics) 기기 및 휴대 전화 등 다양한 디지털 기기에서는 사운드를 취득하기 위한 수단으로서 마이크로폰(microphone)이 사용되는데, 단일 채널의 모노(mono) 사운드가 아닌 2 이상의 채널을 활용하는 스테레오(stereo) 사운드를 구현하기 위해서는 일반적으로 다수의 마이크로폰들이 포함된 마이크로폰 어레이(microphone array)가 사용된 다.The time has come to become commonplace to make a phone call using a mobile phone or to acquire sound through a digital camcorder and a sound recorder. In a variety of digital devices, such as consumer electronics (CE) devices and mobile phones, a microphone is used as a means of acquiring sound, which is a stereo sound utilizing two or more channels rather than a single channel mono sound A microphone array including a plurality of microphones is generally used.

마이크로폰 어레이는 다수의 마이크로폰들을 조합하여 사운드 자체뿐만 아니라 취득하려는 사운드의 방향이나 위치와 같은 지향성(directivity)에 관한 부가적인 성질을 얻을 수 있다. 지향성이라 함은 음원 신호가 어레이를 구성하는 다수의 마이크로폰들 각각에 도달하는 시간 차이를 이용하여 특정 방향에 위치한 음원으로부터 방사되는 음원 신호에 대한 감도를 크게 하는 것을 말한다. 따라서, 이러한 마이크로폰 어레이를 이용하여 음원 신호들을 취득함으로써 특정 방향으로부터 입력되는 음원 신호를 강조하거나 억제할 수 있다.The microphone array can combine multiple microphones to obtain additional properties relating to the directivity as well as the sound itself as well as the direction or position of the sound to be acquired. The directivity means that the sensitivity of a sound source signal emitted from a sound source located in a specific direction is increased by using a time difference in which the sound source signal reaches each of a plurality of microphones constituting the array. Therefore, by acquiring sound source signals using such a microphone array, it is possible to emphasize or suppress the sound source signals inputted from a specific direction.

이러한 마이크로폰 어레이를 조절한다는 것은 마이크로폰 어레이를 구성하는 복수 개의 마이크로폰들 각각의 지연값이나 마이크로폰들 간의 간격 등 지향성 조절 인자(parameter)들을 조절한다는 것을 의미한다. 또한, 사용자가 의도한 대로 마이크로폰 어레이를 조절하기 위해서는 마이크로폰 어레이가 개별 마이크로폰들의 물리적 특성(신호의 크기, 위상 및 주파수 응답 등을 의미한다.)에 따라 동작하여야 한다는 기본적인 전제 조건이 성립하여야 한다.Adjusting the microphone array means adjusting the directivity adjustment parameters such as the delay value of each of the plurality of microphones constituting the microphone array or the interval between the microphones. Also, in order to control the microphone array as intended by the user, a basic precondition must be established that the microphone array must operate according to the physical characteristics of the individual microphones (meaning signal size, phase and frequency response).

본 발명이 해결하고자 하는 기술적 과제는 음원 획득이 가능한 디지털 기기에서 마이크로폰 어레이를 통해 입력된 음원 신호들이 마이크로폰 어레이를 구성하는 개별 마이크로폰들 간의 특성 불일치로 인해 음원 신호가 왜곡되는 문제점을 해결하는 음원 신호 보정 방법 및 장치를 제공하는데 있다.SUMMARY OF THE INVENTION The present invention has been made in view of the above problems, and it is an object of the present invention to provide a sound source signal correction method and a sound source signal correction method for solving the problem that a sound source signal inputted through a microphone array in a digital device capable of acquiring a sound source is distorted due to a characteristic mismatch between individual microphones constituting a microphone array Method and apparatus.

상기 기술적 과제를 달성하기 위하여, 본 발명에 따른 음원 신호 보정 방법은 마이크로폰 어레이를 통해 획득한 음원 신호들의 크기 구간별로 각 구간에 존재하는 음원 신호들의 수를 확률로서 표현한 확률 분포들을 산출하는 단계; 상기 산출된 확률 분포들을 대표하는 기준 확률 분포를 산출하는 단계; 및 상기 산출된 기준 확률 분포에 따라 상기 음원 신호들을 보정하는 단계를 포함하는 것을 특징으로 한다.According to another aspect of the present invention, there is provided a method for correcting a sound source signal, the method comprising the steps of: calculating probability distributions representing the number of sound source signals present in each interval as a probability, according to magnitude intervals of sound source signals acquired through a microphone array; Calculating a reference probability distribution representing the calculated probability distributions; And correcting the sound source signals according to the calculated reference probability distribution.

상기 다른 기술적 과제를 해결하기 위하여, 본 발명은 상기 기재된 음원 신호 보정 방법을 컴퓨터에서 실행시키기 위한 프로그램을 기록한 컴퓨터로 읽을 수 있는 기록매체를 제공한다.According to another aspect of the present invention, there is provided a computer-readable recording medium having recorded thereon a program for causing a computer to execute the sound source signal correction method described above.

상기 기술적 과제를 달성하기 위하여, 본 발명에 따른 음원 신호 보정 장치는 마이크로폰 어레이를 통해 획득한 음원 신호들의 크기 구간별로 각 구간에 존재하는 음원 신호들의 수를 확률로서 표현한 확률 분포들을 산출하는 확률 분포 산출부; 상기 산출된 확률 분포들을 대표하는 기준 확률 분포를 산출하는 기준 확률 분포 산출부; 및 상기 산출된 기준 확률 분포에 따라 상기 음원 신호들을 보정하는 신호 보정부를 포함하는 것을 특징으로 한다.According to an aspect of the present invention, there is provided an apparatus for correcting a sound source signal, the apparatus comprising: a probability distribution calculating unit operable to calculate probability distributions representing the number of sound source signals existing in each interval, part; A reference probability distribution calculating unit for calculating a reference probability distribution representing the calculated probability distributions; And a signal correction unit for correcting the sound source signals according to the calculated reference probability distribution.

이하에서는 도면을 참조하여 본 발명의 다양한 실시예들을 상세히 설명한다. 실시예들을 설명함에 있어서, 음원(sound source)이란 사운드가 방사되어 나오는 소스(source)를 의미하는 용어로서 사용될 것이다. 또한, 음압(sound pressure)이란, 음향 에너지가 미치는 힘을 압력의 물리량을 사용하여 표현한 것이고, 음압장(sound pressure field)이란 음원을 중심으로 음압이 미치는 영역을 개념적으로 표현한 것이다.Hereinafter, various embodiments of the present invention will be described in detail with reference to the drawings. In describing the embodiments, a sound source will be used as a term meaning a source from which sound is emitted. In addition, the sound pressure is the expression of the force of the acoustic energy using the physical quantity of the pressure, and the sound pressure field is a conceptual representation of the area of the sound pressure around the sound source.

도 1은 본 발명이 해결하고자 하는 문제 상황에서 음원 신호를 보정하는 개략적인 아이디어를 도시한 블럭도로서, 마이크로폰 어레이(100), 음원 신호 보정부(110) 및 신호 처리부(120)를 포함한다. 각각의 구성 요소를 설명하기에 앞서 먼저 본 발명이 해결하고자 하는 문제 상황을 설명한다.FIG. 1 is a block diagram illustrating a schematic idea of correcting a sound source signal in a problem situation to be solved by the present invention, and includes a microphone array 100, a sound source signal correcting section 110, and a signal processing section 120. Before describing each component, the problem situation to be solved by the present invention will be described first.

앞서 설명한 바와 같이 마이크로폰 어레이는 지향성과 같은 사운드의 방향 특성을 활용하기 위해 사용된다. 일반적으로 마이크로폰 어레이는 배경 잡음과 혼합된 목표 신호를 고감도로 수신하기 위해 마이크로폰 어레이에 수신된 각각의 신호에 적절한 가중치를 주어 진폭을 향상시킴으로써 원하는 목표 신호와 간섭 잡음 신호의 방향이 다를 경우의 잡음을 공간적으로 줄일 수 있는 필터 역할을 하는데, 이러한 일종의 공간적 필터(spatial filter)를 빔 형성기(beamformer)라고 한다.As described above, the microphone array is used to exploit the directional characteristics of the sound, such as directivity. In general, the microphone array improves the amplitude by giving a proper weight to each signal received by the microphone array to receive the target signal mixed with the background noise with a high sensitivity, thereby improving the noise when the desired target signal and the direction of the interference noise signal are different This kind of spatial filter is called a beamformer.

그런데, 이러한 빔 형성기를 이용하여 음원 신호를 처리하기 위해서는 마이크로폰 어레이를 구성하는 각각의 개별 마이크로폰들의 물리적 특성이 사용자에게 알려진 바와 일치하여야 한다. 즉, 개별 마이크로폰들을 통해 획득한 신호들의 크 기, 신호의 위상 및 주파수 응답들이 모두 일치하여야 한다. 왜냐하면, 마이크로폰 어레이를 조절하기 위한 인자들이 이러한 개별 마이크로폰들의 물리적인 특성에 기초하기 때문이다.However, in order to process a sound source signal using such a beam shaper, the physical characteristics of each of the individual microphones constituting the microphone array must match those known to the user. In other words, the size, the phase, and the frequency response of the signals acquired through the individual microphones must match. This is because the factors for adjusting the microphone array are based on the physical characteristics of these individual microphones.

만약, 마이크로폰 어레이를 조절하기 위해 일정한 시간만큼 음원 신호를 지연시키거나 음원 신호의 크기를 변화시켰는데 마이크로폰 어레이를 구성하는 개별 마이크로폰들이 사용자의 의도에 따라 조절되지 않는다면, 결과적으로 사용자의 의도와는 다른 왜곡된 음원 신호를 얻을 수 밖에 없을 것이다. 예를 들어, 대표적인 적응적 빔 형성기(adaptive beamformer) 알고리즘인 GSC(generalized side-lobe canceller)에서 불필요한 주변 간섭 음압장인 사이드 로브(side-lobe)를 제거하려 할 때, 마이크로폰 어레이를 구성하는 개별 마이크로폰들 간의 특성 불일치(mismatch)가 존재한다면, 이러한 특성 불일치가 신호 누출(leakage)을 야기하게 되고, 결과적으로 음원 신호가 왜곡되는 문제가 발생한다.If the microphone source is delayed or the size of the source signal is changed for a certain period of time in order to adjust the microphone array, if the individual microphones constituting the microphone array are not adjusted according to the user's intention, It will be impossible to obtain a distorted sound source signal. For example, when attempting to eliminate unnecessary side-lobe artifacts in a generalized side-lobe canceller (GSC), which is an exemplary adaptive beamformer algorithm, the individual microphones constituting the microphone array If there is a characteristic mismatch between these characteristics, this characteristic mismatch causes signal leakage, resulting in a distortion of the sound source signal.

이상과 같은 마이크로폰들 간의 특성 불일치는 크게 다음의 2 가지 원인으로부터 기인한다. 첫째, 마이크로폰의 제조 과정에서 발생한 오차에 의한 경우가 있으며, 둘째, 마이크로폰의 사용 과정에서 시간이 경과함에 따라 노화에 의해 마이크로폰의 물리적 특성이 변화한 경우가 있다. 이러한 2 가지 원인들은 모두 실제 제품의 제조 과정이나 사용 과정에서 비롯된 것으로서, 필연적으로 발생할 수 밖에 없는 경우가 많다. 따라서, 이하에서 기술할 본 발명의 다양한 실시예들은 이러한 문제점들이 마이크로폰 어레이를 구비한 사운드 획득 기기에서 발생한 경우, 사운드 획득 기기가 마이크로폰들 간의 특성 불일치에 둔감하도록 음원 신호를 보정하 고자 한다. 이러한 문제 상황 하에서 도 1의 개략적인 구성을 살펴보면 다음과 같다.The above-mentioned characteristic inconsistencies between the microphones are mainly caused by the following two reasons. First, there are cases where the error occurs in the manufacturing process of the microphone. Secondly, the physical characteristics of the microphone may be changed by aging with the lapse of time in the process of using the microphone. Both of these causes are caused by the manufacturing process or the use process of the actual product, and inevitably occur in many cases. Therefore, various embodiments of the present invention to be described below are intended to correct the sound source signal so that the sound acquisition device is insensitive to the characteristic mismatch between the microphones when such problems occur in the sound acquisition device having the microphone array. Under such a problem, the schematic configuration of FIG. 1 will be described as follows.

마이크로폰 어레이(100)는 외부로부터 음원 신호를 획득한다. 음원의 방향이나 음원 신호의 크기 등 마이크로폰 어레이(100)를 조절하는 방법은 본 발명의 실시예가 구현되는 상황 및 목적에 따라 다양하게 설계될 수 있을 것이다.The microphone array 100 acquires a sound source signal from the outside. The method of adjusting the microphone array 100 such as the direction of a sound source or the size of a sound source signal may be variously designed according to the situation and purpose in which the embodiment of the present invention is implemented.

음원 신호 보정부(110)는 마이크로폰 어레이(100)를 통해 획득한 음원 신호들의 특성 불일치를 보정한다. 개별 마이크로폰들 간의 특성 불일치로 인해 발생하는 음원 신호의 왜곡 문제를 해결하는 방법으로는 크게 다음의 2 가지 방법이 가능하다. 첫째, 마이크로폰 어레이를 통해 음원 신호가 입력되면 즉시 입력된 각각의 음원 신호들을 보정하는 방법이 있을 수 있고, 둘째, 마이크로폰 어레이를 통해 입력된 음원 신호를 특정 목적에 따라 가공하는 과정에서 왜곡된 정도는 보정하는 방법이 있을 수 있다. 본 발명의 다양한 실시예들에서는 이상의 2 가지 해결 방법 중 전자, 즉, 입력된 신호들 자체를 바로 보정하는 방법에 따른다. 이후의 도 2에서부터 자세히 설명한다.The sound source signal correcting unit 110 corrects characteristic discrepancies of the sound source signals acquired through the microphone array 100. The following two methods are available as a method for solving the distortion problem of the sound source signal caused by the characteristic mismatch between the individual microphones. First, there may be a method of correcting each input sound source signal immediately when a sound source signal is input through the microphone array. Second, in a process of processing a sound source signal input through the microphone array according to a specific purpose, There may be a way to calibrate. The various embodiments of the present invention follow a method of directly correcting the electrons, i.e., the input signals themselves, of the above two solutions. This will be described in detail later with reference to FIG.

신호 처리부(120)는 음원 신호 보정부(110)를 통해 보정된 음원 신호들을 사용자의 목적에 따라 처리한다. 이는 통상적인 음원 신호 처리, 빔 형성기 및 음원 위치 추적기 등 본 발명이 구현될 다양한 실시예들 및 환경에 따라 자유롭게 설계될 수 있을 것이다.The signal processing unit 120 processes the corrected sound source signals through the sound source signal correcting unit 110 according to the purpose of the user. This may be freely designed according to various embodiments and environments in which the present invention is to be implemented, such as conventional sound source signal processing, beamformer, and sound source location tracker.

도 2는 본 발명의 일 실시예에 따른 음원 신호 보정 장치를 도시한 블럭도로서, 마이크로폰 어레이(200), 기준 확률 분포 산출부(211), 신호 보정부(212) 및 신호 처리부(220)를 포함한다. 마이크로폰 어레이(200)와 신호 처리부(220)는 도 1에서 설명한 마이크로폰 어레이(100) 및 신호 처리부(120)와 동일한 구성으로서, 통상적인 음원 신호 처리 장치의 구성에 해당한다. 따라서, 보다 엄밀한 의미에서 본 실시예에 따른 음원 신호 보정 장치는 점선으로 도시한 영역(210)에 포함되는 기준 확률 분포 산출부(211) 및 신호 보정부(212)로 구성된다. 이하에서는 이들 2 가지 구성 요소를 중심으로 음원 신호 보정 장치를 자세히 설명한다.2 is a block diagram illustrating an apparatus for correcting a sound source signal according to an embodiment of the present invention. The microphone array 200, the reference probability distribution calculating unit 211, the signal correcting unit 212, and the signal processing unit 220 . The microphone array 200 and the signal processing unit 220 have the same configuration as the microphone array 100 and the signal processing unit 120 described in FIG. 1, and correspond to the configuration of a typical sound source signal processing apparatus. Therefore, in a more strict sense, the sound source signal correction apparatus according to the present embodiment includes a reference probability distribution calculating section 211 and a signal correction section 212 included in a region 210 indicated by a dotted line. Hereinafter, the sound source signal correction apparatus will be described in detail with reference to these two components.

기준 확률 분포 산출부(211)는 마이크로폰 어레이(200)를 통해 획득한 음원 신호들의 크기 구간별로 각 구간에 존재하는 음원 신호들의 수를 확률로서 표현한 확률 분포들을 산출한다. 일반적으로 확률 분포라 함은 확률 변수의 분포 상태를 의미하는 것으로서, 어떤 시행에서 일어날 수 있는 사건마다 그 확률 값을 대응시킨 것이다. 이 때, 확률 변수의 분포 상태는 변량을 일정한 폭으로 나눈 구간에 포함된 변수의 개수를 계수함으로써 산출될 수 있다.The reference probability distribution calculating unit 211 calculates probability distributions representing the number of sound source signals present in each interval as probabilities for each magnitude interval of the sound source signals acquired through the microphone array 200. Generally, the probability distribution means the distribution of the probability variable, which corresponds to the probability value for each event that can occur in an attempt. In this case, the distribution state of the random variable can be calculated by counting the number of variables included in the interval obtained by dividing the variable by a predetermined width.

본 발명의 실시예들에서 확률 분포(PDF; probability distribution function)란 음원 신호들의 크기를 일정한 구간으로 나누고, 각 구간별로 해당하는 음원 신호들의 개수를 계수하여 확률로서 표현한 것을 의미한다. 통상적으로 변량을 임의의 구간으로 나누고, 각 구간에 포함되는 값의 개수(도수라고도 한다.)를 표시한 그래프를 히스토그램(histogram)이라고 하는데, 이하에서는 확률 분포를 시각적으로 표시한 그래프라는 의미로서 사용될 것이다.In the embodiments of the present invention, the probability distribution function (PDF) means that the size of the sound source signals is divided into a predetermined section and the number of corresponding sound source signals is counted for each section to express the probability. In general, a graph dividing a variable into arbitrary intervals and displaying the number of values included in each interval (also referred to as a frequency) is referred to as a histogram. Hereinafter, a graph indicating a graphical representation of a probability distribution will be.

확률 분포를 산출하는 기초가 되는 음원 신호들의 크기는 다양한 방법으로 표현될 수 있는데, 대표적으로 시간 영역(time domain)에서의 신호의 크기 또는 주 파수 영역(frequency domain)에서의 신호의 크기로 표현될 수 있을 것이다.The magnitude of the sound source signals from which the probability distribution is calculated can be expressed in various ways. Typically, the size of the signal in the time domain or the size of the signal in the frequency domain It will be possible.

시간 영역에서 입력 신호들의 크기는 신호들 자체의 진폭(amplitude)이 될 것이다. 따라서, 입력 신호들의 진폭을 일정한 크기 구간으로 나누어 각 구간에 포함되는 신호들의 개수를 계수하면 확률 분포를 산출할 수 있다. 한편, 주파수 영역에서 입력 신호들의 크기를 구하기 위해서는 다음과 같은 과정에 따른다.The magnitude of the input signals in the time domain will be the amplitudes of the signals themselves. Therefore, the probability distribution can be calculated by dividing the amplitude of the input signals by a constant magnitude interval and counting the number of signals included in each interval. In order to obtain the magnitudes of the input signals in the frequency domain, the following procedure is followed.

우선, 마이크로폰 어레이(200)를 통해 입력된 음원 신호들에 대한 디지털 신호 처리를 위해서는 연산의 편의를 위해 고속 푸리에 변환(fast Fourier transform)을 통해 주파수 영역으로 변환하게 된다. 일반적으로 디지털 신호 처리에서는 해당 시스템에 신호를 입력하고 그 결과로서 생성되는 출력 신호를 표현하기 위해 컨벌루션(convolution)을 사용하는데, 주어진 대상 신호를 유한하게 제한하기 위해 프레임(frame)으로 나누어 처리하게 된다. 프레임이란 시간의 변화에 따라 음원 신호를 일정한 구간으로 분리한 유닛(unit)을 의미한다. 일단, 음원 신호들이 프레임별로 주파수 영역으로 변환되면, 확률 분포를 산출하는 과정은 시간 영역에서와 유사하다. 즉, 주파주 영역의 신호의 크기를 일정한 크기 구간으로 나누어 각 구간에 포함되는 신호들의 개수를 계수함으로써 확률 분포를 산출할 수 있다.First, for the digital signal processing of the excitation signals inputted through the microphone array 200, the frequency domain is transformed through a fast Fourier transform for convenience of operation. Generally, in digital signal processing, a convolution is used to input a signal to a corresponding system and express a resulting output signal, which is divided into frames in order to limit a given object signal finitely . A frame is a unit in which a sound source signal is separated into a predetermined section according to a change in time. Once the sound source signals are transformed into the frequency domain on a frame-by-frame basis, the process of calculating the probability distribution is similar to that in the time domain. That is, the probability distribution can be calculated by dividing the signal size of the main pseudo range by a constant magnitude interval and counting the number of signals included in each interval.

기준 확률 분포 산출부(211)는 우선 이상의 구간별로 각각 음원 신호의 분포 정도를 확률로서 산출하고, 산출된 결과들로부터 각각의 음원 신호들을 대표하는 기준값을 산출한다. 이러한 기준값이 바로 기준 확률 분포로서, 이후의 단계에서 각각의 음원 신호들을 보정하기 위한 척도가 된다.The reference probability distribution calculating unit 211 first calculates the degree of distribution of the sound source signals for each of the above intervals as a probability, and calculates a reference value representative of the respective sound source signals from the calculated results. This reference value is a reference probability distribution, and it is a measure for correcting the respective sound source signals in a subsequent step.

이하에서는 도 4, 도 5a 및 도 5b를 참조하여 기준 확률 분포 산출부(211)가 입력된 음원 신호들로부터 기준 확률 분포를 산출하는 과정을 보다 상세하게 설명한다. 이하의 실시예들에서는 기분 확률 분포로서 확률 분포들을 신호의 크기가 작은 쪽에서 큰 쪽으로 누적시킨 누적 확률 분포를 사용할 것이고, 또한 이러한 누적 확률 분포를 특정 값으로 정규화시킨 정규 누적 확률 분포를 사용할 것이다. 상기된 확률 분포로서 누적 확률 분포 및 정규 누적 확률 분포 이외에 다양한 확률 분포를 나타내는 수단들을 사용할 수 있음은 물론이다. 이하에서 각각의 구체적인 의미와 구성을 설명한다.Hereinafter, the process of calculating the reference probability distribution from the sound source signals input by the reference probability distribution calculating unit 211 will be described in more detail with reference to FIGS. 4, 5A and 5B. In the following embodiments, the probability distributions as the mood probability distributions will use cumulative probability distributions in which the signal magnitudes are smaller to the larger ones, and the normal cumulative probability distributions obtained by normalizing the cumulative probability distributions to specific values will be used. As a matter of course, it is possible to use means for representing various probability distributions in addition to the cumulative probability distribution and the normal cumulative probability distribution. Hereinafter, specific meanings and configurations will be described.

도 4는 본 발명의 일 실시예에 따른 음원 신호 보정 장치에서 누적 확률 분포를 산출하는 방법을 도시한 도면으로서, 첫 번째 그래프는 마이크로폰 어레이를 통해 입력된 음원 신호들의 확률 분포에 대한 히스토그램을 예시한 것이고, 두 번째 그래프는 음원 신호들의 확률 분포를 누적시키고 정규화시킨 히스토그램을 예시한 것이다. 그래프의 가로축은 음원 신호의 크기 구간을 나타내고, 세로축은 각 구간에 해당하는 음원 신호들의 개수를 확률 분포로서 표시한 것이다. 도 4에서는 마이크로폰 어레이가 4 개의 마이크로폰들로 구성되어 있다고 가정하고 있으며, 각각의 마이크로폰들을 통해 입력된 음원 신호들을 4 개의 채널로 표시하였다.FIG. 4 is a diagram illustrating a method of calculating a cumulative probability distribution in a sound source signal correction apparatus according to an embodiment of the present invention. The first graph illustrates a histogram of a probability distribution of sound source signals input through a microphone array And the second graph is an example of a histogram that accumulates and normalizes the probability distribution of the sound source signals. The horizontal axis of the graph represents a magnitude interval of the sound source signal and the vertical axis represents the number of sound source signals corresponding to each interval as a probability distribution. In FIG. 4, it is assumed that the microphone array is composed of four microphones, and the sound source signals inputted through the respective microphones are represented by four channels.

도 4의 좌측 그래프에서 입력 음원 신호가 동일한 것임에도 불구하고 각각의 채널별로 확률 분포들 간의 차이가 나타나는 것을 볼 수 있다. 즉, 마이크로폰들 간의 특성 불일치가 나타나는 것을 볼 수 있다. 이러한 확률 분포들을 가로축을 기준으로 신호의 크기가 작은 쪽에서 큰 쪽으로 누적시켜 히스토그램으로 도시하면 도 4에 도시된 바와 같이 신호 크기가 증가하는 형태의 그래프가 그려질 것이다. 그런데, 이렇게 그려진 누적 확률 분포들은 앞서 설명한 마이크로폰들 간의 특성 불일치로 인해 누적된 최대값이 서로 일치하지 않는다. 즉, 확률 분포들의 누적 결과가 서로 달라지게 된다. 따라서, 이러한 누적 결과를 일치시키기 위한 정규화(normalization) 과정을 거치게 된다. 여기서, 정규화는 계산이 용이하도록 기준이 되는 특정 값으로 최대값을 일치시키는 것을 의미하며, 본 실시예에서는 누적된 확률 분포의 최대값을 1로, 최소값을 0으로 설정하였다. 이상의 과정을 통해 생성된 히스토그램은 도 4의 두 번째 그래프에 도시되어 있다. 도 4의 두 번째 그래프는 정규화된 누적 확률 분포(normalized cumulative probability distribution function)를 도시한 히스토그램으로서, 세로축이 모두 1로 정규화되어 있으며, 4 개 채널의 확률 분포들이 오른쪽으로 진행할수록 점진적으로 증가하는 형태의 그래프를 형성하고 있다.In the left graph of FIG. 4, although the input sound source signals are the same, a difference between probability distributions appears for each channel. That is, we can see that there is a characteristic mismatch between the microphones. If these probability distributions are accumulated in a larger scale from the smaller side to the larger side with respect to the horizontal axis and a histogram is shown, a graph in which the signal size increases as shown in FIG. 4 will be drawn. However, the accumulated cumulative probability distributions do not match the accumulated maximum values due to the characteristic mismatch between the microphones described above. That is, the cumulative results of the probability distributions are different from each other. Therefore, a normalization process for matching the cumulative results is performed. Here, the normalization means that the maximum value is matched with a specific value as a reference for easy calculation. In this embodiment, the maximum value of the accumulated probability distribution is set to 1 and the minimum value is set to 0. The histogram generated through the above process is shown in the second graph of FIG. The second graph in FIG. 4 is a histogram showing a normalized cumulative probability distribution function. The vertical axis is normalized to 1, and the probability distributions of the four channels gradually increase as they progress to the right As shown in FIG.

도 5a 및 도 5b는 본 발명의 일 실시예에 따른 음원 신호 보정 장치에서 기준 누적 확률 분포를 산출하는 구성을 상세하게 도시한 블럭도로서, 도 4의 히스토그램을 생성하기 위한 구체적인 구성을 도시하고 있다.5A and 5B are block diagrams illustrating a detailed configuration of calculating a reference cumulative probability distribution in a sound source signal correction apparatus according to an embodiment of the present invention, and show a specific configuration for generating the histogram of FIG. 4 .

도 5a에서 기준 누적 확률 분포 산출부는 정규 누적 확률 분포 산출부(510)와 대표값 산출부(520)를 포함한다.5A, the reference cumulative probability distribution calculating unit includes a normal cumulative probability distribution calculating unit 510 and a representative value calculating unit 520. [

앞서 설명한 바와 같이, 정규 누적 확률 분포 산출부(510)는 마이크로폰 어레이를 통해 입력받은 음원 신호들의 크기 구간별 신호 분포들로부터 음원 신호들에 대응하는 정규 누적 확률 분포들을 산출한다. 도 5b를 참조하면, 정규 누적 확 률 분포 산출부(510)는 다시 누적부(511)와 정규화부(512)로 구성된다. 누적부(511)는 음원 신호들의 크기 구간별로 신호들의 크기에 대한 확률 분포들을 산출하여 누적한다. 정규화부(512)는 누적부(511)를 통해 누적된 확률 분포들에 대하여 누적된 최대값들이 서로 일치하도록 정규화한다. As described above, the normal cumulative probability distribution calculating unit 510 calculates the normal cumulative probability distributions corresponding to the sound source signals from the signal distributions of the magnitude intervals of the sound source signals received through the microphone array. Referring to FIG. 5B, the normal cumulative probability distribution calculating unit 510 includes an accumulating unit 511 and a normalizing unit 512. The accumulation unit 511 calculates and accumulates probability distributions of the magnitudes of the signals according to magnitude intervals of the excitation signals. The normalization unit 512 normalizes the accumulated maximum values for the probability distributions accumulated through the accumulation unit 511 to coincide with each other.

대표값 산출부(520)는 정규 누적 확률 분포 산출부(510)를 통해 산출된 정규 누적 확률 분포들로부터 대표값을 산출하여 기준 누적 확률 분포로 설정한다. 마이크로폰 어레이를 구성하는 개별 마이크로폰들의 개수만큼 음원 신호가 생성되고, 생성된 음원 신호에 해당하는 정규 누적 확률 분포가 산출되면, 이들을 대표하는 대표값을 산출하여 이후의 음원 신호 보정을 위한 기준으로 설정한다. 예를 들어, 개별 마이크로폰들이 4 개인 경우를 가정하면, 정규 누적 확률 분포 역시 4 개가 산출될 것이며, 이들 4 개의 정규 누적 확률 분포로부터 특정 규칙에 따라 1 개의 기준 누적 확률 분포를 산출하게 된다. 여기서, 특정 규칙은 대표값을 산출하는 임의의 함수의 형태로 구현이 가능할 것이다.The representative value calculating unit 520 calculates a representative value from the normal cumulative probability distributions calculated through the normal cumulative probability distribution calculating unit 510 and sets the representative cumulative probability distribution. When a sound source signal is generated by the number of individual microphones constituting the microphone array, and a normal cumulative probability distribution corresponding to the generated sound source signal is calculated, a representative value representative of the representative cumulative probability distribution is calculated and set as a criterion for subsequent sound source signal correction . For example, assuming that there are four individual microphones, four regular cumulative probability distributions will be calculated, and one standard cumulative probability distribution is calculated from these four regular cumulative probability distributions according to a specific rule. Here, the specific rule may be implemented in the form of an arbitrary function that calculates a representative value.

대표값을 산출하는 방법으로는 통상적으로 임의의 값들을 대표할 수 있는 값을 선택하는 다양한 방법들이 사용 가능하며, 이러한 방법들은 본 발명이 속하는 기술 분야에서 통상의 지식을 가진 자가 용이하게 도출할 수 있는 것이다. 대표값을 산출하는 방법들 중 대표적인 것들을 간단히 예시하면 다음과 같다.As a method for calculating the representative value, various methods can be used, which typically select values that can represent arbitrary values, and these methods can be easily obtained by those having ordinary skill in the art to which the present invention belongs. It is. Representative methods of calculating representative values will be briefly described as follows.

첫째, 평균값(average)을 이용해 대표값으로 설정할 수 있다. 즉, 복수 개의 정규 누적 확률 분포들의 신호의 크기의 평균값을 구해 기준 누적 확률 분포로 설정한다. 둘째, 중앙값(median)을 이용해 대표값으로 설정할 수 있다. 이 방법은 음 원 신호들의 분포 정도를 고려하여 신호의 크기의 중앙값에 해당하는 정규 누적 확률 분포를 기준 누적 확률 분포로 설정하는 방법이다. 이상의 방법들 이외에 다수의 음원 신호들 중 1 이상의 음원 신호들을 선택하여, 선택된 음원 신호들에 더 많은 가중치를 부여하고 대표값을 산출하는 방법도 사용 가능할 것이다.First, it can be set to the representative value by using the average value (average). That is, an average value of signal magnitudes of a plurality of normal cumulative probability distributions is obtained and set as a reference cumulative probability distribution. Second, it can be set as a representative value using the median. This method is a method of setting the normal cumulative probability distribution corresponding to the median value of the signal size to the reference cumulative probability distribution considering the degree of distribution of the sound signals. In addition to the above methods, a method may be used in which one or more sound source signals among a plurality of sound source signals are selected, more weight is given to the selected sound source signals, and a representative value is calculated.

이상에서 도 2의 기준 확률 분포 산출부(211)에서 기준 누적 확률 분포를 산출하는 과정을 설명하였다. 이러한 과정을 통해, 복수 개의 마이크로폰들 간의 특성 불일치로 인해 입력된 음원 신호들 간에 차이가 발생한 경우, 이러한 음원 신호들을 보정하기 위한 기준이 되는 기준 누적 확률 분포를 얻을 수 있다.The process of calculating the reference cumulative probability distribution in the reference probability distribution calculating unit 211 of FIG. 2 has been described. Through this process, when there is a difference between the input sound signals due to the characteristic mismatch between the plurality of microphones, a reference cumulative probability distribution serving as a reference for correcting the sound source signals can be obtained.

다음으로, 신호 보정부(212)는 기준 확률 분포 산출부(211)를 통해 산출된 기준 누적 확률 분포에 따라 음원 신호들을 보정한다. 따라서, 신호 보정부(212)는 마이크로폰 어레이를 구성하는 개별 마이크로폰들의 개수만큼 존재할 수 있으며, 각각의 음원 신호(채널을 의미한다.)들을 기준 누적 확률 분포를 참조하여 보정한다. 이하에서는 도 6 및 도 7을 참조하여 신호 보정부(212)에서 음원 신호들을 보정하는 방법을 보다 상세하게 설명한다.Next, the signal correcting unit 212 corrects the sound source signals according to the reference cumulative probability distribution calculated through the reference probability distribution calculating unit 211. [ Accordingly, the signal corrector 212 may exist as many as the number of individual microphones constituting the microphone array, and corrects the respective sound source signals (meaning channels) by referring to the reference cumulative probability distribution. Hereinafter, a method of correcting the excitation signals in the signal corrector 212 will be described in more detail with reference to FIGS. 6 and 7. FIG.

도 6은 본 발명의 일 실시예에 따른 음원 신호 보정 장치에서 변환 함수를 이용하여 음원 신호를 보정하는 구성을 상세하게 도시한 블럭도로서, 신호 보정부(620)는 다시 확률 분포 변환부(621)를 포함한다.6 is a detailed block diagram illustrating a configuration of a sound source signal correction apparatus according to an exemplary embodiment of the present invention that corrects a sound source signal using a conversion function. The signal correction unit 620 further includes a probability distribution conversion unit 621 ).

우선, 정규 누적 확률 분포 산출부(610)를 통해 음원 신호들로부터 정규 누적 확률 분포를 산출한다. 이 과정은 앞서 도 2에서 설명한 바와 같다. 다음으로, 확률 분포 변환부(621)는 음원 신호들로부터 기준 누적 확률 분포로 변환하는 함수 를 이용하여 정규 누적 확률 분포 산출부(610)를 통해 산출된 정규 누적 확률 분포들로부터 보정된 음원 신호들을 생성한다. 즉, 정규 누적 확률 분포를 기준값에 따라 보정한다. 이러한 보정 과정은 기준 누적 확률 분포에 따라 샘플 대 샘플(sample-by-sample) 비교 과정을 통해 수행된다. 샘플 대 샘플 비교란 보정 전의 음원 신호의 특정 샘플을 이에 대응하는 기준 샘플(기준 누적 확률 분포에 따른 값을 의미한다.)과 비교하여 변환하는 것을 말한다. 이상의 변환 과정의 이해를 돕기 위해 도 7을 참조한다.First, the normal cumulative probability distribution is calculated from the sound source signals through the normal cumulative probability distribution calculating unit 610. This process is as described in FIG. 2 above. Next, the probability distribution transforming unit 621 transforms the corrected sound source signals from the normal cumulative probability distributions calculated through the normal cumulative probability distribution calculating unit 610 using a function of converting the sound source signals into the reference cumulative probability distribution . That is, the normal cumulative probability distribution is corrected according to the reference value. This correction process is performed through a sample-by-sample comparison process according to the reference cumulative probability distribution. The sample-to-sample comparison refers to a comparison of a specific sample of a sound source signal before correction with a corresponding reference sample (which means a value according to a reference cumulative probability distribution). To help understand the above conversion process, refer to FIG.

도 7은 본 발명의 일 실시예에 따른 음원 신호 보정 장치의 확률 분포 변환부에서 보정된 음원 신호를 생성하는 방법을 도시한 도면이다. 도 7에서 실선으로 도시된 그래프는 기준 누적 확률 분포의 히스토그램이고, 점선으로 도시된 그래프는 현재 보정되지 않은 음원 신호에 해당하는 정규 누적 확률 분포의 히스토그램이다. 그래프의 세로축은 0에서 1까지의 값으로 정규화한 것을 가정한다. 음원 신호를 보정한다는 것은 특정 샘플에 대한 정규 누적 확률 분포 값을 기준 누적 확률 분포 값에 일치시킨다는 것을 의미한다.7 is a diagram illustrating a method of generating a corrected sound source signal in a probability distribution conversion unit of a sound source signal correction apparatus according to an embodiment of the present invention. The graph shown by the solid line in FIG. 7 is the histogram of the reference cumulative probability distribution, and the graph shown by the dotted line is the histogram of the normal cumulative probability distribution corresponding to the currently uncorrected sound source signal. It is assumed that the vertical axis of the graph is normalized to a value from 0 to 1. Correction of the sound source signal means that the normal cumulative probability distribution value for a specific sample is matched to the reference cumulative probability distribution value.

예를 들어, 그래프의 가로축에서 A 값이 보정되지 않은 음원 신호의 특정 값(샘플을 의미한다.)을 나타날 때, A 값에 해당하는 누적 확률 분포는 세로축의 C 값이 될 것이다. 이 때, C 값을 기준으로 이러한 누적 확률 분포에 해당하는 입력값을 역으로 추정하면, 보정된 입력 신호는 기준 누적 확률 분포의 히스토그램의 가로축에서 B 값이 될 것이다. 즉, B 값은 보정되지 않은 마이크로폰 입력 신호 A 값을 보정한 신호가 된다.For example, when the A value on the horizontal axis of the graph shows a specific value (meaning a sample) of the sound source signal that is not corrected, the cumulative probability distribution corresponding to the A value will be the C value on the vertical axis. In this case, if the input value corresponding to this cumulative probability distribution is inversely estimated based on the C value, the corrected input signal will be the B value in the horizontal axis of the histogram of the reference cumulative probability distribution. That is, the B value is a signal obtained by correcting the uncorrected microphone input signal A value.

이상에서 도 6 및 도 7을 참조하여 도 2의 신호 보정부(212)가 음원 신호를 보정하는 과정을 설명하였다. 이하에서는 앞서 설명한 도 2의 음원 신호 보정 장치(210)에서 기준 확률 분포를 산출하는 과정과 산출된 기준 확률 분포에 따라 음원 신호를 보정하는 과정을 수학식을 통해 보충 설명하겠다.The process of correcting the tone signal by the signal corrector 212 of FIG. 2 has been described with reference to FIGS. 6 and 7. FIG. Hereinafter, a process of calculating the reference probability distribution in the sound source signal correction apparatus 210 of FIG. 2 and a process of correcting the sound source signal in accordance with the calculated reference probability distribution will be described below with reference to the mathematical expression.

우선, 마이크로폰 어레이를 구성하는 개별 마이크로폰들의 개수를 k 개라고 가정할 때, k 개의 마이크로폰 입력 신호들을

이라고 하면, 이들 마이크로폰 입력 신호들에 대응하는 정규 누적 확률 분포들은

와 같이 표현할 수 있다. 이 때, 정규 누적 확률 분포들은

이라는 조건을 만족한다. 이어서, 마이크로폰 입력 신호 각각의 누적 확률 분포로부터 기준 누적 확률 분포를 산출하면 다음의 수학식 1과 같이 표현된다.First, assuming that the number of individual microphones constituting the microphone array is k, k microphone input signals

, Normal cumulative probability distributions corresponding to these microphone input signals < RTI ID = 0.0 >

Can be expressed as At this time, the normal cumulative probability distributions

Is satisfied. Next, the reference cumulative probability distribution is calculated from the cumulative probability distribution of each of the microphone input signals, as expressed by the following Equation (1).

여기서,

은 기준 누적 확률 분포에 따른 함수이고, f(·)는 대표값을 산출하는 함수를 의미한다. 앞서 도 5a에서 설명한 바와 같이 f(·)는 평균값이나 중앙값을 구하는 함수가 될 수 있을 것이다.here,

Is a function according to the reference cumulative probability distribution, and f (·) is a function that calculates a representative value. As described above with reference to FIG. 5A, f (.) May be a function for obtaining an average value or a median value.

다음으로, 마이크로폰 입력 신호가 기준 누적 확률 분포를 따르도록 보정하 는 과정은 다음의 수학식 2과 같이 표현된다.Next, the process of correcting the microphone input signal to follow the reference cumulative probability distribution is expressed as Equation 2 below.

여기서,

는 보정되지 않은 마이크로폰 입력 신호이고,

는 보정된 마이크로폰 입력 신호를 의미한다. 또한,

는 정규 누적 확률 분포에 따른 함수이고,

은 기준 누적 확률 분포에 따른 함수를 의미한다. 즉, 수학식 2는 앞서 도 7을 통해 설명한 바와 같이, 특정 입력 신호에 대한 정규 누적 확률 분포 값을 기준 누적 확률 분포 값에 일치시키는 입력 신호 보정 과정을 나타낸다.here,

Is an uncorrected microphone input signal,

Means a corrected microphone input signal. Also,

Is a function according to the normal cumulative probability distribution,

Denotes a function according to the reference cumulative probability distribution. Equation (2) represents an input signal correction process for matching the normal cumulative probability distribution value of a specific input signal with a reference cumulative probability distribution value, as described above with reference to FIG.

수학식 2를 보정된 마이크로폰 입력 신호를 중심으로 정리하면 다음의 수학식 3과 같다.Equation (2) can be summarized with the corrected microphone input signal as a center.

이상의

및

과 같은 확률 분포에 따른 함수들은 일종의 비선형 변환 함수들로서, 수학식 3을 통해 도 2의 신호 보정부(212)는 최종적으로 보정된 음원 신호를 생성하여 신호 처리부(220)에 공급한다.ideal

And

The signal corrector 212 of FIG. 2 generates a final corrected sound source signal and supplies the corrected sound source signal to the signal processing unit 220 through Equation (3).

신호 처리부(220)는 마이크로폰 어레이를 구비한 디지털 신호 처리 기기에서 통상적으로 구비하는 구성으로서, 본 발명의 실시예들이 구현되는 환경에 따라 자 유롭게 설계될 수 있으므로 여기에서는 자세한 설명을 생략한다.The signal processing unit 220 is typically provided in a digital signal processing apparatus having a microphone array. Since the signal processing unit 220 can be freely designed according to the environment in which the embodiments of the present invention are implemented, detailed description is omitted here.

이상에서 본 발명의 다양한 실시예들을 참조하여 음원 신호 보정 장치의 구성 및 역할을 상세하게 설명하였다. 본 실시예들에 따르면 마이크로폰 어레이를 통해 입력된 음원 신호들의 확률 분포들에 기초하여 기준 확률 분포를 산출하고, 이에 따라 음원 신호들을 보정함으로써, 음원 획득이 가능한 디지털 기기에서 마이크로폰 어레이를 구성하는 개별 마이크로폰들 간의 특성 불일치로 인해 음원 신호가 왜곡되는 문제점이 해소된다. 또한, 일단 기준 확률 분포가 산출되면, 입력된 음원 신호들을 실시간으로 보정할 수 있으므로 빠른 오류 보완이 가능해진다.The configuration and the role of the sound source signal correction apparatus have been described in detail with reference to the various embodiments of the present invention. According to the present embodiments, the reference probability distribution is calculated based on the probability distributions of the sound source signals input through the microphone array, and the sound source signals are corrected accordingly, whereby the individual microphones The problem that the sound source signal is distorted due to the characteristic mismatch between the signals is eliminated. In addition, once the reference probability distribution is calculated, input sound source signals can be corrected in real time, so that quick error compensation becomes possible.

도 3은 본 발명의 다른 실시예에 따른 저장부가 추가된 음원 신호 보정 장치를 도시한 블럭도로서, 도 2의 음원 신호 보정 장치(210)에 저장부(315)를 추가한 것이다.FIG. 3 is a block diagram illustrating a sound source signal correction apparatus to which a storage unit according to another embodiment of the present invention is added, in which a storage unit 315 is added to the sound source signal correction apparatus 210 of FIG.

도 3의 음원 신호 보정 장치(310)는 도 2와 마찬가지로 기준 확률 분포 산출부(311) 및 신호 보정부(312)를 포함하므로 이하에서는 저장부(315)를 중심으로 구성상의 특징을 설명하겠다.3 includes a reference probability distribution calculating unit 311 and a signal correcting unit 312 as in FIG. 2, and therefore features of the configuration will be described with the storage unit 315 as a center.

저장부(315)는 기준 확률 분포 산출부(311)를 통해 산출된 기준 확률 분포를 저장한다. 앞서 설명한 바와 같이 마이크로폰들 간의 특성 불일치는 마이크로폰의 제조 과정에서의 오차나 마이크로폰들의 사용에 따른 노화로 인해 야기된다. 따라서, 일단 특성 불일치가 발생하였다면 이러한 문제점이 단시간 내에 크게 변화하지는 않을 가능성이 많다. 즉, 특성 불일치 자체는 시간에 따라 천천히 변화한다. 그러므로, 일단 기준 누적 확률 분포가 산출되었다면 단시간 내에 여러번 산출할 필 요는 없는 것이다. 오히려, 음원 신호를 보정할 때마다 매번 기준 확률 분포를 산출한다면 시스템 자원의 낭비가 될 것이다.The storage unit 315 stores the reference probability distribution calculated through the reference probability distribution calculating unit 311. [ As described above, the property mismatch between microphones is caused by errors in the manufacturing process of the microphones and aging due to use of the microphones. Therefore, once a characteristic mismatch occurs, there is a high possibility that such a problem will not change greatly within a short time. That is, the property mismatch itself slowly changes with time. Therefore, once the baseline cumulative probability distribution is calculated, it is not necessary to calculate it several times in a short time. Rather, if the reference probability distribution is calculated each time the sound source signal is corrected, it will be a waste of system resources.

이상과 같은 이유로 저장부(315)는 이미 산출된 기준 확률 분포를 다시 산출할 필요가 없도록 저장하는 역할을 수행한다. 이러한 저장부(315)는 데이터를 기록할 수 있는 기록 장치나 네트워크를 통해 연결된 특정 저장 장치가 될 수 있을 것이다. 이어서, 신호 보정부(312)는 저장부(315)에 저장된 기준 확률 분포를 읽어들이고, 읽어들인 기준 확률 분포에 따라 음원 신호들을 보정한다. 도 3에서 음원 신호 보정 장치(310)가 보정을 수행하던 중 일정 시간이 경과하거나, 마이크로폰들의 특성이 변화하였다고 판단될 경우, 다시 기준 확률 분포 산출부(311)를 통해 새롭게 기준 확률 분포를 산출할 수 있을 것이고, 새롭게 산출된 기준 확률 분포를 저장부(315)에 갱신함으로써, 변화된 마이크로폰들 간의 특성 불일치를 해소할 수 있을 것이다.For this reason, the storage unit 315 stores the calculated reference probability distribution so that it does not have to be calculated again. The storage unit 315 may be a recording device capable of recording data or a specific storage device connected through a network. Then, the signal correction unit 312 reads the reference probability distribution stored in the storage unit 315, and corrects the sound source signals according to the read reference probability distribution. 3, if it is determined that the sound source signal correction apparatus 310 has performed a predetermined period of time or that the characteristics of the microphones have changed, the reference signal distribution unit 311 newly calculates the reference probability distribution again through the reference probability distribution calculating unit 311 And updating the newly calculated reference probability distribution to the storage unit 315, thereby solving the characteristic mismatch between the changed microphones.

본 실시예를 통해 반복적으로 기준 확률 분포를 산출하지 않고, 일단 산출된 기준 확률 분포를 저장하였다가 음원 신호의 보정시에 활용함으로써 불필요한 연산을 줄이고, 일단 기준 확률 분포가 산출된 이후에는 해당 과정의 생략이 가능하므로 빠른 음원 신호 보정이 가능해진다.In this embodiment, the reference probability distribution is not repeatedly calculated, and the reference probability distribution once calculated is stored. However, the unnecessary calculation is reduced by utilizing the reference probability distribution for correction of the sound source signal. Once the reference probability distribution is calculated, It is possible to omit it, and it is possible to correct the sound source signal quickly.

도 8은 본 발명의 또 다른 실시예에 따른 음원 신호 보정 방법을 도시한 순서도로서, 다음과 같은 단계들을 포함한다.FIG. 8 is a flowchart illustrating a sound source signal correction method according to another embodiment of the present invention, which includes the following steps.

810 단계에서 마이크로폰 어레이를 통해 획득한 음원 신호들의 크기 구간별로 각 구간에 존재하는 음원 신호들의 수를 확률로서 표현한 확률 분포들을 산출한 다. 확률 분포들은 음원 신호들의 확률 분포들을 누적시킨 후 최대값을 일치시키는 정규화 과정을 통해 산출될 수 있다.In step 810, probability distributions representing the number of sound source signals present in each interval as probability are calculated for each magnitude interval of the sound source signals acquired through the microphone array. Probability distributions can be calculated through a normalization process of accumulating probability distributions of sound source signals and then matching the maximum values.

820 단계에서 810 단계를 통해 산출된 확률 분포들에 기초하여 각각의 확률 분포들을 대표하는 기준 확률 분포를 산출한다. 기준 확률 분포는 산출된 확률 분포들로부터 평균값 또는 중앙값과 같은 대표값을 산출하여 기준 확률 분포로 설정함으로써 얻을 수 있다.The reference probability distribution representing each of the probability distributions is calculated based on the probability distributions calculated in step 820 through step 810. [ The reference probability distribution can be obtained by calculating a representative value such as an average value or a median value from the calculated probability distributions and setting the reference probability distribution.

830 단계에서 820 단계를 통해 산출된 기준 확률 분포에 따라 810 단계에서 획득한 음원 신호들을 보정한다. 보정된 음원 신호들은 음원 신호들로부터 810 단계에서 산출한 기준 확률 분포로 변환하는 함수의 역함수를 이용하여 확률 분포들을 입력함으로써 얻을 수 있다.In step 830, the sound source signals acquired in step 810 are corrected according to the reference probability distribution calculated in step 820. [ The corrected sound source signals can be obtained from the sound source signals by inputting probability distributions using an inverse function of a function for converting to a reference probability distribution calculated in step 810. [

본 실시예에 따르면 마이크로폰 어레이를 구성하는 개별 마이크로폰들 간의 특성 불일치로 인해 음원 신호가 왜곡되는 문제점이 해소되며, 산출된 기준 누적 확률 분포에 따라 음원 신호의 보정이 실시간으로 이루어지므로 빠른 오류 보완이 가능하다.According to the present embodiment, the problem that the sound source signal is distorted due to the characteristic mismatch between the individual microphones constituting the microphone array is solved, and the correction of the sound source signal is performed in real time according to the calculated reference cumulative probability distribution. Do.

한편, 본 발명은 컴퓨터로 읽을 수 있는 기록 매체에 컴퓨터가 읽을 수 있는 코드로 구현하는 것이 가능하다. 컴퓨터가 읽을 수 있는 기록 매체는 컴퓨터 시스템에 의하여 읽혀질 수 있는 데이터가 저장되는 모든 종류의 기록 장치를 포함한다.Meanwhile, the present invention can be embodied in computer readable code on a computer readable recording medium. A computer-readable recording medium includes all kinds of recording apparatuses in which data that can be read by a computer system is stored.

컴퓨터가 읽을 수 있는 기록 매체의 예로는 ROM, RAM, CD-ROM, 자기 테이프, 플로피디스크, 광 데이터 저장장치 등이 있으며, 또한 캐리어 웨이브(예를 들어 인 터넷을 통한 전송)의 형태로 구현하는 것을 포함한다. 또한, 컴퓨터가 읽을 수 있는 기록 매체는 네트워크로 연결된 컴퓨터 시스템에 분산되어, 분산 방식으로 컴퓨터가 읽을 수 있는 코드가 저장되고 실행될 수 있다. 그리고 본 발명을 구현하기 위한 기능적인(functional) 프로그램, 코드 및 코드 세그먼트들은 본 발명이 속하는 기술 분야의 프로그래머들에 의하여 용이하게 추론될 수 있다.Examples of the computer-readable recording medium include a ROM, a RAM, a CD-ROM, a magnetic tape, a floppy disk, an optical data storage device, and the like in the form of a carrier wave (for example, transmission via the Internet) . In addition, the computer-readable recording medium may be distributed over network-connected computer systems so that computer readable codes can be stored and executed in a distributed manner. In addition, functional programs, codes, and code segments for implementing the present invention can be easily deduced by programmers skilled in the art to which the present invention belongs.

이상에서 본 발명에 대한 다양한 실시예들을 중심으로 살펴보았다. 본 발명에 속하는 기술 분야에서 통상의 지식을 가진 자는 본 발명이 본 발명의 본질적인 특성에서 벗어나지 않는 범위에서 변형된 형태로 구현될 수 있음을 이해할 수 있을 것이다. 그러므로 개시된 실시예들은 한정적인 관점이 아니라 설명적인 관점에서 고려되어야 한다. 본 발명의 범위는 전술한 설명이 아니라 특허청구범위에 나타나 있으며, 그와 동등한 범위 내에 있는 모든 차이점은 본 발명에 포함된 것으로 해석되어야 할 것이다.Various embodiments of the present invention have been described above. It will be understood by those skilled in the art that various changes in form and details may be made therein without departing from the spirit and scope of the invention as defined by the appended claims. Therefore, the disclosed embodiments should be considered in an illustrative rather than a restrictive sense. The scope of the present invention is defined by the appended claims rather than by the foregoing description, and all differences within the scope of equivalents thereof should be construed as being included in the present invention.

도 1은 본 발명이 해결하고자 하는 문제 상황에서 음원 신호를 보정하는 개략적인 아이디어를 도시한 블럭도이다.1 is a block diagram illustrating a schematic idea of correcting a sound source signal in a problem situation to be solved by the present invention.

도 2는 본 발명의 일 실시예에 따른 음원 신호 보정 장치를 도시한 블럭도이다.2 is a block diagram illustrating a sound source signal correction apparatus according to an embodiment of the present invention.

도 3은 본 발명의 다른 실시예에 따른 저장부가 추가된 음원 신호 보정 장치를 도시한 블럭도이다.3 is a block diagram illustrating a sound source signal correction apparatus to which a storage unit according to another embodiment of the present invention is added.

도 4는 본 발명의 일 실시예에 따른 음원 신호 보정 장치에서 누적 확률 분포를 산출하는 방법을 예시한 도면이다.4 is a diagram illustrating a method of calculating a cumulative probability distribution in a sound source signal correction apparatus according to an embodiment of the present invention.

도 5a 및 도 5b는 본 발명의 일 실시예에 따른 음원 신호 보정 장치에서 기준 누적 확률 분포를 산출하는 구성을 상세하게 도시한 블럭도이다.5A and 5B are block diagrams illustrating a detailed configuration of calculating a reference cumulative probability distribution in a sound source signal correction apparatus according to an embodiment of the present invention.

도 6은 본 발명의 일 실시예에 따른 음원 신호 보정 장치에서 변환 함수를 이용하여 음원 신호를 보정하는 구성을 상세하게 도시한 블럭도이다.6 is a block diagram illustrating in detail a configuration for correcting a sound source signal using a conversion function in a sound source signal correction apparatus according to an embodiment of the present invention.

도 7은 본 발명의 일 실시예에 따른 음원 신호 보정 장치의 확률 분포 변환부에서 보정된 음원 신호를 생성하는 방법을 도시한 도면이다.7 is a diagram illustrating a method of generating a corrected sound source signal in a probability distribution conversion unit of a sound source signal correction apparatus according to an embodiment of the present invention.

도 8은 본 발명의 또 다른 실시예에 따른 음원 신호 보정 방법을 도시한 순서도이다.8 is a flowchart illustrating a sound source signal correction method according to another embodiment of the present invention.

Claims

Calculating probability distributions representing the number of sound source signals present in each interval as probabilities for each magnitude interval of the same sound source signals acquired through a microphone array;

Calculating a reference probability distribution representing the probability distributions based on the calculated probability distributions; And

And correcting the sound source signals according to the calculated reference probability distribution.

The method according to claim 1,

Wherein the calculating of the probability distributions comprises calculating and accumulating probability distributions of the magnitudes of the sound source signals for each of the intervals,

Wherein the step of calculating the reference probability distribution calculates the reference probability distribution based on the accumulated probability distributions.

3. The method of claim 2,

Wherein the step of calculating the probability distributions further comprises the step of normalizing the cumulative probability distributions such that the maximum values of the cumulative probability distributions coincide with a predetermined value,

Wherein the step of calculating the reference probability distribution calculates the reference probability distribution based on the normalized cumulative probability distributions.

The method according to claim 1,

Wherein the step of calculating the reference probability distribution calculates an average value or a median value from the calculated probability distributions and sets the reference probability distribution as the reference probability distribution.

The method according to claim 1,

Wherein the step of correcting the sound source signals comprises generating the corrected sound source signals by inputting the probability distributions into an inverse function of a function of converting the sound source signals into the reference probability distribution.

The method according to claim 1,

Further comprising the step of previously storing the calculated reference probability distribution,

Wherein the step of correcting the sound source signals comprises reading the pre-stored reference probability distribution and correcting the sound source signals according to the read reference probability distribution.

A computer-readable recording medium storing a program for causing a computer to execute the method according to any one of claims 1 to 6.

A probability distribution calculating unit for calculating probability distributions representing the number of the sound source signals present in each interval as probabilities according to magnitude intervals of the same sound source signals acquired through the microphone array;

A reference probability distribution calculating unit for calculating a reference probability distribution representing the probability distributions based on the calculated probability distributions; And

And a signal correction unit for correcting the sound source signals according to the calculated reference probability distribution.

9. The method of claim 8,

Wherein the probability distribution calculating unit includes an accumulating unit for calculating and accumulating probability distributions of magnitudes of the sound source signals for each of the intervals,

Wherein the reference probability distribution calculating unit calculates the reference probability distribution based on the accumulated probability distributions.

10. The method of claim 9,

Wherein the probability distribution calculating unit further includes a normalizing unit for normalizing the accumulated probability distributions such that maximum values of the accumulated probability distributions are equal to a predetermined value,

Wherein the reference probability distribution calculating unit calculates the reference probability distribution based on the normalized cumulative probability distributions.

9. The method of claim 8,

Wherein the reference probability distribution calculating unit calculates an average value or a median value from the calculated probability distributions and sets the reference probability distribution as the reference probability distribution.

9. The method of claim 8,

Wherein the signal corrector comprises a probability distribution transformer for generating corrected sound source signals by inputting the probability distributions into an inverse function of a function for converting from the sound source signals to the reference probability distribution.

9. The method of claim 8,

And a storage unit for storing the calculated reference probability distribution in advance,

Wherein the signal correction unit reads the pre-stored reference probability distribution and corrects the sound source signals according to the read reference probability distribution.