KR100653173B1

KR100653173B1 - Multi-channel blind source separation mechanism for solving the permutation ambiguity

Info

Publication number: KR100653173B1
Application number: KR1020050114409A
Authority: KR
Inventors: 장인선; 김민제; 백승권; 강경옥; 최승진; 홍진우
Original assignee: 한국전자통신연구원; 학교법인 포항공과대학교
Priority date: 2005-11-01
Filing date: 2005-11-28
Publication date: 2006-12-05

Abstract

A method and an apparatus for solving the permutation ambiguity of a separation coefficient of a multi-channel mixed signal are provided to prevent the mixing of different sound source signals, thereby accurately separating an original signal from the mixed signal. An ICA(Independent Component Analysis) input matrix is created in a row vector type of a back-mixing matrix corresponding to a separation coefficient(401). A base vector matrix and a deflection coefficient matrix are extracted using the created ICA method(402). A base vector index value regarding a column vector of a mixing vector is selected by comparing the extracted base vector matrix and deflection coefficient matrix with each other(403). A corresponding clustering including the column vector of the mixing vector is determined based on the selected base vector index value(404).

Description

Method for solving exchange ambiguity of multipath mixed signal separation coefficient and its apparatus {Multi-channel blind source separation mechanism for solving the permutation ambiguity}

도 1은 본 발명이 적용되는 다중경로 혼합신호 분리 시스템에 대한 일실시예 구성도,1 is a configuration diagram of an embodiment of a multipath mixed signal separation system to which the present invention is applied;

도 2는 반향성이 있는 실제환경에서 수집된 혼합신호로부터 원신호를 분리한 경우에 분리계수의 교환 모호성이 발생된 것을 보여주기 위한 일실시예 설명도,FIG. 2 is a diagram illustrating an embodiment in which an exchange ambiguity of a separation coefficient is generated when an original signal is separated from a mixed signal collected in an echoic real environment; FIG.

도 3은 본 발명에 따른 분리계수 교환 모호성 해소 장치에 대한 일실시예 구성도,Figure 3 is an embodiment configuration diagram for the separation coefficient exchange ambiguity resolution device according to the present invention,

도 4는 본 발명에 따른 분리계수 교환 모호성 해소 방법에 대한 일실시예 순서도,4 is a flow chart of an embodiment of a separation coefficient exchange ambiguity resolution method according to the present invention;

도 5는 본 발명에서 독립성분분석법을 사용하여 도출한 주파수 응답의 열벡터 분포의 방향성을 보여주기 위한 일실시예 그래프,5 is an embodiment graph for showing the direction of the column vector distribution of the frequency response derived using the independent component analysis method in the present invention,

도 6a 및 도 6b는 본 발명이 적용되어져 분리계수의 교환 모호성이 해소된 것을 보여주기 위한 일실시예 그래프이다.6A and 6B are graphs of one embodiment for showing that the exchange ambiguity of the separation coefficient is solved by applying the present invention.

* 도면의 주요 부분에 대한 부호의 설명* Explanation of symbols for the main parts of the drawings

41 : 행렬 생성부 42 : 독립성분분석 수행부41: matrix generator 42: independent component analysis unit

43 : 기초벡터 색인값 선정부 44 : 행렬 군집화 결정부43: base vector index value selection unit 44: matrix clustering determination unit

본 발명은 다중경로 혼합신호 분리계수의 교환 모호성을 해소하는 방법 및 그 장치에 관한 것으로, 더욱 상세하게는 반향성이 있는 실제환경에서 수집된 혼합신호에 대해 음원과 녹음환경에 대한 사전정보 없이 고품질의 원신호를 분리하는 경우에 있어, 분리계수 교환 모호성(Permutation Ambiguity)을 해소하기 위해 독립성분분석법(ICA; Independent Component Analysis method)을 사용하여 주파수 영역에서 분리계수가 갖는 내재적인 방향성을 추정하고서 이 방향성을 토대로 각 분리계수를 군집화(Clustering)시켜 각각의 원신호가 정렬되도록 하는, 다중경로 혼합신호 분리계수의 교환 모호성을 해소하는 방법 및 그 장치에 관한 것이다.The present invention relates to a method and a device for solving the exchange ambiguity of a multipath mixed signal separation coefficient, and more particularly, to a mixed signal collected in a reflective real environment without any prior information on a sound source and a recording environment. In the case of separating the original signal of, the independent component analysis method (ICA) is used to estimate the inherent direction of the separation coefficient in the frequency domain in order to solve the permutation ambiguity. A method and apparatus for solving the exchange ambiguity of a multipath mixed signal separation coefficient for clustering each separation coefficient on the basis of directionality so that each original signal is aligned.

블라인드 신호 분리(Blind Signal Separation)란 혼합된 신호로부터 원신호를 추출하는 기술로서, 블라인드가 의미하는 바와 같이 혼합환경 및 원신호에 대한 정보 없이 혼합신호로부터 원신호를 추출하는 것을 목표로 한다. 이 블라인드 신호 분리는 최근 들어서 음향/영상 처리, 생체신호 분석 및 통신 등 다양한 분야에서 각광받고 있다.Blind signal separation is a technique of extracting an original signal from a mixed signal, and aims to extract the original signal from the mixed signal without information on the mixed environment and the original signal as blinds mean. Recently, blind signal separation has been spotlighted in various fields such as acoustic / image processing, biosignal analysis, and communication.

이러한 블라인드 신호 분리는 전통적으로 "상호 신호들은 서로 독립적이다"라는 가정을 바탕으로 문제의 해에 접근, 예컨대 신호의 독립성에 대한 가정은 상당한 제약성이 뒤따르기 때문에 "원신호가 공간적으로 상관관계가 없고, 시간적으로 상관관계가 있다"라는 가정 하에 혼합신호의 이차 통계량(second order statistics)을 이용하여 블라인드 신호 분리의 해를 찾는 방법이 연구되고 있다. 즉, 블라인드 신호 분리의 해는 혼합회로를 추정하여 분리회로를 구하는 것이고, 이 분리회로를 통해 원신호를 복원하는 것을 그 목적으로 한다.This blind signal separation has traditionally been based on the assumption that "mutual signals are independent of each other" and thus the solution to the problem, such as the assumption of signal independence, is subject to significant constraints. A method for finding blind signal separation using a second order statistics of mixed signals is being studied under the assumption that "they are temporally correlated". That is, the solution of blind signal separation is to estimate the mixed circuit to obtain the separated circuit, and to recover the original signal through the separated circuit.

일반적으로, 실제환경에서 수집되는 신호들은 복적분 혼합회로에 의해 모델링되는데, 이러한 복적분 혼합회로 방식에서는 모델링하고자 하는 시스템의 긴 임펄스 응답으로 인해 분리회로 계수(이하, "분리계수"란 용어와 병행 표기하기로 함)의 수렴이 시간적 지연을 초래하고 불안정한 수렴을 갖게 되는 문제점이 있기 때문에, 복적분 혼합회로의 계수를 주파수 영역으로 전환하여 추정하는 방식이 주로 사용되고 있다.In general, the signals collected in the real environment are modeled by a double mixing circuit. In this double mixing circuit, due to the long impulse response of the system to be modeled, the parallel circuit coefficient (hereinafter referred to as "separation coefficient") is used. Since there is a problem that the convergence of the above) causes a time delay and an unstable convergence, a method of converting and estimating coefficients of the multiplication integration circuit into the frequency domain is mainly used.

상기와 같이 주파수 영역 상에서 분리계수를 추정하는 방식은 주파수 대역별로 각각의 음원 분리를 수행할 경우에, 선형 혼합 모델이 갖고 있는 근본적인 모호성, 즉 순서(이하, "교환"이란 용어와 병행 표기하기로 함)와 크기에 대한 모호성(Permutation and Scale Ambiguity)이 분리 성능에 상당한 영향을 미친다. 그에 따라, 이러한 기술에서는 주파수 대역별로 얻어진 선형 혼합 모델에 기반하여 분리한 음원에 대해 순서 및 크기가 원신호에 대응된다고 보장할 수 없기 때문에 하나의 음원을 분리하는데 있어 다른 음원과 섞일 가능성이 매우 높다.As described above, the method of estimating the separation coefficient in the frequency domain is to describe the fundamental ambiguity of the linear mixed model, that is, the order (hereinafter, referred to as “exchange”) in parallel when each sound source separation is performed for each frequency band. Permutation and scale ambiguity have a significant impact on separation performance. Therefore, this technique is very likely to mix with other sound sources in separating one sound source because it cannot guarantee that the order and magnitude correspond to the original signal for the separated sound sources based on the linear mixing model obtained for each frequency band. .

이에, 위와 같은 문제점을 극복하기 위해 최근에는 다음과 같은 방식들이 제안되었다.Thus, the following methods have recently been proposed to overcome the above problems.

첫 번째로, [L. Parra and C. Spence, Convolutive blind source separation of nonstationary sources, IEEE Trans. on SAP, pp320-327, May 2000] 논문에 개시된 방식으로서, 혼합회로 계수의 길이에 제한을 두어서 혼합회로 계수의 최신화를 점진적으로 하는 것이다. 그런데, 이러한 종래 방식에서는 복적분 혼합회로 시스템의 임펄스 응답이 긴 경우에 분리회로 계수의 길이에 상당한 제한을 두어야 되기 때문에 그로 인해 분리 성능이 현저히 떨어지는 문제점이 있다[예컨대 분리회로 계수의 수렴이 시간적 지연을 초래하고 불안정한 수렴을 갖는 점 등].First, [L. Parra and C. Spence, Convolutive blind source separation of nonstationary sources, IEEE Trans. on SAP, pp320-327, May 2000]. The method described in the paper is to gradually update the mixed circuit coefficient by limiting the length of the mixed circuit coefficient. However, in this conventional scheme, when the impulse response of the multi-integral mixed circuit system is long, a significant limit must be placed on the length of the separation circuit coefficient, which causes a problem that the separation performance is remarkably degraded. Resulting in unstable convergence).

두 번째로, [N.Murata, S.Ikeda and A.Ziehe, "An Approach to Blind Source Separation Based on Temporal Structure of Speech Signals", Neurocomputing, Vol. 41, Issue 1-4, pp.1-24, October 2001] 논문에 개시된 방식으로서, "신호가 갖고 있는 특정한 구조가 주파수가 변함에 따라 점진적으로 변한다"라는 사실에 의거해 인접 주파수 대역의 음원 분리 결과를 참조하여 신호를 재배열하는 것이다. 그런데, 이러한 종래 방식에서는 신호 배열을 한 번이라도 잘못하게 되면 이 잘못된 배열이 연속되는 주파수 영역으로 전파되어져 잘못된 신호 분리 결과를 초래하게 되는 문제점이 있다.Second, N. Murata, S. Ikeda and A. Ziehe, "An Approach to Blind Source Separation Based on Temporal Structure of Speech Signals", Neurocomputing, Vol. 41, Issue 1-4, pp.1-24, October 2001] As described in the paper, the separation of sound sources in adjacent frequency bands is based on the fact that "a particular structure of a signal changes gradually as the frequency changes." Refer to the result to rearrange the signals. However, in this conventional method, if the signal arrangement is made at least once, the wrong arrangement is propagated to a continuous frequency domain, resulting in a wrong signal separation result.

세 번째로, [L. Parra and C. Alvino, Geometric source separation: merging convolutive source separation with geometric beamforming, IEEE Trans. on SAP, Vol.10, No.6, pp352-361 September 2002] 논문에 개시된 방식으로서, 빔 포밍(Beamforming)을 사용하여 신호의 도착 방향성을 추정하고서 이 방향성을 토대로 신호를 배열하는 것이다. 그런데, 이러한 종래 방식에서는 샘플링 비율이 높은 고품질 신호에 빔포밍을 적용할 수 없는 문제점이 있다.Third, [L. Parra and C. Alvino, Geometric source separation: merging convolutive source separation with geometric beamforming, IEEE Trans. on SAP, Vol. 10, No. 6, pp352-361 September 2002] In the method disclosed in the paper, beam arrival is used to estimate the direction of arrival of the signal and to arrange the signal based on the direction. However, this conventional method has a problem in that beamforming cannot be applied to a high quality signal having a high sampling rate.

네 번째로, [H. Sawada, S. Araki, R. Mukai, S. Makino, Blind extraction of a dominant source signal from mixtures of many sources using ICA and time-frequency masking, IEEE International Symposium on Circuits and Systems (ISCAS 2005), pp. 5882-5885, May 2005] 논문에 개시된 방식으로서, 혼합계수로부터 혼합환경의 기하학적인 정보를 도출하고서 이 정보를 기반으로 하여 분리계수를 군집화시켜서 교환 모호성이 발생한 결과를 재정렬하는 것이다. 즉, 이러한 종래 방식에서는 혼합회로 계수에 대해 음원과 센서간의 최대거리 정보로 특정화된 정규화를 수행한 다음에 이 정규화 결과로서 남은 혼합환경의 기하학적인 정보만을 이용하여 군집화를 수행하는 것을 그 특징으로 한다. 그런데, 이러한 종래 방식은 음원과 센서간의 거리 정보만을 이용한다는 점에서 완화된 사전 지식을 요구하고 있으나, 정규화 수행 과정에서 선택되는 참조 센서에 따라 분리 성능이 일관되게 보장되지 못하는 문제점이 있다.Fourth, [H. Sawada, S. Araki, R. Mukai, S. Makino, Blind extraction of a dominant source signal from mixtures of many sources using ICA and time-frequency masking, IEEE International Symposium on Circuits and Systems (ISCAS 2005), pp. 5882-5885, May 2005], which derives geometric information of the mixed environment from the mixing coefficients and clusters the separation coefficients based on this information to rearrange the results of exchange ambiguity. That is, in this conventional method, the normalization is performed by specifying the maximum distance information between the sound source and the sensor for the mixed circuit coefficients, and then clustering is performed using only geometric information of the remaining mixed environment as a result of the normalization. . However, this conventional method requires a relaxed prior knowledge in that only the distance information between the sound source and the sensor is used, but there is a problem that the separation performance is not consistently guaranteed according to the reference sensor selected in the normalization process.

본 발명은 상기와 같은 문제점을 해결하기 위하여 제안된 것으로, 반향성이 있는 실제환경에서 수집된 혼합신호에 대해 음원과 녹음환경에 대한 사전정보 없이 고품질의 원신호를 분리하는 경우에 있어, 분리계수 교환 모호성(Permutation Ambiguity)을 해소하기 위해 독립성분분석법(ICA; Independent Component Analysis method)을 사용하여 주파수 영역에서 분리계수가 갖는 내재적인 방향성을 추정하고서 이 방향성을 토대로 각 분리계수를 군집화(Clustering)시켜 각각의 원신호가 정렬되도록 하는, 다중경로 혼합신호 분리계수의 교환 모호성을 해소하는 방법 및 그 장치를 제공하는데 그 목적이 있다.The present invention has been proposed to solve the above problems, in the case of separating the high quality original signal without prior information on the sound source and the recording environment for the mixed signal collected in the real environment with echo, separation coefficient In order to solve the permutation ambiguity, Independent Component Analysis method (ICA) is used to estimate the inherent directionality of the separation coefficient in the frequency domain and cluster each separation coefficient based on this direction. It is an object of the present invention to provide a method and apparatus for solving the exchange ambiguity of a multipath mixed signal separation coefficient, in which each original signal is aligned.

본 발명의 다른 목적 및 장점들은 하기의 설명에 의해서 이해될 수 있으며, 본 발명의 실시예에 의해 보다 분명하게 알게 될 것이다. 또한, 본 발명의 목적 및 장점들은 특허 청구 범위에 나타낸 수단 및 그 조합에 의해 실현될 수 있음을 쉽게 알 수 있을 것이다.Other objects and advantages of the present invention can be understood by the following description, and will be more clearly understood by the embodiments of the present invention. Also, it will be readily appreciated that the objects and advantages of the present invention may be realized by the means and combinations thereof indicated in the claims.

상기 목적을 달성하기 위한 본 발명의 방법은, 다중경로 혼합신호 분리계수의 교환 모호성을 해소하는 방법에 있어서, 혼합신호에 대해 추정된 분리계수를 입력받음에 따라 이 분리계수에 상응되는 역혼합 행렬의 행벡터로 독립성분분석 입력 행렬을 생성하는 행렬 생성 단계; 상기 생성한 행렬에 대해 독립성분분석법을 사용하여 기초벡터 행렬 및 편향계수 행렬을 추출하는 독립성분분석 수행 단계; 상기 추출한 기초벡터 행렬 및 편향계수 행렬을 비교하여 상기 역혼합 행렬 행벡터의 전치에 해당되는 혼합행렬의 열벡터에 대한 기초벡터 색인값을 선정하는 기초벡터 색인값 선정 단계; 및 상기 선정한 기초벡터 색인값을 토대로 상기 혼합행렬의 열벡터가 속하는 해당 군집을 결정하는 행렬 군집화 결정 단계를 포함한다.The method of the present invention for achieving the above object, in the method of solving the exchange ambiguity of the multipath mixed signal separation coefficient, the inverse mixing matrix corresponding to this separation coefficient in response to receiving the estimated separation coefficient for the mixed signal A matrix generation step of generating an independent component analysis input matrix with a row vector of; An independent component analysis step of extracting a base vector matrix and a deflection coefficient matrix from the generated matrix using an independent component analysis method; A base vector index value selection step of comparing the extracted base vector matrix and the deflection coefficient matrix and selecting a base vector index value of a column vector of a mixed matrix corresponding to a transpose of the inverse mixed matrix row vector; And a matrix clustering determining step of determining a corresponding cluster to which the column vector of the mixed matrix belongs based on the selected base vector index value.

한편, 본 발명의 장치는, 다중경로 혼합신호 분리계수의 교환 모호성을 해소하는 장치에 있어서, 혼합신호로부터 추정된 분리계수에 상응되는 역혼합 행렬의 행벡터로 독립성분분석 입력 행렬을 생성하기 위한 행렬 생성부; 상기 행렬 생성부에서 생성한 행렬에 대해 독립성분분석법을 사용하여 기초벡터 행렬 및 편향계수 행렬을 추출하기 위한 독립성분분석 수행부; 상기 독립성분분석 수행부에서 추출한 기초벡터 행렬 및 편향계수 행렬을 비교하여 상기 역혼합 행렬 행벡터의 전치에 해당되는 혼합행렬의 열벡터에 대한 기초벡터 색인값을 선정하기 위한 기초벡터 색인값 선정부; 및 상기 기초벡터 색인값 선정부에서 선정한 기초벡터 색인값을 토대로 상기 혼합행렬의 열벡터가 속하는 해당 군집을 결정하기 위한 행렬 군집화 결정부를 포함한다.On the other hand, the apparatus of the present invention, in the apparatus for solving the exchange ambiguity of the multipath mixed signal separation coefficient, the matrix for generating the independent component analysis input matrix as a row vector of the inverse mixed matrix corresponding to the separation coefficient estimated from the mixed signal Generation unit; An independent component analysis unit for extracting a base vector matrix and a deflection coefficient matrix from the matrix generated by the matrix generator using an independent component analysis method; A base vector index value selecting unit for comparing a base vector matrix and a deflection coefficient matrix extracted by the independent component analysis unit and selecting a base vector index value of a column vector of a mixed matrix corresponding to a transpose of the inverse mixed matrix row vector; And a matrix clustering determining unit for determining a corresponding cluster to which the column vector of the mixed matrix belongs based on the base vector index value selected by the base vector index value selecting unit.

상술한 목적, 특징 및 장점은 첨부된 도면과 관련한 다음의 상세한 설명을 통하여 보다 분명해 질 것이며, 그에 따라 본 발명이 속하는 기술분야에서 통상의 지식을 가진 자가 본 발명의 기술적 사상을 용이하게 실시할 수 있을 것이다. 또한, 본 발명을 설명함에 있어서 본 발명과 관련된 공지 기술에 대한 구체적인 설명이 본 발명의 요지를 불필요하게 흐릴 수 있다고 판단되는 경우에 그 상세한 설명을 생략하기로 한다. 이하, 첨부된 도면을 참조하여 본 발명에 따른 바람직한 일실시예를 상세히 설명하기로 한다.The above objects, features and advantages will become more apparent from the following detailed description taken in conjunction with the accompanying drawings, whereby those skilled in the art may easily implement the technical idea of the present invention. There will be. In addition, in describing the present invention, when it is determined that the detailed description of the known technology related to the present invention may unnecessarily obscure the gist of the present invention, the detailed description thereof will be omitted. Hereinafter, exemplary embodiments of the present invention will be described in detail with reference to the accompanying drawings.

본 발명의 특징은, 반향성이 있는 실제환경에서 수집된 혼합신호, 특히 음향 및 음성신호에 대하여 음원과 녹음환경에 대한 사전정보 없이 분리회로를 통해 고품질의 원신호를 분리하는 경우에 있어, 혼합신호에 대해서 추정한 분리계수의 모 호성, 예컨대 역혼합 행렬의 행벡터간[다시 말하면 역혼합 행렬의 가역인 혼합 행렬의 열벡터간]의 위치를 서로 조정하여 교환 모호성[순서 모호성]을 해소한다. 여기서, 주지 기술을 통해 부연 설명하면, 반향성이 있는 실제환경(예; 녹음 스튜디오와 같은 폐쇄된 곳 등)에서 수집된 신호는 주파수별로 긴 임펄스 응답을 가지며, 고품질의 신호는 음악 등과 같이 고품질의 샘플링이 요구되는 데이터를 말한다.A characteristic of the present invention is that in the case of separating the high quality original signal through the separation circuit without mixing information of the sound source and the recording environment, the mixed signal collected in the real environment with the echo, especially the sound and the audio signal, Exchange ambiguity (order ambiguity) is solved by adjusting the ambiguity of the separation coefficient estimated for the signal, for example, between the row vectors of the inverse mixing matrix (that is, between the column vectors of the mixing matrix which is the invertible mixture of the inverse mixing matrix). Here, the description through the well-known technology, the signal collected in a real echo environment (e.g. closed places such as recording studios) has a long impulse response for each frequency, high quality signals such as music Refers to data that requires sampling.

이를 위해, 본 발명에서는 분리계수 교환 모호성을 해소하기 위해 독립성분분석법(ICA; Independent Component Analysis method)을 사용하여 주파수 영역에서 분리계수가 갖는 내재적인 방향성을 추정하고서 이 방향성을 토대로 각 분리계수를 군집화(Clustering)시켜 각각의 원신호가 정렬되도록 한다.To this end, the present invention estimates the intrinsic directionality of separation coefficients in the frequency domain using an independent component analysis method (ICA) to resolve the separation coefficient exchange ambiguity and clusters each separation coefficient based on this direction. (Clustering) so that each original signal is aligned.

즉, 본 발명에서는 독립성분분석법을 사용하여 아무런 사전정보 없이도 분리계수에 상응되는 역혼합 행렬의 행벡터가 갖고 있는 내재적인 방향성을 추정하고서, 이 역혼합 행렬의 행벡터에 대해서 기존과 같이 정규화를 수행하지 않고서 군집화를 수행하는 것을 그 특징으로 한다. 특히, 본 발명은 독립성분분석법을 사용하여 선형적으로 비직교(non-orthogonal) 교차하면서 분포한 신호로부터도 그 방향성을 찾아낼 수 있다.In other words, the present invention estimates the inherent direction of the row vectors of the inverse mixing matrix corresponding to the separation coefficient without any prior information by using independent component analysis, and normalizes the row vectors of the inverse mixing matrix as before. It is characterized by performing clustering without. In particular, the present invention can find its direction even from a signal distributed linearly and non-orthogonal crossing using independent component analysis.

위와 같은 본 발명은 다중경로 혼합신호 분리 시스템에 있어 후처리 프로세스로 수행, 즉 콘볼루티브 블라인드 소오스 분리 문제(convolutive blind source separation problem)를 시간 영역에서 주파수 영역으로 전환하여 수행하는 과정 중 분리계수를 추정하는 프로세스와 이 분리계수를 혼합신호의 필터링에 사용하는 프로세스 사이에 수행된다.The present invention as described above is performed as a post-processing process in a multipath mixed signal separation system, that is, a separation coefficient during the process of performing a convolutive blind source separation problem from the time domain to the frequency domain. It is performed between the estimating process and the process of using this separation coefficient for filtering the mixed signal.

예컨대, 본 발명에 의해 교환 모호성이 해소된 분리계수(이하, "분리 필터계수 Ｗ"와 병행 표기하기로 함)를 혼합신호 필터링 파라미터로 사용함으로써, 원신호를 분리하는데 있어 각각의 원신호를 원래의 순서에 상응되게 정렬시켜 서로 다른 원신호간 섞이는 것을 방지해 분리 성능을 향상시킨다.For example, by using the separation coefficient (hereinafter, referred to in parallel with the "separation filter coefficient Ｗ") in which the exchange ambiguity is eliminated according to the present invention as a mixed signal filtering parameter, each original signal is separated from the original signal. By sorting in the order of, avoiding mixing between different original signals improves separation performance.

이하, 본 발명의 이해를 돕고자, 도 1에 도시된 일반적인 다중경로 혼합신호 분리 시스템을 참조하여 본 발명의 "분리계수 교환 모호성 해소 장치"에 대해 간략히 설명하기로 하고, 도 1을 설명하는데 있어 도 2를 함께 참조하여 "분리 계수 교환 모호성"에 대해 설명하기로 한다.Hereinafter, to facilitate the understanding of the present invention, the "separation coefficient exchange ambiguity resolution device" of the present invention will be briefly described with reference to the general multipath mixed signal separation system shown in FIG. With reference to FIG. 2, the "separation coefficient exchange ambiguity" will be described.

도 1은 본 발명이 적용되는 다중경로 혼합신호 분리 시스템에 대한 일실시예 구성도이다.1 is a diagram illustrating an embodiment of a multipath mixed signal separation system to which the present invention is applied.

본 발명이 적용되는 다중경로 혼합신호 분리 시스템은 신호 혼합부(100), 이산 푸리에 변환부(200), 분리계수 추정부(300), 역이산 푸리에 변환부(500) 및 신호 분리부(600)로 이루어지는데, 본 발명에서 제시하는 분리계수 교환 모호성 해소 장치(400)가 분리계수 추정부(300)에서 추정한 분리계수를 입력받아 교환 모호성을 해소하여 역이산 푸리에 변환부(500)로 출력한다.The multipath mixed signal separation system to which the present invention is applied includes a signal mixer 100, a discrete Fourier transform unit 200, a separation coefficient estimation unit 300, an inverse discrete Fourier transform unit 500, and a signal separator 600. The separation coefficient exchange ambiguity eliminating apparatus 400 according to the present invention receives the separation coefficient estimated by the separation coefficient estimating unit 300, solves the exchange ambiguity, and outputs it to the inverse discrete Fourier transform unit 500. .

도 1에 도시된 공지의 다중경로 혼합신호 분리 시스템의 구성 요소에 대해 간략히 살펴보면 다음과 같다.The components of the known multipath mixed signal separation system shown in FIG. 1 will be briefly described as follows.

상기 신호 혼합부(100)는 마이크로폰 등과 같은 센서를 통해 외부로부터 다중경로로 입력되는 원신호를 혼합하며, 상기 이산 푸리에 변환부(200)는 이 혼합신호를 이산 푸리에 변환하여 시간 영역을 주파수 영역으로 변환하며, 상기 분리계수 추정부(300)는 주파수 영역의 혼합신호에 대해 공지의 음원분리알고리즘의 학습 루틴을 사용해 분리계수를 최신화(업데이트)시키면서 분리계수를 추정하며, 상기 역이산 푸리에 변환부(500)는 이 주파수 영역의 분리계수를 역 이산 푸리에 변환하여 주파수 영역을 시간 영역으로 변환하며, 상기 신호 분리부(600)는 시간 영역의 분리계수를 사용하여 상기 혼합신호를 필터링하여 원신호를 분리한다.The signal mixer 100 mixes original signals input in the multipath from the outside through a sensor such as a microphone, and the discrete Fourier transform unit 200 converts the mixed signals into discrete Fourier transforms to convert the time domain into a frequency domain. The separation coefficient estimator 300 estimates the separation coefficient by updating (updating) the separation coefficient using a learning routine of a known sound source separation algorithm for the mixed signal in the frequency domain, and the inverse discrete Fourier transform unit. In operation 500, the discrete coefficients of the frequency domain are inverse discrete Fourier transform to convert the frequency domain to the time domain, and the signal separation unit 600 filters the mixed signal using the separation coefficient of the time domain to filter the original signal. Separate.

일반적으로, 공지의 다중경로 혼합신호 분리 시스템에 있어 주요한 구성 요소는 분리계수 추정부(300)라 할 수 있는데, 본 발명에서는 이러한 분리계수 추정부(300)에서 사용되는 공지의 음원분리알고리즘의 학습 루틴에 대해서는 어떠한 알고리즘[예; 상대 최적화(relative optimization), 경사법(gradient method), 신뢰영역법(trust-region method), 최소제곱근사법(least square estimation method), 상대 경사법(relative gradient method), 상대 신뢰영역법(relative trust-region method) 등]이 사용되어도 무방함을 밝혀 둔다.In general, a major component in a known multipath mixed signal separation system may be referred to as a separation coefficient estimator 300. In the present invention, learning of a known sound source separation algorithm used in such a separation coefficient estimator 300 is performed. Any algorithm [eg; Relative optimization, gradient method, trust-region method, least square estimation method, relative gradient method, relative trust domain method -region method, etc.] may be used.

한편, 본 발명의 분리계수 교환 모호성 해소 장치(400)를 설명하기에 앞서, 블라인드 음원 분리에 정의된 신호, 즉 주파수 영역별 순간적인(instantaneous) 선형 혼합 모델로서 모델링되는 실세계 음원 분리 문제를 해결하는데 사용되는 신호에 대해 소개하면 다음과 같다.On the other hand, prior to explaining the separation coefficient exchange ambiguity eliminating apparatus 400 of the present invention, to solve the real-world sound source separation problem modeled as a signal defined in the blind source separation, that is, instantaneous linear mixing model for each frequency domain Introducing the signals used is as follows.

Ｎ개의 원신호[소스 음원]

을 Ｍ개의 마이크로폰을 사용하여 녹음한 혼합신호를

이라고 정의한 상태에서, 시간 지연과 다중경로 효과를 고려하 면 ｉ번째 마이크로폰의 출력신호

를 다음의 [수학식 1]과 같이 나타낼 수 있다[모델링].N original signals [source sound source]

Mixed signals recorded using M microphones

Is defined as, the output signal of the i-th microphone, considering the time delay and the multipath effect

Can be expressed as Equation 1 below [modeling].

여기서,

는 ｊ번째 원신호와 i번째 마이크로폰간의 녹음환경 특성을 나타내는 룸 임펄스 응답(room impulse response)이고, P는 채널의 길이를 나타낸다.here,

Is a room impulse response representing the recording environment characteristic between the i-th original signal and the i-th microphone, and P is the length of the channel.

하기의 [수학식 2]를 가정한 상태에서, 상기 [수학식 1]을 [수학식 2]를 참조하여 정리하면 다음의 [수학식 3]과 같다.In the state in which Equation 2 is assumed below, Equation 1 can be summarized as Equation 3 below with reference to Equation 2.

여기서,

은 시간 이동 연산자이다.here,

Is a time shift operator.

위와 같은 [수학식 3]을 행렬 형태로 정리하면 다음의 [수학식 4]와 같으며, 도 1에 도시된 신호 혼합부(100)의 출력신호임을 알 수 있다.When the above Equation 3 is arranged in a matrix form, it is as shown in Equation 4 below, and it can be seen that it is an output signal of the signal mixer 100 shown in FIG. 1.

앞서 언급했던 바와 같이, 블라인드 신호 분리 문제의 해를 구하는 접근법은 상기 [수학식 4]로 표현되는 시간 영역의 혼합신호를 주파수 영역의 혼합신호로 변환하여 각 주파수 영역에서의 순간적인 혼합신호(instantaneous mixture) 분리 문제로 간주하여 해를 구하는 것이다.As mentioned above, an approach to solve the blind signal separation problem is to convert the mixed signal in the time domain represented by Equation 4 into the mixed signal in the frequency domain, thereby providing an instantaneous mixed signal in each frequency domain. mixture is a solution to the problem of separation.

즉, 상기 이산 푸리에 변환부(200)는 [수학식 4]와 같은 혼합신호를 이산 푸리에 변환하여 시간 영역을 주파수 영역으로 변환한다. 예컨대, 이산 푸리에 변환(DFT; Discrete Fourier Transform)의 프레임 크기 Ｔ가 채널의 길이 Ｐ보다 매우 크다면 선형 콘볼루션(linear convolution)은 다음의 [수학식 5]와 같이 순환 콘볼루션(circular convolution)으로 근사화된다. 다음의 [수학식 5]는 위 [수학식 4]를 주파수 영역으로 변환한 것이다.That is, the discrete Fourier transform unit 200 transforms the time domain into the frequency domain by performing discrete Fourier transform on the mixed signal as shown in [Equation 4]. For example, if the frame size T of the Discrete Fourier Transform (DFT) is much larger than the length P of the channel, the linear convolution is a circular convolution as shown in Equation 5 below. Approximated. Equation 5 below converts Equation 4 to the frequency domain.

또한, 위 [수학식 5]를 통해 언급한 주파수 영역을 기반으로 하여 특정 블리인드 신호 분리 알고리즘이 적용된 상기 분리계수 추정부(300)는 각 주파수 빈(frequency bin, 주; 특정 폭을 갖는 주파수 대역)마다 혼합 행렬

(mixing matrix)의 가역인 역혼합 행렬(demixing matrix)의 추정치, 즉 분리계수

를 도출한다. 이를 통해, 혼합 행렬

의 열벡터

는 역혼합 행렬

의 행벡터

와 서로 대응됨을 알 수 있다. 다음의 [수학식 6]은 혼합 행렬과 역혼합 행렬간의 관계를 나타낸다.In addition, the separation coefficient estimator 300 to which a specific blind signal separation algorithm is applied based on the frequency domain mentioned through Equation 5 above may be used for each frequency bin. Per matrix

estimate of the demixing matrix, the reversible of the mixing matrix

To derive This allows the mixing matrix

Column vector

Is the inverse mixing matrix

Row of

And correspond to each other. Equation 6 below shows a relationship between a mixing matrix and an inverse mixing matrix.

한편, 종래 기술을 통해 설명했던 바와 같이 분리계수 추정부(300)에서 추정한 [수학식 6]과 같은 역혼합 행렬

에는 크기 모호성(scaling ambiguities) 및 교환 모호성(permutation ambiguities)이 존재한다. 여기서, 역혼합 행렬의 크기 모호성은 주지 기술인 음원분리알고리즘으로도 쉽게 해소되는 바, 이하 본 발명 에서는 분리계수 크기 모호성에 대해서는 언급하지 않기로 한다.On the other hand, as described through the prior art inverse mixing matrix as shown in [Equation 6] estimated by the separation coefficient estimation unit 300

There are scaling ambiguities and permutation ambiguities. Here, the size ambiguity of the inverse mixing matrix is easily solved by a sound source separation algorithm, which is a well-known technique. Hereinafter, the separation coefficient size ambiguity will not be described.

즉, 다중경로를 통해 수집된 혼합신호

는 이산 푸리에 변환(또는 단시간 푸리에 변환; short time fourier transform 등)을 거쳐 주파수 빈별 혼합신호

를 생성한다. 이에, 분리계수 추정부(300)에서는 이 주파수 빈별 혼합신호를 입력받아 각 주파수 빈에 대응되는 역혼합 행렬의 추정치

를 도출한다.That is, mixed signals collected through multipath

Is a mixed signal by frequency bin through a discrete Fourier transform (or short time fourier transform, etc.)

Create Therefore, the separation coefficient estimator 300 receives the mixed signal for each frequency bin and estimates an inverse mixing matrix corresponding to each frequency bin.

To derive

그런데, 역혼합 행렬

의 행벡터

는 다음의 [수학식 7]과 같이 교환 행렬(permutation matrix)에 의해 모호성이 발생될 수 있다. 예컨대, "동일한 색인값 i를 갖는 두 주파수 영역의 행벡터

가 실제적으로 서로 다른 음원에 해당되는 역혼합 벡터일 수도 있다"라는 것이 분리계수 교환 모호성에 대한 정의이다.By the way, inverse mixing matrix

Row of

Ambiguity may be generated by a permutation matrix as shown in Equation 7 below. For example, "row vectors of two frequency domains with the same index value i.

May actually be an inverse mixing vector corresponding to different sound sources ".

여기서, Ｐ는 교환 행렬을 나타낸다.Here, P represents an exchange matrix.

이하, 도 1 및 전술한 바와 같이

는 교환 모호성이 해소되지 않은 분리계수를 의미하며[공지의 음원분리알고리즘으로 추정한 분리계수],

는 본 발명에 의해 교환 모호성이 해소된 분리계수를 의미하는 것으로 설명한다. 이와 같은 분리계수 교환 모호성에 대해서 도 2를 참조하여 좀 더 상세히 살펴보기로 한다.1 and as described above

Means the separation coefficient whose exchange ambiguity has not been resolved [separation coefficient estimated by the known sound source separation algorithm],

Explains that the separation coefficient solved the exchange ambiguity by the present invention. This separation coefficient exchange ambiguity will be described in more detail with reference to FIG. 2.

도 2는 반향성이 있는 실제환경에서 수집된 혼합신호로부터 원신호를 분리한 경우에 분리계수의 교환 모호성이 발생된 것을 보여주기 위한 일실시예 설명도이다.FIG. 2 is a diagram illustrating an embodiment in which an exchange ambiguity of a separation coefficient is generated when an original signal is separated from a mixed signal collected in an echoic real environment.

도 2에 도시된 바와 같이, 2개의 음원신호

가 2개의 센서를 통해 수집된 경우에, 2개의 혼합신호

는 각각 이산 푸리에 변환을 거쳐 L개의 주파수 빈별 혼합신호

와

로 변환된다.As shown in FIG. 2, two sound source signals

Is mixed through two sensors, two mixed signals

Is a mixed signal for each of L frequency bins through a discrete Fourier transform.

Wow

Is converted to.

이에, 음원분리알고리즘을 거쳐 L회의 반복을 통해 주파수 빈별로 순간적인 음원 분리 문제의 해를 구하게 되며, 그 결과로서 주파수 빈별로 2개씩의 주파수 영역에서의 음원신호 추정치인 분리계수를 생성한다.Therefore, through the L source repetition algorithm through the L source repetition algorithm, the instantaneous sound source separation problem is solved for each frequency bin. As a result, a separation coefficient which is an estimate of the sound source signal in two frequency domains is generated.

그러나, 최종 결과물인 주파수 영역에서의 음원신호는 어떠한 순서로 생성될지 알 수 없기 때문에, 이러한 결과물을 단순하게 통합하여 하나의 음원신호를 추정하게 된다면 다른 음원신호의 특정 주파수 영역의 신호가 서로 섞이는 경우가 발생되는 것이다.However, since it is impossible to know in what order the sound source signals in the frequency domain, which are the final products, will be generated, if one sound source signal is estimated by simply integrating these results, signals in specific frequency domains of different sound source signals may be mixed with each other. Will be generated.

따라서, 본 발명에서는 공지의 분리계수 추정부(300)에서 추정한 역혼합 행렬

의 교환 모호성을 해소, 즉 역혼합 행렬을 이루는 행벡터[다시 말하면 역혼합 행렬의 가역인 혼합 행렬을 이루는 열벡터]간의 위치를 서로 조정하여 분리계수 교환 모호성을 해소한다. 이러한 본 발명에서 제시하는 분리계수 교환 모호성 해소 매커니즘에 대해서는 도 3, 도 4, 도 5, 도 6a 및 도 6b를 참조하여 보다 상세히 후술하기로 한다.Therefore, in the present invention, the inverse mixing matrix estimated by the known separation coefficient estimator 300 is known.

The separation ambiguity is solved by adjusting the position between the row vectors constituting the inverse mixing matrix (that is, the column vectors constituting the mixing matrix that is the invertible mixture of the inverse mixing matrix). The separation coefficient exchange ambiguity-resolving mechanism proposed in the present invention will be described later in more detail with reference to FIGS. 3, 4, 5, 6A, and 6B.

위와 같이 본 발명에 의해 교환 모호성이 해소된 역혼합 행렬의 추정치

를 공지의 역이산 푸리에 변환부(500) 및 공지의 신호 분리부(600)에서 사용하여 다음의 [수학식 8]과 같이 혼합신호로부터 원신호

를 분리할 수 있다.Estimation of Inverse Mixing Matrix with Exchange Ambiguity Eliminated by the Present Invention

Is used in the known inverse discrete Fourier transform unit 500 and the known signal separation unit 600, the original signal from the mixed signal as shown in Equation 8 below.

Can be separated.

도 3은 본 발명에 따른 분리계수 교환 모호성 해소 장치에 대한 일실시예 구성도이고, 도 4는 본 발명에 따른 분리계수 교환 모호성 해소 방법에 대한 일실시예 순서도이다. 이하, 본 발명에 대한 설명의 편의를 도모하고자 도 3 및 도 4를 함께 참조하여 설명하기로 한다.Figure 3 is a block diagram of an embodiment of the separation coefficient exchange ambiguity solving apparatus according to the present invention, Figure 4 is a flow chart of an embodiment of the separation coefficient exchange ambiguity solving method according to the present invention. Hereinafter, the present invention will be described with reference to FIGS. 3 and 4 for the convenience of description.

본 발명에 따른 분리계수 교환 모호성 해소 장치는 행렬 생성부(41), 독립성분분석 수행부(42), 기초벡터 색인값 선정부(43) 및 행렬 군집화 결정부(44)로 이루어진다.The separation coefficient exchange ambiguity elimination apparatus according to the present invention includes a matrix generator 41, an independent component analyzer 42, a base vector index value selector 43, and a matrix clustering determiner 44.

상기 행렬 생성부(41)는 분리계수 추정부(300)에서 추정한 분리계수, 예컨대 교환 모호성이 발생된 모든 주파수 영역에서의 역혼합 행렬의 행벡터

로 독립성분분석법의 입력 파라미터로 사용될 행렬 X를 생성한다(401). 여기서, 행렬 X는 역혼합 행렬의 행벡터

에 대한 전치 행렬(transpose matrix)로 구성된다.The matrix generator 41 is a row vector of an inverse mixing matrix in all frequency domains in which the separation coefficient estimated by the separation coefficient estimator 300, for example, the exchange ambiguity is generated.

In operation 401, a matrix X to be used as an input parameter of the independent component analysis method is generated. Where matrix X is the row vector of the inverse mixing matrix

It consists of a transpose matrix for.

상기 독립성분분석 수행부(42)는 행렬 생성부(41)로부터 입력받은 행렬 X에 대해 독립성분분석 알고리즘을 사용하여 기초벡터(basis vector)로 이루어진 행렬 Ａ와 편향계수로 이루어진 행렬 Ｓ를 추출한다(402). 여기서, 본 발명에서 사용하는 독립성분분석 알고리즘 수행 결과에 대해서는 도 5를 참조하여 후술하기로 한다.The independent component analyzing unit 42 extracts the matrix A of the basis vector and the matrix S of the deflection coefficient from the matrix X received from the matrix generator 41 using an independent component analysis algorithm. (402). Here, the results of performing the independent component analysis algorithm used in the present invention will be described later with reference to FIG. 5.

앞서 언급한 행렬 생성부(41)에서 생성한 행렬 X와, 독립성분분석 수행부(42)에서 추출한 기초벡터 행렬 Ａ 및 편향계수 행렬 Ｓ에 대한 수학식을 통해 "401" 과정 및 "402" 과정에 대해 보다 상세히 설명하면 다음과 같다.Processes "401" and "402" through the equations for the matrix X generated by the aforementioned matrix generator 41 and the base vector matrix A and the deflection coefficient matrix S extracted by the independent component analyzer 42. When described in more detail as follows.

하기의 [수학식 9]는 행렬 X와 기초벡터 행렬 Ａ 및 편향계수 행렬 Ｓ에 대한 관계를 나타낸다.Equation 9 below shows the relationship between the matrix X, the base vector matrix A, and the deflection coefficient matrix S.

여기서, 행렬 X는 역혼합 행렬의 모든 행벡터들의 전치로 이루어진 행렬

이며, 독립성분분석법이 사용되어져 추출된 기초벡터

는 선형적으로 분포한 열벡터의 방향성을 나타내고, 편향계수 S의 요소들은 주어진 열벡터의 포인트가 해당되는 기초벡터에 어느 정도로 편향되어 있는지를 나타내는 계수이다.Where matrix X is the transpose of all the row vectors of the inverse mixture matrix

Basic vector extracted by independent component analysis

Denotes the direction of linearly distributed column vectors, and the elements of the deflection coefficient S are coefficients indicating how much a point of a given column vector is deflected to the corresponding base vector.

상기 기초벡터 색인값 선정부(43)는 독립성분분석 수행부(42)에서 추출한 기초벡터 행렬 A 및 편향계수 행렬 S를 파라미터로 하여 모든 주파수 영역에서의 역혼합 행렬의 행벡터

에 대응되는, 예컨대 역혼합 행렬의 행벡터의 전치에 해당되는 혼합 행렬의 열벡터

에 대해 편향계수값이 가장 큰 기초벡터 색인값 j를 선정한다(403).The base vector index value selecting unit 43 uses the base vector matrix A and the deflection coefficient matrix S extracted by the independent component analyzing unit 42 as parameters, and the row vectors of the inverse mixing matrix in all frequency domains.

Column vectors of the mixing matrix that correspond to, for example, transpose of the row vectors of the inverse mixing matrix.

The base vector index value j having the largest deflection coefficient value is selected with respect to (403).

즉, 이때에 혼합 행렬의 열벡터

가 실제로 어떠한 색인값을 가지는 열로 교환되어야 되는지 알기 위해서

의 해당 계수값들을 조사한다. 이는, 해당 편향계수 행렬 Ｓ의 열벡터

에 대해

의 각 요소들

중 가장 큰 절대값을 가지는 계수값에 대해서 해당 색인값을 추출하는 것으로서, 계수값이 더 큰 기초벡터에 편향되어 있다고 판정하여 이를 기반으로 군집화를 수행한다. 다시 말하면, 여기서 선정된 기초벡터 색인값 j는 이후의 행렬 군집화 결정 과정에서 열벡터

간의 위치 조정 파라미터로 사용된다.That is, the column vector of the mixing matrix

To know which index value should actually be exchanged for

Examine the corresponding coefficients for. This is the column vector of the deflection coefficient matrix S

About

Elements of

The index value is extracted with respect to the coefficient value having the largest absolute value, and it is determined that the coefficient value is biased to the larger base vector, and clustering is performed based on the index value. In other words, the base vector index value j selected here is a column vector in a subsequent matrix clustering decision process.

Used as a parameter for adjusting the position of the liver.

예를 들어, 혼합 행렬의 열벡터

의 해당 계수값이

와 같은 부등식을 만족하는 경우에, 상기

는 첫번째[1st] 기초벡터에 편향되어 있다고 판정할 수 있다.For example, the column vector of a mixing matrix

The corresponding count of

If the following inequality is satisfied,

Can be determined to be biased against the first [1st] basis vector.

상기 행렬 군집화 결정부(44)는 기초벡터 색인값 선정부(43)에서 선정한 기초벡터 색인값 j를 파라미터로 하여 혼합행렬의 열벡터

가 속하는 해당 군집을 결정한다(404).The matrix clustering determining unit 44 uses the base vector index value j selected by the base vector index value selecting unit 43 as a parameter to obtain a column vector of the mixed matrix.

Determine the corresponding cluster to which it belongs (404).

앞서 기초벡터 색인값 선정 과정을 통해 설명했듯이 기초벡터 색인값 j는 열벡터

가 행렬 내에서 실제로 위치해야 되는 자리를 나타내는 색인값이며, 이에 행렬 군집화 결정 과정에서 열벡터

가 행렬 내에서 다른 열벡터간에 서로 교환되어 있다면 기초벡터 색인값 j를 토대로 원래의 자리로 서로 위치 조정을 해 주어 교환 모호성을 해소시킨다.As explained earlier through the process of selecting the base vector index value, the base vector index value j is a column vector.

Is an index value that represents the position of the actual position in the matrix, and thus the column vector during the matrix clustering decision.

If is exchanged between different column vectors in a matrix, the positional ambiguity is eliminated by adjusting the position of each other based on the base vector index value j.

위와 같은 "403" 과정 및 "404" 과정을 행렬 X의 모든 열벡터

[i>L*N]에 대해서 수행하여 교환 모호성이 해소된

를 출력한다.All column vectors in matrix X for "403" and "404"

exchange ambiguity is resolved by performing [i> L * N].

Outputs

도 5는 본 발명에서 독립성분분석법을 사용하여 도출한 주파수 응답의 열벡터 분포의 방향성을 보여주기 위한 일실시예 그래프이다.FIG. 5 is an exemplary graph for showing the direction of a column vector distribution of a frequency response derived using the independent component analysis method of the present invention.

도 5에 도시된 "□" 모양 및 "○" 모양은 반향성이 있는 녹음 스튜디오에서 녹음된 음원, 예컨대 마이크로폰 등과 같은 센서로 수집한 음원에 대해 독립성분분석법을 수행한 결과인 독립성분벡터, 즉 주파수 응답의 열벡터

를 나타낸다[N=2인 경우].5 and 2 are independent component vectors which are the result of performing independent component analysis on a sound source collected by a reverberant recording studio, such as a microphone. Column vector of frequency response

[In the case of N = 2].

도 5를 통해 알 수 있듯이, 주파수 응답의 열벡터들은 교차하는 선형 분포를 이루고 있으며, 이러한 각 주파수 응답의 열벡터에 대한 선형 분포의 방향성은 센서와 음원간의 기하학적인 정보에 기반한 것이다. 부가적으로, 이러한 주파수 응답의 열벡터는 센서와 음원간의 기하학적인 분포 정보뿐만 아니라 주파수에 의한 변화량에 따라서도 선형적인 분포를 이룬다. 이처럼, 본 발명에서는 녹음환경에 대한 사전정보 없이도 독립성분분석법을 사용하여 데이터, 즉 주파수 응답의 열벡터에 대한 내재적인 방향성, 특히 선형적으로 비직교 교차하면서 분포하는 열벡터에 대한 방향성을 찾아내며, 이러한 열벡터의 선형적인 분포 방향성은 열벡터 군집화 파라미터로 사용된다.As can be seen from FIG. 5, the column vectors of the frequency response form an intersecting linear distribution, and the direction of the linear distribution of the column vectors of each frequency response is based on geometric information between the sensor and the sound source. In addition, the column vector of the frequency response is linearly distributed not only by the geometric distribution information between the sensor and the sound source but also by the amount of change by frequency. As such, the present invention finds the intrinsic directionality of the data, ie, the column vector of the frequency response, in particular the direction of the column vector distributed linearly and non-orthogonally, using independent component analysis without prior information on the recording environment. The linear distribution directionality of these column vectors is used as the column vector clustering parameter.

도 6a는 분리계수 교환 모호성이 발생된 주파수 응답의 열벡터 분포를 나타내는 그래프로서, 주파수 응답간이 서로 교환되어져 동일한 색인값을 갖는 주파수 응답의 열벡터들이 하나의 선형 분포를 이루지 않고서 두 개로 교차하는 선형 분포 모두에 속하고 있다.FIG. 6A is a graph showing a column vector distribution of a frequency response in which a separation coefficient exchange ambiguity is generated, in which column vectors of frequency responses having the same index value intersect with each other without forming a linear distribution are exchanged between frequency responses. It belongs to all distributions.

도 6b는 본 발명에 의해 분리계수 교환 모호성이 해소된 주파수 응답의 열벡터 분포를 나타내는 그래프로서, 주파수 응답의 열벡터간의 위치를 서로 조정하여 각 주파수 응답의 열벡터들이 하나의 선형 분포에 속하고 있다.FIG. 6B is a graph showing a column vector distribution of a frequency response in which the separation coefficient exchange ambiguity is solved according to the present invention, and the column vectors of each frequency response belong to one linear distribution by adjusting positions of the column vectors of the frequency response. have.

상술한 바와 같은 본 발명의 방법은 프로그램으로 구현되어 컴퓨터로 읽을 수 있는 형태로 기록매체(씨디롬, 롬, 램, 플로피 디스크, 하드 디스크, 광자기 디스크 등)에 저장될 수 있다. 이러한 과정은 본 발명이 속하는 기술 분야에서 통상의 지식을 가진 자가 용이하게 실시할 수 있으므로 더 이상 상세히 설명하지 않기로 한다.As described above, the method of the present invention may be implemented as a program and stored in a recording medium (CD-ROM, ROM, RAM, floppy disk, hard disk, magneto-optical disk, etc.) in a computer-readable form. Since this process can be easily implemented by those skilled in the art will not be described in more detail.

이상에서 설명한 본 발명은, 본 발명이 속하는 기술 분야에서 통상의 지식을 가진 자에게 있어 본 발명의 기술적 사상을 벗어나지 않는 범위 내에서 여러 가지 치환, 변형 및 변경이 가능하므로 전술한 실시예 및 첨부된 도면에 의해 한정되는 것이 아니다.The present invention described above is capable of various substitutions, modifications, and changes without departing from the technical spirit of the present invention for those skilled in the art to which the present invention pertains. It is not limited by the drawings.

상기와 같은 본 발명은 블라인드 신호 분리에 있어 분리 성능의 병목점이 되는 분리계수 교환 모호성을 해소하여 서로 다른 음원신호간에 섞이는 것을 방지함으로써, 혼합신호로부터 원신호를 정확하게 분리할 수 있는 효과가 있다.The present invention as described above has the effect of accurately separating the original signal from the mixed signal by eliminating the separation coefficient exchange ambiguity, which is a bottleneck of separation performance in blind signal separation, to prevent mixing between different sound source signals.

또한, 본 발명은 폐쇄된 녹음환경 특성 상 반향성으로 인해 긴 임펄스 응답을 갖는 경우, 음악 등과 같이 고품질의 샘플링 비율이 요구되는 신호가 사용되는 경우, 실제환경에서 수집된 혼합신호에 대해 음원과 녹음환경에 대한 사전정보가 없는 경우 등에 있어 분리 성능 저하없이 제대로 신호 분리를 안정적으로 수행할 수 있도록 하는 효과가 있다.In addition, the present invention is a sound source and recording for the mixed signal collected in the real environment when a signal that requires a high quality sampling rate, such as music, when a long impulse response due to the reverberation characteristics of the closed recording environment is used In the case where there is no prior information on the environment, there is an effect that the signal separation can be performed stably without degrading performance.

Claims

In the method of solving the exchange ambiguity of the multipath mixed signal separation coefficient,

A matrix generation step of generating an independent component analysis input matrix based on a row vector of an inverse mixing matrix corresponding to the separation coefficient according to the input of the separation coefficient estimated for the mixed signal;

An independent component analysis step of extracting a base vector matrix and a deflection coefficient matrix from the generated matrix using an independent component analysis method;

A base vector index value selection step of comparing the extracted base vector matrix and the deflection coefficient matrix and selecting a base vector index value of a column vector of a mixed matrix corresponding to a transpose of the inverse mixed matrix row vector; And

A matrix clustering determination step of determining a corresponding cluster to which the column vector of the mixed matrix belongs based on the selected base vector index value.

Method of solving the exchange ambiguity of the multipath mixed signal separation coefficient comprising a.

The method of claim 1,

The independent component analysis input matrix,

12. A method for solving the exchange ambiguity of a multipath mixed signal separation coefficient, characterized in that it consists of a transpose matrix for the row vectors of the inverse mixing matrix corresponding to the estimated separation coefficient.

The method of claim 1,

The independent component analysis input matrix, the base vector matrix, and the deflection coefficient matrix are

A method for solving the exchange ambiguity of a multipath mixed signal separation coefficient, characterized by having a relationship as in the following equation.

[Equation]

Where X is the independent component analysis input matrix, A is the base vector matrix, and S is the deflection coefficient matrix.

The method of claim 3, wherein

The base vector matrix A is

A method for solving the exchange ambiguity of a multipath mixed signal separation coefficient, characterized by the directionality of the column vector distribution with respect to the linearly distributed frequency response.

The method of claim 3, wherein

The deflection coefficient matrix S is

And how the elements indicate how much the point of the column vector for the frequency response is biased in the corresponding base vector.

The method of claim 1,

The row vector of the inverse mixing matrix is

A method for solving the exchange ambiguity of a multipath mixed signal separation coefficient, characterized by being distributed linearly and non-orthogonally in the frequency domain.

The method of claim 1,

The row vector of the inverse mixing matrix is

A method for solving the exchange ambiguity of a multipath mixed signal separation coefficient, characterized in that it has a corresponding linear distribution direction in the frequency domain according to the geometric arrangement between the signal collection sensor and the original signal.

The method according to any one of claims 1 to 7,

The basic vector index value selection step,

The exchange ambiguity of multipath mixed signal separation coefficients is selected by selecting a base vector index value corresponding to a coefficient value having the largest absolute value among the elements of the deflection coefficient matrix column vector with respect to the column vector of the mixed matrix. How to solve.

The method according to any one of claims 1 to 7,

The matrix clustering determination step,

And resolving the interchange ambiguity of the multipath mixed signal separation coefficient according to the selected base vector index value.

The method of claim 9,

Separating the original signal by filtering the mixed signal using a separation coefficient corresponding to the column vector of the mixing matrix determined in the matrix clustering determination step

Method of eliminating the exchange ambiguity of the multipath mixed signal separation coefficient further comprising.

A device for solving the exchange ambiguity of a multipath mixed signal separation coefficient,

A matrix generator for generating an independent component analysis input matrix as a row vector of an inverse mixing matrix corresponding to the separation coefficient estimated from the mixed signal;

An independent component analysis unit for extracting a base vector matrix and a deflection coefficient matrix from the matrix generated by the matrix generator using an independent component analysis method;

A base vector index value selection unit for comparing a base vector matrix and a deflection coefficient matrix extracted by the independent component analysis unit and selecting a base vector index value of a column vector of a mixed matrix corresponding to a transpose of the inverse mixed matrix row vector; And

A matrix clustering determining unit for determining a corresponding cluster to which the column vector of the mixed matrix belongs based on the base vector index value selected by the base vector index value selecting unit.

Apparatus for solving the exchange ambiguity of the multipath mixed signal separation coefficient comprising a.

The method of claim 11,

Signal separation unit for separating the original signal by filtering the mixed signal using the separation coefficient corresponding to the column vector of the mixing matrix determined by the matrix clustering determination unit

Apparatus for solving the exchange ambiguity of the multipath mixed signal separation coefficient further comprising.