WO2014129878A1 - Apparatus and method for solving scale problem in blind signal extraction of frequency domain - Google Patents

Apparatus and method for solving scale problem in blind signal extraction of frequency domain Download PDF

Info

Publication number
WO2014129878A1
WO2014129878A1 PCT/KR2014/001536 KR2014001536W WO2014129878A1 WO 2014129878 A1 WO2014129878 A1 WO 2014129878A1 KR 2014001536 W KR2014001536 W KR 2014001536W WO 2014129878 A1 WO2014129878 A1 WO 2014129878A1
Authority
WO
WIPO (PCT)
Prior art keywords
frequency
frequency component
resolution
added
calculated
Prior art date
Application number
PCT/KR2014/001536
Other languages
French (fr)
Inventor
Soo Young Lee
Choong Hwan Choi
Byeong Yeol Kim
Original Assignee
Korea Advanced Institute Of Science And Technology
Sony Computer Entertainment America Llc
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Korea Advanced Institute Of Science And Technology, Sony Computer Entertainment America Llc filed Critical Korea Advanced Institute Of Science And Technology
Publication of WO2014129878A1 publication Critical patent/WO2014129878A1/en

Links

Images

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/0272Voice signal separating

Definitions

  • the present invention relates to an apparatus and a method for solving a scale problem in blind signal extraction, and more particularly to an apparatus and a method for solving a scale problem in blind signal extraction of a frequency domain, which is capable of solving the scale problem occurring when the blind signal extraction of the frequency domain is performed.
  • the BSS is a technology of estimating an original signal source in the mixed signal.
  • FIG. 1 is a view illustrating a concept of the BSS.
  • Equation (1) As a modeling of the BSS, assuming that N unknown signal sources which are statistically and mutually independent, and of which an average is zero are received via M sensors (microphones) through mixed independent paths respectively, a relation formula expressed by Equation (1) is established between a microphone signal x and an original signals.
  • Equation (1)
  • Equation 1 h jk (l) means a transfer function from a k th original signal to a j th microphone.
  • a value of M may be used as N.
  • the purpose of the BSS is to separate the h jk (l) and an s k from only a signal x j which is a mixed signal, in an unknown state, as expressed by Equation (2).
  • Equation (2)
  • Equations (1) and (2) are formulas for processing data in a time domain, in which L indicates a magnitude of the filter. As the magnitude of the filter increases, an amount of calculation significantly increases. Accordingly, in order to reduce the amount of the calculation, the data in the time domain need to be converted in a signal in a frequency domain. At this time, a frequency domain BSS is performed by using a Short-Time Fourier Transform (STFT), and is expressed by Equation (3).
  • STFT Short-Time Fourier Transform
  • the STFT is a scheme in which data x j enough to correspond to a window size with a length of L is loaded and Fourier-transformed, and is repeatedly performed by shifting the data by a little.
  • a two dimensional data with information on a transverse time frame and a longitudinal frequency bin is made, and thus, in the Equations (1) and (2), convolution in the time domain is transformed into multiplication in the frequency domain. The amount of calculation can be significantly reduced. Therefore, a relation of the microphone signal and an output signal can be expressed by Equation (4).
  • Equation (4)
  • the present invention has been made to solve the above-mentioned problem in the conventional art, and an aspect of the present invention is to provide an apparatus and a method for solving a scale problem in blind signal extraction of a frequency domain, which are capable of solving the scale problem occurring when the blind signal extraction of the frequency domain is performed.
  • an apparatus for solving a scale problem in blind signal extraction includes: a frequency component adding unit for adding a (2n-1) th frequency component and a 2n th frequency component which are different frequency output signals and adjacent to each other; a frequency component re-adding unit for re-adding a (2m-1) th frequency component and a 2m th frequency component for the frequency component added by the frequency component adding unit: and a blind signal extracting unit for performing blind signal extraction, based on each of an original output signal, a signal added by the frequency element adding unit, and a signal re-added by the frequency component re-adding unit, wherein m and n are natural numbers.
  • the scale problem is solved based on a coefficient enabling a Euclidean distance to be minimum.
  • the blind signal extracting unit may perform the blind signal extraction from two different Short-Time Fourier Transform (STFT) data, based on a lower frequency resolution in which a relative size of the frequency resolution is small.
  • STFT Short-Time Fourier Transform
  • the frequency component adding unit repeatedly performs the adding until a signal of the added frequency resolution becomes one.
  • the blind signal extraction unit pairs the extracted signals in sequence from the lowest frequency resolution to the highest frequency resolution.
  • the values of all frequencies in various frequency resolutions are calculated by using a Euclidean distance and are averaged, and the frequency resolution which has the lowest average is set as an optimal frequency resolution.
  • a value E1 for an optimal solution calculated in all frequencies for various frequency resolutions may be calculated by using the Euclidean distance
  • a value E2 for a secondary optimal solution is calculated by using the Euclidean distance
  • the frequency resolution having a largest difference of the calculated values (E2-E1) may be set as an optimal value.
  • an apparatus for solving a scale problem in blind signal extraction includes: a first frequency component adding unit for adding a (2n-1) th frequency component and a 2n th frequency component which are different frequency output signals and adjacent to each other; a second frequency component adding unit for adding a 2m th frequency component and a (2m+1) th frequency component which are different frequency output signals and adjacent to each other; and a blind signal extracting unit for performing blind signal extraction, based on each of an original output signal, a signal added by the first frequency component adding unit, and a signal added by the second frequency component adding unit, wherein m and n are natural numbers.
  • the scale problem is solved based on a coefficient enabling a Euclidean distance to be minimum.
  • the blind signal extracting unit may perform the blind signal extraction from two different Short-Time Fourier Transform (STFT) data, based on a lower frequency resolution in which a relative size of the frequency resolution is small.
  • STFT Short-Time Fourier Transform
  • the extracted signals are arranged such that they overlap and intersect with each other.
  • the two frequency components are preferably to be normalized.
  • the values of all frequencies in various frequency resolutions are calculated by using a Euclidean distance and are averaged, and the frequency resolution which has the lowest average is set as an optimal frequency resolution.
  • a value E1 for an optimal solution calculated in all frequencies for various frequency resolutions may be calculated by using the Euclidean distance
  • a value E2 for a secondary optimal solution is calculated by using the Euclidean distance
  • the frequency resolution having a largest difference of the calculated values (E2-E1) may be set as an optimal value.
  • a method of solving a scale problem in blind signal extraction includes: adding a (2n-1) th frequency component and a 2n th frequency component which are different frequency output signals and adjacent to each other; re-adding a (2m-1) th frequency component and a 2m th frequency component for the frequency component added in the adding of the frequency component: performing blind signal extraction, based on each of an original output signal, a signal added by the frequency component adding unit, and a signal re-added by the frequency component re-adding unit; and pairing the extracted signals in sequence from a lowest frequency resolution to a highest frequency resolution, wherein m and n are natural numbers.
  • the scale problem is solved based on a coefficient enabling a Euclidean distance to be minimum.
  • the blind signal extraction from two different Short-Time Fourier Transform (STFT) data is performed, based on a lower frequency resolution in which a relative magnitude of the frequency resolution is small.
  • STFT Short-Time Fourier Transform
  • the adding of the frequency component is repeatedly performed until a signal of the added frequency resolution becomes one.
  • the values of all frequencies in various frequency resolutions are calculated by using a Euclidean distance and are averaged, and the frequency resolution which has the lowest average is set as an optimal frequency resolution.
  • a value E1 for an optimal solution calculated in all frequencies for various frequency resolutions may be calculated by using the Euclidean distance
  • a value E2 for a secondary optimal solution is calculated by using the Euclidean distance
  • the frequency resolution having a largest difference of the calculated values (E2-E1) may be set as an optimal value.
  • a method of solving a scale problem in blind signal extraction includes: first-adding a (2n-1) th frequency component and a 2n th frequency component which are different frequency output signals and adjacent to each other; second-adding a 2m th frequency component and a (2m+1) th frequency component which are different frequency output signals and adjacent to each other; performing blind signal extraction, based on each of an original output signal, a signal added in the first frequency component adding, and a signal added in the second frequency component adding, and arranging the extracted signals such that they overlap and intersect with each other, wherein m and n are natural numbers.
  • the scale problem is solved based on a coefficient enabling a Euclidean distance to be minimum.
  • the blind signal extraction from two different Short-Time Fourier Transform (STFT) data is performed, based on a lowerfrequency resolution in which a relative magnitude of the frequency resolution is small.
  • STFT Short-Time Fourier Transform
  • the two frequency components are preferably to be normalized.
  • the values of all frequencies in various frequency resolutions are calculated by using a Euclidean distance and are averaged, and the frequency resolution which has the lowest average is set as an optimal frequency resolution.
  • a value E1 for an optimal solution calculated in all frequencies for various frequency resolutions may be calculated by using the Euclidean distance
  • a value E2 for a secondary optimal solution is calculated by using the Euclidean distance
  • the frequency resolution having a largest difference of the calculated values (E2-E1) may be set as an optimal value.
  • the scale problem when the blind signal extraction of the frequency domain is performed, the scale problem can be solved because only one signal for different frequencies is extracted.
  • FIG. 1 is a view illustrating a concept of blind signal separation
  • FIG. 2 is a block diagram schematically illustrating a configuration of an apparatus for solving a scale problem in blind signal extraction of a frequency domain according to an embodiment of the present invention
  • FIG. 3 is a view illustrating a relation of data with different resolutions according to the embodiment of the present invention.
  • FIG. 4 is a view illustrating a hierarchical approach to solve the scale problem according to the embodiment of the present invention.
  • FIG. 5 is a block diagram schematically illustrating a configuration of an apparatus for solving a scale problem in blind signal extraction of a frequency domain according to another embodiment of the present invention
  • FIG. 6 is a view illustrating an example of an alternative approach to solve the scale problem according to the embodiment of the present invention.
  • FIG. 7 is a view illustrating another example of the alternative approach to solve the scale problem according to the embodiment of the present invention.
  • FIG. 8 is a flowchart illustrating a process of solving the scale problem in the blind signal extraction of the frequency domain according to an embodiment of the present invention.
  • FIG. 9 is a flowchart illustrating a process of solving the scale problem in the blind signal extraction of the frequency domain according to another embodiment of the present invention.
  • any structural element ‘is connected to’, ‘is joined to’, or ‘contacts’ another structural element it should be understood that the structural element is directly connected to or joined to another structural element, but a third structural element may be connected to, joined to, or contact them.
  • FIG. 2 is a block diagram schematically illustrating a configuration of an apparatus for solving a scale problem in blind signal extraction of a frequency domain according to an embodiment of the present invention.
  • the apparatus 100 for solving the scale problem in the blind signal extraction includes a frequency component adding unit 110, a frequency component re-adding unit 120, and a blind signal extracting unit 130.
  • the frequency component adding unit 110 adds a (2n-1) th frequency component and a 2n th frequency component which are different frequency output signals and adjacent to each other.
  • the frequency component adding unit 110 repeatedly performs the adding until a resolution signal of the added frequency becomes one (1).
  • the frequency component re-adding unit 120 re-adds a (2m-1) th frequency component and a 2m th frequency component for the frequency component added by the frequency component adding unit 110.
  • the two frequency components can benormalized.
  • the equalization of the two frequency components is not limited thereto, and may be unconditionally performed regardless of the scale.
  • the normalization of the two frequency components is not limited with relation to normalization of a specific factor such as power, magnitude, and the like, and may be performed relative to various factors.
  • the blind signal extracting unit 130 performs the blind signal extraction based on each of an original output signal, the signal added by the frequency component adding unit 110, and the signal re-added by the frequency component re-adding unit 120.
  • the blind signal extracting unit 130 may perform the blind signal extraction from two different Short-Time Fourier Transform (STFT) data, based on a low frequency resolution in which a relative size of the frequency resolution is small. Further, the blind signal extraction unit 130 pairs the extracted signals in sequence from the lowest frequency resolution to the highest frequency resolution.
  • STFT Short-Time Fourier Transform
  • FIG. 3 is a view illustrating a relation of data with different resolutions according to the embodiment of the present invention.
  • the data with the low frequency resolution is identical to a sum of two neighboring values of data with the high frequency resolution.
  • x_bar(i, i+1) may be a sum of two signals x(i) and x(i+1) which are data having the signal information of 100 ⁇ 300Hz and the low frequency resolution.
  • Equations can be expanded as described above.
  • the high frequency resolution and the low frequency resolution are determined by a relative magnitude of the frequency resolution of two different STFT data to be compared.
  • the data with the low frequency resolution may be utilized in order to solve the scale problem of the data with the high frequency resolution. That is, in order to solve the scale problem, basically, different frequency data are required as expressed by Equation 5.
  • Equation 5 is a formula related to the frequency resolution.
  • Equation (5) for convenience, a character i denotes a frequency instead of a character f, and information on a time frame is omitted. Further, a signal of a low frequency domain merging two frequency domains can be expressed by Equation (6).
  • Equation (6)
  • Equation (6) the importance is that blind signal separation is performed in order to predict different demixing filters with respect to three data x(i), x(i+1) and x_bar (i, i+1).
  • Equation 7 a distance measure indicated by Equation 7 is proposed.
  • Equation (7) is a quadratic equation
  • the Equation (7) is differentiated by coefficients a and b so as to obtain values of a and b and values of remaining y are loaded from the data, thereby calculating a value of E corresponding to a, b and y.
  • Equation (8) can be expressed as Equation (8).
  • Equation (8)
  • Equation (8) m is a sample index, and i is an index of a frequency component.
  • Equation (8) after two neighboring components of a high frequency resolution are multiplied by values of two complex numbers a and b, a Euclidean distance of the obtained value and a component of a low frequency resolution is expressed by a formula.
  • the Equation (8) can be differentiated for a n and b n as indicated by Equation (9), in order to seek a and b so that the Euclidean distance E is minimum.
  • Equation (10) n of a n and b n may be ignored because it corresponds to a number of an output signal. a n and b n such that the differential equation becomes zero (0) can be obtained by using the Equation (10).
  • Equation (10) * means a complex conjugate.
  • the scale problem between the two components of the high frequency resolution can be solved by using the obtained a n and b n .
  • y when y is estimated as a unit variance of a signal S, y may be expressed by Equation (11).
  • Equation (12) since it is predicted that following Equation (12) can be derived from Equation (8),
  • an and bn can be calculated by Equation (13) from these relative Equations, and the scale problem can be solved by appropriately using an and bn.
  • the scale problem for two different frequency output signals in the high frequency resolution signal can be solved.
  • the terminal aim of the present invention is to solve the scale problem for all frequencies, and the solution of the scale problem may be generally achieved in two manners.
  • the first manner is a hierarchical approach, and the other is an alternative approach.
  • a signal of a lower frequency resolution is structurally and continuously generated and an identical work is repeatedly performed.
  • the signal in order to solve the scale problem in a signal with five hundred twelve frequency resolutions (high frequency resolution), the signal must be compared to a signal with two hundred fifty six frequency resolutions (low frequency resolution).
  • the scale problems of (1, 2), (3, 4), , (511, 512) can be solved, but the scale problems in (1, 2) and (3, 4), (3, 4) and (5, 6), , (509, 510) and (511, 512) cannot be solved. Therefore, the scale problem is solved by generating data having a lower structure. That is, the scale problem is solved by generating one hundred twenty eight frequency resolution (low frequency resolution) for two hundred fifty six frequency resolution (high frequency resolution which a signal has at this time), in which this is repeatedly performed until one signal of the low frequency resolution remains.
  • FIG. 5 is a block diagram schematically illustrating a configuration of an apparatus for solving a scale problem in blind signal extraction of a frequency domain according to another embodiment of the present invention.
  • the apparatus 200 for solving the scale problem in the blind signal extraction includes a first frequency component adding unit 210, a second frequency component adding unit 220, and a blind signal extracting unit 230.
  • the first frequency component adding unit 210 adds (2n-1) th frequency component and 2n th frequency component which are different frequency output signals and adjacent to each other.
  • the second frequency component adding unit 220 adds 2m th frequency component and (2m+1) th frequency component which are different frequency output signals and adjacent to each other.
  • the two frequency components can be preferably to be normalized.However, the equalization of the two frequency components is not limited thereto, and may be unconditionally performed regardless of the scale. Further, the equalization of the two frequency components is not limited with relation to equalization of a specific factor such as power, magnitude, and the like, and may be performed relative to various factors.
  • the blind signal extracting unit 230 performs the blind signal extraction based on each of an original output signal, the signal added by the first frequency component adding unit 210, and the signal re-added by the second frequency component re-adding unit 220.
  • the blind signal extracting unit 230 may perform the blind signal extraction from two different Short-Time Fourier Transform (STFT) data, based on a lower frequency resolution in which a relative size of the frequency resolution is small. Further, the blind signal extracting unit 230 can arrange extracted signals such that they overlap and intersect with each other.
  • STFT Short-Time Fourier Transform
  • the alternative approach can solve the scale problem if two sorts of frequency resolutions are present, as shown in FIGS. 6 and 7. That is, it means that only two hundred fifty six frequency resolution signals are necessary when the scale problem is solved in five hundred twelve frequency resolution signals.
  • the frequency component is duplicated in a manner of (1, 2), (2, 3), (3, 4), (4, 5), , (510, 511), and (511, 512), thereby solving the scale problem.
  • FIG. 8 is a flowchart illustrating a process of solving the scale problem in the blind signal extraction according to an embodiment of the present invention.
  • the frequency component adding unit 110 adds (2n-1) th frequency component and 2n th frequency component which are different frequency output signals and adjacent to each other.
  • step S110 the frequency component adding unit 110 repeatedly performs the adding until one signal of the added frequency resolution remains.
  • step S120 the frequency component re-adding unit 120 re-adds a (2n-1) th frequency component and a 2n th frequency component for the frequency component added by the frequency component adding unit 110.
  • the two frequency components can be took an average.
  • the equalization of the two frequency components is not limited thereto, and may be unconditionally performed regardless of the scale.
  • the blind signal extracting unit 130 performs the blind signal extraction based on each of an original output signal, the signal added by the frequency component adding unit 110, and the signal re-added by the frequency component re-adding unit 120.
  • the blind signal extracting unit 130 may perform the blind signal extraction from two different Short-Time Fourier Transform (STFT) data, based on a low frequency resolution in which a relative size of the frequency resolution is small. Further, the blind signal extraction unit 130 pairs the extracted signals in sequence from the lowest frequency resolution to the highest frequency resolution.
  • STFT Short-Time Fourier Transform
  • FIG. 9 is a flowchart illustrating a process of solving the scale problem in the blind signal extraction of the frequency domain according to another embodiment of the present invention.
  • step S210 the first frequency component adding unit 210 adds (2n-1) th frequency component and 2n th frequency component which are different frequency output signals and adjacent to each other.
  • the second frequency component adding unit 220 adds 2m th frequency component and (2m+1) th frequency component which are different frequency output signals and adjacent to each other.
  • the two frequency components can be preferablynormalized.However, the equalization of the two frequency components is not limited thereto, and may be unconditionally performed regardless of the scale.
  • the blind signal extracting unit 230 performs the blind signal extraction based on each of an original output signal, the signal added by the first frequency component adding unit 210, and the signal re-added by the second frequency component re-adding unit 220.
  • the blind signal extracting unit 230 may perform the blind signal extraction from two different Short-Time Fourier Transform (STFT) data, based on a low frequency resolution in which a relative size of the frequency resolution is small. Further, the blind signal extracting unit 230 can arrange extracted signals such that overlap and intersect with each other.
  • STFT Short-Time Fourier Transform
  • an optimal frequency resolution value is changed by a little depending on an environment, and thus the frequency resolution value is not limited to a specific value.
  • the optimal frequency resolution may be determined with reference to a value of E determined by Equation (8).
  • the value of E means a value obtained by calculating a difference between the frequency signal of the low frequency resolution and the high frequency resolution in the scale and permutation by using the Euclidean distance after the scale and permutation of a signal of two neighboring frequency bins of the high frequency resolution are regulated. Accordingly, if the permutation and the scale problem are accurately solved, the value of E becomes zero (0). In the meantime, if they are inaccurate, the value of E increases.
  • the frequency resolution having the largest difference of the values (E2-E1) may be set as the optimal value.
  • all structural elements constituting the embodiment of the present invention are integrated with one piece, or integrally operated, it is noted that the present invention is not limited to the embodiment necessarily. At least two elements of all structural elements may be selectively joined and operate without departing from the scope of the present invention. Further, all structural elements may be implemented in independent hardware respectively, but some or all of the structural elements may be selectively combined and implemented in computer programs which have a program module performing functions of some elements or all elements which are combined in one or more pieces of hardware. Furthermore, such a computer program may be stored in a computer readable medium such as a USB memory, a CD, a flash memory and the like, and read and executed by a computer so as to implement the embodiment of the present invention.
  • a storage medium for the computer program may include a magnetic recording medium, an optical recording medium, a carrier wave medium and the like.

Landscapes

  • Engineering & Computer Science (AREA)
  • Computational Linguistics (AREA)
  • Quality & Reliability (AREA)
  • Signal Processing (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Measurement Of Mechanical Vibrations Or Ultrasonic Waves (AREA)
  • Complex Calculations (AREA)

Abstract

Disclosed are an apparatus and a method for solving a scale problem in blind signal extraction of a frequency domain. The apparatus for solving the scale problem in the blind signal extraction includes: a frequency component adding unit for adding a (2n-1)th frequency component and a 2nth frequency component which are different frequency output signals and adjacent to each other; a frequency component re-adding unit for re-adding a (2m-1)th frequency component and a 2mth frequency component for the frequency component added by the frequency component adding unit: and a blind signal extracting unit for performing blind signal extraction, based on each of an original output signal, a signal added by the frequency element adding unit, and a signal re-added by the frequency component re-adding unit, wherein the scale problem is solved based on a coefficient enabling a Euclidean distance to be minimum and m and n are natural numbers.

Description

APPARATUS AND METHOD FOR SOLVING SCALE PROBLEM IN BLIND SIGNAL EXTRACTION OF FREQUENCY DOMAIN
This application claims the priority under 35 U.S.C. 119(a) to Korean Patent Application Serial No. 10-2013-0019806, which was filed in the Korean Intellectual Property Office on February 25, 2013, the entire content of which is hereby incorporated by reference.
The present invention relates to an apparatus and a method for solving a scale problem in blind signal extraction, and more particularly to an apparatus and a method for solving a scale problem in blind signal extraction of a frequency domain, which is capable of solving the scale problem occurring when the blind signal extraction of the frequency domain is performed.
With development of a technology of processing a digital signal, various voice services using a voice communication and voice recognition system have been provided. However, improvement of capability of the voice communication and voice recognition system is required to effectively apply the various voice services to the system. Especially, in order to improve sound quality and a voice recognition rate in the voice communication, a technology of separating a desired signal from an undesired signal such as a surrounding noise is essentially required.
Recently, in application of a sound source separation in a communication system, a biomedical field and a real environment, application and research for a Blind Signal Separation (BSS) in which a sound/signal source has been estimated by using only a mixed signal observed through a multiple-sensor have actively progressed. Here, the BSS is a technology of estimating an original signal source in the mixed signal.
FIG. 1 is a view illustrating a concept of the BSS.
As a modeling of the BSS, assuming that N unknown signal sources which are statistically and mutually independent, and of which an average is zero are received via M sensors (microphones) through mixed independent paths respectively, a relation formula expressed by Equation (1) is established between a microphone signal x and an original signals.
Equation (1) :
Figure PCTKR2014001536-appb-I000001
In Equation 1, hjk(l) means a transfer function from a kth original signal to a jth microphone. Generally, a value of M may be used as N.
The purpose of the BSS is to separate the hjk(l) and an sk from only a signal xj which is a mixed signal, in an unknown state, as expressed by Equation (2).
Equation (2) :
Figure PCTKR2014001536-appb-I000002
That is, estimation of a value of an unmixing filter or demixing filter wij(l) is a final aim. The Equations (1) and (2) are formulas for processing data in a time domain, in which L indicates a magnitude of the filter. As the magnitude of the filter increases, an amount of calculation significantly increases. Accordingly, in order to reduce the amount of the calculation, the data in the time domain need to be converted in a signal in a frequency domain. At this time, a frequency domain BSS is performed by using a Short-Time Fourier Transform (STFT), and is expressed by Equation (3).
Equation (3)
:
Figure PCTKR2014001536-appb-I000003
That is, the STFT is a scheme in which data xj enough to correspond to a window size with a length of L is loaded and Fourier-transformed, and is repeatedly performed by shifting the data by a little. Through the above-mentioned scheme, a two dimensional data with information on a transverse time frame and a longitudinal frequency bin is made, and thus, in the Equations (1) and (2), convolution in the time domain is transformed into multiplication in the frequency domain. The amount of calculation can be significantly reduced. Therefore, a relation of the microphone signal and an output signal can be expressed by Equation (4).
Equation (4) :
Figure PCTKR2014001536-appb-I000004
In the case of the blind signal extraction, only one signal is extracted differently from the BSS. That is, in the case of the BSS, several signals are separated from each frequency, a permutation problem occurs across different frequency components. However, in the case of the BSE, since only one signal is extracted from each frequency, there is no permutation problem, but a scale problem still remains.
The present invention has been made to solve the above-mentioned problem in the conventional art, and an aspect of the present invention is to provide an apparatus and a method for solving a scale problem in blind signal extraction of a frequency domain, which are capable of solving the scale problem occurring when the blind signal extraction of the frequency domain is performed.
In accordance with an aspect of the present invention, an apparatus for solving a scale problem in blind signal extraction is provided. The apparatus includes: a frequency component adding unit for adding a (2n-1)th frequency component and a 2nth frequency component which are different frequency output signals and adjacent to each other; a frequency component re-adding unit for re-adding a (2m-1)th frequency component and a 2mth frequency component for the frequency component added by the frequency component adding unit: and a blind signal extracting unit for performing blind signal extraction, based on each of an original output signal, a signal added by the frequency element adding unit, and a signal re-added by the frequency component re-adding unit, wherein m and n are natural numbers. In this case, the scale problem is solved based on a coefficient enabling a Euclidean distance to be minimum.
The blind signal extracting unit may perform the blind signal extraction from two different Short-Time Fourier Transform (STFT) data, based on a lower frequency resolution in which a relative size of the frequency resolution is small.
The frequency component adding unit repeatedly performs the adding until a signal of the added frequency resolution becomes one.
The blind signal extraction unit pairs the extracted signals in sequence from the lowest frequency resolution to the highest frequency resolution.
When any one of the (2n-1)th frequency component and the 2nth frequency component added by the frequency component adding unit, and the (2m-1)th frequency component and the 2mth frequency component added by the frequency component re-adding unit is larger than a set scale as compared to the remaining one, it would be better to normalize the two frequency components.
In the apparatus for solving the scale problem in the blind signal extraction as described above, the values of all frequencies in various frequency resolutions are calculated by using a Euclidean distance and are averaged, and the frequency resolution which has the lowest average is set as an optimal frequency resolution.
In the apparatus for solving the scale problem in the blind signal extraction as described above, a value E1 for an optimal solution calculated in all frequencies for various frequency resolutions may be calculated by using the Euclidean distance, a value E2 for a secondary optimal solution is calculated by using the Euclidean distance, and the frequency resolution having a largest difference of the calculated values (E2-E1) may be set as an optimal value.
In accordance with another aspect of the present invention, an apparatus for solving a scale problem in blind signal extraction is provided. The apparatus includes: a first frequency component adding unit for adding a (2n-1)th frequency component and a 2nth frequency component which are different frequency output signals and adjacent to each other; a second frequency component adding unit for adding a 2mth frequency component and a (2m+1)th frequency component which are different frequency output signals and adjacent to each other; and a blind signal extracting unit for performing blind signal extraction, based on each of an original output signal, a signal added by the first frequency component adding unit, and a signal added by the second frequency component adding unit, wherein m and n are natural numbers. In this case, the scale problem is solved based on a coefficient enabling a Euclidean distance to be minimum.
The blind signal extracting unit may perform the blind signal extraction from two different Short-Time Fourier Transform (STFT) data, based on a lower frequency resolution in which a relative size of the frequency resolution is small.
The extracted signals are arranged such that they overlap and intersect with each other.
When any one of the (2n-1)th frequency component and the 2nth frequency component added by the first frequency component adding unit, and the (2m+1)th frequency component and the 2mth frequency component added by the frequency component adding unit is larger than a set scale as compared to the remaining one, the two frequency components are preferably to be normalized.
In the apparatus for solving the scale problem in the blind signal extraction as described above, the values of all frequencies in various frequency resolutions are calculated by using a Euclidean distance and are averaged, and the frequency resolution which has the lowest average is set as an optimal frequency resolution.
In the apparatus for solving the scale problem in the blind signal extraction as described above, a value E1 for an optimal solution calculated in all frequencies for various frequency resolutions may be calculated by using the Euclidean distance, a value E2 for a secondary optimal solution is calculated by using the Euclidean distance, and the frequency resolution having a largest difference of the calculated values (E2-E1) may be set as an optimal value.
In accordance with still another aspect of the present invention, a method of solving a scale problem in blind signal extraction is provided. The method includes: adding a (2n-1)th frequency component and a 2nth frequency component which are different frequency output signals and adjacent to each other; re-adding a (2m-1)th frequency component and a 2mth frequency component for the frequency component added in the adding of the frequency component: performing blind signal extraction, based on each of an original output signal, a signal added by the frequency component adding unit, and a signal re-added by the frequency component re-adding unit; and pairing the extracted signals in sequence from a lowest frequency resolution to a highest frequency resolution, wherein m and n are natural numbers. In this case, the scale problem is solved based on a coefficient enabling a Euclidean distance to be minimum.
In the performing of the blind signal extraction, the blind signal extraction from two different Short-Time Fourier Transform (STFT) data is performed, based on a lower frequency resolution in which a relative magnitude of the frequency resolution is small.
The adding of the frequency component is repeatedly performed until a signal of the added frequency resolution becomes one.
When any one of the (2n-1)th frequency component and the 2nth frequency component added in the first frequency component adding, and the (2m+1)th frequency component and the 2mth frequency component added in the second frequency component adding is larger than a set scale as compared to the remaining one, the two frequency components are normalized
In the method of solving the scale problem in the blind signal extraction as described above, the values of all frequencies in various frequency resolutions are calculated by using a Euclidean distance and are averaged, and the frequency resolution which has the lowest average is set as an optimal frequency resolution.
In the method of solving the scale problem in the blind signal extraction as described above, a value E1 for an optimal solution calculated in all frequencies for various frequency resolutions may be calculated by using the Euclidean distance, a value E2 for a secondary optimal solution is calculated by using the Euclidean distance, and the frequency resolution having a largest difference of the calculated values (E2-E1) may be set as an optimal value.
In accordance with still another aspect of the present invention, a method of solving a scale problem in blind signal extraction is provided. The method includes: first-adding a (2n-1)th frequency component and a 2nth frequency component which are different frequency output signals and adjacent to each other; second-adding a 2mth frequency component and a (2m+1)th frequency component which are different frequency output signals and adjacent to each other; performing blind signal extraction, based on each of an original output signal, a signal added in the first frequency component adding, and a signal added in the second frequency component adding, and arranging the extracted signals such that they overlap and intersect with each other, wherein m and n are natural numbers. In this case, the scale problem is solved based on a coefficient enabling a Euclidean distance to be minimum.
In the performing of the blind signal extraction, the blind signal extraction from two different Short-Time Fourier Transform (STFT) data is performed, based on a lowerfrequency resolution in which a relative magnitude of the frequency resolution is small.
When any one of the (2n-1)th frequency component and the 2nth frequency component added in the first frequency component adding, and the (2m+1)th frequency component and the 2mth frequency component added in the second frequency component adding is larger than a set scale as compared to the remaining one, the two frequency components are preferably to be normalized.
In the method of solving the scale problem in the blind signal extraction as described above, the values of all frequencies in various frequency resolutions are calculated by using a Euclidean distance and are averaged, and the frequency resolution which has the lowest average is set as an optimal frequency resolution.
In the method of solving the scale problem in the blind signal extraction as described above, a value E1 for an optimal solution calculated in all frequencies for various frequency resolutions may be calculated by using the Euclidean distance, a value E2 for a secondary optimal solution is calculated by using the Euclidean distance, and the frequency resolution having a largest difference of the calculated values (E2-E1) may be set as an optimal value.
According to the present invention, when the blind signal extraction of the frequency domain is performed, the scale problem can be solved because only one signal for different frequencies is extracted.
The above and other aspects, features, and advantages of the present invention will be more apparent from the following detailed description taken in conjunction with the accompanying drawings, in which:
FIG. 1 is a view illustrating a concept of blind signal separation;
FIG. 2 is a block diagram schematically illustrating a configuration of an apparatus for solving a scale problem in blind signal extraction of a frequency domain according to an embodiment of the present invention;
FIG. 3 is a view illustrating a relation of data with different resolutions according to the embodiment of the present invention;
FIG. 4 is a view illustrating a hierarchical approach to solve the scale problem according to the embodiment of the present invention;
FIG. 5 is a block diagram schematically illustrating a configuration of an apparatus for solving a scale problem in blind signal extraction of a frequency domain according to another embodiment of the present invention;
FIG. 6 is a view illustrating an example of an alternative approach to solve the scale problem according to the embodiment of the present invention;
FIG. 7 is a view illustrating another example of the alternative approach to solve the scale problem according to the embodiment of the present invention;
FIG. 8 is a flowchart illustrating a process of solving the scale problem in the blind signal extraction of the frequency domain according to an embodiment of the present invention; and
FIG. 9 is a flowchart illustrating a process of solving the scale problem in the blind signal extraction of the frequency domain according to another embodiment of the present invention.
Hereinafter, the embodiments of the present invention will be described in detail with reference to the accompanying drawings. In the following description, the description of a technology which is known to those skilled in the art will be omitted.
In the description of the structural elements of the present invention, further, a different reference numeral may denote structural elements with the same name in each drawing, and also an identical reference numeral may indicate the same structural element in the drawings. However, this case does not means that a corresponding structural element has a different function in each embodiment, or that the structural element has the same function in different embodiments. The function of each structural element should be determined based on the description of each structural element.
Further, in the description of the embodiments of the present invention, a detailed description of known structures or functions incorporated herein will be omitted as it may make the subject matter of the present invention rather unclear.
Furthermore, in the description of the structural element of the present invention, terms of ‘first’, ‘second’, ‘A’, ‘B’, (a)’, ‘(b)’ and the like may be used. This term is merely used to distinguish one structural element from other structural elements, and a property, an order, a sequence and the like of a corresponding structural element are not limited by the term. In the case that it is described that any structural element ‘is connected to’, ‘is joined to’, or ‘contacts’ another structural element, it should be understood that the structural element is directly connected to or joined to another structural element, but a third structural element may be connected to, joined to, or contact them.
FIG. 2 is a block diagram schematically illustrating a configuration of an apparatus for solving a scale problem in blind signal extraction of a frequency domain according to an embodiment of the present invention.
Referring to FIG. 2, the apparatus 100 for solving the scale problem in the blind signal extraction according to the embodiment of the present invention includes a frequency component adding unit 110, a frequency component re-adding unit 120, and a blind signal extracting unit 130.
The frequency component adding unit 110 adds a (2n-1)th frequency component and a 2nth frequency component which are different frequency output signals and adjacent to each other. The frequency component adding unit 110 repeatedly performs the adding until a resolution signal of the added frequency becomes one (1).
The frequency component re-adding unit 120 re-adds a (2m-1)th frequency component and a 2mth frequency component for the frequency component added by the frequency component adding unit 110.
When any one of the (2n-1)th frequency component and the 2nth frequency component added by the frequency component adding unit 110, and the (2m-1)th frequency component and the 2mth frequency component added by the frequency component adding unit 120 is larger than a set scale as compared to the remaining one, the two frequency components can benormalized. However, the equalization of the two frequency components is not limited thereto, and may be unconditionally performed regardless of the scale. Further, the normalization of the two frequency components is not limited with relation to normalization of a specific factor such as power, magnitude, and the like, and may be performed relative to various factors.
The blind signal extracting unit 130 performs the blind signal extraction based on each of an original output signal, the signal added by the frequency component adding unit 110, and the signal re-added by the frequency component re-adding unit 120. Here, the blind signal extracting unit 130 may perform the blind signal extraction from two different Short-Time Fourier Transform (STFT) data, based on a low frequency resolution in which a relative size of the frequency resolution is small. Further, the blind signal extraction unit 130 pairs the extracted signals in sequence from the lowest frequency resolution to the highest frequency resolution.
In the case of the blind signal extraction, assuming that only one sequence of the output is present, in Equation (4), Y(f, t) = Y1(f, t), X(f, t) = [X1(f, t) X2(f, t)]T, and W(f) = [W11(f) W12(f)]. That is, only a demixing filter for extracting only one signal and a corresponding extracted signal are present.
FIG. 3 is a view illustrating a relation of data with different resolutions according to the embodiment of the present invention. The data with the low frequency resolution is identical to a sum of two neighboring values of data with the high frequency resolution. For example, assuming that x(i) has signal information of 100~200Hz and x(i+1) has signal information of 200~300Hz, x_bar(i, i+1) may be a sum of two signals x(i) and x(i+1) which are data having the signal information of 100~300Hz and the low frequency resolution.
Equations can be expanded as described above. Here, the high frequency resolution and the low frequency resolution are determined by a relative magnitude of the frequency resolution of two different STFT data to be compared.
At this time, the data with the low frequency resolution may be utilized in order to solve the scale problem of the data with the high frequency resolution. That is, in order to solve the scale problem, basically, different frequency data are required as expressed by Equation 5. The Equation 5 is a formula related to the frequency resolution.
Equation (5):
Figure PCTKR2014001536-appb-I000005
In Equation (5), for convenience, a character i denotes a frequency instead of a character f, and information on a time frame is omitted. Further, a signal of a low frequency domain merging two frequency domains can be expressed by Equation (6).
Equation (6) :
Figure PCTKR2014001536-appb-I000006
In Equation (6), the importance is that blind signal separation is performed in order to predict different demixing filters with respect to three data x(i), x(i+1) and x_bar (i, i+1). A thing which should be kept in mind to progress onward is that separated signals y, y(i+1), and y_bar(i, i+1) must be established by y_bar(i, i+1)=y(i)+y(i+1) for an 'identical signal'. In the embodiment of the present invention, since it is already assumed that three signals come from the signal having all in the case of the blind signal extraction, it is applied such that equality is established if a proportional constant is satisfied.
Thus, in the embodiment of the present invention, a distance measure indicated by Equation 7 is proposed.
Equation (7)
:
Figure PCTKR2014001536-appb-I000007
Here, since the Equation (7) is a quadratic equation, the Equation (7) is differentiated by coefficients a and b so as to obtain values of a and b and values of remaining y are loaded from the data, thereby calculating a value of E corresponding to a, b and y. The Equation (7) can be expressed as Equation (8).
Equation (8) :
Figure PCTKR2014001536-appb-I000008
In Equation (8), m is a sample index, and i is an index of a frequency component. In the Equation (8), after two neighboring components
Figure PCTKR2014001536-appb-I000009
of a high frequency resolution are multiplied by values of two complex numbers a and b, a Euclidean distance of the obtained value and a component
Figure PCTKR2014001536-appb-I000010
of a low frequency resolution is expressed by a formula. The Equation (8) can be differentiated for an and bn as indicated by Equation (9), in order to seek a and b so that the Euclidean distance E is minimum.
Equation (9) :
Figure PCTKR2014001536-appb-I000011
In Equation (10), n of an and bn may be ignored because it corresponds to a number of an output signal. an and bn such that the differential equation becomes zero (0) can be obtained by using the Equation (10).
Equation (10) :
Figure PCTKR2014001536-appb-I000012
In Equation (10), * means a complex conjugate. The scale problem between the two components of the high frequency resolution can be solved by using the obtained an and bn. At this time, when y is estimated as a unit variance of a signal S, y may be expressed by Equation (11).
Equation (11):
Figure PCTKR2014001536-appb-I000013
Further, since it is predicted that following Equation (12) can be derived from Equation (8),
Equation (12) :
Figure PCTKR2014001536-appb-I000014
an and bn can be calculated by Equation (13) from these relative Equations, and the scale problem can be solved by appropriately using an and bn.
Equation (13) :
Figure PCTKR2014001536-appb-I000015
The scale problem for two different frequency output signals in the high frequency resolution signal can be solved. The terminal aim of the present invention is to solve the scale problem for all frequencies, and the solution of the scale problem may be generally achieved in two manners. The first manner is a hierarchical approach, and the other is an alternative approach.
In the hierarchical approach, as shown in FIG. 4, a signal of a lower frequency resolution is structurally and continuously generated and an identical work is repeatedly performed. For example, in order to solve the scale problem in a signal with five hundred twelve frequency resolutions (high frequency resolution), the signal must be compared to a signal with two hundred fifty six frequency resolutions (low frequency resolution). However, in the circumstance, the scale problems of (1, 2), (3, 4), , (511, 512) can be solved, but the scale problems in (1, 2) and (3, 4), (3, 4) and (5, 6), , (509, 510) and (511, 512) cannot be solved. Therefore, the scale problem is solved by generating data having a lower structure. That is, the scale problem is solved by generating one hundred twenty eight frequency resolution (low frequency resolution) for two hundred fifty six frequency resolution (high frequency resolution which a signal has at this time), in which this is repeatedly performed until one signal of the low frequency resolution remains.
FIG. 5 is a block diagram schematically illustrating a configuration of an apparatus for solving a scale problem in blind signal extraction of a frequency domain according to another embodiment of the present invention.
Referring to FIG. 5, the apparatus 200 for solving the scale problem in the blind signal extraction according to another embodiment of the present invention includes a first frequency component adding unit 210, a second frequency component adding unit 220, and a blind signal extracting unit 230.
The first frequency component adding unit 210 adds (2n-1)th frequency component and 2nth frequency component which are different frequency output signals and adjacent to each other.
The second frequency component adding unit 220 adds 2mth frequency component and (2m+1)th frequency component which are different frequency output signals and adjacent to each other.
When any one of the (2n-1)th frequency component and the 2nth frequency component added by the frequency component adding unit 210, and the (2m+1)th frequency component and the 2mth frequency component added by the second frequency component 220 is larger than a set scale as compared to the remaining one, the two frequency components can be preferably to be normalized.However, the equalization of the two frequency components is not limited thereto, and may be unconditionally performed regardless of the scale. Further, the equalization of the two frequency components is not limited with relation to equalization of a specific factor such as power, magnitude, and the like, and may be performed relative to various factors.
The blind signal extracting unit 230 performs the blind signal extraction based on each of an original output signal, the signal added by the first frequency component adding unit 210, and the signal re-added by the second frequency component re-adding unit 220. Here, the blind signal extracting unit 230 may perform the blind signal extraction from two different Short-Time Fourier Transform (STFT) data, based on a lower frequency resolution in which a relative size of the frequency resolution is small. Further, the blind signal extracting unit 230 can arrange extracted signals such that they overlap and intersect with each other.
The alternative approach can solve the scale problem if two sorts of frequency resolutions are present, as shown in FIGS. 6 and 7. That is, it means that only two hundred fifty six frequency resolution signals are necessary when the scale problem is solved in five hundred twelve frequency resolution signals. In this case, when the scale problem is solved in the five hundred twelve frequency resolution signals, the frequency component is duplicated in a manner of (1, 2), (2, 3), (3, 4), (4, 5), , (510, 511), and (511, 512), thereby solving the scale problem.
FIG. 8 is a flowchart illustrating a process of solving the scale problem in the blind signal extraction according to an embodiment of the present invention.
Referring FIG. 8, the frequency component adding unit 110 adds (2n-1)th frequency component and 2nth frequency component which are different frequency output signals and adjacent to each other. In step S110, the frequency component adding unit 110 repeatedly performs the adding until one signal of the added frequency resolution remains.
In step S120, the frequency component re-adding unit 120 re-adds a (2n-1)th frequency component and a 2nth frequency component for the frequency component added by the frequency component adding unit 110.
When any one of the (2n-1)th frequency component and the 2nth frequency component added by the frequency component adding unit 110, and the (2m-1)th frequency component and the 2mth frequency component added by the frequency component re-adding unit 120 is larger than a set scale as compared to the remaining one, the two frequency components can be took an average. However, the equalization of the two frequency components is not limited thereto, and may be unconditionally performed regardless of the scale.
In step S130, the blind signal extracting unit 130 performs the blind signal extraction based on each of an original output signal, the signal added by the frequency component adding unit 110, and the signal re-added by the frequency component re-adding unit 120. Here, the blind signal extracting unit 130 may perform the blind signal extraction from two different Short-Time Fourier Transform (STFT) data, based on a low frequency resolution in which a relative size of the frequency resolution is small. Further, the blind signal extraction unit 130 pairs the extracted signals in sequence from the lowest frequency resolution to the highest frequency resolution.
FIG. 9 is a flowchart illustrating a process of solving the scale problem in the blind signal extraction of the frequency domain according to another embodiment of the present invention.
Referring to FIG. 9, in step S210, the first frequency component adding unit 210 adds (2n-1)th frequency component and 2nth frequency component which are different frequency output signals and adjacent to each other.
The second frequency component adding unit 220 adds 2mth frequency component and (2m+1)th frequency component which are different frequency output signals and adjacent to each other.
When any one of the (2n-1)th frequency component and the 2nth frequency component added by the frequency component adding unit 210, and the (2m+1)th frequency component and the 2mth frequency component added by the second frequency component 220 is larger than a set scale as compared to the remaining one, the two frequency components can be preferablynormalized.However, the equalization of the two frequency components is not limited thereto, and may be unconditionally performed regardless of the scale.
In step S230, the blind signal extracting unit 230 performs the blind signal extraction based on each of an original output signal, the signal added by the first frequency component adding unit 210, and the signal re-added by the second frequency component re-adding unit 220. Here, the blind signal extracting unit 230 may perform the blind signal extraction from two different Short-Time Fourier Transform (STFT) data, based on a low frequency resolution in which a relative size of the frequency resolution is small. Further, the blind signal extracting unit 230 can arrange extracted signals such that overlap and intersect with each other.
According to the apparatus and the method for solving the scale problem in the above-mentioned blind signal extraction, an optimal frequency resolution value is changed by a little depending on an environment, and thus the frequency resolution value is not limited to a specific value. At this time, the optimal frequency resolution may be determined with reference to a value of E determined by Equation (8). The value of E means a value obtained by calculating a difference between the frequency signal of the low frequency resolution and the high frequency resolution in the scale and permutation by using the Euclidean distance after the scale and permutation of a signal of two neighboring frequency bins of the high frequency resolution are regulated. Accordingly, if the permutation and the scale problem are accurately solved, the value of E becomes zero (0). In the meantime, if they are inaccurate, the value of E increases.
Two methods may be proposed considering this point. In the first method, an average of values of E for all frequencies of the various frequency resolutions is calculated, and the frequency resolution having the lowest average value of E is regarded as an optimal frequency resolution. In the second method, not only a value E1 of E for the optimal solution (in which the permutation and the scale are solved) but also a value E2 of E for a secondary optimal solution in each frequency of the various frequency resolutions are calculated, and then a difference of two values (E2-E1) is observed. The more the difference of the values (E2-E1) increases, the more clear a distinction for two solutions is, so that reliability for the value of E1 is more improved. Meanwhile, the less the difference of the values (E2-E1) decreases, the more unclear the distinction for two solutions is, so that the reliability for the value of E1 is degraded. That is, the frequency resolution having the largest difference of the values (E2-E1) may be set as the optimal value.
After the optimal frequency resolution is obtained, an area of the frequency resolution in which a signal is desired to be essentially processed is calculated by using a principle of the above-mentioned hierarchical approach. For example, an aim is to calculate NFFT=2048 in order to currently process a voice. Assuming that an optimal frequency resolution for solving the scale problem in the proposed manner is NFFT=512, the scale problem for NFFT=512 and NFFT=1024 is solved by a reference of NFFT=512, and then the scale problem for NFFT=104 and NFFT=2048 is solved again, thereby solving the scale problem step by step for NFFT=2048 which is the frequency resolution to be essentially solved.
Although it is described that all structural elements constituting the embodiment of the present invention are integrated with one piece, or integrally operated, it is noted that the present invention is not limited to the embodiment necessarily. At least two elements of all structural elements may be selectively joined and operate without departing from the scope of the present invention. Further, all structural elements may be implemented in independent hardware respectively, but some or all of the structural elements may be selectively combined and implemented in computer programs which have a program module performing functions of some elements or all elements which are combined in one or more pieces of hardware. Furthermore, such a computer program may be stored in a computer readable medium such as a USB memory, a CD, a flash memory and the like, and read and executed by a computer so as to implement the embodiment of the present invention. A storage medium for the computer program may include a magnetic recording medium, an optical recording medium, a carrier wave medium and the like.
Further, all terminologies including technical or scientific terms have the same meanings as those generally understood by persons skilled in the art to which the present invention belongs, respectively, unless they are differently defined in the detailed description. It should be interpreted that general terms including the predefined terms are identical to meanings in a context of the related technology. Also, the terms should not be ideally or excessively interpreted as formal meanings unless they are not clearly defined in the present invention.
Although the technical spirit of the present invention is simply and exemplary described in the above-mentioned description, it will be appreciated by those skilled in the art to which the present invention pertains that various changes and modifications may be implemented without departing from the natural scope of the present invention. Further, the embodiments of the present invention disclosed in the description are to not limit but describe the technical spirits of the present invention, and of course the technical scope and spirits of the present invention is not limited by these embodiments. Thus, the scope of the present invention should be interpreted through the claims appended to the description, and it should be understood that all technical spirits within equivalents thereto pertain to the scope of the present invention.

Claims (24)

  1. An apparatus for solving a scale problem in blind signal extraction of a frequency domain, the apparatus comprising:
    a frequency component adding unit for adding a (2n-1)th frequency component and a 2nth frequency component which are different frequency output signals and adjacent to each other;
    a frequency component re-adding unit for re-adding a (2m-1)th frequency component and a 2mth frequency component for the frequency component added by the frequency component adding unit: and
    a blind signal extracting unit for performing blind signal extraction, based on each of an original output signal, a signal added by the frequency element adding unit, and a signal re-added by the frequency component re-adding unit,
    wherein the scale problem is solved based on a coefficient enabling a Euclidean distance to be minimum and m and n are natural numbers.
  2. The apparatus as claimed in claim 1, wherein the blind signal extracting unit performs the blind signal extraction from two different Short-Time Fourier Transform (STFT) data, based on a lower frequency resolution in which a relative magnitude of the frequency resolution is small.
  3. he apparatus as claimed in claim 1, wherein the frequency component adding unit repeatedly performs the adding until one signal of the added frequency resolution remains.
  4. The apparatus as claimed in claim 3, wherein the blind signal extracting unit pairs the extracted signals in sequence from a lowest frequency resolution to a highest frequency resolution.
  5. The apparatus as claimed in claim 1, wherein when any one of the (2n-1)th frequency component and the 2nth frequency component added by the frequency component adding unit, and the (2m-1)th frequency component and the 2mth frequency component re-added by the frequency element re-adding unit is larger than a set scale as compared to the remaining one, the two frequency components arenormalized.
  6. The apparatus as claimed in claim 1, wherein the values of all frequencies in various frequency resolutions are calculated by using a Euclidean distance and are averaged, and the frequency resolution which has the lowest average is set as an optimal frequency resolution.
  7. The apparatus as claimed in claim 6, wherein a value E1 for an optimal solution calculated in all frequencies in various frequency resolutions is calculated by using the Euclidean distance, a value E2 for a secondary optimal solution is calculated by using the Euclidean distance, and the frequency resolution having a largest difference of the calculated values (E2-E1) is set as an optimal value.
  8. An apparatus for solving a scale problem in blind signal extraction, the apparatus comprising:
    a first frequency component adding unit for adding a (2n-1)th frequency component and a 2nth frequency component which are different frequency output signals and adjacent to each other;
    a second frequency component adding unit for adding a 2mth frequency component and a (2m+1)th frequency component which are different frequency output signals and adjacent to each other; and
    a blind signal extracting unit for performing blind signal extraction, based on each of an original output signal, a signal added by the first frequency component adding unit, and a signal added by the second frequency component adding unit,
    wherein the scale problem is solved based on a coefficient enabling a Euclidean distance to be minimum, and m and n are natural numbers.
  9. The apparatus as claimed in claim 8, wherein the blind signal extracting unit performs the blind signal extraction from two different Short-Time Fourier Transform (STFT) data, based on a lower frequency resolution in which a relative magnitude of the frequency resolution is small.
  10. The apparatus as claimed in claim 9, wherein the extracted signals are arranged such that they overlap and intersect with each other.
  11. The apparatus as claimed in claim 8, wherein when any one of the (2n-1)th frequency component and the 2nth frequency component added by the first frequency component adding unit, and the (2m+1)th frequency component and the 2mth frequency component added by the frequency component adding unit is larger than a set scale as compared to the remaining one, the two frequency components arenormalized.
  12. The apparatus as claimed in claim 8, wherein the values of all frequencies in various frequency resolutions are calculated by using a Euclidean distance and are averaged, and the frequency resolution which has the lowest average is set as an optimal frequency resolution.
  13. The apparatus as claimed in claim 12, wherein a value E1 for a calculated optimal solution in all frequencies of various frequency resolutions is calculated by using the Euclidean distance, a value E2 for a secondary optimal solution is calculated by using the Euclidean distance, and the frequency resolution having a largest difference of the calculated values (E2-E1) is set as an optimal value.
  14. A method of solving a scale problem in blind signal extraction, the method comprising:
    adding a (2n-1)th frequency component and a 2nth frequency component which are different frequency output signals and adjacent to each other;
    re-adding a (2m-1)th frequency component and a 2mth frequency component for the frequency component added in the frequency component adding:
    performing blind signal extraction, based on each of an original output signal, a signal added by a frequency component adding unit, and a signal re-added by a frequency component re-adding unit; and
    pairing the extracted signals in sequence from a lowest frequency resolution to a highest frequency resolution,
    wherein the scale problem is solved based on a coefficient enabling a Euclidean distance to be minimum, and m and n are natural numbers.
  15. The method as claimed in claim 14, wherein in the performing of the blind signal extraction, the blind signal extraction from two different Short-Time Fourier Transform (STFT) data is performed, based on a lower frequency resolution in which a relative magnitude of the frequency resolution is small.
  16. The method as claimed in claim 14, wherein the adding of the frequency component is repeatedly performed until a signal of the added frequency resolution becomes one.
  17. The method as claimed in claim 14, wherein when any one of the (2n-1)th frequency component and the 2nth frequency component added in the first frequency component adding, and the (2m+1)th frequency component and the 2mth frequency component added in the second frequency component adding is larger than a set scale as compared to the remaining one, the two frequency components arenormalized.
  18. The method as claimed in claim 14, wherein the values of all frequencies for various frequency resolutions are calculated by using a Euclidean distance and are averaged, and the frequency resolution having the lowest average is set as an optimal frequency resolution.
  19. The method as claimed in claim 18, wherein a value E1 for an optimal solution calculated in all frequencies in various frequency resolutions is calculated by using the Euclidean distance, a value E2 for a secondary optimal solution is calculated by using the Euclidean distance, and the frequency resolution having a largest difference of the calculated values (E2-E1) is set as an optimal value.
  20. A method of solving a scale problem in blind signal extraction, the method comprising:
    first-adding a (2n-1)th frequency component and a 2nth frequency component which are different frequency output signals and adjacent to each other;
    second-adding a 2mth frequency component and a (2m+1)th frequency component which are different frequency output signals and adjacent to each other;
    performing blind signal extraction, based on each of an original output signal, a signal added in the first frequency component adding, and a signal added in the second frequency component adding, and
    arranging the extracted signals such that they overlap and intersect with each other,
    wherein the scale problem is solved based on a coefficient enabling a Euclidean distance to be minimum, and m and n are natural numbers.
  21. The method as claimed in claim 20, wherein in the performing of the blind signal extraction, the blind signal extraction from two different Short-Time Fourier Transform (STFT) data is performed, based on a low frequency resolution in which a relative magnitude of the frequency resolution is small.
  22. The method as claimed in claim 20, wherein when any one of the (2n-1)th frequency component and the 2nth frequency component added in the first frequency component adding, and the (2m+1)th frequency component and the 2mth frequency component added in the second frequency component adding is larger than a set scale as compared to the remaining one, the two frequency components arenormalized.
  23. The method as claimed in claim 20, wherein the values of all frequencies for various frequency resolutions are calculated by using a Euclidean distance and are averaged, and the frequency resolution having the lowest average is set as an optimal frequency resolution.
  24. The method as claimed in claim 23, wherein a value E1 for an optimal solution calculated in all frequencies for various frequency resolutions is calculated by using the Euclidean distance, a value E2 for a secondary optimal solution is calculated by using the Euclidean distance, and the frequency resolution having a largest difference of the calculated values (E2-E1) is set as an optimal value.
PCT/KR2014/001536 2013-02-25 2014-02-25 Apparatus and method for solving scale problem in blind signal extraction of frequency domain WO2014129878A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
KR1020130019806A KR101474633B1 (en) 2013-02-25 2013-02-25 Scale Problem Solving Apparatous and Method for Blind Signal Sampling in Frequency Domain
KR10-2013-0019806 2013-02-25

Publications (1)

Publication Number Publication Date
WO2014129878A1 true WO2014129878A1 (en) 2014-08-28

Family

ID=51391582

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/KR2014/001536 WO2014129878A1 (en) 2013-02-25 2014-02-25 Apparatus and method for solving scale problem in blind signal extraction of frequency domain

Country Status (2)

Country Link
KR (1) KR101474633B1 (en)
WO (1) WO2014129878A1 (en)

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
KR20110070764A (en) * 2009-12-18 2011-06-24 한국전자통신연구원 Method for separating blind signal and apparatus for performing the same
KR20110121955A (en) * 2010-05-03 2011-11-09 한국과학기술원 Method and apparatus for blind source extraction
US20130031152A1 (en) * 2011-07-29 2013-01-31 Dolby Laboratories Licensing Corporation Methods and apparatuses for convolutive blind source separation

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
KR20110070764A (en) * 2009-12-18 2011-06-24 한국전자통신연구원 Method for separating blind signal and apparatus for performing the same
KR20110121955A (en) * 2010-05-03 2011-11-09 한국과학기술원 Method and apparatus for blind source extraction
US20130031152A1 (en) * 2011-07-29 2013-01-31 Dolby Laboratories Licensing Corporation Methods and apparatuses for convolutive blind source separation

Also Published As

Publication number Publication date
KR101474633B1 (en) 2014-12-24
KR20140105963A (en) 2014-09-03

Similar Documents

Publication Publication Date Title
WO2017213398A1 (en) Learning model for salient facial region detection
WO2016171341A1 (en) Cloud-based pathology analysis system and method
WO2017144007A1 (en) Method and system for audio recognition based on empirical mode decomposition
WO2010038941A2 (en) Apparatus and method of obtaining high resolution image
WO2016163755A1 (en) Quality measurement-based face recognition method and apparatus
WO2019164074A1 (en) Fine dust analysis method, and apparatus for performing same
WO2018151503A2 (en) Method and apparatus for gesture recognition
WO2020027372A1 (en) Voice interpretation device
WO2012050252A1 (en) System and method for automatically generating a mass classifier using a dynamic combination of classifiers
WO2022146050A1 (en) Federated artificial intelligence training method and system for depression diagnosis
WO2016165298A1 (en) Image sieving method and image sieving system
WO2011162446A1 (en) Module and method for deciding named entity of term using named entity dictionary combined with ontology schema and mining rule
WO2021225296A1 (en) Method for explainable active learning, to be used for object detector, by using deep encoder and active learning device using the same
WO2021112273A1 (en) Method and device for de-identifying personal information consecutively in deep learning-based video
WO2015119326A1 (en) Display device and method for controlling the same
WO2013066095A1 (en) Method and device for detecting face, and non-transitory computer-readable recording medium for executing the method
WO2022216109A1 (en) Method and electronic device for quantizing dnn model
WO2022085846A1 (en) Method for improving quality of voice data, and apparatus using same
WO2022220354A1 (en) Fish shoal ecosystem monitoring system device for detecting abnormality in fish shoal ecosystem, and method for operation same
WO2014129878A1 (en) Apparatus and method for solving scale problem in blind signal extraction of frequency domain
WO2011053060A2 (en) Information-processing device, information-processing method and program
WO2024204919A1 (en) Deep learning-based shale gas production prediction system and method
WO2017043945A1 (en) Method and apparatus for recognizing detailed facial expressions
WO2023158068A1 (en) Learning system and method for improving object detection rate
WO2018199367A1 (en) Stereo channel noise removing device and method

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 14754024

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 14754024

Country of ref document: EP

Kind code of ref document: A1