JP2016504622A

JP2016504622A - Method for calculating at least two individual signals from at least two output signals

Info

Publication number: JP2016504622A
Application number: JP2015548194A
Authority: JP
Inventors: コトゥッラ・ダーニエール
Original assignee: エイエスケイ・インダストリーズ・ゲゼルシャフト・ミト・ベシュレンクテル・ハフツング
Priority date: 2012-12-20
Filing date: 2013-12-10
Publication date: 2016-02-12
Anticipated expiration: 2033-12-10
Also published as: JP6163211B2; WO2014094709A3; WO2014094709A2; DE102012025016B3

Abstract

本発明は、少なくとも二つの源信号から形成されている少なくとも二つの出力信号から少なくとも二つの個別信号を算出する方法に関し、少なくとも二つの源信号の出力信号上での混合比率を特定し、混合比率から形成された因子を出力信号に掛け合わせ、出力信号から少なくとも二つの減算信号を算出することにより、各減算信号において一の源信号を消去し、この減算信号をフーリエ変換し、これにより変換信号を生ぜしめ、この変換信号から剰余信号を定め、変換信号と剰余信号とに基づいて少なくとも二つの個別信号を計算し、この個別信号を逆フーリエ変換する。The present invention relates to a method of calculating at least two individual signals from at least two output signals formed from at least two source signals, identifying a mixing ratio on the output signal of at least two source signals, and mixing ratio By multiplying the output signal by the factor formed from the output signal and calculating at least two subtraction signals from the output signal, one source signal is eliminated in each subtraction signal, and this subtraction signal is Fourier transformed, thereby converting the subtraction signal Then, a remainder signal is determined from the converted signal, at least two individual signals are calculated based on the converted signal and the remainder signal, and the individual signal is subjected to inverse Fourier transform.

Description

本発明は、請求項１の前提部分に記載の少なくとも二つの個別信号を算出する方法に関する。 The invention relates to a method for calculating at least two individual signals according to the preamble of claim 1.

二つ以上の音源を録音することでステレオ信号を生成できることが公知である。これには、レベル差や到達時間差が利用される。ステレオ信号を用いることで、立体的な音の感じが生まれることになる。 It is known that a stereo signal can be generated by recording two or more sound sources. For this, a level difference or arrival time difference is used. By using a stereo signal, a three-dimensional sound is born.

公知のステレオ技術は、ＡＢ−法、ＸＹ−法やＭＳ−法の形式のインテンシティ・ステレオ並びにＯＲＴＦ−法やＯＳＳ−法のような折衷型の技術である。 Known stereo techniques are intensity stereo in the form of AB-method, XY-method and MS-method, as well as a compromise technique such as ORTF-method and OSS-method.

近年、二つよりも多くのスピーカを備えた音響システムが、市場において徐々にその数を増してきている。特に、このような音響システムは、例えば自動車の中には既に何年も前から在って、そのような自動車は、側方ドアの中に少なくとも二つのスピーカ、さらに、自動車の車内に二つのスピーカを有している。 In recent years, the number of acoustic systems with more than two speakers has gradually increased in the market. In particular, such an acoustic system has already existed, for example, in cars for many years, and such cars have at least two speakers in the side door and two in the car. Has a speaker.

こういうわけで、ステレオの場合がそうであるように、二つよりも多くの個別信号を使えるようにするためにいくつかの方法が開発されている。この場合、概ね二つの異なるアプローチがある。 This is why several methods have been developed to allow more than two separate signals, as is the case with stereo. In this case, there are generally two different approaches.

先ず、多数の出力信号からそれぞれの源信号を算出できる方法が存在する。この場合、それぞれの楽器、それぞれの歌声や話し声が、源信号と解される。コーラスの場合や、複数人の歌手もまた、一つの源信号として捉えることができ、例えば２０人の女性歌手がいても、「ソプラノ」は例えば一つの歌声として把握することができる。 First, there is a method capable of calculating each source signal from a large number of output signals. In this case, each instrument, each singing voice and speaking voice is interpreted as a source signal. In the case of chorus, a plurality of singers can also be regarded as one source signal. For example, even if there are 20 female singers, “soprano” can be recognized as one singing voice.

総じて見れば、問題は、ステレオ信号において存在するような、二つの出力信号から、あるいはまた、それより多くの出力信号から、任意の数の源信号を抽出することにある。 Overall, the problem is to extract an arbitrary number of source signals from two output signals, such as present in a stereo signal, or from more output signals.

信号源分離のためのこのような方法は、いわゆる「ブラインド分離」のキャッチフレーズのもとに行われる。この方法には、大きく二種類の方法がある。その方法は、一方では、独立成分分析、主成分分析、最大事後確率法、最尤法等の統計学的な方法を用いて行われる。他方では、再帰最適化法を利用することもまた公知である。公知の方法は、その際、任意の源信号を、より少ない数の出力信号から高い計算コストを払って算出可能にするか、実時間計算のために源信号の数より多いか又は同じ数の出力信号を必要とする。 Such a method for signal source separation is carried out under the so-called “blind separation” catchphrase. There are two main types of this method. On the one hand, the method is performed using statistical methods such as independent component analysis, principal component analysis, maximum posterior probability method, maximum likelihood method and the like. On the other hand, it is also known to use recursive optimization methods. The known methods then allow any source signal to be calculated from a smaller number of output signals at a high computational cost, or more than or equal to the number of source signals for real-time calculations. Requires an output signal.

したがって、本発明の課題は、請求項１の前提部分の特徴を有する方法を発展させ、源信号の数より少ない数の出力信号の場合でも実時間内で信号源分離を可能とすることにある。 It is therefore an object of the present invention to develop a method having the features of the premise of claim 1 and to enable signal source separation in real time even in the case of fewer output signals than the number of source signals. .

この課題は、請求項１の特徴部分によって解決される。本発明の有利な発展態様は、下位請求項から与えられる。 This problem is solved by the characterizing part of claim 1. Advantageous developments of the invention are given in the subclaims.

本発明の核心は、出力信号から減算信号が算出され、当該減算信号がフーリエ変換され、このようにして得られた変換信号において個別信号が算出される点である。これにより、統計学的な方法及び再帰法は不要とされ、実時間内の個別信号の算出が達成される。ここで、実時間というのは、入って来るデータストリーム（Ｄａｔｅｎｓｔｒｏｍ）が、信号の最初のブロックを計算する際に生じる限界的な時間のずれを有しつつ再生すべき信号が中断なく提供され得ることを意味する。従って、例えば、録音から、或る特定の歌声と例えばピアノとを引き出して、選択すべき混合比率で特定のスピーカに割り当て、その際に出力信号が事前に処理されていなくてもよいようにできる。こうして、例えば、自動車の中でＣＤをセットし、歌および場合によっては引き出すべき信号の配分や引き出すべき信号そのものを決め、遅れを気付かせることなく歌やＣＤの再生を開始できるようになるはずである。 The core of the present invention is that a subtraction signal is calculated from the output signal, the subtraction signal is Fourier-transformed, and an individual signal is calculated in the converted signal thus obtained. This eliminates the need for statistical and recursive methods and achieves the calculation of individual signals within real time. Here, real time means that the signal to be played can be provided without interruption while the incoming data stream has a marginal time lag that occurs when calculating the first block of the signal. Means that. Thus, for example, a specific singing voice and, for example, a piano can be extracted from a recording and assigned to a specific speaker at a mixing ratio to be selected, so that the output signal does not have to be processed in advance. . Thus, for example, you should be able to set a CD in a car, decide the song and, in some cases, the distribution of the signal to be extracted and the signal to be extracted, and start playing the song or CD without noticing the delay. is there.

出力信号上での源信号の混合比率は、方向を特定することで算出することができる。そのために、出力信号はフーリエ変換され、振幅値が計算される。これら振幅値は、一つのヒストグラムにまとめられる。そのために、フーリエ変換された出力信号からポイント対が作られ、その際に、ヒストグラムにおけるポイント対は、得られる角度によってまとめることができる。ポイント対の角度は、このとき、０°から９０°の間の値を取り、次のように求められる。 The mixing ratio of the source signal on the output signal can be calculated by specifying the direction. For this purpose, the output signal is Fourier transformed and an amplitude value is calculated. These amplitude values are collected in one histogram. For this purpose, point pairs are created from the Fourier-transformed output signal, and the point pairs in the histogram can be combined according to the obtained angles. At this time, the angle of the point pair takes a value between 0 ° and 90 °, and is obtained as follows.

ここで、Ｘ^〜 _{１，ｏｒｇ}は、フーリエ変換された第１の出力信号Ｘ_１を、Ｘ^〜 _{２，ｏｒｇ}は、フーリエ変換された第２の出力信号Ｘ_２を示し、ａ_ｘ及びａ_ｙは、ベクトルを示し、このベクトルには大きさが含まれる。綴りのｌは、出力信号Ｘ_１，Ｘ_２ないしフーリエ変換された出力信号Ｘ^〜 _{１，ｏｒｇ}，Ｘ^〜 _{２，ｏｒｇ}の項数インデックスであり、これに基づいてポイント対が突き止められる。フーリエ変換された出力信号Ｘ^〜 _{１，ｏｒｇ}の第１の数値とフーリエ変換された出力信号Ｘ^〜 _{２，ｏｒｇ}の第１の数値はベクトルに他ならないが、これらの第１の数値は、したがってｌ＝１の第１のポイント対を形成する。もちろん、項数インデックスｌは０から始まってもよく、その場合、項数インデックスはＬ−１まで続く。ここで、Ｌは、各出力信号Ｘ_１，Ｘ_２のフーリエ変換されたポイントの数（例えば４０９６点）を示す。θは、０°から９０°の間の角度であり、Ψ_ｌは、インデックスｌを持つポイント対の大きさである。 ^Here, _{X ~ 1, org} is the first output signal _{X 1} which is the Fourier ^transform, _{X ~ 2, org,} the second shows the output signal _{X 2,} which is the Fourier transform, _{a x} and _{a y} are , A vector, which contains the magnitude. L of spelling, the output signal _X 1, _{X 2} or Fourier transformed output signals ^X _~ 1, ^_org, a term index number of _{X ~ 2, org,} point pairs are ascertained on the basis of this. Although the first value of the Fourier transformed output signals X ^{~ _1,} first numerical Fourier transformed output signals X ^{~ _2,} _org the _org is nothing but the vector, these first number, thus l Form a first pair of points = 1. Of course, the term index l may start from 0, in which case the term index continues to L-1. Here, L indicates the number of points (for example, 4096 points) obtained by Fourier transform of the output signals X ₁ and X ₂ . θ is an angle between 0 ° and 90 °, and ψ _l is the size of the point pair with index l.

ベクトルΨ_ｌは、ベクトルａ_ｘ及びａ_ｙの大きさの和を内容とする。 The vector Ψ _l contains the sum of the magnitudes of the vectors a _x and a _y .

フーリエ変換はブロックで実行される。入力信号として、それぞれ２のべき乗、つまり、２を累乗したものが用いられる。１０乗、１１乗又は１２乗、つまり、１０２４、２０４８又は４０９６のデータポイントが特に効率的であることが判明した。特に好ましくは、４０９６のデータポイントであり、その理由は、計算コストを考慮すると計算時間が最適であるからである。 The Fourier transform is performed in blocks. As the input signals, powers of 2, that is, powers of 2 are used. The 10th, 11th, or 12th power, ie, 1024, 2048, or 4096 data points have been found to be particularly efficient. Particularly preferred are 4096 data points because the calculation time is optimal considering the calculation cost.

ヒストグラムの区分けは、好ましくは１°単位で行われる。すなわち、ヒストグラムは９０の区間を含む。角度θの値はこのとき、ヒストグラムの数値範囲を区切るために整数値に丸められる（ｇｅｒｕｎｄｅｔ）。 Histogram segmentation is preferably performed in units of 1 °. That is, the histogram includes 90 intervals. The value of the angle θ is then rounded to an integer value to delimit the numerical range of the histogram.

最後に、ヒストグラムは、最大（複数の最大）が分かりやすくなるように平坦化される。平坦化関数として、次の関数が用いられる。 Finally, the histogram is flattened so that the maximum (maximums) is clear. The following function is used as the flattening function.

パラメータＴは、整数であり、隣接ポイントが平坦化に幾つ組み込まれるかを与える。したがって、複数のデータポイントに亘る平均が行われる。Ｓ隣接ポイントが存在しないようなヒストグラムの端では、補足すべき相応の位置に０の値が用いられる。ヒストグラムの一番目の度数（Ｚａｈｌｅｎｗｅｒｔ）を計算する際には、したがって、片側では０の値を８回適用されるべきで、その一方、他側では既存の度数が用いられる。平坦化されたヒストグラムにおいて、全ての局所的な極値が特定され、度数の高さ、つまりは頻度により仕分けされる。ヒストグラムにおける各所は、上述のとおり一つの角度に対応しており、その結果、各極値に一つの角度が対応する。見出された最大（複数の最大）に対応する角度が特定され、予め設定された数の最大が用いられるか、又は、予め設定された閾値を頻度が超えるもの全ての最大が用いられる。 The parameter T is an integer and gives how many adjacent points are incorporated into the flattening. Thus, an average over multiple data points is performed. At the edge of the histogram where there are no S adjacent points, a value of 0 is used for the corresponding position to be supplemented. When calculating the first frequency (Zahlenwert) of the histogram, therefore, the value of 0 should be applied 8 times on one side, while the existing frequency is used on the other side. In the flattened histogram, all local extrema are identified and sorted by frequency, ie frequency. Each location in the histogram corresponds to one angle as described above, and as a result, one angle corresponds to each extreme value. The angle corresponding to the found maximum (s) is identified and a preset number of maxima is used, or the maxima of all that exceed a preset threshold are used.

平坦化されたヒストグラム‐ベクトルは、こうして次のようになる。 The flattened histogram-vector thus becomes:

ここで、ｉ＝０，…，９０である。ここで、生じ得る問題は、源信号は、出力信号における全ての期間に亘っては現れないということである。例えば、特定の楽器や歌声は休止する。この休止が、角度を特定する際のエラーを引き起こさないように、二つないしそれより多くのヒストグラムを求めた後、並びに、それに対応する角度を特定した後に、次のヒストグラムに、例えばローパスといった重み関数を掛け合わせることができる。ローパスは、次式で表すことができる。 Here, i = 0,..., 90. Here, a possible problem is that the source signal does not appear over the entire period of the output signal. For example, a specific instrument or singing voice is paused. After determining two or more histograms and identifying the corresponding angles, this pause will give the next histogram a weight, for example a low pass, so that this pause does not cause an error in identifying the angles. You can multiply functions. The low pass can be expressed by the following equation.

ここで、ａ＝０．１及びｂ＝０．９である。インデックスｎは、ヒストグラムのインデックスであり、具体的には、最初のヒストグラムないし最初の４０９６データポイントではｎ＝１である。ローパスを用いることで、ヒストグラムｈ_{ｇｌ＿ＴＰ}が得られる。 Here, a = 0.1 and b = 0.9. The index n is the index of the histogram, specifically n = 1 for the first histogram or the first 4096 data points. By using a low-pass, a histogram h _{gl_TP} is obtained.

この重み付けにより、角度の識別が安定化される。 This weighting stabilizes the angle identification.

算出された角度をもとにして二つの源信号の混合比率が決まるが、その場合、角度として、 The mixing ratio of the two source signals is determined based on the calculated angle.

が次式に代入される。 Is substituted into the following equation.

ヒストグラムの最大が、例えば１８°のところにあるとすると、混合比率としてＶ＝０．３２５が得られる。 If the maximum of the histogram is at, for example, 18 °, V = 0.325 is obtained as the mixing ratio.

混合比率をもとに、減算信号が計算され、二つの出力信号の場合には次のように与えられる。 Based on the mixing ratio, a subtraction signal is calculated, and in the case of two output signals, it is given as follows.

ここで、Ｎ＝１，２，…である。Ｎは、フィルターで取り去るべき（ａｕｓｚｕｆｉｌｔｅｒｎｄｅｎ）源信号のインデックスである。 Here, N = 1, 2,. N is the index of the source signal to be filtered out.

混合比率が１より大きいか小さいかに応じて、それに応じた減算信号が計算される。このとき、二つよりも多くの出力信号を用いることもできるが、しかしそれをすると単に計算コストが高くなるという結果になる。ステレオ信号の場合のように二つの出力信号だけでなく、それより多い出力信号が存在する場合、それゆえ好適には、源信号を抽出するために、源信号が最も強く表れているような二つの出力信号が選択される。 Depending on whether the mixing ratio is larger or smaller than 1, a subtraction signal is calculated accordingly. At this time, more than two output signals can be used, but doing so simply results in higher computational costs. If there are not only two output signals, as in the case of a stereo signal, but more output signals, it is therefore preferable that the source signal appears most strongly to extract the source signal. One output signal is selected.

このように算出された減算信号では、いずれにおいても一つの源信号が遮蔽されている。 In any of the subtraction signals calculated in this way, one source signal is shielded.

これらの減算信号は、次にフーリエ変換される。これはブロック式に行われ、その際に、後続するブロックは、常に、先に変換しているデータポイントの半分の大きさに相当する幅を有して始まる。このことはすなわち、一つのブロックの前半部分は、先行するブロックの第二部分として既にフーリエ変換済みであることを意味する。 These subtracted signals are then Fourier transformed. This is done in a block fashion, where subsequent blocks always start with a width corresponding to half the size of the data point being converted earlier. This means that the first half of one block has already been Fourier transformed as the second part of the preceding block.

この方法は、連続的な信号処理を可能とし、重畳加算法の名で知られている。 This method enables continuous signal processing and is known by the name of the superposition addition method.

リーケージ効果を最小化するために、入力ブロックに窓（例えばハニング窓（Ｈａｎｎｉｎｇ−Ｆｅｎｓｔｅｒ））が掛け合わされる。Ｎのデータポイントのための窓関数ｆ（ｎ）は、以下のものである。 In order to minimize the leakage effect, the input block is multiplied by a window (eg, Hanning-Fister). The window function f (n) for N data points is:

フーリエ変換された減算信号Ｘ^〜（以下、「変換信号」と称する。）から、剰余信号が算出される。二つの変換信号Ｘ^〜に対し、剰余信号は、第１の変換信号から第２の変換信号を単純に減算したものとして与えられる。すなわち、 Fourier transformed subtraction signal X ^{~ (hereinafter} referred to as "conversion signal".) From the remainder signal is calculated. To two converted signals X ^~, remainder signal is provided as from the first converted signal simply by subtracting the second converted signal. That is,

以下においては、引き続き二つの変換信号Ｘ^〜 _１，Ｘ^〜 _２と一つの剰余信号Ｘ^〜 _３からスタートして、これが全部で三つの信号を生じさせる。そのようなわけで、一つの個別信号を抽出するために、これら三つの信号が互いに比較される。 In the following, starting from two conversion signals X ^~ ₁ , X ^~ ₂ and one remainder signal X ^~ ₃ , this gives a total of three signals. That is why these three signals are compared with each other to extract one individual signal.

信号Ｘ^〜 _１，Ｘ^〜 _２，Ｘ^〜 _３の各データポイントについては、通常は、各信号が４０９６のデータポイントを有し、三つの信号の振幅から極値が計算される。 For each data point of the signal ^{_{^{_{^{X ~ 1, X ~ 2,}}}}} X ~ 3, usually, each signal has a data point 4096, extrema amplitude of the three signals is calculated.

３ｘ４０９６のデータポイントを持つアレイから開始される。数字の３が変換信号と剰余信号の数を示し、４０９６が一つのブロック内のフーリエ変換されたデータポイントの数を示している。ベクトルＸ^〜 _１，Ｘ^〜 _２，Ｘ^〜 _３の最初の周波数ビンないし最初のデータポイントに注目すると、比較するための三つの度数が決まる。これら三つの値の最小の値のところで、他の二つの値のうち最大の方の値が設定されるとともに、他の値はゼロに設定される。 Start with an array with 3x4096 data points. The numeral 3 indicates the number of converted signals and remainder signals, and 4096 indicates the number of data points that have undergone Fourier transform in one block. Focusing on the vector _X ^{~ _{_1,}} X ^~ _^2, the first frequency bin to the first data point of the X ^~ _3, determines the three power for comparison. At the minimum of these three values, the maximum value of the other two values is set, and the other values are set to zero.

このことをはっきり説明するために、数値例を示す。 To illustrate this clearly, a numerical example is given.

位置Ｘ^〜 _１の第１の値が５、Ｘ^〜 _２の第１の値が１０、そしてＸ^〜 _３の第１の値が１５とする。すると、５のところに数値１５が設定されるとともに、Ｘ^〜 _２及びＸ^〜 _３の第１の数値が０に設定される。かくして、数値は、列ごとに検討される。４０９６の列と三つの行からなるアレイが得られ、このアレイでは、３分の２の値がゼロである。ゼロに等しくない値は、個別ベクトルに亘って不規則に配分されている。Ｘ^〜 _１及びＸ^〜 _２に対する個別ベクトルは、Ｓ^〜 _１及びＳ^〜 _２と称し、計算処理後の源信号Ｓ_１，Ｓ_２がフーリエ変換されたものである。出来た剰余値信号Ｘ^〜 _３のベクトルは、それ以上は重要でない。 The first value of the positions X ^to ₁ is 5, the first value of X ^to ₂ is 10, and the first value of X ^to ₃ is 15. Then, the numerical value 15 is set at 5 and the first numerical values of X ^to ₂ and X ^to ₃ are set to 0. Thus, numerical values are considered for each column. An array of 4096 columns and three rows is obtained, in which two-thirds of the values are zero. Values that are not equal to zero are randomly distributed across the individual vectors. Individual vectors for X ^~ ₁ and ^X _{~ 2} ^is referred to as ^S _{~ 1} and ^S _{~ 2,} the source signals _S 1 after the computation, _{S 2} is what is Fourier transform. The resulting vector of remainder value signals X ^- ₃ is not important any more.

Ｓ^〜 _１及びＳ^〜 _２の算出は、次の式により与えられている。 The calculation of S ^~ ₁ and S ^~ ₂ is given by the following equation.

ここで、ｋは、データポイントないし周波数ビンの項数インデックスであり、データポイントの場合では値が１から２０４８に及ぶ。周波数ビンの半分しか及ばないはずであるのは、対称性の理由からフーリエ変換に伴ってデータポイントが二重に発生しているからである。上記の例ではｋ＝１であった。 Here, k is the number index of data points or frequency bins, and the value ranges from 1 to 2048 in the case of data points. The reason why it should be only half of the frequency bin is that, due to symmetry, the data points are doubled with the Fourier transform. In the above example, k = 1.

このように転換された行（これらは信号Ｘ^〜 _１，Ｘ^〜 _２に対応していたものである。）、つまりは個別信号Ｓ^〜 _１，Ｓ^〜 _２を用いると、これをもとに、計算処理された源信号Ｓ１，Ｓ２を逆フーリエ変換により算出することができる。位相として、信号Ｘ^〜 _１，Ｘ^〜 _２の位相もまたそのまま考慮することができる。Ｓ^〜 _１及びＳ^〜 _２にはしたがって、ベクトルＸ^〜 _１，Ｘ^〜 _２からの各位相が割り当てられる。この割り当ては、当然のことながら項数インデックスｋに基づいて行われる。 Thus transformed row (These were not correspond to the signal ^{_{^{_{X ~ 1, X ~ 2.}}}} ), The original that is, the use of individual signal ^{^S} _{_~} ^1, ^S _~ ^_2, which, The calculated source signals S1, S2 can be calculated by inverse Fourier transform. As the phase, signal ^{^X} _{_~} ^1, ^X _~ ₂ phase can also be considered as it is. The S ^~ ₁ and ^S _{~ 2} therefore assigned each phase from the vector ^{^X} _{_~} ^1, ^X _~ _2. This assignment is naturally performed based on the term index k.

個別信号Ｓ^〜 _１，Ｓ^〜 _２ないしフーリエ変換された個別信号Ｓ_１，Ｓ_２は、計算精度や計算誤差のために源信号とは少し異なったものになり得る。すなわち、源信号を完全に再生することは確かに達成されないものの、しかしその違いは僅かなもので、通常はその違いに気づくことはない程である。 Individual signal S ^{^~ _1,} S ^~ ₂ or Fourier transformed individual signal S _1, S ₂ may look something a little different from the source signal for the calculation accuracy and calculation errors. That is, complete reproduction of the source signal is certainly not achieved, but the difference is subtle and usually not noticeable.

個別信号の分離を改善するために、以下の工程が可能である。 In order to improve the separation of the individual signals, the following steps are possible.

最小を見分けるのに飛びがないようにするために、先に行った最小の見分けに応じて信号Ｘ^〜 _１，Ｘ^〜 _２，Ｘ^〜 _３の最小を確定することが行われてもよい。例えば、条件付きのローパスフィルタ（ｂｅｄｉｎｇｔｅｒＴｉｅｆｐａｓｓｆｉｌｔｅｒ）が用いられてもよい。 In order to avoid jumps in recognizing the minimum, the minimum of the signals X ^to ₁ , X ^to ₂ , and X ^to ₃ may be determined according to the minimum recognizing performed previously. For example, a conditional low-pass filter may be used.

Ｐ^ｔ _ｈｏｌｄ（ｋ）は、周波数ビンｋの注目対象の値であり、このｋはここでもまた項数インデックスである。パラメータｂは、０と１の間で自由に設定可能であり、ｂ＝０のときに周波数の感知は遮断されている。 P ^t _hold (k) is the value of interest of frequency bin k, which is again the term index. The parameter b can be freely set between 0 and 1, and frequency sensing is cut off when b = 0.

Ｓ^〜 _１及びＳ^〜 _２の算出は、次に以下の式により行われる。 Calculation of S ^~ ₁ and S ^~ ₂ is then performed by the following equation.

パラメータη（０≦η≦１）は、ローパスフィルタを通された信号がいかなる強度で個別信号Ｓ^〜 _ｍ，ｍ＝１，２，３，…に入り込むかを与える。 The parameter η (0 ≦ η ≦ 1) gives the intensity at which the signal passed through the low-pass filter enters the individual signals S ^to _m , m = 1, 2, 3,.

さらに、変換信号Ｘ^〜 _ｍ，ｍ＝１，２，４，５，…の最小は、別々に剰余値信号Ｘ^〜 _３と比較することができ、つまりは、最小の特定Ｅ_ｍｉｎ（ｋ）、０又は列の最大値Ｅ_ｍａｘ（ｋ）をあてがあうこと等は、変換信号Ｘ^〜 _ｍと剰余値信号の間でそれぞれ実行される。このようにして、ベクトルＥ_ｍｉｎ１，Ｅ_ｍｉｎ２，Ｅ_ｍａｘ１，Ｅ_ｍａｘ２が算出される。最小値には、因子β（０≦β≦２）が掛け合わされる。因子の選択次第で、異なる効果が生じる。β＜１に関しては、望ましくない周波数は抑えられ、β＞１に関しては、調和した音調形成が得られる。 Further, the minimum of the converted signals X ^to _m , m = 1, 2, 4, 5,... Can be separately compared with the remainder value signals X ^to ₃ , that is, the minimum specific E _min (k), The assignment of 0 or the maximum value E _max (k) of the column is performed between the conversion signals X ^to _m and the remainder value signal, respectively. In this way, the vectors E _min1 , E _min2 , E _max1 , E _max2 are calculated. The minimum value is multiplied by a factor β (0 ≦ β ≦ 2). Depending on the choice of factors, different effects occur. For β <1, undesirable frequencies are suppressed, and for β> 1, harmonious tone formation is obtained.

個別信号は、以下のものとなる。 The individual signals are as follows.

加えて、信号は、位相状態をもとに分離することができる。変換信号Ｘ^〜 _ｍ（ｋ）の振幅が略同じでかつ剰余値信号Ｘ^〜 _３（ｋ）に最小値があるときには、位相が考慮される。これらの位相が同じであると、最大値がＳ^〜 _３（ｋ）に割り当てられ、そうでなければＳ^〜 _１（ｋ）及びＳ^〜 _２（ｋ）に割り当てられる。この考え方は、もちろん、各周波数ビンｋに対し別々に実行可能である。 In addition, the signals can be separated based on the phase state. When the amplitudes of the converted signals X ^to _m (k) are substantially the same and the remainder value signals X ^to ₃ (k) have a minimum value, the phase is considered. If these phases are the same, the maximum value is assigned to S ^~ ₃ (k), otherwise it is assigned to S ^~ ₁ (k) and S ^~ ₂ (k). This concept can of course be implemented separately for each frequency bin k.

以下が適宜成り立つ。 The following holds as appropriate.

加えて、フーリエ変換された出力信号Ｘ^〜 _{ｍ，ｏｒｇ}をさらに考慮することができる。最小値がＸ^〜 _{１，ｏｒｇ}（ｋ）にあるとき、これらが、同じようにＳ^〜 _１（ｋ）に割り振られる。 In addition, the Fourier transformed output signals X ^to _{m, org} can be further taken into account. When the minimum value is in X ^~ _{1, org} (k), these are assigned to S ^~ ₁ (k) in the same way.

本発明は、図面に記載された実施形態に基づいてより詳細に説明される。 The invention is explained in more detail on the basis of the embodiments described in the drawings.

ステレオ信号に関する本願発明による方法のブロック図である。Fig. 2 is a block diagram of a method according to the present invention for a stereo signal. 第二の実施形態における本願発明による方法のブロック図である。It is a block diagram of the method by this invention in 2nd embodiment. 混合比率を特定するためのフロー図である。It is a flowchart for specifying a mixing ratio. 信号源分離のためのフロー図である。It is a flowchart for signal source separation. ステレオ信号を収録するための構成を示す図である。It is a figure which shows the structure for recording a stereo signal. データ対の散布図である。It is a scatter diagram of data pairs. ＦＴデータ対の散布図である。It is a scatter diagram of FT data pairs. ヒストグラムを示す図である。It is a figure which shows a histogram. 重畳加算法を説明するための図である。It is a figure for demonstrating the superposition addition method. 二つの変換信号と一つの剰余信号の３Ｄ振幅スペクトルを示す図である。It is a figure which shows the 3D amplitude spectrum of two conversion signals and one remainder signal. 個別信号の３Ｄ振幅スペクトルを示す図である。It is a figure which shows the 3D amplitude spectrum of an individual signal. 変換信号と個別信号をベクトル形式で示す図である。It is a figure which shows a conversion signal and an individual signal in a vector format.

図５は、ステレオ信号を録音するための構成を示す。単なる一例として、ＸＹ−ステレオ法により源信号を収録するための構成が示されているのであって、基本的には、本発明による方法はしかし他の全てのステレオ技術において使用できるだけでなく、二つよりも多くの出力信号が生成されるような方法においても適用できる。 FIG. 5 shows a configuration for recording a stereo signal. By way of example only, a configuration for recording a source signal by the XY-stereo method is shown, and basically the method according to the invention can be used not only in all other stereo technologies, The present invention can also be applied to a method in which more than two output signals are generated.

図示されているのは四つの信号源であり、これら信号源がそれぞれ一つの源信号１，２，３，４を生成する。源信号１は、例えば単一の歌声、つまり一人の男性ないし女性歌手についてのものであり、源信号２は、大勢の男性ないし女性の歌い手からなるバックコーラスについてのもので、ただしコーラスでは同じ歌詞と同じ楽譜が演じられるものであり、源信号３は、楽器、例えばピアノについてのものであり、源信号４は、一群の楽器についてのもの、とはいえコーラス同様に同じ楽譜が演じられるものである。この場合の一例は、同じ旋律を奏でるバイオリンのユニットでもよい。この例が示しているのは、一つの源信号が、一人の単独の男性歌手や一人の単独の女性歌手から構成できたり、あるいは、単独の楽器から構成できたりするのではなく、歌手や楽器の多数から構成され得るということである。マイクロフォン５，６の広がりのある指向性により、源信号１，２，３，４は異なったレベルで収録され、そのために源信号１，２，３，４の混合比率は常に異なっている。 Shown are four signal sources, which generate one source signal 1, 2, 3, 4 respectively. Source signal 1 is for a single singing voice, for example a single male or female singer, and source signal 2 is for a back chorus consisting of a large number of male or female singers, but with the same lyrics in the chorus. The source signal 3 is for a musical instrument, for example a piano, and the source signal 4 is for a group of instruments, although the same score is played as in the chorus. is there. An example of this case may be a violin unit playing the same melody. This example shows that one source signal can not consist of a single male singer, a single female singer, or a single instrument, but a singer or instrument. It can be composed of a large number of. Due to the wide directivity of the microphones 5, 6, the source signals 1, 2, 3, 4 are recorded at different levels, so that the mixing ratio of the source signals 1, 2, 3, 4 is always different.

源信号１，２，３，４から、かくしてステレオ信号の二つの出力信号が得られる。これらの出力信号が、まさに本願発明による方法の出発点となる信号であり、したがって元の源信号はそれ以上は使用されない。 From the source signals 1, 2, 3, 4 there are thus obtained two output signals of a stereo signal. These output signals are just the starting points for the method according to the invention, so that the original source signal is not used any further.

もちろん、もっと多くの源信号も出力信号Ｘ_ｍを生成するのに用いることができ、本方法を実行するには、有意義には、しかし少なくとも二つの源信号は必要である。 Of course, can be used to generate more source signal also output signals X _m, in carrying out this method, the meaningful, but at least two source signals is required.

以下において、二つの出力信号Ｘ_１，Ｘ_２が何度も引き合いに出される。しかし、本発明による方法は、ステレオ信号に限定されるものではなく、基本的には任意の多数の出力信号Ｘ_ｍ，ｍ＝１，２，３，…に使用できる。 In the following, the two output signals X ₁ and X ₂ are referred to many times. However, the method according to the present invention is not limited to a stereo signal, and can basically be used for any number of output signals X _m , m = 1, 2, 3,.

図１は、本願発明による方法の第一の実施形態によるブロック図を示す。本実施形態においては、二つの出力信号Ｘ_１，Ｘ_２が考慮されて、二つの源信号（例えば源信号１，２）に対する混合比率が特定される。混合比率を特定するための一つの可能な方法は、後に詳述する。源信号１について例えば混合比率Ｖ_１が生じ、源信号２について混合比率Ｖ_２が生じる。出力信号Ｘ_１，Ｘ_２から次に減算信号Ｘ＾_１，Ｘ＾_２が計算され、その際には、出力信号Ｘ_１，Ｘ_２が以下のようにして混合比率に基づき減算される： FIG. 1 shows a block diagram according to a first embodiment of the method according to the invention. In the present embodiment, the two output signals X ₁ and X ₂ are considered, and the mixing ratio for the two source signals (for example, the source signals 1 and 2) is specified. One possible way to specify the mixing ratio is detailed later. For example, a mixing ratio V ₁ occurs for the source signal _{1 and a} mixing ratio V ₂ occurs for the source signal 2. Output signal _X 1, then subtracting the signal from the _{_{_{X 2 X ^ 1, X ^}}} 2 is calculated, at that time, the output signal _X 1, _{X 2} is subtracted based on the to mixing ratio as follows:

ここで、Ｎは源信号の各指標を表す。 Here, N represents each index of the source signal.

このように計算された減算信号Ｘ＾_１，Ｘ＾_２は、重畳加算法によりフーリエ変換される。 The subtraction signals X ^ ₁ and X ^ ₂ calculated in this way are Fourier transformed by the superposition addition method.

出力信号Ｘ_１，Ｘ_２は、もちろんデジタルデータで、相応に数値やデータポイントが単に並んだものに対応する。出力信号は、数値を極めて多数（通常、数万個）持つベクトルとしても表現することができよう。このことは、減算信号に対しても同じように該当する。計算に対する要求を低く抑えるため、そして、とりわけ最初の個別信号が得られるまでの時間を短く保つため、出力信号Ｘ_１，Ｘ_２ないし減算信号は、ブロックにより後続の処理がなされる。フーリエ変換は、例えば減算信号Ｘ＾_１ないしＸ＾_２の最初の４０９６個のデータポイントに対して適用される。好適には、２のべき乗のデータポイントがフーリエ変換される。それは、本例では、高速フーリエ変換（ＦＦＴ）が適用できるからである。４０９６という数は、このとき、消費される計算時間と計算資源を考慮すると、理想的な数である。もっと小さいデータブロック又はもっと大きいデータブロック（例えば、１０２４ないし２０４８個のデータポイントのデータブロック）を採用することもできる。この場合、もちろん連続するデータポイントのことを常に意味する。減算信号Ｘ＾_１，Ｘ＾_２をフーリエ変換することにより、変換信号Ｘ^〜 _１，Ｘ^〜 _２が得られる。変換信号Ｘ^〜 _１から変換信号Ｘ^〜 _２を減算することにより、剰余信号Ｘ^〜 _３が得られる。 The output signals X ₁ and X ₂ are, of course, digital data and correspond to a simple sequence of numerical values and data points. The output signal can also be expressed as a vector having a very large number (usually tens of thousands). The same applies to the subtraction signal. The output signals X ₁ , X ₂ or the subtraction signal are processed further by the block in order to keep the demands on the calculation low and in particular to keep the time until the first individual signal is obtained short. The Fourier transform is applied to the first 4096 data points of the subtraction signals X ₁ to X ₂ , for example. Preferably, a power of 2 data points are Fourier transformed. This is because fast Fourier transform (FFT) can be applied in this example. At this time, the number 4096 is an ideal number in consideration of the calculation time and calculation resources consumed. Smaller data blocks or larger data blocks (eg, data blocks of 1024 to 2048 data points) may be employed. In this case, of course, it always means consecutive data points. By converting the subtracted signals X ^ ₁ , X ^ ₂ by Fourier transform, converted signals X ^- ₁ and X ^- ₂ are obtained. By subtracting the conversion signals X ^to ₂ from the conversion signals X ^to ₁ , the remainder signals X ^to ₃ are obtained.

剰余信号Ｘ^〜 _３は、変換信号Ｘ^〜 _１，Ｘ^〜 _２と同程度に多くのデータポイント（例えば、４０９６個のデータポイント）を含んでいる。これらのデータポイントは周波数ドメインにあるので、通常これらは周波数ビンとも呼ばれる。周波数ビンは、したがって変換信号ないし剰余信号のデータポイントである。具体的な場合では、各変換信号・剰余信号は、したがって４０９６個の周波数ビンを有している。 The remainder signals X ^- ₃ include as many data points (for example, 4096 data points) as the converted signals X ^- ₁ and X ^- ₂ . Since these data points are in the frequency domain, they are usually also called frequency bins. The frequency bin is thus the data point of the transformed signal or residue signal. In a specific case, each converted signal / residue signal thus has 4096 frequency bins.

次に、信号Ｘ^〜 _１，Ｘ^〜 _２，Ｘ^〜 _３の中で最小が探されるが、これは、これらが遮蔽された信号と同じ意味を持つからである。そのために、各周波数ビンに関して、信号Ｘ^〜 _１，Ｘ^〜 _２，Ｘ^〜 _３の振幅から最小が計算され、その最小のところで、それぞれの周波数ビンのうちの最大の値が設定され、これらの周波数ビンについての他の値がゼロに設定される。ゼロとは異なる値に対する位相を得て、かくして個別信号Ｓ^〜 _１，Ｓ^〜 _２を得る。これらの個別信号は、計算処理された源信号Ｓ_１，Ｓ_２を得るために、時間領域に逆変換されなければならない。 Next, the minimum is sought in the signal ^{_{^{_{X ~ 1, X ~ 2,}}}} X ~ 3, This is because with the same meaning as they have been shielded signal. Therefore, for each frequency bin, the minimum calculated from the amplitude of the signal ^{_{^{_{X ~ 1, X ~ 2,}}}} X ~ 3, at its minimum, maximum value among the respective frequency bins are set, these frequencies The other value for the bin is set to zero. The phase for a value different from zero is obtained, thus obtaining the individual signals S ^~ ₁ , S ^~ ₂ . These individual signals must be converted back to the time domain in order to obtain the calculated source signals S ₁ , S ₂ .

図２は、計算された二つの源信号Ｓ_１，Ｓ_２よりも多くの源信号を算出するための実施形態を示す。本実施形態に関しては、出力信号Ｘ_１，Ｘ_２，Ｘ_３，Ｘ_４から二つよりも多くの混合比率が特定され、これにより、より多くの減算信号，それに伴いより多くの変換信号が得られる。このようにして得られた変換信号Ｘ^〜 _１，Ｘ^〜 _２，Ｘ^〜 _４，Ｘ^〜 _５及びこれらから算出される剰余信号Ｘ^〜 _３においてもまた、最小を特定することが行なわれ、それにより個別信号Ｓ^〜 _１，Ｓ^〜 _２，Ｓ^〜 _３，Ｓ^〜 _４が算出され、これらから、計算された源信号Ｓ_１，Ｓ_２，Ｓ_３，Ｓ_４の計算が行なわれる。 FIG. 2 shows an embodiment for calculating more source signals than the two calculated source signals S ₁ , S ₂ . For this embodiment, more than two mixing ratios are specified from the output signals X ₁ , X ₂ , X ₃ , X ₄ , thereby obtaining more subtraction signals and more conversion signals accordingly. It is done. In the converted signals X ₁ ^to ₁ , X ^to ₂ , X ^to ₄ , X ^to ₅ obtained in this way and the remainder signals X ^to ₃ calculated therefrom, the minimum is also specified, thereby individual signal ^{_{^{_{^{_{S ~ 1, S ~ 2,}}}}}} S ~ 3, S ~ 4 are calculated from these, is calculated source signals _{_{_{S 1, S 2, S 3}}} , the calculation of _{S 4} is performed.

以下に、混合比率を特定するための一つの可能な方法を示す。基本的には、混合比率を特定するためにあらゆる方法を用いることができる。 In the following, one possible method for specifying the mixing ratio is shown. Basically, any method can be used to specify the mixing ratio.

図３は、方向の特定に基づいて混合比率を特定するためのフロー図を示す。ここで、ステップＳ１では、出力信号Ｘ_１，Ｘ_２がフーリエ変換され、これにより変換された出力信号Ｘ^〜 _{１，ｏｒｇ}，Ｘ^〜 _{２，ｏｒｇ}が得られる。ステップＳ２では、フーリエ変換された出力信号Ｘ^〜 _{１，ｏｒｇ}，Ｘ^〜 _{２，ｏｒｇ}のそれぞれのデータポイントないし周波数ビンに対して以下の式に基づき振幅値が算出される。 FIG. 3 shows a flow diagram for specifying the mixing ratio based on specifying the direction. Here, in step S1, the output signal _X 1, _{X 2} is the Fourier transform, which output signal is converted by the ^X _~ 1, ^_org, the _{X ~ 2, org} obtained. In step S2, the amplitude value based on the following equation Fourier transformed output signals ^X _~ 1, ^_org, against _{X ~ 2,} each data point or frequency bins _org is calculated.

次のステップＳ３では、先に計算された値から角度θと大きさΨ_ｌの計算が行なわれる。このとき、フーリエ変換された出力信号Ｘ^〜 _{１，ｏｒｇ}，Ｘ^〜 _{２，ｏｒｇ}の大きさの計算は、データポイントにより、あるいは、ベクトルとして行なわれるが、いずれにしても、これから角度θと大きさΨ_ｌの値の対を算出することができる。 In the next step S3, the angle θ and the magnitude Ψ _l are calculated from the previously calculated values. At this time, the Fourier transformed output signals X ^{~ _1,} _org, calculation of _{X ~} ^2, _org of magnitude, by the data points or is carried out as a vector, in any event, and now the angle θ size A pair of values of Ψ _l can be calculated.

ステップＳ４では、θ及びΨ_ｌの値の対からヒストグラムが計算され、このヒストグラムがステップＳ５では平坦化される。 In step S4, a histogram is calculated from the pair of values of θ and Ψ _l and this histogram is flattened in step S5.

ステップＳ６では、予め決めておいた数だけ、あるいは閾値を超える全ての最大を取り上げることで、ヒストグラムの最大をいくつか特定することが行なわれる。 In step S6, several maximum histograms are identified by taking up a maximum number that exceeds a predetermined number or a threshold value.

これらの最大のそれぞれには、上の方で与えられた式により最大の値からそれぞれ計算される一つの角度が割り当てられている。ステップＳ７では、次に、算出された角度に基づいて、求められる混合比率の特定が行なわれる。 Each of these maximums is assigned an angle calculated from the maximum value according to the formula given above. Next, in step S7, the required mixing ratio is specified based on the calculated angle.

図４は、信号源分離のためのフロー図を示す。変換信号Ｘ^〜 _１，Ｘ^〜 _２及び場合によってはさらにＸ^〜 _４及びＸ^〜 _５並びに剰余値信号Ｘ^〜 _３の場合に、それぞれの周波数ビンに対して最小と最大が計算される（ステップＳ８）。他の実施形態においては、さらにフーリエ変換された出力信号Ｘ^〜 _{１，ｏｒｇ}，Ｘ^〜 _{２，ｏｒｇ}及び場合によってはＸ^〜 _{３，ｏｒｇ}及びＸ^〜 _{４，ｏｒｇ}が考慮される。 FIG. 4 shows a flow diagram for signal source separation. When converted signal ^{^X} _{_~} ^1, ^X _~ ₂ and further ^X _{~ 4} and ^X _{~ 5} and the remainder value signals ^X _{~ 3} is optionally, minimum and maximum are computed for each frequency bin (step S8) . In other embodiments, further Fourier transformed output signals ^X _~ 1, ^_org, by _{X ~ 2, org} and optionally ^X _{~ 3, org,} and ^X _{~ 4, org} is considered.

ステップＳ９では、上述したように、各周波数ビンにおいて、最小となるところに、周波数ビンないし各列の最大の値が設定され、他の全ての値には、ゼロ又はホールド値Ｐ^ｔ _ｈｏｌｄが割り当てられる。 In step S9, as described above, the frequency bin or the maximum value of each column is set at the minimum in each frequency bin, and zero or a hold value P ^t _hold is assigned to all other values. It is done.

ステップ１０によれば、個別信号Ｓ^〜 _１，Ｓ^〜 _２，Ｓ^〜 _４，Ｓ^〜 _５のゼロと異なるそれぞれの値に対して、変換信号Ｘ^〜 _１，Ｘ^〜 _２，Ｘ^〜 _４，Ｘ^〜 _５のそれぞれの位相が割り当てられる。 According to step 10, the converted signals X ^to ₁ , X ^to ₂ , X ^to ₄ , and X ^{to the} respective values different from zero of the individual signals S ^to ₁ , S ^to ₂ , S ^to ₄ , and S ^to _5. Each of the _five phases is assigned.

このとき、割り当ては、周波数ビンの番号付けに基づいて行なわれる。個別信号Ｓ^〜 _１の周波数ビン２８がゼロとは異なる値を有するときには、個別信号は変換信号Ｘ^〜 _１の周波数ビン２８の位相を得ることになる。各周波数ビンにおいて最小の値のところに最大の値を移して残りの値をゼロにするため、個別信号のどの値も、割り当てられた変換信号の値と同じにならない。それでも位相は引き継ぐことができる。 At this time, allocation is performed based on frequency bin numbering. When the frequency bin 28 of the individual signal S ^~ ₁ has a value different from zero, the individual signal obtains the phase of the frequency bin 28 of the transformed signal X ^~ ₁ . Since the maximum value is moved to the minimum value in each frequency bin and the remaining values are made zero, no value of the individual signal is the same as the value of the assigned conversion signal. Still the phase can be taken over.

最小ではないところでのゼロへの設定に代えて、ホールド値ηＰ_ｈｏｌｄを設定することもできる。これにより、個別信号に飛んだ箇所がないようにする。 Instead of setting to zero where it is not the minimum, the hold value ηP _hold can also be set. As a result, there is no place where the individual signal is skipped.

図６は、混合状態にある二つの出力信号Ｘ_１，Ｘ_２の振幅の散布図を示し、この例では、出力信号Ｘ_１，Ｘ_２は、互いにずれた正弦波信号である。散布図は、各々が値の対（２項目の値）を表し、このとき値の対は、データポイントの番号、いわばデータ−ビンの番号をもとに形成される。出力信号Ｘ_１のデータポイントの第１のポイントは、第１の振幅を有し、出力信号Ｘ_２の第１のデータポイントは、同じか又は別の振幅を有する。これら二つの振幅は、散布図９に一つの点をプロットするのに用いられる。出力信号Ｘ_１と出力信号Ｘ_２の各データポイントについて複数用いることで、同じだけの数のデータポイント対が生じる。このとき、軸１０に出力信号Ｘ_１の振幅が表示され、軸１１に出力信号Ｘ_２の振幅が表示されている。例えば出力信号Ｘ_１出力信号Ｘ_２上の４０９６のデータポイントを用いることで、４０９６のデータポイント対が得られる。これらは、散布図内にポイントクラウド１２を形成し、このポイントクラウドは、個々のデータポイント対から求められた複数の点１３から形成される。 FIG. 6 shows a scatter diagram of the amplitudes of the two output signals X ₁ and X ₂ in a mixed state. In this example, the output signals X ₁ and X ₂ are sine wave signals that are shifted from each other. Each scatter diagram represents a pair of values (two items of values). At this time, the pair of values is formed based on the number of data points, that is, the number of data-bins. The first point of the data points of the output signal X ₁ has the first amplitude, the first data point of the output signal X ₂ have the same or different amplitudes. These two amplitudes are used to plot a single point in scatter plot 9. By using a plurality of data points for the output signal X ₁ and the output signal X ₂ , the same number of data point pairs are generated. At this time, it displays the amplitude of the output signal X ₁ on shaft 10, the amplitude of the output signal X ₂ on the shaft 11 are displayed. For example, using 4096 data points on output signal X ₁ output signal X ₂ yields 4096 data point pairs. These form a point cloud 12 in the scatter plot, which is formed from a plurality of points 13 determined from individual data point pairs.

図７は、対応して形成された散布図１４を示すが、ただし出力信号Ｘ_１，Ｘ_２は、フーリエ変換された上に、信号の振幅は以下のように計算された。 FIG. 7 shows a correspondingly formed scatter diagram 14 except that the output signals X ₁ and X ₂ were Fourier transformed and the signal amplitudes were calculated as follows:

これに応じて、軸１５には、Ｘ^〜 _{１，ｏｒｇ}の、つまりは出力信号Ｘ１がフーリエ変換されたものの振幅の大きさが表示され、軸１６には、Ｘ^〜 _{２，ｏｒｇ}の振幅の大きさが表示されている。直線１７ないし１８の傾きを計算することにより、それぞれについて混合比率が得られる。 Accordingly, the shaft ^15, the _{X ~ 1, org,} that is, displays the magnitude of the amplitude but the output signal X1 is Fourier transform, the axis ^16, the amplitude of the _{X ~ 2, org} size Is displayed. By calculating the slope of the straight lines 17 to 18, the mixing ratio is obtained for each.

このように単純なやり方で、実際の収録においてこうは完全でない正弦波信号のときだけはしかし混合比率が算出できる。散布図を作成して得られた直線の傾きを求める代わりに、出力信号Ｘ１，Ｘ２をフーリエ変換した後で、フーリエ変換された出力信号Ｘ^〜 _{１，ｏｒｇ}，Ｘ^〜 _{２，ｏｒｇ}のポイント対から、それぞれ角度θとベクトルΨが計算される。散布図の代わりに、角度θとベクトルΨからなる対からヒストグラムが求められ、そのヒストグラムで角度は整数値に縮められ、そこから角度における対応する頻度が算出される。各ポイント対から、ヒストグラムに関する個々の頻度が生じる。 In this simple manner, the mixing ratio can be calculated only when this is not a perfect sine wave signal in actual recording. Instead of obtaining the inclination of the obtained straight line to create a scatter plot, the output signals X1, X2 after Fourier transform, the Fourier transformed output signals X ^{~ _1,} _org, from _{X ~} ^2, _org points pairs , Respectively, an angle θ and a vector ψ are calculated. Instead of a scatter diagram, a histogram is obtained from a pair consisting of an angle θ and a vector Ψ, and the angle is reduced to an integer value in the histogram, and a corresponding frequency in the angle is calculated therefrom. From each point pair, an individual frequency for the histogram results.

このヒストグラムは、平坦化関数を用いて平坦化されることで、局所的な最小の値及び最大の値の数が減らされる。平坦化関数としては、例えば、既に上述した以下の関数を当てることができる。 The histogram is flattened using a flattening function to reduce the number of local minimum and maximum values. As the flattening function, for example, the following function described above can be applied.

このように平坦化されたヒストグラムが図８に示されている。軸１９には角度が度により示され、つまりは０°から９０°まで表示され、他方、軸２０には、それぞれ頻度が示されている。ヒストグラム２１は、絶対的な真の最大２２と、複数の局所的な最大２３，２４及び２５を有する。 A histogram flattened in this way is shown in FIG. The angle is shown on the axis 19 in degrees, ie from 0 ° to 90 °, while the frequency is shown on the axis 20 respectively. The histogram 21 has an absolute true maximum 22 and a plurality of local maximums 23, 24 and 25.

頻度の数にしたがって並べると、最大２２が最も大きな値を有し、その後に最大２５，２４及び２３が続く。これらの最大の中から、最も頻度の大きい最大について頭の方から予め決めた数だけ選択され、例えば二つの最大、最大２２及び２５が選択されるか、あるいはまた、頻度か閾値を超えている限り最大を選択し続けることもできる。この閾値を例えば１０に設定すると、その後の計算で最大２２，２５及び２４が該当するが、閾値を下回る最大２３は該当しない。 When ordered according to the number of frequencies, a maximum of 22 has the largest value, followed by a maximum of 25, 24 and 23. Of these maxima, a pre-determined number is selected from the head for the most frequent maxima, for example, two maxima, maxima 22 and 25 are selected, or the frequency or threshold is exceeded. You can continue to select the maximum as long as possible. If this threshold value is set to 10, for example, the maximum 22, 25, and 24 are applicable in the subsequent calculation, but the maximum 23 below the threshold is not applicable.

それぞれの最大には一つの角度がある。最大２２は、例えば１８°であるし、最大２５は、７２°である。 Each maximum has one angle. The maximum 22 is, for example, 18 °, and the maximum 25 is 72 °.

このとき、角度は以下の式により与えられる。 At this time, the angle is given by the following equation.

このようにして特定された角度から、それぞれの角度に対してそれぞれ一つの混合比率が以下の式により得られる。 From the angles thus identified, one mixing ratio for each angle is obtained by the following equation.

Ｎは、引き抜くべき源信号のインデックスである。これにより、方向の情報に基づいた混合比率の特定は終了する。 N is the index of the source signal to be extracted. Thereby, the specification of the mixing ratio based on the direction information is completed.

減算信号Ｘ＾_１，Ｘ＾_２の算出は、ずっと上の方で述べた式により行われる。特に、減算信号Ｘ＾_１，Ｘ＾_２は、ブロックで算出することもできる。したがってデータポイントについて予め決められたセットごとに行なうことができ、例えば出力信号Ｘ_１，Ｘ_２の４０９６のデータポイントのそれぞれが混合比率を用いることで減算信号Ｘ＾_１，Ｘ＾_２に計算される。減算信号Ｘ＾_１，Ｘ＾_２は、図９に示されているような重畳加算法によりフーリエ変換される。 The calculation of the subtraction signals X ^ ₁ and X ^ ₂ is performed by the above-described equation. In particular, the subtraction signals X ^ ₁ and X ^ ₂ can be calculated in blocks. Thus can be performed for each set of predetermined for the data points, for example, each of 4096 data points of the output signals X _1, X ₂ is calculated in the subtraction signal _X ^ _1, X ^ ₂ by using the mixing ratio The The subtraction signals X ^ ₁ and X ^ ₂ are Fourier-transformed by a superposition addition method as shown in FIG.

図示されているのは、それぞれ減算信号Ｘ＾_１ないしは減算信号Ｘ＾_２の２０４８のデータポイントからなるデータブロック２６，２７，２８，２９である。ここで、２０４８という数値は、フーリエ変換に使用されたデータポイントの半分であるところから出てくる。データブロック２６〜２９のデータポイントの数は、したがって、データブロック３０〜３３のデータポイントの数から出てくる。 Shown are data blocks 26, 27, 28, 29 consisting of 2048 data points of subtraction signal X ₁ or subtraction signal X ₂ , respectively. Here, the number 2048 comes from where it is half of the data points used for the Fourier transform. The number of data points in the data blocks 26-29 thus comes from the number of data points in the data blocks 30-33.

データブロック３０は、データブロック２６，２７から構成され、それに対応してデータブロック２６又は２７のデータポイントの２倍の数を有している。このデータブロック３０は、フーリエ変換されるデータブロックである。データブロック３１は、データブロック２７，２８からなり、データブロック３２は、データブロック２８とそれに続くデータブロックからなる。データブロック２７は、データブロック３０，３１に含まれ、データブロック２８は、データブロック３１，３２に含まれる。 The data block 30 is composed of data blocks 26 and 27 and has a number corresponding to twice the number of data points of the data block 26 or 27. This data block 30 is a data block to be Fourier transformed. The data block 31 includes data blocks 27 and 28, and the data block 32 includes a data block 28 and a data block subsequent thereto. The data block 27 is included in the data blocks 30 and 31, and the data block 28 is included in the data blocks 31 and 32.

フーリエ変換に先立ち、データブロック３０，３１，３２，３３は、さらにハニング窓が掛け合わされる。これにより、フーリエ変換の際のリーケージ効果を最小限にすることができる。 Prior to the Fourier transform, the data blocks 30, 31, 32 and 33 are further multiplied by a Hanning window. Thereby, the leakage effect in the Fourier transform can be minimized.

図１０は、変換信号Ｘ^〜 _１，Ｘ^〜 _２剰余値信号Ｘ^〜 _３の３Ｄ−振幅スペクトルを示す。軸３５上には、周波数ビンの各番号、つまり信号のベクトルにおける通し番号が付されたポイントが示されている一方、軸３６には大きさが表示されている。軸３７には、それぞれの大きさが表示されている。位置３８には信号｜Ｘ^〜 _２｜が、位置３９には信号｜Ｘ^〜 _３｜が、そして位置４０には信号｜Ｘ^〜 _１｜がある。 FIG. 10 shows a 3D-amplitude spectrum of the converted signals X ^- ₁ and X ^- ₂ residue value signals X ^- ₃ . On the axis 35, each frequency bin number, that is, a point assigned a serial number in the signal vector, is shown, while the axis 36 shows the size. Each size is displayed on the axis 37. Position 38 has signals | X ^to ₂ |, position 39 has signals | X ^to ₃ |, and position 40 has signals | X ^to ₁ |.

図１０に示された信号は、二つの純粋な正弦波信号から求められた。実際の信号では、多かれ少なかれ全ての周波数ビンが一杯になっているが、ただ見やすくするには、正弦波信号が特に適している。図１０に示されていることから直ぐに分かるように、信号｜Ｘ^〜 _１｜，｜Ｘ^〜 _２｜，｜Ｘ^〜 _３｜はそれぞれたったの三つの異なる高さのピークしか持たず、他方それ以外ではただのゼロの状態しか持たない。 The signal shown in FIG. 10 was derived from two pure sine wave signals. In an actual signal, more or less all frequency bins are full, but a sine wave signal is particularly suitable for easy viewing. As can be seen immediately from what is shown in FIG. 10, the signals | X ^to ₁ |, | X ^to ₂ |, | X ^to ₃ | each have only three different height peaks, while the others Then it has only a zero state.

図１１は、個別信号Ｓ^〜 _１，Ｓ^〜 _２剰余信号混合Ｓ^〜 _３の３Ｄ−振幅スペクトルを示す。軸３５は、再び周波数ビンを表示し、軸３６は大きさを表示する。軸４２上には、これに対し、含まれる個別信号が与えられている。位置４３には、個別信号｜Ｓ^〜 _１｜があり、位置４４には、剰余信号｜Ｓ^〜 _３｜があり、そして位置４５には、個別信号｜Ｓ^〜 _２｜がある。個別信号｜Ｓ^〜 _１｜，｜Ｓ^〜 _２｜を逆フーリエ変換できるようにするため、個別信号｜Ｓ^〜 _１｜，｜Ｓ^〜 _２｜のゼロとは異なる位置のそれぞれに、変換信号Ｘ^〜 _１，Ｘ^〜 _２からさらにそれぞれ位相を当てることができる。 Figure 11 shows a 3D- amplitude spectrum of the individual signal ^{^S} _{_~} ^1, ^S _~ ₂ residue signal mixing ^S _{~ 3.} The axis 35 again displays frequency bins and the axis 36 displays size. On the axis 42, the individual signals contained therein are given. The position 43 has an individual signal | S ₁ ^to ₁ |, the position 44 has a remainder signal | S ^to ₃ |, and the position 45 has an individual signal | S ^to ₂ |. Individual signal ^{^|} _S ~ 1 ^{|, |} order to be able to reverse Fourier transform, discrete signal ^{_{^{_{| | S ~ 2 S ~ 1}}}} |, | S ~ 2 | to each different position from the zero, converted signal ^{X ~} Further, a phase can be applied from ₁ and X ^to ₂ , respectively.

図１２は、変換信号Ｘ^〜 _１，Ｘ^〜 _２及び剰余値信号Ｘ^〜 _３から個別信号を取得する点を概略的に示す。ベクトル４６は、この例では、変換信号Ｘ^〜 _１のものをここでも部分的に示しており、これに対応してベクトル４７は、変換信号Ｘ^〜 _２の数値を、ベクトル４８は、剰余信号Ｘ^〜 _３の値を、そしてベクトル４９は、対応する周波数ビンを示している。 FIG. 12 schematically shows that the individual signals are acquired from the converted signals X ^- ₁ and X ^- ₂ and the remainder value signals X ^- ₃ . Vector 46, in this example, converted signal X ^~ is partially shown here ones _1, vector 47 in response to this, the numerical value of the converted signal X ^~ _2, vector 48, the remainder signal X a ^value between ₃ and vector 49, shows the corresponding frequency bins.

これらから、ベクトル５０に示された個別信号Ｓ^〜 _１、ベクトル５１に示された個別信号Ｓ^〜 _２、そしてベクトル５２に示された剰余値信号Ｓ^〜 _３が以下のようにして出てくる。 From these, the individual signals S ₁ ^to ₁ shown in the vector 50, the individual signals S ₂ ^to ₂ shown in the vector 51, and the remainder value signals S ₃ ^to ₃ shown in the vector 52 come out as follows.

周波数ビン１に関して、５及び７の数は１より大きいから最小はベクトル４８にある。周波数ビン１では、対応するベクトル５０，５１，５２は、したがって次の値が設定される。すなわち、最小値１の位置では、最大値７が設定される。この値は、したがってベクトル５２の周波数ビン１に設定されなければならない。周波数ビン１のその余の値、つまりベクトル５０，５１における値は、ゼロに設定される。最小を見ていくのは、こうして列ごとに、データポイントごとないし周波数ビンごとに、行なわれる。これらの表現は同じものである。 For frequency bin 1, the number of 5 and 7 is greater than 1, so the minimum is in vector 48. In frequency bin 1, the corresponding vectors 50, 51, 52 are thus set to the following values: That is, at the position of the minimum value 1, the maximum value 7 is set. This value must therefore be set to frequency bin 1 of vector 52. The remainder of frequency bin 1, ie the values in vectors 50 and 51, are set to zero. Looking at the minimum is thus done column by column, data point or frequency bin. These expressions are the same.

相応して、周波数ビン２，３，４，５，等々に対する列の値が計算される結果、図１２に示された値がそれぞれ出てくる。 Correspondingly, the column values for frequency bins 2, 3, 4, 5, etc. are calculated, resulting in the values shown in FIG.

ベクトル５０には、ベクトル４６に対する値ないし最小値が集められ、ベクトル５１には、ベクトル４７に対するものが、そして、ベクトル５２には、ベクトル４８に対するものが集められる。 Vector 50 collects values or minimums for vector 46, vector 51 collects for vector 47, and vector 52 collects for vector 48.

個別信号Ｓ^〜 _１，Ｓ^〜 _２のベクトル５０，５１に対しては、さらに続けてまだ変換信号Ｘ^〜 _１，Ｘ^〜 _２の位相が割り当てられる。この点を単に一例としてベクトル５０の周波数ビン４について述べる。ベクトル５０の周波数ビン４については、ベクトル４６から周波数ビン４の位相が取得される。これは、ベクトル５０がベクトル４６の最小を形成し、それゆえにそこに数値が存在しているからである。相応して、同じ周波数ビンの位相が受け渡される。 The phases of the converted signals X ^- ₁ and X ^- ₂ are still assigned to the vectors 50 and 51 of the individual signals S ^- ₁ and S ^- ₂ . The frequency bin 4 of the vector 50 will be described by taking this point as an example only. For the frequency bin 4 of the vector 50, the phase of the frequency bin 4 is obtained from the vector 46. This is because vector 50 forms the minimum of vector 46, and therefore there is a numerical value there. Correspondingly, the same frequency bin phase is passed.

図の全ての実施形態において、明示された上述の二つの出力信号、二つの変換信号あるいは個別信号に代えて、二つよりも多くの信号を用いることができ、さらに、明細書冒頭で述べた全ての実施形態も、図に関する記載に鑑み用いることができ、それゆえにこの点は明確には排除されていない。 In all the embodiments of the figure, more than two signals can be used instead of the two output signals, the two converted signals or the individual signals explicitly mentioned above, and further as described at the beginning of the specification. All embodiments can also be used in view of the description with respect to the figures, and thus this point is not explicitly excluded.

１源信号
２源信号
３源信号
４源信号
５マイクロフォン
６マイクロフォン
７混合比率特定
８信号分離
９散布図
１０軸
１１軸
１２ポイントクラウド
１３点
１４散布図
１５軸
１６軸
１７直線
１８直線
１９軸
２０軸
２１ヒストグラム
２２最大
２３最大
２４最大
２５最大
２６データブロック
２７データブロック
２８データブロック
２９データブロック
３０データブロック
３１データブロック
３２データブロック
３３データブロック
３４３Ｄ−振幅スペクトル
３５軸
３６軸
３７軸
３８位置
３９位置
４０位置
４１３Ｄ−振幅スペクトル
４２軸
４３位置
４４位置
４５位置
４６ベクトル
４７ベクトル
４８ベクトル
４９周波数ビン
５０ベクトル
５１ベクトル
５２ベクトル 1 source signal 2 source signal 3 source signal 4 source signal 5 microphone 6 microphone 7 mixing ratio specification 8 signal separation 9 scatter diagram 10 axis 11 axis 12 point cloud 13 point 14 scatter diagram 15 axis 16 axis 17 straight line 18 straight line 19 axis 20 axis 21 Histogram 22 Maximum 23 Maximum 24 Maximum 25 Maximum 26 Data block 27 Data block 28 Data block 29 Data block 30 Data block 31 Data block 32 Data block 33 Data block 34 3D-Amplitude spectrum 35 Axis 36 Axis 37 Axis 38 Position 39 Position 40 Position 41 3D-Amplitude Spectrum 42 Axis 43 Position 44 Position 45 Position 46 Vector 47 Vector 48 Vector 49 Frequency Bin 50 Vector 51 Vector 52 Vector

Claims

At least at least two output signals are formed from two source signal _{_{(1,2,3,4) (X 1, X}} 2, X 3, X 4) at least two separate signals from ^{^_(S ~} 1, _S ~ ₂ , S ^to ₃ , S ^to ₄ )
a) Mixing ratios (V ₁ , V ₂ , V ₃ , V ₄ ) of at least two source signals ( ₁ , ₂ , ₃ , ₄ ) on the output signals (X ₁ , X ₂ , X ₃ , X ₄ ) ) Is identified,
b) The output signal (X ₁ , X ₂ , X ₃ , X ₄ ) is multiplied by the factor formed from the mixing ratio (V ₁ , V ₂ , V ₃ , V ₄ ), and the subtraction signal (X The output signals (X ₁ , X ₂ , X ₃ ) so that one source signal (1, ₂ , ₃ , 4) is eliminated in each of {circumflex over ()} ₁ , X ^ ₂ , X ^ ₃ , X ^ ₄ , X ₄ ), at least two subtraction signals (X ^ ₁ , X ^ ₂ , X ^ ₃ , X ^ ₄ ) are calculated,
c) the subtraction signal _{_{_{(X ^ 1, X ^ 2}}} , X ^ 3, X ^ 4) is converted signals by being Fourier transform ^{_{^{_{^{_{(X ~ 1, X ~ 2}}}}}} , X ~ 4, X ~ 5) occurs ,
d) A remainder signal (X ^- ₃ ) is determined from the converted signals (X ^- ₁ , X ^- ₂ , X ^- ₄ , X ^- ₅ ),
e) the converted signal ^{_{^{_{^{_{(X ~ 1, X ~ 2}}}}}} , X ~ 4, X ~ 5) and the at least two separate signals on the basis of a remainder signal ^{_{^{_{(X ~ 3) (S ~}}}} 1, S ~ 2, S ~ ₃ , S ^~ ₄ ) are calculated,
f) said individual signals ^{_{^{_{^{_{(S ~ 1, S ~ 2}}}}}} , S ~ 3, S ~ 4) is converted by an inverse Fourier transform, to calculate the processed source signal _{_{_{(S 1, S 2, S}}} 3, S 4) A method characterized by being made.

The method of claim 1, wherein
The method is characterized in that the mixing ratio (V ₁ , V ₂ , V ₃ , V ₄ ) is specified by specifying a direction.

The method according to claim 1 or 2,
The specification of the mixing ratio (V ₁ , V ₂ , V ₃ , V ₄ )
Fourier transforming the output signals (X ₁ , X ₂ , X ₃ , X ₄ );
Calculating an amplitude value of the Fourier transformed output signals ^{_{^{_{^{_{(X ~ 1, org, X}}}}}} ~ 2, org, X ~ 3, org, X ~ 4, org),
Obtaining a histogram (21) from the amplitude value, or specifying a slope of a straight line (17, 18) of the amplitude value in the scatter diagram (14);
A method characterized by comprising:

The method of claim 3, wherein
The histogram (21) is multiplied by a flattening function.

The method of claim 4, wherein
As a flattening function, in particular

A hanning window is used.

A method according to any one of claims 3 to 5,
Method according to claim 1, characterized in that the angular interval in the histogram (21) is 1 °.

The method according to any one of claims 3 to 6, wherein
Method according to claim 1, characterized in that a local extreme value of the histogram (21), in particular a local maximum (22, 23, 24, 25) is specified.

The method of claim 7, wherein
A method characterized in that one angle is assigned to each maximum (22, 23, 24, 25) starting from an absolute true maximum (22).

The method of claim 8, wherein
The method according to claim 1, wherein the angle assignment is terminated when a predetermined number is reached or when the angle is less than a threshold value.

The method according to any one of claims 3 to 9,
Method characterized in that the histogram (21) is weighted by a low pass.

The method of claim 10, wherein
The low pass is identified from a histogram previously determined in time.

12. The method according to any one of claims 1 to 11, wherein
Calculated two output signals _(X 1, _{X 2)} the subtraction signal when the _{_{_{(X ^ 1, X ^ 2}}} , X ^ 3, X ^ 4) is, when the mixing ratio is smaller than 1, the mixing ratio The second output signal (X ₂ ) multiplied by (V _N ) is subtracted from the _first output signal (X ₁ ), and otherwise, the first output signal (X ₁ ) A method characterized in that the reciprocal of the mixing ratio (1 / V _N ) is multiplied and subtracted from the second output signal (X ₂ ).

The method according to any one of claims 1 to 12,
Method for the Fourier transform, characterized in that a power of 2 is used for the data points, in particular 1024, 2048 or 4096 data points.

14. A method according to any one of claims 1 to 13,
The method, characterized in that the data points for the Fourier transform are taken into account according to a quasi-continuous and superposition method.

15. A method according to any one of claims 1 to 14,
A remainder value signal (X ^- ₃ ) is defined as a subtraction signal of the conversion signal.

The method according to any one of claims 1 to 15,
At least one transformed signal ^{_{^{_{^{_{(X ~ 1, X ~ 2}}}}}} , X ~ 4, X ~ 5) and / or the individual signals by the specific extreme remainder value signal ^{_{^{_{(X ~ 3) (S ~}}}} 1, S ~ 2, S ^~ ₃ , S ^~ ₄ ) are defined.

17. A method according to any one of claims 1 to 16,
The frequency of the individual signals (S ^~ ₁ , S ^~ ₂ , S ^~ ₃ , S ^~ ₄ ) is identified by specifying a minimum.

The method according to claim 16 or 17, wherein
Within the converted signals (X ^to ₁ , X ^to ₂ , X ^to ₄ , X ^to ₅ ) and the remainder signal (X ^to ₃ ), the individual signals (S ^to ₁ , S ^to ₂ , S ^to ₃ , S) ^~ ₄ ) to generate, for each frequency bin (49), the minimum is found, the maximum value of each frequency bin (49) is set at that position, and the remaining value is set to zero A method characterized by being made.

The method of claim 18, wherein
Phases of the converted signals (X ^- ₁ , X ^- ₂ , X ^- ₄ , X ^- ₅ ) with respect to the amplitude values of the individual signals (S ^- ₁ , S ^- ₂ , S ^- ₃ , S ^- ₄ ) A method characterized in that a value is determined.

20. A method according to any one of claims 1 to 19,
The number of source signals (S ₁ , S ₂ , S ₃ , S ₄ ) to be calculated and / or one or more source signals ( ₁ , ₂ , ₃ , ₄ ) themselves are given in advance. A method characterized by that.