JP2014155075A

JP2014155075A - Sound source separation method, device and program

Info

Publication number: JP2014155075A
Application number: JP2013023869A
Authority: JP
Inventors: Yasushi Honda; 寧本多; Akira Goto; 後藤　　晃; Yoshitaka Murayama; 好孝村山
Original assignee: KYOEI ENGINEERING CORP
Current assignee: KYOEI ENGINEERING CORP
Priority date: 2013-02-08
Filing date: 2013-02-08
Publication date: 2014-08-25
Anticipated expiration: 2033-02-08
Also published as: JP6128547B2

Abstract

PROBLEM TO BE SOLVED: To provide a sound source separation method, device and program capable of emphasizing or suppressing and outputting, with a little computational complexity, sounds coming from any arbitrary direction while using a pair of sound wave signals of stereo microphones which are disposed proximately and without requiring complicated and high-grade analysis.SOLUTION: A composite signal InA(k) and a composite signal InB(k) having an inherent amplitude ratio for each direction are synthesized from a pair of sound wave signals InM1(k), InM2(k) inputted to microphones M1, M2. By emphasizing the amplitude ratio of the pair of composite signals in a specific direction, a filtering process is applied for emphasizing sound waves in the specific direction relatively to sound waves coming from other directions. After the filtering process, a gain is determined in accordance with a differential between waveforms of the pair of sound wave signals, and the sound wave signals are outputted after being multiplied by the gain.

Description

本発明は、音波信号に基づき、任意の方向にある音源に向けて指向性を形成する音源分離方法、装置及びプログラムに関する。 The present invention relates to a sound source separation method, apparatus, and program for forming directivity toward a sound source in an arbitrary direction based on a sound wave signal.

目的音源の音波を精度よく分離し、雑音等の目的外音を抑制するには、一般的に、指向性マイクを用いたり、一定以上の間隔でそれらマイクロフォンを複数並べる必要があった。しかしながら、ＩＣレコーダーのような小型の集音機器では、指向性マイクの搭載や間隔を広くとった複数マイクを用いた集音技術の適用は困難である。また、複数の音源から人工的にダウンミックス処理しておいた収録済みの音声について、これら集音技術を適用することによる精度の良い音源分離は困難である。 In order to accurately separate the sound waves of the target sound source and suppress undesired sounds such as noise, it is generally necessary to use a directional microphone or to arrange a plurality of these microphones at regular intervals. However, in a small sound collecting device such as an IC recorder, it is difficult to apply a sound collecting technique using a directional microphone and a plurality of microphones with wide intervals. In addition, it is difficult to accurately separate sound sources by applying these sound collection techniques to recorded sound that has been artificially downmixed from a plurality of sound sources.

そこで、音波の収録後、各マイクロフォンから出力される信号間の振幅差や位相差を解析し、解析結果に応じた信号処理を施すことで、目的音源を分離抽出する技術が多数提案されている。近年では、統計的解析、周波数解析、複素解析等が持ち出され、入力信号の波形構造の違いを検出し、その検出結果を音源分離処理に生かしている。 Therefore, many techniques have been proposed for separating and extracting the target sound source by analyzing the amplitude difference and phase difference between the signals output from each microphone after sound wave recording and applying signal processing according to the analysis result. . In recent years, statistical analysis, frequency analysis, complex analysis, and the like have been brought out to detect differences in the waveform structure of input signals, and use the detection results for sound source separation processing.

例えば、入力信号を時間軸から周波数軸に変換し、周波数ごとに位相の差分を算出し、この差分を基に目的音源から入力された音波周波数帯を特定し、その周波数帯の音波を強調するように信号処理している（特許文献１参照）。 For example, the input signal is converted from the time axis to the frequency axis, the phase difference is calculated for each frequency, the sound wave frequency band input from the target sound source is identified based on this difference, and the sound wave in that frequency band is emphasized Signal processing is performed as described above (see Patent Document 1).

また、信号処理では、近接配置された２個のマイクロフォンの入力信号に基づいて入力された音波が目的方向にあるかを判断し、２個の入力信号の位相差の差分を補正し、目的方向に存在する音を強調している（特許文献２参照）。２つの入力信号同士を互いに参照させ、得られた信号を利用して逐次フィルタを更新している（特許文献３参照）。 Further, in the signal processing, it is determined whether or not the sound wave input is in the target direction based on the input signals of the two microphones arranged close to each other, the difference in phase difference between the two input signals is corrected, and the target direction is corrected. Is emphasized (see Patent Document 2). Two input signals are referred to each other, and the filter is sequentially updated using the obtained signals (see Patent Document 3).

特開２００７−３１８５２８号公報JP 2007-318528 A 特表２００９−１３５５９３号公報Special table 2009-135593 特開２００９−０２７３８８号公報JP 2009-027388 A

集音機器または集音機器を搭載する機器の小型化は、マイクロフォンの配置間隔の一層の近接化を伴い、そのため、信号間の振幅差や位相差は極小となり、これら振幅差や位相差の明瞭な特定に多大な労力が必要となっている。特に、遠方の音源からの信号においては、振幅差は極小となるため、振幅差や位相差の明確な特定は困難となっている。 The downsizing of sound collecting devices or devices equipped with sound collecting devices is accompanied by a further closer arrangement of microphones, and therefore the amplitude difference and phase difference between signals are minimized, and these amplitude differences and phase differences are clearly defined. A great deal of effort is required to identify these. In particular, in a signal from a distant sound source, the amplitude difference is minimal, so it is difficult to clearly identify the amplitude difference and the phase difference.

近年では、特許文献１乃至３のように、波形構造の周波数解析、複素解析、又は統計的解析を複雑高度化することで、マイクロフォンの近接化に対応している。しかしながら、これら解析の複雑高度化は、周波数領域に変換する場合のフレーム長の長大化、遅延器の多数配置、長いフィルタ長、長いフィルタ係数等を招いてしまう結果となり、演算処理能力の都合上、リアルタイムに指向性を形成することが困難となってきた。演算処理の負荷軽減を図るためにはマイクロフォンの数を増加させればよいが、機器の限られたスペースにより、マイクロフォンの離間距離は更に短くなってしまう。 In recent years, as disclosed in Patent Documents 1 to 3, the frequency analysis, complex analysis, or statistical analysis of the waveform structure is complicated and advanced to cope with the proximity of microphones. However, these complicated sophistication results in an increase in the frame length when converting to the frequency domain, a large number of delay devices, a long filter length, a long filter coefficient, etc. It has become difficult to form directivity in real time. The number of microphones may be increased in order to reduce the processing load, but the distance between the microphones is further shortened due to the limited space of the device.

本発明は、上記のような従来技術の問題点を解決するために成されたものであり、その目的は、任意の方向から到来する音を、複雑高度な解析を必要とせず、少ない演算量で強調又は抑圧して出力することができる音源分離方法、装置及びプログラムを提供することにある。 The present invention has been made to solve the above-described problems of the prior art, and its purpose is to reduce the amount of computation of a sound coming from an arbitrary direction without requiring complex and sophisticated analysis. It is an object to provide a sound source separation method, apparatus, and program that can be emphasized or suppressed with a sound source.

上記の目的を達成するために、実施形態の音源分離方法は、一対の音波信号に対して、特定方向に指向性を形成する音源分離方法であって、任意の数のフィルタと、その後段に設けられた任意の数の加算器とを用い、前記フィルタと加算器とにより、得られた前記一対の音波信号の振幅比を方向毎に固有な値とする振幅比設定ステップと、前記一対の音波信号についてその振幅を一定の割合で変更させるフィルタ処理を施し、当該一定の割合とは、特定の振幅比を同一に変更する振幅変更ステップと、前記振幅変更ステップを経た一対の音波信号の波形間の差分に応じて利得を決定する利得決定ステップと、前記利得を音波信号に乗じて出力する出力ステップと、を備えること、を特徴とする。 In order to achieve the above object, the sound source separation method of the embodiment is a sound source separation method that forms directivity in a specific direction with respect to a pair of sound wave signals, and includes an arbitrary number of filters and subsequent stages. An arbitrary number of adders provided, an amplitude ratio setting step in which the amplitude ratio of the pair of sound wave signals obtained by the filter and the adder is a unique value for each direction; and the pair of pairs Filtering is performed to change the amplitude of the sound wave signal at a constant ratio, and the fixed ratio is an amplitude change step for changing the specific amplitude ratio to be the same and a waveform of a pair of sound wave signals that have undergone the amplitude change step And a gain determining step for determining a gain according to the difference between them, and an output step for multiplying the sound wave signal by the gain and outputting it.

振幅比設定ステップでは、前記一対の音波信号の振幅を、方向によって固有の変化率で変化させると共に、同一方向に由来する振幅を、異なる変化率で変化させても良い。 In the amplitude ratio setting step, the amplitude of the pair of sound wave signals may be changed at a specific change rate depending on the direction, and the amplitude derived from the same direction may be changed at a different change rate.

前記変化率の比を示すポーラパターンは、前記一対の音波信号毎に異なるものであっても良い。 The polar pattern indicating the ratio of the change rates may be different for each pair of sound wave signals.

前記振幅の変化率は、強調または減衰を行う特定方向において、その変化率が、他の方向の振幅率と比較して大きいものであっても良い。 The change rate of the amplitude may be larger in a specific direction in which emphasis or attenuation is performed than the amplitude rate in other directions.

前記利得決定ステップは、前記振幅変更ステップを経た合成信号を、交換回路によって前記一対の合成信号を１サンプル毎に交互に入れ替えることで、一対の交換信号を生成する交換ステップと、前記交換信号の片方に係数ｍを乗じた上で、前記交換信号の誤差信号を生成する生成ステップと、前記誤差信号を含む係数ｍの漸化式を演算して係数ｍを１サンプル毎に更新する更新ステップと、逐次更新された係数ｍを前記一対の合成信号に乗じて出力する出力ステップと、を備えることもできる。 The gain determination step includes: an exchange step for generating a pair of exchange signals by alternately exchanging the pair of synthesized signals for each sample by an exchange circuit using the synthesized signal that has undergone the amplitude changing step; A generation step of generating an error signal of the exchange signal after multiplying one of them by a coefficient m, an update step of calculating a recurrence formula of the coefficient m including the error signal and updating the coefficient m for each sample; An output step of multiplying the pair of combined signals by the sequentially updated coefficient m and outputting the resultant signal.

前記一対の音波信号は、一対のマイクロフォンから入力された一対の音波信号であっても良い。 The pair of sound wave signals may be a pair of sound wave signals input from a pair of microphones.

前記一対の音波信号は、音源からマイクロフォンまでの系を模擬した場合の伝達係数により表わされた系が畳み込まれたものであっても良い。 The pair of sound wave signals may be a convolution of a system represented by a transfer coefficient when a system from a sound source to a microphone is simulated.

また、上記の目的を達成するために、実施形態の音源分離装置は、一対の音波信号に対して、特定方向に指向性を形成する音源分離装置であって、任意の数のフィルタと、その後段に設けられた任意の数の加算器と、前記一対の音波信号についてその振幅を一定の割合で変更させるフィルタ処理を施し、当該一定の割合とは、特定の振幅比を同一に変更する振幅変更部と、前記一対の音波信号の振幅比のうち特定振幅比を同一振幅に変更する振幅変更部と、前記振幅変更ステップを経た一対の音波信号の波形間の差分に応じて利得を決定する利得決定部と、前記利得を音波信号に乗じて出力する出力部と、を備えること、を特徴とする。 In order to achieve the above object, the sound source separation device according to the embodiment is a sound source separation device that forms directivity in a specific direction with respect to a pair of sound wave signals, and includes an arbitrary number of filters, and thereafter An arbitrary number of adders provided in a stage and a filtering process for changing the amplitude of the pair of sound wave signals at a certain ratio, and the certain ratio is an amplitude for changing a specific amplitude ratio to be the same. The gain is determined according to the difference between the changing unit, the amplitude changing unit that changes the specific amplitude ratio among the amplitude ratios of the pair of sound wave signals to the same amplitude, and the waveform of the pair of sound wave signals that have undergone the amplitude changing step. A gain determination unit; and an output unit that multiplies the sound wave signal by the gain and outputs the signal.

さらに、上記の目的を達成するために、実施形態の音源分離プログラムは、一対の音波信号に対して、特定方向に指向性を形成する音源分離プログラムであって、任意の数のフィルタと、その後段に設けられた任意の数の加算器と、を備えるコンピュータを、前記フィルタと加算器とにより、得られた前記一対の音波信号の振幅比を方向毎に固有な値とする振幅比設定手段と、前記一対の音波信号についてその振幅を一定の割合で変更させるフィルタ処理を施し、当該一定の割合とは、特定の振幅比を同一に変更する振幅変更手段と、前記振幅変更ステップを経た一対の音波信号の波形間の差分に応じて利得を決定する利得決定手段と、前記利得を音波信号に乗じて出力する出力手段と、して機能させることを特徴とする。 Furthermore, in order to achieve the above-described object, the sound source separation program according to the embodiment is a sound source separation program that forms directivity in a specific direction with respect to a pair of sound wave signals. An amplitude ratio setting means for setting a computer having an arbitrary number of adders provided in a stage to a unique value for each direction by using the filter and the adder, And applying a filtering process for changing the amplitude of the pair of sound wave signals at a constant ratio, the fixed ratio being an amplitude changing means for changing a specific amplitude ratio to be the same, and a pair having undergone the amplitude changing step. And a gain determining means for determining a gain according to a difference between waveforms of the sound wave signal and an output means for multiplying the sound wave signal by the gain and outputting the gain.

本発明によれば、信号間の振幅や差分を特定する煩雑な解析を必要とせずに、演算数を大幅に削減しながらも、任意の方向に対する指向性を精度よく強調できる。 According to the present invention, the directivity in an arbitrary direction can be accurately emphasized while greatly reducing the number of operations without requiring a complicated analysis for specifying the amplitude and difference between signals.

本発明の実施形態に係る音源分離装置の構成を示すブロック図である。It is a block diagram which shows the structure of the sound source separation apparatus which concerns on embodiment of this invention. 係数決定回路の一例を示すブロック図である。It is a block diagram which shows an example of a coefficient determination circuit. 係数更新回路の一例を示すブロック図である。It is a block diagram which shows an example of a coefficient update circuit. 音源とマイクロフォンＭ１、Ｍ２との関係を示すモデルである。It is a model which shows the relationship between a sound source and microphones M1 and M2. 音波信号ＩｎＭ１（ｋ）、ＩｎＭ２（ｋ）の指向性を示すポーラプロットである。It is a polar plot which shows the directivity of acoustic signal InM1 (k) and InM2 (k). 音波信号ＩｎＭ１（ｋ）、ＩｎＭ２（ｋ）の２７０ｄｅｇ〜９０ｄｅｇの間の振幅比を示すグラフである。It is a graph which shows the amplitude ratio between 270 deg-90 deg of sound wave signal InM1 (k) and InM2 (k). 合成信号ＩｎＡ（ｋ）、ＩｎＢ（ｋ）のポーラプロットである。It is a polar plot of synthetic | combination signal InA (k) and InB (k). 合成信号ＩｎＡ（ｋ）、ＩｎＢ（ｋ）の２７０ｄｅｇ〜９０ｄｅｇの間の振幅比を示すグラフである。It is a graph which shows the amplitude ratio between 270deg-90deg of synthetic | combination signal InA (k) and InB (k). 合成信号ＩｎＡ（ｋ）、ＩｎＢ（ｋ）の３０ｄｅｇの振幅比を１にシフトさせたグラフである。It is the graph which shifted the amplitude ratio of 30deg of synthetic | combination signal InA (k) and InB (k) to 1. FIG. 音波信号ＩｎＭ１（ｋ）、ＩｎＭ２（ｋ）の振幅を５０ｄｅｇ方向で差分を無くすように調整したポーラプロットである。It is a polar plot in which the amplitudes of the sound wave signals InM1 (k) and InM2 (k) are adjusted to eliminate the difference in the 50 deg direction. 合成信号ＩｎＡ（ｋ）、ＩｎＢ（ｋ）の振幅を５０ｄｅｇ方向で差分を無くすように調整したポーラプロットである。It is a polar plot in which the amplitudes of the combined signals InA (k) and InB (k) are adjusted to eliminate the difference in the 50 deg direction. 振幅比設定部がない場合の２７０ｄｅｇ〜９０ｄｅｇに応じた係数ｍ（ｋ）を示すグラフである。It is a graph which shows coefficient m (k) according to 270deg-90deg when there is no amplitude ratio setting part. 振幅比設定部がある場合の２７０ｄｅｇ〜９０ｄｅｇに応じた係数ｍ（ｋ）について、０から１となるよう正規化した（ｍ（ｋ）＋１）／２を示すグラフである。It is a graph which shows (m (k) +1) / 2 normalized so that it may become 0 to 1 about coefficient m (k) according to 270deg-90deg when there is an amplitude ratio setting part. 交換回路の有無に応じた係数ｍ（ｋ）の収束速度を示すグラフである。It is a graph which shows the convergence speed of the coefficient m (k) according to the presence or absence of an exchange circuit. その他の実施形態に係る音源分離装置の構成を示すブロック図である。It is a block diagram which shows the structure of the sound source separation apparatus which concerns on other embodiment. その他の実施形態に係る音源分離装置の構成を示すブロック図である。It is a block diagram which shows the structure of the sound source separation apparatus which concerns on other embodiment.

以下、本発明に係る音源分離方法及び装置の実施形態について図面を参照しつつ詳細に説明する。 Hereinafter, embodiments of a sound source separation method and apparatus according to the present invention will be described in detail with reference to the drawings.

（第１の実施形態）
（構成） (First embodiment)
(Constitution)

図１は、音源分離装置の構成を示すブロック図である。図１に示すように、音源分離装置は、離間配置される一対のマイクロフォンＭ１、Ｍ２に接続されている。音源分離装置には、マイクロフォンＭ１、Ｍ２から音波信号ＩｎＭ１（ｋ）と音波信号ＩｎＭ２（ｋ）が入力され、音波信号ＩｎＭ１（ｋ）と音波信号ＩｎＭ２（ｋ）を信号処理して方向毎に固有の振幅比を有する一対の合成信号に合成される。音源分離装置は、一対の合成信号の特定方向の振幅比を強調することで、特定方向の音波を他方向から到来する音波と比して相対的に強調する。方向とは、マイクロフォンＭ１、Ｍ２との中点を基準とし、その中点に向かう音波の到来方向、言い換えると音源が配置される方向である。特定方向とは、目的音源の方向である。 FIG. 1 is a block diagram illustrating a configuration of a sound source separation device. As shown in FIG. 1, the sound source separation device is connected to a pair of microphones M1 and M2 that are spaced apart. The sound source separation device receives the sound wave signal InM1 (k) and the sound wave signal InM2 (k) from the microphones M1 and M2, and performs signal processing on the sound wave signal InM1 (k) and the sound wave signal InM2 (k) to be specific to each direction. Are combined into a pair of combined signals having an amplitude ratio of The sound source separation device emphasizes the relative amplitude of the sound wave in the specific direction as compared with the sound wave coming from the other direction by enhancing the amplitude ratio of the pair of synthesized signals in the specific direction. The direction is the arrival direction of the sound wave toward the midpoint with respect to the midpoint of the microphones M1 and M2, in other words, the direction in which the sound source is arranged. The specific direction is the direction of the target sound source.

この音源分離装置には、マイクロフォンＭ１、Ｍ２の後段に、振幅比設定部１が設けられる。この振幅比設定部１は、音波信号ＩｎＭ１（ｋ）と音波信号ＩｎＭ２（ｋ）とから合成信号ＩｎＡ（ｋ）と合成信号ＩｎＢ（ｋ）とを生成する。合成信号ＩｎＡ（ｋ）は、音波信号ＩｎＭ１（ｋ）の振幅を変化させた信号であり、合成信号ＩｎＢ（ｋ）は、音波信号ＩｎＭ２（ｋ）の振幅を変化させた信号である。この振幅比設定部１は、合成信号ＩｎＡ（ｋ）と合成信号ＩｎＢ（ｋ）が方向によって固有の振幅比を持つように、音波信号ＩｎＭ１（ｋ）と音波信号ＩｎＭ２（ｋ）とをそれぞれ信号処理する。
すなわち、振幅比設定部１は、音波信号ＩｎＭ１（ｋ）と音波信号ＩｎＭ２（ｋ）の振幅を、方向によって固有の変化率で変化させる。さらに、振幅比設定部１は、同一方向に由来する音波信号ＩｎＭ１（ｋ）と音波信号ＩｎＭ２（ｋ）の振幅を、異なる変化率で変化させる。換言すると、振幅比設定部１は、音波信号ＩｎＭ１（ｋ）の振幅変化率を規定するポーラパターンと、音波信号ＩｎＭ２（ｋ）の振幅変化率を規定するポーラパターンを有する。両ポーラパターンは、同一方向において規定する振幅変化率が異なる。さらに、両ポーラパターンが規定する振幅変化率の比は、方向毎に固有となっている。
このポーラパターンは、音波信号ＩｎＭ１（ｋ）と音波信号ＩｎＭ２（ｋ）を、フィルタＨ１〜Ｈ４と加算器１ａ、１ｂを通すことにより実現する。 In the sound source separation device, an amplitude ratio setting unit 1 is provided at the subsequent stage of the microphones M1 and M2. The amplitude ratio setting unit 1 generates a combined signal InA (k) and a combined signal InB (k) from the sound wave signal InM1 (k) and the sound wave signal InM2 (k). The synthesized signal InA (k) is a signal obtained by changing the amplitude of the sound wave signal InM1 (k), and the synthesized signal InB (k) is a signal obtained by changing the amplitude of the sound wave signal InM2 (k). The amplitude ratio setting unit 1 outputs the sound wave signal InM1 (k) and the sound wave signal InM2 (k) so that the combined signal InA (k) and the combined signal InB (k) have specific amplitude ratios depending on directions. To process.
That is, the amplitude ratio setting unit 1 changes the amplitudes of the sound wave signal InM1 (k) and the sound wave signal InM2 (k) at a specific change rate depending on the direction. Furthermore, the amplitude ratio setting unit 1 changes the amplitudes of the sound wave signal InM1 (k) and the sound wave signal InM2 (k) derived from the same direction at different change rates. In other words, the amplitude ratio setting unit 1 has a polar pattern that defines the amplitude change rate of the sound wave signal InM1 (k) and a polar pattern that defines the amplitude change rate of the sound wave signal InM2 (k). Both polar patterns differ in amplitude change rate defined in the same direction. Furthermore, the ratio of the amplitude change rate defined by both polar patterns is unique for each direction.
This polar pattern is realized by passing the sound wave signal InM1 (k) and the sound wave signal InM2 (k) through the filters H1 to H4 and the adders 1a and 1b.

振幅比設定部１は、合成信号ＩｎＡ（ｋ）と合成信号ＩｎＢ（ｋ）とを合成するために、フィルタＨ１〜Ｈ４と加算器１ａ、１ｂとを備える。フィルタＨ１〜Ｈ４は、伝達関数Ｈ１〜Ｈ４により、音波信号ＩｎＭ１（ｋ）と音波信号ＩｎＭ２（ｋ）をフィルタ処理するフィルタである。フィルタＨ１〜Ｈ４は、例えばＦＩＲフィルタやＩＩＲフィルタ等である。 The amplitude ratio setting unit 1 includes filters H1 to H4 and adders 1a and 1b in order to synthesize the synthesized signal InA (k) and the synthesized signal InB (k). The filters H1 to H4 are filters that filter the sound wave signal InM1 (k) and the sound wave signal InM2 (k) with the transfer functions H1 to H4. The filters H1 to H4 are, for example, FIR filters and IIR filters.

フィルタＨ１〜Ｈ４の伝達関数は、例えば、以下式（１）〜（４）で表される。
Ｈ１＝X ・・・・（１）
Ｈ２＝-(X*C195_2 ) / C195_1・・・・（２）
Ｈ３＝-(X*C105_1 ) / C105_2・・・・（３）
Ｈ４＝X ・・・・（４）
式中Ｘは、任意の一定遅延と振幅特性をもつ信号で、変換の基準となる伝達関数である。
C195_1は、195[deg]方向の音源からＭ１までの伝達関数。
C195_2は、195[deg]方向の音源からＭ２までの伝達関数。
C105_1は、105[deg]方向の音源からＭ１までの伝達関数。
C105_2は、105[deg]方向の音源からＭ２までの伝達関数。 The transfer functions of the filters H1 to H4 are represented by the following expressions (1) to (4), for example.
H1 = X (1)
H2 =-(X * C195_2) / C195_1 (2)
H3 =-(X * C105_1) / C105_2 (3)
H4 = X (4)
In the equation, X is a signal having an arbitrary constant delay and amplitude characteristic, and is a transfer function serving as a reference for conversion.
C195_1 is a transfer function from the sound source in the 195 [deg] direction to M1.
C195_2 is the transfer function from the sound source in the 195 [deg] direction to M2.
C105_1 is a transfer function from the sound source in the 105 [deg] direction to M1.
C105_2 is the transfer function from the sound source in the 105 [deg] direction to M2.

式（１）に示すように、音波信号ＩｎＭ１（ｋ）は、フィルタＨ１を経て、変換の基準となる関数Ｘによりフィルタ処理される。
また、式（３）に示すように、音波信号ＩｎＭ２（ｋ）は、フィルタＨ３を経ることで、音波信号ＩｎＭ１（ｋ）とは、同位相で振幅の正負が反転した関係となる。フィルタＨ３は、１０５ｄｅｇの音源からのマイクＭ１までの伝達関数と、１０５ｄｅｇの音源からのマイクＭ２までの伝達関数の逆数を、変換の基準となる関数Ｘに乗じた値に、さらにマイナスをかけるものである。すなわち、このフィルタＨ３は、１０５ｄｅｇ方向の音源から発信された音波が、マイクロフォンＭ１、Ｍ２に到達したときに生じる信号の相違をマイクロフォンＭ２を基準とした割合を表している。このフィルタＨ３は、この割合をいずれの方向由来の音波信号ＩｎＭ２（ｋ）に対しても乗じる。
そうすると、このフィルタＨ３を通った音波信号ＩｎＭ２（ｋ）は、この音波信号ＩｎＭ２（ｋ）が１０５ｄｅｇ方向の音源由来である場合、音波信号ＩｎＭ１（ｋ）とは、同位相で同振幅を持ちつつも、正負が反転することになる。その他の方向の音源由来である場合には、マイクロフォンＭ１、Ｍ２に達した時に生じる信号の相違とは、ズレた割合が乗じられるため、音波信号ＩｎＭ１（ｋ）と音波信号ＩｎＭ２（ｋ）の振幅が一致しなくなり、そのズレは、１０５ｄｅｇから離れるに従って大きくなっていく。 As shown in the equation (1), the sound wave signal InM1 (k) is filtered by the function X serving as a reference for conversion through the filter H1.
Further, as shown in Expression (3), the sound wave signal InM2 (k) passes through the filter H3, so that the sound wave signal InM1 (k) has a relationship in which the positive and negative amplitudes are inverted in the same phase. The filter H3 multiplies the value obtained by multiplying the transfer function from the 105 deg sound source to the microphone M1 and the reciprocal of the transfer function from the 105 deg sound source to the microphone M2 by the function X serving as a conversion reference. It is. That is, this filter H3 represents the ratio of the difference in the signal generated when the sound wave transmitted from the sound source in the 105 deg direction reaches the microphones M1 and M2, based on the microphone M2. This filter H3 multiplies this ratio to the sound wave signal InM2 (k) derived from any direction.
Then, the sound wave signal InM2 (k) passed through the filter H3 has the same phase and the same amplitude as the sound wave signal InM1 (k) when the sound wave signal InM2 (k) is derived from a sound source in the 105 deg direction. However, the sign is reversed. When the sound source is derived from another direction, the difference between the signals generated when the microphones M1 and M2 are reached is multiplied by the ratio of deviation, so that the amplitude of the sound wave signal InM1 (k) and the sound wave signal InM2 (k) Are not matched, and the deviation increases as the distance from 105 deg increases.

振幅比設定部１では、音波信号ＩｎＭ１（ｋ）をフィルタＨ１でフィルタ処理した信号と、音波信号ＩｎＭ２（ｋ）をフィルタＨ３でフィルタ処理した信号とを加算器１ａで合成し、合成信号ＩｎＡ（ｋ）として出力するように構成される。合成信号ＩｎＡ（ｋ）は、以下の（５）で表される。
合成信号ＩｎＡ（ｋ） = M1*H1 + M2*H3・・・・（５）
音波信号ＩｎＭ１と、音波信号ＩｎＭ２は、フィルタＨ１とフィルタＨ３とにより、同位相で振幅の正負が反転した関係となる。式（５）においては、フィルタＨ１を経た音波信号ＩｎＭ１（ｋ）から、１０５ｄｅｇからのズレ量に応じた割合で振幅を変更させた音波信号ＩｎＭ２（ｋ）を減じる。そのため、１０５ｄｅｇにおいては近づけば近づくほど、相殺の程度が大きく、１０５ｄｅｇ方向から離れると、相殺の程度は小さくなる。つまり、１０５ｄｅｇを最低振幅として、１０５ｄｅｇから離れると、振幅を大きくした信号を出力する。 In the amplitude ratio setting unit 1, the signal obtained by filtering the sound wave signal InM1 (k) with the filter H1 and the signal obtained by filtering the sound wave signal InM2 (k) with the filter H3 are combined by the adder 1a, and the combined signal InA ( k). The combined signal InA (k) is expressed by the following (5).
Composite signal InA (k) = M1 * H1 + M2 * H3 (5)
The sound wave signal InM1 and the sound wave signal InM2 have a relationship in which the positive and negative amplitudes are inverted in the same phase by the filter H1 and the filter H3. In Expression (5), the sound wave signal InM2 (k) whose amplitude is changed is subtracted from the sound wave signal InM1 (k) that has passed through the filter H1 at a rate corresponding to the amount of deviation from 105 degrees. For this reason, the closer to 105 deg, the greater the degree of cancellation, and the further away from the 105 deg direction, the smaller the degree of cancellation. In other words, a signal having a larger amplitude is output when the distance from 105 deg is 105 deg as the minimum amplitude.

同様に、式（４）に示すように、音波信号ＩｎＭ２（ｋ）は、フィルタＨ４を経て、変換の基準となる関数Ｘによりフィルタ処理される。また、式（２）に示すように、音波信号ＩｎＭ１（ｋ）は、フィルタＨ２を経ることで、音波信号ＩｎＭ２（ｋ）とは、同位相で振幅の正負が反転した関係となる。フィルタＨ２は、１９５ｄｅｇの音源からのマイクＭ２までの伝達関数と、１９５ｄｅｇの音源からのマイクＭ１までの伝達関数の逆数を、変換の基準となる関数Ｘに乗じた値に、さらにマイナスをかけるものである。すなわち、このフィルタＨ２は、１９５ｄｅｇ方向の音源から発信された音波が、マイクロフォンＭ１、Ｍ２に到達したときに生じる信号の相違をマイクロフォンＭ１を基準とした割合を表している。このフィルタＨ２は、この割合をいずれの方向由来の音波信号ＩｎＭ１（ｋ）に対しても乗じる。 Similarly, as shown in Expression (4), the sound wave signal InM2 (k) passes through the filter H4 and is filtered by the function X serving as a reference for conversion. Further, as shown in Expression (2), the sound wave signal InM1 (k) passes through the filter H2, so that the sound wave signal InM2 (k) has a relationship in which the positive and negative amplitudes are inverted in the same phase. The filter H2 further multiplies the value obtained by multiplying the transfer function from the 195 deg sound source to the microphone M2 and the reciprocal of the transfer function from the 195 deg sound source to the microphone M1 by the function X serving as a conversion reference. It is. That is, this filter H2 represents the ratio of the difference in the signal generated when the sound wave transmitted from the sound source in the 195 deg direction reaches the microphones M1 and M2, based on the microphone M1. The filter H2 multiplies this ratio to the sound wave signal InM1 (k) derived from any direction.

振幅比設定部１では、音波信号ＩｎＭ１（ｋ）をフィルタＨ２でフィルタ処理した信号と、音波信号ＩｎＭ２（ｋ）をフィルタＨ４でフィルタ処理した信号とを加算器１ｂで合成し、合成信号ＩｎＢ（ｋ）として出力するように構成される。合成信号ＩｎＢ（ｋ）は、以下の（６）で表される。
合成信号ＩｎＢ（ｋ） = M1*H2 + M2*H4・・・・（６）
音波信号ＩｎＭ１（ｋ）と、音波信号ＩｎＭ２（ｋ）は、フィルタＨ２とフィルタＨ４とにより、同位相で振幅の正負が反転した関係となる。式（６）においては、フィルタＨ４を経た音波信号ＩｎＭ２（ｋ）から、１９５ｄｅｇからのズレ量に応じた割合で振幅を変更させた音波信号ＩｎＭ１（ｋ）を減じる。そのため、１９５ｄｅｇにおいては近づけば近づくほど、相殺の程度が大きく、１９５ｄｅｇ方向から離れると、相殺の程度は小さくなる。つまり、１９５ｄｅｇを最低振幅として、１９５ｄｅｇから離れると、振幅を大きくした信号を出力する。 In the amplitude ratio setting unit 1, the signal obtained by filtering the sound wave signal InM1 (k) with the filter H2 and the signal obtained by filtering the sound wave signal InM2 (k) with the filter H4 are synthesized by the adder 1b, and the synthesized signal InB ( k). The combined signal InB (k) is expressed by the following (6).
Composite signal InB (k) = M1 * H2 + M2 * H4 (6)
The sound wave signal InM1 (k) and the sound wave signal InM2 (k) have a relationship in which the positive and negative amplitudes are inverted in the same phase by the filter H2 and the filter H4. In Expression (6), the sound wave signal InM1 (k) whose amplitude is changed is subtracted from the sound wave signal InM2 (k) that has passed through the filter H4 at a rate corresponding to the amount of deviation from 195 deg. For this reason, the closer to 195 deg, the greater the degree of cancellation, and the further away from the 195 deg direction, the smaller the degree of cancellation. In other words, a signal with an increased amplitude is output when the amplitude is 195 deg and the distance is away from 195 deg.

振幅比設定部１は、フィルタＨ１〜Ｈ４、及び加算器１ａ、１ｂにより、方向毎に固有な振幅比を有する合成信号ＩｎＡ（ｋ）及び合成信号ＩｎＢ（ｋ）を生成する。 The amplitude ratio setting unit 1 uses the filters H1 to H4 and the adders 1a and 1b to generate a combined signal InA (k) and a combined signal InB (k) having a specific amplitude ratio for each direction.

音源分離装置は、振幅比設定部１の後段に、合成信号ＩｎＡ（ｋ）と合成信号ＩｎＢ（ｋ）における特定方向の振幅比の信号を同一振幅に変更する振幅変更部２を備える。振幅変更部２では、合成信号ＩｎＡ（ｋ）と合成信号ＩｎＢ（ｋ）における特定方向の振幅比を同一振幅に揃える一方、特定方向以外の振幅比については、特定方向から逸れるほど振幅差を付けていく。振幅変更部２は、フィルタＤ１、フィルタＴ１を備える。 The sound source separation device includes an amplitude changing unit 2 that changes the signal of the amplitude ratio in a specific direction in the combined signal InA (k) and the combined signal InB (k) to the same amplitude, following the amplitude ratio setting unit 1. In the amplitude changing unit 2, the amplitude ratios in the specific direction in the combined signal InA (k) and the combined signal InB (k) are set to the same amplitude, while the amplitude ratio other than the specific direction is given an amplitude difference that deviates from the specific direction. To go. The amplitude changing unit 2 includes a filter D1 and a filter T1.

振幅変更部２では、合成信号ＩｎＡ（ｋ）と合成信号ＩｎＢ（ｋ）とをフィルタ処理することにより、合成信号ＩｎＡ（ｋ）と合成信号ＩｎＢ（ｋ）との特定方向の振幅比を同一にする。振幅変更部２は、合成信号ＩｎＡ（ｋ）にフィルタ処理を施すフィルタＤ１と、合成信号ＩｎＢ（ｋ）にフィルタ処理を施すフィルタＴ１とを備える。 The amplitude changing unit 2 filters the combined signal InA (k) and the combined signal InB (k), thereby making the amplitude ratio in a specific direction of the combined signal InA (k) and the combined signal InB (k) the same. To do. The amplitude changing unit 2 includes a filter D1 that performs filter processing on the combined signal InA (k), and a filter T1 that performs filter processing on the combined signal InB (k).

フィルタＤ１、Ｔ１の伝達関数Ｄ１、Ｔ１の一例としては、以下式（７）（８）で表される。
Ｄ１＝ 1 ・・・・（７）
Ｔ１＝ (c11*h2 +
c12*h4)/(c11*h1 + c12*h3)・・・・（８）
ｃ１１は、音源Ｓ１からマイクロフォンＭ１までの伝達関数。
ｃ１２は、音源Ｓ１からマイクロフォンＭ２までの伝達関数。 An example of the transfer functions D1 and T1 of the filters D1 and T1 is expressed by the following equations (7) and (8).
D1 = 1 (7)
T1 = (c11 * h2 +
c12 * h4) / (c11 * h1 + c12 * h3) (8)
c11 is a transfer function from the sound source S1 to the microphone M1.
c12 is a transfer function from the sound source S1 to the microphone M2.

この式（７）（８）を満たす伝達関数Ｄ１、Ｔ１により、合成信号ＩｎＡ（ｋ）と合成信号ＩｎＢ（ｋ）とにおける特定方向の振幅比を同一に変更する。 With the transfer functions D1 and T1 satisfying the expressions (7) and (8), the amplitude ratio in a specific direction in the combined signal InA (k) and the combined signal InB (k) is changed to be the same.

振幅変更部２を経た合成信号ＩｎＡ（ｋ）及び合成信号ＩｎＢ（ｋ）は、係数決定回路３と、合成回路４への経路とに分配される。本実施形態では、係数決定回路３で決定した係数ｍ（ｋ）に応じた利得ｇを合成回路４内で決定する。合成回路４では、その利得により、出力信号ＯｕｔＲ（ｋ）、出力信号ＯｕｔＬ（ｋ）を出力する。 The combined signal InA (k) and combined signal InB (k) that have passed through the amplitude changing unit 2 are distributed to the coefficient determination circuit 3 and the path to the combining circuit 4. In the present embodiment, the gain g corresponding to the coefficient m (k) determined by the coefficient determination circuit 3 is determined in the synthesis circuit 4. The synthesis circuit 4 outputs an output signal OutR (k) and an output signal OutL (k) based on the gain.

係数決定回路３は、図２に示すように、特性補正回路３ａ、交換回路３ｂ、及びに係数更新回路３ｃが直列に接続された回路である。 As shown in FIG. 2, the coefficient determination circuit 3 is a circuit in which a characteristic correction circuit 3a, an exchange circuit 3b, and a coefficient update circuit 3c are connected in series.

特性補正回路３ａは、周波数特性補正フィルタと位相特性補正回路とを有する。周波数特性補正フィルタは、所望周波数帯の音波信号を抽出する。位相特性補正回路は、音波信号ＩｎＭ１（ｋ）と音波信号ＩｎＭ２（ｋ）に対するマイクロフォンＭ１、Ｍ２の音響特性が与える影響を減少させる。 The characteristic correction circuit 3a includes a frequency characteristic correction filter and a phase characteristic correction circuit. The frequency characteristic correction filter extracts a sound wave signal in a desired frequency band. The phase characteristic correction circuit reduces the influence of the acoustic characteristics of the microphones M1 and M2 on the sound wave signal InM1 (k) and the sound wave signal InM2 (k).

交換回路３ｂは、合成信号ＩｎＡ（ｋ）と合成信号ＩｎＢ（ｋ）を１サンプルおきに交互に入れ替えて出力する。すなわち、交換信号ＩｎＣ（ｋ）及び交換信号ＩｎＤ（ｋ）のデータ列は、ｋ＝１、２、３、４・・・において、以下のようになる。
ＩｎＣ（ｋ）＝｛ＩｎＡ（１）ＩｎＢ（２）ＩｎＡ（３）ＩｎＢ（４）・・・｝
ＩｎＤ（ｋ）＝｛ＩｎＢ（１）ＩｎＡ（２）ＩｎＢ（３）ＩｎＡ（４）・・・｝ The exchange circuit 3b alternately outputs the synthesized signal InA (k) and the synthesized signal InB (k) every other sample. That is, the data strings of the exchange signal InC (k) and the exchange signal InD (k) are as follows when k = 1, 2, 3, 4,.
InC (k) = {InA (1) InB (2) InA (3) InB (4) ...}
InD (k) = {InB (1) InA (2) InB (3) InA (4) ...}

交換信号ＩｎＣ（ｋ）及び交換信号ＩｎＤ（ｋ）は、係数更新回路３ｃに入力される。この係数更新回路３ｃは、交換信号ＩｎＣ（ｋ）と交換信号ＩｎＤ（ｋ）との誤差を計算し、誤差に応じた係数ｍ（ｋ）を決定する。また、係数更新回路３ｃは、過去の係数ｍ（ｋ−１）を参照して逐次的に係数ｍ（ｋ）を更新する。 The exchange signal InC (k) and the exchange signal InD (k) are input to the coefficient update circuit 3c. The coefficient update circuit 3c calculates an error between the exchange signal InC (k) and the exchange signal InD (k) and determines a coefficient m (k) corresponding to the error. The coefficient updating circuit 3c sequentially updates the coefficient m (k) with reference to the past coefficient m (k-1).

同着の交換信号ＩｎＡ（ｋ）と交換信号ＩｎＢ（ｋ）の誤差信号ｅ（ｋ）を以下式（９）のように定義する。
An error signal e (k) between the exchange signal InA (k) and the exchange signal InB (k) is defined as the following equation (9).

係数更新回路３ｃは、誤差信号ｅ（ｋ）を係数ｍ（ｋ−１）の関数とし、誤差信号ｅ（ｋ）を含む係数ｍ（ｋ）の隣接二項間漸化式を演算することで、誤差信号ｅ（ｋ）が最小となる係数ｍ（ｋ）を探索する。係数更新回路３ｃは、この演算処理により、合成信号ＩｎＡ（ｋ）と合成信号ＩｎＢ（ｋ）とに特定方向の振幅差が生じていればいるほど、係数ｍ（ｋ）を減少させる方向で更新し、振幅差がなければ係数ｍ（ｋ）を１に近づけて出力する。 The coefficient update circuit 3c uses the error signal e (k) as a function of the coefficient m (k-1) and calculates a recurrence formula between adjacent binomials of the coefficient m (k) including the error signal e (k). The coefficient m (k) that minimizes the error signal e (k) is searched. The coefficient update circuit 3c is updated in a direction to decrease the coefficient m (k) as the amplitude difference in a specific direction is generated between the combined signal InA (k) and the combined signal InB (k) by this arithmetic processing. If there is no amplitude difference, the coefficient m (k) is output close to 1.

係数ｍ（ｋ）は、合成信号ＩｎＡ（ｋ）及び合成信号ＩｎＢ（ｋ）と共に、合成回路４に入力される。合成回路４は、合成信号ＩｎＡ（ｋ）と合成信号ＩｎＢ（ｋ）に対して、任意の比率で係数ｍ（ｋ）を乗じ、任意の比率で足し合わせて、その結果として出力信号ＯｕｔＬ（ｋ）と出力信号ＯｕｔＲ（ｋ）を出力する。 The coefficient m (k) is input to the synthesis circuit 4 together with the synthesis signal InA (k) and the synthesis signal InB (k). The synthesis circuit 4 multiplies the composite signal InA (k) and the composite signal InB (k) by a coefficient m (k) at an arbitrary ratio, adds them at an arbitrary ratio, and as a result, outputs the output signal OutL (k ) And an output signal OutR (k).

係数更新回路３ｃの一例を更に説明する。図３は、係数更新回路３の一例を示すブロック図である。図３に示すように、係数更新回路３ｃは、複数の積算器と加算器から構成され、隣接二項間漸化式を体現した回路であり、過去の係数ｍ（ｋ−１）を参照して係数ｍ（ｋ）を漸次更新するものである。長いタップ数を有する適応フィルタは排除されている。 An example of the coefficient update circuit 3c will be further described. FIG. 3 is a block diagram illustrating an example of the coefficient update circuit 3. As shown in FIG. 3, the coefficient update circuit 3c is composed of a plurality of accumulators and adders, and is a circuit that embodies a recurrence formula between adjacent binomials, and refers to a past coefficient m (k−1). The coefficient m (k) is gradually updated. Adaptive filters with long tap numbers are eliminated.

この係数更新回路３ｃにおいて、交換信号ＩｎＤ（ｋ）を参照信号として用いて誤差信号ｅ（ｋ）を生成する。すなわち、交換信号ＩｎＣ（ｋ）は、積算器５に入力される。積算器５は、交換信号ＩｎＣ（ｋ）に対して１サンプル前の係数ｍ（ｋ−１）の−１倍を掛け合わせる。積算器５の出力側には、加算器６が接続されている。この加算器６には、積算器５から出力された信号と交換信号ＩｎＤ（ｋ）とが入力され、これら信号を加算することで、瞬時誤差信号ｅ（ｋ）を得る。この演算処理による誤差信号ｅ（ｋ）は以下式（１０）の通りである。
In the coefficient update circuit 3c, an error signal e (k) is generated using the exchange signal InD (k) as a reference signal. That is, the exchange signal InC (k) is input to the integrator 5. The accumulator 5 multiplies the exchange signal InC (k) by −1 times the coefficient m (k−1) one sample before. An adder 6 is connected to the output side of the integrator 5. The adder 6 receives the signal output from the integrator 5 and the exchange signal InD (k), and adds these signals to obtain an instantaneous error signal e (k). The error signal e (k) resulting from this arithmetic processing is as shown in the following equation (10).

誤差信号ｅ（ｋ）は、音波信号をμ倍する積算器７に入力される。係数μは、１未満のステップサイズパラメータである。積算器７の出力側には、積算器８が接続される。積算器８には、交換信号ＩｎＣ（ｋ）と積算器を経た信号μｅ（ｋ）とが入力される。この積算器８は、交換信号ＩｎＣ（ｋ）と信号μｅ（ｋ）とを乗じ、以下式（１１）で表される瞬時二乗誤差の微分信号∂Ｅ（ｍ）^２／∂ｍを得る。
The error signal e (k) is input to the integrator 7 that multiplies the sound wave signal by μ. The coefficient μ is a step size parameter of less than 1. An integrator 8 is connected to the output side of the integrator 7. The accumulator 8 receives the exchange signal InC (k) and the signal μe (k) that has passed through the accumulator. The accumulator 8 multiplies the exchange signal InC (k) and the signal μe (k) to obtain a differential signal ∂E (m) ² / ∂m of an instantaneous square error expressed by the following equation (11).

積算器８には加算器９が接続されている。加算器９は、以下式（１２）を演算することで係数ｍ（ｋ）を完成させ、合成信号ＩｎＡ（ｋ）と合成信号ＩｎＢ（ｋ）とから出力信号ＯｕｔＬ（ｋ）とＯｕｔＲ（ｋ）を生成する合成回路４に係数ｍ（ｋ）をセットする。
すなわち、加算器９は微分信号∂Ｅ（ｍ）^２／∂ｍに対して信号β・ｍ（ｋ−１）を加算することで係数ｍ（ｋ）を完成させる。 An adder 9 is connected to the integrator 8. The adder 9 completes the coefficient m (k) by calculating the following equation (12), and outputs the output signals OutL (k) and OutR (k) from the combined signal InA (k) and the combined signal InB (k). The coefficient m (k) is set in the synthesis circuit 4 that generates
That is, the adder 9 completes the coefficient m (k) by adding the signal β · m (k−1) to the differential signal ∂E (m) ² / ∂m.

信号β・ｍ（ｋ−１）は、加算器９の出力側に１サンプル分だけ信号を遅延させる遅延器１０と定数βを積算する積算器１１とが接続されており、１サンプル前の信号処理により更新された係数ｍ（ｋ−１）に対して積算器１１で定数βを乗じることにより生成される。 The signal β · m (k−1) is connected to the output side of the adder 9 by a delay unit 10 that delays the signal by one sample and an accumulator 11 that accumulates a constant β. The coefficient m (k−1) updated by the process is generated by multiplying the constant 11 by the integrator 11.

これにより、係数更新回路３では、以下の漸化式（１３）の演算処理が実現し、係数ｍ（ｋ）を生成され、サンプリング毎に漸次更新していく。
As a result, the coefficient update circuit 3 realizes the calculation process of the following recurrence formula (13), generates the coefficient m (k), and updates it gradually for each sampling.

（作用）
（振幅比設定部の作用）
図４に、音源とマイクロフォンＭ１、Ｍ２との位置関係を示す。この位置関係を示すモデルは、ｘ軸上に原点を中心にして４ｃｍ離したマイクロフォンＭ１、Ｍ２を設置している。原点から１．０ｍの距離に音源Ｓ１を配置している。音源Ｓ１は、ｙ軸正方向を０ｄｅｇ、ｘ軸正方向を９０ｄｅｇとして角度で特定し、本実施形態では、５０ｄｅｇの方向に配置するものとする。 (Function)
(Operation of amplitude ratio setting unit)
FIG. 4 shows the positional relationship between the sound source and the microphones M1 and M2. In the model indicating the positional relationship, microphones M1 and M2 separated by 4 cm from the origin are set on the x-axis. The sound source S1 is arranged at a distance of 1.0 m from the origin. The sound source S1 is specified by an angle with the y-axis positive direction being 0 deg and the x-axis positive direction being 90 deg. In this embodiment, the sound source S1 is arranged in the direction of 50 deg.

音源Ｓ１からの音波信号は、マイクロフォンＭ１、Ｍ２に入力する。このマイクロフォンＭ１、Ｍ２は、無指向性のマイクロフォンである。つまり、図５に示すように、マイクロフォンＭ１、Ｍ２は、円形のポーラパターンを有し、特定方向に対して指向性を有しない。そのため、３６０゜のいずれに存在する音源から、同一の音圧の音波信号を入力した際には、音波信号ＩｎＭ１（ｋ）、ＩｎＭ２（ｋ）の振幅には差が存在しない。 The sound wave signal from the sound source S1 is input to the microphones M1 and M2. The microphones M1 and M2 are omnidirectional microphones. That is, as shown in FIG. 5, the microphones M1 and M2 have a circular polar pattern and do not have directivity with respect to a specific direction. For this reason, when a sound wave signal having the same sound pressure is input from a sound source existing at 360 °, there is no difference in the amplitudes of the sound wave signals InM1 (k) and InM2 (k).

図６は、音波信号ＩｎＭ１（ｋ）、ＩｎＭ２（ｋ）の振幅比を示した図である。音波信号ＩｎＭ１（ｋ）、ＩｎＭ２（ｋ）の振幅にほとんど差がないため、音波信号ＩｎＭ１（ｋ）、ＩｎＭ２（ｋ）の振幅比は、２７０ｄｅｇ〜９０ｄｅｇの範囲において、音波信号ＩｎＭ１（ｋ）と音波信号ＩｎＭ２（ｋ）の振幅の差は、かなり小さくなる。すなわち、音波信号ＩｎＭ１（ｋ）と音波信号ＩｎＭ２（ｋ）の振幅比は、１：１に近づく。 FIG. 6 is a diagram showing the amplitude ratio of the sound wave signals InM1 (k) and InM2 (k). Since there is almost no difference between the amplitudes of the sound wave signals InM1 (k) and InM2 (k), the amplitude ratio of the sound wave signals InM1 (k) and InM2 (k) is in the range of 270 deg to 90 deg with the sound wave signal InM1 (k). The difference in amplitude of the sound wave signal InM2 (k) is considerably small. That is, the amplitude ratio between the sound wave signal InM1 (k) and the sound wave signal InM2 (k) approaches 1: 1.

振幅比設定部１に入力した音波信号ＩｎＭ１（ｋ）、ＩｎＭ２（ｋ）は、フィルタＨ１〜Ｈ４及び加算器により、フィルタリング処理される。図７は、フィルタリング処理され、加算器１ａから出力した合成信号ＩｎＡ（ｋ）、加算器１ｂから出力した合成信号ＩｎＢ（ｋ）とを示した図である。図７に示すように、フィルタリング処理された合成信号ＩｎＡ（ｋ）、合成信号ＩｎＢ（ｋ）は、それぞれ異なるポーラパターンを有する。そのため、合成信号ＩｎＡ（ｋ）、合成信号ＩｎＢ（ｋ）の振幅は、２７０ｄｅｇ〜９０ｄｅｇにおいて異なる。 The sound wave signals InM1 (k) and InM2 (k) input to the amplitude ratio setting unit 1 are filtered by the filters H1 to H4 and the adder. FIG. 7 is a diagram showing the combined signal InA (k) output from the adder 1a after being filtered and the combined signal InB (k) output from the adder 1b. As shown in FIG. 7, the filtered combined signal InA (k) and combined signal InB (k) have different polar patterns. For this reason, the amplitudes of the combined signal InA (k) and the combined signal InB (k) differ between 270 deg and 90 deg.

図８は、合成信号ＩｎＡ（ｋ）と合成信号ＩｎＢ（ｋ）との振幅比を示した図である。図８に示すように、合成信号ＩｎＡ（ｋ）と合成信号ＩｎＢ（ｋ）との振幅比は、２７０ｄｅｇ〜１５ｄｅｇの範囲で１．１〜８．１となる。この範囲において、振幅比は重複することはなく、振幅比は角度毎に固有の値となっている。
同様に、合成信号ＩｎＡ（ｋ）と合成信号ＩｎＢ（ｋ）との振幅比は、１５ｄｅｇ〜９０ｄｅｇの範囲で０．１〜８．１となる。この範囲において、振幅比は重複することはなく、振幅比は角度毎に固有の値となっている。 FIG. 8 is a diagram illustrating the amplitude ratio between the combined signal InA (k) and the combined signal InB (k). As shown in FIG. 8, the amplitude ratio between the combined signal InA (k) and the combined signal InB (k) is 1.1 to 8.1 in the range of 270 deg to 15 deg. In this range, the amplitude ratio does not overlap, and the amplitude ratio is a unique value for each angle.
Similarly, the amplitude ratio between the combined signal InA (k) and the combined signal InB (k) is 0.1 to 8.1 in the range of 15 deg to 90 deg. In this range, the amplitude ratio does not overlap, and the amplitude ratio is a unique value for each angle.

（振幅変更部の作用）
振幅変更部２では、合成信号ＩｎＡ（ｋ）及び合成信号ＩｎＢをフィルタＴ１，Ｄ１でフィルタ処理することにより振幅比を一定の割合で変更させ、特定方向の振幅比の信号を同一にする。例えば、特定方向である５０ｄｅｇ方向の振幅比を１となるように、合成信号ＩｎＢ（ｋ）を処理するフィルタＴ１の値を設定する。
この場合、フィルタＴ１により、５０ｄｅｇ方向の振幅比は１となる。一方、５０ｄｅｇ方向以外の振幅比は、５０ｄｅｇ方向の振幅比と異なるため１にはならず、５０ｄｅｇ方向の振幅比からの差に応じて、１から離れた値となる。図９は、図８の合成信号ＩｎＡ（ｋ）と、合成信号ＩｎＢ（ｋ）の振幅比において、特定方向３０ｄｅｇの振幅比が１となるようにシフトしたグラフである。図９に示すように、１５ｄｅｇ〜９０ｄｅｇの範囲では、特定方向３０ｄｅｇの振幅比を１にすることにより、他の方向の振幅比は、３０ｄｅｇから離れるごとに、１より離れる。 (Operation of amplitude changer)
The amplitude changing unit 2 filters the combined signal InA (k) and the combined signal InB with the filters T1 and D1, thereby changing the amplitude ratio at a constant rate and making the signals of the amplitude ratio in a specific direction the same. For example, the value of the filter T1 for processing the combined signal InB (k) is set so that the amplitude ratio in the 50 deg direction, which is a specific direction, becomes 1.
In this case, the amplitude ratio in the 50 deg direction becomes 1 by the filter T1. On the other hand, the amplitude ratio in the direction other than the 50 deg direction is different from the amplitude ratio in the 50 deg direction and thus does not become 1, and becomes a value away from 1 depending on the difference from the amplitude ratio in the 50 deg direction. FIG. 9 is a graph in which the amplitude ratio of the synthesized signal InA (k) and the synthesized signal InB (k) in FIG. As shown in FIG. 9, in the range of 15 deg to 90 deg, by setting the amplitude ratio in the specific direction 30 deg to 1, the amplitude ratio in the other direction deviates from 1 every time it departs from 30 deg.

図１０は、音波信号ＩｎＭ１（ｋ）、ＩｎＭ２（ｋ）の５０ｄｅｇ方向の振幅差が無くなるように調整した場合の音波信号ＩｎＭ１（ｋ）、ＩｎＭ２（ｋ）のポーラパターンを示した図である。音波信号ＩｎＭ１（ｋ）、ＩｎＭ２（ｋ）のポーラパターンは、５０ｄｅｇの方向で重なり合う。この場合、図１２に示すように係数ｍ（ｋ）は、特定方向である５０ｄｅｇに近ければ近いほど１に近い係数ｍ（ｋ）となる。 FIG. 10 is a diagram illustrating polar patterns of the sound wave signals InM1 (k) and InM2 (k) when the sound wave signals InM1 (k) and InM2 (k) are adjusted so that the amplitude difference in the 50 deg direction is eliminated. The polar patterns of the sound wave signals InM1 (k) and InM2 (k) overlap in the direction of 50 deg. In this case, as shown in FIG. 12, the coefficient m (k) becomes a coefficient m (k) closer to 1 as it is closer to 50 deg which is the specific direction.

一方、図１１は、合成信号ＩｎＡ（ｋ）、ＩｎＢ（ｋ）の５０ｄｅｇ方向の振幅差が無くなるように調整した場合の合成信号ＩｎＡ（ｋ）、ＩｎＢ（ｋ）のポーラパターンを示した図である。合成信号ＩｎＡ（ｋ）、ＩｎＢ（ｋ）のポーラパターンは、５０ｄｅｇの方向で重なり合う。この場合、図１３に示すように係数ｍ（ｋ）は、特定方向である５０ｄｅｇに近ければ近いほど１に近い係数ｍ（ｋ）となる。特に、振幅比設定部１において、各方向の振幅差を強調することにより、５０ｄｅｇ付近と、それ以外で係数ｍ（ｋ）の値に大きな差ができる。つまり、鋭い指向性が得られている。 On the other hand, FIG. 11 is a diagram showing polar patterns of the combined signals InA (k) and InB (k) when adjustment is made so that the amplitude difference in the 50 deg direction of the combined signals InA (k) and InB (k) is eliminated. is there. The polar patterns of the combined signals InA (k) and InB (k) overlap in the direction of 50 deg. In this case, as shown in FIG. 13, the coefficient m (k) becomes a coefficient m (k) that is closer to 1 as it is closer to 50 deg that is the specific direction. In particular, by enhancing the amplitude difference in each direction in the amplitude ratio setting unit 1, a large difference can be made in the value of the coefficient m (k) in the vicinity of 50 deg. That is, sharp directivity is obtained.

これにより、出力信号ＯｕｔＬ（ｋ）と出力信号ＯｕｔＲ（ｋ）には、音源の存在位置が５０ｄｅｇの方向に近ければ近いほど、１に近い係数ｍ（ｋ）によって相対的に強調される利得が与えられる。一方、音源の存在位置が５０ｄｅｇの方向から離れれば離れるほど、１未満の係数ｍ（ｋ）によって相対的に抑制される利得が与えられる。 As a result, the output signal OutL (k) and the output signal OutR (k) have a gain that is relatively emphasized by a coefficient m (k) closer to 1 as the sound source location is closer to the direction of 50 deg. Given. On the other hand, the further away the sound source location is from the direction of 50 deg, the more gain that is relatively suppressed by the coefficient m (k) less than 1.

（交換回路の作用）
次に、交換回路の意義について説明する。交換回路を経ることによって、係数更新回路は、以下の数式（１４）を交互に演算する。
(Operation of exchange circuit)
Next, the significance of the exchange circuit will be described. By passing through the exchange circuit, the coefficient update circuit alternately calculates the following formula (14).

数式（１４）において、信号の二乗の項は、ホワイトノイズ等の無相関成分を時間の経過とともに小さくなるように作用する。一方、その隣接項は、相関係数を逐次的に算出する以下の数式（１５）の分子部分と同等であり、相関成分の影響を係数ｍに反映させていくこととなる。
In Equation (14), the square term of the signal acts to reduce the uncorrelated component such as white noise as time passes. On the other hand, the adjacent term is equivalent to the numerator part of the following formula (15) for sequentially calculating the correlation coefficient, and the influence of the correlation component is reflected on the coefficient m.

つまり、係数更新回路３ｃが合成信号ＩｎＡ（ｋ）に対して合成信号ＩｎＢ（ｋ）を近似させようとしたときには、合成信号ＩｎＡ（ｋ）の無相関成分は増幅方向となり、合成信号ＩｎＢ（ｋ）の無相関成分は抑制方向となる。また、合成信号ＩｎＢ（ｋ）に対して合成信号ＩｎＡ（ｋ）を近似させようとしたときには、合成信号ＩｎＢ（ｋ）の無相関成分は増幅方向となり、合成信号ＩｎＡ（ｋ）の無相関成分は抑制方向となる。 That is, when the coefficient update circuit 3c attempts to approximate the composite signal InB (k) to the composite signal InA (k), the uncorrelated component of the composite signal InA (k) becomes the amplification direction, and the composite signal InB (k ) Of the uncorrelated component is in the suppression direction. Further, when the synthesized signal InA (k) is approximated to the synthesized signal InB (k), the uncorrelated component of the synthesized signal InB (k) becomes the amplification direction, and the uncorrelated component of the synthesized signal InA (k). Is the direction of suppression.

そこで、係数更新回路３ｃの前に交換回路２を設置すると、合成信号ＩｎＡ（ｋ）に対して合成信号ＩｎＢ（ｋ）を近似させて同期加算しようとする働きと、合成信号ＩｎＢ（ｋ）に対して合成信号ＩｎＡ（ｋ）を近似させて同期加算しようとする働きとを交互に繰り返すこととなる。そのため、無相関成分を増幅及び抑制しようとする働きは、交互に打ち消し合うことになり、係数ｍ（ｋ）には相関成分の影響を濃く反映させていくことになる。 Therefore, when the switching circuit 2 is installed in front of the coefficient update circuit 3c, the synthesized signal InA (k) is approximated to the synthesized signal InB (k) to perform synchronous addition, and the synthesized signal InB (k) is added. On the other hand, the function of approximating the synthesized signal InA (k) and attempting to add synchronously is repeated alternately. Therefore, the function of amplifying and suppressing the uncorrelated component cancels out alternately, and the coefficient m (k) reflects the influence of the correlated component deeply.

尚、図１４は、交換回路３ｂがある場合とない場合での係数ｍ（ｋ）の収束状態を示している。両収束状態は、共にセンター位置に音源を置き、マイクロフォンＭ１、Ｍ２で集音したものである。図１４の曲線Ｆが示すように、交換回路３ｂがある場合には約１０００回目に係数ｍ（ｋ）が１に収束したが、曲線Ｇが示すように、交換回路２がない場合には、係数ｍ（ｋ）を１００００回更新しても未だ１に収束することはなく、その開きは１０倍であった。すなわち、交換回路３ｂが存在する場合には、音源分離が速やかに完了することを示している。 FIG. 14 shows the convergence state of the coefficient m (k) with and without the exchange circuit 3b. In both the convergence states, a sound source is placed at the center position, and sounds are collected by the microphones M1 and M2. As shown by the curve F in FIG. 14, the coefficient m (k) converges to 1 about 1000 times when there is the exchange circuit 3b, but when there is no exchange circuit 2 as shown by the curve G, Even if the coefficient m (k) was updated 10,000 times, it still did not converge to 1, and the opening was 10 times. That is, when the exchange circuit 3b exists, it indicates that the sound source separation is completed quickly.

（効果）
以上のように、本実施形態に係る音源分離装置では、マイクロフォンＭ１、Ｍ２に入力される一対の音波信号ＩｎＭ１（ｋ）、ＩｎＭ２（ｋ）から、方向毎に固有な振幅比を有する合成信号ＩｎＡ（ｋ）、及び合成信号ＩｎＢ（ｋ）を合成する。そして、
合成信号ＩｎＡ（ｋ）と合成信号ＩｎＢ（ｋ）における特定方向の振幅比の信号を同一振幅に変更する。この同一振幅比を強調することで、特定方向の音波を他方向から到来する音波と比して相対的に強調するフィルタ処理を施す。そして、フィルタ処理の後、交換回路２に合成信号ＩｎＡ（ｋ）、と合成信号ＩｎＢ（ｋ）を１サンプル毎に交互に入れ替えることで、一対の交換信号ＩｎＣ（ｋ）とＩｎＤ（ｋ）を生成し、この交換信号ＩｎＣ（ｋ）とＩｎＤ（ｋ）の片方に係数ｍを乗じた上で、交換信号ＩｎＣ（ｋ）とＩｎＤ（ｋ）の誤差信号を生成する。更に、誤差信号を含む係数ｍの漸化式を演算して係数ｍを１サンプル毎に更新する。最後に、逐次更新された係数ｍを一対の音波信号に乗じて出力する。 (effect)
As described above, in the sound source separation device according to the present embodiment, the combined signal InA having a unique amplitude ratio for each direction from the pair of sound wave signals InM1 (k) and InM2 (k) input to the microphones M1 and M2. (K) and the synthesized signal InB (k) are synthesized. And
A signal having an amplitude ratio in a specific direction in the combined signal InA (k) and the combined signal InB (k) is changed to the same amplitude. By emphasizing this same amplitude ratio, a filtering process is performed to emphasize the sound waves in a specific direction relative to the sound waves coming from other directions. Then, after the filtering process, the exchange signal 2 is alternately exchanged with the synthesized signal InA (k) and the synthesized signal InB (k) for each sample, whereby a pair of exchange signals InC (k) and InD (k) are obtained. Then, after multiplying one of the exchange signals InC (k) and InD (k) by a coefficient m, an error signal of the exchange signals InC (k) and InD (k) is produced. Further, the recurrence formula of the coefficient m including the error signal is calculated to update the coefficient m for each sample. Finally, the sequentially updated coefficient m is multiplied by a pair of sound wave signals and output.

例えば、１サンプル前に算出された過去の係数ｍの−１倍がセットされた積算器５に交換信号の片方を通し、積算器５を経た後に、一対の交換信号を加算する加算器６を通し、加算器６を経た後に、定数μがセットされた積算器７を通し、積算器７を経た後に、過去の係数ｍが乗算される前の片方の交換信号がセットされた積算器８を通し、積算器８を経た後に、１サンプル前に算出された過去の係数ｍがセットされた加算器９を通すことで、係数ｍを１サンプル毎に更新する。 For example, an adder 6 that adds one pair of exchange signals after passing one of the exchange signals through the integrator 5 in which −1 times the past coefficient m calculated one sample before is set. After passing through the adder 6, it passes through the integrator 7 in which the constant μ is set. After passing through the integrator 7, the integrator 8 in which one exchange signal before being multiplied by the past coefficient m is set. Then, after passing through the integrator 8, the coefficient m is updated for each sample by passing through the adder 9 in which the past coefficient m calculated one sample before is passed.

これにより、本実施形態の音源分離装置は、音源Ｓ１からの音波信号の振幅比に焦点をあて、煩雑な演算を回避でき、音波信号ＩｎＭ１（ｋ）と音波信号ＩｎＭ２（ｋ）との振幅差が極微小であっても、特定方向に指向性を形成することができる。 Thereby, the sound source separation device of the present embodiment focuses on the amplitude ratio of the sound wave signal from the sound source S1, can avoid complicated calculation, and the amplitude difference between the sound wave signal InM1 (k) and the sound wave signal InM2 (k). Even if is very small, directivity can be formed in a specific direction.

更に、その指向性を形成するためにタップ数の多いフィルタ等に依ることはなく、交換回路と漸化式を演算する一つの係数更新回路によって実現できる。従って、演算数を大幅に削減でき、最終的な遅延は数十マイクロ秒〜数ミリ秒以内におさめることが可能となる。 Furthermore, it does not depend on a filter having a large number of taps in order to form the directivity, and can be realized by an exchange circuit and a single coefficient update circuit that calculates a recurrence formula. Therefore, the number of operations can be greatly reduced, and the final delay can be suppressed within several tens of microseconds to several milliseconds.

合成信号ＩｎＡ（ｋ）及び合成信号ＩｎＢ（ｋ）は、方向毎の振幅比を個別な値となるような相関性を有していれば良く、それぞれの合成信号は、任意のポーラパターンを有している。ポーラパターンの一例としては、カーディオイド型、双指向性型、鋭指向性型のポーラパターンが挙げられる。さらに、合成信号ＩｎＡ（ｋ）と合成信号ＩｎＢ（ｋ）のポーラパターンが線対称であっても良い。また、合成信号ＩｎＡ（ｋ）と合成信号ＩｎＢ（ｋ）のポーラパターンが、それぞれ異なる形状を有するものでも良い。 The combined signal InA (k) and the combined signal InB (k) only need to have a correlation such that the amplitude ratio for each direction becomes an individual value, and each combined signal has an arbitrary polar pattern. doing. As an example of the polar pattern, a cardioid type, a bi-directional type, and an acute-directional type polar pattern can be cited. Furthermore, the polar pattern of the combined signal InA (k) and the combined signal InB (k) may be line symmetric. The polar patterns of the synthesized signal InA (k) and the synthesized signal InB (k) may have different shapes.

本実施形態では、図８に示すように、１５ｄｅｇの振幅比が最大となるように、合成信号ＩｎＡ（ｋ）と合成信号ＩｎＢ（ｋ）の振幅を設定したが、振幅比が最大となる角度は任意の方向とすることができる。例えば、特定方向の指向性を高めたい場合は、その方向の振幅比が拡大するように設定することもできる。これにより、特定方向については、特に振幅比が拡大するため、後の利得決定に際して、更に、相対的に強調される利得が与えられる。さらに、振幅比を大きくするだけでなく、特定の角度の振幅比を強調するよう振幅比を設定しても良い。言い換えると、特定方向の振幅比を他の方向の振幅比と区別しやすいものとする。例えば、角度毎の振幅比をグラフにした場合には、特定方向の振幅比をピークとしたグラフとなり、そのピークは急峻となる。このように、特定方向とその他の方向の振幅比の差が大きければ、大きいほど指向性を強くすることができる。 In the present embodiment, as shown in FIG. 8, the amplitudes of the combined signal InA (k) and the combined signal InB (k) are set so that the amplitude ratio of 15 deg is maximized. Can be in any direction. For example, when it is desired to increase the directivity in a specific direction, the amplitude ratio in that direction can be set to increase. As a result, the amplitude ratio is particularly increased in the specific direction, so that a relatively emphasized gain is given when the gain is determined later. Furthermore, the amplitude ratio may be set not only to increase the amplitude ratio but also to emphasize the amplitude ratio at a specific angle. In other words, it is easy to distinguish the amplitude ratio in a specific direction from the amplitude ratio in other directions. For example, when the amplitude ratio for each angle is graphed, the graph has the amplitude ratio in a specific direction as a peak, and the peak is steep. Thus, the greater the difference in amplitude ratio between the specific direction and the other direction, the stronger the directivity can be.

一方、特定方向に対してマイナスの指向性、つまり特定方向の音を減衰させたい場合には、減衰させたい方向の振幅比を、他の振幅比と比較して小さいものとする。これにより、減衰させたい方向の振幅比と、他の方向の振幅比との差は、大きくなるため、後の利得決定回路において、他の方向の音を強調される利得ｇが与えられ、特定方向に対しては相対的に抑制される利得が与えられる。 On the other hand, when it is desired to attenuate a directivity that is negative with respect to a specific direction, that is, a sound in a specific direction, the amplitude ratio in the direction to be attenuated is set to be smaller than other amplitude ratios. As a result, the difference between the amplitude ratio in the direction to be attenuated and the amplitude ratio in the other direction becomes large, so that a gain g for emphasizing the sound in the other direction is given in a later gain determination circuit. A relatively suppressed gain is provided for the direction.

また、本実施形態では、２７０ｄｅｇ〜９０ｄｅｇの範囲において、２７０ｄｅｇ〜１５ｄｅｇの範囲と、１５ｄｅｇ〜９０ｄｅｇの範囲でそれぞれ方向毎に固有の振幅比として設定した。しかしながら、２７０ｄｅｇ〜９０ｄｅｇの範囲で、方向毎に固有の振幅比として設定することもできる。さらに、０ｄｅｇ〜３６０ｄｅｇの範囲で、方向毎に固有の振幅比として設定しても良い。 In this embodiment, in the range of 270 deg to 90 deg, the specific amplitude ratio is set for each direction in the range of 270 deg to 15 deg and 15 deg to 90 deg. However, it can be set as a specific amplitude ratio for each direction in the range of 270 deg to 90 deg. Further, it may be set as a specific amplitude ratio for each direction in the range of 0 deg to 360 deg.

本実施形態では、係数決定回路３として、特性補正回路３ａ、交換回路３ｂ、及びに係数更新回路３ｃが直列に接続された回路を使用したが、合成信号ＩｎＡ（ｋ）と合成信号ＩｎＢ（ｋ）の波形間の差分に応じて利得を決定する回路であれば、上記実施形態に限定することなく、他の回路や方法で実現可能である。 In the present embodiment, a circuit in which the characteristic correction circuit 3a, the exchange circuit 3b, and the coefficient update circuit 3c are connected in series is used as the coefficient determination circuit 3, but the combined signal InA (k) and the combined signal InB (k As long as it is a circuit that determines the gain according to the difference between the waveforms of (), it can be realized by other circuits and methods without being limited to the above embodiment.

本実施形態では、合成回路４において、合成信号ＩｎＡ（ｋ）と合成信号ＩｎＢ（ｋ）に対して、任意の比率で係数ｍ（ｋ）を乗じ、任意の比率で足し合わせて利得ｇを決定する。利得ｇの一例としては、以下の（１）〜（５）が挙げられる。
（１）そのままの係数ｍ（ｋ）。
（２）ｇ＝ｍ（ｋ）＾２、ｇ＝ｍ（ｋ）＾３のように、係数ｍ（ｋ）を指定数乗した値。
（３）ｇ＝ｍ（ｋ）＋ａ、ｇ＝ｍ（ｋ）−ａのように、係数ｍ（ｋ）に指定数を足した値。
（４）ｇ＝ｍ（ｋ）＊ｂのように、係数ｍ（ｋ）に一定数ｂを掛け合わせた値。
（５）ｇ＝（（ｍ（ｋ）＋ａ）＊ｂ）＾２のように、（２）〜（４）を組み合わせた値。 In the present embodiment, the combining circuit 4 multiplies the combined signal InA (k) and the combined signal InB (k) by a coefficient m (k) at an arbitrary ratio, and adds the arbitrary ratio to determine the gain g. To do. Examples of the gain g include the following (1) to (5).
(1) The coefficient m (k) as it is.
(2) A value obtained by multiplying a coefficient m (k) by a specified number, such as g = m (k) ^ 2 and g = m (k) ^ 3.
(3) A value obtained by adding a specified number to the coefficient m (k) such as g = m (k) + a and g = m (k) −a.
(4) A value obtained by multiplying the coefficient m (k) by a certain number b, such as g = m (k) * b.
(5) A value obtained by combining (2) to (4) as in g = ((m (k) + a) * b) ^ 2.

また、出力信号ＯｕｔＬ（ｋ）と出力信号ＯｕｔＲ（ｋ）は、以下の式（１６）（１７）のように合成することもできる。
出力信号ｏｕｔＬ＝（ＩｎＭ１（ｋ）＋ｏｕｔＬ（ｋ））＊ｇ
出力信号ｏｕｔＲ＝（ＩｎＭ２（ｋ）＋ｏｕｔＲ（ｋ））＊ｇ・・・・（１６）

出力信号ｏｕｔＬ＝ｏｕｔＲ＝（ｏｕｔＬ＋ｏｕｔＲ）＊ｇ＋ＩｎＭ１（ｋ）＋ＩｎＭ２（ｋ）・・・・（１７） Further, the output signal OutL (k) and the output signal OutR (k) can be combined as in the following equations (16) and (17).
Output signal outL = (InM1 (k) + outL (k)) * g
Output signal outR = (InM2 (k) + outR (k)) * g (16)

Output signal outL = outR = (outL + outR) * g + InM1 (k) + InM2 (k) (17)

（その他の実施形態）
以上のように、本発明の実施形態を説明したが、この実施形態は、例として提示したものであり、発明の範囲を限定することを意図していない。これら新規な実施形態は、そのほかの様々な形態で実施されることが可能であり、発明の要旨を逸脱しない範囲で、種々の省略、置き換え、変更を行うことができる。これら実施形態やその変形は、発明の範囲や要旨に含まれるとともに、特許請求の範囲に記載された発明とその均等の範囲に含まれる。 (Other embodiments)
As mentioned above, although embodiment of this invention was described, this embodiment is shown as an example and is not intending limiting the range of invention. These novel embodiments can be implemented in various other forms, and various omissions, replacements, and changes can be made without departing from the scope of the invention. These embodiments and modifications thereof are included in the scope and gist of the invention, and are included in the invention described in the claims and the equivalents thereof.

例えば、実施形態では、音源分離装置がＩＣレコーダやＰＣや携帯端末等の収録機能を有する機器に搭載されることを前提に説明したが、その他のあらゆる音響機器に搭載することもでき、マイクロフォンに代えて、音波データを記憶したメモリから音波信号ＩｎＭ１及びＩｎＭ２の提供を受けることができる。例えば、マイクロフォンからリアルタイムに入力された音波信号の他、音源分離装置に接続された一対のマイクロフォンで予め収録されて得られた音波信号、全く別の一対のマイクロフォンで予め収録されて得られた音波信号、コンピュータ等を用いて一対のマイクロフォンで収録された音波に見立てて擬似的に生成された音波信号に対して、特定方向に指向性を形成することを含む。 For example, in the embodiment, the sound source separation device has been described on the assumption that the sound source separation device is mounted on a device having a recording function such as an IC recorder, a PC, or a portable terminal. However, the sound source separation device can be mounted on any other audio device, Instead, the sound wave signals InM1 and InM2 can be received from the memory storing the sound wave data. For example, in addition to a sound wave signal input in real time from a microphone, a sound wave signal recorded in advance by a pair of microphones connected to a sound source separation device, a sound wave obtained in advance by a completely different pair of microphones This includes forming directivity in a specific direction with respect to a sound wave signal that is artificially generated by using a signal, a computer, or the like as a sound wave recorded by a pair of microphones.

また、図１５に示すように、振幅比設定部１の後段に、合成信号の振幅比を測定する振幅比測定部１２を設けることもできる。振幅比測定部１２では、予め振幅比設定部１における方向毎の振幅率の比のデータを保持し、新たに入力した一対の音波信号の振幅比により、その音波信号の方向を測定する。
すなわち、予め音波信号ＩｎＭ１（ｋ）及び音波信号ＩｎＭ２（ｋ）が入力した際の振幅比設定部１における方向毎の振幅率の比のデータをメモリ内に記録する。その上で、入力する音波信号とＩｎＭ１’（ｋ）と音波信号ＩｎＭ２’（ｋ）との合成信号ＩｎＡ’（ｋ）と合成信号ＩｎＢ’（ｋ）との振幅比を測定する。そして、測定した振幅比を、振幅比測定部１２内のメモリ内に記憶する振幅比設定部１での音波信号ＩｎＭ１（ｋ）及び音波信号ＩｎＭ２（ｋ）の振幅変化率の比と比較する。
音波信号ＩｎＭ１（ｋ）及び音波信号ＩｎＭ２（ｋ）の振幅変化率の比は、方向毎に固有の値であるために、測定した合成信号ＩｎＡ’（ｋ）と合成信号ＩｎＢ’（ｋ）との振幅比と比較することで、音波信号とＩｎＭ１’（ｋ）と音波信号ＩｎＭ２’（ｋ）を発生する音源の方向を知ることもできる。 As shown in FIG. 15, an amplitude ratio measuring unit 12 that measures the amplitude ratio of the combined signal can be provided at the subsequent stage of the amplitude ratio setting unit 1. The amplitude ratio measurement unit 12 stores in advance data of the ratio of the amplitude rate for each direction in the amplitude ratio setting unit 1 and measures the direction of the sound wave signal based on the amplitude ratio of a pair of newly input sound wave signals.
That is, the ratio data of the amplitude ratio for each direction in the amplitude ratio setting unit 1 when the sound wave signal InM1 (k) and the sound wave signal InM2 (k) are input in advance is recorded in the memory. Then, the amplitude ratio of the combined signal InA ′ (k) and the combined signal InB ′ (k) of the input sound wave signal, InM1 ′ (k) and the sound wave signal InM2 ′ (k) is measured. Then, the measured amplitude ratio is compared with the ratio of the amplitude change rate of the sound wave signal InM1 (k) and the sound wave signal InM2 (k) in the amplitude ratio setting unit 1 stored in the memory in the amplitude ratio measurement unit 12.
Since the ratio of the amplitude change rate of the sound wave signal InM1 (k) and the sound wave signal InM2 (k) is a unique value for each direction, the measured synthesized signal InA ′ (k) and synthesized signal InB ′ (k) , The direction of the sound source that generates the sound wave signal, InM1 ′ (k), and sound wave signal InM2 ′ (k) can also be known.

また、図１６に示すように、係数更新回路は、交換信号の片方に係数ｍを乗じた上で、交換信号の誤差信号を生成し、この誤差信号を含む係数ｍの漸化式を演算して係数ｍを１サンプル毎に更新するようにすれば、上記実施形態に限定することなく、その他の態様で実現可能である。 Further, as shown in FIG. 16, the coefficient updating circuit multiplies one of the exchange signals by a coefficient m, generates an error signal of the exchange signal, and calculates a recurrence formula of the coefficient m including the error signal. If the coefficient m is updated for each sample, the present invention is not limited to the above embodiment and can be realized in other modes.

また、この音源分離装置は、ＣＰＵやＤＳＰのソフトウェア処理として実現してもよいし、専用のデジタル回路で構成するようにしてもよい。ソフトウェア処理として実現する場合には、ＣＰＵ、外部メモリ、ＲＡＭを備えるコンピュータにおいて、振幅比設定部１、振幅変更部２、係数決定回路３、合成回路４と同一の処理内容を記述したプログラムをＲＯＭやハードディスクやフラッシュメモリ等の外部メモリに記憶させ、ＲＡＭに適宜展開し、ＣＰＵで其のプログラムに従って演算を行うようにすればよい。 The sound source separation device may be realized as software processing of a CPU or DSP, or may be configured by a dedicated digital circuit. When implemented as software processing, in a computer having a CPU, external memory, and RAM, a program describing the same processing contents as the amplitude ratio setting unit 1, amplitude changing unit 2, coefficient determination circuit 3, and synthesis circuit 4 is read into ROM Or stored in an external memory such as a hard disk or a flash memory, appropriately expanded in a RAM, and the CPU may perform calculations according to the program.

１振幅比設定部
１ａ加算器
１ｂ加算器
２振幅変更部
３係数決定回路
３ａ特性補正回路
３ｂ交換回路
３ｃ係数更新回路
４合成回路
５積算器
６加算器
７積算器
８積算器
９加算器
１０遅延器
１１積算器
１２振幅比測定部

DESCRIPTION OF SYMBOLS 1 Amplitude ratio setting part 1a Adder 1b Adder 2 Amplitude change part 3 Coefficient determination circuit 3a Characteristic correction circuit 3b Exchange circuit 3c Coefficient update circuit 4 Synthesis circuit 5 Accumulator 6 Adder 7 Accumulator 8 Accumulator 9 Adder 10 Delay 11 Accumulator 12 Amplitude ratio measurement unit

Claims

A sound source separation method for forming directivity in a specific direction with respect to a pair of sound wave signals,
Any number of filters,
Using an arbitrary number of adders provided in the subsequent stage,
An amplitude ratio setting step in which the amplitude ratio of the pair of sound wave signals obtained by the filter and the adder is a unique value for each direction;
Applying a filtering process for changing the amplitude of the pair of sound wave signals at a constant ratio, the fixed ratio is an amplitude changing step for changing the amplitude ratio in a specific direction to be the same,
A gain determining step for determining a gain according to a difference between waveforms of a pair of sound wave signals that have undergone the amplitude changing step;
An output step of multiplying the sound wave signal by the gain and outputting;
Providing
A sound source separation method characterized by the above.

In the amplitude ratio setting step,
The amplitude of the pair of sound wave signals is changed at a specific rate of change depending on the direction, and the amplitude derived from the same direction is changed at a different rate of change.
The sound source separation method according to claim 1.

The polar pattern indicating the ratio of the change rate is
The sound source separation method according to claim 2, wherein the sound source separation method is different for each pair of sound wave signals.

The rate of change of the amplitude is in a specific direction for emphasis or attenuation.
4. The sound source separation method according to claim 3, wherein the rate of change is larger than the amplitude rate in other directions.

The gain determining step includes:
An exchange step for generating a pair of exchange signals by alternately exchanging the pair of synthesis signals for each sample by an exchange circuit for the synthesized signal that has undergone the amplitude changing step;
A step of generating an error signal of the exchange signal after multiplying one of the exchange signals by a coefficient m;
An update step of calculating a recurrence formula of the coefficient m including the error signal and updating the coefficient m every sample;
An output step of multiplying the pair of combined signals by the sequentially updated coefficient m and outputting the result;
Providing
The sound source separation method according to claim 4.

The sound source separation method according to claim 1, wherein the pair of sound wave signals is a pair of sound wave signals input from a pair of microphones.

6. The pair of sound wave signals according to any one of claims 1 to 5, wherein a system represented by a transmission coefficient when a system from a sound source to a microphone is simulated is convoluted. Sound source separation method.

A sound source separation device that forms directivity in a specific direction with respect to a pair of sound wave signals,
Any number of filters,
Any number of adders provided in the subsequent stage;
Applying a filtering process that changes the amplitude of the pair of sound wave signals at a constant rate, the fixed rate is an amplitude changing unit that changes the amplitude ratio in a specific direction to the same value,
An amplitude changing unit that changes a specific amplitude ratio to the same amplitude among the amplitude ratios of the pair of sound wave signals;
A gain determining unit that determines a gain according to a difference between waveforms of a pair of sound wave signals that have undergone the amplitude changing step;
An output unit that multiplies the sound wave signal by the gain and outputs it;
Providing
A sound source separation device characterized by the above.

In the amplitude ratio setting unit,
9. The sound source separation device according to claim 8, wherein the amplitude of the pair of sound wave signals is changed at a specific rate of change depending on the direction, and the amplitude derived from the same direction is changed at a different rate of change.

The polar pattern indicating the ratio of the change rate is
The sound source separation device according to claim 9, wherein the sound source separation device is different for each pair of sound wave signals.

The rate of change of the amplitude is in a specific direction for emphasis or attenuation.
The sound source separation device according to claim 10, wherein the rate of change is larger than the amplitude rate in the other direction.

The gain determining unit
An exchange unit that generates a pair of exchange signals by alternately exchanging the pair of synthesis signals for each sample by an exchange circuit with the synthesized signal that has passed through the amplitude changing unit;
A generator for generating an error signal of the exchange signal after multiplying one of the exchange signals by a coefficient m;
An updating unit that calculates a recurrence formula of the coefficient m including the error signal and updates the coefficient m for each sample;
An output unit for multiplying and outputting the pair of sound wave signals by the sequentially updated coefficient m;
Providing
The sound source separation device according to claim 11.

The sound source separation device according to claim 8, wherein the pair of sound wave signals is a pair of sound wave signals input from a pair of microphones.

13. The pair of sound wave signals is a system in which a system represented by a transfer coefficient when a system from a sound source to a microphone is simulated is convoluted. Sound source separation device.

A sound source separation program that forms directivity in a specific direction with respect to a pair of sound wave signals,
Any number of filters,
A computer comprising any number of adders provided in the subsequent stage;
An amplitude ratio setting means for setting the amplitude ratio of the pair of sound wave signals obtained by the filter and the adder to a unique value for each direction;
Applying a filtering process to change the amplitude of the pair of sound wave signals at a constant ratio, the fixed ratio is an amplitude changing means for changing the amplitude ratio in a specific direction to be the same,
Gain determining means for determining a gain according to a difference between waveforms of a pair of sound wave signals that have undergone the amplitude changing step;
Output means for multiplying the sound wave signal by the gain and outputting;
A sound source separation program characterized by being made to function.

The sound source separation program according to claim 15, wherein the amplitude of the pair of sound wave signals is changed at a specific rate of change depending on the direction, and the amplitude derived from the same direction is changed at a different rate of change.

The polar pattern indicating the ratio of the change rate is
The sound source separation program according to claim 16, wherein the sound source separation program is different for each pair of sound wave signals.

The rate of change of the amplitude is in a specific direction for emphasis or attenuation.
The sound source separation program according to claim 17, wherein the rate of change is larger than the amplitude rate in the other direction.

The synthesized signal that has undergone the amplitude changing step is generated by alternately exchanging the pair of synthesized signals for each sample by an exchange circuit,
After multiplying one of the exchange signals by a coefficient m, an error signal of the exchange signal is generated,
Calculating a recurrence formula of the coefficient m including the error signal to update the coefficient m every sample;
Multiplying the pair of synthesized signals by the sequentially updated coefficient m and outputting the result.
The sound source separation program according to claim 18, which realizes a function.