JP5454157B2

JP5454157B2 - Sound processor

Info

Publication number: JP5454157B2
Application number: JP2010006434A
Authority: JP
Inventors: 多伸近藤
Original assignee: Yamaha Corp
Current assignee: Yamaha Corp
Priority date: 2010-01-15
Filing date: 2010-01-15
Publication date: 2014-03-26
Anticipated expiration: 2030-01-15
Also published as: JP2011146948A

Description

本発明は、ステレオ形式の音響信号のうち目的の位置（方向）に音像が定位する成分（例えば、右チャネルおよび左チャネルの各々の信号に含まれる成分である。以下では「定位成分」という）を強調または抑圧する技術に関する。 The present invention is a component in which a sound image is localized at a target position (direction) in a stereo format acoustic signal (for example, a component included in each signal of a right channel and a left channel. Hereinafter, referred to as a “localization component”). It relates to technology that emphasizes or suppresses.

ＣＤ等の記録媒体に収録された音響信号から所定の定位成分を抑圧する技術が従来から提案されている。例えば特許文献１には、ステレオ形式の音響信号の一方から他方を減算（逆相加算）することで、音像が中央方向に定位する成分（典型的には歌唱音）を抑圧する技術が開示されている。 Conventionally, a technique for suppressing a predetermined localization component from an acoustic signal recorded on a recording medium such as a CD has been proposed. For example, Patent Document 1 discloses a technique for suppressing a component (typically a singing sound) in which a sound image is localized in the center direction by subtracting one of stereo audio signals from the other (reverse phase addition). ing.

特開２００７−０７９４１３号公報JP 2007-079413 A

しかし、特許文献１の技術では、所定の成分の抑圧後の音響信号がモノラル形式になるという問題がある。同様の問題は、定位成分を強調する場合にも発生し得る。以上の事情を考慮して、本発明は、定位成分を強調または抑圧したステレオ形式の音響信号を生成することを目的とする。 However, the technique of Patent Document 1 has a problem that the acoustic signal after suppression of a predetermined component is in a monaural format. Similar problems can occur when emphasizing the localization component. In view of the above circumstances, an object of the present invention is to generate a stereo-type acoustic signal in which a localization component is emphasized or suppressed.

以上の課題を解決するために、本発明の音響処理装置は、周波数毎の係数値で構成されてステレオ形式の第１音響信号および第２音響信号における特定位置の定位成分を強調または抑圧する第１数値列を生成する第１生成手段と、第１音響信号と第２音響信号との周波数毎の位相差に応じた係数値で構成される第２数値列を、第１音響信号の振幅と第２音響信号の振幅と両者の和成分の振幅とから生成する第２生成手段と、周波数毎の係数値で構成されて定位成分を強調または抑圧する第１処理係数列を第１数値列と第２数値列とから生成する合成手段と、第１音響信号および第２音響信号の各々の各周波数成分に第１処理係数列の各係数値を作用させる信号処理手段とを具備する。以上の構成においては、第１音響信号および第２音響信号の各々の各周波数成分に第１処理係数列を作用させるから、定位成分が強調または抑圧されたステレオ形式の音響信号を生成することが可能である。また、第１音響信号と第２音響信号との位相差が第１処理係数列に反映されるから、第１音響信号と第２音響信号との間に顕著な位相差がある場合でも、定位成分の強調または抑圧の効果を充分に維持できるという利点がある。 In order to solve the above-described problems, the sound processing apparatus according to the present invention is configured to emphasize or suppress a localization component at a specific position in the stereo first sound signal and the second sound signal, which is configured with coefficient values for each frequency. A first generating means for generating one numerical sequence, and a second numerical sequence composed of coefficient values corresponding to the phase difference for each frequency between the first acoustic signal and the second acoustic signal, and the amplitude of the first acoustic signal, Second generation means for generating from the amplitude of the second acoustic signal and the amplitude of the sum component of both, and a first processing coefficient sequence that is composed of coefficient values for each frequency and emphasizes or suppresses the localization component as a first numerical value sequence Synthesizing means generated from the second numerical sequence, and signal processing means for causing each coefficient value of the first processing coefficient sequence to act on each frequency component of the first acoustic signal and the second acoustic signal. In the above configuration, since the first processing coefficient sequence is applied to each frequency component of the first acoustic signal and the second acoustic signal, it is possible to generate a stereo format acoustic signal in which the localization component is emphasized or suppressed. Is possible. In addition, since the phase difference between the first acoustic signal and the second acoustic signal is reflected in the first processing coefficient sequence, localization is possible even when there is a significant phase difference between the first acoustic signal and the second acoustic signal. There is an advantage that the effect of emphasis or suppression of components can be sufficiently maintained.

本発明の好適な態様において、第２生成手段は、第１音響信号および第２音響信号の各々の振幅を加算した振幅和と和成分の振幅とを周波数毎に算定する振幅算定手段と、振幅和と和成分の振幅との相違に応じて第２数値列の各係数値を設定する設定手段とを含んで構成される。以上の態様においては、第１音響信号と第２音響信号との位相差に応じて第１音響信号および第２音響信号の振幅和と和成分の振幅との相違が変化するという傾向を利用して、第１音響信号と第２音響信号との位相差を高精度に反映した第２数値列を生成することが可能である。 In a preferred aspect of the present invention, the second generation means includes an amplitude calculation means for calculating, for each frequency, an amplitude sum obtained by adding the amplitudes of the first acoustic signal and the second acoustic signal, and an amplitude of the sum component. Setting means for setting each coefficient value of the second numerical sequence according to the difference between the sum and the amplitude of the sum component. In the above aspect, the tendency that the difference between the sum of the amplitudes of the first acoustic signal and the second acoustic signal and the amplitude of the sum component changes according to the phase difference between the first acoustic signal and the second acoustic signal is utilized. Thus, it is possible to generate the second numerical sequence that reflects the phase difference between the first acoustic signal and the second acoustic signal with high accuracy.

本発明の好適な態様に係る音響処理装置は、第１音響信号と第２音響信号との加算で和成分を生成する和成分生成手段と、定位成分を抑圧した差成分を第１音響信号と第２音響信号との間の減算で生成する差成分生成手段とを具備し、第１生成手段は、和成分および差成分から第１数値列を生成し、第２生成手段は、和成分生成手段が生成した和成分の振幅を第２数値列の生成に適用する。以上の態様においては、第１数値列の生成に利用される和成分が第２数値列の生成にも流用されるから、第２数値列の生成の負荷が軽減されるという利点がある。 The sound processing apparatus according to a preferred aspect of the present invention includes a sum component generating means for generating a sum component by adding the first sound signal and the second sound signal, and a difference component obtained by suppressing the localization component as the first sound signal. Difference component generating means generated by subtraction with the second acoustic signal, the first generating means generates a first numerical sequence from the sum component and the difference component, and the second generating means generates the sum component. The amplitude of the sum component generated by the means is applied to the generation of the second numerical sequence. In the above aspect, since the sum component used for generating the first numerical sequence is also used for generating the second numerical sequence, there is an advantage that the load of generating the second numerical sequence is reduced.

本発明の好適な態様において、第１生成手段は、和成分および差成分の一方のスペクトルから他方のスペクトルを減算した結果に応じて第１数値列を生成する。以上の態様においては、和成分および差成分の一方のスペクトルから他方のスペクトルを減算することで定位成分を高精度に強調または抑圧したスペクトルが生成されるから、定位成分を有効に強調または抑圧し得る第１処理係数列を生成できるという利点がある。 In a preferred aspect of the present invention, the first generation means generates a first numerical sequence according to a result of subtracting the other spectrum from one spectrum of the sum component and the difference component. In the above aspect, a spectrum in which the localization component is emphasized or suppressed with high accuracy is generated by subtracting the other spectrum from one spectrum of the sum component and difference component, so that the localization component is effectively emphasized or suppressed. There is an advantage that the obtained first processing coefficient sequence can be generated.

差成分生成手段を具備する態様の好適例に係る音響処理装置は、特定位置を示す定位変数を可変に設定する変数設定手段を具備し、差成分生成手段は、変数設定手段が設定した定位変数に応じた比率で第１音響信号および第２音響信号の一方から他方を減算する。以上の態様においては、差成分生成手段による減算時の第１音響信号および第２音響信号の比率が定位変数に応じて可変に設定される。したがって、強調または抑圧の対象となる定位成分の位置（方向）を可変に制御できる（中央方向に限定されない）という利点がある。 The sound processing apparatus according to a preferred example of the aspect including the difference component generation unit includes a variable setting unit that variably sets a localization variable indicating the specific position, and the difference component generation unit includes the localization variable set by the variable setting unit. The other is subtracted from one of the first acoustic signal and the second acoustic signal at a ratio according to. In the above aspect, the ratio of the first acoustic signal and the second acoustic signal at the time of subtraction by the difference component generation means is variably set according to the localization variable. Therefore, there is an advantage that the position (direction) of the localization component to be emphasized or suppressed can be variably controlled (not limited to the central direction).

本発明の好適な態様に係る音響処理装置は、定位成分の強調および抑圧の一方に対応する第１処理係数列の各係数値を所定値から減算することで定位成分の強調および抑圧の他方に対応する第２処理係数列を生成する第３生成手段を具備する。以上の態様においては、定位成分の強調用および抑圧用の双方の処理係数列を生成することが可能である。また、第１処理係数列の各係数値を所定値から減算することで第２処理係数列が生成されるから、第１処理係数列および第２処理係数列の各々の個別に生成する場合と比較して、第１処理係数列および第２処理係数列の生成の負荷が軽減されるという利点がある。 The acoustic processing device according to a preferred aspect of the present invention subtracts each coefficient value of the first processing coefficient sequence corresponding to one of the localization component enhancement and suppression from a predetermined value to the other of the localization component enhancement and suppression. Third generation means for generating a corresponding second processing coefficient sequence is provided. In the above aspect, it is possible to generate both processing coefficient sequences for emphasizing and suppressing localization components. In addition, since the second processing coefficient sequence is generated by subtracting each coefficient value of the first processing coefficient sequence from the predetermined value, each of the first processing coefficient sequence and the second processing coefficient sequence is generated individually. In comparison, there is an advantage that the load of generating the first processing coefficient sequence and the second processing coefficient sequence is reduced.

以上の各態様に係る音響処理装置は、音響信号の処理に専用されるＤＳＰ（Digital Signal Processor）などのハードウェア（電子回路）によって実現されるほか、ＣＰＵ（Central Processing Unit）などの汎用の演算処理装置とプログラム（ソフトウェア）との協働によっても実現される。本発明のプログラムは、周波数毎の係数値で構成されてステレオ形式の第１音響信号および第２音響信号における特定位置の定位成分を強調または抑圧する第１数値列を生成する第１生成処理と、第１音響信号と第２音響信号との周波数毎の位相差に応じた係数値で構成される第２数値列を、第１音響信号の振幅と第２音響信号の振幅と両者の和成分の振幅とから生成する第２生成処理と、周波数毎の係数値で構成されて定位成分を強調または抑圧する第１処理係数列を第１数値列と第２数値列とから生成する合成処理と、第１音響信号および第２音響信号の各々の各周波数成分に第１処理係数列の各係数値を作用させる信号処理とをコンピュータに実行させる。以上のプログラムによれば、本発明の音響処理装置と同様の作用および効果が実現される。本発明のプログラムは、コンピュータが読取可能な記録媒体に格納された形態で利用者に提供されてコンピュータにインストールされるほか、通信網を介した配信の形態でサーバ装置から提供されてコンピュータにインストールされる。 The acoustic processing device according to each of the above aspects is realized by hardware (electronic circuit) such as a DSP (Digital Signal Processor) dedicated to processing of an acoustic signal, or a general-purpose calculation such as a CPU (Central Processing Unit). It is also realized by cooperation between the processing device and a program (software). The program of the present invention includes a first generation process that generates a first numerical sequence that is composed of coefficient values for each frequency and emphasizes or suppresses a localization component at a specific position in the first acoustic signal and the second acoustic signal in stereo format. The second numerical sequence composed of coefficient values corresponding to the phase difference for each frequency between the first acoustic signal and the second acoustic signal is represented by the sum component of the amplitude of the first acoustic signal and the amplitude of the second acoustic signal. A second generation process generated from the amplitude of the first, a synthesis process for generating a first processing coefficient sequence composed of coefficient values for each frequency and emphasizing or suppressing a localization component from the first numerical value sequence and the second numerical value sequence; And causing the computer to execute signal processing in which each coefficient value of the first processing coefficient sequence is applied to each frequency component of the first acoustic signal and the second acoustic signal. According to the above program, the same operation and effect as the sound processing apparatus of the present invention are realized. The program of the present invention is provided to a user in a form stored in a computer-readable recording medium and installed in the computer, or provided from a server device in a form of distribution via a communication network and installed in the computer. Is done.

第１実施形態に係る音響処理装置のブロック図である。1 is a block diagram of a sound processing apparatus according to a first embodiment. 係数設定部のブロック図である。It is a block diagram of a coefficient setting part. 各周波数成分の関係を示す模式図である。It is a schematic diagram which shows the relationship of each frequency component. 第２生成部のブロック図である。It is a block diagram of the 2nd generating part. 各周波数成分の関係を示す模式図である。It is a schematic diagram which shows the relationship of each frequency component. 変形例における第３生成部のブロック図である。It is a block diagram of the 3rd generating part in a modification. 定位変数と係数値ＣA[k]との関係を示す模式図である。It is a schematic diagram which shows the relationship between a localization variable and coefficient value CA [k]. 定位変数と係数値ＣD[k]との関係を示す模式図である。It is a schematic diagram which shows the relationship between a localization variable and coefficient value CD [k]. 定位変数と位相差との関係を示す模式図である。It is a schematic diagram which shows the relationship between a localization variable and a phase difference.

＜Ａ：第１実施形態＞
図１は、本発明の第１実施形態に係る音響処理装置１００のブロック図である。音響処理装置１００には信号供給装置１２と放音装置１４と入力装置１６とが接続される。信号供給装置１２は、音響（音声や楽音）の波形を表す時間領域の音響信号ＳIN（ＳIN_L，ＳIN_R）を音響処理装置１００に供給する。左チャネルの音響信号ＳIN_Lおよび右チャネルの音響信号ＳIN_Rは、音響を発生する複数の音源の音像が相異なる位置に定位する（すなわち、音響の振幅や位相が各音源の位置に応じて相違する）ように収音または加工されたステレオ形式の信号である。周囲の音響を収音して音響信号ＳINを生成する収音機器（ステレオマイク）や、可搬型または内蔵型の記録媒体から音響信号ＳINを取得して音響処理装置１００に出力する再生装置や、通信網から音響信号ＳINを受信して音響処理装置１００に出力する通信装置が信号供給装置１２として採用され得る。 <A: First Embodiment>
FIG. 1 is a block diagram of a sound processing apparatus 100 according to the first embodiment of the present invention. A signal supply device 12, a sound emission device 14, and an input device 16 are connected to the sound processing device 100. The signal supply device 12 supplies the sound processing device 100 with time-domain sound signals SIN (SIN_L, SIN_R) representing the waveform of sound (speech and music). The sound signal SIN_L of the left channel and the sound signal SIN_R of the right channel are localized at positions where the sound images of a plurality of sound sources generating sound are different (that is, the sound amplitude and phase differ depending on the position of each sound source). In this way, a stereo signal is collected or processed. A sound collection device (stereo microphone) that collects ambient sound to generate an acoustic signal SIN, a playback device that acquires the acoustic signal SIN from a portable or built-in recording medium, and outputs it to the acoustic processing device 100; A communication device that receives the acoustic signal SIN from the communication network and outputs the acoustic signal SIN to the acoustic processing device 100 may be employed as the signal supply device 12.

音響処理装置１００は、信号供給装置１２が供給する音響信号ＳINから音響信号ＳOUT（ＳOUT_L，ＳOUT_R）を生成する。左チャネルの音響信号ＳOUT_Lおよび右チャネルの音響信号ＳOUT_Rは、音響信号ＳINが表す音響のうち目的の位置に音像が定位する定位成分を強調または抑圧したステレオ形式の信号である。放音装置１４（例えばステレオスピーカやステレオヘッドホン）は、音響処理装置１００が生成した音響信号ＳOUT（ＳOUT_L，ＳOUT_R）に応じた音波を放射する。 The acoustic processing device 100 generates an acoustic signal SOUT (SOUT_L, SOUT_R) from the acoustic signal SIN supplied by the signal supply device 12. The left-channel acoustic signal SOUT_L and the right-channel acoustic signal SOUT_R are stereo signals in which a localization component in which a sound image is localized at a target position is emphasized or suppressed among the sounds represented by the acoustic signal SIN. The sound emitting device 14 (for example, a stereo speaker or a stereo headphone) emits a sound wave corresponding to the acoustic signal SOUT (SOUT_L, SOUT_R) generated by the acoustic processing device 100.

入力装置１６は、音響処理装置１００に対する指示を利用者が入力するための機器（例えばマウスやキーボード）である。利用者は、入力装置１６を適宜に操作することで、定位成分の位置（方向）と定位成分の強調／抑圧の何れかの処理とを音響処理装置１００に対して任意に指示することが可能である。 The input device 16 is a device (for example, a mouse or a keyboard) for a user to input an instruction to the sound processing device 100. By appropriately operating the input device 16, the user can arbitrarily instruct the acoustic processing device 100 to perform either the localization component position (direction) or the localization component enhancement / suppression processing. It is.

図１に示すように、音響処理装置１００は、演算処理装置２２と記憶装置２４とを具備するコンピュータシステムで実現される。記憶装置２４は、演算処理装置２２が実行するプログラムＰGや演算処理装置２２が使用するデータを記憶する。半導体記録媒体や磁気記録媒体などの公知の記録媒体や複数種の記録媒体の組合せが記憶装置２４として任意に採用される。音響信号ＳIN（ＳIN_L，ＳIN_R）を記憶装置２４に記憶した構成（したがって信号供給装置１２は省略され得る）も好適である。 As shown in FIG. 1, the sound processing device 100 is realized by a computer system including an arithmetic processing device 22 and a storage device 24. The storage device 24 stores a program PG executed by the arithmetic processing device 22 and data used by the arithmetic processing device 22. A known recording medium such as a semiconductor recording medium or a magnetic recording medium or a combination of a plurality of types of recording media is arbitrarily adopted as the storage device 24. A configuration in which the acoustic signal SIN (SIN_L, SIN_R) is stored in the storage device 24 (therefore, the signal supply device 12 can be omitted) is also suitable.

演算処理装置２２は、記憶装置２４に格納されたプログラムＰGを実行することで、音響信号ＳINから音響信号ＳOUTを生成するための複数の機能（変数設定部３２，周波数分析部３４，係数設定部３６，信号処理部３８，波形合成部４０）を実現する。なお、演算処理装置２２の各機能を複数の集積回路に分散した構成や、専用の電子回路（ＤＳＰ）が各機能を実現する構成も採用され得る。 The arithmetic processing unit 22 executes a program PG stored in the storage device 24 to thereby generate a plurality of functions (variable setting unit 32, frequency analysis unit 34, coefficient setting unit) for generating the acoustic signal SOUT from the acoustic signal SIN. 36, a signal processing unit 38, and a waveform synthesis unit 40). A configuration in which each function of the arithmetic processing unit 22 is distributed over a plurality of integrated circuits, or a configuration in which a dedicated electronic circuit (DSP) realizes each function may be employed.

変数設定部３２は、定位変数αを可変に設定する。定位変数αは、定位成分の音像の位置（方向）を指定する変数である。本形態の変数設定部３２は、入力装置１６に対する利用者からの指示に応じて定位変数α（０≦α≦１）を可変に設定する。例えば、定位変数αが０．５（中央値）である場合には中央（正面）方向が指示される。また、定位変数αが１に近いほど右寄りの方向が指示され、定位変数αが０に近いほど左寄りの方向が指示される。利用者は、放音装置１４からの再生音を聴取しながら随時に入力装置１６の操作で定位変数αを変更することが可能である。 The variable setting unit 32 sets the localization variable α to be variable. The localization variable α is a variable that designates the position (direction) of the sound image of the localization component. The variable setting unit 32 of the present embodiment variably sets the localization variable α (0 ≦ α ≦ 1) in accordance with an instruction from the user to the input device 16. For example, when the localization variable α is 0.5 (median value), the center (front) direction is indicated. Further, as the localization variable α is closer to 1, a rightward direction is indicated, and as the localization variable α is closer to 0, a leftward direction is indicated. The user can change the localization variable α by operating the input device 16 at any time while listening to the reproduced sound from the sound emitting device 14.

周波数分析部３４は、音響信号ＳIN_Lの周波数スペクトル（複素スペクトル）ＬAと音響信号ＳIN_Rの周波数スペクトル（複素スペクトル）ＲAとを時間軸上の単位区間（フレーム）毎に順次に生成する。周波数スペクトルＬAは、Ｋ個の周波数（周波数帯域）ｆ1〜ｆKの各々に対応する周波数成分ＬAk(e^jω)の系列（ＬA1(e^jω)〜ＬAK(e^jω)）である（ｋ＝１〜Ｋ）。記号ｊは虚数単位を意味する。同様に、周波数スペクトルＲAは、周波数ｆ1〜ｆKの各々に対応する周波数成分ＲAk(e^jω)の系列（ＲA1(e^jω)〜ＲAK(e^jω)）である。周波数スペクトルＬAおよび周波数スペクトルＲAの生成には、短時間フーリエ変換などの公知の周波数分析が任意に採用され得る。なお、通過帯域が相異なるＫ個の帯域通過フィルタで構成されるフィルタバンクも周波数分析部３４として採用され得る。 The frequency analysis unit 34 sequentially generates a frequency spectrum (complex spectrum) LA of the acoustic signal SIN_L and a frequency spectrum (complex spectrum) RA of the acoustic signal SIN_R for each unit section (frame) on the time axis. The frequency spectrum LA is a series of frequency components LAk (e ^jω ) (LA1 (e ^jω ) to LAK (e ^jω )) corresponding to each of K frequencies (frequency bands) f1 to fK (k = 1 to 1). K). The symbol j means an imaginary unit. Similarly, the frequency spectrum RA is a series of frequency components RAk (e ^jω ) (RA1 (e ^jω ) to RAK (e ^jω )) corresponding to each of the frequencies f1 to fK. For the generation of the frequency spectrum LA and the frequency spectrum RA, a known frequency analysis such as a short-time Fourier transform may be arbitrarily employed. A filter bank composed of K band pass filters having different pass bands can also be adopted as the frequency analysis unit 34.

図１の係数設定部３６は、音響信号ＳIN（ＳIN_L，ＳIN_R）の定位成分を強調（enhance）するための処理係数列Ｇeと音響信号ＳINの定位成分を抑圧（suppress）するための処理係数列Ｇsとを生成する。処理係数列Ｇeおよび処理係数列Ｇsは、周波数スペクトルＬAと周波数スペクトルＲAとを利用して単位区間毎に順次に生成される。処理係数列Ｇeは、周波数ｆ1〜ｆKの各々に対応する係数値Ｇe[k]の系列（Ｇe[1]〜Ｇe[K]）であり、処理係数列Ｇsは、周波数ｆ1〜ｆKの各々に対応する係数値Ｇs[k]の系列（Ｇs[1]〜Ｇs[K]）である。 The coefficient setting unit 36 in FIG. 1 includes a processing coefficient sequence Ge for enhancing the localization component of the acoustic signal SIN (SIN_L, SIN_R) and a processing coefficient sequence for suppressing the localization component of the acoustic signal SIN. Gs. The processing coefficient sequence Ge and the processing coefficient sequence Gs are sequentially generated for each unit section using the frequency spectrum LA and the frequency spectrum RA. The processing coefficient sequence Ge is a series of coefficient values Ge [k] (Ge [1] to Ge [K]) corresponding to each of the frequencies f1 to fK, and the processing coefficient sequence Gs is assigned to each of the frequencies f1 to fK. This is a series of corresponding coefficient values Gs [k] (Gs [1] to Gs [K]).

係数値Ｇe[k]および係数値Ｇs[k]は、音響信号ＳIN_Lの周波数成分ＬAk(e^jω)や音響信号ＳIN_Rの周波数成分ＲAk(e^jω)に対するゲイン（スペクトルゲイン）に相当し、音響信号ＳIN（ＳIN_L，ＳIN_R）の特性に応じて０以上かつ１以下の範囲内で可変に設定される（０≦Ｇe[k]≦１，０≦Ｇs[k]≦１）。具体的には、定位成分の強調用の処理係数列Ｇeの係数値Ｇe[1]〜Ｇe[K]は、定位成分のパワー（振幅）が大きい周波数ｆkの係数値Ｇe[k]ほど１に近い数値に設定される。他方、定位成分の抑圧用の処理係数列Ｇsの係数値Ｇs[1]〜Ｇs[K]は、定位成分のパワーが大きい周波数ｆkの係数値Ｇs[k]ほど０に近い数値に設定される。 The coefficient value Ge [k] and the coefficient value Gs [k] correspond to a gain (spectral gain) with respect to the frequency component LAk (e ^jω ) of the acoustic signal SIN_L and the frequency component RAk (e ^jω ) of the acoustic signal SIN_R. It is variably set within the range of 0 or more and 1 or less according to the characteristics of SIN (SIN_L, SIN_R) (0 ≦ Ge [k] ≦ 1, 0 ≦ Gs [k] ≦ 1). Specifically, the coefficient values Ge [1] to Ge [K] of the processing coefficient sequence Ge for emphasizing the localization component are set to 1 as the coefficient value Ge [k] of the frequency fk having a large localization component power (amplitude). Set to a close number. On the other hand, the coefficient values Gs [1] to Gs [K] of the processing coefficient sequence Gs for suppressing the localization component are set to values closer to 0 as the coefficient value Gs [k] of the frequency fk having a larger localization component power. .

図１の信号処理部３８は、係数設定部３６が生成した処理係数列Ｇeまたは処理係数列Ｇsを周波数スペクトルＬAおよび周波数スペクトルＲAの各々に個別に作用させる（典型的には乗算する）ことで周波数スペクトルＬBと周波数スペクトルＲBとを単位区間毎に順次に生成する。周波数スペクトルＬBは、周波数ｆ1〜ｆKの各々に対応する周波数成分ＬBk(e^jω)の系列（ＬB1(e^jω)〜ＬBK(e^jω)）であり、周波数スペクトルＲBは、周波数ｆ1〜ｆKの各々に対応する周波数成分ＲBk(e^jω)の系列（ＲB1(e^jω)〜ＲBK(e^jω)）である。各単位区間の周波数スペクトルＬAおよび周波数スペクトルＲAの処理には、その単位区間について係数設定部３６が生成した処理係数列Ｇeまたは処理係数列Ｇsが適用される。 The signal processing unit 38 of FIG. 1 causes the processing coefficient sequence Ge or the processing coefficient sequence Gs generated by the coefficient setting unit 36 to individually act (typically multiply) each of the frequency spectrum LA and the frequency spectrum RA. The frequency spectrum LB and the frequency spectrum RB are sequentially generated for each unit section. The frequency spectrum LB is a series of frequency components LBk (e ^jω ) (LB1 (e ^jω ) to LBK (e ^jω )) corresponding to each of the frequencies f1 to fK, and the frequency spectrum RB is each of the frequencies f1 to fK. Is a series of frequency components RBk (e ^jω ) corresponding to (RB1 (e ^jω ) to RBK (e ^jω )). For the processing of the frequency spectrum LA and the frequency spectrum RA of each unit section, the processing coefficient sequence Ge or the processing coefficient sequence Gs generated by the coefficient setting unit 36 for the unit section is applied.

入力装置１６に対して利用者から定位成分の強調が指示された場合、信号処理部３８は、以下の数式(1a)および数式(1b)に示すように、周波数スペクトルＬAおよび周波数スペクトルＲAに対する処理係数列Ｇeの乗算で周波数スペクトルＬBおよび周波数スペクトルＲBを生成する。すなわち、周波数スペクトルＬBの各周波数ｆkの周波数成分ＬBk(e^jω)は、その周波数ｆkの周波数成分ＬAk(e^jω)と係数値Ｇe[k]との乗算値に設定され（数式(1a)）、周波数スペクトルＲBの各周波数ｆkの周波数成分ＲBk(e^jω)は、その周波数ｆkの周波数成分ＲAk(e^jω)と係数値Ｇe[k]との乗算値に設定される（数式(1b)）。したがって、音響信号ＳIN（ＳIN_L，ＳIN_R）の定位成分を強調した周波数スペクトルＬBおよび周波数スペクトルＲBが生成される。

When the user instructs the input device 16 to emphasize the localization component, the signal processing unit 38 processes the frequency spectrum LA and the frequency spectrum RA as shown in the following formulas (1a) and (1b). A frequency spectrum LB and a frequency spectrum RB are generated by multiplication of the coefficient sequence Ge. That is, the frequency components LBk of each frequency fk in the frequency spectrum ^LB (e jω) is set to the multiplication value between the frequency component of the frequency fk LAk ^(e jω) and the coefficient value Ge [k] (Formula (1a)) , the frequency components RBk of each frequency fk in the frequency spectrum ^RB (e jω) is set to the multiplication value between the frequency component of the frequency fk RAk ^(e jω) and the coefficient value Ge [k] (equation (1b)) . Therefore, the frequency spectrum LB and the frequency spectrum RB in which the localization component of the acoustic signal SIN (SIN_L, SIN_R) is emphasized are generated.

他方、入力装置１６に対して利用者から定位成分の抑圧が指示された場合、信号処理部３８は、以下の数式(2a)および数式(2b)に示すように、周波数スペクトルＬBおよび周波数スペクトルＲBの生成に定位成分の抑圧用の処理係数列Ｇsを適用する。すなわち、周波数スペクトルＬAの各周波数成分ＬAk(e^jω)と処理係数列Ｇsの各係数値Ｇs[k]との乗算で周波数スペクトルＬBの各周波数成分ＬBk(e^jω)を算定し（数式(2a)）、周波数スペクトルＲAの各周波数成分ＲAk(e^jω)と各係数値Ｇs[k]との乗算で周波数スペクトルＲBの各周波数成分ＲBk(e^jω)を算定する（数式(2b)）。したがって、音響信号ＳIN（ＳIN_L，ＳIN_R）の定位成分を抑圧した周波数スペクトルＬBおよび周波数スペクトルＲBが生成される。

On the other hand, when the user instructs the input device 16 to suppress the localization component, the signal processing unit 38, as shown in the following equations (2a) and (2b), the frequency spectrum LB and the frequency spectrum RB The processing coefficient sequence Gs for suppressing the localization component is applied to the generation of. That is, each frequency component LBk (e ^jω ) of the frequency spectrum LB is calculated by multiplying each frequency component LAk (e ^jω ) of the frequency spectrum LA by each coefficient value Gs [k] of the processing coefficient sequence Gs (formula (2a )), calculates the respective frequency components of the frequency spectrum RB RBk ^(e jω) by multiplying the each frequency component RAk of the frequency spectrum RA (e ^{j [omega])} and the coefficient value Gs [k] (formula (2b)). Therefore, the frequency spectrum LB and the frequency spectrum RB in which the localization component of the acoustic signal SIN (SIN_L, SIN_R) is suppressed are generated.

図１の波形合成部４０は、信号処理部３８による処理後の周波数スペクトルＬBおよび周波数スペクトルＲBからステレオ形式の音響信号ＳOUT_Lおよび音響信号ＳOUT_Rを生成する。具体的には、波形合成部４０は、単位区間毎の周波数スペクトルＬBを逆フーリエ変換で時間領域の信号に変換するとともに前後の単位区間について相互に連結することで音響信号ＳOUT_Lを生成する。同様に、波形合成部４０は、各周波数スペクトルＲBから音響信号ＳOUT_Rを生成する。波形合成部４０が生成した音響信号ＳOUT（ＳOUT_L，ＳOUT_R）が放音装置１４に供給されて音波として再生される。 The waveform synthesizer 40 in FIG. 1 generates a stereo acoustic signal SOUT_L and an acoustic signal SOUT_R from the frequency spectrum LB and the frequency spectrum RB processed by the signal processor 38. Specifically, the waveform synthesizer 40 generates the acoustic signal SOUT_L by converting the frequency spectrum LB for each unit section into a signal in the time domain by inverse Fourier transform and mutually connecting the preceding and following unit sections. Similarly, the waveform synthesizer 40 generates an acoustic signal SOUT_R from each frequency spectrum RB. The acoustic signals SOUT (SOUT_L, SOUT_R) generated by the waveform synthesis unit 40 are supplied to the sound emitting device 14 and reproduced as sound waves.

以上に説明したように、第１実施形態では、音響信号ＳIN_L（周波数スペクトルＬA）および音響信号ＳIN_R（周波数スペクトルＲA）の各々に対して個別に処理係数列Ｇeまたは処理係数列Ｇsが乗算されるから、定位成分を強調または抑圧した音響信号ＳOUT（ＳOUT_L，ＳOUT_R）をステレオ形式のまま生成することが可能である。 As described above, in the first embodiment, each of the acoustic signal SIN_L (frequency spectrum LA) and the acoustic signal SIN_R (frequency spectrum RA) is individually multiplied by the processing coefficient sequence Ge or the processing coefficient sequence Gs. Therefore, the acoustic signal SOUT (SOUT_L, SOUT_R) in which the localization component is emphasized or suppressed can be generated in the stereo format.

次に、係数設定部３６の詳細を説明する。図２は、係数設定部３６のブロック図である。図２に示すように、係数設定部３６は、和成分生成部５２と差成分生成部５４と係数列生成部６０とを含んで構成される。和成分生成部５２は、音響信号ＳIN_Lの周波数成分ＬA1(e^jω)〜ＬAK(e^jω)と音響信号ＳIN_Rの周波数成分ＲA1(e^jω)〜ＲAK(e^jω)との加算で単位区間毎に順次に和成分（複素スペクトル）Ｍを生成する。和成分Ｍは、周波数ｆ1〜ｆKの各々に対応する周波数成分Ｍk(e^jω)の系列（Ｍ1(e^jω)〜ＭK(e^jω)）である。数式(3)に示すように、和成分Ｍの周波数ｆkの周波数成分Ｍk(e^jω)は、その周波数ｆkの周波数成分ＬAk(e^jω)と周波数成分ＲAk(e^jω)との加算（複素数）に相当する。したがって、和成分Ｍは、全部の音源からの音響を混合したモノラル形式の信号に相当する。なお、周波数成分ＬAk(e^jω)と周波数成分ＲAk(e^jω)との加重和や平均を和成分Ｍとして算定する構成も採用され得る。

Next, details of the coefficient setting unit 36 will be described. FIG. 2 is a block diagram of the coefficient setting unit 36. As shown in FIG. 2, the coefficient setting unit 36 includes a sum component generation unit 52, a difference component generation unit 54, and a coefficient sequence generation unit 60. The sum component generation unit 52 adds the frequency components LA1 (e ^jω ) to LAK (e ^jω ) of the acoustic signal SIN_L and the frequency components RA1 (e ^jω ) to RAK (e ^jω ) of the acoustic signal SIN_R for each unit interval. A sum component (complex spectrum) M is sequentially generated. The sum component M is a series of frequency components Mk (e ^jω ) (M1 (e ^jω ) to MK (e ^jω )) corresponding to each of the frequencies f1 to fK. As shown in Equation (3), the frequency component Mk (e ^jω ) of the frequency fk of the sum component M is the addition (complex number) of the frequency component LAk (e ^jω ) and the frequency component RAk (e ^jω ) of the frequency fk. It corresponds to. Therefore, the sum component M corresponds to a monaural signal in which sounds from all sound sources are mixed. A configuration in which a weighted sum or average of the frequency component LAk (e ^jω ) and the frequency component RAk (e ^jω ) is calculated as the sum component M may be employed.

図２の差成分生成部５４は、音響信号ＳIN_Lの周波数成分ＬA1(e^jω)〜ＬAK(e^jω)と音響信号ＳIN_Rの周波数成分ＲA1(e^jω)〜ＲAK(e^jω)との間の減算で単位区間毎に順次に差成分（複素スペクトル）Ｓを生成する。差成分Ｓは、周波数ｆ1〜ｆKの各々に対応する周波数成分Ｓk(e^jω)の系列（Ｓ1(e^jω)〜ＳK(e^jω)）である。差成分生成部５４は、変数設定部３２が設定した定位変数αを利用した数式(4)の演算（加重減算）で周波数成分Ｓ1(e^jω)〜ＳK(e^jω)を算定する。

2 performs subtraction between the frequency components LA1 (e ^jω ) to LAK (e ^jω ) of the acoustic signal SIN_L and the frequency components RA1 (e ^jω ) to RAK (e ^jω ) of the acoustic signal SIN_R. Thus, a difference component (complex spectrum) S is sequentially generated for each unit interval. The difference component S is a series of frequency components Sk (e ^jω ) (S1 (e ^jω ) to SK (e ^jω )) corresponding to each of the frequencies f1 to fK. The difference component generator 54 calculates the frequency components S1 (e ^jω ) to SK (e ^jω ) by the calculation (weighted subtraction) of Expression (4) using the localization variable α set by the variable setting unit 32.

数式(4)から理解されるように、定位変数αに応じた可変の比率（重み値）にて周波数成分ＬAk(e^jω)から周波数成分ＲAk(e^jω)を減算（逆相加算）することで差成分Ｓの各周波数成分Ｓk(e^jω)が生成される。したがって、差成分Ｓは、音響信号ＳIN（ＳIN_L，ＳIN_R）のうち定位変数αに応じた位置（方向）の定位成分を他の成分に対して相対的に抑圧した信号（すなわち、定位成分以外の成分を相対的に強調した信号）となる。例えば、定位変数αが０．５（中央値）である場合には、中央方向の定位成分（すなわち、振幅および位相が略同等の成分）を抑圧した差成分Ｓが生成される。また、定位変数αが０．５を上回るほど、中央方向に対して右寄りの定位成分が差成分Ｓでは抑圧され、定位変数αが０．５を下回るほど、中央方向に対して左寄りの定位成分が差成分Ｓでは抑圧される。 As it is understood from the formula (4), varying proportions frequency components at (weight value) LAk (e ^{j [omega])} from a frequency component RAk (e ^{j [omega])} subtraction (reverse phase addition) according to the localization variable α Thus, each frequency component Sk (e ^jω ) of the difference component S is generated. Therefore, the difference component S is a signal in which the localization component in the position (direction) corresponding to the localization variable α in the acoustic signal SIN (SIN_L, SIN_R) is relatively suppressed with respect to other components (that is, other than the localization component). Signal with relatively emphasized components). For example, when the localization variable α is 0.5 (median value), a difference component S in which a localization component in the center direction (that is, a component having substantially the same amplitude and phase) is suppressed is generated. Further, as the localization variable α exceeds 0.5, the localization component closer to the center direction is suppressed by the difference component S, and as the localization variable α is less than 0.5, the localization component toward the left side relative to the center direction is suppressed. However, the difference component S is suppressed.

数式(4)の記号ｍａｘ（α，１−α）は、定位変数αまたは変数（１−α）のうちの最大値を意味する。定位変数αは１以下の数値に設定されるから、数式(4)の分子の演算のみでは周波数成分Ｓk(e^jω)のパワー（振幅）が不足する可能性がある。数式(4)のように最大値ｍａｘ（α，１−α）で除算するのは、周波数成分Ｓk(e^jω)のパワーを周波数成分ＬAk(e^jω)や周波数成分ＲAk(e^jω)と同等に維持するためである。 The symbol max (α, 1-α) in Equation (4) means the maximum value of the localization variable α or the variable (1-α). Since the localization variable α is set to a numerical value of 1 or less, there is a possibility that the power (amplitude) of the frequency component Sk (e ^jω ) is insufficient only by the calculation of the numerator of Equation (4). Dividing by the maximum value max (α, 1-α) as in Equation (4) is equivalent to the power of the frequency component Sk (e ^jω ) and the frequency component LAk (e ^jω ) or frequency component RAk (e ^jω ). It is for maintaining.

図２の係数列生成部６０は、和成分生成部５２が生成した和成分Ｍと差成分生成部５４が生成した差成分Ｓとを利用して定位成分の強調用の処理係数列Ｇe（Ｇe[1]〜Ｇe[K]）と定位成分の抑圧用の処理係数列Ｇs（Ｇs[1]〜Ｇs[K]）とを生成する。図２に示すように、係数列生成部６０は、第１生成部６２と第２生成部６４と第３生成部６６と合成部６８とを含んで構成される。 2 uses the sum component M generated by the sum component generation unit 52 and the difference component S generated by the difference component generation unit 54 to process the localization component enhancement processing coefficient sequence Ge (Ge [1] to Ge [K]) and a processing coefficient sequence Gs (Gs [1] to Gs [K]) for suppressing the localization component are generated. As shown in FIG. 2, the coefficient sequence generation unit 60 includes a first generation unit 62, a second generation unit 64, a third generation unit 66, and a synthesis unit 68.

第１生成部６２は、定位成分の強調用の数値列ＣAを和成分Ｍと差成分Ｓとから生成する。数値列ＣAは、周波数ｆ1〜ｆKの各々に対応する係数値（ゲイン）ＣA[k]の系列（ＣA[1]〜ＣA[K]）である。具体的には、第１生成部６２は、以下の数式(5)の演算で数値列ＣAの各係数値ＣA[k]を算定する。

The first generation unit 62 generates a numerical sequence CA for emphasizing the localization component from the sum component M and the difference component S. The numerical sequence CA is a series (CA [1] to CA [K]) of coefficient values (gains) CA [k] corresponding to the frequencies f1 to fK. Specifically, the first generation unit 62 calculates each coefficient value CA [k] of the numerical sequence CA by the calculation of the following mathematical formula (5).

数式(5)の記号Ｐ[k]は、定位成分を強調したパワースペクトルＰのうち周波数ｆkでのパワーを意味する。パワーＰ[k]は、例えば以下の数式(6a)および数式(6b)で算定される。

The symbol P [k] in Equation (5) means the power at the frequency fk in the power spectrum P in which the localization component is emphasized. The power P [k] is calculated by, for example, the following formula (6a) and formula (6b).

数式(6a)から理解されるように、周波数成分Ｍk(e^jω)のパワー|Ｍk(e^jω)|^２が周波数成分Ｓk(e^jω)のパワー|Ｓk(e^jω)|^２を上回る周波数ｆkのパワーＰ[k]は、和成分Ｍのパワー|Ｍk(e^jω)|^２から差成分Ｓのパワー|Ｓk(e^jω)|^２を減算した数値に設定される。すなわち、パワースペクトルＰは、和成分Ｍのパワースペクトルと差成分Ｓのパワースペクトルとの間の減算（スペクトル減算）で生成される。他方、パワー|Ｍk(e^jω)|^２がパワー|Ｓk(e^jω)|^２以下となる周波数ｆkのパワーＰ[k]は、和成分Ｍのパワー|Ｍk(e^jω)|^２と所定の係数（フロアリング係数）βとの乗算値に設定される。以上の説明から理解されるように、数式(6a)および数式(6b)の演算は、和成分Ｍを信号成分（目的音）と仮定するとともに差成分Ｓを雑音成分と仮定した場合に雑音成分を抑圧するためのスペクトル減算（ＳＳ：Spectral Subtraction）に相当する。なお、数式(5)にて振幅Ｐ[k]^1/2を周波数成分Ｍk(e^jω)の振幅|Ｍk(e^jω)|で除算するのは、係数値ＣA[k]を１以下の数値（０≦ＣA[k]≦１）に正規化するためである。 As is understood from the formula (6a), the power of the frequency component ^{Mk (e jω) | Mk (} e jω) | 2 is the power of the frequency component ^{Sk (e jω) | Sk (} e jω) | 2 over the frequency fk The power P [k] is set to a value obtained by subtracting the power | Sk (e ^jω ) | ² of the difference component S from the power | Mk (e ^jω ) | ² of the sum component M. That is, the power spectrum P is generated by subtraction (spectrum subtraction) between the power spectrum of the sum component M and the power spectrum of the difference component S. On the other hand, the power ^| Mk (e jω) | ² is the power ^| Sk (e jω) | Power P [k] of ² or less and comprising frequency fk is the sum component M Power ^| Mk (e jω) | ² a predetermined It is set to the product of the coefficient (flooring coefficient) β. As can be understood from the above description, the calculations of Equation (6a) and Equation (6b) are performed when the sum component M is assumed to be a signal component (target sound) and the difference component S is assumed to be a noise component. This corresponds to spectral subtraction (SS). Note that dividing the amplitude P [k] ^1/2 by the amplitude | Mk (e ^jω ) | of the frequency component Mk (e ^jω ) | in equation (5) is a numerical value of 1 or less. This is for normalization to (0 ≦ CA [k] ≦ 1).

周波数成分Ｓk(e^jω)は定位成分を抑圧した成分であるから、数式(6a)および数式(6b)で算定されるパワーＰ[1]〜Ｐ[K]の系列は、音響信号ＳIN（ＳIN_L，ＳIN_R）の定位成分を強調した成分のパワースペクトルＰとなる。したがって、数式(5)で算定される数値列ＣAの係数値ＣA[1]〜ＣA[K]は、定位変数αに応じた位置の定位成分のパワー（振幅）が大きい周波数ｆkの係数値ＣA[k]ほど１に近い数値となり、定位成分のパワーが小さい周波数ｆkの係数値ＣA[k]ほど０に近い数値となる。 Since the frequency component Sk (e ^jω ) is a component in which the localization component is suppressed, the series of powers P [1] to P [K] calculated by the equations (6a) and (6b) is the acoustic signal SIN (SIN_L , SIN_R), the power spectrum P of the component emphasizing the localization component. Therefore, the coefficient values CA [1] to CA [K] of the numerical value sequence CA calculated by the equation (5) are the coefficient values CA of the frequency fk at which the power (amplitude) of the localization component at the position corresponding to the localization variable α is large. [k] is a numerical value closer to 1, and the coefficient value CA [k] of the frequency fk having a smaller localization component power is a numerical value closer to 0.

以上の説明から理解されるように、第１生成部６２が生成した数値列ＣAを処理係数列Ｇeとして（係数値ＣA[k]を数式(1a)および数式(1b)の係数値Ｇe[k]として）、音響信号ＳIN_Lの周波数成分ＬAk(e^jω)および音響信号ＳIN_Rの周波数成分ＲAk(e^jω)の各々に乗算する構成（以下「対比例」という）でも、定位成分が強調されたステレオ形式の音響信号ＳOUT（ＳOUT_L，ＳOUT_R）を生成することが可能である。 As understood from the above description, the numerical value sequence CA generated by the first generation unit 62 is used as the processing coefficient sequence Ge (the coefficient value CA [k] is expressed as the coefficient value Ge [k] in the equations (1a) and (1b)). ] as), but the acoustic signal SIN_L frequency components LAk (e ^{j [omega])} and respectively multiplying the configuration of an audio signal SIN_R frequency components RAk ^(e jω) (hereinafter referred to as "comparison example"), stereo localization component is emphasized It is possible to generate an acoustic signal SOUT (SOUT_L, SOUT_R) in the form.

しかし、音響信号ＳIN_L（周波数成分ＬAk(e^jω)）と音響信号ＳIN_R（周波数成分ＲAk(e^jω)）とで位相が相違する場合、数式(4)の演算では定位成分が充分に抑圧されず、差成分Ｓに有意なパワーで定位成分が残留する。以上のように差成分Ｓに残留した定位成分は数式(6a)の演算で和成分Ｍから減算される（すなわち、パワースペクトルＰにおける定位成分のパワーが低減される）。したがって、音響信号ＳIN_Lの周波数成分ＬAk(e^jω)と音響信号ＳIN_Rの周波数成分ＲAk(e^jω)との位相差が大きいほど、第１生成部６２が生成する数値列ＣAの係数値ＣA[k]は小さい数値になるという傾向がある。つまり、数値列ＣAを処理係数列Ｇeとして利用する対比例の構成では、定位成分の強調の効果が不足する結果となる。相互に離れた位置に設置された２個の収音機器を利用した収音で音響空間内での反射後の音響（残響音）の音響信号ＳIN（ＳIN_L，ＳIN_R）を生成した場合や、音響信号ＳINに残響効果を付与した場合には、音響信号ＳIN_Lと音響信号ＳIN_Rとの位相差が顕著となるから、対比例における定位成分の強調の不足は特に顕在化する。 However, when the acoustic signal SIN_L (frequency component LAk (e ^jω )) and the acoustic signal SIN_R (frequency component RAk (e ^jω )) are different in phase, the localization component is not sufficiently suppressed in the calculation of Equation (4). The localization component remains with a significant power in the difference component S. As described above, the localization component remaining in the difference component S is subtracted from the sum component M by the calculation of Equation (6a) (that is, the power of the localization component in the power spectrum P is reduced). Thus, as the phase difference of the frequency components LAk and (e ^{j [omega])} and the frequency component of the acoustic signal SIN_R RAk ^(e jω) of the acoustic signal SIN_L large coefficient values CA [k numerical sequence CA that first generation unit 62 generates ] Tends to be a small number. That is, in the comparative configuration using the numerical value sequence CA as the processing coefficient sequence Ge, the effect of enhancing the localization component is insufficient. When the sound signal SIN (SIN_L, SIN_R) of the sound after reflection in the acoustic space (reverberation sound) is generated by sound collection using two sound collection devices installed at positions separated from each other, When the reverberation effect is given to the signal SIN, the phase difference between the acoustic signal SIN_L and the acoustic signal SIN_R becomes significant, and thus the lack of emphasis on the localization component in the comparison becomes particularly obvious.

そこで、第２生成部６４は、音響信号ＳIN_L（周波数成分ＬAk(e^jω)）と音響信号ＳIN_R（周波数成分ＲAk(e^jω)）との位相差に応じた数値列ＣBを生成する。数値列ＣBは、周波数ｆ1〜ｆKの各々に対応する係数値ＣB[k]の系列（ＣB[1]〜ＣB[K]）である。周波数ｆkの周波数成分ＬAk(e^jω)と周波数成分ＲAk(e^jω)との位相差が大きいほど、数値列ＣBのうち周波数ｆkの係数値ＣB[k]は大きい数値に設定される。 Therefore, the second generation unit 64 generates a numerical sequence CB corresponding to the phase difference between the acoustic signal SIN_L (frequency component LAk (e ^jω )) and the acoustic signal SIN_R (frequency component RAk (e ^jω )). The numerical sequence CB is a series (CB [1] to CB [K]) of coefficient values CB [k] corresponding to the frequencies f1 to fK. As the phase difference between the frequency component LAk (e ^jω ) and the frequency component RAk (e ^jω ) of the frequency fk increases, the coefficient value CB [k] of the frequency fk in the numerical sequence CB is set to a larger value.

図３は、音響信号ＳIN_Lの周波数成分ＬAk(e^jω)と音響信号ＳIN_Rの周波数成分ＲAk(e^jω)と和成分Ｍの周波数成分Ｍk(e^jω)とを複素平面内のベクトル（ただし、各成分の符号はＬA，ＲA，Ｍと略記されている）として表記した模式図である。周波数成分ＬAk(e^jω)と周波数成分ＲAk(e^jω)との角度θが両者の位相差に相当する。なお、図３では、周波数成分ＬAk(e^jω)の振幅|ＬAk(e^jω)|と周波数成分ＲAk(e^jω)の振幅|ＲAk(e^jω)|とが相等しい場合を便宜的に例示した。 3, the frequency component of the acoustic signal SIN_L LAk ^(e jω) and the frequency component Mk (e ^{j [omega])} of the frequency component RAk (e ^{j [omega])} and sum component M of the acoustic signal SIN_R and a vector in the complex plane (where each The component codes are abbreviated as LA, RA, M). The angle θ between the frequency component LAk (e ^jω ) and the frequency component RAk (e ^jω ) corresponds to the phase difference between the two. In FIG. 3, the amplitude of the frequency component ^LAk (e jω) | exemplifying a case and are equal conveniently ^| LAk (e jω) | and the amplitude of the frequency component ^{RAk (e jω) | RAk (} e jω) .

図３の部分(A)のように周波数成分ＬAk(e^jω)と周波数成分ＲAk(e^jω)とが同位相である場合（θ＝０）、和成分Ｍの振幅|Ｍk(e^jω)|は、周波数成分ＬAk(e^jω)の振幅|ＬAk(e^jω)|と周波数成分ＲAk(e^jω)の振幅|ＲAk(e^jω)|との加算値（以下「振幅和」という）Ａkに一致する（Ａk＝|ＬAk(e^jω)|＋|ＲAk(e^jω)|）。他方、図３の部分(B)および部分(C)に示すように、周波数成分ＬAk(e^jω)と周波数成分ＲAk(e^jω)との位相差（角度θ）が大きいほど、和成分Ｍの振幅|Ｍk(e^jω)|は振幅和Ａkと比較して小さい数値となる。すなわち、振幅和Ａkと和成分Ｍの振幅|Ｍk(e^jω)|との相違が大きいほど（振幅|Ｍk(e^jω)|が振幅和Ａkと比較して小さいほど）、周波数成分ＬAk(e^jω)と周波数成分ＲAk(e^jω)との位相差が大きいという傾向がある。以上の傾向を利用して、第１実施形態の第２生成部６４は、周波数成分ＬAk(e^jω)および周波数成分ＲAk(e^jω)の振幅和Ａkと和成分Ｍの振幅|Ｍk(e^jω)|とを比較した結果に応じて数値列ＣBの各係数値ＣB[k]（ＣB[1]〜ＣB[K]）を決定する。 When the frequency component LAk (e ^jω ) and the frequency component RAk (e ^jω ) are in phase (θ = 0) as shown in part (A) of FIG. 3, the amplitude | Mk (e ^jω ) | of the sum component M the amplitude of the frequency component ^LAk (e jω) | matches the sum of the (hereinafter referred to as "amplitude ^{sum") Ak | LAk (e jω)} | and the amplitude of the frequency component ^{RAk (e jω) | RAk (} e jω) (Ak = | LAk (e ^jω ) | + | RAk (e ^jω ) |). On the other hand, as shown in part (B) and part (C) of FIG. 3, the larger the phase difference (angle θ) between the frequency component LAk (e ^jω ) and the frequency component RAk (e ^jω ), The amplitude | Mk (e ^jω ) | is smaller than the amplitude sum Ak. That is, the greater the difference between the amplitude sum Ak and the amplitude | Mk (e ^jω ) | of the sum component M (the smaller the amplitude | Mk (e ^jω ) | is smaller than the amplitude sum Ak), the more the frequency component LAk (e ^jω ) and the frequency component RAk (e ^jω ) tend to be large. Using the above tendency, the second generation unit 64 of the first embodiment uses the amplitude sum Ak of the frequency component LAk (e ^jω ) and the frequency component RAk (e ^jω ) and the amplitude | Mk (e ^jω ) | Is determined according to the result of comparison with each coefficient value CB [k] (CB [1] to CB [K]) of the numerical sequence CB.

図４に示すように、第２生成部６４は、振幅算定部６４２と設定部６４４とを含んで構成される。振幅算定部６４２は、周波数成分ＬA1(e^jω)〜ＬAK(e^jω)の各々の振幅|ＬAk(e^jω)｜と、周波数成分ＲA1(e^jω)〜ＲAK(e^jω)の各々の振幅|ＲAk(e^jω)｜と、和成分生成部５２が生成した和成分Ｍの周波数成分Ｍ1(e^jω)〜ＭK(e^jω)の各々の振幅|Ｍk(e^jω)|とを単位区間毎に順次に算定する。設定部６４４は、振幅算定部６４２が算定した各振幅を利用して数値列ＣBの各係数値ＣB[k]を１以上の範囲内で可変に設定する（ＣB[k]≧１）。 As shown in FIG. 4, the second generation unit 64 includes an amplitude calculation unit 642 and a setting unit 644. Amplitude calculating unit 642, each of the amplitudes of the frequency components ^{LA1 (e jω) ~LAK (e} jω) | LAk (e jω) | and, each of the amplitudes of the frequency components ^{RA1 (e jω) ~RAK (e} jω) | RAk (e ^jω ) | and the amplitude | Mk (e ^jω ) | of each frequency component M1 (e ^jω ) to MK (e ^jω ) of the sum component M generated by the sum component generation unit 52 for each unit section Calculate sequentially. The setting unit 644 uses the amplitudes calculated by the amplitude calculation unit 642 to variably set the coefficient values CB [k] of the numerical value sequence CB within a range of 1 or more (CB [k] ≧ 1).

具体的には、設定部６４４は、周波数ｆ1〜ｆKの各々について、振幅|ＬAk(e^jω)｜および振幅|ＲAk(e^jω)｜の振幅和Ａkと和成分Ｍの振幅|Ｍk(e^jω)|との差分ΔＡk（ΔＡk＝|ＬAk(e^jω)｜＋|ＲAk(e^jω)｜−|Ｍk(e^jω)|）を算定し、差分ΔＡkが大きいほど係数値ＣB[k]を大きい数値に設定する。差分ΔＡkが０である場合、設定部６４４は係数値ＣB[k]を例えば１に設定する。設定部６４４が差分ΔＡkから係数値ＣB[k]を決定する具体的な方法は任意であるが、例えば、差分ΔＡkの各数値と係数値ＣB[k]の各数値とを対応させたテーブルから係数値ＣB[k]を検索する方法や、差分ΔＡkを変数として係数値ＣB[k]を定義する演算式（例えば差分ΔＡkに対する増加関数）の演算で係数値ＣB[k]を算定する方法が好適である。以上の説明のように、周波数成分ＬAk(e^jω)と周波数成分ＲAk(e^jω)との位相差が大きい（差分ΔＡkが大きい）ほど、各数値列ＣBの係数値ＣB[k]は大きい数値に設定される。 Specifically, for each of frequencies f1 to fK, setting unit 644 sets amplitude sum Ak of amplitude | LAk (e ^jω ) | and amplitude | RAk (e ^jω ) | and amplitude | Mk (e ^{jω of} sum component M). ) | Is calculated as ΔAk (ΔAk = | LAk (e ^jω ) | + | RAk (e ^jω ) | − | Mk (e ^jω ) |), and the coefficient value CB [k] increases as the difference ΔAk increases. Set to a numeric value. When the difference ΔAk is 0, the setting unit 644 sets the coefficient value CB [k] to 1, for example. A specific method for the setting unit 644 to determine the coefficient value CB [k] from the difference ΔAk is arbitrary. For example, from the table in which each numerical value of the difference ΔAk and each numerical value of the coefficient value CB [k] are associated with each other. There are a method of searching for the coefficient value CB [k] and a method of calculating the coefficient value CB [k] by calculation of an arithmetic expression that defines the coefficient value CB [k] using the difference ΔAk as a variable (for example, an increasing function with respect to the difference ΔAk). Is preferred. As described above, as the phase difference between the frequency component LAk (e ^jω ) and the frequency component RAk (e ^jω ) increases (the difference ΔAk increases), the coefficient value CB [k] of each numerical value sequence CB increases. Set to

図２の合成部６８は、第１生成部６２が生成した数値列ＣAと第２生成部６４が生成した数値列ＣBとから処理係数列Ｇe（Ｇe[1]〜Ｇe[K]）を生成する。処理係数列Ｇeの周波数ｆkの係数値Ｇe[k]は、例えば、数値列ＣAの周波数ｆkの係数値ＣA[k]と数値列ＣBの周波数ｆkの係数値ＣB[k]との乗算値（Ｇe[k]＝ＣA[k]×ＣB[k]）に設定される。ただし、係数値ＣA[k]と係数値ＣB[k]との乗算値が１を上回る場合、合成部６８は係数値Ｇe[k]を１に設定する。 2 generates a processing coefficient sequence Ge (Ge [1] to Ge [K]) from the numerical sequence CA generated by the first generating unit 62 and the numerical sequence CB generated by the second generating unit 64. To do. The coefficient value Ge [k] of the frequency fk of the processing coefficient sequence Ge is, for example, a multiplication value of the coefficient value CA [k] of the frequency fk of the numerical sequence CA and the coefficient value CB [k] of the frequency fk of the numerical sequence CB ( Ge [k] = CA [k] × CB [k]). However, when the multiplication value of the coefficient value CA [k] and the coefficient value CB [k] exceeds 1, the synthesis unit 68 sets the coefficient value Ge [k] to 1.

以上の説明から理解されるように、定位変数αに応じた位置の定位成分において周波数ｆkの成分が優勢となるほど、または、周波数成分ＬAk(e^jω)と周波数成分ＲAk(e^jω)との位相差が大きいほど、処理係数列Ｇeの係数値Ｇe[k]は大きい数値に設定される。したがって、周波数スペクトルＬAおよび周波数スペクトルＲAの各々に信号処理部３８が処理係数列Ｇeを乗算することで、周波数成分ＬAk(e^jω)と周波数成分ＲAk(e^jω)との位相差が大きい場合でも、定位成分を充分に強調したステレオ形式の音響信号ＳOUT（ＳOUT_L，ＳOUT_R）を生成することが可能である。 As can be understood from the above description, the position of the frequency component LAk (e ^jω ) and the frequency component RAk (e ^jω ) increases as the component of the frequency fk becomes dominant in the localization component at the position corresponding to the localization variable α. As the phase difference is larger, the coefficient value Ge [k] of the processing coefficient sequence Ge is set to a larger numerical value. Therefore, even when the phase difference between the frequency component LAk (e ^jω ) and the frequency component RAk (e ^jω ) is large, the signal processing unit 38 multiplies each of the frequency spectrum LA and the frequency spectrum RA by the processing coefficient sequence Ge. It is possible to generate stereo-type acoustic signals SOUT (SOUT_L, SOUT_R) in which the localization component is sufficiently emphasized.

図２の第３生成部６６は、合成部６８が生成した処理係数列Ｇeを利用して定位成分の抑圧用の処理係数列Ｇs（係数値Ｇs[1]〜Ｇs[K]の系列）を生成する。具体的には、第３生成部６６は、数式(7)に示すように、処理係数列Ｇeの各係数値Ｇe[k]を所定値（本形態では１）から減算することで処理係数列Ｇsの各係数値Ｇs[k]を算定する。

The third generation unit 66 in FIG. 2 uses the processing coefficient sequence Ge generated by the synthesis unit 68 to generate a processing coefficient sequence Gs (a sequence of coefficient values Gs [1] to Gs [K]) for suppressing the localization component. Generate. Specifically, as shown in Equation (7), the third generation unit 66 subtracts each coefficient value Ge [k] of the processing coefficient sequence Ge from a predetermined value (1 in this embodiment) to thereby process the processing coefficient sequence. Each coefficient value Gs [k] of Gs is calculated.

数式(7)から理解されるように、第３生成部６６が生成する処理係数列Ｇsの係数値Ｇs[1]〜Ｇs[K]は、定位変数αに応じた位置の定位成分のパワー（振幅）が大きい周波数ｆkの係数値Ｇs[k]ほど０に近い数値となり、定位成分のパワーが小さい周波数ｆkの係数値Ｇs[k]ほど１に近い数値となる。したがって、周波数スペクトルＬAおよび周波数スペクトルＲAの各々に信号処理部３８が処理係数列Ｇsを乗算することで、前述の通り、定位成分を抑圧したステレオ形式の音響信号ＳOUT（ＳOUT_L，ＳOUT_R）が生成される。 As understood from the equation (7), the coefficient values Gs [1] to Gs [K] of the processing coefficient sequence Gs generated by the third generation unit 66 are the powers of the localization components at positions corresponding to the localization variable α ( The coefficient value Gs [k] of the frequency fk having a larger amplitude) is closer to 0, and the coefficient value Gs [k] of the frequency fk having a smaller localization component power is closer to 1. Accordingly, the signal processing unit 38 multiplies each of the frequency spectrum LA and the frequency spectrum RA by the processing coefficient sequence Gs, thereby generating the stereo-type acoustic signal SOUT (SOUT_L, SOUT_R) in which the localization component is suppressed as described above. The

ところで、前述の対比例では、第１生成部６２が生成した数値列ＣAを数式(7)の処理係数列Ｇeとして抑圧用の処理係数列Ｇsを生成する構成が採用され得る（Ｇs[k]＝１−ＣA[k]）。しかし、周波数成分ＬAk(e^jω)と周波数成分ＲAk(e^jω)との位相差が大きいほど数値列ＣAの係数値ＣA[k]は小さい数値になるから、対比例の構成では、周波数成分ＬAk(e^jω)と周波数成分ＲAk(e^jω)との位相差が大きいほど処理係数列Ｇsの係数値Ｇs[k]は大きい数値となる。したがって、定位成分の抑圧が不足するという問題が発生する。他方、第１実施形態では、周波数成分ＬAk(e^jω)と周波数成分ＲAk(e^jω)との位相差が大きいほど係数値Ｇe[k]は大きい数値となるから、周波数成分ＬAk(e^jω)と周波数成分ＲAk(e^jω)との位相差が大きいほど処理係数列Ｇsの係数値Ｇs[k]は小さい数値となる。したがって、周波数成分ＬAk(e^jω)と周波数成分ＲAk(e^jω)との位相差が大きい場合でも、定位成分を充分に抑圧したステレオ形式の音響信号ＳOUT（ＳOUT_L，ＳOUT_R）を生成することが可能である。 By the way, in the above-described comparison, a configuration may be adopted in which the processing coefficient sequence Gs for suppression is generated using the numerical sequence CA generated by the first generation unit 62 as the processing coefficient sequence Ge of Equation (7) (Gs [k]). = 1-CA [k]). However, the larger the phase difference between the frequency component LAk (e ^jω ) and the frequency component RAk (e ^jω ) is, the smaller the coefficient value CA [k] of the numerical sequence CA becomes. Therefore, in the proportional configuration, the frequency component LAk The larger the phase difference between (e ^jω ) and the frequency component RAk (e ^jω ), the larger the coefficient value Gs [k] of the processing coefficient sequence Gs. Therefore, there arises a problem that the localization component is not sufficiently suppressed. On the other hand, in the first embodiment, the larger the phase difference between the frequency component LAk (e ^jω ) and the frequency component RAk (e ^jω ), the larger the coefficient value Ge [k] becomes, so the frequency component LAk (e ^jω ) The coefficient value Gs [k] of the processing coefficient sequence Gs becomes a smaller numerical value as the phase difference between the frequency component RAk (e ^jω ) increases. Therefore, even when the phase difference between the frequency component LAk (e ^jω ) and the frequency component RAk (e ^jω ) is large, it is possible to generate a stereo-type acoustic signal SOUT (SOUT_L, SOUT_R) in which the localization component is sufficiently suppressed. It is.

以上に説明したように、第１実施形態においては、周波数成分ＬAk(e^jω)と周波数成分ＲAk(e^jω)との位相差が処理係数列Ｇeおよび処理係数列Ｇsに反映されるから、周波数成分ＬAk(e^jω)および周波数成分ＲAk(e^jω)の間に位相差がある場合でも、定位成分の強調または抑圧の効果を充分に維持できるという利点がある。 As described above, in the first embodiment, the phase difference between the frequency component LAk (e ^jω ) and the frequency component RAk (e ^jω ) is reflected in the processing coefficient sequence Ge and the processing coefficient sequence Gs. Even when there is a phase difference between the component LAk (e ^jω ) and the frequency component RAk (e ^jω ), there is an advantage that the effect of emphasizing or suppressing the localization component can be sufficiently maintained.

ところで、特許第３６７０５６２号公報には、ステレオ信号のチャネル間の振幅比に応じた減衰係数ｇi(k)と位相差に応じた減衰係数ｇp(k)との何れかを選択して各チャネルの音響信号に乗算する構成が開示されている。しかし、以上の技術では、振幅比に応じた減衰係数ｇi(k)と位相差に応じた減衰係数ｇp(k)とが択一的に適用されるから、振幅および位相の双方がチャネル間で相違する場合には、定位成分の適切な強調または抑圧が困難であるという問題がある。他方、第１実施形態においては、音響信号ＳIN_L（周波数スペクトルＬA）および音響信号ＳIN_R（周波数スペクトルＲA）から複素スペクトルとして算定された和成分Ｍおよび差成分Ｓが処理係数列Ｇeおよび処理係数列Ｇsの生成に利用されるから、音響信号ＳIN_Lと音響信号ＳIN_Rとの間の振幅差および位相差の双方を反映した処理係数列Ｇeおよび処理係数列Ｇsが生成される。したがって、音響信号ＳIN_Lと音響信号ＳIN_Rとの間で振幅および位相の一方のみが相違する場合に加えて、振幅および位相の双方が相違する場合にも、定位成分を有効に強調または抑圧できるという格別の効果が実現される。 By the way, in Japanese Patent No. 3670562, either an attenuation coefficient gi (k) corresponding to the amplitude ratio between channels of a stereo signal or an attenuation coefficient gp (k) corresponding to a phase difference is selected and each channel is selected. A configuration for multiplying an acoustic signal is disclosed. However, in the above technique, the attenuation coefficient gi (k) according to the amplitude ratio and the attenuation coefficient gp (k) according to the phase difference are alternatively applied. If they are different, there is a problem that it is difficult to properly emphasize or suppress the localization component. On the other hand, in the first embodiment, the sum component M and the difference component S calculated as a complex spectrum from the acoustic signal SIN_L (frequency spectrum LA) and the acoustic signal SIN_R (frequency spectrum RA) are processed coefficient sequence Ge and processing coefficient sequence Gs. Therefore, the processing coefficient sequence Ge and the processing coefficient sequence Gs reflecting both the amplitude difference and the phase difference between the acoustic signal SIN_L and the acoustic signal SIN_R are generated. Therefore, in addition to the case where only one of the amplitude and the phase is different between the acoustic signal SIN_L and the acoustic signal SIN_R, the localization component can be effectively enhanced or suppressed when both the amplitude and the phase are different. The effect of is realized.

＜Ｂ：第２実施形態＞
本発明の第２実施形態を説明する。第１実施形態では、周波数成分ＬAk(e^jω)および周波数成分ＲAk(e^jω)の振幅和Ａkと和成分Ｍの周波数成分Ｍk(e^jω)の振幅|Ｍk(e^jω)|との比較の結果に応じて第２生成部６４が数値列ＣBの各係数値ＣB[k]を設定した。第２実施形態では、周波数成分ＬAk(e^jω)と周波数成分ＲAk(e^jω)との角度θ（すなわち両者の位相差）を算定して係数値ＣB[k]を設定する。なお、以下の各例示において作用や機能が第１実施形態と同等である要素については、第１実施形態と同様の符号を付して各々の詳細な説明を適宜に省略する。 <B: Second Embodiment>
A second embodiment of the present invention will be described. In the first embodiment, the comparison between the amplitude sum Ak of the frequency component LAk (e ^jω ) and the frequency component RAk (e ^jω ) and the amplitude | Mk (e ^jω ) | of the frequency component Mk (e ^jω ) of the sum component M is performed. The second generation unit 64 sets each coefficient value CB [k] of the numerical sequence CB according to the result. In the second embodiment, the angle θ between the frequency component LAk (e ^jω ) and the frequency component RAk (e ^jω ) (that is, the phase difference between them) is calculated to set the coefficient value CB [k]. In the following examples, elements having the same functions and functions as those of the first embodiment are denoted by the same reference numerals as those of the first embodiment, and detailed descriptions thereof are appropriately omitted.

図５は、周波数成分ＬAk(e^jω)と周波数成分ＲAk(e^jω)と和成分Ｍの周波数成分Ｍk(e^jω)との関係を示す模式図である。図５の三角形ＴMLは、周波数成分ＬAk(e^jω)および周波数成分Ｍk(e^jω)を２辺とする三角形であり、三角形ＴMRは、周波数成分ＲAk(e^jω)および周波数成分Ｍk(e^jω)を２辺とする三角形である。三角形ＴMLおよび三角形ＴMRの面積Ｓは相等しく、以下の数式(8)で表現される。数式(8)の角度θ1は、周波数成分ＬAk(e^jω)と周波数成分Ｍk(e^jω)との角度であり、角度θ2は、周波数成分ＲAk(e^jω)と周波数成分Ｍk(e^jω)との角度である。

FIG. 5 is a schematic diagram showing the relationship among the frequency component LAk (e ^jω ), the frequency component RAk (e ^jω ), and the frequency component Mk (e ^jω ) of the sum component M. The triangle TML in FIG. 5 is a triangle having a frequency component LAk (e ^jω ) and a frequency component Mk (e ^jω ) as two sides, and the triangle TMR has a frequency component RAk (e ^jω ) and a frequency component Mk (e ^jω ). Is a triangle with two sides. The areas S of the triangle TML and the triangle TMR are equal to each other and are expressed by the following formula (8). The angle θ1 in the equation (8) is an angle between the frequency component LAk (e ^jω ) and the frequency component Mk (e ^jω ), and the angle θ2 is the frequency component RAk (e ^jω ) and the frequency component Mk (e ^jω ) Is the angle.

数式(8)から以下の数式(9a)および数式(9b)が導出される。

From the formula (8), the following formula (9a) and formula (9b) are derived.

他方、周波数成分ＬAk(e^jω)と周波数成分ＲAk(e^jω)と周波数成分Ｍk(e^jω)とを３辺とする三角形の面積Ｓは、三角形ＴMLや三角形ＴMRの面積と相等しく、以下の数式(10)で表現される（ヘロンの公式）。

On the other hand, the area S of the triangle having the frequency component LAk (e ^jω ), the frequency component RAk (e ^jω ), and the frequency component Mk (e ^jω ) as three sides is equal to the areas of the triangle TML and the triangle TMR. It is expressed by formula (10) (Heron formula).

第２生成部６４は、数式(10)の演算で算定される面積Ｓを数式(9a)および数式(9b)に適用することで角度θ1および角度θ2を算定する。そして、第２生成部６４は、角度θ1および角度θ2を利用した以下の数式(11)の演算で数値列ＣBの各係数値ＣB[k]を算定する（θ1＋θ2＜π/２）。

The second generator 64 calculates the angle θ1 and the angle θ2 by applying the area S calculated by the calculation of the formula (10) to the formula (9a) and the formula (9b). Then, the second generation unit 64 calculates each coefficient value CB [k] of the numerical sequence CB by the following equation (11) using the angle θ1 and the angle θ2 (θ1 + θ2 <π / 2).

数式(11)から理解されるように、周波数成分ＬAk(e^jω)と周波数成分ＲAk(e^jω)との位相差（角度(θ1＋θ2)が大きいほど数値列ＣBの係数値ＣB[k]は大きい数値に設定される。なお、数値列ＣAおよび数値列ＣBを利用して処理係数列Ｇeや処理係数列Ｇsを生成する方法は第１実施形態と同様である。 As understood from the equation (11), the phase difference of the frequency components LAk and (e ^{j [omega])} and the frequency component RAk (e ^{j [omega])} (angle (.theta.1 + .theta.2) coefficient of numerical sequence CB larger the value CB [k] is greater The method for generating the processing coefficient string Ge and the processing coefficient string Gs using the numerical value string CA and the numerical value string CB is the same as in the first embodiment.

第２実施形態でも第１実施形態と同様の作用および効果が実現される。第１実施形態や第２実施形態の例示から理解されるように、第２生成部６４は、周波数成分ＬAk(e^jω)の振幅|ＬAk(e^jω)|と周波数成分ＲAk(e^jω)の振幅|ＲAk(e^jω)|と周波数成分Ｍk(e^jω)の振幅|Ｍk(e^jω)|とから数値列ＣBの各係数値ＣB[k]を生成する要素として包括され、各振幅から係数値ＣB[k]を算定する具体的な方法は本発明において任意である。 In the second embodiment, the same operation and effect as in the first embodiment are realized. As it will be understood from the example of the first embodiment and the second embodiment, the second generator 64, the frequency component LAk of (e ^{j [omega])} the amplitude | LAk ^(e jω) | of the frequency component RAk ^(e jω) It is included as an element for generating each coefficient value CB [k] of the numerical sequence CB from the amplitude | RAk (e ^jω ) | and the amplitude | Mk (e ^jω ) | of the frequency component Mk (e ^jω ) | A specific method for calculating the numerical value CB [k] is arbitrary in the present invention.

＜Ｃ：変形例＞
以上の各形態は多様に変形され得る。具体的な変形の態様を以下に例示する。以下の例示から任意に選択された２以上の態様は適宜に併合され得る。 <C: Modification>
Each of the above forms can be variously modified. Specific modifications are exemplified below. Two or more aspects arbitrarily selected from the following examples can be appropriately combined.

（１）変形例１
数値列ＣBの各係数値ＣB[k]を算定する方法は以上の例示に限定されない。例えば、図３に例示した関係から理解されるように、周波数成分ＬAk(e^jω)と周波数成分ＲAk(e^jω)との位相差が大きいほど、和成分Ｍの振幅|Ｍk(e^jω)|に対する振幅和Ａkの相対比Ｑk（Ｑk＝Ａk／|Ｍk(e^jω)|）は大きい数値となるから、相対比Ｑkを数値列ＣBの各係数値ＣB[k]として算定する構成が採用され得る。また、差分ΔＡk（ΔＡk＝Ａk−|Ｍk(e^jω)|）や相対比Ｑkを変数とする所定の関数の演算（写像）で各係数値ＣB[k]を算定する構成も好適である。 (1) Modification 1
The method of calculating each coefficient value CB [k] of the numerical value sequence CB is not limited to the above example. For example, as can be understood from the relationship illustrated in FIG. 3, the larger the phase difference between the frequency component LAk (e ^jω ) and the frequency component RAk (e ^jω ), the larger the amplitude | Mk (e ^jω ) | Since the relative ratio Qk (Qk = Ak / | Mk (e ^jω ) |) of the amplitude sum Ak with respect to is a large numerical value, a configuration is adopted in which the relative ratio Qk is calculated as each coefficient value CB [k] of the numerical sequence CB. obtain. A configuration in which each coefficient value CB [k] is calculated by calculation (mapping) of a predetermined function using the difference ΔAk (ΔAk = Ak− | Mk (e ^jω ) |) or the relative ratio Qk as a variable is also suitable.

（２）変形例２
以上の各形態では、数値列ＣAと数値列ＣBとを合成した強調用の処理係数列Ｇeから第３生成部６６が抑圧用の処理係数列Ｇsを生成したが、第３生成部６６の代わりに図６の第３生成部６７が処理係数列Ｇsを生成する構成も採用され得る。 (2) Modification 2
In each of the above embodiments, the third generation unit 66 generates the processing coefficient sequence Gs for suppression from the processing coefficient sequence Ge for emphasis obtained by synthesizing the numerical value sequence CA and the numerical value sequence CB. In addition, a configuration in which the third generation unit 67 of FIG. 6 generates the processing coefficient sequence Gs may be employed.

図６の第３生成部６７は、第１演算部６７２と第２演算部６７４とを含んで構成される。第１演算部６７２は、第１生成部６２が生成した数値列ＣAから定位成分の抑圧用の数値列ＣD（係数値ＣD[1]〜ＣD[K]の系列）を算定する。例えば、第１演算部６７２は、前述の数式(7)と同様に、数値列ＣAの係数値ＣA[k]を所定位置（例えば１）から減算することで数値列ＣDの各係数値ＣD[k]（ＣD[k]＝１−ＣA[k]）を算定する。したがって、抑圧用の処理係数列Ｇsと同様に、定位変数αに応じた位置の定位成分のパワーが大きい周波数ｆkの係数値ＣD[k]ほど０に近い数値となる。 The third generation unit 67 of FIG. 6 includes a first calculation unit 672 and a second calculation unit 674. The first calculation unit 672 calculates a numerical sequence CD (sequence of coefficient values CD [1] to CD [K]) for suppressing the localization component from the numerical sequence CA generated by the first generation unit 62. For example, the first calculation unit 672 subtracts the coefficient value CA [k] of the numerical value sequence CA from a predetermined position (for example, 1), similarly to the above-described equation (7), to thereby obtain each coefficient value CD [ k] (CD [k] = 1-CA [k]). Accordingly, like the processing coefficient sequence Gs for suppression, the coefficient value CD [k] of the frequency fk at which the power of the localization component at the position corresponding to the localization variable α is large becomes a numerical value closer to 0.

図６の第２演算部６７４は、周波数成分ＬAk(e^jω)および周波数成分ＲAk(e^jω)の位相差に応じた数値列ＣB'を、第１演算部６７２が生成した数値列ＣDに作用させることで抑圧用の処理係数列Ｇsを生成する。数値列ＣB'は、以上の各形態の数値列ＣBとは逆に、周波数成分ＬAk(e^jω)および周波数成分ＲAk(e^jω)の位相差が大きいほど小さい数値となるように設定された係数値ＣB'[1]〜ＣB'[K]の系列である。例えば、振幅和Ａkに対する和成分Ｍの振幅|Ｍk(e^jω)|の相対比（|Ｍk(e^jω)|／Ａk）が係数値ＣB'[k]として算定され得る。第２演算部６７４は、例えば、数値列ＣDの係数値ＣD[k]と数値列ＣB'の係数値ＣB'[k]との乗算値を処理係数列Ｇsの係数値Ｇs[k]として算定する．図６の構成でも第１実施形態と同様の効果が実現される。 6 operates a numerical sequence CB ′ corresponding to the phase difference between the frequency component LAk (e ^jω ) and the frequency component RAk (e ^jω ) on the numerical sequence CD generated by the first arithmetic unit 672. By doing so, a processing coefficient sequence Gs for suppression is generated. The numerical value sequence CB ′, contrary to the numerical value sequence CB in each of the above forms, is set so that the smaller the phase difference between the frequency component LAk (e ^jω ) and the frequency component RAk (e ^jω ), the smaller the value. This is a series of numerical values CB ′ [1] to CB ′ [K]. For example, the relative ratio (| Mk (e ^jω ) | / Ak) of the amplitude | Mk (e ^jω ) | of the sum component M with respect to the amplitude sum Ak can be calculated as the coefficient value CB ′ [k]. For example, the second calculation unit 674 calculates a product value of the coefficient value CD [k] of the numerical value sequence CD and the coefficient value CB '[k] of the numerical value sequence CB ′ as the coefficient value Gs [k] of the processing coefficient sequence Gs. Do it. The same effects as those of the first embodiment are realized even with the configuration of FIG.

（３）変形例３
以上の各形態では、数値列ＣBの全部の係数値ＣB[1]〜ＣB[K]を係数列ＣAの全部の係数値ＣA[1]〜ＣA[K]に作用させて処理係数列Ｇeを生成したが、所定の周波数帯域内の成分に限定して定位成分の強調／抑圧を実行する構成も好適である。例えば、音響信号ＳINのうち人間の音声のパワーが集中する周波数帯域（例えば１００ｋＨｚ〜８ｋＨｚ）についてのみ以上の各形態の処理が実行される（他の帯域については処理せずに再生する）。 (3) Modification 3
In each of the above embodiments, all the coefficient values CB [1] to CB [K] of the numerical value sequence CB are applied to all the coefficient values CA [1] to CA [K] of the coefficient sequence CA, so that the processing coefficient sequence Ge is obtained. However, a configuration in which localization component enhancement / suppression is performed only for components within a predetermined frequency band is also suitable. For example, the processing in each of the above modes is executed only for the frequency band (for example, 100 kHz to 8 kHz) in which the power of human voice is concentrated in the acoustic signal SIN (reproduction is performed without processing for other bands).

また、以下に例示するように、周波数成分ＬAk(e^jω)と周波数成分ＲAk(e^jω)との位相差に起因した定位成分の強調または抑圧の不足が問題となる係数値ＣA[k]に限定して係数値ＣB[k]を作用させる構成も採用され得る。 Further, as illustrated below, the coefficient value CA [k], which is a problem of insufficient emphasis or suppression of the localization component due to the phase difference between the frequency component LAk (e ^jω ) and the frequency component RAk (e ^jω ), is used. A configuration in which the coefficient value CB [k] is applied in a limited manner may be employed.

図７は、第１生成部６２が生成する数値列ＣAのうち中央方向（α＝０．５）に音像が定位する成分の周波数ｆkに対応する係数値ＣA[k]（縦軸）と定位変数α（横軸）との関係を示す模式図である。図７に示すように、利用者から指示される定位変数αが０．５に近いほど数値列ＣAの各係数列ＣA[k]は１に近い数値に設定される。以上の関係から理解されるように、係数値ＣA[k]が所定の閾値ＣA_THを上回る周波数ｆkの成分は、強調の対象となる定位成分である可能性が高い。 FIG. 7 shows the coefficient value CA [k] (vertical axis) and localization corresponding to the frequency fk of the component in which the sound image is localized in the central direction (α = 0.5) in the numerical sequence CA generated by the first generator 62. It is a schematic diagram which shows the relationship with the variable (alpha) (horizontal axis). As shown in FIG. 7, as the localization variable α instructed by the user is closer to 0.5, each coefficient sequence CA [k] of the numerical sequence CA is set to a value closer to 1. As understood from the above relationship, the component of the frequency fk whose coefficient value CA [k] exceeds the predetermined threshold value CA_TH is highly likely to be a localization component to be emphasized.

そこで、数値列ＣAの係数値ＣA[1]〜ＣA[K]のうち閾値ＣA_THを上回る係数値ＣA[k]に対して限定的に合成部６８が係数値ＣB[k]を乗算して係数値Ｇe[k]を算定する構成が採用され得る。閾値ＣA_THを下回る係数値ＣA[k]は、係数値ＣB[k]が乗算されることなく係数値Ｇe[k]として採択される。以上の構成によれば、Ｋ個の係数値ＣA[1]〜ＣA[K]にＫ個の係数値ＣB[1]〜ＣB[K]を作用させる第１実施形態の構成と比較して、合成部６８による処理の負荷（乗算の回数）が軽減されるという利点がある。 Therefore, the combining unit 68 multiplies the coefficient value CA [k] exceeding the threshold value CA_TH among the coefficient values CA [1] to CA [K] of the numerical sequence CA by multiplying the coefficient value CB [k]. A configuration for calculating the numerical value Ge [k] may be employed. The coefficient value CA [k] below the threshold value CA_TH is adopted as the coefficient value Ge [k] without being multiplied by the coefficient value CB [k]. According to the above configuration, compared with the configuration of the first embodiment in which K coefficient values CB [1] to CB [K] are applied to K coefficient values CA [1] to CA [K], There is an advantage that the processing load (the number of multiplications) by the combining unit 68 is reduced.

また、係数値ＣD[k]（ＣD[k]＝１−ＣA[k]）に係数値ＣB'[k]を作用させて抑圧用の処理係数列Ｇsを生成する図６の構成でも同様に、特定の係数値ＣD[k]に対して限定的に係数値ＣB'[k]を作用させる構成が採用され得る。 Similarly, in the configuration of FIG. 6 in which the coefficient value CB ′ [k] is applied to the coefficient value CD [k] (CD [k] = 1−CA [k]) to generate the processing coefficient sequence Gs for suppression. A configuration in which the coefficient value CB ′ [k] is limitedly applied to the specific coefficient value CD [k] may be employed.

図８は、図６の構成のもとで第３生成部６７の第１演算部６７２が生成する係数列ＣDのうち中央方向（α＝０．５）に音像が定位する成分の周波数ｆkに対応する係数値ＣD[k]（縦軸）と定位変数α（横軸）との関係を示す模式図である。図８に示すように、利用者から指示される定位変数αが０．５に近いほど数値列ＣDの各係数値ＣD[k]は０に近い数値に設定される。以上の関係から理解されるように、係数値ＣD[k]が所定の閾値ＣD_THを下回る周波数ｆkの成分は、抑圧の対象となる定位成分である可能性が高い。 8 shows the frequency fk of the component in which the sound image is localized in the central direction (α = 0.5) in the coefficient sequence CD generated by the first calculation unit 672 of the third generation unit 67 under the configuration of FIG. It is a schematic diagram which shows the relationship between corresponding coefficient value CD [k] (vertical axis) and localization variable (alpha) (horizontal axis). As shown in FIG. 8, the coefficient value CD [k] of the numerical sequence CD is set to a value closer to 0 as the localization variable α instructed by the user is closer to 0.5. As understood from the above relationship, the component of the frequency fk whose coefficient value CD [k] is lower than the predetermined threshold value CD_TH is highly likely to be a localization component to be suppressed.

そこで、数値列ＣDの係数値ＣD[1]〜ＣD[K]のうち閾値ＣD_THを下回る係数値ＣD[k]に対して限定的に図６の第２演算部６７４が係数値ＣB'[k]を乗算して係数値Ｇs[k]を算定する構成が採用され得る。閾値ＣD_THを上回る係数値ＣD[k]は、係数値ＣB'[k]が乗算されることなく係数値Ｇs[k]として採択される。以上の構成によれば、Ｋ個の係数値ＣD[1]〜ＣD[K]にＫ個の係数値ＣB'[1]〜ＣB'[K]を作用させる構成と比較して、第２演算部６７４による処理の負荷（乗算の回数）が軽減されるという利点がある。 Therefore, the second arithmetic unit 674 in FIG. 6 restricts the coefficient value CB ′ [k] for the coefficient value CD [k] below the threshold value CD_TH among the coefficient values CD [1] to CD [K] of the numerical sequence CD. ] To calculate the coefficient value Gs [k]. The coefficient value CD [k] exceeding the threshold value CD_TH is adopted as the coefficient value Gs [k] without being multiplied by the coefficient value CB ′ [k]. According to the above configuration, the second calculation is performed as compared with the configuration in which the K coefficient values CB '[1] to CB' [K] are applied to the K coefficient values CD [1] to CD [K]. There is an advantage that the processing load (number of multiplications) by the unit 674 is reduced.

（４）変形例４
定位成分の周波数ｆkに対応する周波数成分ＬAk(e^jω)と周波数成分ＲAk(e^jω)との位相差φkは、定位変数αに応じた定位成分の方向に応じて変化する。例えば、中央方向（α＝０．５）の定位成分の周波数成分ＬAk(e^jω)と周波数成分ＲAk(e^jω)との位相差φkは０に近い数値となり、定位成分の方向が中央方向から離れる（定位変数αが中央値（０．５）から離れる）ほど位相差φkは増加する。したがって、図９に示すように、定位変数αに対応する位相差φaを含む所定幅の範囲Ａを想定すると、周波数成分ＬAk(e^jω)と周波数成分ＲAk(e^jω)との位相差φkが範囲Ａ内にある周波数ｆkの成分は定位成分に該当する可能性が高い。 (4) Modification 4
The phase difference φk between the frequency component LAk (e ^jω ) and the frequency component RAk (e ^jω ) corresponding to the localization component frequency fk changes according to the direction of the localization component according to the localization variable α. For example, the phase difference φk between the frequency component LAk (e ^jω ) of the localization component in the central direction (α = 0.5) and the frequency component RAk (e ^jω ) is a value close to 0, and the direction of the localization component is changed from the central direction. The phase difference φk increases as the distance increases (the localization variable α increases from the median (0.5)). Therefore, as shown in FIG. 9, assuming a range A having a predetermined width including the phase difference φa corresponding to the localization variable α, the phase difference φk between the frequency component LAk (e ^jω ) and the frequency component RAk (e ^jω ) is There is a high possibility that the component of the frequency fk within the range A corresponds to the localization component.

そこで、係数列ＣAの係数値ＣA[1]〜ＣA[K]のうち周波数成分ＬAk(e^jω)と周波数成分ＲAk(e^jω)との位相差φkが範囲Ａ内にある各周波数ｆk（すなわち定位成分の周波数ｆk）の各係数値ＣA[k]に限定して、合成部６８が数値列ＣBの係数値ＣB[k]を作用させる構成も採用され得る。位相差φkが範囲Ａ外にある周波数ｆkの各係数値ＣA[k]は、係数値ＣB[k]が乗算されることなく係数値Ｇe[k]として採択される。図６の構成でも同様であり、位相差φkが範囲Ａ内にある周波数kの各係数値ＣD[k]に対して限定的に係数値ＣB'[k]を乗算する構成が採用され得る。 Therefore, among the coefficient values CA [1] to CA [K] of the coefficient sequence CA, each frequency fk (that is, the phase difference φk between the frequency component LAk (e ^jω ) and the frequency component RAk (e ^jω ) within the range A (that is, A configuration in which the synthesis unit 68 operates the coefficient value CB [k] of the numerical value sequence CB can be adopted by limiting to each coefficient value CA [k] of the frequency fk) of the localization component. Each coefficient value CA [k] of the frequency fk where the phase difference φk is outside the range A is adopted as the coefficient value Ge [k] without being multiplied by the coefficient value CB [k]. The same applies to the configuration of FIG. 6, and a configuration in which each coefficient value CD [k] of the frequency k having the phase difference φk within the range A is multiplied by the coefficient value CB ′ [k] in a limited manner can be adopted.

（５）変形例５
数値列ＣAと数値列ＣBとから処理係数列Ｇeや処理係数列Ｇsを生成する方法は適宜に変更され得る。例えば、数値列ＣAの各係数値ＣA[k]と数値列ＣBの各係数値ＣB[k]との加算値を処理係数列Ｇeの係数値Ｇe[k]として算定する構成が採用され得る。図６の構成でも同様に、各係数値ＣD[k]と各係数値ＣB'[k]との加算で係数値Ｇs[k]を算定する構成が採用され得る。 (5) Modification 5
The method of generating the processing coefficient sequence Ge and the processing coefficient sequence Gs from the numerical value sequence CA and the numerical value sequence CB can be appropriately changed. For example, a configuration may be adopted in which an addition value of each coefficient value CA [k] of the numerical value sequence CA and each coefficient value CB [k] of the numerical value sequence CB is calculated as the coefficient value Ge [k] of the processing coefficient sequence Ge. Similarly, the configuration of FIG. 6 may employ a configuration in which the coefficient value Gs [k] is calculated by adding each coefficient value CD [k] and each coefficient value CB ′ [k].

（６）変形例６
以上の各形態では、数式(4)から理解されるように、音響信号ＳIN_L（周波数成分ＬAk(e^jω)）と音響信号ＳIN_R（周波数成分ＲAk(e^jω)）との振幅の比率が定位変数αに応じて線形に変化するように差成分Ｓを算定したが、以下の各態様にて例示するように、和成分Ｍおよび差成分Ｓの算定の方法や定位変数αとの関係は適宜に変更される。 (6) Modification 6
In each of the above embodiments, as understood from the equation (4), the ratio of the amplitude of the acoustic signal SIN_L (frequency component LAk (e ^jω )) and the acoustic signal SIN_R (frequency component RAk (e ^jω )) is a localization variable. Although the difference component S is calculated so as to change linearly according to α, as illustrated in the following embodiments, the calculation method of the sum component M and the difference component S and the relationship with the localization variable α are appropriately determined. Be changed.

＜第１態様＞
音響信号ＳIN_Lのパワー|ＬAk(e^jω)|^２および音響信号ＳIN_Rのパワー|ＲAk(e^jω)|^２から和成分Ｍおよび差成分Ｓを算定する構成が採用され得る。具体的には、和成分生成部５２は、以下の数式(12a)の演算で和成分Ｍ（周波数成分Ｍ1(e^jω)〜ＭK(e^jω)で構成される複素スペクトル）を生成し、差成分生成部５４は、以下の数式(12b)の演算で差成分Ｓ（周波数成分Ｓ1(e^jω)〜ＳK(e^jω)で構成される複素スペクトル）を生成する。したがって、差成分Ｓにおいては、音響信号ＳIN_Lおよび音響信号ＳIN_Rの各々のパワーの比率が定位変数αに応じて線形に変化する。なお、数式(12a)および数式(12b)における記号ｅ^jLは周波数スペクトルＬAの位相スペクトルを意味し、記号ｅ^jRは周波数スペクトルＲAの位相スペクトルを意味する。

<First aspect>
^LAk (e jω) | | ² and the power of the acoustic signal ^SIN_R | RAk (e jω) | power of the acoustic signal SIN_L configured to calculate the ² from the sum component M and the difference component S may be employed. Specifically, the sum component generation unit 52 generates a sum component M (complex spectrum composed of frequency components M1 (e ^jω ) to MK (e ^jω )) by the calculation of the following formula (12a), and the difference The component generation unit 54 generates a difference component S (a complex spectrum composed of frequency components S1 (e ^jω ) to SK (e ^jω )) by calculation of the following formula (12b). Therefore, in the difference component S, the power ratio between the acoustic signal SIN_L and the acoustic signal SIN_R changes linearly according to the localization variable α. Symbols e ^jL in equation (12a) and Equation (12b) means a phase spectrum of the frequency spectrum LA, symbol e ^jR means the phase spectrum of the frequency spectrum RA.

以上の方法で和成分Ｍおよび差成分Ｓが生成されると、係数列生成部６０の第１生成部６２は、定位成分を強調したパワースペクトルＰ（パワーＰ[k]）を以下の数式(13a)および数式(13b)の演算で生成し、パワースペクトルＰを利用した数式(14)の演算で処理係数列Ｇeの各係数値Ｇe[k]を算定する。第３生成部６６が処理係数列Ｇeから処理係数列Ｇsを生成する方法は第１実施形態（数式(7)）と同様である。以上の構成でも第１実施形態と同様の効果が実現される。

When the sum component M and the difference component S are generated by the above method, the first generation unit 62 of the coefficient sequence generation unit 60 generates a power spectrum P (power P [k]) in which the localization component is emphasized by the following formula ( The coefficient values Ge [k] of the processing coefficient sequence Ge are calculated by the calculation of the formula (14) using the power spectrum P, which is generated by the calculations of 13a) and (13b). The method by which the third generation unit 66 generates the processing coefficient sequence Gs from the processing coefficient sequence Ge is the same as in the first embodiment (Formula (7)). With the above configuration, the same effect as that of the first embodiment is realized.

＜第２態様＞
定位成分の位置（方向）を定位変数αの関数ｆ(α)に応じて制御する構成が採用され得る。具体的には、和成分生成部５２は、以下の数式(15a)の演算で和成分Ｍ（周波数成分Ｍ1(e^jω)〜ＭK(e^jω)）を生成し、差成分生成部５４は、以下の数式(15b)の演算で差成分Ｓ（周波数成分Ｓ1(e^jω)〜ＳK(e^jω)）を生成する。したがって、差成分Ｓにおける音響信号ＳIN_Lおよび音響信号ＳIN_Rの各々の振幅の比率が定位変数αの関数ｆ(α)に応じて変化する。係数列生成部６０は、以上の各形態と同様の方法で、数式(15a)の和成分Ｍと数式(15b)の差成分Ｓとから処理係数列Ｇeおよび処理係数列Ｇsを算定する。

<Second aspect>
A configuration in which the position (direction) of the localization component is controlled according to the function f (α) of the localization variable α can be adopted. Specifically, the sum component generation unit 52 generates the sum component M (frequency components M1 (e ^jω ) to MK (e ^jω )) by the calculation of the following formula (15a), and the difference component generation unit 54 The difference component S (frequency components S1 (e ^jω ) to SK (e ^jω )) is generated by the calculation of the following formula (15b). Therefore, the ratio of the amplitudes of the acoustic signal SIN_L and the acoustic signal SIN_R in the difference component S changes according to the function f (α) of the localization variable α. The coefficient sequence generation unit 60 calculates the processing coefficient sequence Ge and the processing coefficient sequence Gs from the sum component M of the formula (15a) and the difference component S of the formula (15b) by the same method as each of the above embodiments.

In Equation (6a), the difference between the power of the sum component M | Mk (e ^jω ) | ² and the power of the difference component S | Sk (e ^jω ) | ² is calculated as the power P [k]. As shown in the equation (6c), a configuration is also employed in which the difference between the amplitude | Mk (e ^jω ) | of the sum component M and the amplitude | Sk (e ^jω ) | of the difference component S is calculated as the amplitude P [k]. obtain. In the configuration in which the amplitude P [k] is calculated using the equation (6c), the first generation unit 62 calculates the coefficient value Ge [k] of the processing coefficient sequence Ge by the calculation of the equation (5a).

（７）変形例７
以上の各形態では、パワースペクトルの減算（数式(6a)，数式(13a)）や振幅スペクトルの減算（数式(6c)）で処理係数列Ｇeを算定したが、処理係数列Ｇeを算定する方法は任意である。例えば、和成分Ｍに含まれる定位成分を信号成分と仮定するとともに差成分Ｓを雑音成分と仮定すると、和成分Ｍおよび差成分Ｓを利用した処理係数列Ｇeの生成には、雑音成分（差成分Ｓ）を抑圧して信号成分（定位成分）を強調するための数値列（処理係数列Ｇe）を生成する公知の音声強調の技術を同様に適用することが可能である。 (7) Modification 7
In each of the above embodiments, the processing coefficient sequence Ge is calculated by subtraction of the power spectrum (Formula (6a), Formula (13a)) or subtraction of the amplitude spectrum (Formula (6c)), but a method of calculating the processing coefficient sequence Ge Is optional. For example, if the localization component included in the sum component M is assumed to be a signal component and the difference component S is assumed to be a noise component, the generation of the processing coefficient sequence Ge using the sum component M and the difference component S may include a noise component (difference). A known speech enhancement technique for generating a numerical sequence (processing coefficient sequence Ge) for emphasizing a signal component (localization component) by suppressing the component S) can be similarly applied.

処理係数列Ｇeの生成に適用され得る技術としては、ウィナーフィルタ（Wiener filter）を利用した音声強調や、ＭＭＳＥ-ＳＴＳＡ法またはＭＡＰ（maximum a posteriori estimation）推定法を利用した音声強調の技術が例示され得る。ＭＭＳＥ-ＳＴＳＡ法については、Y. Ephraim and D. Malah, "Speech enhancement using a minimum mean-square error short-time spectral amplitude estimator", IEEE ASSP, vol.ASSP-32, no.6, p.1109-1121, Dec. 1984に開示され、ＭＡＰ推定法については、T. Lotter and P. Vary, "Speech enhancement by MAP spectral amplitude estimation using a Super-Gaussian speech model", EURASIP Journal on Applied Signal Processing, vol.2005, no,7, p.1110-1126, July 2005に開示されている。 Examples of technologies that can be applied to the generation of the processing coefficient sequence Ge include speech enhancement using a Wiener filter and speech enhancement using an MMSE-STSA method or a MAP (maximum a posteriori estimation) estimation method. Can be done. For the MMSE-STSA method, see Y. Ephraim and D. Malah, "Speech enhancement using a minimum mean-square error short-time spectral amplitude estimator", IEEE ASSP, vol.ASSP-32, no.6, p.1109- 1121, Dec. 1984, MAP estimation method is described in T. Lotter and P. Vary, "Speech enhancement by MAP spectral amplitude estimation using a Super-Gaussian speech model", EURASIP Journal on Applied Signal Processing, vol.2005 , no, 7, p.1110-1126, July 2005.

（８）変形例８
以上の各形態では、処理係数列Ｇeを利用した演算（数式(7)）で処理係数列Ｇsを生成したが、係数列生成部６０が処理係数列Ｇeの生成と同様の方法で和成分Ｍおよび差成分Ｓから直接的に処理係数列Ｇsを生成する構成も採用され得る。具体的には、処理係数列Ｇeの生成について以上の例示した各数式における和成分Ｍと差成分Ｓとを相互に置換すれば、定位成分の抑圧用の処理係数列Ｇsを生成することが可能である。すなわち、以上の各態様における処理係数列Ｇeと処理係数列Ｇsとは相互に置換され得る。 (8) Modification 8
In each of the above embodiments, the processing coefficient sequence Gs is generated by the calculation (Formula (7)) using the processing coefficient sequence Ge. However, the coefficient sequence generation unit 60 performs the sum component M in the same manner as the generation of the processing coefficient sequence Ge. A configuration in which the processing coefficient sequence Gs is directly generated from the difference component S can also be adopted. Specifically, when the sum component M and the difference component S in each of the above-exemplified formulas are replaced with each other to generate the processing coefficient sequence Ge, the processing coefficient sequence Gs for suppressing the localization component can be generated. It is. That is, the processing coefficient sequence Ge and the processing coefficient sequence Gs in each of the above aspects can be replaced with each other.

ただし、処理係数列Ｇeを利用して処理係数列Ｇsを算定する第１実施形態の構成によれば、和成分Ｍや差成分Ｓから直接的に処理係数列Ｇsを算定する処理が不要である。したがって、係数列生成部６０による処理の負荷が軽減されるという利点がある。なお、和成分Ｍおよび差成分Ｓから直接的に処理係数列Ｇsを生成し、処理係数列Ｇsの各係数値Ｇs[k]を数式(7)と同様に所定値（例えば１）から減算することで処理係数列Ｇe（係数値Ｇe[k]）を生成する構成も採用され得る。 However, according to the configuration of the first embodiment in which the processing coefficient sequence Gs is calculated using the processing coefficient sequence Ge, processing for directly calculating the processing coefficient sequence Gs from the sum component M and the difference component S is not necessary. . Therefore, there is an advantage that the processing load by the coefficient sequence generator 60 is reduced. A processing coefficient sequence Gs is directly generated from the sum component M and the difference component S, and each coefficient value Gs [k] of the processing coefficient sequence Gs is subtracted from a predetermined value (for example, 1) in the same manner as the equation (7). Thus, a configuration for generating the processing coefficient sequence Ge (coefficient value Ge [k]) may be employed.

（９）変形例９
以上の各形態では、処理係数列Ｇ（Ｇe，Ｇs）を単位区間毎に生成したが、処理係数列Ｇの生成の周期は任意である。例えば、所定個の単位区間の集合を周期として係数列生成部６０が順次に生成した処理係数列Ｇを、信号処理部３８が当該周期内の複数の単位区間について適用する構成も採用され得る。また、以上の各形態では、各単位区間について生成された処理係数列Ｇをその単位区間の周波数スペクトル（ＬA，ＲA）の処理に適用したが、各単位区間の処理係数列Ｇをその単位区間の経過後の各単位区間の周波数スペクトル（ＬA，ＲA）の処理に適用する構成も採用され得る。 (9) Modification 9
In each of the above embodiments, the processing coefficient sequence G (Ge, Gs) is generated for each unit section, but the generation cycle of the processing coefficient sequence G is arbitrary. For example, a configuration in which the signal processing unit 38 applies the processing coefficient sequence G sequentially generated by the coefficient sequence generation unit 60 with a set of a predetermined number of unit intervals as a cycle may be adopted for a plurality of unit intervals in the cycle. In each of the above embodiments, the processing coefficient sequence G generated for each unit section is applied to the processing of the frequency spectrum (LA, RA) of the unit section. However, the processing coefficient sequence G of each unit section is used as the unit section. A configuration applied to the processing of the frequency spectrum (LA, RA) of each unit section after elapse of time can also be adopted.

（１０）変形例１０
以上の各形態では、処理係数列Ｇ（Ｇe，Ｇs）を音響信号ＳINの周波数スペクトル（ＬA，ＲA）に乗算したが、信号処理部３８による処理の内容は適宜に変更される。例えば周波数成分ＬAk(e^jω)および周波数成分ＲAk(e^jω)から処理係数列Ｇ（Ｇe，Ｇs）を減算することで音響信号ＳOUT（ＳOUT_L，ＳOUT_R）を生成する構成が採用され得る。以上の構成における強調用の処理係数列Ｇeは、第１実施形態の処理係数列Ｇeとは逆に、定位成分のパワーが大きい周波数ｆkの係数値Ｇe[k]ほど０に近い数値に設定される。抑圧用の処理係数列Ｇsについても同様であり、第１実施形態の処理係数列Ｇsとは逆に、定位成分のパワーが大きい周波数ｆkの係数値Ｇs[k]ほど１に近い数値に設定される。 (10) Modification 10
In each of the above embodiments, the processing coefficient sequence G (Ge, Gs) is multiplied by the frequency spectrum (LA, RA) of the acoustic signal SIN, but the content of processing by the signal processing unit 38 is appropriately changed. For example, a configuration in which the acoustic signal SOUT (SOUT_L, SOUT_R) is generated by subtracting the processing coefficient sequence G (Ge, Gs) from the frequency component LAk (e ^jω ) and the frequency component RAk (e ^jω ) may be employed. In contrast to the processing coefficient sequence Ge of the first embodiment, the processing coefficient sequence Ge for emphasis in the above configuration is set to a value closer to 0 as the coefficient value Ge [k] of the frequency fk where the power of the localization component is large. The The same applies to the processing coefficient sequence Gs for suppression. Contrary to the processing coefficient sequence Gs of the first embodiment, the coefficient value Gs [k] of the frequency fk having a large localization component power is set to a value closer to 1. The

（１１）変形例１１
以上の各形態においては定位成分の強調用の処理係数列Ｇeおよび抑圧用の処理係数列Ｇsの双方を係数列生成部６０が生成したが、係数列生成部６０が処理係数列Ｇeおよび処理係数列Ｇsの一方のみを生成する構成も採用され得る。したがって、例えば、以上の各形態の第３生成部６６は省略され得る。 (11) Modification 11
In each of the above embodiments, the coefficient sequence generation unit 60 generates both the processing coefficient sequence Ge for emphasizing the localization component and the processing coefficient sequence Gs for suppression, but the coefficient sequence generation unit 60 generates the processing coefficient sequence Ge and the processing coefficient. A configuration that generates only one of the columns Gs may also be employed. Therefore, for example, the third generation unit 66 of each of the above forms can be omitted.

（１２）変形例１２
以上の各形態における係数β（フロアリング係数）を、例えば入力装置１６に対する利用者からの指示に応じて可変に設定する構成も好適である。係数βが小さい（０に近い）ほど定位成分の抑圧の度合が強化されるとともに定位成分の強調の度合が低下する。なお、係数βが大きい場合には、抑圧用の処理係数列Gsのうち定位成分以外の大部分の周波数ｆkの係数値Ｇs[k]が充分に小さい数値となるから、音響信号ＳOUT（ＳOUT_L，ＳOUT_R）の音量が不足する可能性がある。そこで、係数βを大きい数値に設定して定位成分の抑圧を実行する場合、音響信号ＳOUTの音量を増加させる構成が好適である。同様に、係数βが小さい場合には、強調用の処理係数列Ｇeの大部分の周波数ｆkの係数値Ｇe[k]が充分に小さい数値となり得るから、係数βを小さい数値に設定して定位成分の強調を実行する場合、音響信号ＳOUTの音量を増加させる構成が好適に採用される。 (12) Modification 12
A configuration in which the coefficient β (flooring coefficient) in each of the above forms is variably set in accordance with an instruction from the user to the input device 16, for example, is also suitable. As the coefficient β is smaller (closer to 0), the degree of suppression of the localization component is strengthened and the degree of enhancement of the localization component is reduced. When the coefficient β is large, the coefficient value Gs [k] of most of the frequencies fk other than the localization component in the processing coefficient sequence Gs for suppression becomes a sufficiently small value, so that the acoustic signal SOUT (SOUT_L, SOUT_R) may be insufficient. Therefore, when the localization component is suppressed by setting the coefficient β to a large numerical value, a configuration in which the volume of the acoustic signal SOUT is increased is preferable. Similarly, when the coefficient β is small, the coefficient value Ge [k] of most of the frequencies fk in the emphasis processing coefficient string Ge can be a sufficiently small numerical value. When emphasizing the component, a configuration that increases the volume of the acoustic signal SOUT is preferably employed.

１００……音響処理装置、１２……信号供給装置、１４……放音装置、１６……入力装置、２２……演算処理装置、２４……記憶装置、３２……変数設定部、３４……周波数分析部、３６……係数設定部、３８……信号処理部、４０……波形合成部、５２……和成分生成部、５４……差成分生成部、６０……係数列生成部、６２……第１生成部、６４……第２生成部、６６……第３生成部、６８……合成部。
DESCRIPTION OF SYMBOLS 100 ... Sound processing device, 12 ... Signal supply device, 14 ... Sound emission device, 16 ... Input device, 22 ... Arithmetic processing device, 24 ... Memory | storage device, 32 ... Variable setting part, 34 ... Frequency analysis unit 36... Coefficient setting unit 38... Signal processing unit 40... Waveform synthesis unit 52 52 Sum component generation unit 54. Difference component generation unit 60 60 Coefficient sequence generation unit 62 ... First generation unit 64. Second generation unit 66. Third generation unit 68.

Claims

First generation means configured to generate a first numerical sequence that is composed of coefficient values for each frequency and emphasizes or suppresses a localization component at a specific position in the stereo first acoustic signal and the second acoustic signal;
A second numerical sequence composed of coefficient values corresponding to a phase difference for each frequency between the first acoustic signal and the second acoustic signal is represented by both the amplitude of the first acoustic signal and the amplitude of the second acoustic signal. Second generation means for generating from the amplitude of the sum component of
Synthesis means for generating a first processing coefficient sequence composed of coefficient values for each frequency and emphasizing or suppressing the localization component from the first numerical value sequence and the second numerical value sequence;
An acoustic processing apparatus comprising: signal processing means for causing each coefficient value of the first processing coefficient sequence to act on each frequency component of the first acoustic signal and the second acoustic signal.

The second generation means includes
Amplitude calculating means for calculating, for each frequency, an amplitude sum obtained by adding the amplitudes of the first acoustic signal and the second acoustic signal and the amplitude of the sum component;
The sound processing apparatus according to claim 1, further comprising: a setting unit that sets each coefficient value of the second numerical sequence according to a difference between the amplitude sum and the amplitude of the sum component.

Sum component generating means for generating a sum component by adding the first acoustic signal and the second acoustic signal;
Difference component generation means for generating a difference component in which the localization component is suppressed by subtraction between the first acoustic signal and the second acoustic signal;
The first generation means generates the first numerical sequence from the sum component and the difference component,
The sound processing apparatus according to claim 1, wherein the second generation unit applies the amplitude of the sum component generated by the sum component generation unit to generation of the second numerical value sequence.

The acoustic processing apparatus according to claim 3, wherein the first generation unit generates the first numerical value sequence according to a result of subtracting the other spectrum from one spectrum of the sum component and the difference component.

A second processing coefficient sequence corresponding to the other of the localization component enhancement and suppression is generated by subtracting each coefficient value of the first processing coefficient sequence corresponding to one of the localization component enhancement and suppression from a predetermined value. The sound processing apparatus according to claim 1, further comprising third generation means.