JP2006324786A

JP2006324786A - Acoustic signal processing apparatus and method

Info

Publication number: JP2006324786A
Application number: JP2005144331A
Authority: JP
Inventors: Naoyuki Kato; 直行加藤
Original assignee: Matsushita Electric Industrial Co Ltd
Current assignee: Panasonic Holdings Corp
Priority date: 2005-05-17
Filing date: 2005-05-17
Publication date: 2006-11-30

Abstract

<P>PROBLEM TO BE SOLVED: To provide an acoustic signal processing apparatus and method that perform reproduction without variance in sense of a low-pitched sound obtained by adding harmonics. <P>SOLUTION: A fundamental wave extracting filter 12 extracts a fundamental wave signal from an input acoustic signal inputted to an input terminal 10. A harmonic generating means 13 generates harmonic signals of the extracted fundamental wave signal. The respective generated harmonic signals have gains controlled by a gain control means 14 and are put together. A masking quantity detecting means 15 detects masking quantities by specified bands based upon the composite signal generated by the gain control means 14 and the input acoustic signal. A correcting means 16 sets a correction quantity based upon the detected masking quantities by the specified bands so that a masking phenomenon is suppressed. Then frequency amplitude characteristics of the composite signal are corrected in accordance with the correction quantity. <P>COPYRIGHT: (C)2007,JPO&INPIT

Description

本発明は、音響信号処理装置およびその方法に関し、より特定的には、低音域に属する基本波の倍音を音響信号に付加することによって、再生音における低音感の向上を図る音響信号処理装置およびその方法に関する。 The present invention relates to an acoustic signal processing apparatus and method thereof, and more specifically, an acoustic signal processing apparatus and method for improving bass feeling in reproduced sound by adding harmonics of a fundamental wave belonging to a low frequency range to an acoustic signal, and It relates to that method.

一般的に小型スピーカでは、低音の再生が困難である。そこで従来において、再生困難な低音に代えて、その倍音を再生させる方法が提案されている（例えば特許文献１参照１）。以下、図１２を参照して、従来の音響信号処理装置９について説明する。図１２は、従来の音響信号処理装置９の構成を示すブロック図である。 In general, it is difficult to reproduce low sounds with a small speaker. In view of this, conventionally, a method of reproducing harmonics in place of bass that is difficult to reproduce has been proposed (see, for example, Patent Document 1). Hereinafter, a conventional acoustic signal processing device 9 will be described with reference to FIG. FIG. 12 is a block diagram showing a configuration of a conventional acoustic signal processing device 9.

図１２において、音響信号処理装置９は、入力端子９０、加算器９１、基本波抽出フィルタ９２、倍音生成手段９３、ゲイン調整手段９４および出力端子９５を備える。入力端子９０には、音響信号が入力される。入力端子９０に入力された音響信号は、２系統に分岐する。１系統目においては、音響信号の全成分が加算器９１の一方の入力部に入力される。２系統目においては、音響信号の全成分が基本波抽出フィルタ９２に入力される。 In FIG. 12, the acoustic signal processing device 9 includes an input terminal 90, an adder 91, a fundamental wave extraction filter 92, a harmonic overtone generation unit 93, a gain adjustment unit 94, and an output terminal 95. An acoustic signal is input to the input terminal 90. The acoustic signal input to the input terminal 90 is branched into two systems. In the first system, all components of the acoustic signal are input to one input unit of the adder 91. In the second system, all components of the acoustic signal are input to the fundamental wave extraction filter 92.

基本波抽出フィルタ９２は、予め設定された特性をもつローパスフィルタおよびバンドパスフィルタなどで構成される。基本波抽出フィルタ９２は、その予め設定された特性に基づいて、入力信号の全成分の中から基本波を成分とする信号（以下、基本波信号という）を抽出する。抽出された基本波信号は、倍音生成手段９３に入力される。 The fundamental wave extraction filter 92 includes a low-pass filter and a band-pass filter having preset characteristics. The fundamental wave extraction filter 92 extracts a signal having a fundamental wave as a component (hereinafter referred to as a fundamental wave signal) from all components of the input signal based on the preset characteristics. The extracted fundamental wave signal is input to the overtone generation means 93.

倍音生成手段９３は、第２倍音生成手段９３１、第３倍音生成手段９３２、…、第ｎ（ｎは２以上の自然数）倍音生成手段９３３を備える。第２倍音生成手段９３１は、入力された基本波信号の第２倍音信号を生成する。第３倍音生成手段９３２は、入力された基本波信号の第３倍音信号を生成する。つまり、ｉ（ｉは２からｎまでの任意の自然数）を用いて表現すれば、第ｉ倍音生成手段は、入力された基本波信号の第ｉ倍音信号を生成することとなる。生成された各倍音信号のゲインは、基本波信号と同じゲインである。そして、各倍音信号はゲイン調整手段９４に入力される。なお、基本波信号の周波数（以下、基本周波数という）のｎ倍の周波数である倍音信号を、第ｎ倍音信号と呼ぶとする。 The harmonic overtone generation unit 93 includes a second overtone generation unit 931, a third overtone generation unit 932,..., An nth (n is a natural number of 2 or more) overtone generation unit 933. The second overtone generation unit 931 generates a second overtone signal of the input fundamental wave signal. The third harmonic generation unit 932 generates a third harmonic signal of the input fundamental wave signal. In other words, if expressed using i (i is an arbitrary natural number from 2 to n), the i th harmonic generation means generates the i th harmonic signal of the input fundamental wave signal. The gain of each generated harmonic signal is the same as that of the fundamental wave signal. Each harmonic signal is input to the gain adjusting means 94. A harmonic signal having a frequency n times the frequency of the fundamental wave signal (hereinafter referred to as a fundamental frequency) is referred to as an nth harmonic signal.

ここで、倍音信号を生成する方法はいくつか存在する。そのうち、ゼロクロス法について図１３を参照して説明する。図１３は、ゼロクロス法による倍音信号の生成方法を模式的に示す図である。以下、基本波信号を図１３（ａ）に示す正弦波として考える。 Here, there are several methods for generating a harmonic signal. Of these, the zero cross method will be described with reference to FIG. FIG. 13 is a diagram schematically showing a method of generating a harmonic signal by the zero cross method. Hereinafter, the fundamental wave signal is considered as a sine wave shown in FIG.

図１３（ａ）に示す正弦波の波形において、信号レベルが正から負へ、あるいは負から正へ変化する点をゼロクロス点とする。ここでは、負から正へのゼロクロス点を点Ｐ１、点Ｐ２および点Ｐ３とする。第２倍音信号を生成する場合には、ゼロクロス点区間（区間Ｐ１−Ｐ２、区間Ｐ２−Ｐ３）において、時間軸方向について元の波形（基本波信号の波形）を１／２に圧縮する。そして、その１／２に圧縮した波形を２回繰り返して再生する。その結果、図１３（ｂ）に示すように、生成される信号は周波数が２倍の信号となる。他次数の倍音（第３倍音、第４倍音、…、第ｎ倍音）信号についても上記方法で生成される。つまり、第ｎ倍音信号は、上記ゼロクロス点区間において、時間軸方向について基本波信号の波形を１／ｎに圧縮し、ｎ回繰り返して再生することで生成される。 In the sine wave waveform shown in FIG. 13A, a point at which the signal level changes from positive to negative or from negative to positive is defined as a zero cross point. Here, zero cross points from negative to positive are defined as point P1, point P2, and point P3. In the case of generating the second overtone signal, the original waveform (waveform of the fundamental wave signal) is compressed by half in the time axis direction in the zero cross point section (section P1-P2, section P2-P3). Then, the waveform compressed to ½ is reproduced twice. As a result, as shown in FIG. 13B, the generated signal is a signal having a double frequency. Other order harmonics (third harmonic, fourth harmonic,..., Nth harmonic) signals are also generated by the above method. That is, the nth harmonic signal is generated by compressing the waveform of the fundamental wave signal to 1 / n in the time axis direction and reproducing it repeatedly n times in the zero cross point section.

ゲイン調整手段９４は、第２倍音ゲイン調整手段９４１、第３倍音ゲイン調整手段９４２、…、第ｎ倍音ゲイン調整手段９４３および加算器９４４を備える。ゲイン調整手段９４は、入力された各倍音信号が予め設定された倍音構成「第２倍音、第３倍音、…、第ｎ倍音＝ｘ２（ｄＢ）、ｘ３（ｄＢ）、…、ｘｎ（ｄＢ）」となるように、当該各倍音信号のゲインを調整する。なお、ｘ２〜ｘｎは整数とする。加算器９４４は、ゲインが調整された各倍音信号を１つの信号に加算合成する。加算合成された信号は、加算器９１の他方の入力部に入力される。加算器９１は、加算器９４４において加算合成された信号と１系統目の音響信号とを加算合成して、出力端子９５へ出力する。出力端子９５から出力される信号は、最終的にスピーカで再生される。 The gain adjusting unit 94 includes a second overtone gain adjusting unit 941, a third overtone gain adjusting unit 942,..., An nth overtone gain adjusting unit 943, and an adder 944. The gain adjusting means 94 has a harmonic structure in which each input harmonic signal is set in advance “second harmonic, third harmonic,..., N th harmonic = x2 (dB), x3 (dB),..., Xn (dB) The gain of each overtone signal is adjusted so that X2 to xn are integers. The adder 944 adds and synthesizes each overtone signal whose gain is adjusted to one signal. The added and synthesized signal is input to the other input unit of the adder 91. The adder 91 adds and synthesizes the signal added and synthesized by the adder 944 and the first system acoustic signal, and outputs the resultant signal to the output terminal 95. The signal output from the output terminal 95 is finally reproduced by a speaker.

このように、音響信号に倍音を付加した再生を行うと、バーチャル・ピッチ効果と呼ばれる聴覚現象が生じる。この聴覚現象によって、基本波信号があたかも再生されているように聞こえる。つまり、従来の音響信号処理装置では、使用するスピーカが低音再生の困難な小型スピーカであっても、スピーカの再生帯域を変化させることなく低音感の向上を図ることができる。
特開２００４−１０１７９７号公報 As described above, when reproduction is performed by adding overtones to an acoustic signal, an auditory phenomenon called a virtual pitch effect occurs. This auditory phenomenon makes it sound as if the fundamental signal is being reproduced. That is, in the conventional acoustic signal processing apparatus, even if the speaker to be used is a small speaker that is difficult to reproduce with low sound, it is possible to improve the low sound feeling without changing the reproduction band of the speaker.
JP 2004-101797 A

ここで、原音（音響信号の再生音）と倍音の聞こえ方について考える。上述の聴覚現象は、倍音がユーザに聞こえることで生じる。しかしながら、例えば音量の観点から見ると、実際には、原音の音量が大きいとき、倍音は聞き取りにくくなる。逆に、原音の音量が小さければ、倍音が聞き取りやすくなる。これは、原音が倍音をマスクしているために起こる現象である。つまり、原音をマスカー、倍音をマスキーとしたマスキング現象（周波数的な現象および時間的な現象を含む）が起こっている。周波数的なマスキング現象においては、例えば、マスキーの周波数がマスカーの周波数と近いほど、マスカーによってマスクされる量（以下、マスキング量という）が大きいという定性的な性質がある。また、マスカーの音量が大きいほど、マスカーによるマスキング量が大きいという性質もある。したがって、例えば原音の音量が大きいときには、倍音は原音によって大きくマスクされ、聞き取りにくくなる。このように、上記従来の音響信号処理装置では、マスキング現象によって倍音の聞き取りやすさが変化し、上記低音感にばらつきが生じるという問題があった。 Here, how to hear the original sound (reproduced sound of the acoustic signal) and overtones is considered. The above-mentioned auditory phenomenon occurs when the user can hear overtones. However, from the viewpoint of volume, for example, when the volume of the original sound is high, it is difficult to hear overtones. Conversely, if the volume of the original sound is low, overtones are easy to hear. This is a phenomenon that occurs because the original sound masks overtones. That is, a masking phenomenon (including a frequency phenomenon and a temporal phenomenon) occurs in which the original sound is a masker and the overtone is a maskee. In the frequency masking phenomenon, for example, there is a qualitative property that the amount masked by the masker (hereinafter referred to as masking amount) is larger as the masky frequency is closer to the masker frequency. In addition, there is a property that the masking amount by the masker increases as the volume of the masker increases. Therefore, for example, when the volume of the original sound is high, the overtone is greatly masked by the original sound, making it difficult to hear. As described above, the conventional acoustic signal processing apparatus has a problem in that the ease of overtones changes due to the masking phenomenon, and the low-pitched feeling varies.

それ故に、本発明の目的は、倍音を付加して得られる低音感にばらつきがない再生が可能な音響信号処理装置およびその方法を提供することである。 SUMMARY OF THE INVENTION Therefore, an object of the present invention is to provide an acoustic signal processing apparatus and method capable of reproducing the bass sound obtained by adding overtones without variation.

第１の発明は、入力音響信号から基本波信号を抽出する抽出手段と、抽出手段において抽出された基本波信号の倍音信号を複数種類生成し、当該複数種類の倍音信号が合成された信号を供給する供給手段と、入力音響信号が各倍音信号をマスクするマスキング量を検出する検出手段と、合成信号に含まれる各倍音信号のゲインが検出されたマスキング量に応じて大きくなるように、当該合成信号を補正する補正手段と、補正手段において補正された合成信号と入力音響信号とを加算する加算手段とを備える、音響信号処理装置である。 According to a first aspect of the present invention, there is provided an extraction means for extracting a fundamental wave signal from an input acoustic signal, a plurality of types of harmonic signals of the fundamental wave signal extracted by the extraction means, and a signal obtained by synthesizing the plurality of types of harmonic signals. Supply means for supplying, detection means for detecting a masking amount for masking each overtone signal by the input acoustic signal, and the gain of each overtone signal included in the synthesized signal so as to increase according to the detected masking amount An acoustic signal processing apparatus comprising: a correcting unit that corrects a combined signal; and an adding unit that adds the combined signal corrected by the correcting unit and an input acoustic signal.

第２の発明は、上記第１の発明において、検出手段は、供給手段において供給された合成信号と入力音響信号とに基づいて、マスキング量を所定周波数帯域毎に検出し、補正手段は、前記合成信号に含まれる倍音信号毎に、倍音信号の周波数を含む所定周波数帯域について検出されたマスキング量に応じて当該倍音信号のゲインを補正することを特徴とする。 In a second aspect based on the first aspect, the detecting means detects a masking amount for each predetermined frequency band based on the synthesized signal and the input acoustic signal supplied by the supplying means, and the correcting means For each overtone signal included in the synthesized signal, the gain of the overtone signal is corrected according to the masking amount detected for a predetermined frequency band including the frequency of the overtone signal.

第３の発明は、上記第１の発明において、検出手段は、入力音響信号のレベルをマスキング量として検出し、補正手段は、検出手段において検出された入力音響信号のレベルが大きくなるにつれて合成信号のゲインが大きくなるように当該合成信号のゲインを補正することを特徴とする。 In a third aspect based on the first aspect, the detecting means detects the level of the input acoustic signal as a masking amount, and the correcting means is a synthesized signal as the level of the input acoustic signal detected by the detecting means increases. The gain of the synthesized signal is corrected so that the gain of the signal becomes larger.

第４の発明は、上記第３の発明において、補正手段は、入力音響信号のレベルが所定レベル以上となる範囲内において、当該入力音響信号のレベルが大きくなるにつれて合成信号のゲインが大きくなるように当該合成信号のゲインを補正することを特徴とする。 In a fourth aspect based on the third aspect, the correction means increases the gain of the composite signal as the level of the input acoustic signal increases within a range where the level of the input acoustic signal is equal to or higher than a predetermined level. And correcting the gain of the combined signal.

第５の発明は、上記第３の発明において、補正手段は、入力音響信号のレベルが所定レベル以下となる範囲内において、当該入力音響信号のレベルが大きくなるにつれて合成信号のゲインが大きくなるように当該合成信号のゲインを補正することを特徴とする。 In a fifth aspect based on the third aspect, the correction means increases the gain of the composite signal as the level of the input acoustic signal increases within a range where the level of the input acoustic signal is equal to or lower than a predetermined level. And correcting the gain of the combined signal.

第６の発明は、上記第３の発明において、検出手段は、所定周波数帯域における入力音響信号のレベルを検出することを特徴とする。 In a sixth aspect based on the third aspect, the detection means detects the level of the input acoustic signal in a predetermined frequency band.

第７の発明は、上記第６の発明において、所定周波数帯域が各倍音信号の周波数を含む周波数帯域であることを特徴とする。 According to a seventh aspect, in the sixth aspect, the predetermined frequency band is a frequency band including the frequency of each overtone signal.

第８の発明は、入力音響信号から基本波信号を抽出する抽出ステップと、抽出ステップにおいて抽出された基本波信号の倍音信号を複数種類生成し、当該複数種類の倍音信号が合成された信号を供給する供給ステップと、入力音響信号が各倍音信号をマスクするマスキング量を検出する検出ステップと、合成信号に含まれる各倍音信号のゲインが検出されたマスキング量に応じて大きくなるように、当該合成信号を補正する補正ステップと、補正ステップにおいて補正された合成信号と入力音響信号とを加算する加算ステップとを含む、音響信号処理方法である。 According to an eighth aspect of the present invention, there is provided an extraction step for extracting a fundamental wave signal from an input acoustic signal, a plurality of types of overtone signals of the fundamental wave signal extracted in the extraction step, and a signal obtained by synthesizing the plurality of types of overtone signals. A supply step for supplying, a detection step for detecting a masking amount by which the input acoustic signal masks each harmonic signal, and a gain for each harmonic signal included in the synthesized signal so as to increase according to the detected masking amount. An acoustic signal processing method including a correction step of correcting a combined signal and an adding step of adding the combined signal corrected in the correction step and an input acoustic signal.

上記第１の発明によれば、検出されたマスキング量に応じて各倍音信号のゲインが大きくなるように合成信号を補正することで、入力音響信号が各倍音信号をマスクする現象（マスキング現象）を抑制することができる。これにより、倍音を付加することによって得られる低音感にばらつきがない再生を実現することができる。 According to the first aspect of the invention, the phenomenon that the input acoustic signal masks each harmonic signal (masking phenomenon) by correcting the synthesized signal so that the gain of each harmonic signal is increased according to the detected masking amount. Can be suppressed. Thereby, the reproduction | regeneration which does not have dispersion | variation in the low pitch feeling obtained by adding a harmonic can be implement | achieved.

上記第２の発明によれば、各倍音信号のゲインを個別に補正することが可能となる。つまり、複数種類ある倍音信号のうち、例えば１種類の倍音信号の周波数を含む所定周波数帯域のみマスキング量が大きいとき、その１種類の倍音信号のゲインのみを大きくする補正が可能となる。これにより、実際のマスキング現象にしたがった、より自然な、低音感にばらつきがない再生を実現することができる。 According to the second aspect, it is possible to individually correct the gain of each harmonic signal. In other words, among a plurality of types of overtone signals, for example, when the masking amount is large only in a predetermined frequency band including the frequency of one type of overtone signal, it is possible to correct only the gain of that one type of overtone signal. As a result, it is possible to realize reproduction that is more natural and has no variation in the low-pitched sound according to the actual masking phenomenon.

上記第３の発明によれば、入力音響信号のレベルを検出し、当該入力音響信号のレベルに応じて合成信号のゲインを補正する。すなわち、入力音響信号のレベルをマスキング量として等価的に扱って合成信号のゲインを補正する。これにより、実際のマスキング量を検出するための複雑な回路が不要となり、より簡易で、より小規模な装置を提供することができる。 According to the third aspect, the level of the input sound signal is detected, and the gain of the synthesized signal is corrected according to the level of the input sound signal. That is, the gain of the synthesized signal is corrected by equivalently treating the level of the input acoustic signal as a masking amount. This eliminates the need for a complicated circuit for detecting the actual masking amount, and can provide a simpler and smaller apparatus.

上記第４の発明によれば、入力音響信号のレベルが所定レベル以上となる範囲内において、合成信号の補正が行われる。これにより、例えば所定レベルを装置の安定的な動作電圧などに設定することで、より安定的な補正を行うことができる。 According to the fourth aspect of the invention, the composite signal is corrected within a range where the level of the input acoustic signal is equal to or higher than the predetermined level. Thereby, for example, a more stable correction can be performed by setting a predetermined level to a stable operating voltage of the apparatus.

上記第５の発明によれば、入力音響信号のレベルが所定レベルより大きな範囲内において、合成信号のゲインが過大になることを防ぐことができる。つまり、各倍音信号が過大出力されることを防ぐことができる。 According to the fifth aspect of the invention, it is possible to prevent the gain of the synthesized signal from becoming excessive when the level of the input acoustic signal is larger than the predetermined level. That is, it is possible to prevent each harmonic signal from being excessively output.

上記第６の発明によれば、合成信号の補正に対して所定周波数帯域の重み付けが可能となり、より効率的にマスキング現象を抑制する補正を行うことができる。 According to the sixth aspect, weighting of a predetermined frequency band is possible for correction of the composite signal, and correction for suppressing the masking phenomenon can be performed more efficiently.

上記第７の発明によれば、マスカー（入力音響信号）とマスキー（各倍音信号）の周波数が近いほどマスキング量が大きいというマスキング現象の性質に即した補正を行うことができる。その結果、より効率的にマスキング現象を抑制することができる。 According to the seventh aspect of the invention, it is possible to perform correction in accordance with the property of the masking phenomenon that the masking amount increases as the frequencies of the masker (input acoustic signal) and the maskee (each harmonic signal) are closer. As a result, the masking phenomenon can be suppressed more efficiently.

上記第第８の発明は、上記第１の発明と同様の効果を有する。 The eighth invention has the same effect as the first invention.

（第１の実施形態）
以下、図１を参照して、第１の実施形態に係る音響信号処理装置について説明する。図１は、第１の実施形態に係る音響信号処理装置１の構成を示すブロック図である。図１において、音響信号処理装置１は、入力端子１０、加算器１１、基本波抽出フィルタ１２、倍音生成手段１３、ゲイン調整手段１４、マスキング量検出手段１５、補正手段１６、および出力端子１７を備える。 (First embodiment)
Hereinafter, the acoustic signal processing device according to the first embodiment will be described with reference to FIG. FIG. 1 is a block diagram illustrating a configuration of an acoustic signal processing device 1 according to the first embodiment. In FIG. 1, the acoustic signal processing apparatus 1 includes an input terminal 10, an adder 11, a fundamental wave extraction filter 12, a harmonic overtone generation unit 13, a gain adjustment unit 14, a masking amount detection unit 15, a correction unit 16, and an output terminal 17. Prepare.

入力端子１０には、音響信号が入力される。入力された音響信号は、加算器１１の一方の入力部、基本波抽出フィルタ１２、およびマスキング量検出手段１５にそれぞれ入力される。 An acoustic signal is input to the input terminal 10. The input acoustic signal is input to one input unit of the adder 11, the fundamental wave extraction filter 12, and the masking amount detection means 15.

基本波抽出フィルタ１２は、予め設定された特性をもつローパスフィルタまたはバンドパスフィルタなどで構成される。基本波抽出フィルタ１２は、その予め設定された特性に基づいて、音響信号の全成分の中から低音域に属する基本波信号を抽出する。抽出された基本波信号は、倍音生成手段１３に入力される。なお、抽出される基本波信号の周波数帯域は、所望の低音が含まれる帯域である。つまり、倍音信号によって疑似再生させる低音の周波数帯域である。 The fundamental wave extraction filter 12 is configured by a low-pass filter or a band-pass filter having preset characteristics. The fundamental wave extraction filter 12 extracts a fundamental wave signal belonging to the low frequency range from all the components of the acoustic signal based on the preset characteristics. The extracted fundamental wave signal is input to the overtone generation means 13. Note that the frequency band of the extracted fundamental wave signal is a band including a desired bass. That is, it is a low frequency band to be reproduced in a pseudo manner by a harmonic signal.

倍音生成手段１３は、第２倍音生成手段１３１、第３倍音生成手段１３２、…、および第ｎ（ｎは２以上の自然数）倍音生成手段１３３を備える。第２倍音生成手段１３１は、入力された基本波信号の第２倍音信号を生成する。第３倍音生成手段１３２は、入力された基本波信号の第３倍音信号を生成する。つまり、ｉ（ｉは２からｎまでの任意の自然数）を用いて表現すれば、第ｉ倍音生成手段は、入力された基本波信号の第ｉ倍音信号を生成することとなる。第２〜第ｎ倍音信号のゲインは、基本波信号のゲインと同じである。そして、第２〜第ｎ倍音信号は、ゲイン調整手段１４に入力される。なお、倍音信号の生成方法としては、例えば上述したゼロクロス法を用いる。 The harmonic overtone generation unit 13 includes a second overtone generation unit 131, a third overtone generation unit 132,..., And an nth (n is a natural number of 2 or more) overtone generation unit 133. The second overtone generating means 131 generates a second overtone signal of the input fundamental wave signal. The third harmonic generation unit 132 generates a third harmonic signal of the input fundamental wave signal. In other words, if expressed using i (i is an arbitrary natural number from 2 to n), the i th harmonic generation means generates the i th harmonic signal of the input fundamental wave signal. The gains of the second to nth overtone signals are the same as the gain of the fundamental wave signal. Then, the second to nth overtone signals are input to the gain adjusting means 14. For example, the above-described zero cross method is used as a method for generating a harmonic signal.

ゲイン調整手段１４は、第２倍音ゲイン調整手段１４１、第３倍音ゲイン調整手段１４２、…、第ｎ倍音ゲイン調整手段１４３、および加算器１４４を備える。ゲイン調整手段１４は、入力された各倍音信号が予め設定された倍音構成「第２倍音、第３倍音、…、第ｎ倍音＝ｘ２（ｄＢ）、ｘ３（ｄＢ）、…、ｘｎ（ｄＢ）」となるように、当該各倍音信号のゲインを調整する。なお、ｘ２〜ｘｎは整数とする。具体的には、ゲイン調整手段１４は、所定の調整量で各倍音信号のゲインを調整する。所定の調整量とは、倍音信号毎に予め設定されたゲインの調整量である。各倍音信号は、ゲイン調整手段１４において所定の調整量でゲインを調整されることで、上記倍音構成となる。例えば、倍音生成手段１３において第２〜第４倍音信号が生成されるとする。また、基本波信号のゲインを「０ｄＢ」とし、予め設定された倍音構成を「第２倍音、第３倍音、第４倍音＝０（ｄＢ）、−３（ｄＢ）、−６（ｄＢ）」とする。この場合、ゲイン調整手段１４に入力される第２〜第４倍音信号のゲインは、基本波信号のゲインと同じである。つまり、第２〜第４倍音信号のゲインは「０（ｄＢ）」である。したがって、第２〜第４倍音信号が予め設定された倍音構成となるためには、各倍音信号を所定の調整量「第２倍音、第３倍音、第４倍音＝０（ｄＢ）、−３（ｄＢ）、−６（ｄＢ）」でゲイン調整すればよい。ゲインが調整された各倍音信号は、加算器１４４において１つの信号に加算合成される。加算合成された各倍音信号の合成信号は、マスキング量検出手段１５および補正手段１６に入力される。 The gain adjusting unit 14 includes a second overtone gain adjusting unit 141, a third overtone gain adjusting unit 142,..., An nth overtone gain adjusting unit 143, and an adder 144. The gain adjusting means 14 is configured to set a harmonic overtone in which each harmonic overtone signal is set in advance “second overtone, third overtone,..., Nth overtone = x2 (dB), x3 (dB),..., Xn (dB) The gain of each overtone signal is adjusted so that X2 to xn are integers. Specifically, the gain adjusting unit 14 adjusts the gain of each overtone signal by a predetermined adjustment amount. The predetermined adjustment amount is a gain adjustment amount set in advance for each harmonic signal. Each harmonic signal is adjusted in gain by a predetermined adjustment amount by the gain adjusting unit 14 to have the above-described harmonic structure. For example, it is assumed that the second to fourth overtone signals are generated in the overtone generation means 13. Further, the gain of the fundamental wave signal is “0 dB”, and the preset harmonic structure is “second harmonic, third harmonic, fourth harmonic = 0 (dB), −3 (dB), −6 (dB)”. And In this case, the gains of the second to fourth harmonic signals input to the gain adjusting unit 14 are the same as the gain of the fundamental wave signal. That is, the gain of the second to fourth overtone signals is “0 (dB)”. Therefore, in order for the second to fourth overtone signals to have a preset overtone structure, each harmonic overtone signal has a predetermined adjustment amount “second overtone, third overtone, fourth overtone = 0 (dB), −3 (DB), −6 (dB) ”may be used for gain adjustment. Each overtone signal whose gain has been adjusted is added and synthesized by the adder 144 into one signal. The synthesized signal of each overtone signal added and synthesized is input to the masking amount detection means 15 and the correction means 16.

マスキング量検出手段１５は、入力端子１０に入力された音響信号と各倍音信号の合成信号とに基づいて、音響信号によるマスキング量（音響信号が倍音信号をマスクする量）を検出する。なお、音響信号がマスカーであり、倍音信号がマスキーである。マスキング量検出手段１５は、図２に示すように、音響信号分析手段１５１、倍音信号分析手段１５２、および帯域別マスキング量検出手段１５３を備える。図２は、マスキング量検出手段１５の内部構成例を示すブロック図である。 The masking amount detection means 15 detects the masking amount by the acoustic signal (the amount by which the acoustic signal masks the harmonic signal) based on the acoustic signal input to the input terminal 10 and the synthesized signal of each harmonic signal. The acoustic signal is a masker, and the harmonic signal is a maskee. As shown in FIG. 2, the masking amount detection means 15 includes an acoustic signal analysis means 151, a harmonic signal analysis means 152, and a band-specific masking amount detection means 153. FIG. 2 is a block diagram showing an example of the internal configuration of the masking amount detection means 15.

図２において、音響信号分析手段１５１は、入力端子１０から入力される音響信号のレベルを所定の周波数帯域（以下、所定帯域という）毎に分析する。音響信号分析手段１５１による分析の結果を示す情報（分析情報）は、帯域別マスキング量検出手段１５３に入力される。なお、所定帯域は、例えば臨界帯域とする。臨界帯域とは、人間の聴覚特性に基づいた周波数帯域である。具体的に言えば、個々の信号音が聴感上識別できず、個々の信号音の大きさが加算されて聞こえる周波数帯域である。倍音信号分析手段１５２には、ゲイン調整手段１４から合成信号が入力される。倍音信号分析手段１５２は、合成信号に含まれる各倍音信号のレベルを所定帯域毎に分析する。倍音信号分析手段１５２による分析結果を示す分析情報は、帯域別マスキング量検出手段１５３に入力される。なお、音響信号分析手段１５１および倍音信号分析手段１５２における分析方法としては、例えばＦＦＴ（高速フーリエ変換）により周波数振幅特性を求めた後に、信号レベルとして、上記所定帯域毎にその帯域内の信号のパワーを算出する方法がある。また例えば、上記所定帯域を通過帯域とするバンドパスフィルタを通過させた後、信号レベルとして、その出力された信号のパワーを測定する方法などもある。 In FIG. 2, the acoustic signal analysis means 151 analyzes the level of the acoustic signal input from the input terminal 10 for each predetermined frequency band (hereinafter referred to as a predetermined band). Information (analysis information) indicating the result of analysis by the acoustic signal analysis means 151 is input to the band-specific masking amount detection means 153. The predetermined band is, for example, a critical band. The critical band is a frequency band based on human auditory characteristics. Specifically, it is a frequency band in which individual signal sounds cannot be discerned from the sense of hearing and are audible by adding the magnitudes of the individual signal sounds. The synthesized signal is input from the gain adjusting unit 14 to the harmonic signal analyzing unit 152. The harmonic signal analyzing means 152 analyzes the level of each harmonic signal included in the synthesized signal for each predetermined band. The analysis information indicating the analysis result by the harmonic signal analysis unit 152 is input to the band-specific masking amount detection unit 153. As an analysis method in the acoustic signal analysis means 151 and the harmonic signal analysis means 152, for example, after obtaining the frequency amplitude characteristic by FFT (Fast Fourier Transform), the signal level is obtained as a signal level for each predetermined band. There is a way to calculate power. Further, for example, there is a method of measuring the power of the output signal as a signal level after passing through a band pass filter having the predetermined band as a pass band.

帯域別マスキング量検出手段１５３は、音響信号分析手段１５１および倍音信号分析手段１５２において分析された各分析情報に基づいて、所定帯域毎にマスキング量を検出する。なお、所定帯域のマスキング量は、当該所定帯域と同じ帯域において分析された各分析情報に基づいて検出される。検出された情報は、補正手段１６に入力される。マスキング量を検出する方法としては、２つの代表的な方法が知られている。第１の検出方法としては、帯域性雑音と純音とによるマスキング量を検出する方法が挙げられる。この場合には、音響信号を帯域性雑音と、倍音信号を純音とそれぞれ仮定する。第２の検出方法としては、純音同士によるマスキング量を検出する方法が挙げられる。この場合には、音響信号および倍音信号をともに純音と仮定する。このように、上記各検出方法によって、マスキング量が検出される。なお、マスキング量は、上記検出方法以外の他の検出方法を用いて検出されてもよい。 The band-specific masking amount detection means 153 detects the masking amount for each predetermined band based on the analysis information analyzed by the acoustic signal analysis means 151 and the harmonic signal analysis means 152. Note that the masking amount of the predetermined band is detected based on each analysis information analyzed in the same band as the predetermined band. The detected information is input to the correction means 16. As a method for detecting the masking amount, two typical methods are known. As a first detection method, there is a method for detecting a masking amount due to band noise and pure tone. In this case, it is assumed that the acoustic signal is band noise and the harmonic signal is a pure tone. As a second detection method, there is a method of detecting a masking amount between pure tones. In this case, both the acoustic signal and the harmonic signal are assumed to be pure tones. In this way, the masking amount is detected by the above detection methods. The masking amount may be detected using a detection method other than the above detection method.

補正手段１６は、マスキング量検出手段１５において検出された所定帯域毎のマスキング量に基づいて、各倍音信号のゲインが大きくなるように、ゲイン調整手段１４から入力された合成信号の周波数振幅特性を補正する。なお、当該合成信号は、上述したように各倍音信号が加算合成された信号である。したがって、合成信号の周波数振幅特性を補正することで、各倍音信号のゲインをそれぞれ個別に補正することができる。補正手段１６は、図３に示すように、補正量設定手段１６１および周波数振幅特性補正手段１６２を備える。図３は、補正手段１６の内部構成例を示すブロック図である。 Based on the masking amount for each predetermined band detected by the masking amount detection unit 15, the correction unit 16 adjusts the frequency amplitude characteristics of the composite signal input from the gain adjustment unit 14 so that the gain of each harmonic signal increases. to correct. Note that the synthesized signal is a signal obtained by adding and synthesizing each overtone signal as described above. Accordingly, the gain of each harmonic signal can be individually corrected by correcting the frequency amplitude characteristic of the synthesized signal. As shown in FIG. 3, the correction unit 16 includes a correction amount setting unit 161 and a frequency amplitude characteristic correction unit 162. FIG. 3 is a block diagram showing an example of the internal configuration of the correction means 16.

図３において、補正量設定手段１６１は、検出された所定帯域毎のマスキング量に基づいて、音響信号によるマスキング現象を抑制するように補正量を設定する。なお、当該補正量は、所定帯域毎に設定される。ここで、補正量の設定例について説明する。例えば所定帯域「１００Ｈｚ〜２００Ｈｚ」においてマスキング量が「＋６ｄＢ」と検出されたとする。マスキング量が「＋６ｄＢ」ということは、再生段階において音響信号が倍音信号を「＋６ｄＢ」マスクするということである。したがって、この音響信号によるマスキング現象を抑制するには、所定帯域「１００Ｈｚ〜２００Ｈｚ」内にある倍音信号のゲインを例えば「＋６ｄＢ」増幅させればよい。つまり、当該所定帯域においては、補正量を「＋６ｄＢ」と設定すればよい。このように、補正量設定手段１６１は、所定帯域毎のマスキング量に基づいて、所定帯域毎に補正量を設定する。なお、上記数値例では、マスキング量「＋６ｄＢ」に対して補正量「＋６ｄＢ」として、マスキング量を完全に打ち消すように設定したがこれに限定されない。上記補正量がマスキング量を完全に打ち消さない値に設定されても一定の効果を有する。 In FIG. 3, the correction amount setting unit 161 sets the correction amount so as to suppress the masking phenomenon due to the acoustic signal based on the detected masking amount for each predetermined band. The correction amount is set for each predetermined band. Here, an example of setting the correction amount will be described. For example, it is assumed that the masking amount is detected as “+6 dB” in a predetermined band “100 Hz to 200 Hz”. The masking amount “+6 dB” means that the acoustic signal masks the harmonic signal “+6 dB” in the reproduction stage. Therefore, in order to suppress the masking phenomenon due to the acoustic signal, the gain of the harmonic signal in the predetermined band “100 Hz to 200 Hz” may be amplified by “+6 dB”, for example. That is, the correction amount may be set to “+6 dB” in the predetermined band. As described above, the correction amount setting means 161 sets the correction amount for each predetermined band based on the masking amount for each predetermined band. In the above numerical example, the masking amount is set to be completely canceled as the correction amount “+6 dB” with respect to the masking amount “+6 dB”, but the present invention is not limited to this. Even if the correction amount is set to a value that does not completely cancel the masking amount, there is a certain effect.

周波数振幅特性補正手段１６２は、補正量設定手段１６１において設定された所定帯域毎の補正量にしたがって、倍音信号生成手段１３から入力された各倍音信号の合成信号の周波数振幅特性を補正する。ここで、合成信号の周波数振幅特性を補正する方法についての一例を説明する。周波数振幅特性補正手段１６２に倍音信号生成手段１３から合成信号が入力される。入力された合成信号は、ＦＦＴによって所定時間幅で切り出され、周波数振幅特性に変換される。そして、合成信号の周波数振幅特性は、所定帯域毎の補正量にしたがって、重み付けされる。つまり、各倍音信号は、それぞれに対応するマスキング量に基づいたゲインで増幅される。重み付けされた合成信号は、逆ＦＦＴによって、再び時間特性に変換される。このように、周波数振幅特性補正手段１６２は、所定帯域毎の補正量にしたがって、合成信号の周波数振幅特性を補正する。なお、周波数振幅特性を補正する方法としては、上記以外の方法であってもよい。例えばＩＩＲフィルタ、ＦＩＲフィルタを多段備えたものに合成信号を入力する。そして、設定された補正量に応じてフィルタ係数を切り替えることで、合成信号の周波数振幅特性を補正する方法もある。 The frequency amplitude characteristic correcting unit 162 corrects the frequency amplitude characteristic of the synthesized signal of each harmonic signal input from the harmonic signal generating unit 13 according to the correction amount for each predetermined band set by the correction amount setting unit 161. Here, an example of a method for correcting the frequency-amplitude characteristic of the synthesized signal will be described. The synthesized signal is input from the harmonic signal generating unit 13 to the frequency amplitude characteristic correcting unit 162. The input composite signal is cut out with a predetermined time width by FFT and converted into a frequency amplitude characteristic. The frequency amplitude characteristic of the synthesized signal is weighted according to the correction amount for each predetermined band. That is, each overtone signal is amplified with a gain based on the corresponding masking amount. The weighted synthesized signal is converted again into a time characteristic by inverse FFT. As described above, the frequency amplitude characteristic correcting unit 162 corrects the frequency amplitude characteristic of the synthesized signal according to the correction amount for each predetermined band. Note that a method other than the above may be used as a method of correcting the frequency amplitude characteristic. For example, the composite signal is input to a multi-stage IIR filter and FIR filter. There is also a method of correcting the frequency amplitude characteristic of the synthesized signal by switching the filter coefficient in accordance with the set correction amount.

補正手段１６において補正された合成信号は、加算器１１の他方の入力部に入力される。加算器１１は、合成信号と入力端子１０から入力された音響信号とを加算合成して、出力端子１７へ出力する。出力端子１７から出力された信号は、最終的にスピーカで再生される。 The combined signal corrected by the correcting unit 16 is input to the other input unit of the adder 11. The adder 11 adds and synthesizes the synthesized signal and the acoustic signal input from the input terminal 10 and outputs the resultant signal to the output terminal 17. The signal output from the output terminal 17 is finally reproduced by a speaker.

次に、図４に示すフローチャートを参照して、本実施形態に係る音響信号処理装置１の動作の流れについて説明する。図４は、本実施形態に係る音響信号処理装置１の動作の流れを示すフローチャートである。 Next, with reference to the flowchart shown in FIG. 4, the operation | movement flow of the acoustic signal processing apparatus 1 which concerns on this embodiment is demonstrated. FIG. 4 is a flowchart showing an operation flow of the acoustic signal processing apparatus 1 according to the present embodiment.

まず、入力端子１０に音響信号が入力される（ステップＳ１）。基本波抽出フィルタ１２において入力音響信号から基本波信号が抽出される（ステップＳ２）。以下、ステップＳ３〜Ｓ７の処理説明のために、例えば基本波抽出フィルタ１２のフィルタを通過した全信号のうち、周波数が基本周波数である信号の１波分を基本波信号と呼ぶ。倍音生成手段１３において、ステップＳ２で抽出された基本波信号の第２〜第ｎ倍音信号が生成される（ステップＳ３）。ゲイン調整手段１４において、ステップＳ３で生成された各倍音信号がゲイン調整され、その後、合成される（ステップＳ４）。マスキング量検出手段１５において、ステップＳ１で入力された入力音響信号とステップＳ４で合成された合成信号とに基づいて、所定帯域毎にマスキング量が検出される（ステップＳ５）。補正手段１６において、ステップＳ５で検出されたマスキング量に基づいて、入力音響信号によるマスキング現象を抑制するように、所定帯域毎に補正量が設定される（ステップＳ６）。そして、ステップＳ６において設定された補正量にしたがって、各倍音信号の合成信号における周波数振幅特性が補正される（ステップＳ７）。ステップＳ７の次にステップＳ８において、ステップＳ２において抽出された全ての基本波信号について、ステップＳ３〜Ｓ７の処理が完了していれば、図４に示す処理を終了する。全ての基本波信号についてステップＳ３〜Ｓ７の処理をしていなければ、ステップＳ３〜Ｓ７の処理へ進む。以上、音響信号処理装置１の処理の流れについての説明を終了する。 First, an acoustic signal is input to the input terminal 10 (step S1). The fundamental wave extraction filter 12 extracts a fundamental wave signal from the input acoustic signal (step S2). Hereinafter, for explanation of the processing in steps S3 to S7, for example, one wave of a signal having a fundamental frequency among all the signals that have passed through the filter of the fundamental wave extraction filter 12 is referred to as a fundamental wave signal. The harmonic overtone generating means 13 generates the second to nth overtone signals of the fundamental wave signal extracted in step S2 (step S3). The gain adjusting means 14 adjusts the gain of each harmonic sound signal generated in step S3, and then synthesizes it (step S4). The masking amount detection means 15 detects the masking amount for each predetermined band based on the input acoustic signal input in step S1 and the synthesized signal synthesized in step S4 (step S5). Based on the masking amount detected in step S5, the correction unit 16 sets a correction amount for each predetermined band so as to suppress the masking phenomenon caused by the input acoustic signal (step S6). Then, according to the correction amount set in step S6, the frequency amplitude characteristic in the synthesized signal of each harmonic signal is corrected (step S7). In step S8 after step S7, if the processes in steps S3 to S7 are completed for all the fundamental wave signals extracted in step S2, the process shown in FIG. 4 is terminated. If all the fundamental wave signals are not processed in steps S3 to S7, the process proceeds to steps S3 to S7. This is the end of the description of the processing flow of the acoustic signal processing device 1.

ここで、ステップＳ５〜Ｓ７の処理について、数値例を挙げて、より具体的に説明する。ステップＳ５において検出されたマスキング量が、所定帯域「１００Ｈｚ〜２００Ｈｚ」において「＋６ｄＢ」であるとする。また、ステップＳ６において、当該所定帯域における補正量が「＋６ｄＢ」と設定されるとする。以下、周波数が「１２０Ｈｚ」である第２倍音信号について考えるとする。第２倍音信号の周波数「１２０Ｈｚ」は、所定帯域「１００Ｈｚ〜２００Ｈｚ」内の周波数である。したがって、第２倍音信号は、音響信号によって、「＋６ｄＢ」マスクされる。しかしながら、ステップＳ６において、上記所定帯域の補正量が「＋６ｄＢ」と設定されている。したがって、ステップＳ７において、合成信号の周波数振幅特性が補正されて、第２倍音信号のゲインが補正量「＋６ｄＢ」の分だけ増幅される。これにより、音響信号によって「＋６ｄＢ」マスクされても、第２倍音信号が聴感上聞こえにくくなることを回避できる。つまり、音響信号によるマスキング現象を抑制することができる。 Here, the processing of steps S5 to S7 will be described more specifically with a numerical example. It is assumed that the masking amount detected in step S5 is “+6 dB” in the predetermined band “100 Hz to 200 Hz”. In step S6, it is assumed that the correction amount in the predetermined band is set to “+6 dB”. Hereinafter, a second overtone signal having a frequency of “120 Hz” is considered. The frequency “120 Hz” of the second overtone signal is a frequency within a predetermined band “100 Hz to 200 Hz”. Therefore, the second overtone signal is masked by “+6 dB” by the acoustic signal. However, in step S6, the correction amount of the predetermined band is set to “+6 dB”. Therefore, in step S7, the frequency amplitude characteristic of the synthesized signal is corrected, and the gain of the second overtone signal is amplified by the correction amount “+6 dB”. As a result, even if the “+6 dB” is masked by the acoustic signal, it is possible to avoid the second harmonic signal from becoming difficult to hear due to hearing. That is, the masking phenomenon due to the acoustic signal can be suppressed.

以上のように、本実施形態では、音響信号をマスカーとし、倍音信号をマスキーとして所定帯域毎のマスキング量を検出する。そして、当該マスキング量に基づいて、合成信号の周波数振幅特性を補正する。これにより、音響信号の再生音によるマスキング現象を抑制することができる。その結果、倍音を付加することによって得られる低音感にばらつきがない再生を実現することができる。また、本実施形態では、合成信号の周波数振幅特性を補正することで、当該合成信号に含まれる各倍音信号のゲインを個別に補正することができる。つまり、周波数変化するマスキング量に対応した補正を行うことができる。これにより、より自然に低音感にばらつきがない再生を実現することができる。 As described above, in this embodiment, the masking amount for each predetermined band is detected using the acoustic signal as a masker and the overtone signal as a mask. Then, based on the masking amount, the frequency amplitude characteristic of the synthesized signal is corrected. Thereby, the masking phenomenon by the reproduction | regeneration sound of an acoustic signal can be suppressed. As a result, it is possible to realize reproduction in which there is no variation in the low-frequency feeling obtained by adding overtones. Further, in the present embodiment, by correcting the frequency amplitude characteristic of the synthesized signal, the gain of each overtone signal included in the synthesized signal can be individually corrected. That is, the correction corresponding to the masking amount changing in frequency can be performed. As a result, it is possible to realize reproduction with a more natural bass feeling variation.

なお、上述の倍音生成手段１３では、ゼロクロス法を用いて倍音を生成するとしたが、これ以外の他の方法を用いてもよい。それらの方法は、適切な変形によって、本発明に適用することが可能である。 In the above-described harmonic overtone generation unit 13, harmonics are generated using the zero cross method, but other methods may be used. Those methods can be applied to the present invention by appropriate modifications.

（第２の実施形態）
以下、図５を参照して、第２の実施形態に係る音響信号処理装置について説明する。図５は、第２の実施形態に係る音響信号処理装置２の構成を示すブロック図である。マスキング現象においては、上述したようにマスカーのレベルが大きいほどマスキング量も大きいという定性的な性質がある。第２の実施形態では、この定性的性質を利用して、マスカーである音響信号のレベルをマスキング量として等価的に扱い、各倍音信号を補正する。つまり、第２の実施形態では、等価的なマスキング量として、音響信号のレベルを検出する点で第１の実施形態と大きく異なる。以下、異なる点を中心に説明する。 (Second Embodiment)
Hereinafter, the acoustic signal processing device according to the second embodiment will be described with reference to FIG. FIG. 5 is a block diagram illustrating a configuration of the acoustic signal processing device 2 according to the second embodiment. As described above, the masking phenomenon has a qualitative property that the larger the masker level, the larger the masking amount. In the second embodiment, using this qualitative property, the level of the acoustic signal that is a masker is equivalently treated as a masking amount, and each harmonic signal is corrected. That is, the second embodiment is greatly different from the first embodiment in that the level of the acoustic signal is detected as an equivalent masking amount. Hereinafter, different points will be mainly described.

図５において、音響信号処理装置２は、入力端子１０、加算器１１、基本波抽出フィルタ１２、倍音生成手段１３、ゲイン調整手段１４、マスキング量検出手段２５、補正手段２６、および出力端子１７を備える。なお、図５において、入力端子１０、加算器１１、基本波抽出フィルタ１２、倍音生成手段１３、ゲイン調整手段１４、および出力端子１７は、上述した第１の実施形態と同様であるため、同一の符号を付し、説明を省略する。 In FIG. 5, the acoustic signal processing apparatus 2 includes an input terminal 10, an adder 11, a fundamental wave extraction filter 12, a harmonic overtone generation unit 13, a gain adjustment unit 14, a masking amount detection unit 25, a correction unit 26, and an output terminal 17. Prepare. In FIG. 5, the input terminal 10, the adder 11, the fundamental wave extraction filter 12, the harmonic overtone generation unit 13, the gain adjustment unit 14, and the output terminal 17 are the same as those in the first embodiment described above, and thus are the same. The description will be omitted.

図５において、マスキング量検出手段２５には、入力端子１０に入力された音響信号が入力される。マスキング量検出手段２５は、当該音響信号に基づいて、等価的なマスキング量、つまり音響信号のレベルを検出する。マスキング量検出手段２５は、図６に示すように、絶対値化回路２５１、ピークホールド回路２５２、時定数回路２５３、およびレベル検出回路２５４を備える。図６は、マスキング量検出手段２５の内部構成例を示すブロック図である。 In FIG. 5, the acoustic signal input to the input terminal 10 is input to the masking amount detection means 25. The masking amount detection means 25 detects an equivalent masking amount, that is, the level of the acoustic signal based on the acoustic signal. As shown in FIG. 6, the masking amount detection means 25 includes an absolute value conversion circuit 251, a peak hold circuit 252, a time constant circuit 253, and a level detection circuit 254. FIG. 6 is a block diagram showing an example of the internal configuration of the masking amount detection means 25.

図６において、絶対値化回路２５１は、例えば全波整流回路で構成される。そして、入力された音響信号のレベルを絶対値化する。ピークホールド回路２５２は、絶対値化された音響信号レベルのピークを検出する。時定数回路２５３は、例えばＣＲ回路で構成される。そして、ピークホールド回路２５２において検出された音響信号レベルのピークを平滑化する。レベル検出回路２５４は、時定数回路２５３によって平滑化された音響信号のレベルを検出する。このように、本実施形態では、等価的なマスキング量として音響信号のレベルを検出している。 In FIG. 6, the absolute value conversion circuit 251 is constituted by a full-wave rectification circuit, for example. Then, the level of the input acoustic signal is converted to an absolute value. The peak hold circuit 252 detects the peak of the acoustic signal level converted into an absolute value. The time constant circuit 253 is composed of, for example, a CR circuit. Then, the peak of the acoustic signal level detected by the peak hold circuit 252 is smoothed. The level detection circuit 254 detects the level of the acoustic signal smoothed by the time constant circuit 253. Thus, in this embodiment, the level of the acoustic signal is detected as an equivalent masking amount.

図５において、補正手段２６は、所定の補正カーブを有するイコライザで構成される。また、補正手段２６は、マスキング量検出手段２５において検出された音響信号のレベル情報を入力とする。補正手段２６は、所定の補正カーブにしたがって、ゲイン調整手段１４から入力された合成信号のゲインを増幅する補正を行う。この増幅分のゲインは、所定の補正カーブと音響信号レベルとで決まるゲインである。なお、第１の実施形態においては、合成信号の周波数振幅特性を補正したが、本実施形態においては、合成信号自体のゲインを補正する。つまり、合成信号に含まれる各倍音信号は、一律のゲインで補正される。補正カーブの一例を図７に示す。図７は、補正カーブの一例を示す図である。図７において、実線が補正カーブ（補正後の合成信号のゲインカーブ）を示し、点線が補正前の合成信号のゲインを示す。図７に示す補正カーブによれば、合成信号のゲインは、音響信号レベルが大きくなるにつれて大きくなる。このような補正カーブを有するイコライザによって、音響信号レベルが大きいときには、合成信号のゲインが大きくなる。これにより、マスカーのレベルが大きいほどマスキング量も大きいという定性的な性質に即した補正を行うことが可能となり、音響信号レベルをマスキング量として等価的に扱うことができる。 In FIG. 5, the correction means 26 is configured by an equalizer having a predetermined correction curve. The correction means 26 receives the level information of the acoustic signal detected by the masking amount detection means 25 as input. The correction unit 26 performs correction for amplifying the gain of the combined signal input from the gain adjustment unit 14 according to a predetermined correction curve. This gain for amplification is a gain determined by a predetermined correction curve and an acoustic signal level. In the first embodiment, the frequency amplitude characteristic of the synthesized signal is corrected. However, in the present embodiment, the gain of the synthesized signal itself is corrected. That is, each overtone signal included in the synthesized signal is corrected with a uniform gain. An example of the correction curve is shown in FIG. FIG. 7 is a diagram illustrating an example of the correction curve. In FIG. 7, the solid line indicates a correction curve (the gain curve of the combined signal after correction), and the dotted line indicates the gain of the combined signal before correction. According to the correction curve shown in FIG. 7, the gain of the synthesized signal increases as the acoustic signal level increases. The equalizer having such a correction curve increases the gain of the synthesized signal when the acoustic signal level is high. Accordingly, it becomes possible to perform correction in accordance with the qualitative property that the masking amount is larger as the masker level is larger, and the acoustic signal level can be equivalently handled as the masking amount.

以上のように、本実施形態では、音響信号レベルを等価的なマスキング量として補正を行っている。これにより、本実施形態では、第１の実施形態と比べて、実際のマスキング量を検出するための複雑な回路が不要である。そのため、本実施形態に係る音響信号処理装置は、より簡易で、より小規模な構成で実現することができる。 As described above, in this embodiment, the acoustic signal level is corrected using an equivalent masking amount. As a result, the present embodiment does not require a complicated circuit for detecting the actual masking amount, as compared with the first embodiment. Therefore, the acoustic signal processing device according to the present embodiment can be realized with a simpler and smaller configuration.

なお、上述したマスキング量検出手段２５は、図８に示すように、フィルタ２５０をさらに備えていてもよい。図８は、フィルタ２５０をさらに備えたマスキング量検出手段２５の内部構成例を示すブロック図である。フィルタ２５０は、例えばバンドパスフィルタなどで構成される。そして、フィルタ２５０は音響信号の全周波数帯域のうち、所定周波数帯域を選択的に通過させる。マスキング量検出手段２５においては、フィルタ２５０によって選択された所定周波数帯域の音響信号レベルが検出される。これにより、合成信号の補正に対して、所定周波数帯域の重み付けをすることができる。例えば、マスキング現象においては、上述したように、マスカーとマスキーの周波数が近いほど、マスキング量が大きいという性質がある。したがって、例えばフィルタ２５０において、生成された各倍音信号が含まれる周波数帯域の音響信号を通過させると設定してもよい。これにより、上記定性的性質に即した、より効果的な補正を行うことができる。また、音声信号を含む信号（例えば音楽や映画など）では、一般的に音声信号レベルが他の信号より大きい。つまり、音声信号によるマスキング量が大きい場合が多い。したがって、フィルタ２５０において、音声信号が含まれる周波数帯域（例えば、２００Ｈｚ〜４ｋＨｚの周波数帯域）の音響信号を通過させると設定してもよい。これにより、音声信号を含む信号に対して、より効果的な補正を行うことができる。このように、所定周波数帯域の重み付けを行うことで、実際のマスキング量が周波数変化する場合であっても、マスキング量が大きい周波数帯域を選択して、補正することができる。その結果、より効率的にマスキング現象を抑制することができる。 The masking amount detection means 25 described above may further include a filter 250 as shown in FIG. FIG. 8 is a block diagram showing an example of the internal configuration of the masking amount detection means 25 further provided with a filter 250. The filter 250 is composed of, for example, a band pass filter. Then, the filter 250 selectively passes a predetermined frequency band out of the entire frequency band of the acoustic signal. In the masking amount detection means 25, the acoustic signal level in the predetermined frequency band selected by the filter 250 is detected. Thereby, weighting of a predetermined frequency band can be performed for correction of the composite signal. For example, in the masking phenomenon, as described above, there is a property that the masking amount is larger as the frequency of the masker and the maskee is closer. Therefore, for example, the filter 250 may be set to pass an acoustic signal in a frequency band including each generated harmonic signal. Thus, more effective correction can be performed in accordance with the qualitative property. In addition, a signal including an audio signal (for example, music or a movie) generally has an audio signal level higher than that of other signals. That is, the masking amount by the audio signal is often large. Therefore, the filter 250 may be set to pass an acoustic signal in a frequency band (for example, a frequency band of 200 Hz to 4 kHz) including the audio signal. Thereby, more effective correction can be performed on a signal including an audio signal. In this way, by weighting the predetermined frequency band, even if the actual masking amount changes in frequency, it is possible to select and correct a frequency band having a large masking amount. As a result, the masking phenomenon can be suppressed more efficiently.

また、上述した補正手段２６は、図９に示すように、基本波抽出フィルタ１２の出力と倍音生成手段１３の入力との間に設けられてもよい。図９は、第２の実施形態に係る音響信号処理装置２の他の構成例を示すブロック図である。このとき、補正手段２６は、上述した合成信号と同様の補正方法で基本波信号のゲインを補正する。補正手段２６は、合成信号のゲインを基本波信号の段階で補正している。また、上述した補正手段２６において、図７に示す補正カーブを有するとしたが、図１０および図１１に示す補正カーブを有していてもよい。図１０および図１１は、補正カーブの他の例を示す図である。図１０に示す補正カーブにおいて、Ｐ１のレベルは、装置を安定的に動作させることが可能なスレッショルド・レベル（閾値）である。つまり、図１０に示す補正カーブを用いれば、スレッショルド・レベル以上において、合成信号のゲインが補正される。これにより、当該補正動作をより安定的に行うことができる。また、図１１に示す補正カーブは、Ｐ２のレベル以上では、補正するゲイン量は増加せずに一定となる。このように、補正するゲイン量に上限（Ｐ２）を設けることで、合成信号、つまり各倍音信号のレベルが過大になることを防ぐことができる。 Further, the correction means 26 described above may be provided between the output of the fundamental wave extraction filter 12 and the input of the harmonic overtone generation means 13 as shown in FIG. FIG. 9 is a block diagram illustrating another configuration example of the acoustic signal processing device 2 according to the second embodiment. At this time, the correction means 26 corrects the gain of the fundamental wave signal by a correction method similar to that of the above-described composite signal. The correcting means 26 corrects the gain of the composite signal at the stage of the fundamental wave signal. Further, although the correction means 26 described above has the correction curve shown in FIG. 7, it may have the correction curves shown in FIGS. 10 and 11. 10 and 11 are diagrams showing another example of the correction curve. In the correction curve shown in FIG. 10, the level of P1 is a threshold level (threshold) at which the apparatus can be stably operated. That is, if the correction curve shown in FIG. 10 is used, the gain of the composite signal is corrected above the threshold level. Thereby, the correction operation can be performed more stably. In addition, the correction curve shown in FIG. 11 is constant without increasing the amount of gain to be corrected above the level of P2. Thus, by providing the upper limit (P2) to the gain amount to be corrected, it is possible to prevent the level of the synthesized signal, that is, each harmonic signal from becoming excessive.

本発明は、音響信号処理装置およびその方法に関し、音響信号に倍音信号を付加することで低音感の向上を図ることが可能な擬似低音再生装置、オーディオ機器等にも適用される。 The present invention relates to an acoustic signal processing apparatus and method thereof, and is also applied to a pseudo bass reproduction apparatus, an audio device, and the like that can improve bass feeling by adding a harmonic signal to the acoustic signal.

第１の実施形態に係る音響信号処理装置１の構成を示すブロック図The block diagram which shows the structure of the acoustic signal processing apparatus 1 which concerns on 1st Embodiment. マスキング量検出手段１５の内部構成例を示すブロック図The block diagram which shows the example of an internal structure of the masking amount detection means 15 補正手段１６の内部構成例を示すブロック図Block diagram showing an example of the internal configuration of the correction means 16 本実施形態に係る音響信号処理装置１の動作の流れを示すフローチャートThe flowchart which shows the flow of operation | movement of the acoustic signal processing apparatus 1 which concerns on this embodiment. 第２の実施形態に係る音響信号処理装置２の構成の一例を示すブロック図The block diagram which shows an example of a structure of the acoustic signal processing apparatus 2 which concerns on 2nd Embodiment. マスキング量検出手段２５の内部構成例を示すブロック図The block diagram which shows the example of an internal structure of the masking amount detection means 25 補正カーブの一例を示す図Diagram showing an example of a correction curve フィルタ２５０をさらに備えたマスキング量検出手段２５の内部構成例を示すブロック図The block diagram which shows the internal structural example of the masking amount detection means 25 further provided with the filter 250 第２の実施形態に係る音響信号処理装置２の他の構成例を示すブロック図The block diagram which shows the other structural example of the acoustic signal processing apparatus 2 which concerns on 2nd Embodiment. 補正カーブの他の例を示す図Figure showing another example of correction curve 補正カーブの他の例を示す図Figure showing another example of correction curve 従来の音響信号処理装置９の構成を示すブロック図The block diagram which shows the structure of the conventional acoustic signal processing apparatus 9 ゼロクロス法による倍音信号の生成方法を模式的に示す図The figure which shows the method of generating the harmonic signal by the zero cross method typically

Explanation of symbols

１、２音響信号処理装置
１０入力端子
１１、１４４加算器
１２基本波抽出フィルタ
１３倍音生成手段
１４ゲイン調整手段
１５、２５マスキング量検出手段
１６、２６補正手段
１７出力端子
１３１第２倍音生成手段
１３２第３倍音生成手段
１３３第ｎ倍音生成手段
１４１第２倍音ゲイン調整手段
１４２第３倍音ゲイン調整手段
１４３第ｎ倍音ゲイン調整手段
１５１音響信号分析手段
１５２倍音信号分析手段
１５３帯域別マスキング量検出手段
１６１補正量設定手段
１６２周波数振幅特性補正手段
２５０フィルタ
２５１絶対値化回路
２５２ピークホールド回路
２５３時定数回路
２５４レベル検出回路

DESCRIPTION OF SYMBOLS 1, 2 Acoustic signal processing apparatus 10 Input terminal 11, 144 Adder 12 Fundamental wave extraction filter 13 Overtone production | generation means 14 Gain adjustment means 15, 25 Masking amount detection means 16, 26 Correction means 17 Output terminal 131 2nd overtone production means 132 3rd harmonic generation means 133 nth harmonic generation means 141 2nd harmonic gain adjustment means 142 3rd harmonic gain adjustment means 143 nth harmonic gain adjustment means 151 Acoustic signal analysis means 152 Overtone signal analysis means 153 Bandwidth masking amount detection means 161 Correction amount setting means 162 Frequency amplitude characteristic correction means 250 Filter 251 Absolute value conversion circuit 252 Peak hold circuit 253 Time constant circuit 254 Level detection circuit

Claims

Extracting means for extracting a fundamental wave signal from the input acoustic signal;
A supply means for generating a plurality of types of harmonic signals of the fundamental wave signal extracted by the extraction means, and supplying a signal obtained by synthesizing the plurality of types of harmonic signals;
Detecting means for detecting a masking amount by which the input acoustic signal masks each harmonic signal;
Correction means for correcting the synthesized signal so that the gain of each harmonic signal included in the synthesized signal is increased according to the detected masking amount;
An acoustic signal processing apparatus comprising: an adding unit that adds the combined signal corrected by the correcting unit and the input acoustic signal.

The detection means detects the masking amount for each predetermined frequency band based on the synthesized signal and the input acoustic signal supplied by the supply means,
The correction means corrects the gain of the harmonic signal in accordance with a masking amount detected for a predetermined frequency band including the frequency of the harmonic signal for each harmonic signal included in the synthesized signal. The acoustic signal processing device according to 1.

The detection means detects the level of the input acoustic signal as the masking amount,
The correction means corrects the gain of the composite signal so that the gain of the composite signal increases as the level of the input acoustic signal detected by the detection means increases. Acoustic signal processing device.

The correction means corrects the gain of the synthesized signal so that the gain of the synthesized signal increases as the level of the input acoustic signal increases within a range where the level of the input acoustic signal is equal to or higher than a predetermined level. The acoustic signal processing device according to claim 3, wherein:

The correction means corrects the gain of the synthesized signal so that the gain of the synthesized signal increases as the level of the input acoustic signal increases within a range where the level of the input acoustic signal is equal to or lower than a predetermined level. The acoustic signal processing device according to claim 3, wherein:

The acoustic signal processing apparatus according to claim 3, wherein the detection unit detects a level of an input acoustic signal in a predetermined frequency band.

The acoustic signal processing apparatus according to claim 6, wherein the predetermined frequency band is a frequency band including a frequency of each overtone signal.

An extraction step of extracting a fundamental signal from the input acoustic signal;
Supplying a plurality of types of overtone signals of the fundamental wave signal extracted in the extraction step, and supplying a signal in which the plurality of types of overtone signals are combined; and
A detection step of detecting a masking amount by which the input acoustic signal masks each harmonic signal;
A correction step of correcting the synthesized signal so that the gain of each harmonic signal included in the synthesized signal is increased according to the detected masking amount;
An acoustic signal processing method including an adding step of adding the combined signal corrected in the correcting step and the input acoustic signal.