JP2009542137A

JP2009542137A - Filter compressor and method for producing a compressed subband filter impulse response

Info

Publication number: JP2009542137A
Application number: JP2009517033A
Authority: JP
Inventors: ラルスヴィレモエス
Original assignee: Dolby International AB
Current assignee: Dolby International AB
Priority date: 2006-07-04
Filing date: 2007-07-03
Publication date: 2009-11-26
Anticipated expiration: 2027-07-03
Also published as: TR201902417T4; EP2337224A3; EP3739752A1; EP3447916B1; US20100017195A1; WO2008003467A1; ES2623226T3; EP2337224A2; EP3739752B1; EP2337224B1; US8255212B2; ES2638269T3; CN101512899B; HUE057855T2; TW200813981A; KR20090028755A; CN101512899A; KR101201167B1; PL2337224T3; EP3236587B1

Abstract

フィルタ・タップでフィルタ・インパルス応答値を含む、サブバンドに対応する入力サブバンド・フィルタ・インパルス応答から圧縮サブバンド・フィルタ・インパルス応答を生成するためのフィルタ・コンプレッサ（１０２）は、より高い値を有するフィルタ・インパルス応答値およびそのより高い値より低い値を有する少なくとも１つのフィルタ・インパルス応答値を見つけるために、少なくとも２つの入力サブバンド・フィルタ・インパルス応答からフィルタ・インパルス応答値を調べるためのプロセッサ（８２０）と、そのより高い値を有するフィルタ・インパルス応答値を用いて圧縮サブバンド・フィルタ・インパルス応答を構成するためのフィルタ・インパルス応答コンストラクタ（３０５）とを含み、その圧縮サブバンド・フィルタ・インパルス応答は、そのより低い値を有する少なくとも１つのフィルタ・インパルス応答値のフィルタ・タップに対応するフィルタ・インパルス応答値を含まず、または、そのより低い値を有する少なくとも１つのフィルタ・インパルス応答値のフィルタ・タップに対応するゼロ値を含む。 The filter compressor (102) for generating the compressed subband filter impulse response from the input subband filter impulse response corresponding to the subband, including the filter impulse response value at the filter tap, has a higher value. In order to find a filter impulse response value from at least two input subband filter impulse responses in order to find a filter impulse response value having a value and at least one filter impulse response value having a value lower than that Processor 820 and a filter impulse response constructor (305) for constructing a compressed subband filter impulse response using the filter impulse response value having the higher value thereof, the compressed subband・ F The filter impulse response does not include a filter impulse response value corresponding to a filter tap of at least one filter impulse response value having the lower value, or at least one filter impulse having the lower value. Contains a zero value corresponding to the filter tap of the response value.

Description

本発明は、例えばヘッドホン上のマルチチャンネル・サウンド体験のための頭部伝達関数（ＨＲＴＦ）のフィルタリングのような音声アプリケーションの分野において例えば使用することができる、ＱＭＦドメイン（ＱＭＦ＝直交ミラー・フィルタバンク）と呼ばれることもあるサブバンド・ドメインにおけるフィルタ・コンプレッサに関する。 The present invention provides a QMF domain (QMF = orthogonal mirror filter bank) that can be used, for example, in the field of speech applications such as filtering of the head related transfer function (HRTF) for multi-channel sound experiences on headphones. ) In the subband domain, sometimes referred to as).

フィルタ変換技術の最近の進展は、時間ドメイン・フィルタの非常に効率的なＱＭＦ表現を可能にしている。一般に、時間ドメインのどのＦＩＲフィルタ（ＦＩＲ＝有限インパルス応答）も、ＱＭＦにおいて特定のサブバンドにそれぞれ対応する１セットの複素フィルタに変換することができる。このため、フィルタリングは、フィルタリングがＦＦＴ（ＦＦＴ＝高速フーリエ変換）を用いて実行することができる方法と同様に、複素ＱＭＦドメインにおいて起こることができる。それでも、フィルタリングのＱＭＦドメイン表現および実施の計算の複雑性は、例えば時間ドメインにおいて長いインパルス応答を有するフィルタの場合にかなりなものとなり得る。 Recent developments in filter transformation technology have enabled a very efficient QMF representation of time domain filters. In general, any FIR filter in the time domain (FIR = finite impulse response) can be transformed into a set of complex filters, each corresponding to a particular subband in QMF. Thus, filtering can occur in the complex QMF domain, similar to the way that filtering can be performed using FFT (FFT = Fast Fourier Transform). Nonetheless, the QMF domain representation of filtering and the computational complexity of implementation can be substantial, for example for filters with long impulse responses in the time domain.

さらに、音声符号化における最近の進展は、ステレオ（またはモノラル）信号および対応する制御データに基づいて音声信号のマルチチャンネル表現を再現する能力を利用できるようになっている。これらの方法は、さらなる制御データが、送信されたモノラルまたはステレオ・チャンネルに基づくサラウンド・チャンネルの、アップミックスとも呼ばれる、再現を制御するために送信されるので、ドルビー社プロロジック（Ｐｒｏｌｏｇｉｃ（登録商標））のような古くからあるマトリクスに基づく解決策とは大幅に異なる。 Furthermore, recent developments in speech coding have made available the ability to reproduce multi-channel representations of speech signals based on stereo (or monaural) signals and corresponding control data. These methods transmit additional control data to control reproduction, also called upmix, of surround channels based on the transmitted mono or stereo channel, so Dolby Prologic® It is very different from the old matrix based solutions like)).

したがって、そのようなパラメトリック・マルチチャンネル・オーディオ・デコーダ、例えばＭＰＥＧサラウンド（ＭＰＥＧＳｕｒｒｏｕｎｄ（登録商標））は、Ｍ個の送信されたチャンネルおよびさらなる制御データに基づいてＮ個のチャンネルを再構成し、ＮおよびＭは可能な整数であり、Ｎ＞Ｍである。このさらなる制御データは、Ｎ個のチャンネルの全ての送信より著しく低いデータレートを表し、符号化を非常に効率的にすると同時に、Ｍチャンネル装置およびＮチャンネル装置の両方の互換性を確実にする。 Thus, such a parametric multi-channel audio decoder, eg MPEG Surround (MPEG Surround®), reconfigures N channels based on M transmitted channels and further control data, N and M are possible integers, N> M. This additional control data represents a significantly lower data rate than all transmissions of the N channels, making encoding very efficient while ensuring compatibility of both M-channel and N-channel devices.

これらのパラメトリック・サラウンド符号化方法は、通常、ＩＩＤ（チャンネル間強度差）およびＩＣＣ（チャンネル間コヒーレンス）に基づくサラウンド信号のパラメータ化を含む。これらのパラメータは、アップミックス・プロセスにおけるチャンネル対間のパワー比および相関関係を表す。先行技術でも用いられるさらなるパラメータは、アップミックス手順中に、中間または出力チャンネルを予測するために用いられる予測パラメータを含む。 These parametric surround coding methods typically involve the parameterization of surround signals based on IID (Interchannel Intensity Difference) and ICC (Interchannel Coherence). These parameters represent the power ratio and correlation between channel pairs in the upmix process. Additional parameters that are also used in the prior art include prediction parameters used to predict intermediate or output channels during the upmix procedure.

音声符号化における他の進展は、ステレオ・ヘッドホン上のマルチチャンネル信号印象を得る手段を提供している。これは、元のマルチチャンネル信号およびいわゆるＨＲＴＦ（頭部伝達関数）フィルタを用いて、マルチチャンネル信号をステレオにダウンミックスすることによって一般に行われる。パラメトリック・マルチチャンネル・オーディオ・デコーダが、送信されたダウンミックス信号からマルチチャンネル信号をまず再現し、その後にＨＲＴＦフィルタによって再びそれをダウンミックスする必要なく、ヘッドホン上のマルチチャネル信号をレンダリングすることを可能にするバイノーラル・ダウンミックス・アルゴリズムと組み合わせることができることは、先行技術に示されている。これは、パラメトリック・マルチチャネル表現の関数として、ＨＲＴＦフィルタを４つのフィルタに組み合わせることによって達成される。結果として、４つのフィルタは、マルチチャンネル表現のための入力として用いられるステレオ信号（２つのチャンネル）が結果として生じるバイノーラルまたはステレオ出力信号（２つのチャンネル）を達成するためにどのように組み合わされまたはミックスされるかについてパラメトリック・マルチチャンネル表現の関数として表される。それで、４つのフィルタのそれぞれは、２つの出力信号に関して２つの入力信号のうちの１つに関連する。しかしながら、ＨＲＴＦフィルタは、室内特性をうまくモデル化するために、非常に長くなり、そのため、ＱＭＦドメインにおいて４つのＨＲＴＦフィルタをフィルタリングするための計算の複雑性は、著しくなり得る。 Other advances in speech coding have provided a means to obtain a multi-channel signal impression on stereo headphones. This is generally done by downmixing the multichannel signal to stereo using the original multichannel signal and a so-called HRTF (head related transfer function) filter. A parametric multi-channel audio decoder that first reproduces the multi-channel signal from the transmitted down-mix signal and then renders the multi-channel signal on the headphones without having to down-mix it again with an HRTF filter. It has been shown in the prior art that it can be combined with the binaural downmix algorithm that enables it. This is accomplished by combining the HRTF filter into four filters as a function of the parametric multi-channel representation. As a result, how the four filters are combined to achieve a binaural or stereo output signal (2 channels) resulting in a stereo signal (2 channels) used as input for multi-channel representation or Expressed as a function of parametric multi-channel representation of what is mixed. Thus, each of the four filters is associated with one of the two input signals with respect to the two output signals. However, HRTF filters can be very long to model room characteristics well, so the computational complexity for filtering four HRTF filters in the QMF domain can be significant.

本発明の実施形態によれば、フィルタ・タップでフィルタ・インパルス応答値を含む、サブバンドに対応する入力サブバンド・フィルタ・インパルス応答から圧縮サブバンド・フィルタ・インパルス応答を生成するためのフィルタ・コンプレッサは、より高い値を有するフィルタ・インパルス応答値およびそのより高い値より低い値を有する少なくとも１つのフィルタ・インパルス応答値を見つけるために、少なくとも２つの入力サブバンド・フィルタ・インパルス応答からフィルタ・インパルス応答値を調べるためのプロセッサと、そのより高い値を有するフィルタ・インパルス応答値を用いて圧縮サブバンド・フィルタ・インパルス応答を構成するためのフィルタ・インパルス応答コンストラクタとを含み、その圧縮サブバンド・フィルタ・インパルス応答は、そのより低い値を有する少なくとも１つのフィルタ・インパルス応答値のフィルタ・タップに対応するフィルタ・インパルス応答値を含まず、または、そのより低い値を有する少なくとも１つのフィルタ・インパルス応答値のフィルタ・タップに対応するゼロ値の値を含む。 According to an embodiment of the present invention, a filter for generating a compressed subband filter impulse response from an input subband filter impulse response corresponding to a subband, including a filter impulse response value at a filter tap. The compressor filters from at least two input subband filter impulse responses to find a filter impulse response value having a higher value and at least one filter impulse response value having a value lower than the higher value. A processor for examining the impulse response value and a filter impulse response constructor for constructing a compressed subband filter impulse response using the filter impulse response value having the higher value, the compressed subband ·fill The impulse response does not include a filter impulse response value corresponding to a filter tap of the at least one filter impulse response value having the lower value, or at least one filter impulse response having the lower value Contains a zero value corresponding to the value filter tap.

本発明のさらなる実施形態は、フィルタ・タップでフィルタ・インパルス応答値を含む、サブバンドに対応する入力サブバンド・フィルタ・インパルス応答から圧縮サブバンド・フィルタ・インパルス応答を作るするための方法に関し、より高い値を有するフィルタ・インパルス応答値およびそのより高い値より低い値を有する少なくとも１つのフィルタ・インパルス応答値を見つけるために、少なくとも２つの入力サブバンド・フィルタ・インパルス応答からフィルタ・インパルス応答値を調べるステップと、そのより高い値を有するフィルタ・インパルス応答値を用いて圧縮サブバンド・フィルタ・インパルス応答を構成するステップとを含み、その圧縮サブバンド・フィルタ・インパルス応答は、そのより低い値を有する少なくとも１つのフィルタ・インパルス応答値のフィルタ・タップに対応するフィルタ・インパルス応答値を含まず、または、そのより低い値を有する少なくとも１つのフィルタ・インパルス応答値のフィルタ・タップに対応するゼロ値の値を含む。 A further embodiment of the present invention relates to a method for creating a compressed subband filter impulse response from an input subband filter impulse response corresponding to a subband, including a filter impulse response value at a filter tap, In order to find a filter impulse response value having a higher value and at least one filter impulse response value having a value lower than the higher value, the filter impulse response value from at least two input subband filter impulse responses And using the filter impulse response value having the higher value to construct a compressed subband filter impulse response, the compressed subband filter impulse response having the lower value Having at least one A zero value corresponding to the filter tap of at least one filter impulse response value that does not include or has a lower value than the filter impulse response value corresponding to the filter tap of the filter impulse response value of Including.

コンピュータ読み取り可能な記憶媒体の実施形態は、それに記憶された複数セットのサブバンド・フィルタ・インパルス応答を含み、それぞれのセットのサブバンド・フィルタ・インパルス応答は、ともに時間ドメイン頭部伝達関数関連フィルタに近似し、その時間ドメイン頭部伝達関数関連フィルタのフィルタ・インパルス応答は、それぞれのセットのサブバンド・フィルタ・インパルス応答のサブバンド・フィルタ・インパルス応答の長さの合計より大きいか、または、その時間ドメイン頭部伝達関数関連フィルタのフィルタ・インパルス応答は、フィルタ・インパルス応答値が複素数値である場合に、それぞれのセットのサブバンド・フィルタ・インパルス応答のサブバンド・フィルタ・インパルス応答の複素数値のフィルタ・インパルス応答値の長さの合計より大きい。 Embodiments of a computer readable storage medium include multiple sets of subband filter impulse responses stored thereon, each set of subband filter impulse responses together being a time domain head related transfer function related filter And the filter impulse response of the time domain head related transfer function related filter is greater than the sum of the lengths of the subband filter impulse responses of each set of subband filter impulse responses, or The filter impulse response of the time domain head related transfer function-related filter is the complex number of the subband filter impulse response of each set of subband filter impulse responses when the filter impulse response values are complex values. Value filter imper Greater than the sum of the length of the scan response value.

本発明のいくつかの実施形態は、一方では計算効率および他方では品質を両立させることに関して有利になる得る。実施形態は、計算の複雑性の著しい低減および入力サブバンド・フィルタ・インパルス応答によって表されるフィルタの優れた近似の両方を提供する。選択された（または決定された）フィルタ・インパルス応答値を用いて圧縮サブバンド・フィルタ・インパルス応答を調べること（結局、選択することまたは決定することを含む）および構成することは、いくつかの実施形態および／またはアプリケーションにおいて計算の複雑性の低減および優れた近似の両方を達成することができ、（ほとんど）音声的に見分けのつかないリスニング体験に導くことができる。いくつかの実施形態において、これは、より高い値を有する入力フィルタ・インパルス応答のフィルタ・インパルス応答値を見つけ、選択しまたは決定することによって達成することができ、その一方で、少なくとも１つのフィルタ・インパルス応答値は、選択されないかまたは決定されず、より高い値より低い値を有する。選択されまたは決定されたフィルタ・インパルス応答値またはより高い値を有するフィルタ・インパルス応答値を用いて、圧縮フィルタ・インパルス応答値を有する圧縮フィルタ・インパルス応答は、構成されまたは作られる。実施に応じて、選択されない若しくは決定されないフィルタ・インパルス応答値またはより高い値より低い値を有するフィルタ・インパルス応答値は、ゼロにセットされるかまたは無視される。換言すれば、フィルタ・インパルス応答値は、無視されるパターンを含んでも、ゼロに設定されてもまたはその他に修正されたフィルタ・インパルス応答値であってもよい。 Some embodiments of the invention may be advantageous with respect to balancing computational efficiency on the one hand and quality on the other hand. Embodiments provide both a significant reduction in computational complexity and a good approximation of the filter represented by the input subband filter impulse response. Examining (and eventually including selecting or determining) and configuring the compressed subband filter impulse response with the selected (or determined) filter impulse response value may include several Both reduction in computational complexity and good approximation can be achieved in embodiments and / or applications, leading to (almost) audio indistinguishable listening experiences. In some embodiments, this can be accomplished by finding, selecting or determining a filter impulse response value of an input filter impulse response having a higher value, while at least one filter The impulse response value is not selected or determined and has a value lower than the higher value. Using the selected or determined filter impulse response value or a filter impulse response value having a higher value, a compressed filter impulse response having a compressed filter impulse response value is constructed or created. Depending on the implementation, filter impulse response values that are not selected or not determined or that have a value lower than a higher value are set to zero or ignored. In other words, the filter impulse response value may include an ignored pattern, set to zero, or otherwise modified filter impulse response value.

さらに、いくつかの実施形態は、圧縮サブバンド・フィルタ・インパルス応答が構成されるフィルタ・インパルス応答値の選択に影響を与えることによって、計算の複雑性の広範囲にわたる達成可能な低減を提供することができる。結果として、本発明のいくつかの実施形態は、一方では計算の複雑性の達成可能な適合および他方では近似の品質を両立させることにおいて非常に大きい柔軟性を提供する。 Further, some embodiments provide a wide range of achievable reductions in computational complexity by influencing the selection of filter impulse response values from which compressed subband filter impulse responses are constructed. Can do. As a result, some embodiments of the present invention provide tremendous flexibility in balancing the achievable fit of computational complexity on the one hand and the approximate quality on the other hand.

したがって、本発明のいくつかの実施形態は、特に、時間ドメインにおいて比較的長い（有限の）インパルス応答を有するフィルタを含む音声または他のアプリケーションの分野において適用することができる。後に説明されるように、フィルタまたはフィルタ・エレメントを時間ドメインから（複素）サブバンド・ドメインに変換することによって、計算は、個々のサブバンド・フィルタのインパルス応答が時間ドメインにおいてフィルタのインパルス応答と比較して著しく短いように並行して行うことができる。 Thus, some embodiments of the invention can be applied particularly in the field of speech or other applications that include filters with relatively long (finite) impulse responses in the time domain. By transforming the filter or filter element from the time domain to the (complex) subband domain, as will be explained later, the calculation is such that the impulse response of the individual subband filter is the filter impulse response of the filter in the time domain. It can be performed in parallel so as to be significantly shorter than the comparison.

しかしながら、全体の計算の複雑性は、時間ドメインから（複素）サブバンド・ドメインへの単なる変換だけでは低減することができない。例えば、ＨＲＴＦフィルタのように比較的長いインパルス応答を有するフィルタに対して、個々のサブバンド・フィルタでさえ通常長い有限インパルス応答を有し、それは、個々のサブバンドの数で除算される時間ドメインおいて対応するフィルタの有限インパルス応答の順序について非常に大ざっぱに言っている。したがって、特定のアプリケーションで利用可能な計算する能力に応じて、全体の計算の複雑性または個々のサブバンド・フィルタに関する計算の複雑性さえ、かなりなものとなり得る。 However, the overall computational complexity cannot be reduced by a simple transformation from the time domain to the (complex) subband domain. For example, for a filter with a relatively long impulse response, such as an HRTF filter, even an individual subband filter usually has a long finite impulse response, which is divided by the number of individual subbands. The order of the finite impulse responses of the corresponding filters is very roughly described. Thus, depending on the computing capabilities available in a particular application, the overall computational complexity or even the computational complexity for individual subband filters can be substantial.

さらに、付加的にまたは代わりに、フィルタ・インパルス応答のレベルに基づく決定は、フィルタ・コンプレッサの実施形態において実施することができる。このような場合、フィルタ・コンプレッサは、フィルタ・インパルス応答の値（例えば絶対値）が閾値より下である場合に、少なくとも１つのフィルタ・インパルス応答値がゼロに設定されまたは無視されることができるように、構成することができる。アプリケーションのいくらかの分野において、１以上のフィルタ・インパルス応答値は、入力サブバンド・フィルタ・インパルス応答に対応するフィルタバンクのエイリアシング・レベルに近くてもよい。フィルタ・インパルス応答値の値がそのような対応するフィルタバンクのエイリアシング・レベルに近い場合、特定のタップは、対応するフィルタ係数またはフィルタ・インパルス応答値がゼロに問題なく設定されることができるように、ゼロに設定されてもよい。結果として、そのような圧縮フィルタ・インパルス応答に基づくフィルタの実施は、ゼロ値の係数またはインパルス応答値のための乗加算を実行する必要がない。 Furthermore, additionally or alternatively, a determination based on the level of the filter impulse response can be implemented in a filter compressor embodiment. In such a case, the filter compressor may have at least one filter impulse response value set to zero or ignored if the value of the filter impulse response (eg absolute value) is below a threshold. Can be configured. In some areas of the application, the one or more filter impulse response values may be close to the aliasing level of the filter bank corresponding to the input subband filter impulse response. If the value of the filter impulse response value is close to the aliasing level of such a corresponding filter bank, a particular tap can be set without any problem to the corresponding filter coefficient or filter impulse response value to zero. May be set to zero. As a result, implementations of filters based on such compressed filter impulse responses do not have to perform multiplication and addition for zero valued coefficients or impulse response values.

これに関連して、フィルタバンクのエイリアシング・レベルは、多くのフィルタバンクの固有特性である。フィルタバンクのそのようなエイリアシング・レベルは、例えばＳＢＲアプリケーションの枠組みにおいて、信号の単なる処理から生じる。それぞれのフィルタ・タップまたはフィルタ・インパルス応答値が信号を生じる出力に貢献するので、タブ（の例えば絶対値）がより小さく、それぞれのタップの結果または貢献がフィルタバンクの出力に関してより小さい。したがって、小さいタップは、それらの貢献がその範囲にあるフィルタバンクのまたはそれぞれのフィルタバンクのエイリアシング・レベルの順序の出力へのそのような小さい貢献を有することが起こる。この場合、対応するタップをゼロに設定することによって導入されるさらなるディストーションは、さらなる聞き取れるディストーションを導入しないように、多くの場合に許容することができる。多くの場合、エイリアシング・レベルの典型的な範囲は、ピーク信号と比較して−３０ｄＢ、−４０ｄＢ、−５０ｄＢ、−６０ｄＢおよび−７０ｄＢの範囲およびそれ以下である。 In this connection, the filter bank aliasing level is an inherent characteristic of many filter banks. Such an aliasing level of the filter bank arises from mere processing of the signal, for example in the framework of SBR applications. Since each filter tap or filter impulse response value contributes to the output producing the signal, the tab (eg absolute value thereof) is smaller and the result or contribution of each tap is smaller with respect to the output of the filter bank. Thus, small taps will have such a small contribution to the output of the order of the aliasing levels of the filter banks or the respective filter banks whose contributions are in that range. In this case, further distortion introduced by setting the corresponding tap to zero can often be tolerated so as not to introduce further audible distortion. In many cases, typical ranges of aliasing levels are in the range of −30 dB, −40 dB, −50 dB, −60 dB, and −70 dB and lower compared to the peak signal.

例えば、ＨＲＴＦフィルタの場合、時間ドメインＨＲＴＦフィルタを複素ＱＭＦ表現に変換した後に、複素ＱＭＦ表現において時間周波数タイルのいくつかは、（ＭＰＥＧサラウンド・フィルタバンクのエイリアシング・レベルで）低い絶対値を有し得る。ＨＲＴＦフィルタの複素ＱＭＦ表現のこれらのエントリーは、ゼロに設定することができる。これは、複素ＱＭＦ表現に含まれる室内応答を有する長いＨＲＴＦフィルタを実施するための煩雑性低減を可能にする。したがって、低減された煩雑性でバイノーラル化を達成するために、現実的な室内効果を維持するとともに、フィルタ・コンバータの後にフィルタ・コンプレッサの実施形態の形式においてフィルタ低減プロセスが続くことができる。フィルタ低減ステップは、サブバンドＨＲＴＦフィルタが少なくとも少ない数またはかなりの数のゼロさえも含むように、ＨＲＴＦフィルタを単純化することを目的とする。より少ない係数がアクティブであるので、計算の複雑性の著しい低減は、それによって達成することができる。 For example, in the case of an HRTF filter, after converting the time domain HRTF filter to a complex QMF representation, some of the time frequency tiles in the complex QMF representation have a low absolute value (at the aliasing level of the MPEG Surround filter bank). obtain. These entries in the complex QMF representation of the HRTF filter can be set to zero. This allows for reduced complexity for implementing a long HRTF filter with room response included in the complex QMF representation. Thus, to achieve binauralization with reduced complexity, realistic filter effects can be maintained and the filter converter can be followed by a filter reduction process in the form of a filter compressor embodiment. The filter reduction step aims to simplify the HRTF filter so that the subband HRTF filter contains at least a small number or even a significant number of zeros. Since fewer coefficients are active, a significant reduction in computational complexity can thereby be achieved.

したがって、フィルタ・コンプレッサの実施形態、それを作るための方法の実施形態によって提供される１セットの作られた圧縮サブバンド・フィルタ・インパルス応答および複数の圧縮サブバンド・フィルタ・インパルス応答を含むコンピュータ読み取り可能な記憶媒体の実施形態は、それぞれのサブバンド・フィルタに対して個々の計算の複雑性および全てのサブバンド・フィルタに関して全体の計算の複雑性を著しく低減することができる。 Thus, a computer comprising a set of compressed subband filter impulse responses and a plurality of compressed subband filter impulse responses provided by an embodiment of a filter compressor, an embodiment of a method for making it Embodiments of readable storage media can significantly reduce the individual computational complexity for each subband filter and the overall computational complexity for all subband filters.

本発明は、添付図面を参照して、本発明の範囲または精神を制限しない図示例によって記載される。 The present invention will now be described by way of example with reference to the accompanying drawings, which do not limit the scope or spirit of the invention.

図１は、フィルタ・コンバータおよび本発明によるコンプレッサの実施形態の相互作用を図示する。FIG. 1 illustrates the interaction of an embodiment of a filter converter and a compressor according to the present invention. 図２は、本発明のための使用例のシナリオを図示する。FIG. 2 illustrates an example scenario for the present invention. 図３は、本発明によるフィルタ・コンプレッサの実施形態を図示する。FIG. 3 illustrates an embodiment of a filter compressor according to the present invention. 図４は、本発明によるフィルタ・コンプレッサのさらなる実施形態を図示する。FIG. 4 illustrates a further embodiment of a filter compressor according to the present invention. 図５は、マルチプル・フィルタに同時に作動する本発明によるフィルタ・コンプレッサのさらなる実施形態を図示する。FIG. 5 illustrates a further embodiment of a filter compressor according to the present invention operating simultaneously on multiple filters. 図６は、ＨＲＴＦフィルタリングとの関連で用いられる本発明の実施形態を図示する。FIG. 6 illustrates an embodiment of the present invention used in the context of HRTF filtering. 図７は、適合性のあるフィルタのための可能な解決策を図示する。FIG. 7 illustrates a possible solution for a compatible filter. 図８は、フィルタ・コンバータの主要なコンポーネントのための可能な解決策を図示する。FIG. 8 illustrates a possible solution for the main components of the filter converter. 図９は、（複素）分析フィルタバンクの可能な解決策を図示する。FIG. 9 illustrates a possible solution for a (complex) analysis filter bank. 図１０は、適合性のあるサブバンド・フィルタバンクの可能な解決策を図示する。FIG. 10 illustrates a possible solution for a compatible subband filter bank. 図１１は、（複素）合成フィルタバンクの第１の可能な解決策を図示する。FIG. 11 illustrates a first possible solution of the (complex) synthesis filter bank. 図１２は、（複素）合成フィルタバンクのための第２の可能な解決策を図示する。FIG. 12 illustrates a second possible solution for a (complex) synthesis filter bank. 図１３は、本発明によるフィルタ・コンプレッサのさらなる実施形態を図示する。FIG. 13 illustrates a further embodiment of a filter compressor according to the present invention. 図１４ａ〜図１４ｃは、本発明によるフィルタ・コンプレッサの実施形態に使用されるようにスペクトル白色化を図示する。FIGS. 14a-14c illustrate spectral whitening as used in a filter compressor embodiment according to the present invention. 図１５は、マルチプル・フィルタに同時に作動する本発明によるフィルタ・コンプレッサの実施形態を図示する。FIG. 15 illustrates an embodiment of a filter compressor according to the present invention that operates simultaneously on multiple filters.

以下に記載されている実施形態は、効率的なフィルタ表現のための本発明の原理のために単に図示するだけである。本願明細書に記載されている構成および詳細の変更および変形が他の当業者にとって明らかであるものと理解される。したがって、本願明細書において実施形態の記載および説明として表される具体的な詳細によってではなく、間近に迫った特許請求の範囲だけによって制限されることを意図する。 The embodiments described below are merely illustrative for the principles of the present invention for efficient filter representation. It will be understood that changes and modifications in the arrangements and details described herein will be apparent to those skilled in the art. Accordingly, it is intended that the invention be limited not by the specific details presented as the description and description of the embodiments herein, but only by the forthcoming claims.

さらに詳細に本発明の実施形態、実施形態のさらなるコンポーネントおよびアプリケーションを記載する前に、同一または類似の機能特性を有する目的、構成およびコンポーネントが同じ引用符号によって示される点に留意すべきである。明示的にその他に注意さない限り、類似するまたは等しい機能特性および特徴を有する目的、構成およびコンポーネントに関する記載は、互いに関して交換することができる。さらに、以下において、特定の目的、構成およびコンポーネントの特性または特徴が述べられない限り、１つの実施形態において同一であるか類似する、または、図の１つに示される異なる構成に現れる、目的、構成およびコンポーネントの引用符号を要約することが用いられる。引用符号を要約することによって、本発明の実施形態のよりコンパクトでより明瞭な記載を可能にし、さらに異なる実施形態間の特徴および記載の交換の可能性を明確に示す。 Before describing embodiments of the present invention, further components and applications of the invention in more detail, it should be noted that objects, configurations and components having the same or similar functional characteristics are indicated by the same reference signs. Unless expressly noted otherwise, descriptions of objects, configurations and components having similar or equivalent functional characteristics and features may be interchanged with respect to one another. In addition, in the following, unless specific characteristics, configurations and characteristics or features of components are stated, the objectives that are the same or similar in one embodiment or appear in different configurations shown in one of the figures, Summarizing configuration and component citations is used. By summarizing the reference signs, a more compact and clearer description of the embodiments of the invention is possible, and the possibility of exchanging features and descriptions between the different embodiments is clearly shown.

さらに、以下において、図に示される実施形態が方法の対応する実施形態を等しく表す点に留意すべきである。したがって、図に示される実施形態は、例えばフィルタ・コンプレッサの対応する実施形態を図示するだけでなく、対応する方法の対応する実施形態のフローチャートも表す。以下に概説されるように、方法のそのような実施形態は、ハードウェアにおいてまたはソフトウェアにおいて実施することができる。 Furthermore, it should be noted that in the following, the embodiments shown in the figures equally represent corresponding embodiments of the method. Thus, the embodiment shown in the figure not only illustrates a corresponding embodiment of a filter compressor, for example, but also represents a flowchart of a corresponding embodiment of the corresponding method. As outlined below, such embodiments of the method can be implemented in hardware or in software.

図１において、可能なアプリケーションとともに本発明の１つの実施形態が概説される。具体的には、図１は、フィルタ・コンプレッサ１０２の実施形態に接続されるフィルタ・コンバータ１０１を示す。フィルタ・コンバータ１０１は、後にさらに詳細に記載される。フィルタ・コンバータ１０１の実施形態には、時間ドメインにおいてフィルタまたはフィルタ・エレメントの有限インパルス応答ｈ（ｎ）に関する情報を含む入力信号が提供される。インデックスｎは、これに関連して有限インパルス応答（ＦＩＲ）の異なる値またはサンプルを示す整数であり、ｈ（ｎ）は、実数値の数である。 In FIG. 1, one embodiment of the present invention is outlined along with possible applications. Specifically, FIG. 1 shows a filter converter 101 connected to an embodiment of the filter compressor 102. The filter converter 101 will be described in more detail later. An embodiment of the filter converter 101 is provided with an input signal that contains information about the finite impulse response h (n) of the filter or filter element in the time domain. The index n is an integer indicating a different value or sample of the finite impulse response (FIR) in this connection, and h (n) is a real-valued number.

時間ドメイン・フィルタｈ（ｎ）の有限インパルス応答は、所定の振幅を有する単一のインパルスの形式において励起上の時間ドメインにおけるフィルタまたはフィルタ・エレメントの応答である。原則として、時間ドメインにおけるフィルタ・エレメントの完全な挙動は、フィルタの有限インパルス応答に含まれる。デジタル・システムの場合、フィルタのインパルス応答は、時間において単一の例でゼロとは異なる値を有する入力信号を適用することによって決定されまたは測定され得る。この値は、例えば１に等しくあり得る。 The finite impulse response of the time domain filter h (n) is the filter or filter element response in the time domain on the excitation in the form of a single impulse having a predetermined amplitude. In principle, the complete behavior of the filter element in the time domain is included in the finite impulse response of the filter. For a digital system, the impulse response of the filter can be determined or measured by applying an input signal having a value different from zero in a single instance in time. This value may be equal to 1, for example.

フィルタ・コンバータ１０１は、図１０との関連で概説されるように、適合性のあるフィルタの枠組みにおいて用いることができる１セットの有限インパルス応答Ｈ（ｎ，ｋ）を提供することができる。複素分析フィルタバンクに基づく複素フィルタ・コンバータの場合、有限インパルス応答Ｈ（ｎ，ｋ）が複素数値の数を含む点に留意すべきであり、ｎは、前と同じように異なるサンプルを示し、ｋ＝０、・・・、（Ｌ−１）は、サブバンド・フィルタの有限インパルス応答が対応する対応のサブバンドを示す。ｌおよびｋの両方とも整数である。さらに、サブバンドの数Ｌは、正の整数でもある。デジタル・システムの場合、フィルタ・コンバータ１０１によって提供され、フィルタリング・デジタル音声入力信号のために後に用いられるサブバンドの数Ｌは、多くの場合に２の累乗、例えば１６、３２、６４、１２８、２５６、５１２である。以下の例では、サブバンドの数は、Ｌ＝６４であるように選択される。しかしながら、前に概説されるように、原則として、フィルタ・コンプレッサのアプリケーション、コンポーネントおよび実施形態においてサブバンドの数としてあらゆる正の整数Ｌが使用できる。 The filter converter 101 can provide a set of finite impulse responses H (n, k) that can be used in a compatible filter framework, as outlined in connection with FIG. Note that for complex filter converters based on complex analysis filter banks, the finite impulse response H (n, k) contains a number of complex values, where n represents a different sample as before, k = 0,..., (L−1) indicates the corresponding subband to which the finite impulse response of the subband filter corresponds. Both l and k are integers. Furthermore, the number L of subbands is also a positive integer. For digital systems, the number L of subbands provided by the filter converter 101 and used later for the filtered digital audio input signal is often a power of 2, eg 16, 32, 64, 128, 256, 512. In the following example, the number of subbands is selected such that L = 64. However, as outlined above, in principle, any positive integer L can be used as the number of subbands in filter compressor applications, components and embodiments.

説明されるように、時間ドメイン・フィルタｈ（ｎ）は、フィルタＨ（ｎ，ｋ）の複素ＱＭＦまたはサブバンド表現を作り出すフィルタ・コンバータ１０１に入力される。Ｌ＝６４のサブバンドＱＭＦが用いられるこの特定例において、フィルタの複素ＱＭＦ表現は、多数のＬ＝６４の有限インパルス応答の長さを有する長さＫの時間ドメイン・フィルタに対して、長さＫ／６４＋２のＬ＝６４の複素フィルタによって表される。 As described, the time domain filter h (n) is input to a filter converter 101 that produces a complex QMF or subband representation of the filter H (n, k). In this particular example where L = 64 subband QMFs are used, the complex QMF representation of the filter is the length for a length K time domain filter with a number of L = 64 finite impulse response lengths. It is represented by an L = 64 complex filter of K / 64 + 2.

本発明によれば、フィルタＨ（ｎ，ｋ）は、その後にフィルタ・コンプレッサ１０２に入力され、それは、圧縮サブバンド・フィルタ・インパルス応答としてＨ（ｎ，ｋ）を出力する。フィルタ・コンプレッサ１０２の実施形態は、元のフィルタＨ（ｎ，ｋ）より多い数のゼロ値の係数を有するフィルタＨ（ｎ，ｋ）出力し、したがって、低い計算の複雑性を可能する。 In accordance with the present invention, the filter H (n, k) is then input to the filter compressor 102, which outputs H (n, k) as a compressed subband filter impulse response. Embodiments of the filter compressor 102 output a filter H (n, k) having a greater number of zero-valued coefficients than the original filter H (n, k), thus allowing for low computational complexity.

実施形態およびアプリケーションに応じて、フィルタ・コンバータ１０１およびフィルタ・コンプレッサ１０２は、Ｌ接続を介して互いに結合され、それぞれに異なるサブバンド（インデックスｋ＝０、・・・、Ｌ−１またはｋ＝１、・・・、Ｌ）に対応するフィルタ・インパルス応答が送信される。このオプションは、フィルタ・コンバータ１０１およびフィルタ・コンプレッサ１０２の接続を交差するスラッシュ（／）によって図１に示される。しかしながら、２つのコンポーネントは、より少ない数の接続によってまたは単一の接続だけによって互いに結合されてもよく、それに対応する信号または情報が送信される。示される図および実施形態の単純化のために、それぞれのサブバンドに対して個々の接続を含むエレメントの可能な並列接続が必要に応じて示される。しかしながら、サブバンドに関する信号または情報が送信されるときはいつでも、例えばそれを示す変数（例えばＨ（ｎ，ｋ））によって示されるように、いかなる接続も実施することができる。 Depending on the embodiment and application, the filter converter 101 and the filter compressor 102 are coupled to each other via an L connection, each with a different subband (index k = 0,..., L−1 or k = 1). ,..., L), the filter impulse response is transmitted. This option is indicated in FIG. 1 by a slash (/) crossing the connection of the filter converter 101 and the filter compressor 102. However, the two components may be coupled to each other by a fewer number of connections or only by a single connection, and corresponding signals or information are transmitted. For simplicity of the figures and embodiments shown, possible parallel connections of elements including individual connections for each subband are shown as needed. However, whenever a signal or information about a subband is transmitted, any connection can be made, for example as indicated by a variable indicating it (eg H (n, k)).

後にさらに詳細に説明されるように、フィルタ・コンプレッサ１０２の実施形態も、例えばサブバンド・フィルタバンクに含まれるそれぞれの数のサブバンド・フィルタに対して１セットまたは複数のフィルタ・インパルス応答を出力する。入力サブバンド・フィルタ・インパルス応答Ｈ（ｎ，ｋ）および圧縮サブバンド・フィルタ・インパルス応答Ｈ（ｎ，ｋ）の両方とも、前に説明されるように、時間に関連したｎおよびサブバンドに関連したｋによって表示される２次元マトリクスに配列される両方の複素数値の数である。 As will be described in more detail later, embodiments of the filter compressor 102 also output one or more filter impulse responses for each number of subband filters included in the subband filter bank, for example. To do. Both the input subband filter impulse response H (n, k) and the compressed subband filter impulse response H (n, k) are in time-related n and subbands as previously described. The number of both complex values arranged in a two-dimensional matrix represented by the associated k.

しかしながら、フィルタ・コンプレッサ１０２の異なる実施形態に関するさらなる詳細は、後に概説される。さらに、圧縮サブバンド・フィルタ・インパルス応答Ｈ（ｎ，ｋ）および入力サブバンド・フィルタ応答Ｈ（ｎ，ｋ）間の関係は、フィルタ・コンプレッサ１０２の異なる実施形態のために後に説明される。原則として、２セットの複数のフィルタ・インパルス応答Ｈ（ｎ，ｋ）およびＨ（ｎ，ｋ）は、簡単に概説されるように、ゼロ値の係数の数だけより多くのやり方において異なり得る点に留意することが重要である。 However, further details regarding different embodiments of the filter compressor 102 will be outlined later. Further, the relationship between the compressed subband filter impulse response H (n, k) and the input subband filter response H (n, k) will be described later for different embodiments of the filter compressor 102. In principle, the two sets of multiple filter impulse responses H (n, k) and H (n, k) can differ in more ways by the number of zero-valued coefficients, as briefly outlined. It is important to keep in mind.

図２において、本発明のための一般の使用例のシナリオが概説される。ここで、時間ドメイン・フィルタｈ（ｎ）は、フィルタＨ（ｎ，ｋ）の複素ＱＭＦ表現を作り出すフィルタ・コンバータ１０１に入力され、複素ＱＭＦフィルタＨ（ｎ，ｋ）は、前に説明されるように、低減されまたは圧縮された複素ＱＭＦフィルタＨ（ｎ，ｋ）を出力するフィルタ・コンプレッサ１０２の実施形態に入力される。 In FIG. 2, a general use case scenario for the present invention is outlined. Here, the time domain filter h (n) is input to a filter converter 101 that produces a complex QMF representation of the filter H (n, k), which is described previously. As such, it is input to an embodiment of the filter compressor 102 that outputs a reduced or compressed complex QMF filter H (n, k).

両方が図１との関連で説明される時間ドメインｈ（ｎ）においてフィルタの実数値のインパルス応答を提供するフィルタ・コンバータ１０１およびフィルタ・コンプレッサ１０２の実施形態は別として、図２に示される使用例のシナリオは、複素分析フィルタバンクとも呼ばれるＱＭＦ分析フィルタバンク２０３をさらに含む。ＱＭＦ分析フィルタバンク２０３には、例えばデジタル音声信号であり得る入力信号ｘ（ｎ）が提供される。ＱＭＦ分析フィルタ・ドメイン２０３は、出力において入力信号ｘ（ｎ）の複素ＱＭＦ表現Ｘ（ｎ，ｋ）を提供する。図１との関連で説明されるように、整数ｎおよびｋは、サンプルまたは時間インデックスとサブバンド・インデックスとにそれぞれ関連する。ＱＭＦ分析フィルタバンク２０３のための可能な解決策は、図９との関連でさらに詳細に説明される。 Apart from the embodiment of the filter converter 101 and the filter compressor 102, which both provide the real-valued impulse response of the filter in the time domain h (n) described in connection with FIG. 1, the use shown in FIG. The example scenario further includes a QMF analysis filter bank 203, also referred to as a complex analysis filter bank. The QMF analysis filter bank 203 is provided with an input signal x (n), which can be, for example, a digital audio signal. The QMF analysis filter domain 203 provides a complex QMF representation X (n, k) of the input signal x (n) at the output. As described in connection with FIG. 1, integers n and k are associated with a sample or time index and a subband index, respectively. A possible solution for the QMF analysis filter bank 203 is described in more detail in connection with FIG.

入力信号ｘ（ｎ）の複素ＱＭＦ表現Ｘ（ｎ，ｋ）は、サブバンド・ドメインにおいて作動するフィルタリング・ステージ２０１にその後に提供される。フィルタリング・ステージまたはサブバンド・フィルタ２０１は、調整可能なサブバンド・フィルタバンクであり、それは、フィルタ・コンプレッサ１０２の実施形態の出力に結合される複数のＬ個の中間フィルタを含む。フィルタ・コンプレッサ１０２の実施形態を介して、サブバンド・フィルタバンク２０１の中間フィルタには、（複素数値の）ＱＭＦ表現Ｘ（ｎ，ｋ）をフィルタにかけるために用いられる圧縮サブバンド・フィルタ・インパルス応答Ｈ（ｎ，ｋ）が提供される。 The complex QMF representation X (n, k) of the input signal x (n) is then provided to a filtering stage 201 that operates in the subband domain. Filtering stage or subband filter 201 is an adjustable subband filter bank, which includes a plurality of L intermediate filters coupled to the output of an embodiment of filter compressor 102. Through an embodiment of the filter compressor 102, the intermediate filter of the subband filter bank 201 includes a compressed subband filter used to filter the (complex-valued) QMF representation X (n, k). An impulse response H (n, k) is provided.

原則として、後に説明されるように、複素ＱＭＦ表現Ｘ（ｎ，ｋ）は、サブバンド・インデックスｋによって識別されるそれぞれのサブバンドに対して複素ＱＭＦ表現Ｘ（ｎ，ｋ）およびフィルタ・コンプレッサ１０２の実施形態によって提供されるそれぞれのフィルタ・インパルス応答Ｈ（ｎ，ｋ）の畳み込みを計算することによって、フィルタにかけることができる。 In principle, as will be explained later, the complex QMF representation X (n, k) is a complex QMF representation X (n, k) and filter compressor for each subband identified by the subband index k. It can be filtered by calculating the convolution of each filter impulse response H (n, k) provided by the 102 embodiment.

それから、複素ＱＭＦドメインにおいてサブバンド・フィルタバンク２０１によって提供されるフィルタ信号は、ＱＭＦ合成フィルタバンクまたは複素合成フィルタバンクに提供され、それは、最後的に（実数値の）出力信号ｙ（ｎ）を合成する。ＱＭＦ合成フィルタバンク２０２または複素合成フィルタバンクのための可能な解決策は、図１１および図１２の枠組みにおいて述べられる。 Then, the filter signal provided by the subband filter bank 201 in the complex QMF domain is provided to the QMF synthesis filter bank or the complex synthesis filter bank, which finally outputs the (real-valued) output signal y (n). Synthesize. Possible solutions for the QMF synthesis filter bank 202 or the complex synthesis filter bank are described in the framework of FIGS.

換言すれば、図１に示すように、フィルタ・コンバータ１０１およびフィルタ・コンプレッサ１０２の実施形態と並行して、信号ｘ（ｎ）は、Ｘ（ｎ，ｋ）すなわち入力信号の合複素ＱＭＦ表現を出力するＱＭＦ分析２０３モジュールに入力される。その後、その信号は、フィルタ・コンプレッサ１０２によって出力される複素ＱＭＦフィルタを用いてＱＭＦドメインにおいてフィルタ２０１にかけられ、フィルタにかけられた信号は、フィルタにかけられた出力信号ｙ（ｎ）を作り出すＱＭＦ合成フィルタバンク２０２によって時間ドメインに最終的に合成される。 In other words, as shown in FIG. 1, in parallel with the embodiment of the filter converter 101 and the filter compressor 102, the signal x (n) is represented by X (n, k), ie the combined complex QMF representation of the input signal. It is input to the QMF analysis 203 module to be output. The signal is then applied to filter 201 in the QMF domain using a complex QMF filter output by filter compressor 102, and the filtered signal produces a filtered output signal y (n). It is finally synthesized into the time domain by the bank 202.

図３において、フィルタ・コンプレッサ１０２の実施形態のさらに詳細な図が与えられる。また、時間ドメインにおいて入力インパルス応答としての時間ドメイン・フィルタｈ（ｎ）は、フィルタ・コンバータ１０１に入力される。フィルタの時間ドメイン・インパルス応答は、３０１によって表示される。前に説明されるように、フィルタ・コンバータの後に、時間ドメイン・フィルタは、サブバンド・ドメインに転送され、Ｈ（ｎ，ｋ）によって表される。フィルタ応答の絶対値の時間／周波数プロットは、３０２によって与えられる。 In FIG. 3, a more detailed view of an embodiment of the filter compressor 102 is provided. Also, the time domain filter h (n) as an input impulse response in the time domain is input to the filter converter 101. The time domain impulse response of the filter is indicated by 301. As explained previously, after the filter converter, the time domain filter is forwarded to the subband domain and is represented by H (n, k). A time / frequency plot of the absolute value of the filter response is given by 302.

図３に示されるフィルタ・コンプレッサ１０２の実施形態は、絶対値表現モジュール３０３を含み、それは、フィルタ・コンプレッサ１０２の実施形態の入力に接続される。フィルタ・コンプレッサ１０２の実施形態は、さらにマスク・ジェネレータ３０４を含み、それは、絶対値表現モジュール３０３の出力に結合される。また、フィルタ計算機３０５は、フィルタ・コンプレッサ１０２の実施形態に含まれ、それは、フィルタ・コンプレッサ１０２の実施形態の入力およびマスク・ジェネレータ３０４の出力の両方に接続される。また、フィルタ計算機３０５は、出力を含み、それは、フィルタ・コンプレッサ１０２の実施形態の出力を表す。 The embodiment of the filter compressor 102 shown in FIG. 3 includes an absolute value representation module 303 that is connected to the input of the embodiment of the filter compressor 102. The filter compressor 102 embodiment further includes a mask generator 304, which is coupled to the output of the absolute value representation module 303. The filter calculator 305 is also included in the filter compressor 102 embodiment, which is connected to both the input of the filter compressor 102 embodiment and the output of the mask generator 304. The filter calculator 305 also includes an output, which represents the output of the filter compressor 102 embodiment.

複素ＱＭＦフィルタまたはサブバンド入力フィルタＨ（ｎ，ｋ）は、絶対値表現モジュール３０３、フィルタ・マスク・ジェネレータ３０４およびフィルタ計算機３０５を含むフィルタ・コンプレッサ１０２の実施形態に入力される。絶対値表現モジュール３０３は、部分図３０２によって例示されるように、フィルタの絶対値の時間／周波数プロットを作る。これは、後に概説されるように、例えば、ＱＭＦドメインにおいてフィルタ係数の絶対値の対数表現であり得る。フィルタ・マスク・ジェネレータ３０４は、１つの実施形態において、絶対値表現モジュール３０３によって提供される情報に基づいてＱＭＦドメインにおいてフィルタの絶対値の表現における最大値を有する係数（ｎ，ｋ）を選択しまたは決定する。フィルタ・マスク・ジェネレータ３０４は、要求されるフィルタ圧縮の量に依存する係数の調整可能な、プログラム可能な、一定のまたは所定の数を決定しまたは選択する。選択されたフィルタ係数のより低い数は、より高い煩雑性低減を与える。例およびさらなる詳細は、さらに明細書中に説明される。したがって、本願記載の枠組みにおいて多くの場合、ワードの決定、選択し、確定し、確立しおよび見つけることは、同義的に用いることができる。多くの場合、決定されまたは選択されているフィルタ・インパルス応答値は、そのようなフィルタ・インパルス応答値であり、それは、より高い値より低い値を有するフィルタ・インパルス応答値と比較して、高い値を有する（または含む）。これらの低い値のフィルタ・インパルス応答値は、選択されないまたは決定されないものとして見なされる。 The complex QMF filter or subband input filter H (n, k) is input to an embodiment of the filter compressor 102 that includes an absolute value representation module 303, a filter mask generator 304 and a filter calculator 305. The absolute value representation module 303 creates a time / frequency plot of the absolute value of the filter, as illustrated by the partial diagram 302. This can be, for example, a logarithmic representation of the absolute value of the filter coefficient in the QMF domain, as outlined later. The filter mask generator 304, in one embodiment, selects the coefficient (n, k) having the maximum value in the representation of the absolute value of the filter in the QMF domain based on the information provided by the absolute value representation module 303. Or decide. The filter mask generator 304 determines or selects an adjustable, programmable, constant or predetermined number of coefficients depending on the amount of filter compression required. A lower number of selected filter coefficients gives a higher complexity reduction. Examples and further details are further described in the specification. Thus, in many cases in the framework described herein, determining, selecting, establishing, establishing and finding words can be used interchangeably. In many cases, the determined or selected filter impulse response value is such a filter impulse response value, which is high compared to a filter impulse response value having a lower value than a higher value. Has (or contains) a value. These low value filter impulse response values are considered as not being selected or determined.

前に概説されるように、代わりにまたは付加的に、煩雑性の低減は、フィルタ・コンプレッサの実施形態に提供されるように、フィルタ・インパルス応答に対応するフィルタバンクのいわゆるエイリアシング・レベルと比較して、フィルタ・タップまたはフィルタ・インパルス応答値を調べることに基づいて達成することもできる。ＱＭＦドメインにおいてフィルタ・インパルス応答価値の特定のタップが、フィルタバンクのエイリアシング・レベルに近い場合、これらのフィルタ・タップは、計算の複雑性を低減するために問題なくゼロに設定されまたはその他に取り扱われ得る。これらのフィルタ・タップは、ゼロ値の係数がフィルタの実施において乗加算の枠組みに含まれることを必要としないように、フィルタの実施の場合において問題なく無視することができる。例えば、時間ドメインＨＲＴＦフィルタを複素ＱＭＦ表現に変換した後に、複素ＱＭＦ表現において時間周波数タイルのいくつかは、対応するＭＰＥＧサラウンド・フィルタバンクのエイリアシング・レベルで低い絶対値を有し得る。ＨＲＴＦフィルタの複素ＱＭＦ表現においてこれらのエントリーは、ゼロに設定することができ、それは、含まれる室内応答を有する長いＨＲＴＦフィルタを実施するための煩雑性低減を可能にする。 As outlined previously, alternatively or additionally, the complexity reduction is compared to the so-called aliasing level of the filter bank corresponding to the filter impulse response, as provided in the filter compressor embodiment. It can also be achieved based on examining the filter tap or filter impulse response value. If certain taps of the filter impulse response value in the QMF domain are close to the aliasing level of the filter bank, these filter taps are set to zero or otherwise handled without difficulty to reduce computational complexity Can be. These filter taps can be safely ignored in the filter implementation so that zero-valued coefficients do not need to be included in the multiply-add framework in the filter implementation. For example, after converting a time domain HRTF filter to a complex QMF representation, some of the time frequency tiles in the complex QMF representation may have a low absolute value at the aliasing level of the corresponding MPEG Surround filter bank. These entries in the complex QMF representation of the HRTF filter can be set to zero, which allows for reduced complexity for implementing long HRTF filters with an included room response.

フィルタ・マスク・ジェネレータは、絶対値表現モジュール３０３によって提供される情報に基づいて、フィルタ・マスクＭ（ｎ，ｋ）を作り、さらに、Ｈ（ｎ，ｋ）の選択されたフィルタ係数をフィルタ計算機３０５に示す選択されたフィルタ・マスクＭ（ｎ，ｋ）を出力する。フィルタ計算機３０５は、選択されたフィルタ係数を含むＱＭＦドメインにおいて元のフィルタＨ（ｎ，ｋ）から新しい圧縮フィルタＨ（ｎ，ｋ）を作り出す。実施に関する異なる可能性についてのさらなる詳細は、以下に与えられる。 The filter mask generator creates a filter mask M (n, k) based on the information provided by the absolute value representation module 303, and further selects the selected filter coefficient of H (n, k) as a filter calculator. The selected filter mask M (n, k) shown at 305 is output. The filter calculator 305 creates a new compressed filter H (n, k) from the original filter H (n, k) in the QMF domain that includes the selected filter coefficients. Further details on the different possibilities for implementation are given below.

図４は、フィルタ・コンプレッサ１０２のさらなる実施形態を示し、それは、図３に示されるフィルタ・コンプレッサ１０２の実施形態と同じ基本構成を有する。具体的には、図４に示されるフィルタ・コンプレッサ１０２の実施形態も、絶対値表現モジュール３０３を含み、それは、一方ではフィルタ・コンプレッサ１０２の実施形態の入力に接続され、他方では絶対値表現モジュール３０３の出力を介してマスク・ジェネレータ３０４に接続される。図４におけるフィルタ・コンプレッサ１０２の実施形態も、フィルタ計算機３０５を含み、それは、フィルタ・コンプレッサの入力およびマスク・ジェネレータ３０４の出力に接続される。フィルタ計算機３０５の出力は、前と同じように、図４に示されるフィルタ・コンプレッサ１０２の実施形態の出力を表す。 FIG. 4 shows a further embodiment of the filter compressor 102, which has the same basic configuration as the embodiment of the filter compressor 102 shown in FIG. Specifically, the embodiment of the filter compressor 102 shown in FIG. 4 also includes an absolute value representation module 303, which is connected on the one hand to the input of the embodiment of the filter compressor 102 and on the other hand the absolute value representation module. It is connected to the mask generator 304 via the output of 303. The embodiment of the filter compressor 102 in FIG. 4 also includes a filter calculator 305 that is connected to the input of the filter compressor and the output of the mask generator 304. The output of the filter calculator 305 represents the output of the embodiment of the filter compressor 102 shown in FIG. 4 as before.

しかしながら、図３に示されるフィルタ・コンプレッサ１０２の実施形態と比較して、絶対値表現モジュール３０３およびフィルタ計算機３０５は、図４に示される実施形態の場合にさらに詳細に示され、本願特許出願の以下のセクションにおいて代わりのまたはさらなる実施とともにさらに詳細に説明される。 However, compared to the embodiment of the filter compressor 102 shown in FIG. 3, the absolute value representation module 303 and the filter calculator 305 are shown in more detail in the case of the embodiment shown in FIG. Further details are given in the following sections along with alternative or further implementations.

絶対値表現モジュール３０３は、絶対値アルゴリズム関数モジュール４０１を含み、それは、絶対値表現モジュール３０３の入力および出力間に白色化モジュール４０２と直列に接続される。フィルタ計算機モジュール３０５は、フィルタ・デシメータ・モジュール４０３を含み、それは、利得計算機４０４と直列に接続される。フィルタ・デシメータ・モジュール４０３および利得計算機４０４の両方は、フィルタ計算機モジュール３０５の入力および出力間に直列に接続される。具体的な実施に応じて、マスク・ジェネレータ３０４によって提供されるようにマスクに関する情報は、図４に示されるように、フィルタ・デシメータ・モジュール４０３に提供されさらに任意に利得計算機モジュール４０４に提供される。しかしながら、フィルタ計算機モジュール３０５の具体的な実施に応じて、利得計算機モジュール４０４には、図４の破線によって示されるように、利得計算機モジュール４０４およびフィルタ計算機モジュール３０５の入力間の任意の接続を介して、フィルタ・コンプレッサ１０２の実施形態に提供されるように、入力サブバンド・フィルタ・インパルス応答Ｈ（ｎ，ｋ）が任意に提供されてもよい。 The absolute value representation module 303 includes an absolute value algorithm function module 401, which is connected in series with the whitening module 402 between the input and output of the absolute value representation module 303. The filter calculator module 305 includes a filter decimator module 403 that is connected in series with a gain calculator 404. Both the filter decimator module 403 and the gain calculator 404 are connected in series between the input and output of the filter calculator module 305. Depending on the specific implementation, information about the mask as provided by the mask generator 304 is provided to the filter decimator module 403 and optionally to the gain calculator module 404 as shown in FIG. The However, depending on the specific implementation of the filter calculator module 305, the gain calculator module 404 may be passed through any connection between the inputs of the gain calculator module 404 and the filter calculator module 305, as shown by the dashed lines in FIG. Thus, as provided in the embodiment of the filter compressor 102, an input subband filter impulse response H (n, k) may optionally be provided.

図４にさらに詳細に示されるフィルタ・コンプレッサ１０２の実施形態の個々のモジュールについて述べる前に、図４に示されるようにフィルタ・コンプレッサ１０２の実施形態の機能性の総括が与えられる。 Before discussing the individual modules of the filter compressor 102 embodiment shown in more detail in FIG. 4, a summary of the functionality of the embodiment of the filter compressor 102 is given as shown in FIG.

図４において、本発明によるフィルタ・コンプレッサ１０２の異なる実施形態が概説される。ここで、絶対値表現モジュール３０３は、絶対値および対数関数モジュール４０１と、絶対値および対数関数モジュール４０１によって供給される絶対値の表現のスペクトル白色化を実行する白色化モジュール４０２とを含む。フィルタ・マスク・ジェネレータ３０４は、前と同じであり、フィルタ・マスクＭ（ｎ，ｋ）をフィルタ計算機モジュール３０５に出力する。これは、フィルタＨ（ｎ，ｋ）の選択された係数を保持し、他の係数をこの実施形態においてゼロに設定するフィルタ・デシメータ・モジュール４０３と、圧縮フィルタＨ（ｎ，ｋ）の利得が元のフィルタＨ（ｎ，ｋ）のそれと同様であるように、フィルタの利得を調整する利得計算機モジュール４０４とを含む。 In FIG. 4, a different embodiment of the filter compressor 102 according to the present invention is outlined. Here, the absolute value representation module 303 includes an absolute value and logarithmic function module 401 and a whitening module 402 that performs spectral whitening of the absolute value representation supplied by the absolute value and logarithmic function module 401. The filter mask generator 304 is the same as before, and outputs the filter mask M (n, k) to the filter calculator module 305. This is because the filter decimator module 403 holds the selected coefficients of the filter H (n, k) and sets the other coefficients to zero in this embodiment, and the gain of the compression filter H (n, k) And a gain calculator module 404 that adjusts the gain of the filter to be similar to that of the original filter H (n, k).

方程式（１）によって示される評価表現Ａ（ｎ，ｋ）は、人間の耳の特定の音響特性を考慮することなく人間の耳に関して体積配分を反映する。 The evaluation expression A (n, k) shown by equation (1) reflects the volume allocation for the human ear without taking into account the specific acoustic properties of the human ear.

評価表現Ａ（ｎ，ｋ）を概略的に表す図３における部分図３０２は、図３に示される平面において２つのインデックスｋ、ｎの関数として評価表現Ａ（ｎ，ｋ）の３次元プロットとして理解することができ、さらに、評価表現値Ａ（ｎ，ｋ）は、部分図３０２のｎ−ｋ平面と直角にプロットされる。換言すれば、部分図３０２は、サンプル・インデックスまたは時間インデックスｎとサブバンド・インデックスｋとの関数としてフィルタＡ（ｎ，ｋ）絶対値の時間／周波数表現の評価表現の略図を示す。時間インデックスまたはサンプル・インデックスｎは、例えば、時間ドメイン・インパルス応答ｈ（ｎ）のインデックスｎからＬ（サブバンドの数）倍だけ異なってもよい。図９、図１１および図１２との関連で説明されるように、フィルタ・コンバータ１０１は、複素変調された分析フィルタバンクを含んでもよく、それは、順に１つ以上のダウンサンプラを含んでもよく、それは、サンプルの数を係数倍だけ低減し、それは、例えばサブバンドＬの数であり得る。しかしながら、これらのダウンサンプラが任意のコンポーネントであるように、インデックスｎは、時間ドメイン・インパルス応答Ｈ（ｎ）のインデックスに相当する時間インデックスまたはサンプル・インデックスを指すか、または、例えば、時間ドメイン・インパルス応答Ｈ（ｎ）の時間またはサンプル・インデックスｎからＬ倍だけ異なるダウンサンプルされた時間インデックスまたはサンプル・インデックスに対応してもよい。 The partial diagram 302 in FIG. 3 schematically representing the evaluation expression A (n, k) is a three-dimensional plot of the evaluation expression A (n, k) as a function of the two indices k, n in the plane shown in FIG. Further, the evaluation expression value A (n, k) is plotted at right angles to the nk plane of the partial view 302. In other words, the partial diagram 302 shows a schematic representation of the evaluation representation of the time / frequency representation of the absolute value of the filter A (n, k) as a function of the sample index or time index n and the subband index k. The time index or sample index n may differ, for example, by an index n to L (number of subbands) times the time domain impulse response h (n). As described in connection with FIGS. 9, 11 and 12, the filter converter 101 may include a complex modulated analysis filter bank, which in turn may include one or more downsamplers, It reduces the number of samples by a factor of times, which can be, for example, the number of subbands L. However, as these downsamplers are optional components, the index n refers to a time index or sample index corresponding to the index of the time domain impulse response H (n), or, for example, It may correspond to a down-sampled time index or sample index that differs by L times from the time or sample index n of the impulse response H (n).

以下には、白色化モジュール４０２についてのさらなる詳細が概説される。本発明によって教示される白色化モジュールの目的は、知覚的に重要なフィルタ・タップが廃棄される状況を回避するために、マスク生成の前にフィルタの知覚的な重み付けを可能にすることであり、その理由は、それらが、他の知覚的により重要でないフィルタ・タップのために小さい絶対値を有するからである。 In the following, further details about the whitening module 402 are outlined. The purpose of the whitening module taught by the present invention is to allow perceptual weighting of filters prior to mask generation to avoid situations where perceptually important filter taps are discarded. The reason is that they have small absolute values due to other perceptually less important filter taps.

スペクトル白色化は、図１４との関連でさらに方程式（４）によって表されるように特に白色化を考慮してさらに詳細に説明されるにもかかわらず、（スペクトル）白色化が、フィルタ圧縮の過程において作られるディストーションを防止しまたは最小化するためにエネルギーをスペクトル部分から異なるスペクトル部分に転送することが望ましいという知見に基づく点に留意すべきである。 Even though spectral whitening is described in more detail in particular with regard to whitening as represented by equation (4) in connection with FIG. 14, (spectral) whitening is It should be noted that it is based on the finding that it is desirable to transfer energy from one spectral part to another in order to prevent or minimize the distortion created in the process.

現実のフィルタおよびオーディオ・システムは、不規則に分散された時間／周波数分布を非常に多くの場合に有し、それは、より高い周波数に位置されているサブバンドよりはむしろより低い周波数に位置されているサブバンドと比較して著しく大きい長さを有するサブバンド・ドメインにおいてフィルタ・インパルス応答をもたらすことができる。さらに、現実のフィルタおよびオーディオ・システムの不規則に分散された振幅／周波数分布は、互いに関して個々のサブバンド・フィルタの異なる関連性を導くこともできる。換言すれば、例えば、より高い周波数で現実のフィルタおよびのオーディオ・システムより高い減衰化のため、より高い周波数に対応するサブバンド・フィルタは、より低い周波数に対応するサブバンド・フィルタと比較して、それほど重要でない。しかしながら、フィルタ圧縮がより高い周波数サブバンド・フィルタにおいて有する影響を防止しまたは少なくとも最小化するために、（スペクトル）白色化は、リスニング体験の激しいディストーションを導く、圧縮の過程において完全に抑制されることから、より高い周波数でサブバンド・フィルタを、上で概説されるシナリオにおいて防止するために有利に実施することができる。そのため、重み付けとも呼ばれる（スペクトル）白色化は、現実のフィルタおよびオーディオ・システムのための重要な点であり得る。 Real-world filters and audio systems very often have an irregularly distributed time / frequency distribution that is located at lower frequencies rather than subbands located at higher frequencies. A filter impulse response can be provided in the subband domain that has a significantly greater length compared to the subband being. Furthermore, the random distributed amplitude / frequency distribution of real filters and audio systems can also lead to different associations of individual subband filters with respect to each other. In other words, a subband filter corresponding to a higher frequency is compared to a subband filter corresponding to a lower frequency, for example due to higher attenuation than real filters and audio systems at higher frequencies. And not so important. However, to prevent or at least minimize the effect that filter compression has on higher frequency subband filters, (spectral) whitening is completely suppressed in the process of compression, leading to intense distortion of the listening experience. Thus, subband filters at higher frequencies can be advantageously implemented to prevent in the scenario outlined above. Thus, (spectral) whitening, also called weighting, can be an important point for real-world filters and audio systems.

したがって、図４に示される実施形態において絶対値表現モジュール３０３に含まれるように白色化モジュール４０２は、正規化効果が全体の周波数範囲を周波数バンドに分割することによって使用されるスペクトル白色化を適用する。複素変調された分析フィルタバンクとの関連でさらに詳細に説明されるように、それぞれのサブバンドは、特定の中心周波数を有する特定の周波数範囲に対応する。結果として、サブバンドは、中心周波数に従って配列することができる。自然な選択において、サブバンド・インデックスｋは、増加する順序において中心周波数に増加する順序に対応する。 Therefore, as included in the absolute value representation module 303 in the embodiment shown in FIG. 4, the whitening module 402 applies spectral whitening, where the normalization effect is used by dividing the entire frequency range into frequency bands. To do. Each subband corresponds to a specific frequency range having a specific center frequency, as will be described in more detail in the context of a complex modulated analysis filter bank. As a result, the subbands can be arranged according to the center frequency. In a natural choice, the subband index k corresponds to an increasing order to the center frequency in increasing order.

前述の周波数バンドに関して正規化効果の形式においてスペクトル白色化を実施するために、サブバンドまたはサブバンドのサブグループの知覚的に関連したインターバルが形成され、それはそれぞれ少なくとも１つのサブバンドを含む。さらに、多くの具体的な実施において、個々のサブバンドは、全体として正確に１つのサブグループに属する。しかしながら、サブバンドのそれぞれのサブグループは、複数のサブバンドを含んでもよい。この場合、サブグループは、典型的に隣接した中心周波数を有するサブバンドだけを含む。 To perform spectral whitening in the form of a normalization effect with respect to the aforementioned frequency bands, perceptually related intervals of subbands or subgroups of subbands are formed, each including at least one subband. Further, in many specific implementations, individual subbands belong to exactly one subgroup as a whole. However, each subgroup of subbands may include multiple subbands. In this case, the subgroup typically includes only subbands with adjacent center frequencies.

換言すれば、サブバンドが、増加する順序においてそれらの中心周波数に従ってさらに同時に増加するサブバンド・インデックスｋに従って配列される場合、隣接した周波数を有するサブバンドだけを含むサブグループは、サブバンド・インデックスｋを有するサブバンドに関連し、それは、２つの配列されたサブバンド・インデックス間の最大差が方程式（３）との関連で説明されるように＋／−ｌに等しいように、配列することができる。換言すれば、それぞれの周波数バンドは、サブバンドのサブグループまたはインターバルによって表すことができ、それは、サブバンドの上位集合である。しかしながら、サブバンドのサブグループが正確に１つのサブバンドを含んでもよい点に留意すべきである。 In other words, if the subbands are arranged according to a subband index k that increases further simultaneously according to their center frequencies in increasing order, the subgroup containing only subbands with adjacent frequencies is subband indexed. associated with subbands having k, which is arranged such that the maximum difference between two arranged subband indices is equal to +/− l as described in connection with equation (3) Can do. In other words, each frequency band can be represented by a subgroup or interval of subbands, which is a superset of subbands. However, it should be noted that a subgroup of subbands may contain exactly one subband.

前述のように、スペクトル白色化の枠組みにおいて、特定の数Ｐの周波数バンド、サブバンドのサブグループまたはインターバルは、区別される。原則として、サブバンドのサブグループの数ｐは、整数であり、それは、それぞれのサブグループが少なくとも１つのサブバンドを含みさらにそれぞれのサブバンドが正確にサブバンドの１つのサブグループに属する制約のためサブバンドの数Ｌより小さい。Ｌ＝６４のサブバンドにおいて作動するフィルタ・システムの場合に、サブバンドのサブグループの典型的な数Ｐは、２８になるように選択することができる。しかしながら、この数は、上述のように制限されない。サブバンドのサブグループの対応する数Ｐ（例えばＰ＝３２）は、周波数ドメインにおいて知覚的に関連したインターバルを表す心理音響モデルに基づいて選択することができる。 As mentioned above, in the spectral whitening framework, a certain number P of frequency bands, subband subgroups or intervals are distinguished. In principle, the number p of subgroups of subbands is an integer, which means that each subgroup contains at least one subband, and each subband belongs to exactly one subgroup of subbands. Therefore, it is smaller than the number L of subbands. For a filter system operating in L = 64 subbands, a typical number P of subband subgroups can be selected to be 28. However, this number is not limited as described above. A corresponding number P of subgroups of subbands (eg, P = 32) can be selected based on a psychoacoustic model representing perceptually relevant intervals in the frequency domain.

したがって、白色化は、多くの現実のフィルタおよびオーディオ・システムにおいて、任意に心理音響モデルに関する人間の耳の知覚特性に基づいて、エネルギーのより低いスペクトル部分からより高いスペクトル部分への転送を導く。 Thus, whitening leads to a transfer from a lower spectral portion of energy to a higher spectral portion, in many real filters and audio systems, optionally based on the perceptual characteristics of the human ear with respect to psychoacoustic models.

しかしながら、白色化モジュール４０２の異なる実施は、絶対値表現３０３の枠組みにおいて容易に実施することができる。具体的には、他の実施は、方程式（４）に従ってサブバンドのそれぞれのサブグループに含まれる全てのサブバンドに基づいて白色化を実行する代わりに、インデックスｋを有するそれぞれのサブバンドに対して評価表現Ａ（ｎ，ｋ）を個々に白色化する可能性を含む。さらに、方程式（４）に示されるように最大値を減算する代わりに、白色化は、評価表現Ａ（ｎ，ｋ）の全ての値を分割しさらにそれによりそれぞれのサブバンドの最大に関してまたはサブバンドのそれぞれのサブグループの最大値に関して評価表現の全ての値を正規化することによって、実行することができる。さらに、評価表現を分割することによる記載されている正規化は、それぞれの評価表現Ａ（ｎ，ｋ）の全ての値の合計が（それぞれの個々のサブバンドに関してまたはサブバンドのそれぞれのサブグループに関して）実行することができるように、実行することもできる。この場合、第１のステップにおいて、それぞれのサブバンドまたはサブバンドのそれぞれのサブグループに関して評価表現の全ての値の合計が決定され、その後に、方程式（４）に従って減算することまたはそれぞれの合計値を有する評価表現の値を分割することが続く。 However, different implementations of the whitening module 402 can be easily implemented in the framework of the absolute value representation 303. Specifically, other implementations may be performed for each subband with index k instead of performing whitening based on all subbands included in each subgroup of subbands according to equation (4). The evaluation expression A (n, k) is individually whitened. Further, instead of subtracting the maximum value as shown in equation (4), whitening divides all values of the evaluation expression A (n, k) and thereby with respect to the maximum of each subband or sub This can be done by normalizing all values of the evaluation expression with respect to the maximum value of each subgroup of the band. Furthermore, the normalization described by dividing the evaluation expression is such that the sum of all values of each evaluation expression A (n, k) is (for each individual subband or for each subgroup of subbands). It can also be performed just as it can. In this case, in a first step, the sum of all values of the evaluation expression is determined for each subband or for each subgroup of subbands, followed by subtraction according to equation (4) or the respective sum value. Followed by splitting the value of the evaluation expression having.

要約すると、上で概説される実施形態において、調べることおよび選択することは、フィルタ・タップでフィルタ・インパルス応答値の絶対値に基づく。そのため、この実施形態において、フィルタ・インパルス応答値は、より高い値を含む少なくとも１つを選択する場合、フィルタ・タップの絶対値に関する比較に基づいて選択されるかまたは選択されない。異なる実施形態において、必要に応じて、フィルタ・タップを比較することまたは調べることが、他の数学的手段の適用に基づいてもよい。フィルタ・タップが実数値である場合、原則として、数学的手段の適用は必要でないが、絶対値を計算しまたは決定することが実施されてもよい。 In summary, in the embodiment outlined above, examining and selecting is based on the absolute value of the filter impulse response value at the filter tap. Thus, in this embodiment, the filter impulse response value is selected or not selected based on a comparison on the absolute value of the filter tap when selecting at least one that includes a higher value. In different embodiments, comparing or examining the filter taps as needed may be based on the application of other mathematical means. If the filter tap is real-valued, in principle, no mathematical means need be applied, but calculating or determining the absolute value may be implemented.

複素数値のフィルタ・タップの場合、いくつかの数学的手段の適用が望ましい。例としては、絶対値を導き出すこと、または、複素数の平面において所定の若しくは明確な方向（例えば正の実数の方向）に関してフィルタ・タップの角度若しくは位相を導き出すことであり得る。さらに、実数部、実数部の絶対値、虚数部、虚数部の絶対値を決定すること、または、それぞれの複素数を（任意に正の）実数にマップするその他の関数も、原則として適用することができる。 In the case of complex-valued filter taps, it is desirable to apply some mathematical means. Examples may be deriving absolute values or deriving the filter tap angle or phase with respect to a predetermined or unambiguous direction (eg positive real direction) in a complex plane. In addition, determine the real part, the absolute value of the real part, the imaginary part, the absolute value of the imaginary part, or other functions that map each complex number to a (optionally positive) real number as a rule. Can do.

図４に示される実施形態において、白色化モジュール４０２による出力として白色化された評価表現Ａ_W（ｎ，ｋ）は、マスク・ジェネレータ３０４に提供され、それは、白色化された評価表現に基づいてフィルタ・マスクまたはマスクＭ（ｋ）を作る。評価表現についての実際の白色化モジュール４０２のため、マスク・ジェネレータ３０４は、最も多くの（知覚的に）関連したフィルタ係数を選択することができる。フィルタ・マスクは、図４に示される実施形態において、１セットの０および１であり、Ｍ（ｎ，ｋ）＝１は、対応するフィルタ・タップまたはフィルタ・インパルス応答値が用いられまたは保持されるように選択されることを示す。したがって、値Ｍ（ｎ，ｋ）＝０は、サンプル・インデックスまたは時間インデックスｎとサブバンド・インデックスｋとによって識別される対応するフィルタ・タップまたはフィルタ・インパルス応答値が選択されず、そのため、用いられないことを示す。換言すれば、特定のフィルタ・インパルス応答値は、無視されるかまたはゼロに設定される。 In the embodiment shown in FIG. 4, the whitened evaluation expression A _W (n, k) as output by the whitening module 402 is provided to the mask generator 304, which is based on the whitened evaluation expression. Create a filter mask or mask M (k). Due to the actual whitening module 402 for the evaluation representation, the mask generator 304 can select the most (perceptually) relevant filter coefficients. The filter mask is a set of 0s and 1s in the embodiment shown in FIG. 4, where M (n, k) = 1 is used or retained with the corresponding filter tap or filter impulse response value. To be selected. Thus, the value M (n, k) = 0 is not used because the corresponding filter tap or filter impulse response value identified by the sample index or time index n and the subband index k is not selected. Indicates that it is not possible. In other words, the particular filter impulse response value is ignored or set to zero.

マスク・ジェネレータ３０４の具体的な実施は、フィルタ・コンプレッサ１０２の実施形態によって大幅に異なり得る。図４に示される実施形態において、マスク・ジェネレータは、例えば、フィルタ・マスクＭ（ｎ，ｋ）＝１の対応する値を設定するがフィルタ・マスクの残りの値が０に設定されることによる白色化された評価表現Ａ_W（ｎ，ｋ）に基づいて、特定の数のインパルス応答値を選択することができる。特定の絶対数のインパルス応答値を選択することは別として、１セットのサブバンド・フィルタ応答Ｈ（ｎ，ｋ）によって与えられる全体の数のインパルス応答値に関する相対数も可能である。それぞれの入力サブバンド・フィルタ・インパルス応答が１６個の非ゼロの、ゼロになることのないまたは非自明なフィルタ・タップを含むＬ＝６４のＱＭＦサブバンド実施の場合における具体的な例において、入力サブバンド・フィルタ応答の全体のマトリクスは、１０２４個のインパルス応答値を含む６４・１６のマトリクスによって与えられる。この例において、マスク・ジェネレータ３０４は、例えば、特定の所定の数のインパルス応答値（例えば白色化された評価表現によって提供されるように最も大きい絶対値による２５６個のエレメント）を選択することができ、または、マスク・ジェネレータ３０４は、全体の数のフィルタ・インパルス応答に関して所定のまたは特定の比率（相対数）のフィルタ・インパルス応答（例えば全体の数の２５%のフィルタ応答値）を選択することができる。いずれの場合においても、残りのインパルス応答値は、フィルタ・マスクＭ（ｎ，ｋ）の対応する値をゼロ（Ｍ（ｎ，ｋ）＝０）に等しく設定することによって無視されるかまたは選択されない。 The specific implementation of the mask generator 304 may vary greatly depending on the embodiment of the filter compressor 102. In the embodiment shown in FIG. 4, the mask generator sets, for example, the corresponding value of the filter mask M (n, k) = 1, but the remaining value of the filter mask is set to 0 A particular number of impulse response values can be selected based on the whitened evaluation expression A _W (n, k). Apart from selecting a specific absolute number of impulse response values, relative numbers for the entire number of impulse response values given by a set of subband filter responses H (n, k) are also possible. In a specific example in the case of an L = 64 QMF subband implementation where each input subband filter impulse response includes 16 non-zero, non-zero or non-trivial filter taps, The entire matrix of input subband filter responses is given by a 64 · 16 matrix containing 1024 impulse response values. In this example, the mask generator 304 may select, for example, a particular predetermined number of impulse response values (eg, 256 elements with the largest absolute value as provided by the whitened evaluation representation). The mask generator 304 can select a predetermined or specific ratio (relative number) of filter impulse responses (eg, 25% of the total number of filter response values) with respect to the total number of filter impulse responses. be able to. In any case, the remaining impulse response values are ignored or selected by setting the corresponding value of the filter mask M (n, k) equal to zero (M (n, k) = 0). Not.

フィルタ・コンプレッサ１０２のさらなる実施形態において、マスク・ジェネレータ３０４は、選択される絶対数のインパルス応答値を示しまたは全体の数のインパルス応答値に関してインパルス応答値の比率を示す信号を受信するように構成されてもよい。フィルタ・コンプレッサ１０２のそのような実施形態において、圧縮比は、前に言及された図を調整することによって調整することができる。 In a further embodiment of the filter compressor 102, the mask generator 304 is configured to receive a signal indicating the selected number of impulse response values or indicating the ratio of the impulse response values with respect to the total number of impulse response values. May be. In such an embodiment of the filter compressor 102, the compression ratio can be adjusted by adjusting the previously mentioned figures.

さらに、マスク・ジェネレータ３０４は、代わりにまたは付加的に、異なる基準に基づいてそれぞれのフィルタ・インパルス応答値を選択するように構成されてもよい。例えば、マスク・ジェネレータ３０４は、サブバンドごとに所定の、一定の、プログラム可能なまたは適合性のある数のインパルス応答値（例えばそれぞれのサブバンドに対して評価表現に関する最大値を有する３つのインパルス応答値）を選択するように構成されてもよい。さらに、マスク・ジェネレータ３０４は、例えば、全てのインパルス応答値が選択されるように、閾値基準が所定の、一定の、調整可能なまたはプログラム可能な閾値より大きい対応する評価表現値を保持するように構成されてもよい。さらなる実施態様において、隣接したインパルス応答値で、それぞれの値との比較に基づいて、インパルス応答値を選択することができるように、マスク・ジェネレータ３０４を構成することが望ましい。例えば、マスク・ジェネレータ３０４は、それぞれの値が（任意に白色化された）評価表現を考慮して隣接した値と比較して一定の、所定の、プログラム可能なまたは調整可能な比率より小さい（例えば２５%より小さい）場合、フィルタ・インパルス応答値が選択されないように構成されてもよい。しかしながら、他の選択スキームも実施され得る。 Further, the mask generator 304 may alternatively or additionally be configured to select each filter impulse response value based on different criteria. For example, the mask generator 304 may determine a predetermined, constant, programmable or compatible number of impulse response values for each subband (eg, three impulses having a maximum value for the evaluation representation for each subband. Response value) may be selected. Further, the mask generator 304 holds a corresponding evaluation expression value that is greater than a predetermined, constant, adjustable or programmable threshold, for example, so that all impulse response values are selected. May be configured. In a further embodiment, it may be desirable to configure the mask generator 304 such that impulse response values can be selected based on a comparison with the respective values of adjacent impulse response values. For example, the mask generator 304 may have a value that is less than a predetermined, programmable, or adjustable ratio that is constant relative to adjacent values taking into account (optionally whitened) evaluation expressions ( If, for example, less than 25%, the filter impulse response value may not be selected. However, other selection schemes can be implemented.

しかしながら、白色化のため、方程式（４）との関連で記載されるように、サブバンドのそれぞれのサブグループまたは個々のサブバンドに基づいて、少なくとも１つのインパルス応答値は、選択されたインパルス応答値の数がサブバンドによってまたはサブグループによって大幅に異なるが、具体的な実施に応じて、サブバンドのそれぞれのサブグループにおいてまたはそれぞれのサブバンドにおいて選択される。例えば、評価表現値の対応するサブセットの最大値で評価表現Ａ（ｎ，ｋ）を除算することによって実施される白色化の場合、マスク・ジェネレータ３０４の上述の実施において、少なくとも１つのフィルタ・インパルス応答値は、図１４との関連で説明されるように、それぞれのサブバンドにおいてまたはサブバンドのそれぞれのサブグループにおいて選択される。 However, for whitening, as described in connection with equation (4), based on each subgroup of subbands or individual subbands, at least one impulse response value is selected from the selected impulse response. The number of values varies greatly from subband to subband or from subgroup to subband, but is selected in each subgroup of subbands or in each subband, depending on the specific implementation. For example, in the case of whitening implemented by dividing the evaluation expression A (n, k) by the maximum value of the corresponding subset of evaluation expression values, in the above implementation of the mask generator 304, at least one filter impulse. The response value is selected in each subband or in each subgroup of subbands, as described in connection with FIG.

結果として、絶対値表現モジュール３０３およびマスク・ジェネレータ３０４の相互作用は、ｎ−ｋ平面（参照図３における部分図３０２参照）においてフィルタ・インパルス応答価値の重要なエリアへの集合、および、ｎ−ｋ平面の知覚的に関連したエリア間において「減圧」または「空気」の「圧縮」を導く。したがって、関連したインパルス応答値は、マスクＭ（ｎ，ｋ）を設定することによって無視される。 As a result, the interaction between the absolute value representation module 303 and the mask generator 304 is the set of important values of the filter impulse response value in the nk plane (see the partial diagram 302 in the reference FIG. 3) and the n− It leads to “decompression” or “compression” of “air” between perceptually related areas of the k-plane. Therefore, the associated impulse response value is ignored by setting the mask M (n, k).

この実施形態において、マスクは、ゼロまたは１であるエントリーからなる。ゼロを有するエントリーは、どのフィルタ係数が放棄されるべきものであるかを表し、１を有するエントリーは、どのフィルタ係数が保持（選択）されるべきものであるかを表す。 In this embodiment, the mask consists of entries that are zero or one. An entry with a zero indicates which filter coefficients are to be discarded, and an entry with a 1 indicates which filter coefficients are to be retained (selected).

いずれの場合においても、Ｇ_maxは、利得補償についての上限であり、さらにεは、ゼロでの除算を回避するために含まれる小さい正の数である。したがって、Ｇ_maxおよびεの両方は、数であり、それらは、方程式（６）および（８）において括弧でくくられた括弧における２つの項の最小によるように、ゼロによる除算を防止するため（すなわちε＞０）に、さらに、利得計算機モジュール４０４によってサブバンドに適用される利得を最大利得Ｇ_maxによって定義されるような値に制限するために、利得計算機４０４の数値実施に役立ち、それぞれの利得Ｇ（ｋ）およびＧ（ｐ）は、Ｇ_maxの値に制限される。 In any case, G _max is an upper bound for gain compensation, and ε is a small positive number included to avoid division by zero. Thus, both G _max and ε are numbers, and they prevent division by zero, such as by the minimum of two terms in parentheses bracketed in equations (6) and (8) ( I.e., ε> 0), and further, to limit the gain applied to the subbands by the gain calculator module 404 to a value as defined by the maximum gain G _max , the numerical calculation of the gain calculator 404 is useful for each Gains G (k) and G (p) are limited to the value of G _max .

したがって、利得計算機モジュール４０４は、少なくともいくつかの入力サブバンド入力応答のマスキングの過程において失われるエネルギーを補償するためにエネルギーに関してマスクされたフィルタ・タップＨ_M（ｎ，ｋ）を正規化する。換言すれば、フィルタ・デシメータ４０３の枠組みにおいてマスキングのため、マスクされたサブバンド・フィルタ・インパルス応答Ｈ_M（ｎ，ｋ）に対応するサブバンド・フィルタ入力応答によってフィルタにかけられる信号は、サブバンド・フィルタ・インパルス応答Ｈ（ｎ，ｋ）を使用するサブバンド・フィルタと比較して、より小さいエネルギーを有する。 Accordingly, gain calculator module 404 normalizes the filter taps H _M (n, k) masked with respect to energy to compensate for energy lost in the process of masking at least some input subband input responses. In other words, for masking in the filter decimator 403 framework, the signal filtered by the subband filter input response corresponding to the masked subband filter impulse response H _M (n, k) is subband Has a smaller energy compared to a subband filter using a filter impulse response H (n, k).

しかしながら、利得計算機モジュール４０４は、異なる利得スキームを適用するように構成されてもよい。一例として、エネルギーではなくむしろサブバンド・フィルタ・インパルス応答の絶対値の直接比較は、利得係数を決定するために使用することができる。付加的にまたは代わりに、利得係数Ｇは、方程式（６）および（８）との関連で説明されるように、個々のサブバンドまたはサブバンドの個々のサブグループのインパルス応答値よりむしろ、全体の数のサブバンド・フィルタ・インパルス応答値に基づいて、決定することができる。さらに、利得計算モジュール４０４が、必要なコンポーネントでなく、むしろ任意のコンポーネントである点に留意すべきである。 However, the gain calculator module 404 may be configured to apply different gain schemes. As an example, a direct comparison of the absolute values of the subband filter impulse response rather than energy can be used to determine the gain factor. Additionally or alternatively, the gain factor G may be an overall rather than an impulse response value for individual subbands or individual subgroups of subbands, as described in connection with equations (6) and (8). Can be determined based on the number of subband filter impulse response values. Furthermore, it should be noted that the gain calculation module 404 is not a required component but rather an optional component.

フィルタ・インパルス応答コンストラクタまたはフィルタ計算機モジュール３０５は、本発明のさらなる実施形態において、上述のように非選択のサブバンド・フィルタ・インパルス応答値をゼロに設定することによってだけでなく、圧縮サブバンド・フィルタ・インパルス応答を構成することができる。具体的な実施に応じて、フィルタ・インパルス応答コンストラクタ３０５は、例えば、圧縮サブバンド・フィルタ・インパルス応答を構成するために、適切な選択されまたは決定されたサブバンド・フィルタ・インパルス応答値を重み付け、コピーし、または取得することによって、これを達成することができる。 The filter impulse response constructor or filter calculator module 305, in a further embodiment of the present invention, not only by setting the unselected subband filter impulse response value to zero as described above, A filter impulse response can be constructed. Depending on the specific implementation, the filter impulse response constructor 305 weights the appropriate selected or determined subband filter impulse response values, eg, to construct a compressed subband filter impulse response. This can be accomplished by copying, or obtaining.

これに関連して、決定されていないかまたは選択されていないフィルタ・インパルス応答値をたとえ無視するかまたは含まないでも、時間においてフィルタの圧縮を導かない点に留意すべきである。本願記載の枠組みにおいて、選択されていないかまたは決定されていないフィルタ・インパルス応答値を放置し、無視しまたは用いないことは、遅延演算子ｚ−１の係数への単なる修正は別として対応する（ＱＭＦフィルタバンク）多項式の多項式の個々の加数の順序の著しい変化を導かない。換言すれば、フィルタ・タップまたはフィルタ・インパルス応答を無視し、放置し、ゼロに設定しまたはその他にさらに注意しないことによって、遅延演算子ｚ−１の累乗に関して、フィルタ・タップの新しい分布を導かない。選択されずまたは決定されないようなフィルタ・インパルス応答値に続く、選択されまたは決定されるフィルタ・タップまたはフィルタ・インパルス応答値は、遅延演算子の累乗に関して変えられない。 In this connection, it should be noted that even if ignoring or not including a filter impulse response value that has not been determined or selected, does not lead to compression of the filter in time. In the framework of the present application, leaving a filter impulse response value that is not selected or not determined, neglected or not used corresponds to a simple modification to the coefficient of the delay operator z−1. (QMF filter bank) Does not lead to significant changes in the order of the individual addends of the polynomial. In other words, ignoring the filter tap or filter impulse response, leaving it, setting it to zero, or not paying extra attention to it, leads to a new distribution of filter taps with respect to the power of the delay operator z−1. No. The filter tap or filter impulse response value that is selected or determined following the filter impulse response value that is not selected or determined is not changed with respect to the power of the delay operator.

換言すれば、フィルタ・インパルス応答コンストラクタ３０５によって構成されるように圧縮サブバンド・フィルタ・インパルス応答は、非選択のフィルタ・インパルス応答値のフィルタにかけられたタップに対応するゼロ値の値を含むことができ、または、圧縮サブバンド・フィルタ・インパルス応答は、それぞれの非選択のフィルタ・インパルス応答値を全く含まないことができる。さらに換言すれば、フィルタ・インパルス応答コンストラクタ３０５は、例えば、増加した数のゼロ値を有する値を除いて、入力サブバンド・フィルタ・インパルス応答として同じ数のサブバンド・フィルタ・インパルス応答値を原則として有する圧縮サブバンド・フィルタ・インパルス応答を構成することができ、または、圧縮サブバンド・フィルタ・インパルス応答は、フィルタ・インパルス応答コンストラクタ３０５が選択された値をコピーしさらに非選択の値を無視するように、より短い全体の長さを有することができる。 In other words, the compressed subband filter impulse response as configured by the filter impulse response constructor 305 includes a zero value corresponding to the filtered tap of the unselected filter impulse response value. Or the compressed subband filter impulse response may not include any respective unselected filter impulse response values. In other words, the filter impulse response constructor 305, in principle, takes the same number of subband filter impulse response values as the input subband filter impulse response, except for values having an increased number of zero values. The compressed subband filter impulse response can be configured as, or the compressed subband filter impulse response is copied by the filter impulse response constructor 305 and ignores the unselected values As such, it can have a shorter overall length.

実数値のフィルタ・インパルス応答値が複素数値のフィルタ・インパルス応答値と比較して著しい煩雑性低減を導くので、フィルタ・インパルス応答コンストラクタ３０５は、選択されたフィルタ・インパルス応答値のいくつかの絶対値を有利に出力することもできる。この動作モードは、人間の聴覚が位相関係に対して感受性が低いより高い周波数に対応するサブバンドにおいて特に魅力的である。 Since the real-valued filter impulse response value leads to a significant complexity reduction compared to the complex-valued filter impulse response value, the filter-impulse response constructor 305 may provide some absolute value of the selected filter-impulse response value. The value can also be output advantageously. This mode of operation is particularly attractive in subbands corresponding to higher frequencies where human hearing is less sensitive to phase relationships.

結果として、境界周波数より上の中心周波数に対応するサブバンドのサブバンド・インパルス応答値は、上述のエレメントのうちの少なくとも１つの絶対値、虚数部、実数部、位相、線形結合、多項式組合せまたは実数値式と任意に置換することができる。また、複素数値の虚数部は、本願記載の枠組みにおいて実数値の数であると考慮される。具体的な実施に応じて、境界周波数は、１ｋＨｚ〜１０ｋＨｚの範囲にあり得るが、多くのアプリケーションにおいて、１ｋＨｚ〜５ｋＨｚまたは１ｋＨｚ〜３ｋＨｚの間の範囲において境界周波数の実施は、人間の典型的な聴覚特性を考慮して利用することができる。さらに、フィルタ・コンプレッサの具体的な実施に応じて、複素数値のフィルタ・インパルス応答値に基づく実数値の値による複素数値のフィルタ・インパルス応答値の記載されている置換は、選択され若しくは決定された、選択されていない、または、決定されていないフィルタ・インパルス応答値に応じて実施することができる。代わりにまたは付加的に、境界周波数の上の中心周波数に対応するサブバンドに属するフィルタ・インパルス応答値は、通常、複素数値のフィルタ・インパルス応答値に基づいて、対応する実数値の値と置換することができる。これに関連して、決定されまたは選択されたフィルタ・インパルス応答値を用いることが、対応するフィルタ・インパルス応答値を置換するために、そのようなフィルタ・インパルス応答値に基づいて、（例えば実数値の）値を用いることを含む点に留意すべきである。 As a result, the subband impulse response value of the subband corresponding to the center frequency above the boundary frequency is the absolute value, imaginary part, real part, phase, linear combination, polynomial combination or at least one of the above-mentioned elements Can be arbitrarily replaced with a real-valued expression. Also, the imaginary part of a complex value is considered to be a real number in the framework described herein. Depending on the specific implementation, the boundary frequency may be in the range of 1 kHz to 10 kHz, but in many applications the implementation of the boundary frequency in the range between 1 kHz to 5 kHz or 1 kHz to 3 kHz is typical of humans. It can be used in consideration of auditory characteristics. Further, depending on the specific implementation of the filter compressor, the described replacement of the complex-valued filter impulse response value with a real-valued value based on the complex-valued filter impulse response value is selected or determined. It can also be implemented in response to filter impulse response values that have not been selected or have been determined. Alternatively or additionally, filter impulse response values belonging to the subband corresponding to the center frequency above the boundary frequency are usually replaced with corresponding real-valued values based on complex-valued filter impulse response values can do. In this context, using a determined or selected filter impulse response value may be based on such a filter impulse response value (e.g., an actual value) to replace the corresponding filter impulse response value. It should be noted that this includes the use of numerical values.

図５は、マルチプル・フィルタに同時に作動する本発明によるフィルタ・コンプレッサ５０１のさらなる実施形態を示す。図５において、異なる実施形態は、概説される。ここで、マルチプル・フィルタ（ν＝０、・・・、（Ｎ−１）によって示されるＮ個のフィルタ）は、Ｎが正の整数であり、フィルタ・コンプレッサ５０１の実施形態に入力され、それぞれのフィルタは、個々の絶対値表現モジュール３０３に入力され、Ｎ個の表現は、フィルタ・マスク・ジェネレータ５０２に入力される。 FIG. 5 shows a further embodiment of a filter compressor 501 according to the present invention operating simultaneously on multiple filters. In FIG. 5, the different embodiments are outlined. Here, multiple filters (N filters denoted by ν = 0,..., (N−1)), where N is a positive integer, are input to the filter compressor 501 embodiment, Are input to the individual absolute value expression module 303, and the N expressions are input to the filter mask generator 502.

具体的には、図５に示されるフィルタ・コンプレッサ５０１の実施形態は、１セットのＮ個のフィルタ・コンバータ１０１に接続されまたは結合され、それに１セットの実数値の時間ドメイン・インパルス応答Ｈν（ｎ，ｋ）が供給され、前に説明されるように、ν＝０、・・・、（Ｎ−１）は、時間ドメインにおいて対応するフィルタのインデックスである。例えば、ＨＲＴＦのようなシステムの枠組みにおいて５チャンネル入力信号の場合において、５つの入力チャンネルのそれぞれに対してさらに２つのヘッドホン・チャンネル（左右）のそれぞれに対して、個々の時間ドメイン・フィルタは、Ｎ＝１０の総数の時間ドメイン・フィルタを導くように使用される。 Specifically, the embodiment of the filter compressor 501 shown in FIG. 5 is connected or coupled to a set of N filter converters 101 to which a set of real-valued time domain impulse responses Hν ( n, k) is supplied and, as explained previously, ν = 0,..., (N−1) is the index of the corresponding filter in the time domain. For example, in the case of a five channel input signal in the framework of a system like HRTF, for each of the five input channels, for each of the two headphone channels (left and right), the individual time domain filters are: Used to derive a total time domain filter of N = 10.

換言すれば、図５に示されるフィルタ・コンプレッサ５０１には、複数セットのインパルス応答が提供され、複数セットのフィルタ・インパルス応答のそれぞれのセットは、図５に図示される場合、異なるフィルタ・コンバータ１０１によって提供される。しかしながら、個々のフィルタ・コンバータ１０１によって提供されるように１セットのフィルタ・インパルス応答に関して、１セットのフィルタ・インパルス応答は、Ｌ個の個々のフィルタ・インパルス応答を含み、それぞれは、特定の数のフィルタ・タップまたはフィルタ・インパルス応答値を有する。中心周波数との関連で前に説明されるように、個々のサブバンドに対応するそれぞれのインパルス応答は、中心周波数に関連し、それによって、中心周波数は、複数の中心周波数を形成する。 In other words, the filter compressor 501 shown in FIG. 5 is provided with multiple sets of impulse responses, each set of multiple sets of filter impulse responses being different filter converters as illustrated in FIG. 101. However, with respect to a set of filter impulse responses as provided by an individual filter converter 101, a set of filter impulse responses includes L individual filter impulse responses, each of which is a specific number. Filter tap or filter impulse response value. As previously described in connection with the center frequency, each impulse response corresponding to an individual subband is associated with a center frequency, whereby the center frequency forms a plurality of center frequencies.

また、同じサブバンド・インデックスｋに対応するが、インデックスνによって示されるように異なるセットのフィルタ・インパルス応答に属するフィルタ・インパルス応答は、同じ中心周波数に対応する。換言すれば、（１セットのフィルタ・インパルス応答によって定義されるように）複数の中心周波数のそれぞれ中心周波数に、少なくとも圧縮の前に、それぞれのセットのフィルタ・インパルス応答において（正確に）１つのフィルタ・インパルス応答を対応する。 Also, filter impulse responses that correspond to the same subband index k, but belong to different sets of filter impulse responses as indicated by index ν, correspond to the same center frequency. In other words, to each center frequency of the plurality of center frequencies (as defined by a set of filter impulse responses), at least prior to compression, (exactly) one in each set of filter impulse responses. Corresponds to the filter impulse response.

それぞれのフィルタ・コンバータ１０１は、それぞれの時間ドメイン・フィルタに対して１セットの複素数値のサブバンド・フィルタ・インパルス応答Ｈν（ｎ，ｋ）を提供し、それは、図５に示されるフィルタ・コンプレッサ５０１の実施形態に提供される。Ｎ個の異なる時間ドメイン・フィルタのためのそれぞれのサブバンド・フィルタ・インパルス応答は、個々の絶対値表現モジュール３０３に提供され、それは、Ｎ個の時間ドメイン・フィルタのそれぞれのための絶対値表現または評価表現をフィルタ・マスク・ジェネレータ５０２に提供する。絶対値表現モジュール３０３は、同じ引用符号によって示されるように、本願において概説される本発明のフィルタ・コンプレッサの他の実施形態のうちの１つから選ぶことができる。 Each filter converter 101 provides a set of complex-valued subband filter impulse responses Hν (n, k) for each time domain filter, which is the filter compressor shown in FIG. 501 embodiments. Each subband filter impulse response for N different time domain filters is provided to an individual absolute value representation module 303, which is an absolute value representation for each of the N time domain filters. Alternatively, an evaluation expression is provided to the filter mask generator 502. The absolute value representation module 303 can be selected from one of the other embodiments of the filter compressor of the present invention outlined in this application, as indicated by the same reference signs.

この（共同の）絶対値表現は、正確に前の実施形態において単一のフィルタ・マスク・ジェネレータ３０４におけるように単一のマスク生成Ｍ（ｎ，ｋ）のための基礎を形成する。白色化ステップが実行される場合において、これは、個々の絶対値表現モジュール３０３のために実行することもできまたは共同の絶対値表現のために一度だけ実行することもできる。 This (joint) absolute value representation forms the basis for a single mask generation M (n, k) exactly as in the single filter mask generator 304 in the previous embodiment. In the case where a whitening step is performed, this can be performed for the individual absolute value representation module 303 or only once for the joint absolute value representation.

図１５との関連で、フィルタ・コンプレッサ５０１の実施形態が述べられ、そこにおいて、（スペクトル）白色化がそれぞれのフィルタν＝０、・・・、（Ｎ−１）に対して個々に実行される。この実施形態におけるフィルタ・マスク・ジェネレータ５０２は、全てのフィルタのＮ個の絶対値表現に基づいて、全てのフィルタのための単一のフィルタ・マスクＭ（ｎ，ｋ）を作る。これは、本発明の実施形態の大きな利点であり、その理由は、フィルタ・マスク・ジェネレータ５０２が、圧縮フィルタが後の状態のおいてどのように組み合わされるかについて考慮することができるからである。それぞれの元のフィルタは、概説されるように、フィルタ計算機３０５に入力され、その後に、フィルタ・コンプレッサは、それぞれのフィルタ計算機に同じマスクＭ（ｎ，ｋ）が提供されるように、Ｎ個の新しいフィルタＡWν（ｎ，ｋ）を作り出す。 In the context of FIG. 15, an embodiment of a filter compressor 501 is described, in which (spectral) whitening is performed individually for each filter ν = 0,..., (N−1). The The filter mask generator 502 in this embodiment creates a single filter mask M (n, k) for all filters based on the N absolute value representations of all filters. This is a major advantage of embodiments of the present invention because the filter mask generator 502 can take into account how the compression filters are combined in a later state. . Each original filter is input to a filter calculator 305, as outlined, after which the filter compressor provides N filters so that each filter calculator is provided with the same mask M (n, k). Create a new filter AWν (n, k).

方程式（１１）と比較して、方程式（１１´）の（共同の）絶対値表現Ａ（ｎ，ｋ）は、等しく分散された重み係数ω（ν）＝１／Ｎを定義することによって方程式（１１）の結果に変換することができる。換言すれば、方程式（１１）による絶対値表現の計算は、インデックスνによって示されるそれぞれのフィルタの知覚的な重要性の重み付けを可能にするように、より大きな柔軟性を提供する方程式（１１´）による絶対値表現の特殊な形式を表す。 Compared to equation (11), the (joint) absolute value representation A (n, k) of equation (11 ′) is obtained by defining an equally distributed weighting factor ω (ν) = 1 / N. It can convert into the result of (11). In other words, the calculation of the absolute value representation according to equation (11) provides more flexibility so as to allow weighting of the perceptual importance of each filter indicated by the index ν. ) Represents a special form of absolute value representation.

時間ドメインにおいてＮ個の個々のフィルタのそれぞれに対して同じフィルタ・マスクＭ（ｎ，ｋ）を用いることによって、フィルタ・コンプレッサ５０１の実施形態は、フィルタ・コンプレッサ５０１の後のＮ個の個々のサブバンドの後処理さえ、他のフィルタのうちの１つにおいて対応する選択されたインパルス応答値を有しない関連したインパルス応答値を有するエントリーを有する結果として生じる圧縮サブバンド・フィルタ・インパルス応答を導かないように、Ｎ個のフィルタのそれぞれに対して１セットの圧縮サブバンド・フィルタ・インパルス応答を作り出すことができる。図５に示される実施形態のマスク・ジェネレータ５０２を図３および図４に示される実施形態のマスク・ジェネレータ３０４と比較して、マスク・ジェネレータ５０２は、時間ドメインだけにおいてＮ個の個々のフィルタのためのＮ個の入力サブバンド・フィルタ・インパルス応答が提供されるにもかかわらず、Ｎ個のサブバンド・フィルタ・インパルス応答の全てを示す単一のマスクＭ（ｎ，ｋ）を作り出す点に留意することが重要である。 By using the same filter mask M (n, k) for each of the N individual filters in the time domain, an embodiment of the filter compressor 501 makes the N individual after the filter compressor 501 Even the subband post-processing leads to a resulting compressed subband filter impulse response having entries with associated impulse response values that do not have a corresponding selected impulse response value in one of the other filters. A set of compressed subband filter impulse responses can be created for each of the N filters. Compared to the mask generator 304 of the embodiment shown in FIGS. 3 and 4 with the mask generator 502 of the embodiment shown in FIG. 5, the mask generator 502 uses N individual filters only in the time domain. In spite of providing N input subband filter impulse responses for generating a single mask M (n, k) showing all of the N subband filter impulse responses. It is important to note.

フィルタ・コンプレッサ５０１のさらなる実施形態において、異なるマスク・ジェネレータ５０２が使用でき、それは、時間ドメインにおいてＮ個のフィルタの全てに対して共通の評価表現を提供するために異なるスキームを原則として用いることができる。換言すれば、方程式（１１）に示すように、平均を適用することは別として、絶対値表現モジュール３０３によって提供されるように個々の評価表現は、例えば、関連するサブバンドに関する重み付けが実施できる、それぞれの値を合計することによって、それぞれの値を線形結合することによって、または、評価表現のそれぞれの値のさらに複雑な組合せ（例えば二次または高次の組合せ）を使用することによって、単一の評価表現に組み合わせることができる。 In a further embodiment of the filter compressor 501, different mask generators 502 can be used, which in principle use different schemes to provide a common evaluation representation for all N filters in the time domain. it can. In other words, apart from applying an average, as shown in equation (11), the individual evaluation representations can be weighted, for example, with respect to the associated subbands as provided by the absolute value representation module 303. Simply by summing each value, by linearly combining each value, or by using more complex combinations (eg, secondary or higher order combinations) of each value in the evaluation expression. Can be combined into one evaluation expression.

すでに前に説明されるように、バイノーラル・デコーダ６０２は、１０個（２つの音声出力チャネル（ステレオ）のための５つの音声入力チャンネル）を４個のＨＲＴＦフィルタに組み合わせ、それは、ステレオ入力信号６０３に直ちに適用することができる。しかしながら、ＨＲＴＦフィルタは、バイノーラル・ステレオ信号６０５をレンダリングするためにバイノーラル・デコーダ６０２に提供される空間パラメータ６０４に依存する。前述のように、特にＨＲＴＦフィルタは、多くの場合に人間の耳および音源のためのバイノーラル・ステレオ出力信号間の非常に複雑な相互作用がモデル化されなければならないように、フィルタ・タップとしてかなりの数の非自明な、非ゼロのまたはゼロになることのないサブバンド・フィルタ・インパルス応答値を含む。それぞれのＨＲＴＦフィルタは、例えばモデル化される環境および他の影響の室内特性を効率的にモデル化するために大幅に長くなる得る。 As already explained before, binaural decoder 602 combines 10 (5 audio input channels for 2 audio output channels (stereo)) into 4 HRTF filters, which are connected to stereo input signal 603. Can be applied immediately. However, the HRTF filter relies on spatial parameters 604 that are provided to the binaural decoder 602 to render the binaural stereo signal 605. As mentioned above, HRTF filters, in particular, often serve as filter taps so that very complex interactions between binaural stereo output signals for the human ear and sound source must be modeled in many cases. Number of non-obvious, non-zero or non-zero subband filter impulse response values. Each HRTF filter can be significantly longer, for example, to efficiently model the environment being modeled and the room characteristics of other influences.

特にこれに関連して、フィルタ・コンプレッサ５０１の実施形態は、著しくバイノーラル・デコーダ６０２に関して計算の複雑性を低減するために効率的に適用することができる。バイノーラル・デコーダ６０２の枠組みにおいて考慮される関連したサブバンド・フィルタ・インパルス応答値の数を低減することによって、バイノーラル・デコーダ６０２は、より小さい計算する能力で実施することができ、それは、例えば対応するバイノーラル・デコーダのクロック速度が所定の期間において低い数の計算のため低減され得るように、低いエネルギー消費量を最終的に導く。あるいは、バイノーラル・デコーダ６０２は、原則として第２の処理コアが回避できるように、同じ理由からより小さく構築することができる。 In particular in this regard, embodiments of the filter compressor 501 can be efficiently applied to significantly reduce the computational complexity for the binaural decoder 602. By reducing the number of associated subband filter impulse response values considered in the binaural decoder 602 framework, the binaural decoder 602 can be implemented with less computing power, for example correspondingly The low energy consumption is ultimately derived so that the clock speed of the binaural decoder to be reduced can be reduced due to the low number of calculations in a given period. Alternatively, the binaural decoder 602 can be made smaller for the same reason, in principle so that the second processing core can be avoided.

図７〜図１３との関連でさらに詳細に概説されるように、１０個の時間ドメインＨＲＴＦフィルタを複素ＱＭＦドメインまたは複素サブバンド・ドメインに変換するために用いられる１９２個（＝３・６４）のフィルタ・タップを有するフィルタ・コンバータを使用すると、８９６個（＝１４・６４）のフィルタ・タップを有する時間ドメインにおけるＨＲＴＦフィルタは、例えば図１に示されるように、フィルタ・コンバータ６０１またはむしろ１０個のフィルタ・コンバータ１０１によって、それそれが１６個（＝１４＋３−１）のフィルタ・タップを含む６４個の個々のサブバンド・フィルタ・インパルス応答に転送される。１０個の時間ドメインＨＲＴＦフィルタのそれぞれに対して結果として生じる１０２４個のフィルタ・タップは、フィルタ・コンプレッサ５０１の実施形態がフィルタ・タップの全体の数を例えば４〜２５６（＝１０２４／４）倍だけ低減するために使用されない限り、バイノーラル・デコーダ６０２に対するかなりの計算負担をもたらす。この実施形態が複素ＱＭＦまたはサブバンド・ドメインにおいて１０個のＨＲＴＦフィルタのそれぞれのためのＬ＝６４個のサブバンドを含むシステムに基づくにもかかわらず、原則としていかなる数のＬ個のサブバンドも使用され得る。 As outlined in more detail in connection with FIGS. 7-13, 192 (= 3 · 64) used to convert 10 time domain HRTF filters to complex QMF domain or complex subband domain HRTF filter in the time domain with 896 (= 14 · 64) filter taps, for example, filter converter 601 or rather 10 Each of the filter converters 101 forwards it to 64 individual subband filter impulse responses including 16 (= 14 + 3-1) filter taps. The resulting 1024 filter taps for each of the 10 time domain HRTF filters is such that the filter compressor 501 embodiment multiplies the total number of filter taps by, for example, 4 to 256 (= 1024/4). Unless used to reduce only a significant computational burden on the binaural decoder 602. Although this embodiment is based on a system that includes L = 64 subbands for each of 10 HRTF filters in the complex QMF or subband domain, in principle any number of L subbands Can be used.

フィルタ・コンプレッサのさらなる実施形態および圧縮サブバンド・フィルタ・インパルス応答フィルタを作るための方法が述べられる前に、フィルタ・コンバータのための可能な解決策および複素サブバンド・ドメイン（ＱＭＦドメイン）において作動するフィルタがさらに詳細に説明される。しかしながら、特にフィルタ・コンバータについて、技術背景をさらに詳細に述べる前に、デジタル・フィルタを（時間ドメインにおいてまたはサブバンド・ドメインにおいて）デジタル音声入力に適用する一般概念が述べられるべきである。 Possible solutions for filter converters and operation in the complex subband domain (QMF domain) before further embodiments of filter compressors and methods for making compressed subband filter impulse response filters are described The filter is described in more detail. However, before discussing the technical background in more detail, particularly for filter converters, the general concept of applying a digital filter (in the time domain or in the subband domain) to digital audio input should be described.

図７は、フィルタまたはフィルタ・エレメント７００のための可能な解決策を示し、それには、デジタル音声入力が提供される。原則としてデジタル音声入力は時間ドメイン信号および（複素）サブバンド・ドメインにおける信号の両方であり得る点に留意すべきである。フィルタ・エレメントは、出力でデジタル音声出力を提供し、それは、フィルタ定義信号またはそれぞれのフィルタ・インパルス応答信号に応じて、フィルタにかけられたデジタル音声入力を表す。 FIG. 7 shows a possible solution for a filter or filter element 700, which is provided with a digital audio input. It should be noted that in principle the digital audio input can be both a time domain signal and a signal in the (complex) subband domain. The filter element provides a digital audio output at the output, which represents the filtered digital audio input in response to a filter definition signal or a respective filter impulse response signal.

フィルタ・コンバータ１０１は、図８に示されるように、対応するフィルタ・インパルス応答信号が提供される中心コンポーネントとして複素分析フィルタバンク７１０を含む。複素分析フィルタバンク７１０は、時間ドメインにおいてフィルタのインパルス応答信号を分析し、それは、１セットのＬ個の分析フィルタを有するフィルタリングによって、ＱＭＦドメインに転送され、その後に係数Ｌの任意のダウンサンプリングが続き、Ｌは、また正の整数であり、好ましくは１より大きくかつ複素分析フィルタバンク７１０のサブバンドの数を示す。分析フィルタは、プロトタイプ・フィルタｑ（ｎ）の複素変調によって通常得られ、またｎは、データの配列におけるインデックスまたは信号における値のインデックスを示す正の整数である。フィルタバンク７１０の出力は、Ｌ個のサブバンド信号からなり、全体として、複素ＱＭＦドメインの時間ドメインにおいてそのフィルタ・インパルス応答によって特徴づけられるフィルタを表す。具体的には、複素分析フィルタバンク７１０の出力は、１セットのサブバンド・フィルタ・インパルス応答であり、それは、複素ＱＭＦドメインにおいて音声入力信号のフィルタリングを実行するためにフィルタ・エレメント７００に提供することができ、それは、時間ドメインにおいて直接のフィルタリングと比較して、音声出力信号の知覚的に見分けのつかない違いを導く。 The filter converter 101 includes a complex analysis filter bank 710 as a central component where a corresponding filter impulse response signal is provided, as shown in FIG. The complex analysis filter bank 710 analyzes the impulse response signal of the filter in the time domain, which is forwarded to the QMF domain by filtering with a set of L analysis filters, after which any downsampling of the coefficient L is performed. Subsequently, L is also a positive integer, preferably greater than 1 and indicates the number of subbands in the complex analysis filter bank 710. The analysis filter is usually obtained by complex modulation of the prototype filter q (n), where n is a positive integer indicating the index in the array of data or the index of the value in the signal. The output of the filter bank 710 is composed of L subband signals and generally represents a filter characterized by its filter impulse response in the time domain of the complex QMF domain. Specifically, the output of the complex analysis filter bank 710 is a set of subband filter impulse responses that are provided to the filter element 700 to perform filtering of the speech input signal in the complex QMF domain. It can lead to perceptually indistinguishable differences in the audio output signal compared to direct filtering in the time domain.

プロトタイプ・フィルタｑ（ｎ）および複素変調された分析フィルタバンクの基本設計は、それらの両方に関するさらなる詳細が概説され、その後にさらに詳細に説明される。さらに、以下において、サブバンドの数は、Ｌ＝６４個で固定されると仮定される。しかしながら、前に説明されるように、これは、本発明の実施形態の制限でなく、単に適切な例として働くだけである。 The basic design of the prototype filter q (n) and the complex-modulated analysis filter bank is outlined in more detail, followed by further details. Further, in the following, it is assumed that the number of subbands is fixed at L = 64. However, as explained previously, this is not a limitation of the embodiments of the present invention, but merely serves as a suitable example.

図９は、複素分析バンク７１０の可能な解決策をさらに詳細に示す。複素分析バンク７１０は、複素分析バンク７１０によって出力されるそれぞれのサブバンドのための複数のＬ個の中間分析フィルタ７２０を含む。具体的には、Ｌ個の中間分析フィルタ７２０のそれぞれは、処理される入力信号として時間ドメイン・インパルス応答信号が提供されるノード７３０に並列に接続される。中間分析フィルタ７２０のそれぞれは、それぞれのサブバンドの中心周波数に関して複素分析バンク７１０の入力信号をフィルタリングするために構成される。異なるサブバンドの中心周波数によれば、それぞれのサブバンドは、サブバンド・インデックスまたはインデックスｋによって表示され、またｋは、典型的に０から（Ｌ−１）の範囲において、非負の整数である。複素分析バンク７１０の中間分析フィルタ７２０は、中間分析フィルタ７２０が適用されるサブバンドのサブバンド・インデックスｋによる複素変調によってプロトタイプ・フィルタｐ（ｎ）から導き出すことができる。プロトタイプ・フィルタの複素変調に関するさらなる詳細が以下に説明される。 FIG. 9 shows in more detail a possible solution of the complex analysis bank 710. Complex analysis bank 710 includes a plurality of L intermediate analysis filters 720 for each subband output by complex analysis bank 710. Specifically, each of the L intermediate analysis filters 720 is connected in parallel to a node 730 where a time domain impulse response signal is provided as an input signal to be processed. Each of the intermediate analysis filters 720 is configured to filter the input signal of the complex analysis bank 710 with respect to the center frequency of the respective subband. According to the center frequencies of the different subbands, each subband is denoted by a subband index or index k, and k is a non-negative integer, typically in the range of 0 to (L-1). . The intermediate analysis filter 720 of the complex analysis bank 710 can be derived from the prototype filter p (n) by complex modulation with the subband index k of the subband to which the intermediate analysis filter 720 is applied. Further details regarding the complex modulation of the prototype filter are described below.

直接的に中間分析フィルタ７２０によってまたは（図９において点線によって示される）任意のダウンサンプラ７４０によって、中間分析フィルタ７２０によって出力される信号のサンプリング周波数は、係数Ｌ倍だけ低減される。前記のように、対応する中間分析フィルタ７２０によって出力されるそれぞれのサブバンド信号に供給されるダウンサンプラ７４０は、具体的な実施に応じて、ダウンサンプリングが中間分析フィルタ７２０の枠組みにおいて実施することもできるように、任意である。原則として、中間分析フィルタ７２０によって出力される信号のダウンサンプリングは必要でない。それにもかかわらず、明示的なまたは暗黙的なダウンサンプラ７４０の存在は、複素分析バンク７１０によって提供されるデータ量がデータの著しい冗長性を導くＬ倍に選択的に増加されるので、いくつかのアプリケーションにおいて有利なオプションであってもよい。 The sampling frequency of the signal output by the intermediate analysis filter 720, either directly by the intermediate analysis filter 720 or by any downsampler 740 (shown by the dotted line in FIG. 9) is reduced by a factor L. As described above, the downsampler 740 supplied to each subband signal output by the corresponding intermediate analysis filter 720 may perform downsampling in the framework of the intermediate analysis filter 720, depending on the specific implementation. It is optional so that you can also. In principle, downsampling of the signal output by the intermediate analysis filter 720 is not necessary. Nevertheless, the presence of an explicit or implicit downsampler 740 can increase the amount of data provided by the complex analysis bank 710 selectively by L times leading to significant data redundancy. This may be an advantageous option in other applications.

図１０は、サブバンド・フィルタリング７５０およびフィルタ・コンバータ１０１とのその相互作用の可能な解決策をさらに詳細に示す。サブバンド・フィルタリング７５０は、複数の中間フィルタ７６０を含み、１つの中間フィルタ７６０には、サブバンド・フィルタリング７５０に提供される複素数値のそれぞれのサブバンド信号が提供される。そのため、サブバンド・フィルタリング７５０は、Ｌ個の中間フィルタ７６０を含む。 FIG. 10 shows in more detail a possible solution of the subband filtering 750 and its interaction with the filter converter 101. Subband filtering 750 includes a plurality of intermediate filters 760, where one intermediate filter 760 is provided with a complex-valued respective subband signal provided to subband filtering 750. As such, subband filtering 750 includes L intermediate filters 760.

フィルタ・コンバータ１０１は、それぞれの中間フィルタ７６０に接続される。結果として、フィルタ・コンバータ１０１は、サブバンド・フィルタリング７５０のそれぞれの中間フィルタ７６０のためのフィルタ・タップを提供することができる。中間フィルタ７６０によって行われるフィルタリングに関するさらなる詳細が、アプリケーションのさらなる過程において説明される。そのため、フィルタ・コンバータ１０１によって出力されかつ異なる中間フィルタ７６０に提供されるフィルタ・タップは、中間フィルタ定義信号を形成する。 The filter converter 101 is connected to each intermediate filter 760. As a result, filter converter 101 can provide a filter tap for each intermediate filter 760 of subband filtering 750. Further details regarding the filtering performed by the intermediate filter 760 are described in the further course of the application. Thus, the filter taps output by filter converter 101 and provided to different intermediate filters 760 form an intermediate filter definition signal.

さらに、実施形態、解決策および実施が、図面において省略された、信号のいずれかまたは信号のサブセットを遅延するためのさらなるおよび／または任意の遅延を含むことができる点に留意すべきである。それにもかかわらず、遅延またはディレイヤは、それらの具体的な実施に応じて、全ての実施形態、解決策および実施において任意のエレメントとして示されまたは加えられるエレメント（例えばフィルタ）に含ませることができる。 Furthermore, it should be noted that embodiments, solutions and implementations can include additional and / or optional delays for delaying any of the signals or subsets of signals, which are omitted in the drawings. Nevertheless, delays or delayers can be included in elements (eg, filters) shown or added as optional elements in all embodiments, solutions and implementations, depending on their specific implementation. .

図１１は、複素合成バンク７７０のための可能な解決策を図示する。複素合成バンク７７０は、Ｌ個のサブバンド信号が提供されるＬ個の中間合成フィルタ７８０を含む。中間合成フィルタ７８０の枠組みにおいてフィルタリングより前の複素合成バンク７７０の具体的な実施に応じて、サブバンド信号は、Ｌ個のアップサンプラ７９０によってアップサンプリングされ、それは、サンプリング周波数をＬ倍に増加することによってサブバンド信号のサンプルされた周波数を再構成する。換言すれば、任意のアップサンプラ７９０は、それぞれのサブバンド信号に含まれる情報が保持される一方でサンプリング周波数がＬ倍に増加されるというようなやり方で、アップサンプラ７９０に提供されるサブバンド信号を再構成またはリフォームする。 FIG. 11 illustrates a possible solution for the complex synthesis bank 770. Complex synthesis bank 770 includes L intermediate synthesis filters 780 from which L subband signals are provided. Depending on the specific implementation of the complex synthesis bank 770 prior to filtering in the framework of the intermediate synthesis filter 780, the subband signal is upsampled by L upsamplers 790, which increases the sampling frequency by a factor of L. Thereby reconstructing the sampled frequency of the subband signal. In other words, any upsampler 790 is provided with a subband provided to the upsampler 790 in such a way that the information contained in the respective subband signal is retained while the sampling frequency is increased L times. Reconfigure or reform the signal.

それにもかかわらず、図９との関連ですでに説明されるように、アップサンプラ７９０は、アップサンプリングが中間合成フィルタ７８０の枠組みにおいて実施することができるように、任意のコンポーネントである。そのため、アップサンプラ７９０によって実施されるサブバンド信号をアップサンプリングするステップは、中間合成フィルタ７８０の枠組みにおいて同時に処理することができる。しかしながら、ダウンサンプラ７４０が明示的にも暗黙的にも実施されない場合、アップサンプラ７９０は、明示的にまたは暗黙的に実施される必要もない。 Nevertheless, as already described in connection with FIG. 9, upsampler 790 is an optional component so that upsampling can be implemented in the framework of intermediate synthesis filter 780. Therefore, the step of upsampling the subband signal performed by upsampler 790 can be processed simultaneously in the framework of intermediate synthesis filter 780. However, if the downsampler 740 is not explicitly or implicitly implemented, the upsampler 790 need not be explicitly or implicitly implemented.

中間合成フィルタ７８０は、Ｌ個の中間合成フィルタ７８０によって出力されるフィルタにかけられたサブバンド信号を合計する加算器８００に出力を介して接続される。加算器８００は、実数部エクストラクタ８１０にさらに接続され、それは、加算器８００によって提供される複素数値の信号に基づいて実数値の信号またはむしろ（実数値の）時間ドメイン出力信号を抽出しまたは形成する。実数部エクストラクタ８１０は、このタスクを、例えば、加算器８１０によって提供される複素数値の信号の実数部を抽出することによって、加算器８１０によって提供される複素数値の信号の絶対値を計算することによって、または、複素数値の入力信号に基づいて実数値の出力信号を形成する他の方法によって、実行することができる。 Intermediate synthesis filter 780 is connected via an output to adder 800 that sums the filtered subband signals output by L intermediate synthesis filters 780. The adder 800 is further connected to a real part extractor 810 that extracts a real-valued signal or rather a (real-valued) time domain output signal based on the complex-valued signal provided by the adder 800 or Form. The real part extractor 810 calculates the absolute value of the complex value signal provided by the adder 810, for example, by extracting the real part of the complex value signal provided by the adder 810. Or by other methods of forming a real-valued output signal based on a complex-valued input signal.

図１２に示される複素合成バンク７７０のための第２の可能な解決策は、実数部エクストラクタ８１０および加算器８００に関してだけ図１１に示される第１の可能な解決策と異なる。具体的には、中間合成フィルタ７８０の出力は、それぞれのサブバンドから、中間合成フィルタ７８０によって出力される複素数値の信号に基づいて実数値の信号を抽出しまたは形成する実数部エクストラクタ８１０に別々に接続される。そして、実数部エクストラクタ８１０は、加算器８００に接続され、それは、加算器８００によって提供される実数値の出力信号を形成するためにＬ個のフィルタにかけられたサブバンド信号から導き出されるＬ個の実数値の信号を合計する。 The second possible solution for the complex synthesis bank 770 shown in FIG. 12 differs from the first possible solution shown in FIG. 11 only with respect to the real part extractor 810 and the adder 800. Specifically, the output of the intermediate synthesis filter 780 is sent from each subband to a real part extractor 810 that extracts or forms a real value signal based on the complex value signal output by the intermediate synthesis filter 780. Connected separately. The real part extractor 810 is then connected to the adder 800, which is derived from the L filtered subband signals to form the real-valued output signal provided by the adder 800. Sum all real-valued signals.

前に説明されるように、図３は、フィルタ・コンバータ１０１の可能な選択を図示する。フィルタは、そのインパルス応答によって与えられると仮定される。このインパルス応答を離散時間信号として見ると、それは、Ｌ個のバンド複素分析（フィルタ）バンク７１０によって分析される。そして、結果として生じるサブバンド信号出力は、正確に図１０に示されるサブバンド・フィルタリング７５０のそれぞれのサブバンドにおいて別々に適用されるフィルタのインパルス応答である。図８に示される場合において、フィルタ・コンバータ１０１およびその複素分析バンクまたは複素分析フィルタバンク７１０に提供されるフィルタ定義信号は、フィルタの振幅／周波数特性を示すインパルス応答信号であり、それは、サブバンド・ドメインに転送される。そのため、Ｌ個のサブバンドのそれぞれの複素分析（フィルタ）バンク７１０の出力は、サブバンド・フィルタリング７５０に含まれる中間フィルタのインパルス応答を表す。 As previously described, FIG. 3 illustrates a possible choice of filter converter 101. The filter is assumed to be given by its impulse response. Looking at this impulse response as a discrete time signal, it is analyzed by L band complex analysis (filter) banks 710. The resulting subband signal output is then exactly the impulse response of the filter applied separately in each subband of the subband filtering 750 shown in FIG. In the case shown in FIG. 8, the filter definition signal provided to the filter converter 101 and its complex analysis bank or complex analysis filter bank 710 is an impulse response signal indicating the amplitude / frequency characteristics of the filter, which is a subband. • Transferred to the domain. Thus, the output of the complex analysis (filter) bank 710 for each of the L subbands represents the impulse response of the intermediate filter included in the subband filtering 750.

複素分析バンク７１０は、音声出力信号のための分析バンクから原則として導き出されるが、異なるプロトタイプ・フィルタおよびわずかに異なる変調構成を有し、それの詳細が以下の記載に概説される。プロトタイプ・フィルタｑ（ν）の長さは、比較的小さいように設計することができる。また、係数Ｌによるダウンサンプリングのため、サブバンド・フィルタの長さは、所定の時間ドメイン・フィルタおよびプロトタイプ・フィルタｑ（ν）の長さの合計より小さいＬの係数である。 The complex analysis bank 710 is derived in principle from the analysis bank for the audio output signal, but has different prototype filters and slightly different modulation configurations, the details of which are outlined in the following description. The length of the prototype filter q (ν) can be designed to be relatively small. Further, because of the downsampling by the coefficient L, the length of the subband filter is a coefficient of L smaller than the sum of the lengths of the predetermined time domain filter and the prototype filter q (ν).

本願において、ゼロになることのないタップまたは値は、理想的にゼロに等しくないタップまたは値である。それにもかかわらず、このアプリケーションの枠組みにおいて実施制限のため、ゼロになることのない値またはタップは、所定の閾値、例えば１０^-bまたは２^-bより大きい絶対値を有する実数値または複素数値のタップまたは値であり、ｂは、具体的な実施の要求による正の整数である。デジタル・システムにおいて、この閾値は、バイナリー・システム（基数２）において好ましくは定義され、整数ｂは、実施の詳細に応じて所定の値を有する。典型的に、値ｂは、４、５、６、７、８、１０、１２、１４、１６または３２である。 In this application, a tap or value that is never zero is a tap or value that is ideally not equal to zero. Nevertheless, due to implementation limitations in the framework of this application, a value or tap that never becomes zero is a real or complex value with an absolute value greater than a predetermined threshold, eg, 10 ^−b or 2 ^−b . Is a tap or value, and b is a positive integer according to the specific implementation requirement. In digital systems, this threshold is preferably defined in binary systems (base 2), and the integer b has a predetermined value depending on the implementation details. Typically, the value b is 4, 5, 6, 7, 8, 10, 12, 14, 16 or 32.

複素変調されたフィルタバンク Complex modulated filter bank

方程式（１３）および（１４）において、θおよびψは、実数値の離散時間信号ｘ（ｎ）を複素数値のサブバンド信号にフィルタリングするための、および、複素数値のサブバンド信号ｄｋ（ｍ）から実数値の出力サンプルｙ（ｎ）を再構成するための（一定の）位相係数を表す。プロトタイプ・フィルタおよび一定の位相係数θおよびψは、サブバンド信号が不変であるｄｋ（ｍ）＝ｃｋ（ｍ）の場合において、完全な再構成ｙ（ｎ）＝ｘ（ｎ）を与えるために選択することができることはよく知られている。実際には、完全な再構成特性は、遅延（および／または符号変換）に至るまで当てはまるが、続く計算において、この詳細は、ＰＣＴ／ＳＥ０２／００６２６「複素指数変調されたフィルタバンクを用いるエイリアシングの低減」におけるような設計の疑似ＱＭＦタイプの場合において説明されるように、非因果的プロトタイプ・フィルタの使用を可能にすることによって無視される。ここで、プロトタイプ・フィルタは、対称ｐ（−ｎ）＝ｐ（ｎ）であり、さらにその離散時間フーリエ変換Ｐ（ω）は、インターバル｜ω｜＝ｐ／Ｌの外側で本質的にゼロになる。完全な再構成は、ほぼ完全な再構成特性とも置換される。後に続く導出に対して、説明を簡単にするために完全な再構成が保持しさらにｐ／Ｌ＜｜ω｜＝ｐに対してＰ（ω）＝０であると仮定される。さらに、位相係数は、ψ−θが４Ｌの整数の倍数に等しい状態を満たすように仮定される。 In equations (13) and (14), θ and ψ are used to filter the real-valued discrete-time signal x (n) into complex-valued subband signals and complex-valued subband signals dk (m). Represents a (constant) phase coefficient for reconstructing a real-valued output sample y (n) from. Prototype filter and constant phase coefficients θ and ψ to give a complete reconstruction y (n) = x (n) in the case of dk (m) = ck (m) where the subband signal is unchanged It is well known that you can choose. In practice, the full reconstruction characteristics apply up to the delay (and / or code conversion), but in the calculations that follow, this detail will be discussed in the PCT / SE02 / 00626 “Aliasing with Complex Exponential Modulated Filter Bank” section. As explained in the case of a pseudo-QMF type of design as in “reduction”, it is ignored by allowing the use of a non-causal prototype filter. Here, the prototype filter is symmetric p (−n) = p (n), and its discrete-time Fourier transform P (ω) is essentially zero outside the interval | ω | = p / L. Become. Complete reconstruction is also replaced with nearly complete reconstruction characteristics. For subsequent derivations, it is assumed that a complete reconstruction is maintained for simplicity of the description and that P (ω) = 0 for p / L <| ω | = p. Further, the phase coefficient is assumed to satisfy a state where ψ−θ is equal to an integer multiple of 4L.

きわめてサンプルされたフィルタバンクにおいて、合成より前のサブバンド信号の変更は、通常エイリアシング・アーチファクトの導入を導く。これは、係数２によるオーバーサンプリングが複素数値の信号を用いることによって導入されるという事実のため、ここで克服される。サブバンド・サンプルの総サンプリング・レートが離散時間入力信号のサンプリング・レートと同一であるにもかかわらず、入力信号は、実数値であり、さらにサブバンド・サンプルは、複素数値である。下に概説されるように、エイリアスのないことは、効率的な時間不変信号処理を可能にする。 In highly sampled filter banks, changes to subband signals prior to synthesis usually lead to the introduction of aliasing artifacts. This is overcome here by the fact that oversampling by a factor of 2 is introduced by using a complex-valued signal. Even though the total sampling rate of the subband samples is the same as the sampling rate of the discrete time input signal, the input signal is real-valued, and the subband samples are complex-valued. As outlined below, the absence of aliasing allows for efficient time-invariant signal processing.

複素変調されたフィルタバンクにおけるサブバンド・フィルタリング Subband filtering in complex modulated filter banks.

興味のある他の場合は、ｙ（ｎ）＝ｘ（ｎ−Ｌ）のように、Ｇ（ω）＝ｅｘｐ（ｉＬω）を導くＧ_k（ω）＝ｅｘｐ（−ｉω）である。 Another case of interest is G _k (ω) = exp (−iω) which leads to G (ω) = exp (iLω), such as y (n) = x (n−L).

サブバンド・フィルタリングによって所定のフィルタ応答の近似 Approximate a given filter response with subband filtering

この手順の利点は、いかなる所定のフィルタｈ（ｎ）も中間サブバンド・フィルタ応答に効率的に変換することができるということである。ｑ（ｎ）がＫ_Q・Ｌ個のタップを有する場合、Ｋ_H・Ｌ個のタップの時間ドメイン・フィルタｈ（ｎ）は、（Ｋ_H＋Ｋ_Q−１）個のタップを有するサブバンド・ドメイン・フィルタ（２４）に変換され、Ｋ_HおよびＫ_Qは、正の整数である。Ｋ_Qが３（Ｌ・Ｋ_Q＝１９２）に等しくさらに時間ドメイン・フィルタのインパルス応答がＫ_H・６４（Ｌ＝６４）の長さに対応する場合、それぞれの中間サブバンド・フィルタ７６０は、Ｋ_H＋３−１＝Ｋ_H＋２だけのタップのインパルス応答長を有する。 The advantage of this procedure is that any given filter h (n) can be efficiently converted to an intermediate subband filter response. If q (n) has K _Q · L taps, then the time domain filter h (n) of K _H · L taps has subbands with (K _H + K _Q −1) taps. Converted to a domain filter (24), K _H and K _Q are positive integers. If K _Q is equal to 3 (L · K _Q = 192) and the impulse response of the time domain filter corresponds to a length of K _H · 64 (L = 64), then each intermediate subband filter 760 is It has an impulse response length of taps by K _H + 3-1 = K _H +2.

フィルタ・コンバータのためのプロトタイプ・フィルタの設計 Prototype filter design for filter converter

以下において、ＨＲＴＦフィルタのマルチ・スロットＱＭＦ表現（サブバンド・ドメイン）の決定が記載される。時間ドメインから複素ＱＭＦサブバンド・ドメインへのフィルタ変換は、フィルタ・コンバータ１０１においてＦＩＲフィルタによって実行される。具体的には、以下の記載が、複素ＱＭＦサブバンド・ドメインにおける長さＮ_Hの所定のＦＩＲフィルタｈ（ｎ）を実施するための方法を概説する。 In the following, the determination of the multi-slot QMF representation (subband domain) of the HRTF filter is described. The filter transformation from the time domain to the complex QMF subband domain is performed in the filter converter 101 by the FIR filter. Specifically, the following description outlines a method for implementing a predetermined FIR filter h (n) of length _NH in the complex QMF subband domain.

それ自体をフィルタにかけるサブバンドは、サブバンド・フィルタリング７５０の内部で中間フィルタ７６０によって実施される。具体的には、サブバンド・フィルタリングは、インデックスｋ＝０、１、・・・、６３を有するそれぞれのＱＭＦサブバンドに対して１つの複素数値のＦＩＲ中間フィルタｇ_k（ｌ）の別のアプリケーションからなる。換言すれば、以下の記載において、特別な参照は、Ｌ＝６４の異なるサブバンド信号の場合になされる。それにもかかわらず、サブバンド信号のこの特別な数は、本質的でなく、さらに、適切な方程式は、さらに一般の形式でも与えられる。 The subband that filters itself is implemented by the intermediate filter 760 within the subband filtering 750. Specifically, subband filtering is another application of one complex-valued FIR intermediate filter g _k (l) for each QMF subband with index k = 0, 1,. Consists of. In other words, in the following description, a special reference is made in the case of L = 64 different subband signals. Nevertheless, this particular number of subband signals is not essential, and an appropriate equation is also given in a more general form.

所定の時間ドメインＦＩＲフィルタｈ（ｎ）を複素サブバンド・ドメイン・フィルタｇ_k（ｌ）に変換するフィルタ・コンバータ１０１は、複素分析バンク７１０を含む。Ｌ＝６４のサブバンド信号の特別の場合のための長さ１９２（＝３・６４）のフィルタ・コンバータ１０１ｑ（ｎ）の複素分析フィルタバンク７１０のプロトタイプ・フィルタは、方程式（２８）の過剰に決定されたシステムを最小二乗法において解決することによって作り出される。フィルタ係数ｑ（ｎ）は、Ｌ＝６４のサブバンドの場合について後にさらに詳細に記載される。 A filter converter 101 that converts a predetermined time domain FIR filter h (n) to a complex subband domain filter g _k (l) includes a complex analysis bank 710. The prototype filter of the complex analysis filter bank 710 of the filter converter 101q (n) of length 192 (= 3 · 64) for the special case of L = 64 subband signals is in excess of equation (28). Produced by solving the determined system in the least squares method. The filter coefficient q (n) will be described in more detail later for the case of L = 64 subbands.

方程式に基づく方程式の下の本願の枠組みにおいて、さらなる遅延（ｌ₀およびｎ₀を参照）係数、さらなる係数の導入、および、ウィンドウ関数または他の単関数の導入の導入が理解される点に留意すべきである。さらに、単純な定数、一定の加数などは、ドロップすることができる。さらに、全くまたは著しいやり方で方程式の結果を変えない代数変換、等価変換および近似（例えばテイラー近似）も、含まれる。換言すれば、結果として生じる一致に関して本質的に導くわずかな修正および変換の両方は、方程式または式が方程式または式が基づく場合において含まれる。 Note that in the framework of the present application under equations based equations, it is understood that additional delay (see l ₀ and n ₀ ) coefficients, the introduction of further coefficients, and the introduction of window functions or other simple functions. Should. In addition, simple constants, constant addends, etc. can be dropped. Also included are algebraic transformations, equivalent transformations and approximations (eg, Taylor approximation) that do not alter the result of the equation in any or significant manner. In other words, both slight modifications and transformations that essentially lead to the resulting match are included when the equation or expression is based on the equation or expression.

この場合において、整数ｋ＝０、１、・・・、６３は、またサブバンドのインデックスであり、さらに、ｌ＝０、１、・・・、（Ｋ_H＋１）は、結果として生じる中間フィルタ７６０のタップを示す整数である。 In this case, the integers k = 0, 1,..., 63 are also subband indices, and l = 0, 1,..., (K _H +1) is the resulting intermediate filter. An integer indicating 760 taps.

方程式（２４）と比較して方程式（３２）において（−２）の特別な加数があり、なぜなら、方程式（２４）は、フィルタの損失を考慮せずに開発されたからである。実際の実施は、遅延を常に生じる。そのため、具体的な実施に応じて、さらなるディレイヤまたは遅延は、実施することができ、それは、図において単純化のために省略されている。 There is a special addend of (−2) in equation (32) compared to equation (24) because equation (24) was developed without considering the loss of the filter. Actual implementation always introduces a delay. Thus, depending on the specific implementation, further delayering or delay can be implemented, which is omitted in the figure for simplicity.

多くの場合において、線形方程式（２８）のシステムは、過度に決定される。それにもかかわらず、それは、プロトタイプ・フィルタ係数ｑ（ｎ）に関して最小二乗法において解決しまたは近似することができる。最小二乗法において線形方程式（２８）のシステムを解決することは、０から１９１までの整数ｎのための以下の関係を満たすプロトタイプ・フィルタｑ（ｎ）の以下のフィルタ・タップを導く。
ｑ［０］＝−０．２０２９３４３３８０
ｑ［１］＝−０．１９８０３３１５８８
ｑ［２］＝−０．１９２９４１１５１９
ｑ［３］＝−０．１８７６７４４２２２
ｑ［４］＝−０．１８２２４７４０１１
ｑ［５］＝−０．１７６６７３０２０２
ｑ［６］＝−０．１７０９６２８６３６
ｑ［７］＝−０．１６５１２７３００５
ｑ［８］＝−０．１５９１７５６０２４
ｑ［９］＝−０．１５３１１６０４５５
ｑ［１０］＝−０．１４６９５６０００５
ｑ［１１］＝−０．１４０７０２０１３２
ｑ［１２］＝−０．１３４３５９８７３８
ｑ［１３］＝−０．１２７９３４６７９０
ｑ［１４］＝−０．１２１４３０８８７６
ｑ［１５］＝−０．１１４８５２３６８６
ｑ［１６］＝−０．１０８２０２４４５４
ｑ［１７］＝−０．１０１４８３９３４１
ｑ［１８］＝−０．０９４６９９１７８３
ｑ［１９］＝−０．０８７８５００７９９
ｑ［２０］＝−０．０８０９３８１２６８
ｑ［２１］＝−０．０７３９６４４１７４
ｑ［２２］＝−０．０６６９２９６８３１
ｑ［２３］＝−０．０５９８３４３０８１
ｑ［２４］＝−０．０５２６７８３４６６
ｑ［２５］＝−０．０４５４６１５３８８
ｑ［２６］＝−０．０３８１８３３２４９
ｑ［２７］＝−０．０３０８４２８５７２
ｑ［２８］＝−０．０２３４３９０１１５
ｑ［２９］＝−０．０１５９７０３９５７
ｑ［３０］＝−０．００８４３５３５８４
ｑ［３１］＝−０．０００８３１９９５６
ｑ［３２］＝０．００６８４１８４３５
ｑ［３３］＝０．０１４５８８５５２７
ｑ［３４］＝０．０２２４１０７６４８
ｑ［３５］＝０．０３０３１１３４９５
ｑ［３６］＝０．０３８２９３４１２６
ｑ［３７］＝０．０４６３６０２９５９
ｑ［３８］＝０．０５４５１５５７８９
ｑ［３９］＝０．０６２７６３０８１０
ｑ［４０］＝０．０７１１０６８６５７
ｑ［４１］＝０．０７９５５１２４５３
ｑ［４２］＝０．０８８１００７８７９
ｑ［４３］＝０．０９６７６０３２５９
ｑ［４４］＝０．１０５５３４９６５８
ｑ［４５］＝０．１１４４３０１０００
ｑ［４６］＝０．１２３４５１４２２２
ｑ［４７］＝０．１３２６０４９４３４
ｑ［４８］＝０．１４１８９７０１２３
ｑ［４９］＝０．１５１３３４３３７０
ｑ［５０］＝０．１６０９２４０１２６
ｑ［５１］＝０．１７０６７３５５１７
ｑ［５２］＝０．１８０５９０９１９４
ｑ［５３］＝０．１９０６８４５７５３
ｑ［５４］＝０．２００９６３５１９１
ｑ［５５］＝０．２１１４３７３４５８
ｑ［５６］＝０．２２２１１６３０８０
ｑ［５７］＝０．２３３０１１３８６８
ｑ［５８］＝０．２４４１３４３７４２
ｑ［５９］＝０．２５５４９７９６６４
ｑ［６０］＝０．２６７１１５８７００
ｑ［６１］＝０．２７９００２９２３６
ｑ［６２］＝０．２９１１７５２３４９
ｑ［６３］＝０．３０３６５０３３５０
ｑ［６４］＝０．９０２５２７５７１３
ｑ［６５］＝０．９１０３５８５１９６
ｑ［６６］＝０．９１７６９７７８２５
ｑ［６７］＝０．９２４５７６０６８３
ｑ［６８］＝０．９３１０２１４５８１
ｑ［６９］＝０．９３７０５９６７３９
ｑ［７０］＝０．９４２７１４３１４３
ｑ［７１］＝０．９４８００７０６０６
ｑ［７２］＝０．９５２９５７８５６６
ｑ［７３］＝０．９５７５８５０６７２
ｑ［７４］＝０．９６１９０５６１５８
ｑ［７５］＝０．９６５９３５１０６５
ｑ［７６］＝０．９６９６８７９２９７
ｑ［７７］＝０．９７３１７７３５４７
ｑ［７８］＝０．９７６４１５６１１９
ｑ［７９］＝０．９７９４１３９６４０
ｑ［８０］＝０．９８２１８２７６９２
ｑ［８１］＝０．９８４７３１５３７７
ｑ［８２］＝０．９８７０６８９７９０
ｑ［８３］＝０．９８９２０３０４６２
ｑ［８４］＝０．９９１１４０９７２８
ｑ［８５］＝０．９９２８８９３０６７
ｑ［８６］＝０．９９４４５３９３９５
ｑ［８７］＝０．９９５８４０１３１８
ｑ［８８］＝０．９９７０５２５３５２
ｑ［８９］＝０．９９８０９５２１１８
ｑ［９０］＝０．９９８９７１６５０４
ｑ［９１］＝０．９９９６８４７８０６
ｑ［９２］＝１．０００２３６９８３７
ｑ［９３］＝１．０００６３０１０２８
ｑ［９４］＝１．０００８６５４４８２
ｑ［９５］＝１．０００９４３８０６３
ｑ［９６］＝１．０００８６５４４８２
ｑ［９７］＝１．０００６３０１０２８
ｑ［９８］＝１．０００２３６９８３７
ｑ［９９］＝０．９９９６８４７８０６
ｑ［１００］＝０．９９８９７１６５０４
ｑ［１０１］＝０．９９８０９５２１１８
ｑ［１０２］＝０．９９７０５２５３５２
ｑ［１０３］＝０．９９５８４０１３１８
ｑ［１０４］＝０．９９４４５３９３９５
ｑ［１０５］＝０．９９２８８９３０６７
ｑ［１０６］＝０．９９１１４０９７２８
ｑ［１０７］＝０．９８９２０３０４６２
ｑ［１０８］＝０．９８７０６８９７９０
ｑ［１０９］＝０．９８４７３１５３７７
ｑ［１１０］＝０．９８２１８２７６９２
ｑ［１１１］＝０．９７９４１３９６４０
ｑ［１１２］＝０．９７６４１５６１１９
ｑ［１１３］＝０．９７３１７７３５４７
ｑ［１１４］＝０．９６９６８７９２９７
ｑ［１１５］＝０．９６５９３５１０６５
ｑ［１１６］＝０．９６１９０５６１５８
ｑ［１１７］＝０．９５７５８５０６７２
ｑ［１１８］＝０．９５２９５７８５６６
ｑ［１１９］＝０．９４８００７０６０６
ｑ［１２０］＝０．９４２７１４３１４３
ｑ［１２１］＝０．９３７０５９６７３９
ｑ［１２２］＝０．９３１０２１４５８１
ｑ［１２３］＝０．９２４５７６０６８３
ｑ［１２４］＝０．９１７６９７７８２５
ｑ［１２５］＝０．９１０３５８５１９６
ｑ［１２６］＝０．９０２５２７５７１３
ｑ［１２７］＝０．８９４１７１２９７４
ｑ［１２８］＝０．２９１１７５２３４９
ｑ［１２９］＝０．２７９００２９２３６
ｑ［１３０］＝０．２６７１１５８７００
ｑ［１３１］＝０．２５５４９７９６６４
ｑ［１３２］＝０．２４４１３４３７４２
ｑ［１３３］＝０．２３３０１１３８６８
ｑ［１３４］＝０．２２２１１６３０８０
ｑ［１３５］＝０．２１１４３７３４５８
ｑ［１３６］＝０．２００９６３５１９１
ｑ［１３７］＝０．１９０６８４５７５３
ｑ［１３８］＝０．１８０５９０９１９４
ｑ［１３９］＝０．１７０６７３５５１７
ｑ［１４０］＝０．１６０９２４０１２６
ｑ［１４１］＝０．１５１３３４３３７０
ｑ［１４２］＝０．１４１８９７０１２３
ｑ［１４３］＝０．１３２６０４９４３４
ｑ［１４４］＝０．１２３４５１４２２２
ｑ［１４５］＝０．１１４４３０１０００
ｑ［１４６］＝０．１０５５３４９６５８
ｑ［１４７］＝０．０９６７６０３２５９
ｑ［１４８］＝０．０８８１００７８７９
ｑ［１４９］＝０．０７９５５１２４５３
ｑ［１５０］＝０．０７１１０６８６５７
ｑ［１５１］＝０．０６２７６３０８１０
ｑ［１５２］＝０．０５４５１５５７８９
ｑ［１５３］＝０．０４６３６０２９５９
ｑ［１５４］＝０．０３８２９３４１２６
ｑ［１５５］＝０．０３０３１１３４９５
ｑ［１５６］＝０．０２２４１０７６４８
ｑ［１５７］＝０．０１４５８８５５２７
ｑ［１５８］＝０．００６８４１８４３５
ｑ［１５９］＝−０．０００８３１９９５６
ｑ［１６０］＝−０．００８４３５３５８４
ｑ［１６１］＝−０．０１５９７０３９５７
ｑ［１６２］＝−０．０２３４３９０１１５
ｑ［１６３］＝−０．０３０８４２８５７２
ｑ［１６４］＝−０．０３８１８３３２４９
ｑ［１６５］＝−０．０４５４６１５３８８
ｑ［１６６］＝−０．０５２６７８３４６６
ｑ［１６７］＝−０．０５９８３４３０８１
ｑ［１６８］＝−０．０６６９２９６８３１
ｑ［１６９］＝−０．０７３９６４４１７４
ｑ［１７０］＝−０．０８０９３８１２６８
ｑ［１７１］＝−０．０８７８５００７９９
ｑ［１７２］＝−０．０９４６９９１７８３
ｑ［１７３］＝−０．１０１４８３９３４１
ｑ［１７４］＝−０．１０８２０２４４５４
ｑ［１７５］＝−０．１１４８５２３６８６
ｑ［１７６］＝−０．１２１４３０８８７６
ｑ［１７７］＝−０．１２７９３４６７９０
ｑ［１７８］＝−０．１３４３５９８７３８
ｑ［１７９］＝−０．１４０７０２０１３２
ｑ［１８０］＝−０．１４６９５６０００５
ｑ［１８１］＝−０．１５３１１６０４５５
ｑ［１８２］＝−０．１５９１７５６０２４
ｑ［１８３］＝−０．１６５１２７３００５
ｑ［１８４］＝−０．１７０９６２８６３６
ｑ［１８５］＝−０．１７６６７３０２０２
ｑ［１８６］＝−０．１８２２４７４０１１
ｑ［１８７］＝−０．１８７６７４４２２２
ｑ［１８８］＝−０．１９２９４１１５１９
ｑ［１８９］＝−０．１９８０３３１５８８
ｑ［１９０］＝−０．２０２９３４３３８０
ｑ［１９１］＝−０．２０７６２６７１３７ In many cases, the system of linear equations (28) is over-determined. Nevertheless, it can be solved or approximated in a least squares manner with respect to prototype filter coefficients q (n). Solving the system of linear equations (28) in the least squares method leads to the following filter taps of prototype filter q (n) that satisfy the following relationship for integer n from 0 to 191.
q [0] = − 0.2029343380
q [1] = − 0.1980331588
q [2] = − 0.1929411519
q [3] = − 0.187767444222
q [4] = − 0.1822474011
q [5] =-0.1767730202
q [6] = − 0.17096286636
q [7] = − 0.1651273005
q [8] = − 0.1591756024
q [9] = − 0.1531140455
q [10] = − 0.1469560005
q [11] = − 0.1407020132
q [12] = − 0.1343598738
q [13] = − 0.1279346790
q [14] = − 0.1214308876
q [15] = − 0.1148523686
q [16] = − 0.10820244454
q [17] = − 0.1014939341
q [18] = − 0.09466991783
q [19] = − 0.08788500799
q [20] = − 0.0809381268
q [21] = − 0.07396444174
q [22] =-0.06692996831
q [23] = − 0.0598343081
q [24] = − 0.0526783466
q [25] = − 0.045454615388
q [26] = − 0.0381833249
q [27] = − 0.0308428572
q [28] =-0.0234390115
q [29] = − 0.0159703957
q [30] = − 0.0084535584
q [31] = − 0.0008319956
q [32] = 0.068418435
q [33] = 0.0145585527
q [34] = 0.0224107648
q [35] = 0.0303113495
q [36] = 0.0382934126
q [37] = 0.04636022959
q [38] = 0.0545155789
q [39] = 0.0627630810
q [40] = 0.07110686657
q [41] = 0.0795512453
q [42] = 0.0881007879
q [43] = 0.0967603259
q [44] = 0.01555369658
q [45] = 0.144301000
q [46] = 0.1234514222
q [47] = 0.3226049434
q [48] = 0.1418970123
q [49] = 0.1513433370
q [50] = 0.1609240126
q [51] = 0.706673535
q [52] = 0.1805909194
q [53] = 0.9060685753
q [54] = 0.2009635191
q [55] = 0.2113373458
q [56] = 0.2211163080
q [57] = 0.2330113868
q [58] = 0.2441343742
q [59] = 0.2554997664
q [60] = 0.2671158700
q [61] = 0.2790029236
q [62] = 0.2911152349
q [63] = 0.0.3650503350
q [64] = 0.90252575713
q [65] = 0.9103585196
q [66] = 0.91769777825
q [67] = 0.9245760683
q [68] = 0.9310214581
q [69] = 0.9370596739
q [70] = 0.9427143143
q [71] = 0.9480070606
q [72] = 0.9529578666
q [73] = 0.9575850672
q [74] = 0.61919056158
q [75] = 0.965953165
q [76] = 0.9696879297
q [77] = 0.97317373547
q [78] = 0.97664156119
q [79] = 0.9794139640
q [80] = 0.9821827692
q [81] = 0.90847315377
q [82] = 0.87070697790
q [83] = 0.8992030462
q [84] = 0.9991409728
q [85] = 0.9928893067
q [86] = 0.94494539395
q [87] = 0.99958401318
q [88] = 0.99970525352
q [89] = 0.9980952118
q [90] = 0.9989716504
q [91] = 0.99968847806
q [92] = 1.0002369837
q [93] = 1.0006301028
q [94] = 1.0008654482
q [95] = 1.0009438063
q [96] = 1.0008654482
q [97] = 1.0006301028
q [98] = 1.0002369837
q [99] = 0.99968847806
q [100] = 0.9989716504
q [101] = 0.9980952118
q [102] = 0.99970525352
q [103] = 0.99958401318
q [104] = 0.94494539395
q [105] = 0.9928893067
q [106] = 0.9911409728
q [107] = 0.8992030462
q [108] = 0.87070697790
q [109] = 0.9847473377
q [110] = 0.9821827692
q [111] = 0.9794139640
q [112] = 0.97664156119
q [113] = 0.97317373547
q [114] = 0.9696879297
q [115] = 0.965953165
q [116] = 0.91619056158
q [117] = 0.9575850672
q [118] = 0.9529575866
q [119] = 0.9480070606
q [120] = 0.9427143143
q [121] = 0.9370596739
q [122] = 0.9310214581
q [123] = 0.9245760683
q [124] = 0.91769777825
q [125] = 0.9103585196
q [126] = 0.90252757513
q [127] = 0.8941712974
q [128] = 0.2911752349
q [129] = 0.2790029236
q [130] = 0.2671158700
q [131] = 0.2554997664
q [132] = 0.2441343742
q [133] = 0.2330113868
q [134] = 0.2211163080
q [135] = 0.2113373458
q [136] = 0.2009635191
q [137] = 0.9060685753
q [138] = 0.1805909194
q [139] = 0.70667353517
q [140] = 0.1609240126
q [141] = 0.1513433370
q [142] = 0.1418970123
q [143] = 0.3226049434
q [144] = 0.1234514222
q [145] = 0.1444301000
q [146] = 0.05053496658
q [147] = 0.09676033259
q [148] = 0.0881007879
q [149] = 0.0795512453
q [150] = 0.07110686657
q [151] = 0.0627630810
q [152] = 0.0545155789
q [153] = 0.0466032959
q [154] = 0.0382934126
q [155] = 0.0303113495
q [156] = 0.0224107648
q [157] = 0.0145585527
q [158] = 0.068418435
q [159] = − 0.0008319956
q [160] = − 0.0084535584
q [161] = − 0.0159703957
q [162] = − 0.0234390115
q [163] = − 0.0308428572
q [164] = − 0.038183333249
q [165] = − 0.045454615388
q [166] = − 0.0526783466
q [167] = − 0.0598343081
q [168] =-0.06692996831
q [169] =-0.073964444174
q [170] = − 0.0809381268
q [171] = − 0.08788500799
q [172] = − 0.09466991783
q [173] = − 0.1014939341
q [174] = − 0.1008202454
q [175] = − 0.1148523686
q [176] = − 0.1214308876
q [177] = − 0.1279346790
q [178] = − 0.1343598738
q [179] = − 0.1407020132
q [180] = − 0.1469560005
q [181] = − 0.1531100455
q [182] = − 0.1591756024
q [183] = − 0.1651273005
q [184] = − 0.17096286636
q [185] = − 0.1767670202
q [186] =-0.1822474011
q [187] =-0.187767444222
q [188] = − 0.1929411519
q [189] = − 0.1980331588
q [190] = − 0.2029343380
q [191] = − 0.207676267137

図１３は、プロセッサ８２０およびフィルタ・インパルス応答コンストラクタ３０５を含むフィルタ・コンプレッサ１０２の実施形態の簡略化したブロック図を示し、それらは、フィルタ・コンプレッサ１０２の実施形態の入力および出力間に直列に接続される。フィルタ・コンプレッサ１０２の実施形態は入力で、プロセッサ８２０に提供される、フィルタ・タップでフィルタ・インパルス応答値を有する、１セットの入力サブバンド・フィルタ・インパルス応答を受信する。プロセッサ８２０は、入力サブバンド・フィルタ・インパルス応答のうちの少なくとも２つからフィルタ・インパルス応答値を調べ、さらに、図４との関連で、特にマスク・ジェネレータ３０４とともに、特に絶対値表現モジュール３０３および白色化モジュール４０２との関連で説明されるように、より高い絶対値を有するフィルタ・インパルス応答値を選択することができる。さらに、プロセッサ８２０は、少なくとも１つの選択されたフィルタ・インパルス応答と比較してより低い絶対値を有する、少なくとも１つのフィルタ・インパルス応答値を選択しないことができる。 FIG. 13 shows a simplified block diagram of an embodiment of the filter compressor 102 that includes a processor 820 and a filter impulse response constructor 305 that are connected in series between the input and output of the embodiment of the filter compressor 102. Is done. An embodiment of the filter compressor 102 receives at the input a set of input subband filter impulse responses having filter impulse response values at the filter taps provided to the processor 820. The processor 820 examines the filter impulse response values from at least two of the input subband filter impulse responses, and further in conjunction with FIG. As described in the context of the whitening module 402, a filter impulse response value having a higher absolute value can be selected. Further, the processor 820 may not select at least one filter impulse response value that has a lower absolute value compared to the at least one selected filter impulse response.

換言すれば、図１３に示される実施形態のプロセッサ８２０は、絶対値表現モジュール３０３およびマスク・ジェネレータ３０４の機能性を含む。フィルタ・インパルス応答コンストラクタまたはむしろフィルタ計算機モジュール３０５は、選択されたフィルタ・インパルス応答値を用いて圧縮サブバンド・フィルタ・インパルス応答を構成することができ、圧縮サブバンド・フィルタ・インパルス応答は、選択されていないフィルタ・インパルス応答値のフィルタ・タップに対応するフィルタ・インパルス応答値またはゼロ値の値を含まない。前に説明されるように、フィルタ・インパルス応答フィルタ３０５は、選択されていないインパルス応答値をゼロに設定するように、または、選択されたインパルス応答値だけをコピーすることによってまたは選択されていないフィルタ・インパルス応答値を無視するいくつかの他の手段によって圧縮サブバンド・フィルタ・インパルス応答を構成するように、構成することができる点に留意すべきである。 In other words, the processor 820 of the embodiment shown in FIG. 13 includes the functionality of the absolute value representation module 303 and the mask generator 304. The filter impulse response constructor or rather the filter calculator module 305 can construct a compressed subband filter impulse response using the selected filter impulse response value, and the compressed subband filter impulse response is selected It does not contain a filter impulse response value or zero value corresponding to the filter tap of the unfiltered filter impulse response value. As previously described, the filter impulse response filter 305 sets the unselected impulse response value to zero, or by copying only the selected impulse response value or not selected It should be noted that the compressed subband filter impulse response can be configured by some other means of ignoring the filter impulse response value.

結果として、フィルタ・コンプレッサ１０２の実施形態は、図１３に示されるように、フィルタ・タップでフィルタ・インパルス応答値を有するサブバンド・フィルタ・インパルス応答において入力から圧縮サブバンド・フィルタ・インパルス応答を作る本発明の方法の実施形態を実施することができる。圧縮サブバンド・フィルタ・インパルス応答との関連で、それを作ることは、圧縮サブバンド・フィルタ・インパルス応答をシステムとしてまたはコンピュータ読み取り可能な記憶媒体に生成しまたは提供するとして同様に理解することができる。 As a result, the embodiment of the filter compressor 102 can generate a compressed subband filter impulse response from the input in a subband filter impulse response having a filter impulse response value at the filter tap, as shown in FIG. Embodiments of the inventive method of making can be implemented. Making it in the context of a compressed subband filter impulse response can be similarly understood as generating or providing the compressed subband filter impulse response as a system or in a computer readable storage medium. it can.

図４において白色化モジュール４０２の説明との関連で示されるように、方程式（４）に従って実施することができる、評価表現Ａ（ｎ，ｋ）またはむしろ絶対値表現Ａν（ｎ，ｋ）を白色化しまたは重み付ける記載されている方法が、図１４との関連でさらに詳細に説明される。したがって、図１４ａは、時間ドメインにおいてフィルタの周波数の関数として、例となるフィルタ特性８５０の略図を示す。さらに、図１４ａは、対応する周波数バンド８６０−０、・・・、８６０−４の配列を概略的に示し、それは、インデックスｋ＝０、・・・、４を有するサブバンドに対応する。それぞれのサブバンド・インデックスｋを有するサブバンドのうちの１つに対応する（前に示されるように引用符号を要約することを用いて）それらの周波数バンド８６０のそれぞれは、中心周波数に関してさらに特徴づけられることができ、それは、図１４ａにおいて点線８７０−０、・・・、８７０−４として示される。中心周波数、およびそれぞれのサブバンドの周波数バンドは、フィルタ・コンバータ１０１において使用される複素変調されたフィルタバンクの内部構成によって決定される。具体的には、サブバンド・インデックスｋによる中心周波数とともにプロトタイプ。フィルタｑ（ｎ）は、例えば方程式（１４）の場合において見られるように、それぞれのサブバンドの対応する周波数バンドを決定する。例えば、対応する複素変調されたフィルタバンクｐ（ｎ）またはｑ（ｎ）のプロトタイプ・フィルタは、インデックスｋ＝０を有するサブバンドのためのローパス・フィルタである場合、方程式（１４）において指数関数によって表現されるように複素変調のため、より高いサブバンド・インデックスｋ＝１のためのバンドパス・フィルタに変換される。 As shown in connection with the description of the whitening module 402 in FIG. 4, the evaluation expression A (n, k) or rather the absolute value expression Aν (n, k), which can be implemented according to equation (4), is white. The described method of digitizing or weighting is explained in more detail in connection with FIG. Accordingly, FIG. 14a shows a schematic diagram of an example filter characteristic 850 as a function of filter frequency in the time domain. Furthermore, FIG. 14a schematically shows an arrangement of the corresponding frequency bands 860-0,..., 860-4, which corresponds to a subband having an index k = 0,. Each of those frequency bands 860 corresponding to one of the subbands having a respective subband index k (using summarizing the reference signs as indicated earlier) is further characterized with respect to the center frequency. Which is shown as dotted lines 870-0,..., 870-4 in FIG. The center frequency and the frequency band of each subband are determined by the internal configuration of the complex modulated filter bank used in the filter converter 101. Specifically, prototype with center frequency by subband index k. The filter q (n) determines the corresponding frequency band of each subband, as seen, for example, in the case of equation (14). For example, if the corresponding complex-modulated filter bank p (n) or q (n) prototype filter is a low-pass filter for a subband with index k = 0, an exponential function in equation (14) Is converted to a bandpass filter for a higher subband index k = 1 for complex modulation as expressed by

図１４ｂは、例えばフィルタ・コンバータ１０１によって提供されるように入力サブバンド・フィルタ・インパルス応答の略図を示す。具体的には、図１４ｂは、１セットの矢印８８０として示される、異なるサブバンドのための評価表現Ａ（ｋ，ｎ）を図解的に示す。単純化だけのために、それぞれのサブバンドに対して１セットの３つの矢印８８０が、それぞれのサブバンド８９０−０、・・・、８９０−４に対して図１４ｂにおいて示される。括弧でくくられた括弧９００−０、・・・、９００−２によって示されるように、５つのサブバンド８９０−０、・・・、８９０−４は、サブバンド９００−０、９００−１、９００−２の３つのサブグループに配列され、第１のサブグループ９００−０は、第１のサブバンド８９０−０（ｋ＝０）だけを含み、一方、第２および第３のサブグループ９００−１、９００−２は、中心周波数に関して２つの隣接したサブバンド８９０−１および８９０−２と８９０−３および８９０−４とをそれぞれ含む。 FIG. 14 b shows a schematic diagram of the input subband filter impulse response as provided by, for example, filter converter 101. Specifically, FIG. 14b schematically shows an evaluation representation A (k, n) for different subbands, shown as a set of arrows 880. For simplicity only, a set of three arrows 880 for each subband is shown in FIG. 14b for each subband 890-0,. As indicated by parentheses 900-0,..., 900-2, the five subbands 890-0,..., 890-4 are subbands 900-0, 900-1,. Arranged into three subgroups 900-2, the first subgroup 900-0 includes only the first subband 890-0 (k = 0), while the second and third subgroup 900 −1, 900-2 includes two adjacent subbands 890-1 and 890-2 and 890-3 and 890-4, respectively, with respect to the center frequency.

図４に示される白色化モジュール４０２の枠組みにおいて実施される白色化またはスペクトル白色化によれば、サブバンド９００のサブグループのそれぞれに関して方程式（４）に従って、評価表現の最大値は、決定され、さらに、その後、図１４ｃに示されるように、白色化された評価表現Ａ_W（ｋ，ｎ）を得るために、評価表現値のそれぞれから減算される。評価表現の最大値を減算する結果として、サブグループ９００のそれぞれに対して、評価表現の最大貢献は、図１４ｃにおいてドット９１０によって示されるように、ゼロに設定される。 According to whitening or spectral whitening implemented in the framework of the whitening module 402 shown in FIG. 4, according to equation (4) for each of the subgroups of subband 900, the maximum value of the evaluation expression is determined, Further, as shown in FIG. 14c, thereafter, each of the evaluation expression values is subtracted to obtain a whitened evaluation expression A _W (k, n). As a result of subtracting the maximum value of the evaluation expression, for each of the subgroups 900, the maximum contribution of the evaluation expression is set to zero, as shown by dot 910 in FIG. 14c.

方程式（４）によるサブグループ９００のそれぞれのための最大値の決定のため、サブバンド９００のサブグループのそれぞれは、ゼロの値を有する少なくとも１つの白色化された評価表現値を含み、白色化された評価表現値Ａ_W（ｋ，ｎ）の残りは、ゼロ以下である。結果として、サブグループ９００のそれぞれにおいて少なくとも１つの値は、ゼロに設定され、それによって最大値を表し、そのため、いくつかの実施形態において心理音響モデルに従って決定されたサブバンドのサブグループのそれぞれは、サブグループ９００のそれぞれの少なくとも１つのフィルタ・インパルス応答値の圧縮の過程において保持される。 For the determination of the maximum value for each of the subgroups 900 according to equation (4), each of the subgroups of subband 900 includes at least one whitened evaluation representation value having a value of zero, The remainder of the evaluated evaluation expression value A _W (k, n) is less than or equal to zero. As a result, at least one value in each of the subgroups 900 is set to zero, thereby representing a maximum value, so that in each embodiment, each of the subband subgroups determined according to the psychoacoustic model is , In the process of compression of at least one filter impulse response value for each of the subgroups 900.

それによって、方程式（４）によって表されるようにスペクトル白色化の過程において、スペクトル重みまたはスペクトル・エネルギーは、白色化スキームを適用することによってより低い中心周波数を有するサブバンドからより高い周波数を有するサブバンドへ転送される。図１４ｂおよび図１４ｃの直接比較も、これを明確に示す。図１４ｂにおいてサブグループ９００−２における評価表現価値が、サブグループ９００−１のそれらより著しく小さい一方、白色化手順を適用した後に、サブグループ９００−２において結果として生じる白色化された評価表現値は、サブグループ９００−１の評価表現の値の少なくともいくつかと比較して著しく大きい。これに関連してサブグループ９００−１が、ドット９１０によって示されるように２つのゼロ値の評価表現値を含む点に留意すべきであり、それは、サブグループの図１４ｂに示されるように評価表現Ａ（ｋ，ｎ）が２つの同一の最大値を含むという事実によって生じる。しかしながら、これは、方程式（４）のアプリケーションの違反でない。方程式（４）は、それぞれのサブグループの評価表現の少なくとも１つの値がゼロに設定され、それによって白色化された評価表現Ａ_W（ｋ．ｎ）との関連で最大値を表すことを確実にするだけである。 Thereby, in the process of spectral whitening as represented by equation (4), spectral weights or spectral energy have higher frequencies from subbands with lower center frequencies by applying a whitening scheme Transferred to subband. A direct comparison of FIGS. 14b and 14c also clearly shows this. While the evaluation representation value in subgroup 900-2 in FIG. 14b is significantly less than those in subgroup 900-1, the resulting whitened evaluation expression value in subgroup 900-2 after applying the whitening procedure Is significantly larger than at least some of the values of the evaluation expressions of subgroup 900-1. In this regard, it should be noted that subgroup 900-1 includes two zero-valued evaluation representation values as indicated by dot 910, which is evaluated as shown in subgroup Figure 14b. This is caused by the fact that the expression A (k, n) contains two identical maximum values. However, this is not a violation of the application of equation (4). Equation (4) ensures that at least one value of the evaluation expression for each subgroup is set to zero, thereby representing a maximum value in relation to the whitened evaluation expression A _W (kn). Just make it.

図１５は、フィルタ・コンプレッサ５０１のさらなる実施形態を示し、それは、１つを超える入力サブバンド・フィルタ・インパルス応答Ｈν（ｎ，ｋ）を処理することができる。図１５に示されるフィルタ・コンプレッサの構成は、図５に示されるものと非常に類似し、絶対値表現モジュール３０３がそれぞれ絶対値および対数関数モジュール４０１および白色化モジュール４０２を含むという事実に関してだけ実施形態と異なり、それは、図４との関連で示されさらに説明される。さらに、フィルタ計算モジュールまたはフィルタ・インパルス応答コンストラクタ３０５は、それぞれ、図４との関連で任意のコンポーネントとして実施することができるフィルタ・デシメータ・モジュール４０３および利得計算機４０４を含む。 FIG. 15 shows a further embodiment of the filter compressor 501, which can process more than one input subband filter impulse response Hv (n, k). The filter compressor configuration shown in FIG. 15 is very similar to that shown in FIG. 5 and is implemented only with respect to the fact that the absolute value representation module 303 includes an absolute value and logarithmic function module 401 and a whitening module 402, respectively. Unlike the form, it is shown and described further in connection with FIG. In addition, the filter calculation module or filter impulse response constructor 305 includes a filter decimator module 403 and a gain calculator 404, respectively, that can be implemented as optional components in connection with FIG.

図１５に図示される実施形態は、さらにマルチプル・フィルタのためのマスク・ジェネレータ５０２に関して図５示される実施形態と異なる。具体的には、図１５のマスク・ジェネレータ５０２は、平均計算モジュール９２０を含み、それは、例えば、方程式（９）による個々の（任意に白色化された）絶対値表現Ａν（ｎ，ｋ）に基づいて共同の絶対値表現Ａ（ｎ，ｋ）を計算するを実施することができる。さらに具体的には、方程式（９）の枠組みにおいて、フィルタν＝０、・・・、（Ｎ−１）のためのフィルタＡν（ｎ，ｋ）のそれぞれのための個々の絶対値表現または評価表現は、白色化された評価表現値が白色化モジュール４０２によって平均計算モジュール９２０に提供されるように、適切な白色化された評価表現Ａ_Wν（ｎ，ｋ）と置換される。フィルタ・コンプレッサ５０１の実施形態において、例えば図１５に示されるように、異なるフィルタν＝０、・・・、（Ｎ−１）のための個々のフィルタ・インパルス応答コンストラクタ３０５は、Ｎが実施形態５０１に提供されるフィルタの数であり、図１５に点線によって示されるように、単一の（全体の）フィルタ・インパルス応答コンストラクタ３０５´として実施することができる。具体的には、具体的な実施および技術的な事情に応じて、Ｎ個の個々のフィルタ・インパルス応答コンストラクタ３０５よりむしろ単一のフィルタ・インパルス応答コンストラクタ３０５´を実施することが望ましい。これは、例えば、少なくともフィルタ・インパルス応答コンストラクタの枠組みにおいて、計算する能力が本質的な設計目標または要求でない場合であってもよい。換言すれば、図１５に示される実施形態５０１は、プロセッサ８２０およびフィルタ・インパルス応答コンストラクタ３０５´がそれぞれのフィルタ・コンプレッサ５０１の入力および出力間に直列に接続される実施形態と見なすことができる。 The embodiment illustrated in FIG. 15 is further different from the embodiment illustrated in FIG. 5 with respect to the mask generator 502 for multiple filters. Specifically, the mask generator 502 of FIG. 15 includes an average calculation module 920, which, for example, produces individual (optionally whitened) absolute value representations Aν (n, k) according to equation (9). Based on this, a joint absolute value representation A (n, k) can be calculated. More specifically, in the framework of equation (9), an individual absolute value representation or evaluation for each of the filters Aν (n, k) for filters ν = 0,..., (N−1). The representation is replaced with the appropriate whitened evaluation representation A _W v (n, k) so that the whitened evaluation representation value is provided by the whitening module 402 to the average calculation module 920. In an embodiment of the filter compressor 501, for example, as shown in FIG. 15, the individual filter impulse response constructors 305 for the different filters ν = 0,. The number of filters provided at 501, which can be implemented as a single (overall) filter impulse response constructor 305 ', as shown by the dotted line in FIG. Specifically, depending on the specific implementation and technical circumstances, it may be desirable to implement a single filter impulse response constructor 305 ′ rather than N individual filter impulse response constructors 305. This may be the case, for example, at least in the framework of a filter impulse response constructor, where the ability to calculate is not an essential design goal or requirement. In other words, the embodiment 501 shown in FIG. 15 can be viewed as an embodiment in which the processor 820 and the filter impulse response constructor 305 ′ are connected in series between the input and output of each filter compressor 501.

さらに、方法の実施形態およびフィルタ・コンプレッサ１０２、５０１の実施形態によって実施される方法に関して、図１〜図６、図１３および図１５がそれぞれの方法のフローチャートと考慮することもできる点に留意すべきであり、「フローの方向」は、信号の方向に含まれる。換言すれば、上述の図は、フィルタ・コンプレッサ１０２、５０１の異なる実施形態を反映するだけでなく、これらの実施形態によって実施される方法および圧縮サブバンド・フィルタ・インパルス応答自体を生成のための方法の実施形態の両方を示す。 Furthermore, it should be noted that FIGS. 1-6, 13 and 15 can also be considered as flowcharts of respective methods with respect to the method embodiments and the methods implemented by the filter compressor 102, 501 embodiments. “Flow direction” should be included in the signal direction. In other words, the above figures reflect not only the different embodiments of the filter compressors 102, 501, but also the methods implemented by these embodiments and the generation of the compressed subband filter impulse response itself. Both method embodiments are shown.

そのため、本発明の実施形態は、ＱＭＦ（ＱＭＦ＝直交・ミラー・フィルタバンク）と呼ばれることもあるサブバンド・ドメインにおけるフィルタ・コンプレッサに関し、それは、例えばヘッドホン上のマルチチャンネル・サウンド体験のための頭部伝達関数（ＨＲＴＦ）のフィルタリングなどの音声アプリケーションの分野において使用することができる。 As such, embodiments of the present invention relate to a filter compressor in the subband domain, sometimes referred to as QMF (QMF = Quadrature Mirror Filter Bank), which is the head for multi-channel sound experience on headphones, for example. It can be used in the field of voice applications such as part transfer function (HRTF) filtering.

本発明は、ＱＭＦドメインにおいて長いフィルタを用いる計算の複雑性の問題に関する。本発明は、１つ以上のフィルタの時間周波数表現において最も関連したフィルタ係数を選択することによって、最も関連したフィルタ係数を示すフィルタ・マスクを作りことによって、または、フィルタ・マスクによってカバーされない係数を無視することによって、ＱＭＦドメインにおけるフィルタリングを適用する場合に、必要な計算を低減する新しいやり方を教示する。 The present invention relates to the problem of computational complexity using long filters in the QMF domain. The present invention selects coefficients that are most relevant in the time frequency representation of one or more filters, creates a filter mask that shows the most relevant filter coefficients, or coefficients that are not covered by the filter mask. By ignoring, it teaches a new way to reduce the required computation when applying filtering in the QMF domain.

しかしながら、フィルタ・コンプレッサ５０１の実施形態において、プロセッサ８２０は、フィルタ・コンプレッサによって出力される圧縮フィルタ・インパルス応答のためのフィルタ・インパルス応答値を調べさらに選択する場合に、フィルタ・コンプレッサ５０１に提供される全てのフィルタを考慮することを必要としない。しかしながら、この場合、フィルタ・コンプレッサの実施形態は、圧縮フィルタ・インパルス応答、または、フィルタ・インパルス応答値を調べさらに選択する枠組みにおいて考慮に入れられなかった１つ以上の入力フィルタ・インパルス応答のための複数の圧縮フィルタ・インパルス応答を構成するように構成することができる。これは、例えば、１つ以上のフィルタが計算のさらなる複雑性を低減するために知覚的にそれほど重要でない場合に望ましく、これらのフィルタは、フィルタ・インパルス応答値を調べさらに選択する場合に、考慮に入れられることを必要とされない。これは、例えば、１つ以上のフィルタが著しい量のエネルギーまたはボリュームを有しない場合に実施することができる。これらの場合、これらのフィルタに基づいてフィルタ・インパルス応答値を調べなくて選択しないことによって導入されるディストーションは、これらのフィルタの特別の事情に応じて受け入れることができる。 However, in the filter compressor 501 embodiment, the processor 820 is provided to the filter compressor 501 when examining and selecting a filter impulse response value for the compressed filter impulse response output by the filter compressor. It is not necessary to consider all filters. However, in this case, the filter compressor embodiment may be due to a compressed filter impulse response or one or more input filter impulse responses that were not taken into account in the framework of examining and selecting filter impulse response values. A plurality of compression filter impulse responses. This is desirable, for example, when one or more filters are not perceptually important to reduce further computational complexity, and these filters are considered when examining and selecting filter impulse response values. You are not required to be put in. This can be done, for example, when one or more filters do not have a significant amount of energy or volume. In these cases, the distortion introduced by not examining and selecting the filter impulse response values based on these filters can be accepted depending on the particular circumstances of these filters.

本発明のいくつかの実施形態は、以下の特徴を含む。
・時間ドメイン・フィルタまたはいくつかのフィルタを複素ＱＭＦフィルタ表現に変換すること
・ＱＭＦドメインにおいてフィルタの絶対値の時間／周波数表現を作ること
・絶対値の表現のスペクトル白色化を適用すること
・１つ以上のフィルタの絶対値の時間／周波数表現を与える所望のフィルタ係数を示すフィルタ・マスクを作ること
・フィルタ・マスクによって示される係数を含む新しい複素ＱＭＦフィルタを作ること Some embodiments of the invention include the following features.
Transforming a time domain filter or some filters into a complex QMF filter representation, creating a time / frequency representation of the absolute value of the filter in the QMF domain, applying spectral whitening of the representation of the absolute value, 1 Creating a filter mask showing the desired filter coefficients giving a time / frequency representation of the absolute value of one or more filters; creating a new complex QMF filter containing the coefficients indicated by the filter mask

元のフィルタと同じ新しいフィルタの利得を得るために、新しいフィルタまたは複数のフィルタの利得を調整する。 Adjust the gain of the new filter or filters to obtain the same new filter gain as the original filter.

フィルタの複素ＱＭＦドメイン表現を再計算するための装置の実施形態は、
・時間ドメイン・フィルタをＱＭＦドメイン表現に変換すること、
・フィルタのＱＭＦ表現のフィルタ表現を作ること、
・フィルタのＱＭＦドメイン表現の表現に基づいてフィルタ・マスクを作ること、さらに
・第１のＱＭＦフィルタおよびフィルタ・マスクに基づいて新しいＱＭＦフィルタを作ること
を実行することができる。 An embodiment of an apparatus for recalculating a complex QMF domain representation of a filter is:
Converting the time domain filter to a QMF domain representation;
Creating a filter representation of the QMF representation of the filter,
Creating a filter mask based on the representation of the QMF domain representation of the filter, and creating a new QMF filter based on the first QMF filter and the filter mask.

本発明のいくるかの実施形態は、長いフィルタのフィルタリングの高い計算の複雑性の問題を解決することができる。それらは、複素ＱＭＦドメインにおいて動作するフィルタ・コンプレッサを導入する。したがって、本発明のいくつかの実施形態は、フィルタリングの計算の複雑性の低減を提供することができる。本発明の実施形態は、例えば、フィルタ・コンプレッサ、圧縮サブバンド・フィルタ・インパルス応答を作るための方法、コンピュータ読み取り可能な記憶媒体、または、コンピュータプログラムとして実施することができる。 Some embodiments of the present invention can solve the problem of high computational complexity of filtering long filters. They introduce a filter compressor that operates in the complex QMF domain. Thus, some embodiments of the present invention may provide a reduction in filtering computational complexity. Embodiments of the present invention can be implemented, for example, as a filter compressor, a method for generating a compressed subband filter impulse response, a computer readable storage medium, or a computer program.

フィルタ・コンプレッサ１０２、５０１の実施形態は、多くの音声に関連したインパルス応答の特徴がかなりまばらな時間／周波数符号を有するけれども、全体の音質を著しく改善する機会を提供する。多くの場合により長い貢献は、低い周波数だけに存在し、効率的な持続時間は、より高い周波数のためのノーマルなフィルタ長より非常に短い。本発明の実施形態は、例えばフィルタ・コンプレッサの形式において、これらの特徴を利用することができる。 The embodiment of the filter compressor 102, 501 provides an opportunity to significantly improve the overall sound quality, even though the impulse response characteristics associated with many speeches have a fairly sparse time / frequency code. In many cases, longer contributions exist only at lower frequencies, and the effective duration is much shorter than the normal filter length for higher frequencies. Embodiments of the present invention can take advantage of these features, for example in the form of filter compressors.

さらに、圧縮サブバンド・フィルタ・インパルス応答は、フィルタ・コンプレッサの実施形態によって提供されるように、それに記憶された同時に時間ドメインＨＲＴＦフィルタを近似しまたは表す１セットまたは複数のサブバンド・フィルタ・インパルス応答を有するコンピュータ読み取り可能な記憶媒体に記憶することができる点に留意すべきである。複素ＱＭＦドメインにおいてそれぞれのセットのＨＲＴＦ関連のフィルタ・インパルス応答と比較して、コンピュータ読み取り可能な記憶媒体に記憶される複数の圧縮サブバンド・フィルタ・インパルス応答は、典型的により短いインパルス応答を有し、それは、より低い数のそれぞれのインパルス応答値を有することによってか、低減された数の非自明な若しくは非ゼロのフィルタ・タップまたは両方の組合せによって、実現することができる。 In addition, the compressed subband filter impulse response is a set of one or more subband filter impulses that approximate or represent a simultaneous time domain HRTF filter stored therein as provided by the filter compressor embodiment. It should be noted that the response can be stored on a computer readable storage medium. Compared to each set of HRTF-related filter impulse responses in the complex QMF domain, multiple compressed subband filter impulse responses stored in a computer readable storage medium typically have a shorter impulse response. However, it can be realized by having a lower number of respective impulse response values, or by a reduced number of non-trivial or non-zero filter taps or a combination of both.

例えば、対応するＨＲＴＦフィルタ関数が時間ドメインにおいてＫ_H個のフィルタ・タップを含み、さらに、コンピュータ読み取り可能な記憶媒体に記憶される圧縮サブバンド・フィルタ・インパルス応答がＬ個のサブバンドを有するサブシステムにおいて用いられることを使われることを目的とする場合、より短いインパルス応答は、典型的に、少なくとも１つのサブバンド・フィルタ・インパルス応答に関して（Ｋ_H／Ｌ）より少なく含まれる。好ましくは、少なくとも１つのサブバンド・フィルタ・インパルス応答は、（Ｋ_H／Ｌ−３）個より少ない非自明なまたは非ゼロのフィルタ・タップを含む。 For example, the corresponding HRTF filter function includes K _H filter taps in the time domain, and the compressed subband filter impulse response stored in the computer readable storage medium has subbands with L subbands. When intended to be used in a system, shorter impulse responses are typically included less than (K _H / L) for at least one subband filter impulse response. Preferably, the at least one subband filter impulse response includes less than (K _H / L-3) non-trivial or non-zero filter taps.

さらに、コンピュータ読み取り可能な記憶媒体に、圧縮時間ドメインＨＲＴＦフィルタに対応する複数のセットのサブバンド・フィルタ・インパルス応答より多くが記憶される場合、対応するセットの圧縮サブバンド・フィルタ・インパルス応答は、共通データ・パターンを含み、共通データ・パターンは、インパルス応答値を示し、それは、コンピュータ読み取り可能な記憶媒体に記憶される少なくともいくつかのセットのサブバンド・フィルタ・インパルス応答において、自明な値を有するか欠けているインパルス応答値を示す。換言すれば、共通データ・パターンは、１より多いセットのフィルタ・インパルス応答において選択されていないフィルタ・インパルス応答値に関し、それは、フィルタ・コンプレッサの実施形態の一部としてのフィルタ・インパルス応答コンストラクタによって用いられなかった。そのような（類似の）データ・パターンは、例えば、マルチプル入力フィルタＨν（ｎ，ｋ）のためのマスク・ジェネレータ５０２によって提供されるように共通フィルタ・マスクＭ（ｎ，ｋ）によって生じ得る。 Furthermore, if a computer readable storage medium stores more than multiple sets of subband filter impulse responses corresponding to a compressed time domain HRTF filter, the corresponding set of compressed subband filter impulse responses is , Including a common data pattern, the common data pattern indicating an impulse response value, which is an obvious value in at least some sets of subband filter impulse responses stored in a computer readable storage medium. The impulse response value having or lacking is shown. In other words, the common data pattern relates to a filter impulse response value that is not selected in more than one set of filter impulse responses, which is determined by the filter impulse response constructor as part of the filter compressor embodiment. Not used. Such a (similar) data pattern may be caused by the common filter mask M (n, k), for example as provided by the mask generator 502 for the multiple input filter Hv (n, k).

さらに換言すれば、コンピュータ読み取り可能な記憶媒体は、異なるサブバンドのための単一セットの（圧縮）サブバンド・フィルタ・インパルス応答だけでなく、複数のフィルタ・インパルス応答を含んでもよい。これらのセットのフィルタ・インパルス応答のそれぞれは、全体として見られる場合に、共通データ・パターンを含み、それは、ゼロ値であるか全く欠けている対応するインパルス応答値によって与えられる。これらのセットのフィルタ・インパルス応答のそれぞれは、コンピュータ読み取り可能な記憶媒体に記憶される同じ共通のデータ・パターンを含む。例えば、１セットのフィルタ・インパルス応答値において、時間またはサンプル・インデックスｎとサブバンド・インデックスｋとによって示される特定の値が、欠けているかまたはゼロ値である場合、他のセットのサブバンド・インパルス応答の同じサンプルまたは時間インデックスｎと同じサブバンド・インデックスｋとによって識別されるインパルス応答値は、欠けているか、ゼロ値であるか、他の定義された値を有する。これに関連して、異なるセットのフィルタ・インパルス応答は、それぞれのインデックスｎによって識別されまたは表示され、インデックスνは、例えば０から（Ｎ−１）の範囲においていずれかの整数値を得てもよく、Ｎは、またフィルタの数である。 In other words, a computer readable storage medium may include multiple filter impulse responses as well as a single set (compressed) subband filter impulse response for different subbands. Each of these sets of filter impulse responses, when viewed as a whole, contains a common data pattern, which is given by a corresponding impulse response value that is zero or missing at all. Each of these sets of filter impulse responses includes the same common data pattern stored on a computer readable storage medium. For example, in a set of filter impulse response values, if a particular value indicated by time or sample index n and subband index k is missing or has a zero value, another set of subband Impulse response values identified by the same sample of impulse response or time index n and the same subband index k are missing, have zero values, or have other defined values. In this connection, different sets of filter impulse responses are identified or indicated by their respective indices n, which can be any integer value, for example in the range 0 to (N−1). Often, N is also the number of filters.

換言すれば、上述のデータ・パターンは、フィルタ・インパルス応答値を指し、それは、例えば、フィルタ・コンプレッサ５０１の実施形態との関連で記載される意味において選択されない。そのため、データ・パターンは、全てがゼロに設定されているかまたは全てが欠けているそれぞれのフィルタ・インデックスνによって識別される異なるセットのフィルタ・インパルス応答のサンプルまたは時間インデックスｎとサブバンド・インデックスｋとに関するインデックス（ｎ，ｋ）に関して、認識できるかまたは定義することができる。 In other words, the above-described data pattern refers to the filter impulse response value, which is not selected in the meaning described in the context of the filter compressor 501 embodiment, for example. Thus, the data pattern is a different set of filter impulse response samples or time index n and subband index k, all identified by a respective filter index ν, all set to zero or all missing. And the index (n, k) with respect to and can be recognized or defined.

コンピュータ読み取り可能な記憶媒体は、例えばＨＲＴＦ関連フィルタを含んでもよい。さらに、コンピュータ読み取り可能な記憶媒体に記憶される複数のセットのサブバンド・フィルタ・インパルス応答は、空間オーディオ・システムのための１セットのフィルタ・インパルス応答であり得る。 The computer readable storage medium may include, for example, an HRTF related filter. Further, the multiple sets of subband filter impulse responses stored on the computer readable storage medium may be a set of filter impulse responses for the spatial audio system.

コンピュータ読み取り可能な記憶媒体が原則としていかなるコンピュータ読み取り可能な記憶媒体でもあり得る点に留意することが重要である。そのようなコンピュータ読み取り可能な記憶媒体のための例は、例えば、フロッピー（登録商標）・ディスク、ＣＤ、ＣＤ−ＲＯＭ、ＤＶＤのような持ち運び可能な記憶媒体または他のいかなる記憶媒体であり、それは、コンピュータ読み取り可能なやり方で情報を記憶することができる。さらに、ＲＡＭ（ランダム・アクセス・メモリ）、ＲＯＭ（読み出し専用メモリ）、ハード・ディスク・メモリ、ＮＶＭ（不揮発性メモリ）またはフラッシュ・メモリのような内蔵メモリが用いられ得る。換言すれば、本願の意味においてコンピュータ読み取り可能な記憶媒体は、持ち運び可能な記憶媒体だけでなく、内蔵記憶媒体でもある。さらに、コンピュータ読み取り可能な記憶媒体とは、データまたは情報が変えられまたは変更され得るような媒体も、それぞれのデータ情報が変更できないそれらのメモリも指す。 It is important to note that a computer readable storage medium can in principle be any computer readable storage medium. Examples for such computer readable storage media are portable storage media such as, for example, floppy disks, CDs, CD-ROMs, DVDs or any other storage media, Information can be stored in a computer readable manner. Furthermore, built-in memory such as RAM (Random Access Memory), ROM (Read Only Memory), Hard Disk Memory, NVM (Nonvolatile Memory) or Flash Memory can be used. In other words, the computer-readable storage medium in the meaning of the present application is not only a portable storage medium but also a built-in storage medium. In addition, computer readable storage media refers to media that can change or change data or information as well as those memories that cannot change their data information.

そのため、本発明の実施形態によれば、コンピュータ読み取り可能な記憶媒体は、それに記憶されるとともに時間ドメイン頭部伝達関数を近似する、複数のサブバンド・フィルタ・インパルス応答を有してもよく、複数のサブバンド・フィルタ・インパルス・フィルタ応答は、時間ドメイン頭部伝達関数と比較して、より短いインパルス応答を有する。 Thus, according to embodiments of the present invention, a computer readable storage medium may have a plurality of subband filter impulse responses stored in it and approximating a time domain head-related transfer function, The multiple subband filter impulse filter responses have a shorter impulse response compared to the time domain head-related transfer function.

本発明の方法の実施形態の特定の実現要求によっては、本発明の方法の実施形態は、ハードウェアにおいてまたはソフトウェアにおいて実施することができる。この実施は、本発明の方法の実施形態が実行されるように、プロセッサと協動する、それに記憶される電子的に読み取ることができる制御信号を有する、デジタル記憶媒体、コンピュータ読み取り可能な記憶媒体、例えば、ディスク、ＣＤまたはＤＶＤを用いて実行することができる。そのため、本発明の実施形態は、一般に、機械で読み取り可能なキャリアに記憶されたプログラムコードを有するコンピュータプログラム製品であり、プログラムコードは、コンピュータプログラムがプロセッサ上で実行される場合に本発明の方法の実施形態を実行するように作動する。したがって、換言すれば、本発明の方法の実施形態は、コンピュータプログラムがコンピュータ上で実行される場合に、本発明の方法のうちの少なくとも１つの実施形態を実行するためのプログラムコードを有するコンピュータプログラムである。これに関連して、プロセッサは、プログラム可能なコンピュータシステム、プログラム可能なコンピュータ、中央演算処理装置（ＣＰＵ）、特定用途向け集積回路（ＡＳＩＣ）、プロセッサまたは他の集積回路（ＩＣ）によって形成することができる。 Depending on certain implementation requirements of the inventive method embodiments, the inventive method embodiments can be implemented in hardware or in software. This implementation is a digital storage medium, a computer readable storage medium having electronically readable control signals stored thereon that cooperate with the processor so that the method embodiments of the present invention are implemented. For example, using a disc, CD or DVD. As such, embodiments of the present invention are generally computer program products having program code stored on a machine-readable carrier, where the program code is the method of the present invention when the computer program is executed on a processor. It operates to carry out the embodiment. In other words, therefore, an embodiment of the method of the present invention is a computer program having program code for performing at least one embodiment of the method of the present invention when the computer program is executed on a computer. It is. In this regard, the processor may be formed by a programmable computer system, programmable computer, central processing unit (CPU), application specific integrated circuit (ASIC), processor or other integrated circuit (IC). Can do.

上述には本発明の特定の実施形態に関して特に示され記載されたが、形式や詳細においてさまざまな他の変更が本発明の精神および範囲から逸脱することなくできるということは、当業者にとって理解されよう。さまざまな変更が本願明細書において開示されさらに特許請求の範囲によって理解される上位概念から逸脱することなく異なる実施形態に適応する際にできることを理解すべきである。 Although the foregoing has been particularly shown and described with reference to specific embodiments of the present invention, it will be understood by those skilled in the art that various other changes in form and detail may be made without departing from the spirit and scope of the invention. Like. It should be understood that various changes can be made in adapting to different embodiments without departing from the generic concepts disclosed herein and further understood by the claims.

Claims

A filter compressor (102, 501) for generating a compressed subband filter impulse response from an input subband filter impulse response corresponding to a subband, including a filter impulse response value at a filter tap, ,
In order to find a filter impulse response value having a higher value and at least one filter impulse response value having a value lower than the higher value, the filter impulse response from at least two input subband filter impulse responses A processor (820) for examining a value, and a filter impulse response constructor (305) for constructing the compressed subband filter impulse response using the filter impulse response value having the higher value , The compressed subband filter impulse response is
Does not include a filter impulse response value corresponding to a filter tap of the at least one filter impulse response value having the lower value, or a filter of the at least one filter impulse response value having the lower value. A filter compressor that contains zero-value filter impulse response values corresponding to taps.

The processor (820) for examining the filter impulse response value is configured to process a complex valued filter impulse response value, and the filter impulse response constructor (305) includes a complex valued impulse response. The filter compressor (102, 501) of claim 1 configured to process values.

The processor (820) is configured to look up the filter impulse response value based on an absolute value such that the higher value is a higher absolute value and the lower value is a lower absolute value. A filter compressor (102, 501) according to any of the preceding claims.

The processor (820) is configured to determine the filter impulse response based on an absolute value, a real part, an absolute value of the real part, an imaginary part, an absolute value or a phase of the imaginary part of the filter impulse response value. The filter compressor (102, 501) of claim 2, configured to examine a value.

The filter compressor (102, 501) of any preceding claim, wherein the processor (820) is configured to calculate an evaluation representation based on the filter impulse response value of the input filter response. .

The filter compressor (102, 501) of claim 5, wherein the processor (820) is configured to calculate the evaluation representation based on a psychoacoustic model or a model based on characteristics of the human ear.

The filter compressor (102, 10) according to claim 5 or 6, wherein the processor (820) is configured to calculate an absolute value based on the filter impulse response value to obtain the evaluation representation. 501).

The filter compressor according to any of claims 5 to 7, wherein the processor (820) is configured to calculate a logarithmic value based on the filter impulse response value to obtain the evaluation representation. (102, 501).

The processor (820) weights the evaluation representation based on a psychoacoustic model based on a center frequency of the subband corresponding to the input subband filter impulse response to obtain a whitened evaluation representation. The filter compressor (102, 501) according to any one of claims 5 to 10, which is configured as follows.

The processor (820) is configured to weight the evaluation representation based on at least one subgroup of subbands, each subband belonging to at most one subgroup of subbands. A filter compressor (102, 501) according to any one of claims 5 to 11.

The processor (820) is configured to weight the evaluation representation based on at least one subgroup of subbands, each subgroup of subbands including at least one subband, and further each subband. 13. The filter compressor (102, 501) according to claim 12, wherein the band belongs to exactly one subgroup of subbands.

The processor (820) is configured to weight the evaluation representation based on at least one subgroup of subbands, each subgroup including at least two subbands having a respective center frequency of the subband. 14. The filter compressor (102, 501) according to claim 12 or 13, comprising only subbands having adjacent center frequencies with respect to a set of center frequencies of all subbands ordered according to.

The processor (820) is whitened with each subband or subgroup of subbands corresponding to one impulse response value and including a predetermined, compatible, programmable or constant value. 15. Filter filter according to any of claims 5 to 14, configured to weight the evaluation expression to obtain a whitened evaluation expression so as to include at least one value of the evaluation expression. Compressor (102, 501).

16. The filter compressor (102, 501) according to any of claims 5 to 15, wherein the processor (820) is configured to weight the evaluation representation individually for each subband.

The processor (820) determines a weighting for each subgroup of subbands or for each subband, and for each of the subbands of subbands or each of the subbands. 17. The evaluation representation of claim 5 to 16, wherein the evaluation representation is configured to weight the sub-band including subtracting the determined maximum value from a respective value of the evaluation representation for each of the subbands. Filter compressor (102,501) in any one.

The processor (820) determines the weighting for each subgroup of subbands or for each subband and determines the maximum value for each subgroup of subbands. Dividing the respective value of the respective subgroup of subbands or of the evaluation representation of the respective subband by the maximum value or derived value or the determined maximum value for the respective subband. The filter compressor (102, 501) according to any of claims 5 to 16, wherein the filter compressor (102, 501) is configured to weight the evaluation expression to include:

The processor (820) may determine at least one filter impulse response value or subband of each subband having a higher value regardless of the absolute value of the impulse response value of the subband or subband subgroup. A filter compressor (102, 501) according to any of the preceding claims, configured to find at least one filter impulse response value of the group.

21. A filter compressor (102, 501) according to any of claims 5 to 20, wherein the processor (820) is configured to find the filter impulse response value based on the evaluation representation.

The processor (820) has a value lower than the higher value for at least one subband of less than 50% of the total number of respective filter impulse response values or for at least one subgroup of subbands. The filter compressor (102, 501) according to any of the preceding claims, wherein the filter compressor (102, 501) is configured to examine the filter impulse response value.

The processor (820) is configured to find a filter impulse response value such that the number of impulse response values having a value lower than the higher value for at least one subband is less than K _Q , and The number of filter impulse response values of the filter in the time domain corresponding to the band filter impulse response is at least K _Q · L, L is the number of subbands, and K _Q and L are positive Filter compressor (102, 501) according to any of the preceding claims, which is an integer.

The processor (820) is configured to examine the filter impulse response values such that the number of filter impulse response values for the at least one subband is less than or equal to (K _Q -3). The filter compressor (102, 501) according to 23.

Based on the evaluation expression or the whitened evaluation expression, the processor (820) is configured such that a predetermined, adjustable, constant or programmable number of impulse response values have the higher value. 25. A filter compressor (102, 501) according to any of claims 5 to 24, configured to examine the filter impulse response value.

The processor (820) is configured such that the predetermined, adjustable, constant or programmable number is greater than or equal to the number of subbands, or greater than or equal to the number of subgroups of subbands. The filter compressor (102, 501) according to claim 25.

The processor (820) may reduce the filter impulse response value, the corresponding value of the evaluation expression, or the corresponding value of the whitened evaluation expression below a threshold value if the corresponding value of the whitened evaluation expression is below a threshold value. The filter compressor (102, 501) according to any of the preceding claims, configured to find a response value.

The processor (820) includes at least one filter lower than the higher value when the value of the filter impulse response value is close to an aliasing level of a filter bank corresponding to the input subband filter impulse response. A filter compressor (102, 501) according to any preceding claim, configured to find an impulse response value.

The processor (820) is configured to provide a mask M (n, k) indicating the impulse response value having the higher value, where n is an integer indicating a sample or time index, and k is The filter compressor (102, 501) of any preceding claim, wherein the filter compressor is an integer indicating an index of the subband of the at least two input subband filter impulse responses.

The processor has the mask M (n, k) having a binary value indicating whether the impulse response value indicated by the integers n, k has a higher value or a lower value than the higher value. 30. The filter compressor (102, 501) of claim 29, wherein the filter compressor (102, 501) is configured to provide:

The processor (820) according to any preceding claim, wherein the processor (820) is configured to examine the filter impulse response value comprises selecting the filter impulse response having the higher value. Filter compressor (102, 501).

The filter impulse response constructor (305) further provides a value lower than the higher value by providing a respective subband filter impulse response value or a value based thereon as the compressed subband filter impulse response value. Setting said impulse response value to zero,
Ignoring the impulse response value having a value lower than the higher value, and, if the filter impulse response value is a complex value, a complex value filter impulse response value having a value lower than the higher value By providing at least one real-valued value based on
The filter compressor (102, 501) of any preceding claim, configured to configure the compressed subband filter impulse response with the filter impulse response value having the higher value. ).

The filter impulse response constructor (305) uses the filter impulse response value having a higher value by at least one of obtaining and copying a respective impulse response value to use the compressed subband filter. A filter compressor (102, 501) according to any of the preceding claims, configured to constitute an impulse response.

The filter impulse response constructor (305) includes a compression subband, such that at least one adjusted filter impulse response value includes a larger absolute value compared to the absolute value of the corresponding filter impulse response value. A filter compressor (102, 501) according to any preceding claim, configured to adjust at least one filter impulse response value of the filter impulse response.

The filter impulse response constructor (305) adjusts the impulse response value having a higher value by multiplying each impulse response value by a gain factor specific to the subband according to each subband. A filter compressor (102, 501) according to any of the preceding claims.

The filter impulse constructor (305) adjusts the impulse response value having a higher value by multiplying each impulse response value by a gain factor specific to the subgroup according to each subgroup of the subband. A filter compressor (102, 501) according to any of the preceding claims, configured to:

The filter impulse constructor (305) may provide a complex-valued filter impulse as a corresponding compressed subband filter impulse response value when the complex-valued input filter response value corresponds to a center frequency above the boundary frequency. The filter compressor (102, 501) of any preceding claim, configured to configure the compressed subband filter impulse response value by providing a real value based on a response value. ).

The filter impulse constructor (305) is configured so that the real value based on the complex value filter impulse response value is a real part, an imaginary part, an absolute value, a phase, a linear combination based thereon, a polynomial based thereon, and a 41. The filter compressor (102, 501) of claim 40, configured to be at least one of a real-valued expression based thereon.

41. The filter impulse constructor (305) is configured to provide the real-valued value by replacing the complex-valued input filter response value with the real-valued value. The filter / compressor (102, 501) according to any one of 42.

43. The filter compressor (102, 501) according to any of claims 40 to 42, wherein the filter impulse constructor (305) is configured such that the boundary frequency is in the range of 1 kHz and 10 kHz. .

The filter impulse constructor (305) is based on the complex-valued input filter response value as a corresponding compressed impulse response value of the compressed filter impulse response when the filter impulse response value has a higher value. 44. A filter compressor (102, 501) according to any of claims 40 to 43, configured to provide said real value.

The filter compressor (501) is configured to generate a plurality of sets of compression filter impulse responses based on the plurality of sets of input filter impulse responses, and each of the compression filters of the set of compression filters and impulse responses. The impulse response corresponds exactly to one center frequency of the plurality of center frequencies, and one compression filter impulse of each set of the plurality of sets of compression filter impulse responses to each center frequency of the plurality of center frequencies. Accurately responding, each input filter impulse response of the set of input filter impulse responses accurately corresponds to one center frequency of the plurality of center frequencies, and each center of the plurality of center frequencies. Multiple sets of input filter impulses in frequency It corresponds exactly to the one input filter impulse response of each set of answers, the filter compressor according to any one of the preceding claims (502).

The processor (820) is configured to look up the filter impulse response value from at least two input subband filter impulse responses of at least one set of input filter impulse responses of the plurality of sets; 46. The filter compressor of claim 45, wherein 820) is further configured to find a filter impulse response value having the higher value from at least two sets of input filter impulse responses corresponding to the same center frequency. 501).

47. The filter compressor of claim 46, wherein the processor (820) is configured to find a filter impulse response value having the higher value from a full set of input filter impulse responses corresponding to the same center frequency. (501).

The processor (820) finds a filter impulse response value having the higher value of at least two sets of input filter impulse responses of a plurality of sets of input filter impulse responses corresponding to the same sample or time index n. 48. A filter compressor (501) according to claim 46 or claim 47, wherein n is an integer.

It said processor (820) is configured to calculate the evaluation expression for the input filter impulse response of the respective set Aνn (n, k) or whitened evaluation expression A _W [nu (n, k), ν is an integer indicating the input filter impulse response of the set, n is an integer indicating a sample or time index, k is an integer indicating the index of the subband, and the processor (820) Calculates said evaluation expression A (n, k) based on at least two evaluation expressions Aν (n, k) or based on at least two whitened evaluation expressions A _W ν (n, k) 49. The filter compressor (501) according to any one of claims 45 to 48, further configured to:

The filter impulse response constructor (305) is configured such that the filter impulse response value of the compression filter impulse response corresponding to the same center frequency and the same sample or time index is set to zero, and the plurality of sets of compression filters The compression filter of the impulse response, the plurality of sets of compressions so that if not included in the impulse response or if the corresponding filter impulse response value is a complex value, it is replaced with the respective real value value 51. A filter compressor (501) according to any of claims 45 to 50, configured to constitute the compressed subband filter impulse response of a filter impulse response.

A method for creating a compressed subband filter impulse response from an input subband filter impulse response corresponding to a subband, including a filter impulse response value at a filter tap, comprising:
In order to find a filter impulse response value having a higher value and at least one filter impulse response value having a value lower than the higher value, the filter impulse response from at least two input subband filter impulse responses Examining a value; and configuring the compressed subband filter impulse response with the filter impulse response value having the higher value;
The compressed subband filter impulse response is
Does not include a filter impulse response value corresponding to a filter tap of the at least one filter impulse response value having the lower value, or a filter of the at least one filter impulse response value having the lower value. A method comprising a zero value filter impulse response value corresponding to a tap.

53. A computer program for performing the method of claim 52 when executed on a processor.

A computer readable storage medium having multiple sets of subband filter impulse responses stored therein, each set of subband filter impulse responses approximated to a time domain head related transfer function related filter And each filter impulse response of the time domain head related transfer function related filter is greater than the sum of the lengths of the subband filter impulse responses of each set of subband filter impulse responses, or Each filter impulse response of the time domain head related transfer function filter is the subband filter impulse response of each set of subband filter impulse responses when the filter impulse response values are complex values. Complex-valued Greater than the sum of the length of the filter impulse response values, a computer readable storage medium.

55. The computer readable storage medium of claim 54, wherein the plurality of sets of subband filter impulse responses includes a common data pattern.

The common data pattern includes at least two sets of subband filter impulse response subband filter impulse response zero or missing filter impulse response values, or the filter impulse response values are complex numbers. 56. The computer readable storage medium of claim 55, wherein if it is a value, the value is associated with a real value.

57. The computer readable storage medium of any of claims 54 to 56, wherein at least one set of subband filter impulse responses approximates a spatial audio filter.