JP6345633B2

JP6345633B2 - Sound field reproducing apparatus and method

Info

Publication number: JP6345633B2
Application number: JP2015151874A
Authority: JP
Inventors: 圭吾若山; 高田　英明; 英明高田
Original assignee: Nippon Telegraph and Telephone Corp
Current assignee: Nippon Telegraph and Telephone Corp
Priority date: 2015-07-31
Filing date: 2015-07-31
Publication date: 2018-06-20
Anticipated expiration: 2035-07-31
Also published as: JP2017034441A

Description

本発明は音場再生技術に関し、特に空間マルチゾーン音場再生技術に関する。 The present invention relates to a sound field reproduction technique, and more particularly to a spatial multi-zone sound field reproduction technique.

空間マルチゾーン音場再生とは、スピーカアレイを駆動することで、分離された空間に独立した複数の音場を再生する技術である（例えば、非特許文献１参照）。従来技術では、複数の領域で音場を再現するため、再現領域を取り囲むように無指向性スピーカを等間隔に円状に配置した円状スピーカアレイを用いている。 Spatial multi-zone sound field reproduction is a technique for reproducing a plurality of independent sound fields in a separated space by driving a speaker array (see, for example, Non-Patent Document 1). In the prior art, in order to reproduce a sound field in a plurality of areas, a circular speaker array in which omnidirectional speakers are arranged in a circle at equal intervals so as to surround the reproduction area is used.

Y. J. Wu, et al. “Spatial Multizone Soundfield Reproduction：Theory and Design,” Speech and Audio Processing, IEEE Transactions on 19.6 (2011): 1711-1720.Y. J. Wu, et al. “Spatial Multizone Soundfield Reproduction: Theory and Design,” Speech and Audio Processing, IEEE Transactions on 19.6 (2011): 1711-1720.

従来の円状スピーカアレイによって半径Ｒの円周内で波数ｋまで正確に音場を再現するには、２ｋＲ＋１個以上の膨大な数のスピーカが必要となる。本発明の課題は、従来よりも少ない個数の音源で空間マルチゾーン音場再生を実現することである。 In order to accurately reproduce the sound field up to the wave number k within the circumference of the radius R by the conventional circular speaker array, a huge number of speakers of 2kR + 1 or more are required. An object of the present invention is to realize spatial multi-zone sound field reproduction with a smaller number of sound sources than in the prior art.

指向性パターンが異なる互いに独立な複数個の放射モードの何れかまたは前記複数個の放射モードの少なくとも一部の組み合わせからなる、再生パターンで音響信号を再生する所定個の高次音源を用いる。当該再生パターンは、音響信号の反響を利用して複数個の空間領域に所望の音場を生成するために放射モードのそれぞれの重みが制御されたものである。 A predetermined number of higher-order sound sources that reproduce an acoustic signal with a reproduction pattern, which is composed of any one of a plurality of independent radiation modes having different directivity patterns or a combination of at least a part of the plurality of radiation modes, are used. In the reproduction pattern, each weight of the radiation mode is controlled in order to generate a desired sound field in a plurality of spatial regions using the echo of the acoustic signal.

本発明では、放射モードのそれぞれの重みを制御しつつ反響を利用することで、従来よりも少ない個数の音源で空間マルチゾーン音場再生を実現できる。 In the present invention, spatial multi-zone sound field reproduction can be realized with a smaller number of sound sources than in the prior art by using echoes while controlling the respective weights of the radiation modes.

図１は実施形態の音場再生装置の構成を説明するための概念図である。FIG. 1 is a conceptual diagram for explaining the configuration of the sound field reproducing apparatus of the embodiment. 図２は実施形態の制御部を例示したブロック図である。FIG. 2 is a block diagram illustrating the control unit of the embodiment. 図３は定理１に関する幾何学図である。FIG. 3 is a geometric diagram relating to Theorem 1. 図４は実施形態の係数設定装置の構成を説明するための概念図である。FIG. 4 is a conceptual diagram for explaining the configuration of the coefficient setting device of the embodiment. 図５は実施形態の設定部を例示したブロック図である。FIG. 5 is a block diagram illustrating the setting unit of the embodiment.

以下、本発明の実施形態を説明する。
〔概要〕
反響環境下（残響環境下）に所定個の高次音源（higher order source）を配置する。各高次音源は、指向性パターンが異なる互いに独立な複数個の放射モードの何れかまたは複数個の放射モードの少なくとも一部の組み合わせ（線形和）からなる「再生パターン」で音響信号を再生する音源である。高次音源の例は高次スピーカ（higher order loudspeaker）である（例えば、参考文献１「M. Poletti, T. Betlehem, “Design of a prototype variable directivity loudspeaker for improved surround sound reproduction in rooms,” AES 52nd International Conference, Guildford, UK, 2013, September 2-4」等参照）。ここで、音響信号の反響を利用して複数個の空間領域に所望の音場を生成するために放射モードのそれぞれの重みが制御された「再生パターン」を用いることで、従来よりも少ない個数の音源で空間マルチゾーン音場再生を実現できる。すなわち、非特許文献１の式（２０）および（２１）等にあるように、非特許文献１では、各スピーカのフィルタ係数を制御して空間マルチゾーン音場再生を実現する。これに対し、本形態では高次音源の放射モードのそれぞれを制御する。放射モードを制御することにより、反響による鏡像の制御の自由度を向上でき、少ない個数の音源で空間マルチゾーン音場再生を実現できる。 Embodiments of the present invention will be described below.
〔Overview〕
A predetermined number of higher order sources are arranged in a reverberant environment (in a reverberant environment). Each higher-order sound source reproduces an acoustic signal with a “reproduction pattern” composed of any one of a plurality of independent radiation modes having different directivity patterns or a combination (linear sum) of at least a part of a plurality of radiation modes. It is a sound source. An example of a higher order sound source is a higher order loudspeaker (for example, reference 1 “M. Poletti, T. Betlehem,“ Design of a prototype variable directivity loudspeaker for improved surround sound reproduction in rooms, ”AES 52nd. International Conference, Guildford, UK, 2013, September 2-4 "). Here, by using the “reproduction pattern” in which the weight of each radiation mode is controlled in order to generate the desired sound field in a plurality of spatial regions using the echo of the acoustic signal, a smaller number than in the past It is possible to realize spatial multi-zone sound field playback with any sound source. That is, as shown in Equations (20) and (21) of Non-Patent Document 1, in Non-Patent Document 1, spatial multi-zone sound field reproduction is realized by controlling the filter coefficient of each speaker. On the other hand, in this embodiment, each of the radiation modes of the higher-order sound source is controlled. By controlling the radiation mode, it is possible to improve the degree of freedom of mirror image control due to reverberation and to realize spatial multi-zone sound field reproduction with a small number of sound sources.

「再生パターン」は、例えば、高次音源ｈｓ（ｕ）の波数ｋでの放射モードｍ（ν）に対するフィルタ重みを

としたものである。ただし、ｈｓ（１），…，ｈｓ（Ｌ）がＬ個の高次音源を表し、Ｌが正整数（例えば、Ｌは２以上の整数）であり、ｍ（１），…，ｍ（２Ｎ_Ｖ＋１）が２Ｎ_Ｖ＋１個の放射モードを表し、Ｎ_Ｖが正整数であり（Ｎ_Ｖは１以上の整数）、ｕ＝１，…，Ｌであり、ν＝１，…，２Ｎ_Ｖ＋１である。Ｈ_νｕ（ｒ，θ；ｋ）が高次音源ｈｓ（ｕ）から大域的な極座標（ｒ，θ）までの放射モードｍ（ν）の波数ｋでの音響伝達関数であり、ｒが動径であり、θが偏角である。Ｍ_０（ｋ）が正整数であり、ｍ＝−Ｍ_０（ｋ），…，Ｍ_０（ｋ）であり、α_ｍνｕ（ｋ）が音響伝達関数Ｈ_νｕ（ｒ，θ；ｋ）のｍについての展開係数であり、Ａ（ｋ）がα_ｍνｕ（ｋ）を要素とする（２Ｍ_０（ｋ）＋１）×Ｌ’の行列であり、Ｌ’＝Ｌ（２Ｎ_Ｖ＋１）であり、

である。

であり、［・］^Ｔは［・］の転置を表し、β^ｄ _ｍ（ｋ）は所望の音場が生成された複数個の空間領域を含む領域の極座標（ｒ，θ）での波数ｋの周波数領域信号Ｓ^ｄ（ｒ，θ；ｋ）のｍについての展開係数であり、ｇ（ｋ）がＡ（ｋ）ｇ（ｋ）＝β^ｄ（ｋ）の解または近似解である。 The “reproduction pattern” is, for example, the filter weight for the radiation mode m (ν) at the wave number k of the higher-order sound source hs (u).

It is what. Here, hs (1),..., Hs (L) represent L high-order sound sources, L is a positive integer (for example, L is an integer of 2 or more), and m (1),. _V +1) represents 2N _V +1 radiation modes, N _V is a positive integer (N _V is an integer of 1 or more), u = 1,..., L, ν = 1,..., 2N _V +1 It is. H _νu (r, θ; k) is an acoustic transfer function at the wave number k of the radiation mode m (ν) from the higher-order sound source hs (u) to the global polar coordinate (r, θ), and r is the radius And θ is a declination. M ₀ (k) is a positive integer, m = −M ₀ (k),..., M ₀ (k), and α _mνu (k) is m of the acoustic transfer function H _νu (r, θ; k). A (k) is a (2M ₀ (k) +1) × L ′ matrix with α _mνu (k) as elements, and L ′ = L (2N _V +1).

It is.

, And the represents a transpose of [·] ^T is ^{_{[·], β d m (}} k) is the wave number k in the polar region including a plurality of spatial regions where desired sound field is generated (r, theta) Of the frequency domain signal S ^d (r, θ; k) for m, and g (k) is a solution or an approximate solution of A (k) g (k) = β ^d (k).

α_ｍνｕ（ｋ）は音響伝達関数Ｈ_νｕ（ｒ，θ；ｋ）から得ることができ、Ｈ_νｕ（ｒ，θ；ｋ）は高次音源からの出力とマイクロホンでの集音結果から計算できる。しかしながら、環境によってはＨ_νｕ（ｒ，θ；ｋ）の誤差が大きくなってしまう場合がある。この場合、

によってＨ_νｕ（ａ，θ_ｑ；ｋ）をフィルタリングして

を得てもよい。これにより音響伝達関数の推定誤差の影響を低減できる。ただし、Ｈ_νｕ（ａ，θ_ｑ；ｋ）は大域的な極座標（ａ，θ_ｑ）に配置されたマイクロホンＭＩＣ（ｑ）（ただし、ｑ＝１，…，Ｑ）での観測音響信号に基づいて得られる、高次音源ｈｓ（ｕ）から極座標（ａ，θ_ｑ）までの放射モードｍ（ν）の波数ｋでの音響伝達関数である。ａは動径であり、θ_ｑは偏角であり、

であり、ｉが虚数単位であり、Ｈ’_ｍ（・）が次数ｍの第一種のハンケル関数の導関数であり、（・）^*が（・）の複素共役を表し、λ_ｅｑが等価正則化パラメータであり、｜（・）｜が（・）の絶対値を表す。 α _mνu (k) can be obtained from the acoustic transfer function H _νu (r, θ; k), and H _νu (r, θ; k) can be calculated from the output from the higher-order sound source and the sound collection result of the microphone. . However, depending on the environment, the error of H _νu (r, θ; k) may increase. in this case,

_Filter H _νu (a, θ _q ; k) by

You may get Thereby, the influence of the estimation error of the acoustic transfer function can be reduced. However, H _νu (a, θ _q ; k) is based on the observed acoustic signal at the microphone MIC (q) (where q = 1,..., Q) arranged at the global polar coordinates (a, θ _q ). Is an acoustic transfer function at a wave number k of the radiation mode m (ν) from the higher-order sound source hs (u) to the polar coordinates (a, θ _q ). a is a moving radius, θ _q is a declination,

, I is an imaginary unit, H ′ _m (•) is a derivative of the first kind Hankel function of degree m, (•) ^* represents the complex conjugate of (•), and λ _eq is equivalent It is a regularization parameter, and | (•) | represents the absolute value of (•).

〔第１実施形態〕
図面を参照して第１実施形態を説明する。
＜構成＞
図１に例示するように、本形態の音場再生装置１は、制御部１１と、壁面１３で囲まれた残響室（反響環境下）に配置されたＬ個の高次音源（高次スピーカ）１２−ｕ（ただし、ｕ＝１，…，Ｌ）とを有する。図２に例示するように、制御部１１は記憶部１１１と重み取得部１１２とフィルタリング部１１３−ｕと駆動信号生成部１１４−ｕとを有する。制御部１１は、例えば、ＣＰＵ（central processing unit）等のプロセッサ（ハードウェア・プロセッサ）およびＲＡＭ（random-access memory）・ＲＯＭ（read-only memory）等のメモリ等を備える汎用または専用のコンピュータが所定のプログラムを実行することで構成される。このコンピュータは１個のプロセッサやメモリを備えていてもよいし、複数個のプロセッサやメモリを備えていてもよい。このプログラムはコンピュータにインストールされてもよいし、予めＲＯＭ等に記録されていてもよい。また、ＣＰＵのようにプログラムが読み込まれることで機能構成を実現する電子回路（circuitry）ではなく、プログラムを用いることなく処理機能を実現する電子回路を用いて一部またはすべての処理部が構成されてもよい。また、１個の装置を構成する電子回路が複数のＣＰＵを含んでいてもよい。なお、図１および２はＬ＝５の例であるが、これは本発明を限定しない。 [First Embodiment]
A first embodiment will be described with reference to the drawings.
<Configuration>
As illustrated in FIG. 1, the sound field reproducing apparatus 1 of this embodiment includes a control unit 11 and L high-order sound sources (high-order speakers) arranged in a reverberation room (in a reverberation environment) surrounded by a wall surface 13. ) 12-u (where u = 1,..., L). As illustrated in FIG. 2, the control unit 11 includes a storage unit 111, a weight acquisition unit 112, a filtering unit 113-u, and a drive signal generation unit 114-u. The control unit 11 is, for example, a general-purpose or dedicated computer that includes a processor (hardware processor) such as a CPU (central processing unit) and a memory such as random-access memory (RAM) and read-only memory (ROM). It is configured by executing a predetermined program. The computer may include a single processor and memory, or may include a plurality of processors and memory. This program may be installed in a computer, or may be recorded in a ROM or the like in advance. In addition, some or all of the processing units are configured using an electronic circuit that realizes a processing function without using a program, instead of an electronic circuit (circuitry) that realizes a functional configuration by reading a program like a CPU. May be. In addition, an electronic circuit constituting one device may include a plurality of CPUs. 1 and 2 are examples of L = 5, this does not limit the present invention.

≪幾何学配置≫
２次元の大域的（グローバル）な空間領域、および、互いにオーバーラップしないＳ個の２次元の局所的（ローカル）な所望の空間領域（円形領域）１４−１，…，１４−Ｓならびにそれらに対応する所望の音場を想定する。ただし、Ｓは２以上の整数である。ｓ番目（ただし、ｓ＝１，…，Ｓ）の空間領域１４−ｓの半径と原点をそれぞれＲ_ｚ ^（ｓ）とＯ_ｓで表現する。ここでＯ_ｓは大域的な原点Ｏに対する極座標（ｒ^（ｓ０），θ^（ｓ０））に位置する。ｒ^（ｓ０）およびθ^（ｓ０）はそれぞれ動径および偏角を表す。ｓ番目の空間領域１４−ｓの中にある任意の観測点の局所的な極座標を（Ｒ^（ｓ），Ω^（ｓ））と表現する。（Ｒ^（ｓ），Ω^（ｓ））は局所的な原点Ｏ_ｓを中心とした極座標であり、Ｒ^（ｓ）およびΩ^（ｓ）はそれぞれ動径および偏角を表す。局所的な極座標（Ｒ^（ｓ），Ω^（ｓ））は、大域的な原点Ｏに対する極座標（ｒ，θ）に位置する。ｒおよびθはそれぞれ動径および偏角を表す。すべての空間領域１４−１，…，１４−Ｓは、大域的な原点Ｏを中心とした半径Ｒ_Ｐ≧ｒの円形領域内に存在する。Ｒ_Ｐはすべての空間領域１４−１，…，１４−Ｓを内側に含む円形領域の半径である。Ｌ個の高次音源１２−ｕは、大域的な原点Ｏから半径Ｒ_Ｓ≧Ｒ_Ｐの円周上に配置される。高次音源１２−１，…，１２−Ｌは等間隔で配置されてもよいし、等間隔で配置されなくてもよい。Ｌは正整数であり、例えばＬは２以上の整数である。 ≪Geometric arrangement≫
Two-dimensional global (global) spatial regions, and S two-dimensional local (circular) desired spatial regions (circular regions) 14-1,. Assume a corresponding desired sound field. However, S is an integer of 2 or more. The radius and origin of the s-th (where s = 1,..., S) space region 14-s are expressed by R _z ^(s) and O _s , respectively. Here, O _s is located at polar coordinates (r ^(s0) , θ ^(s0) ) with respect to the global origin O. r ^(s0) and θ ^(s0) represent a radius vector and a declination angle, respectively. A local polar coordinate of an arbitrary observation point in the s-th spatial region 14-s is expressed as (R ^(s) , Ω ^(s) ). (R ^(s) , Ω ^(s) ) are polar coordinates centered on the local origin O _s , and R ^(s) and Ω ^(s) represent the radial and declination, respectively. Local polar coordinates (R ^(s) , Ω ^(s) ) are located at polar coordinates (r, θ) with respect to the global origin O. r and θ represent a radius vector and a declination angle, respectively. All the spatial regions 14-1,..., 14-S are present in a circular region having a radius R _P ≧ r with the global origin O as the center. R _P is all the spatial regions 14-1, ..., the radius of the circular area including the 14-S inwards. L pieces of high-order tone 12-u are arranged on the circumference of a radius R _S ≧ R _P from global origin O. The higher order sound sources 12-1, ..., 12-L may be arranged at equal intervals or may not be arranged at equal intervals. L is a positive integer. For example, L is an integer of 2 or more.

≪所望の音場≫
２次元の空間領域１４−ｓでの所望の音場を、ｘ_{ｓｏｕｒｃｅ（ｓ）}＝（ｒ_ｓｃｏｓθ_ｓ，ｒ_ｓｓｉｎθ_ｓ）^Ｔに位置する２次元グリーン関数（２次元平面では点音源、３次元空間では線音源）に起因する音場と定義する。この音源を仮想音源と呼ぶ。ただし、局所的な原点Ｏ_ｓに対する任意の極座標を（ｒ_ｓ，θ_ｓ）と表記する。ｒ_ｓおよびθ_ｓはそれぞれ動径および偏角を表す。また（・）^Ｔは（・）の転置を表す。すなわち、本形態の目的は高次音源１２−１，…，１２−Ｌを用い、ｘ_{ｓｏｕｒｃｅ（ｓ）}に位置する仮想音源に起因する音場を空間領域１４−ｓでの「所望の音場」として再現することである。空間領域１４−ｓでの所望の音場を以下の時間周波数領域信号Ｓ^ｄ（ｓ）（ｘ_ｄ（ｓ）；ｋ）で表記する。

ただし、ｋは波数であり、ｉは虚数単位であり、||（・）||は（・）のノルムを表す。Ｈ_ｎ（・）は次数ｎの第一種のハンケル関数である。ｘ_ｄ（ｓ）は空間領域１４−ｓでの任意の観測点である。ｄ（ｓ）は空間領域１４−ｓの所望の音場を表すインデックスである。ｘ_ｄ（ｓ）を局所的な原点Ｏ_ｓに対する極座標で表すと（Ｒ^（ｓ），Ω^（ｓ））となり、大域的な原点Ｏに対する極座標で表すと（ｒ，θ）となる。Ｓ^ｄ（ｓ）（ｘ_ｄ（ｓ）；ｋ）に係数Ｓ（ｋ）を乗じることで任意の音響信号による音場を表現できる。 ≪ Desired sound field≫
A desired sound field in the two-dimensional space region 14-s is expressed as a two-dimensional Green function (point source 3 in the two-dimensional plane, x _{source (s)} = (r _s cos θ _s , r _s sin θ _s ) ^T It is defined as a sound field caused by a linear sound source in a dimensional space. This sound source is called a virtual sound source. However, an arbitrary polar coordinate with respect to the local origin O _s is expressed as (r _s , θ _s ). r _s and θ _s represent a radius vector and a declination angle, respectively. (•) ^T represents transposition of (•). That is, the purpose of this embodiment is to use higher-order sound sources 12-1,..., 12-L, and the sound field caused by the virtual sound source located at x _{source (s)} To reproduce. A desired sound field in the spatial domain 14-s is represented by the following time frequency domain signal S ^{d (s)} (x _{d (s)} ; k).

However, k is a wave number, i is an imaginary unit, and || (•) || represents a norm of (•). H _n (·) is a first type Hankel function of order n. _{xd (s)} is an arbitrary observation point in the spatial region 14-s. d (s) is an index representing a desired sound field in the spatial region 14-s. When x _{d (s)} is expressed in polar coordinates with respect to the local origin O _s , it becomes (R ^(s) , Ω ^(s) ), and when expressed in polar coordinates with respect to the global origin O, it becomes (r, θ). By multiplying S ^{d (s)} (x _{d (s)} ; k) by a coefficient S (k), a sound field by an arbitrary acoustic signal can be expressed.

≪所望の音場の円筒調和関数展開≫
ｘ_ｄ（ｓ）を局所的な原点Ｏ_ｓに対する極座標（Ｒ^（ｓ），Ω^（ｓ））で表す。この場合、空間領域１４−ｓの所望の音場を表す時間周波数領域信号Ｓ^ｄ（ｓ）（Ｒ^（ｓ），Ω^（ｓ）；ｋ）は、円筒調和関数展開によって以下のように表現される。

ただし、Ｊ_ｍ（・）は次数ｍのベッセル関数であり、α_ｍ ^ｄ（ｓ）（ｋ）はＳ^ｄ（ｓ）（Ｒ^（ｓ），Ω^（ｓ）；ｋ）を一意に表現するためのｍについての展開係数（音場係数）であり、ｅはネイピア数（自然対数の底）である。式（２）はフーリエ級数展開となっており、これによって任意の数の円筒波や平面波に由来する任意の２次元音場を表現できる。なお、「α_ｍ ^ｄ（ｓ）（ｋ）」の「ｄ（ｓ）」は「ｍ」の真上に表記すべきであるが、記載表記の制約上、「α_ｍ ^ｄ（ｓ）（ｋ）」と表記する。その他の記号でも同様な表記を行う場合がある。 ≪Cylinder harmonic function expansion of desired sound field≫
x _{d (s)} is represented by polar coordinates (R ^(s) , Ω ^(s) ) with respect to the local origin O _s . In this case, the time-frequency domain signal S ^{d (s)} (R ^(s) , Ω ^(s) ; k) representing the desired sound field in the spatial domain 14-s is expressed as follows by cylindrical harmonic function expansion. The

However, J _m (•) is a Bessel function of degree m, and α _m ^{d (s)} (k) uniquely represents S ^{d (s)} (R ^(s) , Ω ^(s) ; k). Expansion coefficient (sound field coefficient) for m, and e is the Napier number (the base of natural logarithm). Expression (2) is a Fourier series expansion, which can represent an arbitrary two-dimensional sound field derived from an arbitrary number of cylindrical waves and plane waves. In addition, “ ^{d (s)} ” of “α _m ^{d (s)} (k)” should be written directly above “m”, but “α _m ^{d (s)} (k” is restricted due to the limitation of the description. ) ”. The same notation may be used for other symbols.

式（２）は無限個の直交モードを持っている。しかしながら、ベッセル関数の性質とすべての音源が外側にある空間領域では音場は限定されるという事実により、この級数展開を有限個の直交モードによる展開で打ち切ることができる。この場合、Ｓ^ｄ（ｓ）（Ｒ^（ｓ），Ω^（ｓ）；ｋ）は以下のように近似できる。

ただし、Ｍ_ｓ（ｋ）は正の整数である。この場合、Ｓ^ｄ（ｓ）（Ｒ^（ｓ），Ω^（ｓ）；ｋ）は、少なくとも２Ｍ_ｓ（ｋ）＋１個のモードで表現される。Ｍ_ｓ（ｋ）＝ｃｅｉｌ（ｋｅＲ_ｚ ^（ｓ）／２）の場合、打ち切り誤差は１６．１％以下である。ただし、ｃｅｉｌ（・）は（・）の天井関数である。 Equation (2) has an infinite number of orthogonal modes. However, due to the nature of the Bessel function and the fact that the sound field is limited in the spatial region where all sound sources are outside, this series expansion can be discontinued by a finite number of orthogonal modes. In this case, S ^{d (s)} (R ^(s) , Ω ^(s) ; k) can be approximated as follows.

However, M _s (k) is a positive integer. In this case, S ^{d (s)} (R ^(s) , Ω ^(s) ; k) is expressed by at least 2M _s (k) +1 modes. When M _s (k) = ceil (keR _z ^(s) / 2), the truncation error is 16.1% or less. However, ceil (•) is a ceiling function of (•).

≪等価なグローバル音場≫
上述したＳ個の空間領域１４−１，・・・，Ｓ（マルチゾーン）の音場から構成される等価なグローバル音場を定義する。すなわち、複数の空間領域１４−１，・・・，Ｓの音場の再現の問題を全域にわたるグローバルな所望の音場の再現に帰着させる。所望のグローバル音場を表す時間周波数領域信号Ｓ^ｄ（ｒ，θ；ｋ）は、円筒調和関数展開によって以下のように近似される。

ただし、β_ｍ ^ｄ（ｋ）はＳ^ｄ（ｒ，θ；ｋ）を一意に表現するためのｍについての展開係数（音場係数）である。この場合、Ｓ^ｄ（ｒ，θ；ｋ）は、少なくとも２Ｍ_０（ｋ）＋１個のモードで表現される。例えば、Ｍ_０（ｋ）＝ｃｅｉｌ（ｋｅＲ_Ｐ／２）である。すべてのマルチゾーンが半径Ｒ_Ｐの円形領域内に収まる場合、Ｍ_０（ｋ）によるモード制限ですべてのマルチゾーンで音場を再現するためには、
Ｍ_０（ｋ）≧（Ｍ_１（ｋ）＋Ｍ_２（ｋ）＋・・・＋Ｍ_Ｓ（ｋ））（５）
を満たす必要がある。 ≪Equivalent global sound field≫
An equivalent global sound field composed of the sound fields of the S spatial regions 14-1,..., S (multi-zone) described above is defined. That is, the sound field reproduction problem of the plurality of spatial regions 14-1,..., S is reduced to reproduction of a global desired sound field over the entire area. A time-frequency domain signal S ^d (r, θ; k) representing a desired global sound field is approximated by cylindrical harmonic function expansion as follows.

Here, β _m ^d (k) is an expansion coefficient (sound field coefficient) for m for uniquely expressing S ^d (r, θ; k). In this case, S ^d (r, θ; k) is expressed by at least 2M ₀ (k) +1 modes. For example, M ₀ (k) = ceil (keR _P / 2). If all multizone falls within a circular region of radius R _P, in order to reproduce the sound field in all multi-zone mode limitation by M 0 _(k) is
M ₀ (k) ≧ (M ₁ (k) + M ₂ (k) +... + M _S (k)) (5)
It is necessary to satisfy.

《空間調和係数変換》
音源の存在しない領域の中の音場を考える。図３のようにＯ_１，Ｏ_２は、２つの座標系の原点であり、それらは同じ方向の軸を持っており、Ｏ_１を既知の変換で移動させたものがＯ_２であるとする。Ｏ_１を原点とする座標系での極座標（ｒ^（１），θ^（１））を、Ｏ_２を原点とする座標系での極座標で表すと（ｒ^（２），θ^（２））となる。ここで、Ｏ_１に対するＯ_２の極座標（ｒ^（１２），θ^（１２））と表す。また｛α_ｍ ^（１）（ｋ）｝，｛α_ｍ ^（２）（ｋ）｝が、それぞれ原点をＯ_１，Ｏ_２とした２つの座標系における、音源の存在しない領域中の音場を一意に表現するためのｍについての展開係数の集合であるとする。この場合、以下の定理１，２が成り立つ。《Spatial harmonic coefficient conversion》
Consider a sound field in an area where no sound source exists. O _1, O ₂ as shown in FIG. 3, is the origin of the two coordinate systems, they have the axis of the same direction, which moves the O ₁ a known conversion is assumed to be O ₂ . When the polar coordinates (r ⁽¹⁾ , θ ⁽¹⁾ ) in the coordinate system with O ₁ as the origin are represented by the polar coordinates in the coordinate system with O ₂ as the origin, (r ⁽²⁾ , θ ⁽²⁾ ) and Become. Here, the polar coordinates of O ₂ with respect to O ₁ (r ⁽¹²⁾ , θ ⁽¹²⁾ ) are represented. Also, {α _m ⁽¹⁾ (k)} and {α _m ⁽²⁾ (k)} denote the sound field in the region where no sound source exists in the two coordinate systems with the origins as O ₁ and O ₂ , respectively. It is assumed that this is a set of expansion coefficients for m for uniquely expressing. In this case, the following theorems 1 and 2 hold.

［定理１］
α_ｍ ^（１）（ｋ）とα_ｍ ^（２）（ｋ）は以下によって関係付けられる。

ただし、

であり、「＊」はモード次数ｍでの離散畳み込みを表している。なお、Ｔ_ｍ ^（２１）は原点Ｏ_２からＯ_１への変換作用素を表しており、Ｔ_ｍ ^（１２）は原点Ｏ_１からＯ_２への変換作用素を表している。 [Theorem 1]
α _m ⁽¹⁾ (k) and α _m ⁽²⁾ (k) are related by:

However,

Where “*” represents discrete convolution at mode order m. T _m ⁽²¹⁾ represents a conversion operator from the origin O ₂ to O ₁ , and T _m ⁽¹²⁾ represents a conversion operator from the origin O ₁ to O ₂ .

［定理２］
Ｏ_２に対するＯ_１の極座標を（ｒ^（２１），θ^（２１））と表記し、

および空間調和係数変換（Spatial Harmonic Coefficient Translation）の定理を適用すると、以下の関係を導出できる。

ここで、Ｔ_ｍ ^（０ｓ）はＯを原点とした大域的な座標系から、ｓ番目の空間領域１４−ｓのＯ_ｓを原点とした局所的な座標系への変換作用素である。 [Theorem 2]
The polar coordinates of O ₁ with respect to O ₂ are expressed as (r ⁽²¹⁾ , θ ⁽²¹⁾ ),

And applying the theorem of Spatial Harmonic Coefficient Translation, the following relation can be derived.

Here, T _m ^(0s) is a conversion operator from a global coordinate system having O as the origin to a local coordinate system having O _s in the sth spatial region 14- _s as the origin.

《グローバル音場係数の探索》
α_ｍ ^ｄ（ｓ）（ｋ）からβ_ｍ ^ｄ（ｋ）への係数変換を行う。式（９）の畳み込みを線形和で書き下すと以下のようになる。

この式（１０）をｓ＝１，…，ＳについてＳ個書き下し、これら同時方程式を構成し、行列形式で表現すると以下のようになる。
α^ｄ（ｋ）＝Ｔ（ｋ）β^ｄ（ｋ）（１１）
ただし、以下を満たす。
《Search for global sound field coefficient》
Coefficient conversion from α _m ^{d (s)} (k) to β _m ^d (k) is performed. The convolution of equation (9) is written as a linear sum as follows.

When this equation (10) is written down for S = 1,..., S, these simultaneous equations are constructed and expressed in matrix form as follows.
α ^d (k) = T (k) β ^d (k) (11)
However, the following is satisfied.

式（１１）は以下のように変形できる。
β^ｄ（ｋ）＝Ｔ（ｋ）^＋α^ｄ（ｋ）（１４）
ただし、Ｔ（ｋ）^＋＝［Ｔ（ｋ）^ＨＴ（ｋ）］^−１Ｔ（ｋ）^ＨはＴ（ｋ）のムーア−ペローンズの擬似逆行列であり、（・）^Ｈは（・）の複素共役転置を表す。最小二乗法などを用い、式（１４）のβ^ｄ（ｋ）の解または近似解を求めることで、α_ｍ ^ｄ（ｓ）（ｋ）からβ_ｍ ^ｄ（ｋ）への係数変換を行うことができる。 Equation (11) can be modified as follows.
β ^d (k) = T (k) ⁺ α ^d (k) (14)
Where T (k) ⁺ = [T (k) ^H T (k)] ⁻¹ T (k) ^H is a Moore-Peron's pseudo-inverse matrix of T (k), and (·) ^H is (·) Represents the complex conjugate transpose of. Perform coefficient conversion from α _m ^{d (s)} (k) to β _m ^d (k) by obtaining a solution of β ^d (k) or an approximate solution of equation (14) using a least square method or the like. Can do.

《高次音源による残響室でのマルチゾーン音場再生》
残響室（反響環境下）に配置されたＬ個の高次音源１２−１，…，１２−Ｌによって空間マルチゾーンでの音場再生を行う。各高次音源１２−ｓは、互いに独立した複数の指向性パターンを生成できる。２次元平面で考えると、次数Ｎ_Ｖの高次音源１２−ｓは２Ｎ_Ｖ＋１個の独立した放射モードを生成できる。それらは｛ｅ^ｉｎθ：ｎ＝−Ｎ_Ｖ，…，Ｎ_Ｖ｝のの極応答を持つ位相モード、｛１，ｃｏｓ(ｎθ)■，ｓｉｎ(ｎθ)■：ｎ＝１，…，Ｎ_Ｖ｝の極応答を持つ振幅モード、あるいは、これらの極応答の線形和で表される。これらＬ個の高次音源１２−１，…，１２−Ｌによって生成される、入力信号Ｓ（ｋ）＝１における再現音場は、以下の時間周波数領域信号Ｓ（ｒ，θ；ｋ）で表記できる。

ここで、Ｇ_νｕ（ｋ）は高次音源１２−ｓの波数ｋの放射モード（radiation mode）ｍ（ν）に適用されるフィルタ重みを表す。ただし、ν＝１，…，２Ｎ_Ｖ＋１は各放射モードに対応するインデックスである。Ｈ_νｕ（ｒ，θ；ｋ）は、高次音源１２−ｓから極座標（ｒ，θ）までの放射モードｍ（ν）での波数ｋの音響伝達関数である。《Multi-zone sound field reproduction in reverberation room with high-order sound source》
Sound field reproduction in a spatial multi-zone is performed by L high-order sound sources 12-1, ..., 12-L arranged in a reverberation room (under reverberation environment). Each higher-order sound source 12-s can generate a plurality of directivity patterns independent of each other. Considering a two-dimensional plane, a higher-order sound source 12-s of order N _V can generate 2N _V +1 independent radiation modes. They ^{_{{e inθ: n = -N V}} , ..., N V} -phase mode with a polar response of the, {1, cos (nθ) ■, sin (nθ) ■: n = 1, ..., N V} It is represented by an amplitude mode having a polar response of or a linear sum of these polar responses. The reproduced sound field in the input signal S (k) = 1 generated by these L high-order sound sources 12-1,..., 12-L is the following time frequency domain signal S (r, θ; k). Can be written.

Here, G _νu (k) represents the filter weight applied to the radiation mode m (ν) of the wave number k of the higher-order sound source 12-s. However, ν = 1,..., 2N _V +1 is an index corresponding to each radiation mode. H _νu (r, θ; k) is an acoustic transfer function of wave number k in the radiation mode m (ν) from the higher-order sound source 12-s to the polar coordinates (r, θ).

《モードマッチング》
本形態では、所望の音場を円筒調和関数に分解し、所望の音場のモードに一致させるように高次音源１２−１，…，１２−Ｌのアクティブモード（ｍ＝−Ｍ_０（ｋ），…，Ｍ_０（ｋ）の範囲でのモード）を制御するモードマッチングを行う。具体的には、ｍ＝−Ｍ_０（ｋ），…，Ｍ_０（ｋ）の範囲で所望の音場のモードに一致させるようにＧ_νｕ（ｋ）（ただし、ｕ＝１，…，Ｌ）を調整する。モードマッチングのアプローチの利点は、高次音源１２−１，…，１２−Ｌの空間ナイキスト周波数までの制御領域で正確な音場再生が保証されるという点である。音場のアクティブモードを正確に再現するためには、自由度の数、すなわち、高次音源１２−１，…，１２−Ｌで提供される指向性パターンの数Ｌ’＝Ｌ（２Ｎ_Ｖ＋１）がアクティブモードの数（２Ｍ_０（ｋ）＋１）と等しいかそれ以上でないとならない。すなわち、Ｌ（２Ｎ_Ｖ＋１）≧２Ｍ_０（ｋ）＋１であることが望ましい。《Mode matching》
In this embodiment, the desired sound field is decomposed into a cylindrical harmonic function, and the active modes (m = −M ₀ (k ),..., Mode matching for controlling the mode in the range of M ₀ (k). Specifically, G _ν u (k) (where u = 1,..., L so as to match the desired sound field mode within a range of m = −M ₀ (k),..., M ₀ (k). ). The advantage of the mode matching approach is that accurate sound field reproduction is guaranteed in the control region up to the spatial Nyquist frequency of the higher-order sound sources 12-1, ..., 12-L. In order to accurately reproduce the active mode of the sound field, the number of degrees of freedom, that is, the number of directivity patterns provided by the higher-order sound sources 12-1,..., 12-L L ′ = L (2N _V +1 ) Must be equal to or greater than the number of active modes (2M ₀ (k) +1). That is, it is desirable that L (2N _V +1) ≧ 2M ₀ (k) +1.

各高次音源１２−ｓから極座標（ｒ，θ）までの放射モードｍ（ν）での波数ｋの音響伝達関数は以下のように表現できる。

ここで、α_ｍνｕ（ｋ）は高次音源１２−ｓの放射モードｍ（ν）での音響伝達関数Ｈ_νｕ（ｒ，θ；ｋ）のｍについての展開係数である。 The acoustic transfer function of wave number k in the radiation mode m (ν) from each higher-order sound source 12-s to polar coordinates (r, θ) can be expressed as follows.

Here, α _mνu (k) is an expansion coefficient for m of the acoustic transfer function H _νu (r, θ; k) in the radiation mode m (ν) of the higher-order sound source 12-s.

式（４）（１５）（１６）より、以下の関係が導出できる。

ただし、ｍ＝−Ｍ_０（ｋ），…，Ｍ_０（ｋ）である。この方程式の集合を行列で表現すると以下のようになる。
Ａ（ｋ）ｇ（ｋ）＝β^ｄ（ｋ）（１８）
ただし、Ａ（ｋ）はα_ｍνｕ（ｋ）を要素とする（２Ｍ_０（ｋ）＋１）×Ｌ’の行列であり、

である。［（・）］_ω，ιは行列（・）のω行ι列要素を表す。式（１６）に示すように、実測によって音響伝達関数Ｈ_νｕ（ｒ，θ；ｋ）が得ることでＡ（ｋ）を定めることができる。

である。［・］^Ｔは［・］の転置を表し、β^ｄ _ｍ（ｋ）は所望の音場が生成された複数個の空間領域を含む領域の極座標（ｒ，θ）での波数ｋの周波数領域信号Ｓ^ｄ（ｒ，θ；ｋ）のｍについての展開係数である。式（３）（１３）（１４）に示すように、高次音源１２−１，…，１２−Ｌおよび空間領域１４−１，…，１４−Ｓの所望の音場を定めればβ^ｄ（ｋ）を定めることができる。ｇ（ｋ）はＧ_νｕ（ｋ）を要素とするベクトルであり、

である。［（・）］_ωはベクトル（・）のω個目の要素を表す。 From the equations (4), (15) and (16), the following relationship can be derived.

However, m = −M ₀ (k),..., M ₀ (k). This set of equations can be expressed as a matrix as follows.
A (k) g (k) = β ^d (k) (18)
However, A (k) is a matrix of (2M ₀ (k) +1) × L ′ having α _mνu (k) as an element,

It is. [(•)] _{ω, ι} represents the ω row ι column element of the matrix (•). As shown in Equation (16), A (k) can be determined by obtaining the acoustic transfer function H _νu (r, θ; k) by actual measurement.

It is. [·] ^T represents transposition of [·], β ^d _m (k) is a frequency region of wave number k in polar coordinates (r, θ) of a region including a plurality of spatial regions in which a desired sound field is generated This is the expansion coefficient for m of the signal S ^d (r, θ; k). As shown in equation (3) (13) (14), higher sound 12-1, ..., 12-L and the spatial regions 14-1, ..., be determined desired sound field 14-S beta ^d (K) can be defined. g (k) is a vector having G _νu (k) as an element,

It is. [(•)] _ω represents the ω-th element of the vector (•).

以上より、式（１８）を解き、ｇ（ｋ）を求めることでモードマッチングを実現できる。本形態では、式（１８）のｇ（ｋ）の解または近似解を求める。Ａ（ｋ）ｇ（ｋ）とβ^ｄ（ｋ）との二乗誤差Ｊ（ｋ）は、以下のように表記できる。

ただし、Ｗ（ｋ）は［ｗ_ｍ（ｋ）］_{Ｍ０（ｋ）＋ｍ＋１，Ｍ０（ｋ）＋ｍ＋１}＝ｗ_ｍ（ｋ）（ただし、ｍ＝−Ｍ_０（ｋ），…，Ｍ_０（ｋ））で定義される（２Ｍ_０（ｋ）＋１）行（２Ｍ_０（ｋ）＋１）列の対角重み行列である。ただし、下付き添え字の「Ｍ０（ｋ）」は「Ｍ_０（ｋ）」を表す。また、

である。ｗ_ｍ（ｋ）は制御領域に対するモードｍの相対的なエネルギーの寄与の大きさを表している。二乗誤差Ｊ（ｋ）の最小値（最小二乗誤差）に対応するｇ（ｋ）は以下のようになる。

ただし、ηはティコノフの正則化パラメータであり、Ｉは（２Ｍ_０（ｋ）＋１）×（２Ｍ_０（ｋ）＋１）の単位行列である。式（２０）の結果を式（１８）のｇ（ｋ）の解または近似解とする。 From the above, mode matching can be realized by solving equation (18) and obtaining g (k). In this embodiment, a solution or approximate solution of g (k) in Expression (18) is obtained. The square error J (k) between A (k) g (k) and β ^d (k) can be expressed as follows.

Where W (k) is [w _m (k)] _{M0 (k) + m + 1, M0 (k) + m + 1} = w _m (k) (where m = −M ₀ (k),..., M ₀ (k) ) Defined by (2M ₀ (k) +1) rows (2M ₀ (k) +1) columns. However, the subscript “M0 (k)” represents “M ₀ (k)”. Also,

It is. w _m (k) represents the magnitude of the relative energy contribution of mode m to the control region. G (k) corresponding to the minimum value of the square error J (k) (least square error) is as follows.

Here, η is a Tikhonov regularization parameter, and I is a unit matrix of (2M ₀ (k) +1) × (2M ₀ (k) +1). The result of equation (20) is taken as the solution or approximate solution of g (k) in equation (18).

＜動作＞
以上のように得られるｇ（ｋ）は制御部１１（図２）の記憶部１１１に格納される。また入力信号Ｓ（ｋ）も記憶部１１１に格納される。重み取得部１１２は、記憶部１１１から読み込んだｇ（ｋ）からＧ_νｕ（ｋ）を取得し、Ｇ_νｕ（ｋ）をフィルタリング部１１３−ｕに送る（ただし、ｕ＝１，…，Ｌ）。フィルタリング部１１３−ｕは記憶部１１１から入力信号Ｓ（ｋ）を読み込み、高次音源１２−ｕの波数ｋでの放射モードｍ（ν）に対するフィルタ重みをＧ_νｕ（ｋ）とするフィルタを入力信号Ｓ（ｋ）に適用し、出力信号Ｓ_νｕ（ｋ）を得る。フィルタリング部１１３−ｕの処理は時間領域で行われてもよいし、時間周波数領域で行われてもよい。各高次音源１２−ｕは、例えば、円筒形のバッフルの外周に等間隔で環状に配置されたＤ_ｕ個のスピーカｓｐ_ｕ（χ）（ただしχ∈［０，Ｄ_ｕ−１］）による環状アレイによって構成できる（例えば、参考文献１）。この場合、高次音源１２−ｕから放射モードｍ（ν）で放射を行う各スピーカｓｐ_ｕ（χ）の音響信号の重みは
Ｗ_ｕ，χ＝（１／Ｄ_ｕ）ｅ^{−ｉνφ（χ）}
となる。ただし、χ∈［０，Ｄ_ｕ−１］であり、φ（χ）は高次音源１２−ｕをなする環状アレイの中心に対する二次元平面上での角度φ（χ）＝２χπ／Ｄ_ｕである。この例の場合、出力信号Ｓ_νｕ（ｋ）は以下のようになる。
Ｓ_νｕ（ｋ）＝Ｇ_νｕ（ｋ）Ｗ_ｕ，χＳ（ｋ） <Operation>
G (k) obtained as described above is stored in the storage unit 111 of the control unit 11 (FIG. 2). The input signal S (k) is also stored in the storage unit 111. The weight acquisition unit 112 acquires G _νu (k) from g (k) read from the storage unit 111, and sends G _νu (k) to the filtering unit 113-u (where u = 1,..., L). . The filtering unit 113-u reads the input signal S (k) from the storage unit 111, and inputs a filter whose filter weight is G _νu (k) for the radiation mode m (ν) at the wave number k of the higher-order sound source 12-u. Apply to the signal S (k) to obtain the output signal S _νu (k). The processing of the filtering unit 113-u may be performed in the time domain or may be performed in the time frequency domain. Each higher-order sound source 12-u is formed by, for example, D _u speakers sp _u (χ) (provided that χ∈ [0, D _u −1]) arranged annularly at equal intervals on the outer periphery of a cylindrical baffle. It can be constituted by an annular array (for example, Reference 1). In this case, the weight of the acoustic signal of each speaker sp _u (χ) that radiates from the higher-order sound source 12-u in the radiation mode m (ν) is W _{u, χ} = (1 / D _u ) e ^{−iνφ (χ)}
It becomes. However, χ∈ [0, D _u −1], and φ (χ) is an angle φ (χ) = 2χπ / D _u on the two-dimensional plane with respect to the center of the annular array forming the higher-order sound source 12- _u. It is. In this example, the output signal S _νu (k) is as follows.
S _νu (k) = G _νu (k) W _{u, χ} S (k)

出力信号Ｓ_νｕ（ｋ）またはその組み合わせ（線形和）は駆動信号生成部１１４−１に送られ、駆動信号生成部１１４−１は出力信号Ｓ_νｕ（ｋ）またはその組み合わせに対応する駆動信号Ｓ’_νｕ（ｋ）またはその組み合わせを生成して高次音源１２−ｕに出力する。高次音源１２−ｕは駆動信号Ｓ’_νｕ（ｋ）またはその組み合わせに応じた再生パターンの音響信号を放射する。これにより、所望の空間領域１４−１，…，１４−Ｓに所望の音場が生成され、空間マルチゾーン音場再生が実現される。本形態ではＧ_νｕ（ｋ）によって高次音源１２−ｕ（ただし、ｕ＝１，…，Ｌ）の放射モードｍ（ν）（ただし、ν＝１，…，２Ｎ_Ｖ＋１）のそれぞれを制御する。これにより、反響による鏡像の制御の自由度を向上でき、残響室で複数の領域に正確な音場を再現するために必要な次数Ｎ_Ｖの高次音源１２−ｕ（次数Ｎ_Ｖ）の個数Ｌを、非特許文献１に比べて最大Ｎ_Ｖ＋１分の１にまで削減できる。 The output signal S _νu (k) or a combination (linear sum) thereof is sent to the drive signal generation unit 114-1, and the drive signal generation unit 114-1 outputs the drive signal S corresponding to the output signal S _νu (k) or a combination thereof. ' _νu (k) or a combination thereof is generated and output to the higher-order sound source 12-u. The higher order sound source 12-u radiates an acoustic signal having a reproduction pattern corresponding to the drive signal S ′ _νu (k) or a combination thereof. Thereby, a desired sound field is generated in the desired spatial region 14-1,..., 14-S, and spatial multi-zone sound field reproduction is realized. In this embodiment, each of the radiation modes m (ν) (where ν = 1,..., 2N _V +1) of the higher-order sound source 12-u (where u = 1,..., L) is controlled by G _νu (k). To do. This can improve the flexibility of the control of the mirror image due to reverberation, the number of high-order tone 12-u of order N _V required to reproduce an accurate sound field into a plurality of regions in the reverberation room (order N _V) L can be reduced to a maximum of N _V + 1 / compared with Non-Patent Document 1.

〔第２実施形態〕
第１実施形態で説明したように、式（１８）のｇ（ｋ）の解または近似解を得るためには、Ａ（ｋ）を特定する必要があり、Ａ（ｋ）を正確に特定するためには、音響伝達関数Ｈ_νｕ（ｒ，θ；ｋ）を正確に測定する必要がある。音響伝達関数はＨ_νｕ（ｒ，θ；ｋ）は高次音源１２−ｕから放射された音響信号をマイクロホンアレイで観測することで計測可能である。しかしながら、音響伝達関数Ｈ_νｕ（ｒ，θ；ｋ）を精度良く計測することは容易ではない。マイクロホンのミスマッチ、センサーノイズ、空間エイリアジング、不完全なモード等価の各々が音響伝達関数の係数の誤差に寄与する。本形態では音響伝達関数の測定誤差の影響を低減させ、Ａ（ｋ）の精度を向上させる方法を説明する。 [Second Embodiment]
As described in the first embodiment, in order to obtain a solution or approximate solution of g (k) in Expression (18), A (k) needs to be specified, and A (k) is specified accurately. For this purpose, it is necessary to accurately measure the acoustic transfer function H _νu (r, θ; k). The acoustic transfer function H _νu (r, θ; k) can be measured by observing the acoustic signal radiated from the higher-order sound source 12-u with a microphone array. However, it is not easy to accurately measure the acoustic transfer function H _νu (r, θ; k). Microphone mismatch, sensor noise, spatial aliasing, and imperfect mode equivalence each contribute to the error in the coefficients of the acoustic transfer function. In this embodiment, a method for reducing the influence of measurement error of the acoustic transfer function and improving the accuracy of A (k) will be described.

＜構成＞
図１に例示するように、本形態の係数設定装置２は、設定部２１と、壁面１３で囲まれた残響室（反響環境下）に配置されたＬ個の高次音源１２−ｕ（ただし、ｕ＝１，…，Ｌ）とマイクロホン２４−ｑ（ただし、ｑ＝１，…，Ｑ）を有する。Ｑは正整数であり、例えばＱ≧２である。図２に例示するように、設定部２１は制御部２１１と音響伝達関数推定部２１２と係数算出部２１３とを有する。設定部２１は、例えば、前述した汎用または専用のコンピュータが所定のプログラムを実行することで構成される。各マイクロホン２４−ｑは、大域的な原点Ｏを中心とした半径ａ≦Ｒ_Ｐの円筒形の底面を持つバッフルに埋め込まれる。バッフルは硬い素材からできていてもよいし、やわらかい素材からできていてもよい。マイクロホン２４−ｑは原点Ｏに対する極座標（ａ，θ_ｑ）に配置される。なお、図４および５はＬ＝５，Ｑ＝６の例であるが、これは本発明を限定しない。 <Configuration>
As illustrated in FIG. 1, the coefficient setting device 2 according to the present embodiment includes a setting unit 21 and L high-order sound sources 12-u (provided that the reverberation room is surrounded by a wall surface 13). , U = 1,..., L) and a microphone 24-q (where q = 1,..., Q). Q is a positive integer, for example, Q ≧ 2. As illustrated in FIG. 2, the setting unit 21 includes a control unit 211, an acoustic transfer function estimation unit 212, and a coefficient calculation unit 213. The setting unit 21 is configured by, for example, the above-described general-purpose or dedicated computer executing a predetermined program. Each microphone 24-q is embedded in the baffle with the cylindrical bottom surface of radius a ≦ R _P around the global origin O. The baffle may be made of a hard material or a soft material. The microphone 24-q is arranged at polar coordinates (a, θ _q ) with respect to the origin O. 4 and 5 are examples of L = 5 and Q = 6, this does not limit the present invention.

マイクロホン２４−ｑ（ただし、ｑ＝１，…，Ｑ）での観測音響信号に基づいて得られる、高次音源１２−ｕから極座標（ａ，θ_ｑ）までの放射モードｍ（ν）の波数ｋでの音響伝達関数Ｈ_νｕ（ａ，θ_ｑ；ｋ）は以下のようになる。

ただし、Ｈ’_ｍ（・）は次数ｍの第一種のハンケル関数の導関数（微分）である。ロンスキアン・リレーション（Wronskian relation）に基づき、式（２１）は式（２２）のように変形できる。

ここで

は前述の円筒部分でのモードの強さを表す。 The wave number of the radiation mode m (ν) from the higher-order sound source 12-u to the polar coordinates (a, θ _q ) obtained based on the observed acoustic signal with the microphone 24-q (where q = 1,..., Q). The acoustic transfer function H _νu (a, θ _q ; k) at k is as follows.

Here, H ′ _m (•) is a derivative (derivative) of the first kind Hankel function of order m. Based on the Wronskian relation, equation (21) can be transformed into equation (22).

here

Represents the strength of the mode in the cylindrical portion described above.

式（２２）の両辺にｅ^−ｉｍθを掛け、積分すると以下の関係が導出される。

さらに式（２３）は以下のように変形できる。

ここでＢ_ｍ ^−１（ｋａ）はモード等価フィルタである。この表現では、十分な数Ｑのマイクロホン２４−ｑによってアレイが構成されていることを前提としてエイリアジング誤差を無視している。マイクロホン２４−１，…，２４−Ｑからなる円状マイクロホンアレイにより測定できるＨ_νｕ（ａ，θ_ｑ；ｋ）の最大モード次数Ｍ_０（ｋ）はＱにより上から押さえられＭ_０（ｋ）≦（Ｑ−１）／２となる。

ただし、λ_ｅｑは等価正則化パラメータであり、｜（・）｜は（・）の絶対値を表す。この正則化はモード等価フィルタの最大ゲインを−（６＋１０ｌｏｇ_１０λ_ｅｑ）ｄＢに制限している。これにより、マイクロホンノイズの極端な増幅を防いでいる。 ^Multiplying both sides of equation (22) by e ^−imθ and integrating, the following relationship is derived.

Furthermore, equation (23) can be modified as follows.

Here, B _m ⁻¹ (ka) is a mode equivalent filter. In this expression, aliasing errors are ignored on the assumption that an array is constituted by a sufficient number of microphones 24-q. The maximum mode order M ₀ (k) of H _νu (a, θ _q ; k) that can be measured by a circular microphone array composed of microphones 24-1,..., 24-Q is suppressed from above by Q and M ₀ (k) ≦ (Q−1) / 2.

However, λ _eq is an equivalent regularization parameter, and | (•) | represents the absolute value of (•). This regularization limits the maximum gain of the mode equivalent filter to − (6 + ₁₀ log ₁₀ λ _eq ) dB. As a result, extreme amplification of microphone noise is prevented.

＜動作＞
まず、音響伝達関数Ｈ_νｕ（ａ，θ_ｑ；ｋ）の測定を行う。Ｈ_νｕ（ａ，θ_ｑ；ｋ）を測定する場合、設定部２１（図５）の制御部２１１は、駆動信号生成部１１４−ｕおよび音響伝達関数推定部２１２にモードｍ（ν）で波数ｋのテスト信号ｔ_νｕ（ｋ）を送る。駆動信号生成部１１４−ｕは、テスト信号ｔ_νｕ（ｋ）に対応する駆動信号ｔ’_νｕ（ｋ）を生成して高次音源１２−ｕに送る。高次音源１２−ｕは駆動信号ｔ’_νｕ（ｋ）に応じた音響信号を放射する。マイクロホン２４−ｑで観測された観測信号は音響伝達関数推定部２１２に送られる。音響伝達関数推定部２１２は、テスト信号ｔ_νｕ（ｋ）および観測信号を用いて音響伝達関数Ｈ_νｕ（ａ，θ_ｑ；ｋ）を計算して出力する。係数算出部２１３はＨ_νｕ（ａ，θ_ｑ；ｋ）を入力とし、式（２４）に従ってα_ｍνｕ（ｋ）を生成して出力する。出力されたα_ｍνｕ（ｋ）は第１実施形態で説明したＡ（ｋ）の要素として利用される。 <Operation>
First, the acoustic transfer function H _νu (a, θ _q ; k) is measured. When _measuring H _νu (a, θ _q ; k), the control unit 211 of the setting unit 21 (FIG. 5) sets the wave number in the mode m (ν) to the drive signal generation unit 114-u and the acoustic transfer function estimation unit 212. Send k test signals t _νu (k). The drive signal generation unit 114-u generates a drive signal t ′ _νu (k) corresponding to the test signal t _νu (k) and sends it to the higher-order sound source 12-u. The higher order sound source 12-u emits an acoustic signal corresponding to the drive signal t ′ _νu (k). The observation signal observed by the microphone 24-q is sent to the acoustic transfer function estimation unit 212. The acoustic transfer function estimation unit 212 calculates and outputs an acoustic transfer function H _νu (a, θ _q ; k) using the test signal t _νu (k) and the observation signal. The coefficient calculation unit 213 receives H _νu (a, θ _q ; k) as input, generates α _mνu (k) according to the equation (24), and outputs it. The output α _mνu (k) is used as an element of A (k) described in the first embodiment.

本形態では、式（２５）のモード等価フィルタＢ_ｍ ^−１（ｋａ）を用いることでマイクロホンノイズの極端な増幅を防ぎ、音響伝達関数Ｈ_νｕ（ａ，θ_ｑ；ｋ）の測定誤差の影響を低減させ、Ａ（ｋ）の精度を向上させることができる。 In the present embodiment, extreme amplification of microphone noise is prevented by using the mode equivalent filter B _m ⁻¹ (ka) of Expression (25), and the influence of measurement error of the acoustic transfer function H _νu (a, θ _q ; k). And the accuracy of A (k) can be improved.

〔変形例等〕
なお、本発明は上述の実施の形態に限定されるものではない。例えば、上述の制御部や設定部の処理は、記載に従って時系列に実行されるのみならず、処理を実行する装置の処理能力あるいは必要に応じて並列的にあるいは個別に実行されてもよい。その他、本発明の趣旨を逸脱しない範囲で適宜変更が可能であることはいうまでもない。 [Modifications, etc.]
The present invention is not limited to the embodiment described above. For example, the processes of the control unit and the setting unit described above are not only executed in time series according to the description, but may also be executed in parallel or individually as required by the processing capability of the apparatus that executes the process. Needless to say, other modifications are possible without departing from the spirit of the present invention.

制御部や設定部の構成をコンピュータによって実現する場合、これらが有すべき機能の処理内容はプログラムによって記述される。このプログラムをコンピュータで実行することにより、それらの処理機能がコンピュータ上で実現される。この処理内容を記述したプログラムは、コンピュータで読み取り可能な記録媒体に記録しておくことができる。コンピュータで読み取り可能な記録媒体の例は、非一時的な（non-transitory）記録媒体である。このような記録媒体の例は、磁気記録装置、光ディスク、光磁気記録媒体、半導体メモリ等である。 When the configuration of the control unit and the setting unit is realized by a computer, the processing contents of the functions that these should have are described by a program. By executing this program on a computer, these processing functions are realized on the computer. The program describing the processing contents can be recorded on a computer-readable recording medium. An example of a computer-readable recording medium is a non-transitory recording medium. Examples of such a recording medium are a magnetic recording device, an optical disk, a magneto-optical recording medium, a semiconductor memory, and the like.

このプログラムの流通は、例えば、そのプログラムを記録したＤＶＤ、ＣＤ−ＲＯＭ等の可搬型記録媒体を販売、譲渡、貸与等することによって行う。さらに、このプログラムをサーバコンピュータの記憶装置に格納しておき、ネットワークを介して、サーバコンピュータから他のコンピュータにそのプログラムを転送することにより、このプログラムを流通させる構成としてもよい。 This program is distributed, for example, by selling, transferring, or lending a portable recording medium such as a DVD or CD-ROM in which the program is recorded. Furthermore, the program may be distributed by storing the program in a storage device of the server computer and transferring the program from the server computer to another computer via a network.

このようなプログラムを実行するコンピュータは、例えば、まず、可搬型記録媒体に記録されたプログラムもしくはサーバコンピュータから転送されたプログラムを、一旦、自己の記憶装置に格納する。処理の実行時、このコンピュータは、自己の記録装置に格納されたプログラムを読み取り、読み取ったプログラムに従った処理を実行する。このプログラムの別の実行形態として、コンピュータが可搬型記録媒体から直接プログラムを読み取り、そのプログラムに従った処理を実行することとしてもよく、さらに、このコンピュータにサーバコンピュータからプログラムが転送されるたびに、逐次、受け取ったプログラムに従った処理を実行することとしてもよい。サーバコンピュータから、このコンピュータへのプログラムの転送は行わず、その実行指示と結果取得のみによって処理機能を実現する、いわゆるＡＳＰ（Application Service Provider）型のサービスによって、上述の処理を実行する構成としてもよい。 A computer that executes such a program first stores, for example, a program recorded on a portable recording medium or a program transferred from a server computer in its own storage device. When executing the process, this computer reads a program stored in its own recording device and executes a process according to the read program. As another execution form of the program, the computer may read the program directly from the portable recording medium and execute processing according to the program, and each time the program is transferred from the server computer to the computer. The processing according to the received program may be executed sequentially. The above-described processing may be executed by a so-called ASP (Application Service Provider) type service that realizes a processing function only by an execution instruction and result acquisition without transferring a program from the server computer to the computer. Good.

上記実施形態では、コンピュータ上で所定のプログラムを実行させて本装置の処理機能が実現されたが、これらの処理機能の少なくとも一部がハードウェアで実現されてもよい。 In the above embodiment, the processing functions of the apparatus are realized by executing a predetermined program on a computer. However, at least a part of these processing functions may be realized by hardware.

１音場再生装置
２係数設定装置 1 Sound field reproduction device 2 Coefficient setting device

Claims

A predetermined number of higher-order sound sources for reproducing an acoustic signal in a reproduction pattern, comprising any one of a plurality of independent radiation modes having different directivity patterns or a combination of at least a part of the plurality of radiation modes,
The sound field reproducing apparatus, wherein the weight of each of the radiation modes is controlled so that the reproduction pattern generates a desired sound field in a plurality of spatial regions using the echo of an acoustic signal.

The sound field reproducing apparatus according to claim 1,
The predetermined high-order sound sources are L high-order sound sources hs (1),..., Hs (L), L is a positive integer, and the plurality of radiation modes are 2N _V +1 radiation modes. m (1), ..., m (2N _V +1), N _V is a positive integer, u = 1, ..., L, ν = 1, ..., 2N _V +1,
H _νu (r, θ; k) is an acoustic transfer function at the wave number k of the radiation mode m (ν) from the higher-order sound source hs (u) to the global polar coordinate (r, θ), and r , Θ is a declination, M ₀ (k) is a positive integer, m = −M ₀ (k),..., M ₀ (k), and α _mνu (k) is the acoustic transmission. The expansion coefficient of the function H _νu (r, θ; k) with respect to m, and A (k) is a matrix of (2M ₀ (k) +1) × L ′ with α _mνu (k) as elements, and L '= L (2N _V +1),

[•] ^T represents the transpose of [•]

Β ^d _m (k) is a frequency domain signal S ^d (r, θ; wavenumber k) in polar coordinates (r, θ) of an area including the plurality of spatial areas in which the desired sound field is generated. k) is the expansion coefficient for m, and g (k) is a solution or approximate solution of A (k) g (k) = β ^d (k),
The reproduction pattern is a filter weight for the radiation mode m (ν) at the wave number k of the higher-order sound source hs (u).

The sound field reproduction device which is the pattern.

The sound field reproducing device according to claim 2,
H _νu (a, θ _q ; k) is obtained based on the acoustic signal observed by the microphone MIC (q) (where q = 1,..., Q) arranged at the global polar coordinates (a, θ _q ). Is an acoustic transfer function at a wave number k of the radiation mode m (ν) from the higher-order sound source hs (u) to the polar coordinates (a, θ _q ), a is a radial, and θ _q is a declination Yes,

I is an imaginary unit, H ′ _m (·) is a derivative of the first-class Hankel function of degree m,

(•) ^* represents the complex conjugate of (•), λ _eq is the equivalent regularization parameter, | (•) | represents the absolute value of (•),

A sound field reproduction device.

A predetermined number of high-order sound sources reproduces an acoustic signal with a reproduction pattern consisting of any one of a plurality of independent radiation modes having different directivity patterns or a combination of at least some of the plurality of radiation modes,
The sound reproduction method according to claim 1, wherein the weight of each of the radiation modes is controlled so that the reproduction pattern generates a desired sound field in a plurality of spatial regions using the echo of an acoustic signal.

The sound field reproduction method according to claim 4,
The predetermined high-order sound sources are L high-order sound sources hs (1),..., Hs (L), L is a positive integer, and the plurality of radiation modes are 2N _V +1 radiation modes. m (1), ..., m (2N _V +1), N _V is a positive integer, u = 1, ..., L, ν = 1, ..., 2N _V +1,
H _νu (r, θ; k) is an acoustic transfer function at the wave number k of the radiation mode m (ν) from the higher-order sound source hs (u) to the global polar coordinate (r, θ), and r , Θ is a declination, M ₀ (k) is a positive integer, m = −M ₀ (k),..., M ₀ (k), and α _mνu (k) is the acoustic transmission. The expansion coefficient of the function H _νu (r, θ; k) with respect to m, and A (k) is a matrix of (2M ₀ (k) +1) × L ′ with α _mνu (k) as elements, and L '= L (2N _V +1),

[•] ^T represents the transpose of [•]

Β ^d _m (k) is a frequency domain signal S ^d (r, θ; wavenumber k) in polar coordinates (r, θ) of an area including the plurality of spatial areas in which the desired sound field is generated. k) is the expansion coefficient for m, and g (k) is a solution or approximate solution of A (k) g (k) = β ^d (k),
The reproduction pattern is a filter weight for the radiation mode m (ν) at the wave number k of the higher-order sound source hs (u).

The sound field reproduction method that is the pattern.

The sound field reproduction method according to claim 5,
H _νu (a, θ _q ; k) is obtained based on the acoustic signal observed by the microphone MIC (q) (where q = 1,..., Q) arranged at the global polar coordinates (a, θ _q ). Is an acoustic transfer function at a wave number k of the radiation mode m (ν) from the higher-order sound source hs (u) to the polar coordinates (a, θ _q ), a is a radial, and θ _q is a declination Yes,

I is an imaginary unit, H ′ _m (·) is a derivative of the first-class Hankel function of degree m,

(·) ^* Is the complex conjugate of (·), λ _eq is the equivalent regularization parameter, | (·) | represents the absolute value of (·),

A sound field reproduction method.