JP6087760B2

JP6087760B2 - Sound field recording / reproducing apparatus, method, and program

Info

Publication number: JP6087760B2
Application number: JP2013156922A
Authority: JP
Inventors: 翔一小山; 祐介日和▲崎▼
Original assignee: Nippon Telegraph and Telephone Corp
Current assignee: Nippon Telegraph and Telephone Corp
Priority date: 2013-07-29
Filing date: 2013-07-29
Publication date: 2017-03-01
Anticipated expiration: 2033-07-29
Also published as: JP2015027046A

Description

この発明は、ある音場に設置されたマイクロホンで音信号を収音し、その音信号を用いてスピーカでその音場を再現する波面合成法（Wave Field Synthesis）の技術、アンビソニックス(Ambisonics)の技術に関する。 The present invention relates to a wave field synthesis technique (Ambisonics) that collects a sound signal with a microphone installed in a certain sound field and reproduces the sound field with a speaker using the sound signal. Related to technology.

波面合成法及びアンビソニックスは、複数のマイクロホン及び複数のスピーカを用いて、遠隔地の音場を仮想的に再現する技術である。そのような技術として例えば非特許文献１に記載された技術が知られている。遠隔コミュニケーションシステムなどの応用では、リアルタイムの収音・再現が必要になるため、一般的なマイクロホンアレーで収音した音圧を、一般的なスピーカアレーで出力するための音場再現信号へと一意に変換可能であることが必要となる。 The wavefront synthesis method and ambisonics are technologies that virtually reproduce a sound field in a remote place using a plurality of microphones and a plurality of speakers. As such a technique, for example, a technique described in Non-Patent Document 1 is known. In applications such as remote communication systems, real-time sound collection and reproduction is required, so the sound pressure collected by a general microphone array is unique to a sound field reproduction signal for output by a general speaker array. It must be convertible to

小山,古家,日和崎,羽田,”音場収音・再現のための時空間周波数領域信号変換法,” 2011年9月, pp. 635-636.Oyama, Furuya, Hiwazaki, Haneda, “Spatio-temporal frequency domain signal conversion for sound field collection and reproduction,” September 2011, pp. 635-636.

非特許文献１に記載された技術では、音場を同じ位置又は前後左右に平行移動した位置で再現することはできたが、音場を所定の角度で回転して再現することはできなかった。 In the technique described in Non-Patent Document 1, the sound field could be reproduced at the same position or a position translated in the front / rear and left / right directions, but the sound field could not be reproduced by rotating it at a predetermined angle. .

この発明の目的は、音場を所定の角度で回転して再現することができる音場収音再生装置、方法及びプログラムを提供することである。 An object of the present invention is to provide a sound field collecting and reproducing apparatus, method, and program capable of reproducing a sound field by rotating it at a predetermined angle.

上記の課題を解決するために、この発明の一実施形態による音場収音再生装置は、２個以上のマイクロホンを含むマイクロホンアレーで収音された信号から生成された周波数領域信号が所定の回転角で空間周波数シフトされた信号である空間周波数変調信号を生成する空間周波数変調部を含み、空間周波数変調信号から再生信号を得る。 In order to solve the above-described problem, a sound field sound collecting / reproducing apparatus according to an embodiment of the present invention rotates a frequency domain signal generated from a signal collected by a microphone array including two or more microphones with a predetermined rotation. It includes a spatial frequency modulation section that generates a spatial frequency modulation signal that is a signal that is spatially shifted by an angle, and obtains a reproduction signal from the spatial frequency modulation signal.

音場を所定の角度で回転して再現することができる。 The sound field can be reproduced by rotating it at a predetermined angle.

第一実施形態の音場収音再生装置の例を示す機能ブロック図。The functional block diagram which shows the example of the sound field sound collection reproducing | regenerating apparatus of 1st embodiment. 第一実施形態のマイクロホン及びスピーカの配置の例を説明するための図。The figure for demonstrating the example of arrangement | positioning of the microphone and speaker of 1st embodiment. 第二実施形態の音場収音再生装置の例を示す機能ブロック図。The functional block diagram which shows the example of the sound field sound collection reproducing | regenerating apparatus of 2nd embodiment. 第二実施形態のマイクロホン及びスピーカの配置の例を説明するための図。The figure for demonstrating the example of arrangement | positioning of the microphone and speaker of 2nd embodiment. 音場収音再生方法の例を示す流れ図。The flowchart which shows the example of the sound field sound collection reproduction | regeneration method.

以下、図面を参照してこの発明の実施形態を説明する。以下の説明において、テキスト中で使用する記号「~」等は、本来直前の文字の真上に記載されるべきものであるが、テキスト記法の制限により、当該文字の直後に記載する。式中においてはこれらの記号は本来の位置に記述している。また、ベクトルや行列の各要素単位で行われる処理は、特に断りが無い限り、そのベクトルやその行列の全ての要素に対して適用されるものとする。 Embodiments of the present invention will be described below with reference to the drawings. In the following description, the symbol “˜” and the like used in the text should be described immediately above the immediately preceding character, but are described immediately after the character due to restrictions on text notation. In the formula, these symbols are written in their original positions. Further, the processing performed for each element of a vector or matrix is applied to all elements of the vector or matrix unless otherwise specified.

［第一実施形態］
＜マイクロホンアレー及びスピーカアレーの配置＞
第一実施形態の音場収音再生装置及び方法は、図２に示すように、第一の空間のy=0,z=0の位置に直線状に配置されたN_x個のマイクロホンで構成される一次元マイロホンアレーＭ１，Ｍ２，…，ＭＮ_ｘと、第一の空間とは異なる第二の空間のy=0,z=0の位置に直線状に配置されたN_x個のスピーカで構成される一次元スピーカアレーＳ１，Ｓ２，…，ＳＮ_ｘとを用いて、音源Sで発生した音によって形成された第一の空間の音場を第二の空間で再現する。 [First embodiment]
<Arrangement of microphone array and speaker array>
As shown in FIG. 2, the sound field sound collecting and reproducing apparatus and method according to the first embodiment includes N _x microphones arranged linearly at positions y = 0 and z = 0 in the first space. .., MN _x and N _x speakers arranged linearly at positions y = 0, z = 0 in a second space different from the first space. , SN _x is used to reproduce the sound field of the first space formed by the sound generated by the sound source S in the second space.

この際、音場は所定の回転角だけ回転した状態で再現される。具体的には、図１に点線で例示された、所定の回転角だけ回転されたスピーカアレーＳ１，Ｓ２，…，ＳＮ_ｘにより音場が再現されたような知覚を与えることができる。 At this time, the sound field is reproduced in a state rotated by a predetermined rotation angle. Specifically, it is possible to provide exemplified by dotted lines in FIG. 1, the speaker array S1, is rotated by a predetermined rotation angle, S2, ..., the perception that the sound field is reproduced by the SN _x.

第一実施形態は、後述する第二実施形態と比較すると、マイク数、スピーカ数及びチャネル数を少なくすることができるため、実装が比較的容易となる。 Compared with the second embodiment described later, the first embodiment can reduce the number of microphones, the number of speakers, and the number of channels, so that mounting is relatively easy.

N_xは２以上の整数である。この実施形態では、マイクロホンアレーＭ１，Ｍ２，…，ＭＮ_ｘを構成するマイクロホンの数とスピーカアレーＳ１，Ｓ２，…，ＳＮ_ｘを構成するスピーカの数は同じである。マイクロホンアレーＭ１，Ｍ２，…，ＭＮ_ｘを構成するマイクロホンＭｉは等間隔に配置されている。また、スピーカアレーＳ１，Ｓ２，…，ＳＮ_ｘを構成するスピーカも等間隔に配置されている。マイクロホンアレーＭ１，Ｍ２，…，ＭＮ_ｘの大きさと、スピーカアレーＳ１，Ｓ２，…，ＳＮ_ｘの大きさはほぼ同じである。各マイクロホンＭｉのマイクロホンアレーＭ１，Ｍ２，…，ＭＮ_ｘにおける位置は、その各マイクロホンＭｉに対応するスピーカＳｉのスピーカアレーＳ１，Ｓ２，…，ＳＮ_ｘにおける位置と同じであることが望ましいが、異なっていても良い。この位置が同じであれば、より忠実に音場の再生を行うことができる。 N _x is an integer of 2 or more. In this embodiment, the microphone array M1, M2, ..., the number and the speaker array S1 of microphone constituting the MN _x, S2, ..., the number of speakers constituting the SN _x is the same. The microphones Mi constituting the microphone arrays M1, M2,..., MN _x are arranged at equal intervals. Further, the speakers constituting the speaker arrays S1, S2,..., SN _x are also arranged at equal intervals. Microphone array M1, M2, ..., and the size of MN _x, speaker array S1, S2, ..., the magnitude of SN _x is about the same. Microphone array M1, M2 of the microphone Mi, ..., position in MN _x is speaker Si speaker array S1, S2 corresponding to the respective microphones Mi, ..., but it is desirable that the same as the position in the SN _x, different May be. If this position is the same, the sound field can be reproduced more faithfully.

第一の空間のy=0,z=0の位置に配置されたマイクロホンアレーＭ１，Ｍ２，…，ＭＮ_ｘを構成するインデックスi[i=1,…,N_x]に対応するマイクロホンＭｉの位置をr_m,i=(x_m,i,0,0)と表わすことにする。 The position of the microphone Mi corresponding to the index i [i = 1,..., N _x ] constituting the microphone arrays M1, M2,..., MN _x arranged at the positions y = 0, z = 0 in the first space. Is expressed as r _{m, i} = (x _{m, i} , 0,0).

なお、第一の空間に配置されたマイクロホンの数と第二の空間に配置されたスピーカの数は異なっていてもよい。マイクロホンの数が、第二の空間に配置されたスピーカの数よりも多い場合には、再生信号を間引けばよい。一方、マイクロホンの数が、第二の空間に配置されたスピーカの数よりも少ない場合には、再生信号をチャネル間で平均を取るなどして補間を行えばよい。補間を行う方法は、例えば、線形補間やsinc補間などを適用することができる。 Note that the number of microphones arranged in the first space and the number of speakers arranged in the second space may be different. If the number of microphones is larger than the number of speakers arranged in the second space, the reproduction signal may be thinned out. On the other hand, when the number of microphones is smaller than the number of speakers arranged in the second space, the reproduction signal may be interpolated by taking an average between channels. As a method for performing the interpolation, for example, linear interpolation, sinc interpolation, or the like can be applied.

＜音場収音再生装置＞
第一実施形態の音場収音再生装置は、図１に示すように周波数変換部１、空間周波数変調部２、空間周波数変換部３、変換フィルタ部４、空間周波数逆変換部５、周波数逆変換部６及び窓関数部７を例えば含み、図Ｆ１に例示された各ステップの処理を行う。 <Sound field recording and playback device>
As shown in FIG. 1, the sound field sound collecting / reproducing apparatus of the first embodiment includes a frequency conversion unit 1, a spatial frequency modulation unit 2, a spatial frequency conversion unit 3, a conversion filter unit 4, a spatial frequency inverse conversion unit 5, and a frequency inverse unit. The conversion unit 6 and the window function unit 7 are included, for example, and the processing of each step illustrated in FIG. F1 is performed.

第一の空間に配置されたマイクロホンアレーＭ１，Ｍ２，…，ＭＮ_ｘは、第一の空間の音源Ｓで発せられた音を収音して時間領域の信号を生成する。生成された信号は、周波数変換部１に送られる。インデックスiに対応するr_m,i=(x_m,i,0,0)に位置するマイクロホンＭｉで収音された時間領域の時刻ｔの信号をp_i(t)と表記する。 The microphone arrays M1, M2,..., MN _x arranged in the first space pick up the sound emitted from the sound source S in the first space and generate a time domain signal. The generated signal is sent to the frequency converter 1. A signal at time t in the time domain picked up by the microphone Mi located at r _{m, i} = (x _{m, i} , 0,0) corresponding to the index i is denoted as p _i (t).

＜周波数変換部１＞
周波数変換部１は、マイクロホンアレーＭ１，Ｍ２，…，ＭＮ_ｘで収音された信号p_i(t)をフーリエ変換により周波数領域信号P_i(ω)に変換する（ステップＳ１）。生成された周波数領域信号P_i(ω)は、空間周波数変調部２に提供される。ωは周波数である。例えば、短時間離散フーリエ変換により周波数領域信号P_i(ω)が生成される。もちろん、他の既存の方法により周波数領域信号P_i(ω)を生成してもよい。また、オーバーラップアド等の方法を用いて周波数領域信号P_i(ω)を生成してもよい。入力信号が長い場合や、リアルタイム処理のように連続して信号が入力される場合には、例えば１０ｍｓごとといったフレームごとに処理を行う。周波数領域信号P_i(ω)は、例えば以下のように定義される。関数expの引数の中のjは虚数単位である。 <Frequency conversion unit 1>
The frequency converter 1 converts the signal p _i (t) collected by the microphone arrays M1, M2,..., MN _x into a frequency domain signal P _i (ω) by Fourier transform (step S1). The generated frequency domain signal P _i (ω) is provided to the spatial frequency modulation unit 2. ω is a frequency. For example, the frequency domain signal P _i (ω) is generated by short-time discrete Fourier transform. Of course, the frequency domain signal P _i (ω) may be generated by other existing methods. Further, the frequency domain signal P _i (ω) may be generated using a method such as overlap add. When the input signal is long or when the signal is continuously input as in real time processing, the processing is performed for each frame such as every 10 ms. For example, the frequency domain signal P _i (ω) is defined as follows. J in the argument of the function exp is an imaginary unit.

＜空間周波数変調部２＞
空間周波数変調部２は、周波数領域信号P_i(ω)が所定の回転角で空間周波数シフトされた信号である空間周波数変調信号P_mod,i(ω)を生成する（ステップＳ２）。生成された空間周波数変調信号P_mod,i(ω)は、空間周波数変換部３に提供される。所定の回転角は、仰角方向の角度がθ_rotである回転角である。 <Spatial frequency modulation unit 2>
The spatial frequency modulation unit 2 generates a spatial frequency modulation signal P _{mod, i} (ω) that is a signal obtained by shifting the frequency domain signal P _i (ω) by a spatial frequency at a predetermined rotation angle (step S2). The generated spatial frequency modulation signal P _{mod, i} (ω) is provided to the spatial frequency conversion unit 3. The predetermined rotation angle is a rotation angle whose angle in the elevation angle direction is θ _rot .

空間周波数変調部２は、例えば下記式により定義される空間周波数変調信号P_mod,i(ω)を計算する。jは虚数単位であり、kは波数でありcを音速とするとk=ω/cである。 The spatial frequency modulation unit 2 calculates a spatial frequency modulation signal P _{mod, i} (ω) defined by the following equation, for example. j is an imaginary unit, k is a wave number, and k = ω / c where c is the speed of sound.

このように、空間周波数変調部２は、２個以上のマイクロホンを含むマイクロホンアレーで収音された信号から生成された周波数領域信号が所定の回転角で空間周波数シフトされた信号である空間周波数変調信号を生成する。その後、音場収音再生装置は、以下のようにして、空間周波数変調信号から再生信号を得る。 As described above, the spatial frequency modulation unit 2 is a spatial frequency modulation which is a signal obtained by shifting a frequency domain signal generated from a signal collected by a microphone array including two or more microphones by a predetermined rotational angle. Generate a signal. Thereafter, the sound field sound collecting / reproducing apparatus obtains a reproduced signal from the spatial frequency modulation signal as follows.

＜空間周波数変換部３＞
空間周波数変換部３は、空間のフーリエ変換により周波数領域信号P_mod,i(ω)を時空間周波数領域信号P~_n(ω)に変換する（ステップＳ３）。時空間周波数領域信号P~_n(ω)は、周波数領域信号P_mod,i(ω)に由来するものである。時空間周波数領域信号P~_n(ω)は、各ωごとに計算される。変換された時空間周波数領域信号P~_n(ω)は、変換フィルタ部４に提供される。空間周波数変換部３は、具体的には下記式（１）により定義されるP~_n(ω)を計算する。 <Spatial frequency converter 3>
The spatial frequency conversion unit 3 converts the frequency domain signal P _{mod, i} (ω) into a spatio-temporal frequency domain signal P _n (ω) by Fourier transform of space (step S3). Spatio-temporal frequency domain signal P ~ _n (ω) is derived from the frequency domain signal P _mod, the _i (omega). The spatio-temporal frequency domain signal P _n (ω) is calculated for each ω. The converted spatio-temporal frequency domain signal P _n (ω) is provided to the conversion filter unit 4. Specifically, the spatial frequency conversion unit 3 calculates P _n (ω) defined by the following equation (1).

k_x,nはx軸方向の波数であり、nは波数k_x,nのインデックスであり、波数とは、いわゆる空間周波数又は角度スペクトルのことである。上記式（１）は、時空間周波数領域への変換の一例であり、他の方法により空間のフーリエ変換を行ってもよい。 k _{x, n} is a wave number in the x-axis direction, n is an index of the wave number k _{x, n} , and the wave number is a so-called spatial frequency or angular spectrum. The above formula (1) is an example of conversion to the spatio-temporal frequency domain, and spatial Fourier transform may be performed by other methods.

＜変換フィルタ部４＞
変換フィルタ部４は、時空間周波数領域信号P~_n(ω)に対して式（２）により定義されるフィルタF~_n(ω)を適用してフィルタ処理後信号D~_n(ω)を生成する（ステップＳ４）。生成されたフィルタ処理後信号D~_n(ω)は、空間周波数逆変換部５に提供される。 <Conversion filter unit 4>
Conversion filter unit 4, the spatio-temporal frequency domain signal P ~ _n (omega) by applying the equation (2) filters F ~ _n defined by (omega) to the filter processed signal D ~ _n the (omega) Generate (step S4). The generated post-filtering signal D _n (ω) is provided to the spatial frequency inverse transform unit 5.

A(ω)は、周波数特性を調整するための所定の複素数である。例えば、A(ω)=1+0×ｊ=1である。y_refは、スピーカアレーＳ１，Ｓ２，…，ＳＮ_ｘから振幅を一致させる位置までのy軸方向における距離である。言い換えれば、y_refは、振幅を一致させる位置ともいえる。 A (ω) is a predetermined complex number for adjusting the frequency characteristic. For example, A (ω) = 1 + 0 × j = 1. y _ref is a speaker array S1, S2, ..., a distance in the y-axis direction from the SN _x to a position to match the amplitude. In other words, y _ref can be said to be a position where the amplitudes are matched.

また、w_nはnに基づいて例えば以下のように定まる、エバネッセント波を減衰させるための所定の重みである。以下の式において、k_cは、予め定められた値でありそれぞれk_x,nのカットオフ値である。k_cは、例えばエバネッセント波を抑制するような値に設定する。α_xは、カットオフの滑らかさを決めるための予め定められた値であり例えば0.05である。もちろん、w_nとして、他の重み関数を用いてもよい。 Also, w _n is determined by, for example, in the following manner based on n, it is a predetermined weight to attenuate the evanescent wave. In the following equations, k _c is a predetermined value and is a cutoff value of k _{x, n} respectively. k _c is set to a value that suppresses evanescent waves, for example. α _x is a predetermined value for determining the smoothness of the cutoff, and is 0.05, for example. Of course, other weight functions may be used as w _n .

・を任意の実数として、H_n ⁽¹⁾(・)はn次の第一種ハンケル関数である。したがって、H₀ ⁽¹⁾(・)は０次の第一種ハンケル関数である。H_n ⁽¹⁾(・)は、以下のように定義される。J_n(・)はn次のベッセル関数であり、Γ(z)はガンマ関数であり、Y_n(z)はノイマン関数である。 H _n ⁽¹⁾ (·) is an nth-order first-class Hankel function, where • is an arbitrary real number. Therefore, H ₀ ⁽¹⁾ (·) is a zeroth-order first-class Hankel function. H _n ⁽¹⁾ (•) is defined as follows. J _n (•) is an nth-order Bessel function, Γ (z) is a gamma function, and Y _n (z) is a Neumann function.

なお、変換フィルタ部４は、上記式（２）のフィルタF~_n(ω)に代えて、式（３）により定義されるフィルタF~_n(ω)を適用してフィルタ処理後信号D~_n(ω)を生成してもよい。d_x,d_yは、再生する音場をそれぞれx軸方向及びy軸方向にシフトさせる量である。このフィルタF~_n(ω)を適用すると、第二の空間において、音場をx軸方向にd_xだけy軸方向にd_yだけシフトして再現することができる。 The conversion filter unit 4, instead of the filter F ~ _n (omega) of the above formula (2), Equation (3) applies a filter F ~ _n (omega) which is defined by the filter processed signal D ~ _n (ω) may be generated. d _x and d _y are amounts by which the sound field to be reproduced is shifted in the x-axis direction and the y-axis direction, respectively. Applying this filter F ~ _n (ω), in the second space, the sound field can be reproduced shifted by d _y in the y-axis direction by d _x in the x-axis direction.

＜空間周波数逆変換部５＞
空間周波数逆変換部５は、フィルタ処理後信号D~_n(ω)を空間の逆フーリエ変換により周波数領域信号D_i(ω)に変換する（ステップＳ５）。変換された周波数領域信号D_i(ω)は、周波数逆変換部６に提供される。空間周波数逆変換部５は、具体的には下記式（４）により定義される周波数領域信号D_i(ω)を計算する。関数expの引数の中のjは虚数単位である。 <Spatial frequency inverse transform unit 5>
The spatial frequency inverse transform unit 5 transforms the filtered signal _D˜n (ω) into the frequency domain signal D _i (ω) by inverse Fourier transform of the space (step S5). The transformed frequency domain signal D _i (ω) is provided to the frequency inverse transform unit 6. Specifically, the spatial frequency inverse transform unit 5 calculates a frequency domain signal D _i (ω) defined by the following equation (4). J in the argument of the function exp is an imaginary unit.

＜周波数逆変換部６＞
周波数逆変換部６は、周波数領域信号D_i(ω)を逆フーリエ変換により時間領域信号P^d _i(t)に変換する（ステップＳ６）。逆フーリエ変換によりフレーム毎に得られた時間領域信号P^d _i(t)は適宜シフトされて線形和が取られて、連続した時間領域信号となる。逆フーリエ変換は短時間離散逆フーリエ変換等の既存の方法を用いればよい。時間領域信号P^d _i(t)は、窓関数部７に送られる。 <Inverse frequency converter 6>
The frequency inverse transform unit 6 transforms the frequency domain signal D _i (ω) into a time domain signal P ^d _i (t) by inverse Fourier transform (step S6). The time domain signal P ^d _i (t) obtained for each frame by the inverse Fourier transform is appropriately shifted to obtain a linear sum to be a continuous time domain signal. For the inverse Fourier transform, an existing method such as a short-time discrete inverse Fourier transform may be used. The time domain signal P ^d _i (t) is sent to the window function unit 7.

＜窓関数部７＞
窓関数部７は、時間領域信号P^d _i(t)に窓関数を乗じて窓関数後時間領域信号d_i（t）を生成する（ステップＳ７）。窓関数後時間領域信号d_i（t）は、スピーカアレーＳ１，Ｓ２，…，ＳＮ_ｘに提供される。 <Window function part 7>
The window function unit 7 multiplies the time domain signal P ^d _i (t) by the window function to generate a post-window function time domain signal d _i (t) (step S7). Window function after a time-domain signal d _i (t) is the speaker array S1, S2, ..., are provided in the SN _x.

窓関数として、以下の式より定義されるいわゆるターキー（Tukey）窓関数w_iを例えば用いる。N_tprは、テーパーを適用する点数であり１以上N_x以下の整数である。もちろん、他の窓関数を用いてもよい。 For example, a so-called Tukey window function w _i defined by the following equation is used as the window function. N _tpr is the number of points to which the taper is applied, and is an integer from 1 to N _x . Of course, other window functions may be used.

スピーカアレーＳ１，Ｓ２，…，ＳＮ_ｘは、窓関数後時間領域信号d_i（t）に基づいて音を再生する。具体的には、i=1,…,N_xとして、スピーカＳｉが窓関数後時間領域信号d_i（t）に基づいて音を再生する。 The speaker arrays S1, S2,..., SN _x reproduce sound based on the time domain signal d _i (t) after the window function. Specifically, with i = 1,..., N _x , the speaker Si reproduces sound based on the time domain signal d _i (t) after the window function.

これにより、第一の空間のy=0,z=0の位置の波面を第二の空間のスピーカアレーＳ１，Ｓ２，…，ＳＮ_ｘで再現して、第一の空間の音場を第二の空間に再現することができる。 Thus, the first y = 0 of the space, z wavefront position = 0 in the second space speaker array S1, S2, ..., and reproduced in SN _x, the sound field of the first space second Can be reproduced in the space.

この際、再現される信号の振幅は、y_refで表される直線上の位置で振幅が一致する。具体的には、図１に示すように、スピーカアレーＳ１，Ｓ２，…，ＳＮ_ｘからy軸方向にy_refだけ離れた位置にあり、スピーカアレーＳ１，Ｓ２，…，ＳＮ_ｘが配置されている直線と平行な直線上の位置で振幅が一致する。 At this time, the amplitude of the reproduced signal matches at a position on a straight line represented by y _ref . Specifically, as shown in FIG. 1, the speaker array S1, S2, ..., in a position at a distance y _ref from SN _x to y-axis direction, the speaker array S1, S2, ..., and SN _x is located The amplitude matches at a position on a straight line parallel to the straight line.

また、音場は所定の回転角だけ回転した状態で再現される。具体的には、図１に点線で示された、所定の回転角だけ回転されたスピーカアレーＳ１，Ｓ２，…，ＳＮ_ｘにより音場が再現されたような知覚を与えることができる。 In addition, the sound field is reproduced in a state rotated by a predetermined rotation angle. Specifically, it is possible to provide indicated by a dotted line in FIG. 1, the speaker array S1, it is rotated by a predetermined rotation angle, S2, ..., the perception that the sound field is reproduced by the SN _x.

［第二実施形態］
＜マイクロホンアレー及びスピーカアレーの配置＞
第二実施形態の音場収音再生装置及び方法は、図３に示すように、第一の空間のy=0の位置に平面状に配置されたN_x×N_z個のマイクロホンで構成される二次元マイクロホンアレーＭ１−１，Ｍ２−１，…，ＭＮ_ｘ−Ｎ_ｚと、第一の空間とは異なる第二の空間のy=0の位置に平面状に配置されたN_x×N_z個のスピーカで構成される二次元スピーカアレーＳ１−１，Ｓ２−１，…，ＳＮ_ｘ−Ｎ_ｚとを用いて、音源Sで発生した音によって形成された第一の空間の音場を第二の空間で再現する。 [Second Embodiment]
<Arrangement of microphone array and speaker array>
As shown in FIG. 3, the sound field sound collecting / reproducing apparatus and method according to the second embodiment are configured by N _x × N _z microphones arranged in a plane at a position y = 0 in the first space. , MN _x -N _z and N _x × N arranged in a plane at a position y = 0 in a second space different from the first space. _{Using the} two-dimensional speaker arrays S1-1, S2-1,..., SN _x -N _z composed of _z speakers, the sound field of the first space formed by the sound generated by the sound source S is obtained. Reproduce in the second space.

この際、音場は所定の回転角だけ回転した状態で再現される。具体的には、図１に点線で例示された、所定の回転角だけ回転されたスピーカアレーＳ１−１，Ｓ２−１，…，ＳＮ_ｘ−Ｎ_ｚにより音場が再現されたような知覚を与えることができる。 At this time, the sound field is reproduced in a state rotated by a predetermined rotation angle. Specifically, the perception that the sound field is reproduced by the speaker arrays S1-1, S2-1,..., SN _x -N _z rotated by a predetermined rotation angle, as exemplified by the dotted line in FIG. Can be given.

N_x,N_zは２以上の整数である。N_x及びN_zは、同じ値でもよいし、互いに異なる値であってもよい。この実施形態では、マイクロホンアレーＭ１−１，Ｍ２−１，…，ＭＮ_ｘ−Ｎ_ｚを構成するマイクロホンの数とスピーカアレーＳ１−１，Ｓ２−１，…，ＳＮ_ｘ−Ｎ_ｚを構成するスピーカの数は同じである。マイクロホンアレーＭ１−１，Ｍ２−１，…，ＭＮ_ｘ−Ｎ_ｚを構成するマイクロホンＭｉ−ｊは等間隔に配置されている。また、スピーカアレーＳ１−１，Ｓ２−１，…，ＳＮ_ｘ−Ｎ_ｚを構成するスピーカも等間隔に配置されている。マイクロホンアレーＭ１−１，Ｍ２−１，…，ＭＮ_ｘ−Ｎ_ｚの大きさと、スピーカアレーＳ１−１，Ｓ２−１，…，ＳＮ_ｘ−Ｎ_ｚの大きさはほぼ同じである。各マイクロホンＭｉ−ｊのマイクロホンアレーＭ１−１，Ｍ２−１，…，ＭＮ_ｘ−Ｎ_ｚにおける位置は、その各マイクロホンＭｉ−ｊに対応するスピーカＳｉ−ｊのスピーカアレーＳ１−１，Ｓ２−１，…，ＳＮ_ｘ−Ｎ_ｚにおける位置と同じであることが望ましいが、異なっていてもよい。この位置が同じであれば、より忠実に音場の再生を行うことができる。 N _x and N _z are integers of 2 or more. N _x and N _z may be the same value or different values. In this embodiment, the microphone array M1-1, M2-1, ..., _MN x the number and the speaker array microphones constituting the -N _z S1-1, S2-1, ..., constituting the _SN x -N _z speaker The number of is the same. The microphones Mi-j constituting the microphone arrays M1-1, M2-1,..., MN _x -N _z are arranged at equal intervals. Further, the speakers constituting the speaker arrays S1-1, S2-1,..., SN _x -N _z are also arranged at equal intervals. Microphone array M1-1, M2-1, ..., and the size of _MN x -N _z, a speaker array S1-1, S2-1, ..., the magnitude of _SN x -N _z are approximately the same. Microphone array M1-1 of the microphones Mi-j, M2-1, ..., position in _MN x -N _z is the speaker Si-j of the speaker array S1-1 corresponding to the respective microphones Mi-j, S2-1 ,..., SN _x −N _z is desirably the same as the position, but may be different. If this position is the same, the sound field can be reproduced more faithfully.

第一の空間のy=0の位置に配置されたマイクロホンアレーＭ１−１，Ｍ２−１，…，ＭＮ_ｘ−Ｎ_ｚを構成するインデックス(i,j) [i=1,…,N_x,j=1,…,N_z]に対応するマイクロホンＭｉ−ｊの位置をr_m,ij=(x_m,i,0,z_m,j)と表わすことにする。 Indexes (i, j) constituting the microphone arrays M1-1, M2-1,..., MN _x −N _z arranged at the position of y = 0 in the first space [i = 1 _,. The position of the microphone Mi-j corresponding to j = 1,..., N _z ] is represented as r _{m, ij} = (x _{m, i} , 0, z _{m, j} ).

＜音場収音再生装置＞
第二実施形態の音場収音再生装置は、図１に示すように周波数変換部１、空間周波数変調部２、空間周波数変換部３、変換フィルタ部４、空間周波数逆変換部５、周波数逆変換部６及び窓関数部７を例えば含み、図Ｆ１に例示された各ステップの処理を行う。 <Sound field recording and playback device>
As shown in FIG. 1, the sound field sound collecting and reproducing apparatus according to the second embodiment includes a frequency conversion unit 1, a spatial frequency modulation unit 2, a spatial frequency conversion unit 3, a conversion filter unit 4, a spatial frequency inverse conversion unit 5, and a frequency inverse unit. The conversion unit 6 and the window function unit 7 are included, for example, and the processing of each step illustrated in FIG. F1 is performed.

第一の空間に配置されたマイクロホンアレーＭ１−１，Ｍ２−１，…，ＭＮ_ｘ−Ｎ_ｚは、第一の空間の音源Ｓで発せられた音を収音して時間領域の信号を生成する。生成された信号は、周波数変換部１に送られる。インデックスi,jに対応するr_m,ij=(x_m,i,0,z_m,j)に位置するマイクロホンＭｉ−ｊで収音された時間領域の時刻ｔの信号をp_ij(t)と表記する。 The microphone arrays M1-1, M2-1,..., MN _x -N _z arranged in the first space pick up sounds emitted from the sound source S in the first space and generate signals in the time domain. To do. The generated signal is sent to the frequency converter 1. A signal at time t in the time domain picked up by the microphone Mi-j located at r _{m, ij} = (x _{m, i} , 0, z _{m, j} ) corresponding to the index i, j is represented by p _ij (t) Is written.

＜周波数変換部１＞
周波数変換部１は、マイクロホンアレーＭ１−１，Ｍ２−１，…，ＭＮ_ｘ−Ｎ_ｚで収音された信号p_ij(t)をフーリエ変換により周波数領域信号P_ij(ω)に変換する（ステップＳ１）。生成された周波数領域信号P_ij(ω)は、空間周波数変調部２に提供される。ωは周波数である。例えば、短時間離散フーリエ変換により周波数領域信号P_ij(ω)が生成される。もちろん、他の既存の方法により周波数領域信号P_ij(ω)を生成してもよい。また、オーバーラップアド等の方法を用いて周波数領域信号P_ij(ω)を生成してもよい。入力信号が長い場合や、リアルタイム処理のように連続して信号が入力される場合には、例えば１０ｍｓごとといったフレームごとに処理を行う。周波数領域信号P_ij(ω)は、例えば以下のように定義される。関数expの引数の中のjは虚数単位である。 <Frequency conversion unit 1>
The frequency converter 1 converts the signal p _ij (t) collected by the microphone arrays M1-1, M2-1,..., MN _x −N _z into a frequency domain signal P _ij (ω) by Fourier transform ( Step S1). The generated frequency domain signal P _ij (ω) is provided to the spatial frequency modulation unit 2. ω is a frequency. For example, the frequency domain signal P _ij (ω) is generated by short-time discrete Fourier transform. Of course, the frequency domain signal P _ij (ω) may be generated by other existing methods. Further, the frequency domain signal P _ij (ω) may be generated using a method such as overlap add. When the input signal is long or when the signal is continuously input as in real time processing, the processing is performed for each frame such as every 10 ms. For example, the frequency domain signal P _ij (ω) is defined as follows. J in the argument of the function exp is an imaginary unit.

＜空間周波数変調部２＞
空間周波数変調部２は、周波数領域信号P_ij(ω)が所定の回転角で空間周波数シフトされた信号である空間周波数変調信号P_mod,ij(ω)を生成する（ステップＳ２）。生成された空間周波数変調信号P_mod,ij(ω)は、空間周波数変換部３に提供される。所定の回転角は、方位角方向の角度がθ_rotであり仰角方向の角度がφ_rotである回転角である。 <Spatial frequency modulation unit 2>
The spatial frequency modulation unit 2 generates a spatial frequency modulation signal P _{mod, ij} (ω) that is a signal obtained by shifting the frequency domain signal P _ij (ω) by a spatial frequency by a predetermined rotation angle (step S2). The generated spatial frequency modulation signal P _{mod, ij} (ω) is provided to the spatial frequency conversion unit 3. The predetermined rotation angle is a rotation angle in which the azimuth angle is θ _rot and the elevation angle is φ _rot .

空間周波数変調部２は、例えば下記式により定義される空間周波数変調信号P_mod,ij(ω)を計算する。関数expの引数の中のjは虚数単位である。kは波数でありcを音速とするとk=ω/cである。 The spatial frequency modulation unit 2 calculates a spatial frequency modulation signal P _{mod, ij} (ω) defined by the following equation, for example. J in the argument of the function exp is an imaginary unit. k is the wave number, and k = ω / c where c is the speed of sound.

＜空間周波数変換部３＞
空間周波数変換部３は、空間のフーリエ変換により周波数領域信号P_mod,ij(ω)を時空間周波数領域信号P~_nm(ω)に変換する（ステップＳ３）。時空間周波数領域信号P~_nm(ω)は、周波数領域信号P_mod,ij(ω)に由来するものである。時空間周波数領域信号P~_nm(ω)は、各ωごとに計算される。変換された時空間周波数領域信号P~_nm(ω)は、変換フィルタ部４に提供される。空間周波数変換部３は、具体的には下記式（６）により定義されるP~_nｍ(ω)を計算する。関数expの引数の中のjは虚数単位である。 <Spatial frequency converter 3>
Spatial frequency transformation unit 3, the frequency domain signal P _mod by the Fourier transform of the _{spatial, ij} (omega) is converted to the spatio-temporal frequency domain signal P ~ _nm (ω) (Step S3). Spatio-temporal frequency domain signal P ~ _nm (ω) are derived from the frequency domain signal P _mod, the _{ij (ω).} The spatio-temporal frequency domain signal P _nm (ω) is calculated for each ω. The converted spatio-temporal frequency domain signal P _nm (ω) is provided to the conversion filter unit 4. Specifically, the spatial frequency converter 3 calculates _P˜nm (ω) defined by the following equation (6). J in the argument of the function exp is an imaginary unit.

k_x,nはx軸方向の波数であり、nは波数k_x,nのインデックスであり、k_z,mはz軸方向の波数であり、mは波数k_z,mのインデックスであり、波数とは、いわゆる空間周波数又は角度スペクトルのことである。上記式（６）は、時空間周波数領域への変換の一例であり、他の方法により空間のフーリエ変換を行ってもよい。 k _{x, n} is the wave number in the x-axis direction, n is the index of the wave number k _{x, n} , k _{z, m} is the wave number in the z-axis direction, m is the index of the wave number k _{z, m} , The wave number is a so-called spatial frequency or angular spectrum. The above equation (6) is an example of conversion to the spatio-temporal frequency domain, and spatial Fourier transform may be performed by other methods.

＜変換フィルタ部４＞
変換フィルタ部４は、時空間周波数領域信号P~_nm(ω)に対して式（７）により定義されるフィルタF~_nm(ω)を適用してフィルタ処理後信号D~_nm(ω)を生成する（ステップＳ４）。生成されたフィルタ処理後信号D~_nm(ω)は、空間周波数逆変換部５に提供される。 <Conversion filter unit 4>
Conversion filter unit 4, the spatio-temporal frequency domain signal P ~ _nm filter F is defined for (omega) by Equation (7) ~ _nm (ω) applied to filtering after signal D ~ _nm to the (omega) Generate (step S4). The generated filtered signal _D˜nm (ω) is provided to the spatial frequency inverse transform unit 5.

A(ω)は、周波数特性を調整するための所定の複素数である。例えば、A(ω)=1+0×ｊ=1である。 A (ω) is a predetermined complex number for adjusting the frequency characteristic. For example, A (ω) = 1 + 0 × j = 1.

また、w_nmはn,mに基づいて例えば以下のように定まる、エバネッセント波を減衰させるための所定の重みである。以下の式において、k_cは、予め定められた値でありk_x,n,k_z,mのカットオフ値である。k_cは、例えばエバネッセント波を抑制するような値に設定する。α_x,α_zは、カットオフの滑らかさを決めるための予め定められた値であり例えば0.05である。もちろん、w_nmとして、他の重み関数を用いてもよい。 W _nm is a predetermined weight for attenuating the evanescent wave, which is determined as follows based on n and m, for example. In the following equation, k _c is a predetermined value and is a cutoff value of k _{x, n} , k _{z, m} . k _c is set to a value that suppresses evanescent waves, for example. α _x and α _z are predetermined values for determining the smoothness of the cutoff, and are 0.05, for example. Of course, other weight functions may be used as w _nm .

なお、変換フィルタ部４は、上記式（７）のフィルタF~_nm(ω)に代えて、式（８）により定義されるフィルタF~_nm(ω)を適用してフィルタ処理後信号D~_nm(ω)を生成してもよい。d_x,d_y,d_zは、再生する音場をそれぞれx軸方向、y軸方向及びz軸方向にシフトさせる量である。このフィルタF~_nm(ω)を適用すると、第二の空間において、音場をx軸方向にd_xだけy軸方向にd_yだけz軸方向にd_zだけシフトして再現することができる。 Note that the conversion filter unit 4 applies the filter _F˜nm (ω) defined by the equation (8) instead of the filter _F˜nm (ω) of the above equation (7) to apply the filtered signal D˜ _nm (ω) may be generated. d _x , d _y , and d _z are amounts by which the sound field to be reproduced is shifted in the x-axis direction, the y-axis direction, and the z-axis direction, respectively. Applying this filter F ~ _nm (ω), can be first in a two-space, to reproduce the sound field shifted by d _y in the z-axis direction by the y-axis direction d _x in the x-axis direction by d _z .

＜空間周波数逆変換部５＞
空間周波数逆変換部５は、フィルタ処理後信号D~_nm(ω)を空間の逆フーリエ変換により周波数領域信号D_ij(ω)に変換する（ステップＳ５）。変換された周波数領域信号D_ij(ω)は、周波数逆変換部６に提供される。空間周波数逆変換部５は、具体的には下記式（９）により定義される周波数領域信号D_ij(ω)を計算する。関数expの引数の中のjは虚数単位である。 <Spatial frequency inverse transform unit 5>
The spatial frequency inverse transform unit 5, converts the filtered signal after D ~ _nm (ω) on the frequency domain signal D _ij (omega) by inverse Fourier transform of the space (step S5). The converted frequency domain signal D _ij (ω) is provided to the frequency inverse transform unit 6. Specifically, the spatial frequency inverse transform unit 5 calculates a frequency domain signal D _ij (ω) defined by the following equation (9). J in the argument of the function exp is an imaginary unit.

＜周波数逆変換部６＞
周波数逆変換部６は、周波数領域信号D_ij(ω)を逆フーリエ変換により時間領域信号P^d _ij(t)に変換する（ステップＳ６）。逆フーリエ変換によりフレーム毎に得られた時間領域信号P^d _ij(t)は適宜シフトされて線形和が取られて、連続した時間領域信号となる。逆フーリエ変換は短時間離散逆フーリエ変換等の既存の方法を用いればよい。時間領域信号P^d _ij(t)は、窓関数部７に送られる。 <Inverse frequency converter 6>
The frequency inverse transform unit 6 transforms the frequency domain signal D _ij (ω) to the time domain signal P ^d _ij (t) by inverse Fourier transform (step S6). The time domain signal P ^d _ij (t) obtained for each frame by the inverse Fourier transform is appropriately shifted to obtain a linear sum to be a continuous time domain signal. For the inverse Fourier transform, an existing method such as a short-time discrete inverse Fourier transform may be used. The time domain signal P ^d _ij (t) is sent to the window function unit 7.

＜窓関数部７＞
窓関数部７は、時間領域信号P^d _ij(t)に窓関数を乗じて窓関数後時間領域信号d_ij（t）を生成する（ステップＳ７）。窓関数後時間領域信号d_ij（t）は、スピーカアレーＳ１，Ｓ２，…，ＳＮ_ｘに提供される。 <Window function part 7>
The window function unit 7 multiplies the time domain signal P ^d _ij (t) by the window function to generate a post-window function time domain signal d _ij (t) (step S7). After the window function time domain signal d _ij (t) is the speaker array S1, S2, ..., it is provided in the SN _x.

窓関数として、以下の式より定義されるいわゆるターキー（Tukey）窓関数w_ijを例えば用いる。N_tprは、テーパーを適用する点数であり１以上N_x,N_z以下の整数である。もちろん、他の窓関数を用いてもよい。 For example, a so-called tukey window function w _ij defined by the following equation is used as the window function. N _tpr is the number of points to which the taper is applied, and is an integer of 1 or more and N _x or N _z . Of course, other window functions may be used.

スピーカアレーＳ１−１，Ｓ２−１，…，ＳＮ_ｘ−Ｎ_ｚは、窓関数後時間領域信号d_ij（t）に基づいて音を再生する。具体的には、i=1,…,N_x,j=1,…,N_zとして、スピーカＳｉ−ｊが窓関数後時間領域信号d_ij（t）に基づいて音を再生する。 The speaker arrays S1-1, S2-1,..., SN _x -N _z reproduce sound based on the time domain signal _dij (t) after the window function. Specifically, with i = 1,..., N _x , j = 1,..., N _z , the speaker Si-j reproduces the sound based on the post-window function time domain signal _dij (t).

これにより、第一の空間のy=0の位置の波面を第二の空間のスピーカアレーＳ１−１，Ｓ２−１，…，ＳＮ_ｘ−Ｎ_ｚで再現して、第一の空間の音場を第二の空間に再現することができる。 Thereby, the wavefront of the position of y = 0 in the first space the second space of the speaker array S1-1, S2-1, ..., and reproduced in SN _x -N _z, the sound field of the first space Can be reproduced in the second space.

［変形例等］
フィルタは、収音が行われた音場を再現する信号に変換するための、再生信号が出力される２個以上のスピーカを含むスピーカアレーの配置に応じたフィルタであればどのようなフィルタであってもよい。第一実施形態及び第二実施形態で説明したフィルタは、一例である。 [Modifications, etc.]
Any filter can be used as long as it is a filter according to the arrangement of speaker arrays including two or more speakers from which a reproduction signal is output for converting the sound field where the sound is collected into a signal to be reproduced. There may be. The filters described in the first embodiment and the second embodiment are examples.

例えば、第一実施形態において、以下に示す式（１０）又は式（１１）で定義されるフィルタを用いてもよい。 For example, in the first embodiment, a filter defined by the following formula (10) or formula (11) may be used.

式（１０）において、p,qは予め設定された次数とし、d_p,qはスピーカアレーを構成する各スピーカの伝達特性を前記次数p,qで多重極展開した多重極係数である。例えばp,qの全てを１とする等、p,qのうち何れか１つ以上は０でない正値である。 In Equation (10), p and q are preset orders, and d _{p and q} are multipole coefficients obtained by multipole expansion of the transfer characteristics of the speakers constituting the speaker array with the orders p and q. For example, one or more of p and q are positive values other than 0, such as setting all of p and q to 1.

H₀ ⁽²⁾は、n=0の第二種ハンケル関数である。第二種ハンケル関数H_n ⁽²⁾は、第一種ベッセル関数J_n(x)及び第二種ベッセル関数Y_n(x)を用いて、以下のように定義される。 H ₀ ⁽²⁾ is a second kind Hankel function with n = 0. The second kind Hankel function H _n ⁽²⁾ is defined as follows using the first kind Bessel function J _n (x) and the second kind Bessel function Y _n (x).

k_ρは次式により定義される。 k _ρ is defined by the following equation.

式（１１）において、G~_2Dは、理想的な伝達特性を表す２次元自由空間グリーン関数をｘ軸方向に空間のフーリエ変換をした関数である。G~_spは、音場が再現される空間のスピーカアレーの位置とその位置からy_refだけ離れた位置との間の予め測定された伝達特性をｘ軸方向に空間のフーリエ変換をした関数である。 In Expression (11), G to _2D are functions obtained by subjecting a two-dimensional free space Green function representing ideal transfer characteristics to a spatial Fourier transform in the x-axis direction. G ~ _sp is a function obtained by performing a Fourier transform of the space in the x-axis direction on the pre-measured transfer characteristics between the position of the speaker array in the space where the sound field is reproduced and the position away from that position by _yref. is there.

また、例えば、第二実施形態において、以下に示す式（１２）又は式（１３）で定義されるフィルタを用いてもよい。 Further, for example, in the second embodiment, a filter defined by the following formula (12) or formula (13) may be used.

式（１２）において、p,q,sは予め設定された次数であり、d_p,q,sはスピーカアレーを構成する各スピーカの伝達特性を前記次数p,q,sで多重極展開した多重極係数である。例えばp,q,sの全てを１とする等、p,q,sのうち何れか１つ以上は０でない正値である。 In Equation (12), p, q, and s are preset orders, and d _{p, q, and s} are multipole expansions of the transfer characteristics of each speaker constituting the speaker array at the orders p, q, and s. Multipole coefficient. For example, one or more of p, q, and s are positive values that are not 0, such as setting all of p, q, and s to 1.

式（１３）において、G~は、理想的な伝達特性を表す３次元自由空間グリーン関数をｘ軸方向及びｚ軸方向に空間のフーリエ変換をした関数である。G~_spは、音場が再現される空間のスピーカアレーの位置とその位置からy_refだけ離れた位置との間の予め測定された伝達特性をｘ軸方向及びｚ軸方向に空間のフーリエ変換をした関数である。 In Expression (13), G˜ is a function obtained by subjecting a three-dimensional free space Green function representing ideal transfer characteristics to a spatial Fourier transform in the x-axis direction and the z-axis direction. G ~ _sp is the Fourier transform of space in the x-axis and z-axis directions, measured in advance in the x-axis and z-axis directions, between the position of the speaker array in the space where the sound field is reproduced and the position away from that position by y _ref It is a function that

第一の空間と第二の空間の位置は、図２，４に示したものに限定されない。第一の空間と第二の空間は、隣接していても互いに離れた位置にあってもよい。また、第一の空間と第二の空間の向きもどのようなものであってもよい。 The positions of the first space and the second space are not limited to those shown in FIGS. The first space and the second space may be adjacent to each other or separated from each other. Also, the orientation of the first space and the second space may be any.

窓関数部７による窓関数の処理は、どの段階で行ってもよいし、多段で行ってもよい。すなわち、窓関数部７は、マイクロホンアレーと周波数変換部１との間、周波数変換部１と空間周波数変調部２との間、空間周波数変調部２と空間周波数変換部３との間、空間周波数変換部３と変換フィルタ部４との間、変換フィルタ部４と空間周波数逆変換部５との間、空間周波数逆変換部５と周波数逆変換部６との間の少なくとも１つの間に備えられていてもよい。音場収音再生装置の各部は、その各部に入力される信号について窓関数の処理が行われた場合には、その入力される信号に代えて上記と同様にしてその窓関数の処理がされた後の信号に対して処理を行う。 The window function processing by the window function unit 7 may be performed at any stage or in multiple stages. That is, the window function unit 7 is connected between the microphone array and the frequency conversion unit 1, between the frequency conversion unit 1 and the spatial frequency modulation unit 2, between the spatial frequency modulation unit 2 and the spatial frequency conversion unit 3, and spatial frequency. It is provided between at least one between the conversion unit 3 and the conversion filter unit 4, between the conversion filter unit 4 and the spatial frequency inverse transform unit 5, and between the spatial frequency inverse transform unit 5 and the frequency inverse transform unit 6. It may be. When the window function processing is performed on the signal input to each section, each section of the sound field sound collecting / reproducing apparatus performs the window function processing in the same manner as described above instead of the input signal. The processed signal is processed.

音場収音再生装置は、空間周波数変調部２を含みさえすれば、他の部を備えていなくてもよい。言い換えれば、周波数変換部１、空間周波数変換部３、変換フィルタ部４、空間周波数逆変換部５、周波数逆変換部６及び窓関数部７は必須ではない。 As long as the sound field sound collecting / reproducing apparatus includes the spatial frequency modulation unit 2, it may not include other units. In other words, the frequency conversion unit 1, the spatial frequency conversion unit 3, the conversion filter unit 4, the spatial frequency reverse conversion unit 5, the frequency reverse conversion unit 6, and the window function unit 7 are not essential.

例えば、窓関数部７はなくてもよい。この場合、第一実施形態においてはｉ＝１，…，Ｎ_ｘとしてスピーカＳｉが時間領域信号P^d _i(t)に基づいて音を再生し、第二実施形態においてはｉ＝１，…，Ｎ_ｘ，ｊ＝１，…，Ｎ_ｚとしてスピーカＳｉ−ｊが時間領域信号P^d _ij(t)に基づいて音を再生する。 For example, the window function unit 7 may not be provided. In this case, in the first embodiment, i = 1,..., _Nx , the speaker Si reproduces the sound based on the time domain signal ^Pd _i (t), and in the second embodiment, i = 1,. The speaker Si-j reproduces the sound based on the time domain signal P ^d _ij (t) as N _x , j = 1,..., N _z .

空間周波数変換部３及び空間周波数逆変換部５がない場合には、空間周波数変調信号が、変換フィルタ部４に提供される。変換フィルタ部４は、空間周波数変調信号に対してフィルタを適用して、フィルタ処理後信号を得る。フィルタ処理後信号は、周波数逆変換部６で時間領域信号に変換される。 When the spatial frequency conversion unit 3 and the spatial frequency inverse conversion unit 5 are not provided, a spatial frequency modulation signal is provided to the conversion filter unit 4. The conversion filter unit 4 applies a filter to the spatial frequency modulation signal to obtain a filtered signal. The filtered signal is converted into a time domain signal by the frequency inverse converter 6.

周波数変換部１の処理と空間周波数変調部２の処理と空間周波数変換部３の処理とを同時に行ってもよい。同様に、空間周波数逆変換部５の処理と周波数逆変換部６の処理とを同時に行ってもよい。また、空間周波数変換部３と空間周波数逆変換部５とを入れ替えてもよい。 The processing of the frequency conversion unit 1, the processing of the spatial frequency modulation unit 2, and the processing of the spatial frequency conversion unit 3 may be performed simultaneously. Similarly, the process of the spatial frequency inverse transform unit 5 and the process of the frequency inverse transform unit 6 may be performed simultaneously. Further, the spatial frequency conversion unit 3 and the spatial frequency inverse conversion unit 5 may be interchanged.

音場収音再生装置は、コンピュータによって実現することができる。この場合、この装置の各部の処理内容はプログラムによって記述される。そして、このプログラムをコンピュータで実行することにより、この装置における各部がコンピュータ上で実現される。 The sound field sound collecting / reproducing apparatus can be realized by a computer. In this case, the processing content of each part of this apparatus is described by a program. Then, by executing this program on a computer, each unit in this apparatus is realized on the computer.

この処理内容を記述したプログラムは、コンピュータで読み取り可能な記録媒体に記録しておくことができる。また、この形態では、コンピュータ上で所定のプログラムを実行させることにより、これらの装置を構成することとしたが、これらの処理内容の少なくとも一部をハードウェア的に実現することとしてもよい。 The program describing the processing contents can be recorded on a computer-readable recording medium. In this embodiment, these apparatuses are configured by executing a predetermined program on a computer. However, at least a part of these processing contents may be realized by hardware.

この発明は、上述の実施形態に限定されるものではなく、本発明の趣旨を逸脱しない範囲で適宜変更が可能である。 The present invention is not limited to the above-described embodiment, and can be modified as appropriate without departing from the spirit of the present invention.

１周波数変換部
２空間周波数変調部
３空間周波数変換部
４変換フィルタ部
５離散球面調和逆変換部
６周波数逆変換部
７窓関数部 DESCRIPTION OF SYMBOLS 1 Frequency conversion part 2 Spatial frequency modulation part 3 Spatial frequency conversion part 4 Conversion filter part 5 Discrete spherical harmonic inverse transformation part 6 Frequency inverse transformation part 7 Window function part

Claims

A spatial frequency modulation unit that generates a spatial frequency modulation signal, which is a signal obtained by shifting a frequency domain signal generated from a signal collected by a microphone array including two or more microphones at a predetermined rotational angle, and
A sound field collecting and reproducing apparatus for obtaining a reproduction signal from the spatial frequency modulation signal.

The sound field collecting and reproducing device according to claim 1,
A filter corresponding to the arrangement of speaker arrays including two or more speakers from which the reproduction signal is output for converting the sound field where the sound is collected into a signal to be reproduced is used as the spatial frequency modulation signal or the space. Further including a transform filter unit that applies a spatiotemporal frequency domain signal derived from the frequency modulation signal to generate a filtered signal;
A sound field sound collecting / reproducing apparatus that obtains the reproduced signal from the filtered signal instead of the spatial frequency modulation signal.

The sound field collecting and reproducing apparatus according to claim 2 ,
The microphone array is linear, the speaker array is linear, the array direction of the speaker array is the x-axis direction, and the predetermined rotation angle is a rotation angle whose angle of elevation is θ _rot. , P _i (ω) is the frequency domain signal, j is an imaginary unit, ω is the frequency, c is the speed of sound, k = ω / c, and the position of the microphone corresponding to the index i constituting the microphone array R _{m, i} = (x _{m, i} , 0,0), P _{mod, i} (ω) as the spatial frequency modulation signal,
The spatial frequency modulation unit calculates a spatial frequency modulation signal P _{mod, i} (ω) defined by the following equation:

Sound field recording and playback device.

The sound field collecting and reproducing device according to claim 3,
k _{x, n} is the wave number in the x-axis direction, n is its index, w _n is a weight determined based on n, A (ω) is a predetermined complex number, and the sound field to be reproduced is in the x-axis direction and the y-axis The amount of shift in the direction is dx, dy, H ₀ ⁽¹⁾ (・) is the 0th kind Hankel function, and y _ref is the position where the amplitudes match,
The transform filter unit applies a filter F _n (ω) defined by one of the following two expressions to the spatio-temporal frequency domain signal to generate a filtered signal D _n (ω).

Sound field recording and playback device.

The sound field collecting and reproducing apparatus according to claim 2 ,
The microphone array is arranged on the xz plane, and the predetermined rotation angle is a rotation angle in which the azimuth angle is θ _rot and the elevation angle is φ _rot , and P _ij (ω) Is the frequency domain signal, j is an imaginary unit, ω is the frequency, c is the speed of sound, k = ω / c, and the position of the microphone corresponding to the index i, j constituting the microphone array is r _{m, ij} = (x _{m, i} , 0, z _{m, j} ) and P _{mod, ij} (ω) as the spatial frequency modulation signal,
The spatial frequency modulation unit calculates a spatial frequency modulation signal P _{mod, ij} (ω) defined by the following equation:

Sound field recording and playback device.

The sound field collecting and reproducing apparatus according to claim 5,
k _{x, n} is the wave number in the x-axis direction, n is its index, k _{z, m} is the wave number in the z-axis direction, m is its index, w _nm is a weight determined based on n, m, and A (ω) is a predetermined complex number, and the amounts by which the sound field to be reproduced is shifted in the x-axis direction, y-axis direction, and z-axis direction are dx, dy, and dz, respectively.
The transform filter unit applies a filter F _n (ω) defined by one of the following two expressions to the spatio-temporal frequency domain signal to generate a filtered signal D _n (ω).

Sound field recording and playback device.

The sound field recording and reproducing device according to any one of claims 2 to 6,
A spatial frequency converter that converts the spatial frequency modulation signal into the spatio-temporal frequency domain signal by Fourier transform of the space;
A spatial frequency inverse transform unit for transforming the filtered signal into a frequency domain signal by inverse Fourier transform of space;
A frequency inverse transform unit for transforming the frequency domain signal into the reproduction signal that is a time domain signal by inverse Fourier transform;
A sound field collecting and reproducing apparatus further comprising:

A space in which a spatial frequency modulation unit generates a spatial frequency modulation signal, which is a signal obtained by shifting a frequency domain signal generated from a signal collected by a microphone array including two or more microphones at a predetermined rotation angle. Including a frequency modulation step;
A sound field collection and reproduction method for obtaining a reproduction signal from the spatial frequency modulation signal.

A sound field sound recording / reproducing program for causing a computer to function as each unit of the sound field sound collecting / reproducing apparatus according to claim 1.