JP2016092562A

JP2016092562A - Audio processing device and method, and program

Info

Publication number: JP2016092562A
Application number: JP2014223959A
Authority: JP
Inventors: 将文高橋; Masafumi Takahashi
Original assignee: Sony Corp
Current assignee: Sony Corp
Priority date: 2014-11-04
Filing date: 2014-11-04
Publication date: 2016-05-23

Abstract

PROBLEM TO BE SOLVED: To obtain audio having sound pressure distribution with less variation.SOLUTION: An acoustic sensor arranged at each of a plurality of control points collects audio output from a sound source device on the basis of a measurement acoustic signal and outputs a measurement signal. A transfer function calculator calculates a transfer function from the sound source device to the acoustic sensor on the basis of the measurement signal and the measurement acoustic signal. An information input unit sets a control point in a listening area and a non-listening area. A filter coefficient calculation unit calculates an input filter coefficient by solving a linear programming problem for determining, on the basis of the transfer function and the setting result of each control point, the input filter coefficient for minimizing the maximum value of sound pressure in the non-listening area under the condition that an error of sound pressure in the listening area is within an allowable error range. The technology can be applied to an acoustic system.SELECTED DRAWING: Figure 1

Description

本技術は音声処理装置および方法、並びにプログラムに関し、特に、よりばらつきの少ない音圧分布の音声を得ることができるようにした音声処理装置および方法、並びにプログラムに関する。 The present technology relates to a sound processing apparatus and method, and a program, and more particularly, to a sound processing apparatus and method, and a program capable of obtaining sound with a sound pressure distribution with less variation.

例えば、音響信号を空間中に出力するために一般的なスピーカのような音源を用いる場合、その音源の指向特性による影響はあるものの概ね周囲にいる全ての人が音源から出力された音声を聴取することが可能である。 For example, when a sound source such as a general speaker is used to output an acoustic signal into the space, almost all people around it listen to the sound output from the sound source, although it is affected by the directivity characteristics of the sound source. Is possible.

そのため、音声を聴取できる空間または方向（以下、聴取エリアとも称する）と、音声を聴取できない空間または方向（以下、非聴取エリアとも称する）を設定し、聴取エリア内にいる者に限定して音声を聞かせるということはできず、そのような目的を達成するためには再生装置に対する何らかの工夫が必要となる。 Therefore, a space or direction in which sound can be heard (hereinafter also referred to as a listening area) and a space or direction in which sound cannot be heard (hereinafter also referred to as a non-listening area) are set, and the sound is limited to those who are in the listening area. In order to achieve such a purpose, some device for the playback device is required.

特定の位置に限定して音声を再生することが可能となれば、聴取者以外の人に対してその音声が騒音となることを防げる他、１つの空間内で複数の聴取者がそれぞれ別個の音響信号を聴取することが可能となる。このような要求に対する実現手段として、所望の方向に音波を放射する指向性スピーカシステムを用いることが考えられる。 If it is possible to reproduce the sound limited to a specific position, it is possible to prevent the sound from becoming a noise to a person other than the listener, and a plurality of listeners are individually separated in one space. An acoustic signal can be heard. It is conceivable to use a directional speaker system that emits sound waves in a desired direction as means for realizing such a requirement.

ここで指向性スピーカシステムには、主に幾何的な形状によって指向性を作り出すホーンスピーカや、振幅変調された強い超音波ビームの自己復調によって狭指向性を実現するパラメトリックスピーカ、複数のスピーカを設置することで全体としての指向性を形成するスピーカアレイなどがある。 Here, the directional speaker system is equipped with a horn speaker that creates directivity mainly by its geometric shape, a parametric speaker that realizes narrow directivity by self-demodulation of an amplitude-modulated strong ultrasonic beam, and multiple speakers. Thus, there is a speaker array that forms directivity as a whole.

これらの指向性スピーカシステムのうち、ホーンスピーカでは低周波で指向性を得るためには幾何学的な構造が大きくならざるを得ず、用途が限られてしまう。 Among these directional speaker systems, a horn speaker must have a large geometric structure in order to obtain directivity at a low frequency, and its application is limited.

また、超音波を用いるパラメトリックスピーカは狭い範囲の指向性を容易に得ることができるが、低周波域において音圧が不足するなどの理由で音質面において不十分である。さらに、パラメトリックスピーカでは、放射する超音波に対して自己復調される音波の比率が低いため、十分な音圧を確保するためには非常に強い超音波の曝露を許容しなければならない。 Moreover, a parametric speaker using ultrasonic waves can easily obtain directivity in a narrow range, but is insufficient in sound quality due to a lack of sound pressure in a low frequency range. Furthermore, since the ratio of the self-demodulated sound wave to the radiated ultrasonic wave is low in the parametric speaker, it is necessary to allow a very strong ultrasonic wave exposure to ensure a sufficient sound pressure.

これに対して、スピーカアレイを用いる方法では、ホーンスピーカに比べて小規模でありながら指向性を得ることができ、また可聴帯域の音を用いるため超音波暴露なども生じない。さらに、スピーカアレイによる方法では、各スピーカへと入力される入力信号間の相対的な振幅位相特性をフィルタにより変化させることで、スピーカアレイの配置を変えずに指向性を変更することが可能である。 On the other hand, in the method using the speaker array, the directivity can be obtained while being smaller than the horn speaker, and since the sound in the audible band is used, no ultrasonic exposure occurs. Furthermore, in the method using the speaker array, it is possible to change the directivity without changing the arrangement of the speaker array by changing the relative amplitude and phase characteristics between the input signals input to the speakers using a filter. is there.

しかしながら、スピーカアレイを用いる方法では、どのように振幅位相特性を変化させれば所望の指向特性を得られるかは自明ではなく、またスピーカの配置の仕方が指向性に与える影響も大きい。そのため、様々なフィルタ係数の算出方法やスピーカの配置方法が提案されている。 However, in the method using the speaker array, it is not obvious how the desired directional characteristics can be obtained by changing the amplitude and phase characteristics, and the way the speakers are arranged has a great influence on the directivity. For this reason, various filter coefficient calculation methods and speaker arrangement methods have been proposed.

例えば、聴取エリアと非聴取エリアにおける平均音響エネルギ密度と、それらの平均音響エネルギ密度の差を計算し、音源から出力される全体音響エネルギに対する音響エネルギー密度の差の比が最大になるように音源から放射される信号を計算する方法が提案されている（例えば、特許文献１参照）。この方法によれば、スピーカアレイにより音声を再生したときに、聴取エリアにいる聴取者のみが音声を聞き取ることができる。 For example, the average acoustic energy density in the listening area and the non-listening area and the difference between the average acoustic energy densities are calculated, and the ratio of the difference in acoustic energy density to the total acoustic energy output from the sound source is maximized. There has been proposed a method for calculating a signal radiated from (see, for example, Patent Document 1). According to this method, only the listener in the listening area can hear the sound when the sound is reproduced by the speaker array.

特許第５０７３７２４号公報Japanese Patent No. 5073724

しかしながら、上述した技術では、聴取エリアや非聴取エリアの平均音響エネルギ密度を用いて計算を行っているため、聴取エリアや非聴取エリアにおける音圧分布に空間的なばらつきが生じてしまう。 However, in the above-described technique, calculation is performed using the average acoustic energy density of the listening area and the non-listening area, so that spatial variation occurs in the sound pressure distribution in the listening area and the non-listening area.

本技術は、このような状況に鑑みてなされたものであり、よりばらつきの少ない音圧分布の音声を得ることができるようにするものである。 The present technology has been made in view of such a situation, and makes it possible to obtain sound with a sound pressure distribution with less variation.

本技術の一側面の音声処理装置は、聴取エリア内にいくつかの制御点を設定するとともに、非聴取エリア内にいくつかの制御点を設定するエリア設定部と、前記聴取エリア内の各前記制御点の目的音圧を設定する目的音圧設定部と、前記目的音圧の許容誤差を設定する許容誤差設定部と、全前記制御点の合計個数未満の所定個数の各音源について、前記音源と各前記制御点のそれぞれとの間の伝達関数に基づいて、前記聴取エリア内の前記制御点の音圧の前記目的音圧に対する誤差が前記許容誤差の範囲内に収まるという条件下で、前記非聴取エリア内の前記制御点の音圧を最小化する、前記音源に対応するフィルタのフィルタ係数を算出するフィルタ係数計算部とを備える。 An audio processing device according to an aspect of the present technology sets an area setting unit that sets several control points in a non-listening area, and sets each control point in the listening area. A target sound pressure setting unit that sets a target sound pressure at a control point; an allowable error setting unit that sets an allowable error of the target sound pressure; and a predetermined number of sound sources less than the total number of all the control points On the basis of the transfer function between the control point and each of the control points, under the condition that the error of the sound pressure of the control point in the listening area with respect to the target sound pressure falls within the allowable error range. A filter coefficient calculation unit for calculating a filter coefficient of a filter corresponding to the sound source that minimizes the sound pressure of the control point in the non-listening area.

前記フィルタ係数計算部には、前記フィルタのゲインが所定の上限値を超えないという制約条件をさらに加えて前記フィルタ係数を算出させることができる。 The filter coefficient calculation unit may calculate the filter coefficient by further adding a constraint that the gain of the filter does not exceed a predetermined upper limit value.

前記エリア設定部には、前記聴取エリアおよび前記非聴取エリアとは異なるその他のエリアにいくつかの制御点を設定させ、前記フィルタ係数計算部には、前記その他のエリア内の前記制御点の音圧が、所定の最大音圧を超えないという制約条件をさらに加えて前記フィルタ係数を算出させることができる。 The area setting unit is configured to set some control points in other areas different from the listening area and the non-listening area, and the filter coefficient calculation unit is configured to generate sound of the control points in the other areas. The filter coefficient can be calculated by further adding a constraint that the pressure does not exceed a predetermined maximum sound pressure.

前記最大音圧を、前記聴取エリア内の前記制御点の前記目的音圧の最小値に基づいて定めるようにすることができる。 The maximum sound pressure may be determined based on a minimum value of the target sound pressure at the control point in the listening area.

前記フィルタ係数計算部には、線型計画問題を解くことにより、前記フィルタ係数を算出させることができる。 The filter coefficient calculation unit can calculate the filter coefficient by solving a linear programming problem.

本技術の一側面の音声処理方法またはプログラムは、聴取エリア内にいくつかの制御点を設定するとともに、非聴取エリア内にいくつかの制御点を設定し、前記聴取エリア内の各前記制御点の目的音圧を設定し、前記目的音圧の許容誤差を設定し、全前記制御点の合計個数未満の所定個数の各音源について、前記音源と各前記制御点のそれぞれとの間の伝達関数に基づいて、前記聴取エリア内の前記制御点の音圧の前記目的音圧に対する誤差が前記許容誤差の範囲内に収まるという条件下で、前記非聴取エリア内の前記制御点の音圧を最小化する、前記音源に対応するフィルタのフィルタ係数を算出するステップを含む。 An audio processing method or program according to one aspect of the present technology sets several control points in a listening area, sets several control points in a non-listening area, and sets each control point in the listening area. A target sound pressure, a tolerance of the target sound pressure, and a transfer function between the sound source and each of the control points for a predetermined number of sound sources less than the total number of all the control points. The sound pressure of the control point in the non-listening area is minimized under the condition that the error of the sound pressure of the control point in the listening area with respect to the target sound pressure is within the allowable error range. Calculating a filter coefficient of a filter corresponding to the sound source.

本技術の一側面においては、聴取エリア内にいくつかの制御点が設定されるとともに、非聴取エリア内にいくつかの制御点が設定され、前記聴取エリア内の各前記制御点の目的音圧が設定され、前記目的音圧の許容誤差が設定され、全前記制御点の合計個数未満の所定個数の各音源について、前記音源と各前記制御点のそれぞれとの間の伝達関数に基づいて、前記聴取エリア内の前記制御点の音圧の前記目的音圧に対する誤差が前記許容誤差の範囲内に収まるという条件下で、前記非聴取エリア内の前記制御点の音圧を最小化する、前記音源に対応するフィルタのフィルタ係数が算出される。 In one aspect of the present technology, several control points are set in the listening area and several control points are set in the non-listening area, and the target sound pressure of each control point in the listening area is set. Is set, an allowable error of the target sound pressure is set, and for a predetermined number of sound sources less than the total number of all the control points, based on a transfer function between the sound source and each of the control points, Minimizing the sound pressure of the control point in the non-listening area under the condition that the error of the sound pressure of the control point in the listening area with respect to the target sound pressure is within the allowable error range, A filter coefficient of a filter corresponding to the sound source is calculated.

本技術の一側面によれば、よりばらつきの少ない音圧分布の音声を得ることができる。 According to one aspect of the present technology, it is possible to obtain sound having a sound pressure distribution with less variation.

なお、ここに記載された効果は必ずしも限定されるものではなく、本開示中に記載された何れかの効果であってもよい。 Note that the effects described here are not necessarily limited, and may be any of the effects described in the present disclosure.

音響システムの構成例を示す図である。It is a figure which shows the structural example of an acoustic system. フィルタ係数算出処理を説明するフローチャートである。It is a flowchart explaining a filter coefficient calculation process. 音声再生処理を説明するフローチャートである。It is a flowchart explaining audio | voice reproduction | regeneration processing. 音響システムの構成例を示す図である。It is a figure which shows the structural example of an acoustic system. 音声再生処理を説明するフローチャートである。It is a flowchart explaining audio | voice reproduction | regeneration processing. ユースケースについて説明する図である。It is a figure explaining a use case. ユースケースについて説明する図である。It is a figure explaining a use case. コンピュータの構成例を示す図である。It is a figure which shows the structural example of a computer.

以下、図面を参照して、本技術を適用した実施の形態について説明する。 Hereinafter, embodiments to which the present technology is applied will be described with reference to the drawings.

〈第１の実施の形態〉
〈本技術の概要〉
本技術は、複数のスピーカ間での相対的な振幅位相特性を変化させることにより空間の音場特性を変化させ、予め設定した所望の聴取エリア内部のみで音響信号を聴取できるようにするものである。 <First Embodiment>
<Outline of this technology>
This technology changes the sound field characteristics of a space by changing the relative amplitude and phase characteristics among a plurality of speakers so that an acoustic signal can be heard only within a predetermined desired listening area. is there.

まず、本技術の概要について説明する。 First, an outline of the present technology will be described.

（本技術の基本的な原理について）
例えば、音響信号に含まれるある周波数成分ωについて考える。音響信号に基づく音声の再生空間上に、聴取者に音声を聴取できるようにする聴取エリアと、音声を聴取できないようにする非聴取エリアとを設け、聴取エリアをV_targetと表し、非聴取エリアをV_cancelと表すとする。また、再生空間上に音源としてN個のスピーカが存在するとする。 (About the basic principle of this technology)
For example, consider a certain frequency component ω included in an acoustic signal. On the sound reproduction space based on the acoustic signal, a listening area for allowing the listener to listen to the sound and a non-listening area for preventing the sound from being heard are provided, and the listening area is represented as V _target, and the non-listening area. _Is expressed as V _cancel . Further, it is assumed that N speakers exist as sound sources in the reproduction space.

ここで、N個のスピーカのなかのn番目（但し0≦n≦N-1）のスピーカから、聴取エリアV_targetまたは非聴取エリアV_cancel内にある所定の位置rまでの伝達関数をH_n(r)とする。また、n番目のスピーカに供給される音響信号に対して施されるフィルタ処理に用いる入力フィルタ係数をW_nとする。すなわち、入力された音響信号が、入力フィルタ係数W_nが用いられてフィルタ処理された後、スピーカで再生されるものとする。 Here, the transfer function from the n-th speaker among the N speakers (where 0 ≦ n ≦ N−1) to a predetermined position r within the listening area V _target or the non-listening area V _{cancel is} represented by H _n. (r). Also, let W _n be an input filter coefficient used for filter processing performed on the acoustic signal supplied to the nth speaker. In other words, it is assumed that the input acoustic signal is filtered using the input filter coefficient W _n and then reproduced by a speaker.

ここで、入力フィルタ係数W_nは、どのようなものであってもよいが、ここでは各周波数成分ごとの係数、つまり各周波数成分に対して乗算される係数を要素とする行列であるものとする。また、以下、入力フィルタ係数W_nからなるフィルタを入力フィルタとも称することとする。 Here, the input filter coefficient W _n may be any value, but here, it is assumed that the coefficient is a coefficient for each frequency component, that is, a matrix having a coefficient multiplied by each frequency component as an element. To do. Further, hereinafter, it will be referred to as both an input filter filters consisting of the input filter coefficients W _n.

さらに、聴取エリアV_targetにおける所望の音圧特性、すなわち目的音圧レベルをd(r)とし、音圧特性d(r)からの許容誤差をσとする。ここで、音圧特性d(r)は複素音圧である。 Further, a desired sound pressure characteristic in the listening area V _target , that is, a target sound pressure level is d (r), and an allowable error from the sound pressure characteristic d (r) is σ. Here, the sound pressure characteristic d (r) is a complex sound pressure.

このとき、N個のスピーカからなるスピーカアレイの所定位置rにおける指向特性D(r)、つまり実際に得られるであろう音圧は、次式（１）に示されるようになる。 At this time, the directivity characteristic D (r) at a predetermined position r of the speaker array composed of N speakers, that is, the sound pressure that would actually be obtained is expressed by the following equation (1).

すなわち、指向特性D(r)は、N個の各スピーカの伝達関数H_n(r)と入力フィルタ係数W_nとの積の総和により表すことができる。 That is, the directivity characteristic D (r) can be represented by the sum of products of the transfer function H _n (r) of each of the N speakers and the input filter coefficient W _n .

そこで、入力フィルタ係数W_nを変数とする次式（２）に示す最大値最小化問題を考える。 Therefore, the maximum value minimization problem shown in the following equation (2) using the input filter coefficient W _n as a variable is considered.

式（２）に示される最大値最小化問題は、聴取エリアV_targetにおける音圧ΣH_n(r)W_n（但し位置rは聴取エリアV_target内）と所望の音圧特性d(r)との間の誤差が許容誤差σ以内、つまり許容誤差の範囲内であるという条件下で、非聴取エリアV_cancelにおける音圧ΣH_n(r)W_n（但し位置rは非聴取エリアV_cancel内）を最小化する入力フィルタ係数W_nを求めるものである。より詳細には、非聴取エリアV_cancelにおいて音圧が最大となる位置rにおける音圧を最小化する入力フィルタ係数W_nが求められる。 Maximum minimization problem shown in equation (2), the sound pressure .SIGMA.H _n (r) W _n (where position r within the listening area V _target) in the listening area V _target and the desired sound pressure characteristic d (r) within the error tolerances σ between, under the condition that that is in the range of allowable error, non-listening area V _cancel sound pressure at ΣH _{_n} (r) W _n (where position r is in a non-listening area V _cancel) To obtain an input filter coefficient W _n that minimizes. More specifically, the input filter coefficient W _n that minimizes the sound pressure at the position r where the sound pressure is maximum in the non-listening area V _cancel is obtained.

なお、音圧特性d(r)と許容誤差σは位置rごとに定められるようにしてもよいし、いくつかの位置rで同じものが用いられてもよい。 The sound pressure characteristic d (r) and the allowable error σ may be determined for each position r, or the same one may be used at several positions r.

この最大値最小化問題を解き、入力フィルタ係数W_nを求めれば、入力フィルタ係数W_nを用いたフィルタ処理によってスピーカアレイを構成する各スピーカから出力される音声の相互間の周波数特性および振幅位相特性を制御することができる。 If this maximum value minimization problem is solved and the input filter coefficient W _n is obtained, the frequency characteristics and amplitude phases between the sounds output from the speakers constituting the speaker array by the filter processing using the input filter coefficient W _n Properties can be controlled.

これにより、聴取エリアV_targetで聴取される音声の音圧に比べて、非聴取エリアV_cancelで聴取される音声の音圧を減少させ、聴取エリアV_targetのみで音声を聞き取ることができるようにすることができる。 Thus, compared to the sound pressure of the sound heard at the listening area V _target, to reduce the sound pressure of the sound heard by the non-listening area V _cancel, so as to be able to listen to audio only listening area V _target can do.

しかも式（２）に示す最大値最小化問題では、比較的人間の聴覚上問題となりにくい聴取エリアV_targetにおける音質のばらつきが許容誤差σの分だけ許容されるようにしている。そのため、聴覚上、比較的問題となりやすい非聴取エリアV_cancelにおける音圧をその許容範囲の中で最大限に抑えることが可能となり、聴取エリアや非聴取エリアにおける空間的な音圧のばらつきを抑えるとともに、より高い分離性能を得ることが可能となる。 In addition, in the maximum value minimization problem shown in Expression (2), the sound quality variation in the listening area V _target that is relatively difficult to cause human hearing problems is allowed by the allowable error σ. Therefore, it is possible to suppress the sound pressure in the non-listening area V _cancel , which is relatively problematic from the viewpoint of hearing, to the maximum within the permissible range, and to suppress variations in spatial sound pressure in the listening and non-listening areas. At the same time, higher separation performance can be obtained.

また、この最大値最小化問題では、聴取エリアV_targetにおける音圧の許容誤差σを調整することにより、聴取エリアV_targetにおける音質と非聴取エリアV_cancelにおける音圧の間のトレードオフを容易に調整することができる。 Further, in this maximum minimization problem, by adjusting the tolerance σ sound pressure at the listening area V _target, easily a trade-off between sound quality and non-listening area V _cancel sound in pressure in the listening area V _target Can be adjusted.

ところで、式（２）に示した最大値最小化問題は、このままでは複素数の絶対値演算のような非線型な要素が含まれるため、数学的な取り扱いが難しい。そこで、この最大値最小化問題を線型計画問題へと変形すれば、容易に入力フィルタ係数W_nを求めることができるようになる。 By the way, the maximum value minimization problem shown in Equation (2) is difficult to mathematically handle because it includes a non-linear element such as a complex absolute value calculation. Therefore, if this maximum value minimization problem is transformed into a linear programming problem, the input filter coefficient W _n can be easily obtained.

線型計画問題への変形を行うにあたり、まず、非聴取エリアV_cancel内の音声の音圧の最大値δを次式（３）により表すこととする。 In performing the transformation to the linear planning problem, first, the maximum value δ of the sound pressure of the voice in the non-listening area V _cancel is expressed by the following equation (3).

そして、式（３）に示す最大値δを用いて、上述した式（２）を以下の式（４）に示すように書き換える。 Then, using the maximum value δ shown in Equation (3), Equation (2) described above is rewritten as shown in Equation (4) below.

ここで、式（４）における｜ΣH_n(r)W_n｜≦δは、非聴取エリアV_cancel内の各位置rにおける音圧が最大値δを超えないという制約を表している。 Here, | ΣH _n (r) W _n | ≦ δ in Expression (4) represents a constraint that the sound pressure at each position r in the non-listening area V _cancel does not exceed the maximum value δ.

次に、実回転定理より、任意の複素数Zについて次式（５）が成立することから、この式（５）を用いて、式（４）における複素数の絶対値に関する条件を実数の条件に変換することができる。これにより、式（４）は以下の式（６）へと変形される。 Next, according to the real rotation theorem, the following equation (5) holds for any complex number Z. Using this equation (5), the condition relating to the absolute value of the complex number in equation (4) is converted into a real number condition. can do. Thereby, Formula (4) is deform | transformed into the following formula | equation (6).

なお、式（５）および式（６）においてθは複素数の位相、すなわち回転角度を示している。また、jは虚数単位を示しており、Re{}は複素数の実部を示している。 In Equations (5) and (6), θ represents a complex phase, that is, a rotation angle. J represents an imaginary unit, and Re {} represents a real part of a complex number.

また、複素数の虚部をIm{}で表すこととし、位置rを変数とする関数A_n(r)を次式（７）で表すとする。なお、式（７）においてmは自然数である。 Further, it is assumed that an imaginary part of a complex number is represented by Im {}, and a function _An (r) having a position r as a variable is represented by the following equation (7). In Equation (7), m is a natural number.

さらに、位置rに応じた関数a(r)を次式（８）に示すものとし、式（７）に示した関数A_n(r)を要素としてもつ大きさ2N+1の行ベクトルA(r,θ)を式（９）に示すものとする。 Further, a function a (r) corresponding to the position r is represented by the following equation (8), and a row vector A (2N + 1) having the function A _n (r) represented by equation (7) as an element: r, θ) is represented by equation (9).

また、各入力フィルタ係数W_nの実部と虚部、および最大値δを要素としてもつ大きさ2N+1の列ベクトルxを式（１０）に示すものとし、大きさ2N+1の行ベクトルcを式（１１）に示すものとし、位置rと回転角度θを変数とする関数b(r,θ)を式（１２）に示すものとする。なお、式（１１）では、行ベクトルcの最後の要素以外の他の要素の値は0となっている。 Further, a column vector x having a size 2N + 1 having the real part and the imaginary part of each input filter coefficient W _n and the maximum value δ as elements is shown in Expression (10), and a row vector having a size 2N + 1. It is assumed that c is expressed by equation (11), and a function b (r, θ) having variables of position r and rotation angle θ is expressed by equation (12). In Expression (11), the values of the elements other than the last element of the row vector c are 0.

このとき、行ベクトルc、列ベクトルx、行ベクトルA(r,θ)、および関数b(r,θ)を用いて、式（６）を次式（１３）に示す線型半無限計画問題として表すことができる。 At this time, using the row vector c, the column vector x, the row vector A (r, θ), and the function b (r, θ), the equation (6) is expressed as a linear semi-infinite programming problem represented by the following equation (13). Can be represented.

これにより、式（２）に示した最大値最小化問題が、式（１３）に示す線型半無限計画問題へと変形されたことになる。 As a result, the maximum value minimization problem shown in Expression (2) is transformed into a linear semi-infinite planning problem shown in Expression (13).

式（１３）に示す線型半無限計画問題では、位置rおよび回転角度θが連続値であるため制約条件の数が無限個存在するが、位置rおよび回転角度θについて離散化を行うことで、式（１３）に示す線型半無限計画問題を通常の線型計画問題として扱うことができる。これにより、その線型計画問題を主双対内点法などの線型計画法により計算することで、最適な入力フィルタ係数W_nを算出することが可能となる。 In the linear semi-infinite programming problem shown in Equation (13), the position r and the rotation angle θ are continuous values, and thus there are an infinite number of constraints. By discretizing the position r and the rotation angle θ, The linear semi-infinite programming problem expressed by Equation (13) can be treated as a normal linear programming problem. Thus, the optimal input filter coefficient W _n can be calculated by calculating the linear programming problem by a linear programming method such as the main dual interior point method.

ここで、回転角度θの離散化は、実回転定理の近似精度にのみ関わる。これに対して、位置rの離散化では、聴取エリアV_targetと非聴取エリアV_cancelの両空間を、音圧を制御する位置rである制御点で離散化し、空間中の音場の音圧分布を制御点に配置したマイクロフォン等のセンサの値によって代表させることになる。そのため、位置rの離散化は、得られる解の性質に大きな影響を持つ。具体的には、マイクロフォン等のセンサの数が、スピーカ等の音源の数以下である場合、聴取エリアV_targetにおける許容誤差σ＝0という条件下でも、非聴取エリアV_cancel内の制御点での最大音圧を0とする解xが存在し得る。 Here, the discretization of the rotation angle θ is related only to the approximation accuracy of the actual rotation theorem. On the other hand, in the discretization of the position r, both the listening area V _target and the non-listening area V _cancel are discretized at the control point that is the position r for controlling the sound pressure, and the sound pressure of the sound field in the space is determined. The distribution is represented by the value of a sensor such as a microphone arranged at the control point. Therefore, discretization of the position r has a great influence on the properties of the obtained solution. Specifically, when the number of sensors such as microphones is equal to or less than the number of sound sources such as speakers, the control point in the non-listening area V _cancel can be used even under the condition that the tolerance σ = 0 in the listening area V _target . There may be a solution x with a maximum sound pressure of 0.

しかしながら、そのようにして得られる解xは、定在波の節のようにセンサのある位置においてピンポイントに音圧を0とする解となり得るため、センサのある位置以外の非聴取エリアV_cancelにおける音圧は必ずしも最小化されない。したがって、センサの数は少なくとも音源数より1つ以上多い数だけ配置することが求められる。 However, since the solution x thus obtained can be a solution where the sound pressure is zero at a certain position of the sensor as in the standing wave section, the non-listening area V _cancel other than the position where the sensor is located. The sound pressure at is not necessarily minimized. Therefore, it is required to arrange at least one sensor more than the number of sound sources.

（入力フィルタの安定化について）
また、式（１３）から得られる線型計画問題に対して、さらに他の制約条件を追加することで、得られる入力フィルタの性質を調整することが可能である。例えば、線型計画問題の制約条件として、次式（１４）に示す制約条件を追加することで、入力フィルタのゲインについて上限を設定することができる。 (Stabilization of input filter)
Moreover, it is possible to adjust the property of the input filter obtained by adding another constraint condition to the linear programming problem obtained from Expression (13). For example, the upper limit can be set for the gain of the input filter by adding the constraint condition shown in the following equation (14) as the constraint condition of the linear programming problem.

なお、式（１４）においてg_maxはスカラ値であり、入力フィルタ係数W_nの上限値を示している。したがって、式（１４）により示される制約条件は、入力フィルタ係数W_nの各要素の大きさ（絶対値）が上限値g_maxを超えないという条件となる。 In Expression (14), g _max is a scalar value and indicates the upper limit value of the input filter coefficient W _n . Therefore, the constraint condition expressed by the equation (14) is a condition that the size (absolute value) of each element of the input filter coefficient W _n does not exceed the upper limit value g _max .

式（１４）により示される制約条件は複素数の絶対値を含んでいるが、以上において説明した最大値最小化問題を線型計画問題へと変形する場合と同様の変形を施すことにより、この制約条件を容易に線型計画問題に組み込むことが可能である。 The constraint condition represented by the equation (14) includes the absolute value of a complex number, but this constraint condition can be obtained by performing the same modification as the case where the maximum value minimization problem described above is transformed into a linear programming problem. Can be easily incorporated into linear planning problems.

このように入力フィルタのゲイン（入力フィルタ係数W_n）に対して上限値g_maxを設定することにより、入力とする音響信号に基づいて音声を再生する際に、スピーカやアンプリファイアなどの音源装置に不具合が生じ得るような非常に高いゲインとなる周波数帯を持つ入力フィルタが得られてしまうことを回避することができる。 Thus, by setting the upper limit value g _max for the gain of the input filter (input filter coefficient W _n ), a sound source device such as a speaker or an amplifier is used to reproduce sound based on the input acoustic signal. Thus, it is possible to avoid an input filter having a frequency band with a very high gain that may cause problems.

すなわち、スピーカ等の音源の種類によっては、特定の周波数帯に高い振幅の成分を持つ信号を入力すると音が割れてしまうなどの不具合を有することがある。そこで、式（１４）に示したフィルタゲインの上限値に関する制約条件を加えることで、そのような不具合の発生を抑えつつも聴取エリアにおける音質を確保し、非聴取エリアにおける音圧を最小化することができる。 That is, depending on the type of sound source such as a speaker, there is a problem that sound is broken when a signal having a high amplitude component in a specific frequency band is input. Therefore, by adding the constraint condition regarding the upper limit value of the filter gain shown in Expression (14), the sound quality in the listening area is ensured while suppressing the occurrence of such a problem, and the sound pressure in the non-listening area is minimized. be able to.

また、入力フィルタ係数W_nに対して上限値g_maxを設定すると、入力フィルタ係数W_nの各要素の大きさが上限値g_max以下となるので、入力フィルタの周波数軸方向の過度なゲインのばらつきを抑えることができる。その結果、インパルス応答が短く実装が容易な入力フィルタを得ることができる。換言すれば、上限値g_maxを設定することで、入力フィルタとして、時間領域でみたときにタップ数が短く、エリアシング等の生じない安定なフィルタを得ることができる。 The input when the filter coefficients W _n to set the upper limit value g _max, since the size of each element of the input filter coefficient W _n is equal to or less than the upper limit value g _max, the input filter frequency axis excessive gain Variation can be suppressed. As a result, an input filter with a short impulse response and easy mounting can be obtained. In other words, by setting the upper limit value g _max , it is possible to obtain a stable filter that has a small number of taps when viewed in the time domain and does not cause aliasing or the like as an input filter.

このような入力フィルタのゲインのばらつきを抑える効果は、上述した線型計画問題を解くことで得られる非聴取エリアにおける音声の音圧とのトレードオフになっている。すなわち、上限値g_maxの値をより小さくすると、よりゲインのばらつきを抑えることができ、逆に上限値g_maxの値をより大きくすると、非聴取エリアにおける音圧の最大値をより小さくし、聴取エリアと非聴取エリアとの音圧差、つまり分離性能を向上させることができる。 Such an effect of suppressing the variation in the gain of the input filter is a trade-off with the sound pressure of the sound in the non-listening area obtained by solving the above-described linear planning problem. That is, if the value of the upper limit value g _max is made smaller, variations in gain can be suppressed, and conversely, if the value of the upper limit value g _max is made larger, the maximum value of the sound pressure in the non-listening area is made smaller, The sound pressure difference between the listening area and the non-listening area, that is, separation performance can be improved.

したがって、入力フィルタ係数W_nの上限値g_maxの値を調整することで、算出される入力フィルタの安定性と、得られる聴取エリアおよび非聴取エリアの分離性能との間のトレードオフを容易に調整することができるようになる。 Therefore, by adjusting the value of the upper limit value g _max of the input filter coefficient W _n , it is easy to make a trade-off between the calculated stability of the input filter and the separation performance of the obtained listening area and non-listening area. Will be able to adjust.

（非限定エリアの設定と拘束条件について）
また、上述した聴取エリアV_targetにも非聴取エリアV_cancelにも含まれない空間または方向を、その他のエリア（以下、非限定エリアとも称する）とし、非限定エリアについて別の制約条件を設定するようにしてもよい。 (Regarding non-limited area settings and constraint conditions)
In addition, a space or direction that is not included in the above-described listening area V _target or non-listening area V _cancel is set as another area (hereinafter also referred to as a non-limited area), and another restriction condition is set for the non-limited area. You may do it.

ここで、非限定エリアは、例えば聴取エリアV_targetではないものの積極的に音圧を低減させる必要性の低い空間または方向とされる。 Here, the non-limited area is, for example, a space or a direction that is not the listening area V _target but has a low necessity of actively reducing the sound pressure.

例として、複数の聴取エリアでそれぞれ別個の音響信号（音声）を聴取できるようにすることを考える。 As an example, let us consider that it is possible to listen to different acoustic signals (voices) in a plurality of listening areas.

このように複数の聴取エリアで、互いに異なる音声を聴取できるようにする方法としては、次のような実現方法が考えられる。すなわち、まず各音声の音響信号について再生空間上に１つの聴取エリアを設定し、聴取エリア以外の空間を非聴取エリアとして音響信号ごとに入力フィルタを算出する。そして、各音響信号に対して入力フィルタを用いたフィルタ処理を施し、その結果得られた音響信号を足し合わせて音源であるスピーカ等に供給して音声を再生させればよい。 As a method for enabling different sounds to be heard in a plurality of listening areas as described above, the following realization methods can be considered. That is, first, a listening area is set on the reproduction space for each sound signal, and an input filter is calculated for each sound signal with a space other than the listening area as a non-listening area. Then, filtering processing using an input filter may be performed on each acoustic signal, and the resultant acoustic signals may be added and supplied to a speaker or the like as a sound source to reproduce sound.

ここで、１つの音響信号に注目すると、再生空間のうち、注目する音響信号とは異なる他の音響信号に対して聴取エリアとされた空間は、その注目する音響信号の入力フィルタの算出時には、真の非聴取エリアとされる。しかし、再生空間において、どの音響信号の聴取エリアでもない空間は非限定エリアとすることができる。 Here, when attention is paid to one acoustic signal, a space that is set as a listening area for another acoustic signal different from the acoustic signal of interest in the reproduction space is calculated when calculating an input filter of the acoustic signal of interest. It is a true non-listening area. However, a space that is not a listening area for any acoustic signal in the reproduction space can be a non-limited area.

この場合、非限定エリアにおける音の漏れをある程度許容することにより、変数の自由度が真の非聴取エリアにおける音圧の抑圧に回され、各音響信号の分離性能の向上を期待することができる。 In this case, by allowing a certain amount of sound leakage in the non-limited area, the degree of freedom of the variable is reduced to the suppression of the sound pressure in the true non-listening area, and it can be expected to improve the separation performance of each acoustic signal. .

非限定エリアをV_otherwiseとすると、例えば非限定エリアV_otherwiseにおける制約条件として、次式（１５）に示すものが考えられる。 _Assuming that the non-limited area is V _otherwise , for example, as a constraint condition in the non-limited area V _otherwise , the following equation (15) can be considered.

なお、式（１５）におけるd_minは、次式（１６）により示されるように聴取エリアV_targetにおける音声の目的音圧、つまり音圧特性d(r)の最小値を示している。

Note that d _min in Expression (15) indicates the _target sound pressure of the sound in the listening area V _target , that is, the minimum value of the sound pressure characteristic d (r), as shown by the following Expression (16).

したがって、式（１５）に示される制約条件は、非限定エリアV_otherwise内の各位置r、つまり各制御点における音圧が、許容誤差σを考慮した聴取エリアV_target内の目的音圧の最小値以下となる（超えない）という条件となる。 Therefore, the constraint condition shown in Expression (15) is that the sound pressure at each position r in the non-limited area V _otherwise , that is, at each control point, is the minimum of the target sound pressure in the listening area V _target considering the allowable error σ. The condition is that the value is below (not exceeding) the value.

換言すれば、式（１５）に示される制約条件は、非限定エリアV_otherwise内の各制御点の音圧が、非限定エリアV_otherwiseに対して定めた所定の最大音圧を超えないという制約条件となる。この例では、式（１５）の右辺であるd_min-σが最大音圧を示している。 In other words, the constraints shown in equation (15), the constraint that the sound pressure of the respective control points within the non-limiting area V _otherwise does not exceed a predetermined maximum sound pressure determined for non-limiting area V _otherwise It becomes a condition. In this example, d _min -σ, which is the right side of Equation (15), indicates the maximum sound pressure.

なお、最大音圧は、その他、αd_min（但し、α＜1）など、聴取エリアV_targetにおける目的音圧の最小値に基づいて定められるもの、つまり聴取エリアV_targetにおける音圧の最小値を超えないものであれば、どのようなものであってもよい。また、非限定エリアV_otherwiseに対して定める最大音圧は、制御点ごとに異なる値としてもよい。そのような場合、例えば複数の非限定エリアV_otherwiseごとに段階的に最大音圧を設定すること等が可能となる。 The maximum sound pressure is determined based on the minimum value of the target sound pressure in the listening area V _target , such as αd _min (where α <1), that is, the minimum sound pressure in the listening area V _target . As long as it does not exceed, it may be anything. Further, the maximum sound pressure determined for the non-limited area V _otherwise may be a different value for each control point. In such a case, for example, the maximum sound pressure can be set stepwise for each of the plurality of non-limited areas V _otherwise .

このような式（１５）に示される制約条件も、最大値最小化問題を線型計画問題へと変形する場合と同様の変形を施すことにより、容易に式（１３）から得られる線型計画問題の制約条件として組み込むことが可能である。 The constraint condition shown in the equation (15) is also applied to the linear programming problem easily obtained from the equation (13) by performing the same modification as the case where the maximum value minimization problem is transformed into the linear programming problem. It can be incorporated as a constraint.

式（１５）に示される制約条件は、単に非限定エリアを設定するだけで、非限定エリアに特に拘束条件を設けない場合、聴取エリア以外のエリアに強い音圧を発生させることが生じ得るため、そのようなことを防止するための条件である。 The constraint condition shown in the equation (15) is that a non-limited area is simply set, and if there is no particular constraint condition in the non-limited area, strong sound pressure may be generated in an area other than the listening area. This is a condition for preventing such a situation.

以上のように再生空間に非限定エリアを設定し、上述した線型計画問題に非限定エリアにおける制約条件をさらに加えることで、意図せぬ大音圧のエリアの発生を防止し、最適な入力フィルタを得ることができるようになる。 As described above, the unrestricted area is set in the reproduction space, and the constraint condition in the unrestricted area is further added to the above-mentioned linear planning problem, thereby preventing the occurrence of an unintended high sound pressure area and the optimum input filter. You will be able to get

〈音響システムの構成例〉
次に、以上において説明した本技術の具体的な実施の形態について説明する。 <Example of acoustic system configuration>
Next, specific embodiments of the present technology described above will be described.

図１は、本技術を適用した音響システムの一実施の形態の構成例を示す図である。 FIG. 1 is a diagram illustrating a configuration example of an embodiment of an acoustic system to which the present technology is applied.

図１に示す音響システム１１は、フィルタ計算部２１および音声再生部２２を有しており、この音響システム１１は、音声を再生する前に、事前測定により予め伝達関数および入力フィルタを求めておくフィードフォワード型のシステムとなっている。 The acoustic system 11 shown in FIG. 1 includes a filter calculation unit 21 and an audio reproduction unit 22, and the acoustic system 11 obtains a transfer function and an input filter in advance by prior measurement before reproducing the audio. It is a feedforward type system.

すなわち、フィルタ計算部２１は、入力された各種の情報および伝達関数に基づいて入力フィルタを算出し、音声再生部２２に供給する。 That is, the filter calculation unit 21 calculates an input filter based on various types of input information and transfer functions, and supplies the input filter to the audio reproduction unit 22.

音声再生部２２は例えば再生空間に配置され、フィルタ計算部２１から供給された入力フィルタを用いて、供給された入力音響信号にフィルタ処理を施し、その結果得られた出力音響信号に基づいて音声を出力する。 The audio reproduction unit 22 is arranged in a reproduction space, for example, and performs filtering processing on the supplied input acoustic signal using the input filter supplied from the filter calculation unit 21, and audio is generated based on the output acoustic signal obtained as a result. Is output.

この例では、再生空間はエリアR1乃至エリアR3の3つのエリアに分けられている。そして、エリアR1およびエリアR3が、音声を再生すべきでない空間または方向である非聴取エリアとされており、エリアR2が、音声を再生すべき空間または方向である聴取エリアとされている。 In this example, the reproduction space is divided into three areas R1 to R3. Area R1 and area R3 are non-listening areas that are spaces or directions in which sound should not be reproduced, and area R2 is a listening area that is a space or direction in which sounds are to be reproduced.

フィルタ計算部２１は、スイッチ３１−１乃至スイッチ３１−Ｎ、D/A（Digital/Analog）変換器３２−１乃至D/A変換器３２−Ｎ、音源装置３３−１乃至音源装置３３−Ｎ、音響センサ３４−１乃至音響センサ３４−Ｋ、伝達関数計算器３５、情報入力部３６、およびフィルタ係数計算部３７を有している。 The filter calculation unit 21 includes switches 31-1 to 31-N, D / A (Digital / Analog) converters 32-1 to D / A converters 32-N, sound source devices 33-1 to 33-N. , Acoustic sensor 34-1 to acoustic sensor 34-K, transfer function calculator 35, information input unit 36, and filter coefficient calculation unit 37.

スイッチ３１−１乃至スイッチ３１−Ｎのそれぞれは、それらの出力端がD/A変換器３２−１乃至D/A変換器３２−Ｎに接続されており、フィルタ計算部２１の制御に応じてオンまたはオフする。スイッチ３１−１乃至スイッチ３１−Ｎは、オン状態である場合、供給された測定用音響信号をD/A変換器３２−１乃至D/A変換器３２−Ｎに供給する。 Each of the switches 31-1 to 31-N has an output terminal connected to the D / A converter 32-1 to D / A converter 32-N, and according to the control of the filter calculation unit 21. Turn on or off. When the switches 31-1 to 31-N are in the on state, the supplied measurement acoustic signals are supplied to the D / A converter 32-1 to D / A converter 32-N.

D/A変換器３２−１乃至D/A変換器３２−Ｎは、スイッチ３１−１乃至スイッチ３１−Ｎを介して供給された測定用音響信号に対してD/A変換を行い、音源装置３３−１乃至音源装置３３−Ｎに供給する。 The D / A converter 32-1 to D / A converter 32-N performs D / A conversion on the measurement acoustic signals supplied via the switches 31-1 to 31-N, and generates a sound source device. 33-1 to the sound source device 33-N.

音源装置３３−１乃至音源装置３３−Ｎは、例えば空間中に音声（音響信号）を放射するスピーカなどからなり、D/A変換器３２−１乃至D/A変換器３２−Ｎから供給された測定用音響信号に基づいて、伝達関数測定用の音声を再生する。 The sound source device 33-1 to sound source device 33-N include, for example, speakers that emit sound (acoustic signals) in the space, and are supplied from the D / A converter 32-1 to D / A converter 32-N. Based on the measured measurement acoustic signal, the transfer function measurement sound is reproduced.

なお、以下、スイッチ３１−１乃至スイッチ３１−Ｎを特に区別する必要のない場合、単にスイッチ３１とも称し、D/A変換器３２−１乃至D/A変換器３２−Ｎを特に区別する必要のない場合、単にD/A変換器３２とも称する。さらに、以下、音源装置３３−１乃至音源装置３３−Ｎを特に区別する必要のない場合、単に音源装置３３とも称する。ここで、音源装置３３の個数Nは、2以上とされる。 Hereinafter, when it is not necessary to particularly distinguish the switches 31-1 to 31 -N, they are also simply referred to as switches 31, and it is necessary to particularly distinguish the D / A converters 32-1 to 32-N. When there is no signal, it is also simply referred to as a D / A converter 32. Further, hereinafter, the sound source device 33-1 to the sound source device 33-N are also simply referred to as the sound source device 33 when it is not necessary to particularly distinguish them. Here, the number N of the sound source devices 33 is 2 or more.

音響センサ３４−１乃至音響センサ３４−Ｋは、例えばマイクロフォンなどからなり、音源装置３３から出力された音声を収音し、その結果得られた測定信号を伝達関数計算器３５に供給する。なお、以下、音響センサ３４−１乃至音響センサ３４−Ｋを特に区別する必要のない場合、単に音響センサ３４とも称することとする。 The acoustic sensors 34-1 to 34-K are composed of, for example, a microphone and the like, pick up the sound output from the sound source device 33, and supply the measurement signal obtained as a result to the transfer function calculator 35. Hereinafter, the acoustic sensors 34-1 to 34-K are also simply referred to as acoustic sensors 34 when it is not necessary to distinguish them.

また、フィルタ計算部２１が配置された空間では、例えば非聴取エリアであるエリアR1やエリアR3に対応するエリアに、音声の音圧を制御する点（位置）であるL個の制御点が設定される。同様に、フィルタ計算部２１が配置された空間における、聴取エリアであるエリアR2に対応するエリアにM個の制御点が設定される。ここでL＋M＝Kとされる。なお、以下では、再生空間上の聴取エリアや非聴取エリアに対応する、フィルタ計算部２１が配置された空間上のエリアを、単に聴取エリアや非聴取エリアとも称することとする。 In the space where the filter calculation unit 21 is arranged, for example, L control points that are points (positions) for controlling the sound pressure of sound are set in areas corresponding to areas R1 and R3 that are non-listening areas. Is done. Similarly, M control points are set in an area corresponding to the area R2 that is the listening area in the space where the filter calculation unit 21 is arranged. Here, L + M = K. In the following, the area on the space where the filter calculation unit 21 corresponding to the listening area and the non-listening area on the reproduction space is also simply referred to as a listening area and a non-listening area.

さらに、各制御点に音響センサ３４が配置される。このとき、全制御点の合計個数、つまり全音響センサ３４の合計個数Kは、全音源装置３３の合計個数N以上の数となるようにされる。これは、上述したように、制御点以外の非聴取エリアにおける音圧が確実に最小化されるようにするためである。なお、より詳細にはどのエリアのどの位置に制御点を設定するか、換言すれば各音響センサ３４の配置位置が聴取エリアとされるか、または非聴取エリアとされるかは、情報入力部３６により決定される。 Further, an acoustic sensor 34 is arranged at each control point. At this time, the total number of all control points, that is, the total number K of all acoustic sensors 34 is set to be equal to or greater than the total number N of all sound source devices 33. This is to ensure that the sound pressure in the non-listening area other than the control point is minimized as described above. In more detail, the information input unit determines which control point is set in which area, in other words, whether the position of each acoustic sensor 34 is set as a listening area or a non-listening area. 36.

伝達関数計算器３５は、音響センサ３４−１乃至音響センサ３４−Ｋから供給された測定信号と、供給された測定用音響信号とに基づいて、各音源装置３３から音響センサ３４までの間の伝達特性を示す伝達関数H_n(r)を算出し、フィルタ係数計算部３７に供給する。伝達関数計算器３５では、各音源装置３３について、音響センサ３４ごとに伝達関数が求められる。つまり、合計でN×K個の伝達関数が得られる。 The transfer function calculator 35 is provided between each sound source device 33 and the acoustic sensor 34 based on the measurement signal supplied from the acoustic sensors 34-1 to 34-K and the supplied measurement acoustic signal. A transfer function H _n (r) indicating transfer characteristics is calculated and supplied to the filter coefficient calculation unit 37. In the transfer function calculator 35, a transfer function is obtained for each sound sensor 33 for each acoustic sensor 34. That is, a total of N × K transfer functions are obtained.

情報入力部３６は、管理者の操作入力等に応じて入力フィルタの算出に必要な各種の設定を行い、その設定結果をフィルタ係数計算部３７に供給する。情報入力部３６はエリア設定部４１、目的音圧設定部４２、および許容誤差設定部４３を有している。 The information input unit 36 performs various settings necessary for calculating the input filter in accordance with an operation input by the administrator, and supplies the setting result to the filter coefficient calculation unit 37. The information input unit 36 includes an area setting unit 41, a target sound pressure setting unit 42, and an allowable error setting unit 43.

エリア設定部４１は、例えば管理者の操作入力等に応じて、空間上にK個の制御点を設定する。具体的には、聴取エリア、非聴取エリア、非限定エリア等の各種別のエリアにいくつかの制御点が設定され、それらの設定された制御点の位置rと、制御点を含むエリアの種別を示す情報とが制御点の設定結果などとされる。 The area setting unit 41 sets K control points on the space in accordance with, for example, an operation input by the administrator. Specifically, several control points are set in various areas such as a listening area, a non-listening area, and a non-limited area, and the position r of the set control points and the type of the area including the control points are set. Information indicating the control point setting result and the like.

目的音圧設定部４２は、例えば管理者の操作入力等に応じて、K個の制御点のうちの聴取エリア内に設定された制御点、すなわち上述したM個の制御点について、それらの制御点における目的音圧を設定する。ここで、聴取エリア内の制御点の目的音圧は、上述した音圧特性d(r)である。 The target sound pressure setting unit 42 controls the control points set in the listening area among the K control points, that is, the M control points described above, for example, in response to an operation input by the administrator. Set the target sound pressure at the point. Here, the target sound pressure at the control point in the listening area is the sound pressure characteristic d (r) described above.

許容誤差設定部４３は、例えば管理者の操作入力等に応じて、聴取エリアの各制御点について、それらの制御点における目的音圧の許容誤差σを設定する。 The allowable error setting unit 43 sets an allowable error σ of the target sound pressure at each control point for each control point in the listening area, for example, according to an operation input by the administrator.

情報入力部３６は、エリア設定部４１乃至許容誤差設定部４３による設定の結果、すなわち、各制御点の位置、各制御点があるエリアの種別、目的音圧、および許容誤差をフィルタ係数計算部３７に供給する。 The information input unit 36 uses the area setting unit 41 to the allowable error setting unit 43 to set the result of the setting, that is, the position of each control point, the type of area in which each control point is located, the target sound pressure, and the allowable error. 37.

フィルタ係数計算部３７は、伝達関数計算器３５から供給された伝達関数H_n(r)と、情報入力部３６から供給された各種の設定結果とに基づいて入力フィルタを算出し、音声再生部２２に供給する。 The filter coefficient calculation unit 37 calculates an input filter based on the transfer function H _n (r) supplied from the transfer function calculator 35 and various setting results supplied from the information input unit 36, and the sound reproduction unit 22 is supplied.

また、再生空間に配置された音声再生部２２は、振幅位相制御フィルタ処理部６１−１乃至振幅位相制御フィルタ処理部６１−Ｎ、D/A変換器６２−１乃至D/A変換器６２−Ｎ、および音源装置６３−１乃至音源装置６３−Ｎを有している。 The audio reproduction unit 22 arranged in the reproduction space includes an amplitude phase control filter processing unit 61-1 to an amplitude phase control filter processing unit 61-N, a D / A converter 62-1 to a D / A converter 62-. N, and sound source devices 63-1 to 63-N.

振幅位相制御フィルタ処理部６１−１乃至振幅位相制御フィルタ処理部６１−Ｎには、再生しようとする音声の入力音響信号が供給される。 An input acoustic signal of the sound to be reproduced is supplied to the amplitude phase control filter processing unit 61-1 to the amplitude phase control filter processing unit 61-N.

振幅位相制御フィルタ処理部６１−１乃至振幅位相制御フィルタ処理部６１−Ｎは、供給された入力音響信号に対して、フィルタ係数計算部３７から供給された入力フィルタを用いたフィルタ処理を施し、その結果得られた出力音響信号をD/A変換器６２−１乃至D/A変換器６２−Ｎに供給する。振幅位相制御フィルタ処理部６１−１乃至振幅位相制御フィルタ処理部６１−Ｎにおけるフィルタ処理により、N個の入力音響信号間の相対的な振幅や位相が制御される。 The amplitude phase control filter processing unit 61-1 to the amplitude phase control filter processing unit 61-N perform filter processing using the input filter supplied from the filter coefficient calculation unit 37 on the supplied input acoustic signal, The output acoustic signal obtained as a result is supplied to the D / A converter 62-1 to D / A converter 62-N. The relative amplitude and phase between the N input acoustic signals are controlled by the filter processing in the amplitude phase control filter processing unit 61-1 to the amplitude phase control filter processing unit 61-N.

D/A変換器６２−１乃至D/A変換器６２−Ｎは、振幅位相制御フィルタ処理部６１−１乃至振幅位相制御フィルタ処理部６１−Ｎから供給された出力音響信号に対してD/A変換を行い、音源装置６３−１乃至音源装置６３−Ｎに供給する。 The D / A converter 62-1 through D / A converter 62-N perform D / A conversion on the output acoustic signals supplied from the amplitude phase control filter processing unit 61-1 through amplitude phase control filter processing unit 61-N. A conversion is performed and supplied to the sound source device 63-1 to 63-N.

音源装置６３−１乃至音源装置６３−Ｎは、例えば空間中に音声を放射するスピーカなどからなり、D/A変換器６２−１乃至D/A変換器６２−Ｎから供給された出力音響信号に基づいて音声を再生する。ここで、音源装置６３−１乃至音源装置６３−Ｎのそれぞれは、音源装置３３−１乃至音源装置３３−Ｎのそれぞれに対応する。 The sound source devices 63-1 to 63-N include, for example, speakers that emit sound into the space, and output acoustic signals supplied from the D / A converter 62-1 to D / A converter 62-N. Play audio based on Here, each of the sound source devices 63-1 to 63-N corresponds to each of the sound source devices 33-1 to 33-N.

なお、以下、振幅位相制御フィルタ処理部６１−１乃至振幅位相制御フィルタ処理部６１−Ｎを特に区別する必要のない場合、単に振幅位相制御フィルタ処理部６１とも称し、D/A変換器６２−１乃至D/A変換器６２−Ｎを特に区別する必要のない場合、単にD/A変換器６２とも称する。また、以下、音源装置６３−１乃至音源装置６３−Ｎを特に区別する必要のない場合、単に音源装置６３とも称する。 Hereinafter, when it is not necessary to distinguish between the amplitude phase control filter processing unit 61-1 to the amplitude phase control filter processing unit 61-N, the amplitude phase control filter processing unit 61-N is also simply referred to as an amplitude phase control filter processing unit 61, and The 1 to D / A converters 62 -N are also simply referred to as D / A converters 62 when it is not necessary to distinguish them. Hereinafter, the sound source device 63-1 to the sound source device 63-N are also simply referred to as the sound source device 63 when it is not necessary to distinguish between them.

図１に示す音響システム１１では、伝達関数を得るための音響センサ３４を再生空間に配置する必要がないため、ユーザは再生空間の所望の位置で音声を聴取したり、所望の位置に移動したりして、より自由な聴取を行うことができる。 In the acoustic system 11 shown in FIG. 1, since it is not necessary to arrange the acoustic sensor 34 for obtaining the transfer function in the reproduction space, the user listens to the voice at a desired position in the reproduction space or moves to the desired position. Or more free listening.

〈フィルタ係数算出処理の説明〉
次に、図１に示した音響システム１１の動作について説明する。 <Description of filter coefficient calculation processing>
Next, the operation of the acoustic system 11 shown in FIG. 1 will be described.

例えば音響システム１１のフィルタ計算部２１は、再生空間において音声再生部２２が音声を再生するにあたり、事前にフィルタ係数算出処理を行って入力フィルタを算出し、音声再生部２２に供給する。 For example, the filter calculation unit 21 of the acoustic system 11 calculates the input filter by performing filter coefficient calculation processing in advance when the audio reproduction unit 22 reproduces sound in the reproduction space, and supplies the input filter to the audio reproduction unit 22.

以下、図２のフローチャートを参照して、音響システム１１によるフィルタ係数算出処理について説明する。 Hereinafter, the filter coefficient calculation processing by the acoustic system 11 will be described with reference to the flowchart of FIG.

ステップＳ１１において、エリア設定部４１は、空間上に配置されるK個の制御点についてエリア設定を行う。すなわち、エリア設定部４１は、必要に応じて再生空間上に聴取エリアや非聴取エリアといった各種別のエリアを設定するとともに、それらの各エリアに対応する、フィルタ計算部２１が配置された空間上のエリアに制御点を設定する。 In step S11, the area setting unit 41 performs area setting for K control points arranged in the space. That is, the area setting unit 41 sets various types of areas such as a listening area and a non-listening area on the reproduction space as necessary, and on the space where the filter calculation unit 21 corresponding to each area is arranged. Set control points in the area.

この例では、非聴取エリアであるエリアR1やエリアR3に対応するエリアに、いくつかの制御点が設定され、聴取エリアであるエリアR2に対応するエリアに、いくつかの制御点が設定される。 In this example, some control points are set in the areas corresponding to the non-listening areas R1 and R3, and some control points are set in the areas corresponding to the listening area R2. .

ステップＳ１２において、目的音圧設定部４２は、聴取エリア内にある各制御点について、それらの制御点における音圧特性d(r)を目的音圧として設定する。 In step S12, the target sound pressure setting unit 42 sets, for each control point in the listening area, the sound pressure characteristic d (r) at those control points as the target sound pressure.

ステップＳ１３において、許容誤差設定部４３は、聴取エリアの各制御点について、それらの制御点における目的音圧の許容誤差σを設定する。なお、各制御点における許容誤差σは同じ値であってもよいし、異なる値であってもよい。 In step S 13, the allowable error setting unit 43 sets the allowable error σ of the target sound pressure at each control point for each control point in the listening area. The allowable error σ at each control point may be the same value or a different value.

以上の処理が行われると、情報入力部３６は各制御点の位置r、各制御点があるエリアの種別、目的音圧としての音圧特性d(r)、および許容誤差σをフィルタ係数計算部３７に供給する。 When the above processing is performed, the information input unit 36 calculates the filter coefficient of the position r of each control point, the type of area where each control point is, the sound pressure characteristic d (r) as the target sound pressure, and the allowable error σ. To the unit 37.

ステップＳ１４において、フィルタ計算部２１は、N個の音源装置３３のうちのまだ処理対象としていない音源装置３３を1つ選択する。例えば音源装置３３−１から音源装置３３−Ｎまで順番に1つずつ処理対象とする音源装置３３が選択されていく。 In step S 14, the filter calculation unit 21 selects one sound source device 33 that has not yet been processed from among the N sound source devices 33. For example, the sound source devices 33 to be processed are sequentially selected one by one from the sound source device 33-1 to the sound source device 33-N.

フィルタ計算部２１は処理対象とする1つの音源装置３３を選択すると、その音源装置３３にのみ測定用音響信号が供給されるようにスイッチ３１の開閉を制御する。 When the filter calculation unit 21 selects one sound source device 33 to be processed, the filter calculation unit 21 controls the opening and closing of the switch 31 so that the measurement sound signal is supplied only to the sound source device 33.

すなわち、フィルタ計算部２１は、処理対象の音源装置３３に接続されているスイッチ３１をオンさせ、他の処理対象ではない音源装置３３に接続されているスイッチ３１をオフさせる。 That is, the filter calculation unit 21 turns on the switch 31 connected to the sound source device 33 to be processed, and turns off the switch 31 connected to the sound source device 33 that is not another processing target.

ステップＳ１５において、音源装置３３は伝達関数測定用の音声を出力する。 In step S15, the sound source device 33 outputs a transfer function measurement sound.

すなわち、D/A変換器３２は、スイッチ３１を介して供給された測定用音響信号をD/A変換し、音源装置３３に供給する。これにより、測定用音響信号がデジタル信号からアナログ信号に変換される。また、音源装置３３は、D/A変換器３２から供給された測定用音響信号に基づいて、伝達関数測定用の音声を出力する。 That is, the D / A converter 32 performs D / A conversion on the measurement acoustic signal supplied via the switch 31 and supplies it to the sound source device 33. As a result, the measurement acoustic signal is converted from a digital signal to an analog signal. The sound source device 33 outputs a transfer function measurement sound based on the measurement acoustic signal supplied from the D / A converter 32.

ここでは、フィルタ計算部２１によるスイッチ３１の開閉制御により、処理対象の音源装置３３にのみ測定用音響信号が供給されるので、処理対象の音源装置３３から伝達関数測定用の音声が出力され、他の音源装置３３からは音声は出力されない。 Here, since the measurement acoustic signal is supplied only to the sound source device 33 to be processed by the opening / closing control of the switch 31 by the filter calculation unit 21, the sound for transfer function measurement is output from the sound source device 33 to be processed, No sound is output from the other sound source device 33.

ステップＳ１６において、K個の各音響センサ３４は、処理対象の音源装置３３から出力された音声を収音し、その結果得られた測定信号を伝達関数計算器３５に供給する。 In step S 16, each of the K acoustic sensors 34 collects the sound output from the sound source device 33 to be processed, and supplies the measurement signal obtained as a result to the transfer function calculator 35.

ステップＳ１７において、伝達関数計算器３５は、音響センサ３４から供給された測定信号と、供給された測定用音響信号とに基づいて、処理対象の音源装置３３からK個の各音響センサ３４のそれぞれまでの間の伝達特性を示す伝達関数H_n(r)を算出し、フィルタ係数計算部３７に供給する。これにより、K個の音響センサ３４ごとに、処理対象の音源装置３３との間の伝達関数が得られる。 In step S 17, the transfer function calculator 35 determines each of the K acoustic sensors 34 from the sound source device 33 to be processed based on the measurement signal supplied from the acoustic sensor 34 and the supplied measurement acoustic signal. A transfer function H _n (r) indicating the transfer characteristic until is calculated and supplied to the filter coefficient calculation unit 37. Thereby, a transfer function between the K acoustic sensors 34 and the sound source device 33 to be processed is obtained.

ステップＳ１８において、フィルタ計算部２１は、全ての音源装置３３について処理を行ったか否かを判定する。例えば、全ての音源装置３３が処理対象とされ、各音源装置３３についての伝達関数H_n(r)が得られた場合、全ての音源装置３３について処理を行ったと判定される。 In step S 18, the filter calculation unit 21 determines whether or not all sound source devices 33 have been processed. For example, when all the sound source devices 33 are to be processed and the transfer function H _n (r) for each sound source device 33 is obtained, it is determined that the processing has been performed for all the sound source devices 33.

ステップＳ１８において、まだ全ての音源装置３３について処理を行っていないと判定された場合、処理はステップＳ１４に戻り、上述した処理が繰り返し行われる。すなわち、次に処理対象とする音源装置３３が選択され、伝達関数H_n(r)が算出される。 If it is determined in step S18 that processing has not been performed for all the sound source devices 33, the processing returns to step S14, and the above-described processing is repeated. That is, the sound source device 33 to be processed next is selected, and the transfer function H _n (r) is calculated.

これに対して、ステップＳ１８において全ての音源装置３３について処理を行ったと判定されると、処理はステップＳ１９に進む。 On the other hand, if it is determined in step S18 that the processing has been performed for all sound source devices 33, the process proceeds to step S19.

ステップＳ１９において、フィルタ係数計算部３７は、伝達関数計算器３５から供給された伝達関数H_n(r)と、情報入力部３６から供給された各制御点の位置r、各制御点があるエリアの種別、目的音圧としての音圧特性d(r)、および許容誤差σとに基づいて入力フィルタの入力フィルタ係数を算出し、振幅位相制御フィルタ処理部６１に供給する。 In step S19, the filter coefficient calculation unit 37 includes the transfer function H _n (r) supplied from the transfer function calculator 35, the position r of each control point supplied from the information input unit 36, and the area where each control point exists. The input filter coefficient of the input filter is calculated based on the type of sound pressure, the sound pressure characteristic d (r) as the target sound pressure, and the allowable error σ, and supplied to the amplitude phase control filter processing unit 61.

具体的には、フィルタ係数計算部３７は、上述した式（１３）に示した線型半無限計画問題について位置rおよび回転角度θを離散化することで得られる線型計画問題を解くことにより、N個の音源装置３３ごとの入力フィルタ係数W_nを算出する。 Specifically, the filter coefficient calculation unit 37 solves the linear programming problem obtained by discretizing the position r and the rotation angle θ with respect to the linear semi-infinite programming problem shown in the above-described equation (13), thereby obtaining N N An input filter coefficient W _n for each sound source device 33 is calculated.

すなわち、フィルタ係数計算部３７は、聴取エリアにおける音圧の目的音圧に対する誤差が許容誤差σの範囲内に収まるという条件下で、非聴取エリアにおける音圧の最大値を最小化する入力フィルタ係数W_nを求める線型計画問題を解くことで、入力フィルタ係数W_nを算出する。 In other words, the filter coefficient calculator 37 minimizes the maximum value of the sound pressure in the non-listening area under the condition that the error of the sound pressure in the listening area with respect to the target sound pressure is within the allowable error σ. by solving the linear programming problem of finding W _n, to calculate the input filter coefficients W _n.

このとき、フィルタ係数計算部３７は、例えば式（１４）に示した上限値g_maxについての制約条件を線型計画問題に追加して、線型計画問題を解くようにしてもよい。また、フィルタ係数計算部３７は、非限定エリアが設定される場合には、式（１５）に示した非限定エリアについての制約条件を線型計画問題に追加して、線型計画問題を解くようにしてもよい。 At this time, the filter coefficient calculation unit 37 may solve the linear planning problem by adding a constraint condition for the upper limit value g _max shown in the equation (14) to the linear planning problem, for example. In addition, when the non-limited area is set, the filter coefficient calculation unit 37 adds the constraint condition for the non-limited area shown in Expression (15) to the linear programming problem to solve the linear programming problem. May be.

入力フィルタ係数W_nが得られると、フィルタ係数計算部３７は、各音源装置３３について求められた入力フィルタ係数W_nを、それらの音源装置３３に対応する音源装置６３に接続されている振幅位相制御フィルタ処理部６１に供給する。 When the input filter coefficient W _n is obtained, the filter coefficient calculating unit 37, the input filter coefficients W _n obtained for each sound source device 33, their source device corresponding amplitude phase that is connected to the tone generator 63 to 33 This is supplied to the control filter processing unit 61.

すなわち、n番目の音源装置３３−ｎについて求められた入力フィルタ係数W_nが、音源装置３３−ｎに対応するn番目の音源装置６３−ｎに接続されている振幅位相制御フィルタ処理部６１−ｎに供給される。 That is, the input filter coefficient W _n obtained for the _n -th sound source device 33-n is the amplitude / phase control filter processing unit 61- connected to the n-th sound device 63-n corresponding to the sound source device 33-n. n.

入力フィルタ係数W_nが振幅位相制御フィルタ処理部６１に供給されると、フィルタ係数算出処理は終了する。 When the input filter coefficient W _n is supplied to the amplitude / phase control filter processing unit 61, the filter coefficient calculation process ends.

以上のようにして音響システム１１は、エリアの種別や目的音圧などの各制御点に関する情報を設定するとともに伝達関数を測定により求める。そして、音響システム１１は、各制御点に関する設定および伝達関数に基づいて、聴取エリアの音圧の誤差が許容誤差σの範囲内に収まるという条件下で、非聴取エリアにおける音圧の最大値を最小化する入力フィルタ係数を求めるという線型計画問題を解くことで、入力フィルタ係数を算出する。 As described above, the acoustic system 11 sets information regarding each control point such as the type of area and the target sound pressure, and obtains a transfer function by measurement. Then, the acoustic system 11 calculates the maximum value of the sound pressure in the non-listening area under the condition that the sound pressure error in the listening area is within the allowable error σ based on the setting and transfer function regarding each control point. An input filter coefficient is calculated by solving a linear programming problem of obtaining an input filter coefficient to be minimized.

これにより、聴取エリアの音圧を許容誤差により定まる範囲内に収めつつ非聴取エリアの音圧を最大限に抑えられる入力フィルタ係数を得ることができ、この入力フィルタ係数によって、より空間的なばらつきの少ない音圧分布の音声を得ることができるようになる。 As a result, it is possible to obtain an input filter coefficient that can suppress the sound pressure in the non-listening area to the maximum while keeping the sound pressure in the listening area within the range determined by the allowable error. It is possible to obtain a sound with a small sound pressure distribution.

〈音声再生処理の説明〉
続いて、音響システム１１が、フィルタ係数算出処理により得られた入力フィルタを用いて音声の再生を行う音声再生処理について説明する。 <Description of audio playback processing>
Next, a sound reproduction process in which the sound system 11 reproduces sound using the input filter obtained by the filter coefficient calculation process will be described.

ステップＳ４１において、振幅位相制御フィルタ処理部６１は、供給された入力音響信号に対して、フィルタ係数計算部３７から供給された入力フィルタを用いたフィルタ処理を施し、その結果得られた出力音響信号をD/A変換器６２に供給する。 In step S41, the amplitude phase control filter processing unit 61 performs a filtering process using the input filter supplied from the filter coefficient calculation unit 37 on the supplied input acoustic signal, and the output acoustic signal obtained as a result thereof Is supplied to the D / A converter 62.

具体的には、振幅位相制御フィルタ処理部６１は、供給された入力音響信号に対して時間周波数変換を行うことで、時間領域の信号（時間信号）である入力音響信号を、周波数領域の信号である周波数スペクトルに変換する。 Specifically, the amplitude phase control filter processing unit 61 performs time-frequency conversion on the supplied input acoustic signal, thereby converting the input acoustic signal, which is a time domain signal (time signal), into a frequency domain signal. Is converted to a frequency spectrum.

そして、振幅位相制御フィルタ処理部６１は、得られた周波数スペクトルの各周波数成分に、フィルタ係数計算部３７から供給された入力フィルタの入力フィルタ係数W_nを構成する各周波数成分の要素（係数）を乗算することで、入力音響信号の相対的な振幅や位相を調整する。すなわち、入力音響信号に対して、周波数領域で入力フィルタを用いたフィルタ処理が施される。 Then, the amplitude / phase control filter processing unit 61 includes, for each frequency component of the obtained frequency spectrum, elements (coefficients) of each frequency component constituting the input filter coefficient W _n of the input filter supplied from the filter coefficient calculation unit 37. Is used to adjust the relative amplitude and phase of the input acoustic signal. That is, the input acoustic signal is subjected to filter processing using an input filter in the frequency domain.

さらに、振幅位相制御フィルタ処理部６１は、フィルタ処理された周波数スペクトルに対して、周波数時間変換、つまり時間周波数変換の逆変換を行うことで、フィルタ処理された時間領域の信号である出力音響信号を得る。 Further, the amplitude phase control filter processing unit 61 performs frequency-time conversion, that is, inverse conversion of time-frequency conversion, on the filtered frequency spectrum, thereby outputting an output acoustic signal that is a filtered time-domain signal. Get.

ステップＳ４２において、D/A変換器６２は、振幅位相制御フィルタ処理部６１から供給された出力音響信号をD/A変換し、音源装置６３に供給する。これにより、出力音響信号がデジタル信号からアナログ信号へと変換される。 In step S 42, the D / A converter 62 performs D / A conversion on the output acoustic signal supplied from the amplitude phase control filter processing unit 61 and supplies it to the sound source device 63. As a result, the output acoustic signal is converted from a digital signal to an analog signal.

ステップＳ４３において、音源装置６３は、D/A変換器６２から供給された出力音響信号に基づいて音声を再生し、音声再生処理は終了する。 In step S43, the sound source device 63 reproduces sound based on the output acoustic signal supplied from the D / A converter 62, and the sound reproduction process ends.

各音源装置６３で音声が再生されると、再生空間内のエリアR1およびエリアR3、つまり非聴取エリアにいるユーザは再生された音声を聴取することができず、再生空間内のエリアR2、つまり聴取エリアにいるユーザは再生された音声を聴取することができる。 When the sound is reproduced by each sound source device 63, the users in the areas R1 and R3 in the reproduction space, that is, the non-listening area cannot listen to the reproduced sound, and the area R2 in the reproduction space, that is, A user in the listening area can listen to the reproduced sound.

以上のようにして、音響システム１１は、事前に算出された入力フィルタを用いて入力音響信号にフィルタ処理を施し、その結果得られた出力音響信号に基づいて、音声を再生する。このように、入力フィルタを用いて入力音響信号にフィルタ処理を施すことで、より空間的なばらつきの少ない音圧分布の音声を得ることができるとともに、より高い分離性能を得ることができる。 As described above, the acoustic system 11 performs the filtering process on the input acoustic signal using the input filter calculated in advance, and reproduces the sound based on the output acoustic signal obtained as a result. As described above, by performing the filtering process on the input acoustic signal using the input filter, it is possible to obtain sound with a sound pressure distribution with less spatial variation and to obtain higher separation performance.

〈第２の実施の形態〉
〈音響システムの構成例〉
ところで、上述したように複数の入力音響信号に対して、それぞれ聴取エリアを設定することも可能である。そのような場合、音響システム１１は、例えば図４に示すように構成される。なお、図４において図１における場合と対応する部分には同一の符号を付してあり、その説明は省略する。 <Second Embodiment>
<Example of acoustic system configuration>
By the way, as described above, a listening area can be set for each of a plurality of input sound signals. In such a case, the acoustic system 11 is configured as shown in FIG. 4, for example. In FIG. 4, parts corresponding to those in FIG. 1 are denoted by the same reference numerals, and the description thereof is omitted.

図４に示す音響システム１１は、フィルタ計算部２１と音声再生部２２とから構成される。また、音声再生部２２には、互いに異なる音声を再生するための2つの入力音響信号が供給され、音声再生部２２は、それらの2つの入力音響信号の再生を同時に行う。 The acoustic system 11 shown in FIG. 4 includes a filter calculation unit 21 and an audio reproduction unit 22. In addition, two input acoustic signals for reproducing different sounds are supplied to the audio reproducing unit 22, and the audio reproducing unit 22 reproduces these two input acoustic signals simultaneously.

ここで、これらの入力音響信号を区別するために、一方の入力音響信号を入力IP1の入力音響信号SG1とも称し、他方の入力音響信号を入力IP2の入力音響信号SG2とも称することとする。 Here, in order to distinguish these input sound signals, one input sound signal is also referred to as an input sound signal SG1 of the input IP1, and the other input sound signal is also referred to as an input sound signal SG2 of the input IP2.

また、この例では、再生空間がエリアR11乃至エリアR15の5つのエリアに分けられている。そして、エリアR12が入力IP1の聴取エリアとされ、エリアR14が入力IP2の聴取エリアとされ、他のエリアR11、エリアR13、およびエリアR15は非聴取エリアとされている。 Further, in this example, the reproduction space is divided into five areas R11 to R15. The area R12 is the listening area for the input IP1, the area R14 is the listening area for the input IP2, and the other areas R11, R13, and R15 are non-listening areas.

したがって、この例では、入力IP1の入力音響信号SG1についての入力フィルタ係数を算出するときには、エリアR12が聴取エリアとされ、他のエリアR11、およびエリアR13乃至エリアR15は非聴取エリアとされる。 Therefore, in this example, when calculating the input filter coefficient for the input acoustic signal SG1 of the input IP1, the area R12 is set as a listening area, and the other areas R11 and the areas R13 to R15 are set as non-listening areas.

また、入力IP2の入力音響信号SG2についての入力フィルタ係数を算出するときには、エリアR14が聴取エリアとされ、他のエリアR11乃至エリアR13、およびエリアR15が非聴取エリアとされる。 Further, when calculating the input filter coefficient for the input acoustic signal SG2 of the input IP2, the area R14 is set as a listening area, and the other areas R11 to R13 and the area R15 are set as non-listening areas.

なお、ここでは各エリアが聴取エリアまたは非聴取エリアとされる例について説明するが、例えば入力IP1についても入力IP2についても非聴取エリアとなっているエリアR11、エリアR13、およびエリアR15のうちのいくつかが非限定エリアとされてもよい。 Here, an example in which each area is a listening area or a non-listening area will be described. For example, the input IP1 and the input IP2 are the non-listening areas of the areas R11, R13, and R15. Some may be non-limited areas.

また、例えば入力IP1の入力フィルタ係数を算出するときには、エリアR12およびエリアR13を聴取エリアとし、入力IP2の入力フィルタ係数を算出するときには、エリアR13およびエリアR14を聴取エリアとしてもよい。そのようにすると、エリアR12では入力IP1の音声のみが聴取でき、エリアR14では入力IP2の音声のみが聴取でき、エリアR13では入力IP1と入力IP2の両方の音声が聴取できるようになる。 For example, when calculating the input filter coefficient of the input IP1, the area R12 and the area R13 may be set as the listening area, and when calculating the input filter coefficient of the input IP2, the area R13 and the area R14 may be set as the listening area. By doing so, only the voice of the input IP1 can be heard in the area R12, only the voice of the input IP2 can be heard in the area R14, and the voices of both the input IP1 and the input IP2 can be heard in the area R13.

このような図４の例では、フィルタ計算部２１は、スイッチ３１−１乃至スイッチ３１−Ｎ、D/A変換器３２−１乃至D/A変換器３２−Ｎ、音源装置３３−１乃至音源装置３３−Ｎ、音響センサ３４−１乃至音響センサ３４−Ｋ、伝達関数計算器３５、情報入力部３６、情報入力部１０１、フィルタ係数計算部３７、およびフィルタ係数計算部１０２を有している。 In the example of FIG. 4, the filter calculation unit 21 includes the switches 31-1 to 31-N, the D / A converters 32-1 to 32-N, the sound source device 33-1 to the sound source. The apparatus 33-N, the acoustic sensors 34-1 to 34-K, the transfer function calculator 35, the information input unit 36, the information input unit 101, the filter coefficient calculation unit 37, and the filter coefficient calculation unit 102 are included. .

図４に示すフィルタ計算部２１の構成は、新たに情報入力部１０１およびフィルタ係数計算部１０２が設けられている点で図１に示したフィルタ計算部２１の構成と異なり、その他の点では図１のフィルタ計算部２１と同じ構成となっている。 The configuration of the filter calculation unit 21 shown in FIG. 4 is different from the configuration of the filter calculation unit 21 shown in FIG. 1 in that an information input unit 101 and a filter coefficient calculation unit 102 are newly provided. The configuration is the same as that of one filter calculation unit 21.

図４のフィルタ計算部２１では、情報入力部３６が、入力IP1の入力音響信号SG1についての各種の設定を行い、その設定結果をフィルタ係数計算部３７に供給する。また、フィルタ係数計算部３７が、伝達関数計算器３５から供給された伝達関数H_n(r)と、情報入力部３６から供給された各種の設定結果とに基づいて、入力IP1についての入力フィルタを算出し、音声再生部２２に供給する。 In the filter calculation unit 21 of FIG. 4, the information input unit 36 performs various settings for the input acoustic signal SG1 of the input IP1, and supplies the setting results to the filter coefficient calculation unit 37. Further, the filter coefficient calculation unit 37 inputs the input filter for the input IP1 based on the transfer function H _n (r) supplied from the transfer function calculator 35 and various setting results supplied from the information input unit 36. Is calculated and supplied to the audio playback unit 22.

これに対して、情報入力部１０１は、例えば管理者の操作入力等に応じて、入力IP2の入力音響信号SG2についての各種の設定を行い、その設定結果をフィルタ係数計算部１０２に供給する。また、情報入力部１０１は、エリア設定部４１乃至許容誤差設定部４３のそれぞれに対応するエリア設定部１１１、目的音圧設定部１１２、および許容誤差設定部１１３を有している。 On the other hand, the information input unit 101 performs various settings for the input acoustic signal SG2 of the input IP2 according to, for example, an operation input by the administrator, and supplies the setting results to the filter coefficient calculation unit 102. Further, the information input unit 101 includes an area setting unit 111, a target sound pressure setting unit 112, and an allowable error setting unit 113 corresponding to each of the area setting unit 41 to the allowable error setting unit 43.

さらに、フィルタ係数計算部１０２は、伝達関数計算器３５から供給された伝達関数H_n(r)と、情報入力部１０１から供給された各種の設定結果とに基づいて、入力IP2についての入力フィルタを算出し、音声再生部２２に供給する。 Further, the filter coefficient calculation unit 102 inputs the input filter for the input IP2 based on the transfer function H _n (r) supplied from the transfer function calculator 35 and various setting results supplied from the information input unit 101. Is calculated and supplied to the audio playback unit 22.

また、図４に示す音声再生部２２は、振幅位相制御フィルタ処理部６１−１乃至振幅位相制御フィルタ処理部６１−Ｎ、振幅位相制御フィルタ処理部１３１−１乃至振幅位相制御フィルタ処理部１３１−Ｎ、加算部１３２−１乃至加算部１３２−Ｎ、D/A変換器６２−１乃至D/A変換器６２−Ｎ、および音源装置６３−１乃至音源装置６３−Ｎを有している。 Also, the audio reproduction unit 22 shown in FIG. 4 includes an amplitude phase control filter processing unit 61-1 through an amplitude phase control filter processing unit 61-N, an amplitude phase control filter processing unit 131-1 through an amplitude phase control filter processing unit 131-. N, an adding unit 132-1 to an adding unit 132-N, a D / A converter 62-1 to a D / A converter 62-N, and a sound source device 63-1 to a sound source device 63-N.

図４に示す音声再生部２２の構成は、新たに振幅位相制御フィルタ処理部１３１−１乃至振幅位相制御フィルタ処理部１３１−Ｎ、および加算部１３２−１乃至加算部１３２−Ｎが設けられている点で図１の音声再生部２２と異なり、その他の点では図１の音声再生部２２と同じ構成となっている。 The configuration of the audio reproduction unit 22 shown in FIG. 4 is newly provided with an amplitude phase control filter processing unit 131-1 through amplitude phase control filter processing unit 131-N and an addition unit 132-1 through addition unit 132-N. 1 is different from the audio reproduction unit 22 in FIG. 1 in other respects and has the same configuration as the audio reproduction unit 22 in FIG.

図４の音声再生部２２では、振幅位相制御フィルタ処理部６１−１乃至振幅位相制御フィルタ処理部６１−Ｎは、供給された入力IP1の入力音響信号SG1に対して、フィルタ係数計算部３７から供給された入力フィルタを用いたフィルタ処理を施し、その結果得られた出力音響信号を加算部１３２−１乃至加算部１３２−Ｎに供給する。 In the audio reproduction unit 22 of FIG. 4, the amplitude phase control filter processing unit 61-1 to the amplitude phase control filter processing unit 61-N apply the input acoustic signal SG1 of the input IP1 from the filter coefficient calculation unit 37. Filter processing using the supplied input filter is performed, and the output acoustic signal obtained as a result is supplied to the adder 132-1 to adder 132-N.

これに対して、振幅位相制御フィルタ処理部１３１−１乃至振幅位相制御フィルタ処理部１３１−Ｎは、供給された入力IP2の入力音響信号SG2に対して、フィルタ係数計算部１０２から供給された入力フィルタを用いたフィルタ処理を施し、その結果得られた出力音響信号を加算部１３２−１乃至加算部１３２−Ｎに供給する。 On the other hand, the amplitude phase control filter processing unit 131-1 through the amplitude phase control filter processing unit 131-N receive the input supplied from the filter coefficient calculation unit 102 with respect to the input acoustic signal SG2 of the supplied input IP2. Filter processing using a filter is performed, and an output acoustic signal obtained as a result is supplied to the adder 132-1 to adder 132-N.

加算部１３２−１乃至加算部１３２−Ｎは、それぞれ振幅位相制御フィルタ処理部６１−１乃至振幅位相制御フィルタ処理部６１−Ｎから供給された出力音響信号と、振幅位相制御フィルタ処理部１３１−１乃至振幅位相制御フィルタ処理部１３１−Ｎから供給された出力音響信号とを加算して、D/A変換器６２−１乃至D/A変換器６２−Ｎに供給する。 The adding units 132-1 to 132-N respectively output acoustic signals supplied from the amplitude phase control filter processing unit 61-1 to the amplitude phase control filter processing unit 61-N and the amplitude phase control filter processing unit 131-. 1 to the output acoustic signal supplied from the amplitude / phase control filter processor 131-N are added and supplied to the D / A converter 62-1 to D / A converter 62-N.

なお、以下、振幅位相制御フィルタ処理部１３１−１乃至振幅位相制御フィルタ処理部１３１−Ｎを特に区別する必要のない場合、単に振幅位相制御フィルタ処理部１３１とも称することとする。また、以下、加算部１３２−１乃至加算部１３２−Ｎを特に区別する必要のない場合、単に加算部１３２とも称することとする。 Hereinafter, the amplitude phase control filter processing unit 131-1 to the amplitude phase control filter processing unit 131 -N are also simply referred to as the amplitude phase control filter processing unit 131 when it is not necessary to distinguish between them. Hereinafter, the addition units 132-1 to 132-N are also simply referred to as the addition unit 132 when it is not necessary to distinguish between them.

〈音声再生処理の説明〉
続いて、図４に示した音響システム１１の動作について説明する。 <Description of audio playback processing>
Next, the operation of the acoustic system 11 shown in FIG. 4 will be described.

まず、音響システム１１は、入力IP1と入力IP2の音声を再生する前に、事前にそれらの入力IP1についての入力フィルタと、入力IP2についての入力フィルタとを計算して求めておく。このとき、音響システム１１は、入力IP1と入力IP2について、それぞれ図２を参照して説明したフィルタ係数算出処理を行う。 First, the acoustic system 11 calculates and obtains an input filter for the input IP1 and an input filter for the input IP2 in advance before reproducing the audio of the input IP1 and the input IP2. At this time, the acoustic system 11 performs the filter coefficient calculation process described with reference to FIG. 2 for the input IP1 and the input IP2.

ここで、入力IP1についての入力フィルタを求めるフィルタ係数算出処理では、第１の実施の形態における場合と同様に、情報入力部３６によりステップＳ１１乃至ステップＳ１３の処理が行われ、ステップＳ１９の処理はフィルタ係数計算部３７により行われる。この場合、情報入力部３６のエリア設定部４１は、エリアR12を聴取エリアとし、エリアR11、およびエリアR13乃至エリアR15を非聴取エリアとして制御点の設定を行う。 Here, in the filter coefficient calculation process for obtaining the input filter for the input IP1, as in the case of the first embodiment, the process from step S11 to step S13 is performed by the information input unit 36, and the process of step S19 is performed. This is performed by the filter coefficient calculation unit 37. In this case, the area setting unit 41 of the information input unit 36 sets control points with the area R12 as a listening area and the areas R11 and R13 to R15 as non-listening areas.

これに対して、入力IP2についての入力フィルタを求めるフィルタ係数算出処理では、情報入力部１０１のエリア設定部１１１、目的音圧設定部１１２、および許容誤差設定部１１３により、ステップＳ１１、ステップＳ１２、およびステップＳ１３の処理が行われる。このとき、ステップＳ１１では、エリア設定部１１１は、エリアR14を聴取エリアとし、エリアR11乃至エリアR13、およびエリアR15を非聴取エリアとして制御点の設定を行う。 In contrast, in the filter coefficient calculation process for obtaining the input filter for the input IP2, the area setting unit 111, the target sound pressure setting unit 112, and the allowable error setting unit 113 of the information input unit 101 perform steps S11, S12, And the process of step S13 is performed. At this time, in step S11, the area setting unit 111 sets a control point with the area R14 as a listening area and the areas R11 to R13 and the area R15 as non-listening areas.

また、入力IP2についてのフィルタ係数算出処理が行われる前に、入力IP1についてのフィルタ係数算出処理が行われた場合には、既に伝達関数H_n(r)が得られているので、入力IP2についてのフィルタ係数算出処理では、ステップＳ１４乃至ステップＳ１８の処理は行われなくてもよい。 In addition, when the filter coefficient calculation process for the input IP1 is performed before the filter coefficient calculation process for the input IP2, the transfer function H _n (r) has already been obtained. In the filter coefficient calculation process of step S14 to step S18, the process may not be performed.

さらに、入力IP2についてのフィルタ係数算出処理では、フィルタ係数計算部１０２によりステップＳ１９の処理が行われ、入力IP2についての入力フィルタ係数W_nが算出され、振幅位相制御フィルタ処理部１３１に供給される。 Furthermore, the filter coefficient calculation processing for the input IP2, the process of step S19 by the filter coefficient calculation unit 102 is performed, the input filter coefficients W _n for the input IP2 is calculated and supplied to the amplitude phase control filter processing unit 131 .

このようにして入力IP1についての入力フィルタと、入力IP2についての入力フィルタが得られると、その後、任意のタイミングで音響システム１１は、入力IP1の音声と入力IP2の音声を再生する音声再生処理を行う。 When the input filter for the input IP1 and the input filter for the input IP2 are obtained in this way, the acoustic system 11 then performs a sound reproduction process for reproducing the sound of the input IP1 and the sound of the input IP2 at an arbitrary timing. Do.

以下、図５のフローチャートを参照して、図４に示す音響システム１１により行われる音声再生処理について説明する。 Hereinafter, the sound reproduction process performed by the acoustic system 11 shown in FIG. 4 will be described with reference to the flowchart of FIG.

ステップＳ７１において、音声再生部２２は、入力ごとに入力音響信号に対してフィルタ処理を施す。 In step S71, the sound reproducing unit 22 performs a filtering process on the input acoustic signal for each input.

すなわち、振幅位相制御フィルタ処理部６１は、供給された入力IP1の入力音響信号SG1に対して、フィルタ係数計算部３７から供給された入力フィルタを用いたフィルタ処理を施し、その結果得られた出力音響信号を加算部１３２に供給する。なお、振幅位相制御フィルタ処理部６１により行われるフィルタ処理は、図３のステップＳ４１において行われるフィルタ処理と同様であるので、その説明は省略する。 That is, the amplitude phase control filter processing unit 61 performs the filtering process using the input filter supplied from the filter coefficient calculation unit 37 on the input acoustic signal SG1 of the supplied input IP1, and the output obtained as a result thereof The acoustic signal is supplied to the adding unit 132. The filter process performed by the amplitude / phase control filter processing unit 61 is the same as the filter process performed in step S41 of FIG.

同様に、振幅位相制御フィルタ処理部１３１は、供給された入力IP2の入力音響信号SG2に対して、フィルタ係数計算部１０２から供給された入力フィルタを用いたフィルタ処理を施し、その結果得られた出力音響信号を加算部１３２に供給する。なお、振幅位相制御フィルタ処理部１３１により行われるフィルタ処理は、振幅位相制御フィルタ処理部６１により行われるフィルタ処理と同様であるので、その説明は省略する。 Similarly, the amplitude / phase control filter processing unit 131 performs a filter process using the input filter supplied from the filter coefficient calculation unit 102 on the input acoustic signal SG2 of the supplied input IP2, and obtained as a result. The output acoustic signal is supplied to the adding unit 132. Note that the filter processing performed by the amplitude phase control filter processing unit 131 is the same as the filter processing performed by the amplitude phase control filter processing unit 61, and thus the description thereof is omitted.

ステップＳ７２において、加算部１３２は、振幅位相制御フィルタ処理部６１から供給された入力IP1の出力音響信号と、振幅位相制御フィルタ処理部１３１から供給された入力IP2の出力音響信号とを加算して、D/A変換器６２に供給する。 In step S72, the adding unit 132 adds the output acoustic signal of the input IP1 supplied from the amplitude / phase control filter processing unit 61 and the output acoustic signal of the input IP2 supplied from the amplitude / phase control filter processing unit 131. To the D / A converter 62.

入力IP1の出力音響信号と入力IP2の出力音響信号が加算されると、その後、ステップＳ７３およびステップＳ７４の処理が行われて音声再生処理は終了するが、これらの処理は図３のステップＳ４２およびステップＳ４３の処理と同様であるので、その説明は省略する。 When the output acoustic signal of the input IP1 and the output acoustic signal of the input IP2 are added, the processes of step S73 and step S74 are performed thereafter, and the sound reproduction process is terminated. These processes are performed in steps S42 and S42 of FIG. Since it is the same as the process of step S43, the description is abbreviate | omitted.

このようにして出力音響信号に基づく音声が再生されると、再生空間内のエリアR12では入力IP1の音声を聴取することができ、エリアR14では入力IP2の音声を聴取することができるようになる。また、エリアR11、エリアR13、およびエリアR15では、音声を聴取することができない。 When the sound based on the output acoustic signal is reproduced in this way, the sound of the input IP1 can be heard in the area R12 in the reproduction space, and the sound of the input IP2 can be heard in the area R14. . In addition, in area R11, area R13, and area R15, it is impossible to listen to audio.

以上のようにして、音響システム１１は、入力ごとに事前に算出された入力フィルタを用いて入力音響信号にフィルタ処理を施し、その結果得られた各入力の出力音響信号を加算して音声を再生する。このように、入力ごとに入力フィルタを用いて入力音響信号にフィルタ処理を施すことで、より空間的なばらつきの少ない音圧分布の音声を得ることができるとともに、より高い分離性能を得ることができる。 As described above, the acoustic system 11 performs the filtering process on the input acoustic signal using the input filter calculated in advance for each input, and adds the output acoustic signals of the respective inputs obtained as a result to generate the sound. Reproduce. As described above, by performing the filtering process on the input acoustic signal using the input filter for each input, it is possible to obtain sound with a sound pressure distribution with less spatial variation and to obtain higher separation performance. it can.

〈ユースケース１〉
また、図４に示した音響システム１１のユースケースとして、様々なケースが考えられる。具体的には、例えば図６に示すように、画像提示装置１８１でコンテンツの映像を再生しつつ、スピーカアレイ１８２でコンテンツの音声を再生する場合に音響システム１１を適用することができる。 <Use Case 1>
Various cases are conceivable as use cases of the acoustic system 11 shown in FIG. Specifically, for example, as shown in FIG. 6, the acoustic system 11 can be applied when the audio of the content is reproduced by the speaker array 182 while the video of the content is reproduced by the image presentation device 181.

図６では、音響システム１１は図示されていないが、スピーカアレイ１８２を構成する各スピーカが音源装置６３に対応し、D/A変換器６２からの出力音響信号がスピーカアレイ１８２に供給される。 In FIG. 6, the acoustic system 11 is not shown, but each speaker constituting the speaker array 182 corresponds to the sound source device 63, and an output acoustic signal from the D / A converter 62 is supplied to the speaker array 182.

例えば、テレビジョン放送における音声多重放送や映画など、言語1の主音声と言語2の副音声が含まれる場合がある。通常は、それらの2つの音声を同時に再生すると主音声と副音声が混ざり合って聞こえてしまうため、言語1を解する人と言語2を解する人が同じ部屋でコンテンツを試聴することは困難であった。 For example, there may be cases where a primary audio of language 1 and a secondary audio of language 2 are included, such as audio multiplex broadcasting in television broadcasting and movies. Normally, when those two sounds are played back at the same time, the main sound and sub sound will be mixed and heard, so it is difficult for people who understand language 1 and people who understand language 2 to listen to the content in the same room. Met.

これに対して、図４に示した音響システム１１を利用すれば、再生空間に主音声のみの聴取エリアと、副音声のみの聴取エリアとを設けることができる。したがって、言語1を解する人のいる場所を主音声の聴取エリアかつ副音声の非聴取エリアとし、言語2を解する人のいる場所を副音声の聴取エリアかつ主音声の非聴取エリアとすれば、互いに相手の言語の音声に邪魔されることなくコンテンツを楽しむことができる。 On the other hand, if the acoustic system 11 shown in FIG. 4 is used, it is possible to provide a listening area for only the main sound and a listening area for only the sub sound in the reproduction space. Therefore, the place where the person who understands language 1 is the main sound listening area and the sub sound non-listening area, and the place where the person who understands language 2 is the sub sound listening area and the main sound non-listening area. Thus, the user can enjoy the content without being disturbed by the voice of the other party's language.

この例では、再生空間がエリアR21乃至エリアR25の5つのエリアに分けられている。いま、言語1を解するユーザがエリアR22におり、言語2を解するユーザがエリアR24にいるとする。そのような場合、エリアR22を主音声の聴取エリアかつ副音声の非聴取エリアとし、エリアR24を副音声の聴取エリアかつ主音声の非聴取エリアとして、音響システム１１において、それぞれ主音声の入力フィルタと副音声の入力フィルタを算出すればよい。この場合、エリアR21、エリアR23、およびエリアR25は、非聴取エリアや非限定エリアとすればよい。 In this example, the reproduction space is divided into five areas R21 to R25. It is assumed that a user who understands language 1 is in area R22 and a user who understands language 2 is in area R24. In such a case, in the acoustic system 11, each of the input filters for the main sound is set as the area R22 as the main sound listening area and the sub sound non-listening area, and the area R24 as the sub sound listening area and the main sound non-listening area. And a sub audio input filter may be calculated. In this case, the area R21, the area R23, and the area R25 may be non-listening areas or non-limited areas.

〈ユースケース２〉
また、図４に示した音響システム１１のユースケースとして、例えば図７に示すケースも考えられる。 <Use Case 2>
Further, as a use case of the acoustic system 11 shown in FIG. 4, for example, a case shown in FIG.

例えば、博物館や展示会などにおいて各映像や画像に対応した複数の音声を同一の部屋で同時に再生するような場合、所定の場所で再生した音声が別の場所で再生する音声の邪魔となってしまうことが生じ得る。 For example, when a plurality of sounds corresponding to each video and image are reproduced simultaneously in the same room in a museum or an exhibition, the sound reproduced at a predetermined place obstructs the sound reproduced at another place. Can occur.

そのような場合、図４に示した音響システム１１を利用すれば、それらの複数の音声を互いの音声が他の音声に邪魔されるようなことなく、再生することが可能となる。 In such a case, if the acoustic system 11 shown in FIG. 4 is used, the plurality of sounds can be reproduced without the mutual sounds being disturbed by other sounds.

図７では、画像提示装置２１１でコンテンツC1の映像が再生されるとともに、画像提示装置２１２でコンテンツC2の映像が再生され、さらにスピーカアレイ２１３でコンテンツC1の音声およびコンテンツC2の音声が再生される。 In FIG. 7, the video of the content C1 is reproduced by the image presentation device 211, the video of the content C2 is reproduced by the image presentation device 212, and the audio of the content C1 and the audio of the content C2 are reproduced by the speaker array 213. .

また、図７では音響システム１１は図示されていないが、スピーカアレイ２１３を構成する各スピーカが音源装置６３に対応し、D/A変換器６２からの出力音響信号がスピーカアレイ２１３に供給される。 Although the acoustic system 11 is not illustrated in FIG. 7, each speaker constituting the speaker array 213 corresponds to the sound source device 63, and an output acoustic signal from the D / A converter 62 is supplied to the speaker array 213. .

この例では、再生空間がエリアR31乃至エリアR35の5つのエリアに分けられている。そして、エリアR32をコンテンツC1の音声の聴取エリア、かつコンテンツC2の音声の非聴取エリアとし、エリアR34をコンテンツC2の音声の聴取エリア、かつコンテンツC1の音声の非聴取エリアとして、コンテンツC1とコンテンツC2の入力フィルタを算出すればよい。この場合、エリアR31、エリアR33、およびエリアR35は、非聴取エリアや非限定エリアとすればよい。 In this example, the reproduction space is divided into five areas R31 to R35. Then, the content C1 and the content C1 and the content C1 are listened to as the area R32 as the audio listening area for the content C2 and the content C2 as the audio non-listening area, and the area R34 as the content C2 as the audio listening area. What is necessary is just to calculate the input filter of C2. In this case, the area R31, the area R33, and the area R35 may be non-listening areas or non-limited areas.

このようにすることで、コンテンツC1の映像が再生される画像提示装置２１１の正面、つまりエリアR32にいるユーザは、コンテンツC1の音声のみを聴取することができる。同様に、コンテンツC2の映像が再生される画像提示装置２１２の正面、つまりエリアR34にいるユーザは、コンテンツC2の音声のみを聴取することができる。 By doing in this way, the user in the front of the image presentation device 211 where the video of the content C1 is reproduced, that is, the user in the area R32 can listen only to the sound of the content C1. Similarly, the user in the front of the image presentation device 212 where the video of the content C2 is reproduced, that is, the user in the area R34 can listen only to the audio of the content C2.

ところで、上述した一連の処理は、ハードウェアにより実行することもできるし、ソフトウェアにより実行することもできる。一連の処理をソフトウェアにより実行する場合には、そのソフトウェアを構成するプログラムが、コンピュータにインストールされる。ここで、コンピュータには、専用のハードウェアに組み込まれているコンピュータや、各種のプログラムをインストールすることで、各種の機能を実行することが可能な、例えば汎用のパーソナルコンピュータなどが含まれる。 By the way, the above-described series of processing can be executed by hardware or can be executed by software. When a series of processing is executed by software, a program constituting the software is installed in the computer. Here, the computer includes, for example, a general-purpose personal computer capable of executing various functions by installing a computer incorporated in dedicated hardware and various programs.

図８は、上述した一連の処理をプログラムにより実行するコンピュータのハードウェアの構成例を示すブロック図である。 FIG. 8 is a block diagram illustrating an example of a hardware configuration of a computer that executes the above-described series of processes using a program.

コンピュータにおいて、CPU（Central Processing Unit）５０１，ROM（Read Only Memory）５０２，RAM（Random Access Memory）５０３は、バス５０４により相互に接続されている。 In a computer, a CPU (Central Processing Unit) 501, a ROM (Read Only Memory) 502, and a RAM (Random Access Memory) 503 are connected to each other by a bus 504.

バス５０４には、さらに、入出力インターフェース５０５が接続されている。入出力インターフェース５０５には、入力部５０６、出力部５０７、記録部５０８、通信部５０９、及びドライブ５１０が接続されている。 An input / output interface 505 is further connected to the bus 504. An input unit 506, an output unit 507, a recording unit 508, a communication unit 509, and a drive 510 are connected to the input / output interface 505.

入力部５０６は、キーボード、マウス、マイクロフォン、撮像素子などよりなる。出力部５０７は、ディスプレイ、スピーカなどよりなる。記録部５０８は、ハードディスクや不揮発性のメモリなどよりなる。通信部５０９は、ネットワークインターフェースなどよりなる。ドライブ５１０は、磁気ディスク、光ディスク、光磁気ディスク、又は半導体メモリなどのリムーバブルメディア５１１を駆動する。 The input unit 506 includes a keyboard, a mouse, a microphone, an image sensor, and the like. The output unit 507 includes a display, a speaker, and the like. The recording unit 508 includes a hard disk, a nonvolatile memory, and the like. The communication unit 509 includes a network interface or the like. The drive 510 drives a removable medium 511 such as a magnetic disk, an optical disk, a magneto-optical disk, or a semiconductor memory.

以上のように構成されるコンピュータでは、CPU５０１が、例えば、記録部５０８に記録されているプログラムを、入出力インターフェース５０５及びバス５０４を介して、RAM５０３にロードして実行することにより、上述した一連の処理が行われる。 In the computer configured as described above, the CPU 501 loads the program recorded in the recording unit 508 to the RAM 503 via the input / output interface 505 and the bus 504 and executes the program, for example. Is performed.

コンピュータ（CPU５０１）が実行するプログラムは、例えば、パッケージメディア等としてのリムーバブルメディア５１１に記録して提供することができる。また、プログラムは、ローカルエリアネットワーク、インターネット、デジタル衛星放送といった、有線または無線の伝送媒体を介して提供することができる。 The program executed by the computer (CPU 501) can be provided by being recorded on a removable medium 511 as a package medium or the like, for example. The program can be provided via a wired or wireless transmission medium such as a local area network, the Internet, or digital satellite broadcasting.

コンピュータでは、プログラムは、リムーバブルメディア５１１をドライブ５１０に装着することにより、入出力インターフェース５０５を介して、記録部５０８にインストールすることができる。また、プログラムは、有線または無線の伝送媒体を介して、通信部５０９で受信し、記録部５０８にインストールすることができる。その他、プログラムは、ROM５０２や記録部５０８に、あらかじめインストールしておくことができる。 In the computer, the program can be installed in the recording unit 508 via the input / output interface 505 by attaching the removable medium 511 to the drive 510. Further, the program can be received by the communication unit 509 via a wired or wireless transmission medium and installed in the recording unit 508. In addition, the program can be installed in the ROM 502 or the recording unit 508 in advance.

なお、コンピュータが実行するプログラムは、本明細書で説明する順序に沿って時系列に処理が行われるプログラムであっても良いし、並列に、あるいは呼び出しが行われたとき等の必要なタイミングで処理が行われるプログラムであっても良い。 The program executed by the computer may be a program that is processed in time series in the order described in this specification, or in parallel or at a necessary timing such as when a call is made. It may be a program for processing.

また、本技術の実施の形態は、上述した実施の形態に限定されるものではなく、本技術の要旨を逸脱しない範囲において種々の変更が可能である。 The embodiments of the present technology are not limited to the above-described embodiments, and various modifications can be made without departing from the gist of the present technology.

例えば、本技術は、１つの機能をネットワークを介して複数の装置で分担、共同して処理するクラウドコンピューティングの構成をとることができる。 For example, the present technology can take a configuration of cloud computing in which one function is shared by a plurality of devices via a network and is jointly processed.

また、上述のフローチャートで説明した各ステップは、１つの装置で実行する他、複数の装置で分担して実行することができる。 In addition, each step described in the above flowchart can be executed by being shared by a plurality of apparatuses in addition to being executed by one apparatus.

さらに、１つのステップに複数の処理が含まれる場合には、その１つのステップに含まれる複数の処理は、１つの装置で実行する他、複数の装置で分担して実行することができる。 Further, when a plurality of processes are included in one step, the plurality of processes included in the one step can be executed by being shared by a plurality of apparatuses in addition to being executed by one apparatus.

さらに、本技術は、以下の構成とすることも可能である。 Furthermore, this technique can also be set as the following structures.

［１］
聴取エリア内にいくつかの制御点を設定するとともに、非聴取エリア内にいくつかの制御点を設定するエリア設定部と、
前記聴取エリア内の各前記制御点の目的音圧を設定する目的音圧設定部と、
前記目的音圧の許容誤差を設定する許容誤差設定部と、
全前記制御点の合計個数未満の所定個数の各音源について、前記音源と各前記制御点のそれぞれとの間の伝達関数に基づいて、前記聴取エリア内の前記制御点の音圧の前記目的音圧に対する誤差が前記許容誤差の範囲内に収まるという条件下で、前記非聴取エリア内の前記制御点の音圧を最小化する、前記音源に対応するフィルタのフィルタ係数を算出するフィルタ係数計算部と
を備える音声処理装置。
［２］
前記フィルタ係数計算部は、前記フィルタのゲインが所定の上限値を超えないという制約条件をさらに加えて前記フィルタ係数を算出する
［１］に記載の音声処理装置。
［３］
前記エリア設定部は、前記聴取エリアおよび前記非聴取エリアとは異なるその他のエリアにいくつかの制御点を設定し、
前記フィルタ係数計算部は、前記その他のエリア内の前記制御点の音圧が、所定の最大音圧を超えないという制約条件をさらに加えて前記フィルタ係数を算出する
［１］または［２］に記載の音声処理装置。
［４］
前記最大音圧は、前記聴取エリア内の前記制御点の前記目的音圧の最小値に基づいて定められる
［３］に記載の音声処理装置。
［５］
前記フィルタ係数計算部は、線型計画問題を解くことにより、前記フィルタ係数を算出する
［１］乃至［４］の何れか一項に記載の音声処理装置。
［６］
聴取エリア内にいくつかの制御点を設定するとともに、非聴取エリア内にいくつかの制御点を設定し、
前記聴取エリア内の各前記制御点の目的音圧を設定し、
前記目的音圧の許容誤差を設定し、
全前記制御点の合計個数未満の所定個数の各音源について、前記音源と各前記制御点のそれぞれとの間の伝達関数に基づいて、前記聴取エリア内の前記制御点の音圧の前記目的音圧に対する誤差が前記許容誤差の範囲内に収まるという条件下で、前記非聴取エリア内の前記制御点の音圧を最小化する、前記音源に対応するフィルタのフィルタ係数を算出する
ステップを含む音声処理方法。
［７］
聴取エリア内にいくつかの制御点を設定するとともに、非聴取エリア内にいくつかの制御点を設定し、
前記聴取エリア内の各前記制御点の目的音圧を設定し、
前記目的音圧の許容誤差を設定し、
全前記制御点の合計個数未満の所定個数の各音源について、前記音源と各前記制御点のそれぞれとの間の伝達関数に基づいて、前記聴取エリア内の前記制御点の音圧の前記目的音圧に対する誤差が前記許容誤差の範囲内に収まるという条件下で、前記非聴取エリア内の前記制御点の音圧を最小化する、前記音源に対応するフィルタのフィルタ係数を算出する
ステップを含む処理をコンピュータに実行させるプログラム。 [1]
An area setting unit for setting some control points in the listening area and setting some control points in the non-listening area;
A target sound pressure setting unit for setting a target sound pressure of each of the control points in the listening area;
An allowable error setting unit for setting an allowable error of the target sound pressure;
For a predetermined number of sound sources less than the total number of all control points, the target sound of the sound pressure of the control points in the listening area is based on the transfer function between the sound source and each of the control points. A filter coefficient calculation unit for calculating a filter coefficient of a filter corresponding to the sound source that minimizes the sound pressure of the control point in the non-listening area under a condition that an error with respect to pressure falls within the allowable error range. And a voice processing device.
[2]
The sound processing apparatus according to [1], wherein the filter coefficient calculation unit further calculates a filter coefficient by further adding a constraint condition that a gain of the filter does not exceed a predetermined upper limit value.
[3]
The area setting unit sets some control points in other areas different from the listening area and the non-listening area,
The filter coefficient calculation unit calculates the filter coefficient by further adding a constraint that the sound pressure at the control point in the other area does not exceed a predetermined maximum sound pressure. [1] or [2] The speech processing apparatus according to the description.
[4]
The sound processing device according to [3], wherein the maximum sound pressure is determined based on a minimum value of the target sound pressure at the control point in the listening area.
[5]
The speech processing apparatus according to any one of [1] to [4], wherein the filter coefficient calculation unit calculates the filter coefficient by solving a linear programming problem.
[6]
Set some control points in the listening area and set some control points in the non-listening area,
Set the target sound pressure of each control point in the listening area,
Set the target sound pressure tolerance,
For a predetermined number of sound sources less than the total number of all control points, the target sound of the sound pressure of the control points in the listening area is based on the transfer function between the sound source and each of the control points. A sound including a step of calculating a filter coefficient of a filter corresponding to the sound source that minimizes a sound pressure at the control point in the non-listening area under a condition that an error with respect to pressure falls within the allowable error range. Processing method.
[7]
Set some control points in the listening area and set some control points in the non-listening area,
Set the target sound pressure of each control point in the listening area,
Set the target sound pressure tolerance,
For a predetermined number of sound sources less than the total number of all control points, the target sound of the sound pressure of the control points in the listening area is based on the transfer function between the sound source and each of the control points. A process including a step of calculating a filter coefficient of a filter corresponding to the sound source that minimizes the sound pressure of the control point in the non-listening area under a condition that an error with respect to pressure falls within the allowable error range. A program that causes a computer to execute.

１１音響システム，２１フィルタ計算部，２２音声再生部，３３−１乃至３３−Ｎ，３３音源装置，３４−１乃至３４−Ｎ，３４音響センサ，３５伝達関数計算器，３６情報入力部，３７フィルタ係数計算部，４１エリア設定部，４２目的音圧設定部，４３許容誤差設定部，６１−１乃至６１−Ｎ，６１振幅位相制御フィルタ処理部，６３−１乃至６３−Ｎ，６３音源装置，１０１情報入力部，１０２フィルタ係数計算部，１１１エリア設定部，１１２目的音圧設定部，１１３許容誤差設定部，１３１−１乃至１３１−Ｎ，１３１振幅位相制御フィルタ処理部，１３２−１乃至１３２−Ｎ，１３２加算部 DESCRIPTION OF SYMBOLS 11 Sound system, 21 Filter calculation part, 22 Sound reproduction part, 33-1 thru | or 33-N, 33 Sound source device, 34-1 thru | or 34-N, 34 Acoustic sensor, 35 Transfer function calculator, 36 Information input part, 37 Filter coefficient calculation unit, 41 area setting unit, 42 target sound pressure setting unit, 43 tolerance setting unit, 61-1 to 61-N, 61 amplitude phase control filter processing unit, 63-1 to 63-N, 63 sound source device , 101 Information input unit, 102 Filter coefficient calculation unit, 111 Area setting unit, 112 Target sound pressure setting unit, 113 Allowable error setting unit, 131-1 to 131 -N, 131 Amplitude phase control filter processing unit, 132-1 to 132-N, 132 adder

Claims

An area setting unit for setting some control points in the listening area and setting some control points in the non-listening area;
A target sound pressure setting unit for setting a target sound pressure of each of the control points in the listening area;
An allowable error setting unit for setting an allowable error of the target sound pressure;
For a predetermined number of sound sources less than the total number of all control points, the target sound of the sound pressure of the control points in the listening area is based on the transfer function between the sound source and each of the control points. A filter coefficient calculation unit for calculating a filter coefficient of a filter corresponding to the sound source that minimizes the sound pressure of the control point in the non-listening area under a condition that an error with respect to pressure falls within the allowable error range. And a voice processing device.

The audio processing apparatus according to claim 1, wherein the filter coefficient calculation unit calculates the filter coefficient by further adding a constraint that the gain of the filter does not exceed a predetermined upper limit value.

The area setting unit sets some control points in other areas different from the listening area and the non-listening area,
The audio processing according to claim 1, wherein the filter coefficient calculation unit further calculates the filter coefficient by further adding a constraint that a sound pressure at the control point in the other area does not exceed a predetermined maximum sound pressure. apparatus.

The sound processing apparatus according to claim 3, wherein the maximum sound pressure is determined based on a minimum value of the target sound pressure at the control point in the listening area.

The speech processing apparatus according to claim 1, wherein the filter coefficient calculation unit calculates the filter coefficient by solving a linear planning problem.

Set some control points in the listening area and set some control points in the non-listening area,
Set the target sound pressure of each control point in the listening area,
Set the target sound pressure tolerance,
For a predetermined number of sound sources less than the total number of all control points, the target sound of the sound pressure of the control points in the listening area is based on the transfer function between the sound source and each of the control points. A sound including a step of calculating a filter coefficient of a filter corresponding to the sound source that minimizes a sound pressure at the control point in the non-listening area under a condition that an error with respect to pressure falls within the allowable error range. Processing method.

Set some control points in the listening area and set some control points in the non-listening area,
Set the target sound pressure of each control point in the listening area,
Set the target sound pressure tolerance,
For a predetermined number of sound sources less than the total number of all control points, the target sound of the sound pressure of the control points in the listening area is based on the transfer function between the sound source and each of the control points. A process including a step of calculating a filter coefficient of a filter corresponding to the sound source that minimizes the sound pressure of the control point in the non-listening area under a condition that an error with respect to pressure falls within the allowable error range. A program that causes a computer to execute.