JP2009065452A

JP2009065452A - Sound image localization control device, sound image localization control method, program, and integrated circuit

Info

Publication number: JP2009065452A
Application number: JP2007231561A
Authority: JP
Inventors: Ko Mizuno; 耕水野
Original assignee: Panasonic Corp
Current assignee: Panasonic Corp
Priority date: 2007-09-06
Filing date: 2007-09-06
Publication date: 2009-03-26

Abstract

【課題】制御処理負担の軽減、および回路の小規模化を実現しつつ、音像の大きさや明瞭度を調整することが可能な音像定位制御装置を提供する。
【解決手段】本発明に係る音像定位制御装置は、複数のスピーカから音を出力して各スピーカとは異なる位置に音像を定位させる音像定位制御装置であって、複数のスピーカそれぞれに対応して設けられ、予め設定された１つ以上の処理パラメータに従って、音像が所定の位置に定位するように入力音響信号を処理し、対応するスピーカへ出力する複数の音像定位手段と、各音像定位手段に予め設定された処理パラメータの少なくとも１つを、ユーザーの所望する音像の大きさまたは音像の明瞭度に応じた処理パラメータに変更する処理パラメータ変更手段とを備える。
【選択図】図１A sound image localization control device capable of adjusting the size and clarity of a sound image while reducing the control processing burden and reducing the circuit scale.
A sound image localization control device according to the present invention is a sound image localization control device that outputs sound from a plurality of speakers and localizes the sound image at a position different from each speaker, and corresponds to each of the plurality of speakers. A plurality of sound image localization means for processing the input acoustic signal so that the sound image is localized at a predetermined position according to one or more processing parameters set in advance and outputting the sound image to a corresponding speaker; and for each sound image localization means And processing parameter changing means for changing at least one of the preset processing parameters into a processing parameter corresponding to the size of the sound image desired by the user or the clarity of the sound image.
[Selection] Figure 1

Description

本発明は、音像定位制御装置、音像定位制御方法、プログラム、および集積回路に関し、より特定的には、音を出力する複数のスピーカを用いて各スピーカとは異なる位置に音像を定位させる音像定位制御装置、音像定位制御方法、プログラム、および集積回路に関する。 The present invention relates to a sound image localization control device, a sound image localization control method, a program, and an integrated circuit, and more specifically, a sound image localization that uses a plurality of speakers that output sound to localize a sound image at a position different from each speaker. The present invention relates to a control device, a sound image localization control method, a program, and an integrated circuit.

ユーザーがＤＶＤ等のマルチチャンネル音声を含むコンテンツを視聴する場合、居住スペースの制約の問題から、ユーザーの前方に設置した２つのスピーカで音響再生を行うことが多い。このような状況において、より高い臨場感をユーザーに与えるべく、マルチチャンネル音声それぞれを音像として所定位置に定位させる制御技術、いわゆるバーチャルサラウンド技術が広く実用化されている。通常、バーチャルサラウンド技術は、ユーザーの両耳における音響伝達関数が所定特性となるように、入力音響信号の振幅と位相をデジタルフィルタ処理を施して調整することによって、音像を所定位置に定位させている。 When a user views content including multi-channel audio such as a DVD, sound reproduction is often performed with two speakers installed in front of the user due to the problem of the limitation of living space. In such a situation, in order to give the user a higher sense of realism, a control technique that localizes each multi-channel sound as a sound image at a predetermined position, so-called virtual surround technique, has been widely put into practical use. In general, virtual surround technology localizes a sound image at a predetermined position by adjusting the amplitude and phase of an input acoustic signal by applying a digital filter process so that the acoustic transfer function in the user's both ears has a predetermined characteristic. Yes.

また、音像を所定位置に定位させるだけでなく、音像に距離感をもたせる制御技術も提案されている。この制御技術によれば、デジタルフィルタ処理後の出力信号に複数の遅延信号を加算する。これにより、非遅延信号である出力信号が直接音となり、複数の遅延信号が模擬的な間接音（反射音）となるため、ユーザーは、音像の距離感を感じることができる。 In addition, a control technique has been proposed that not only localizes a sound image at a predetermined position but also gives the sound image a sense of distance. According to this control technique, a plurality of delay signals are added to the output signal after the digital filter processing. Thereby, since the output signal which is a non-delayed signal becomes a direct sound, and a plurality of delayed signals become a simulated indirect sound (reflected sound), the user can feel a sense of distance of the sound image.

更に近年、ユーザーの好みあるいはコンテンツの種類に応じて音像の距離感を調整する制御技術が提案されている。音像の距離感を調整する制御技術として例えば、上記遅延信号の遅延時間を調整したり、非遅延信号である直接音と遅延信号である間接音との振幅比を変更したりする制御技術が提案されている。この制御技術では、遅延時間の異なる複数の遅延信号をメモリ等の記憶装置に記憶しておく。そして、音像の距離感に応じた遅延信号を記憶装置から読み出し、読み出した遅延信号を適切なタイミング、振幅比で非遅延信号に加算する制御を行っている。 Furthermore, in recent years, a control technique for adjusting a sense of distance of a sound image according to a user's preference or content type has been proposed. For example, a control technique that adjusts the delay time of the delayed signal or changes the amplitude ratio of the direct sound that is a non-delayed signal and the indirect sound that is a delayed signal is proposed as a control technique that adjusts the sense of distance of the sound image Has been. In this control technique, a plurality of delay signals having different delay times are stored in a storage device such as a memory. Then, the delay signal corresponding to the sense of distance of the sound image is read from the storage device, and the read delay signal is controlled to be added to the non-delay signal at an appropriate timing and amplitude ratio.

また例えば、音像の距離感を調整する制御技術として、複数の位置に音像を定位させる制御技術（例えば、特許文献１など）が提案されている。以下、複数の位置に音像を定位させる音像定位制御装置の動作について、図１８を参照して説明する。図１８は、複数の位置に音像を定位させる従来の音像定位制御装置の構成を示した図である。 For example, as a control technique for adjusting the sense of distance between sound images, a control technique (for example, Patent Document 1) that localizes a sound image at a plurality of positions has been proposed. Hereinafter, the operation of the sound image localization control device that localizes the sound image at a plurality of positions will be described with reference to FIG. FIG. 18 is a diagram illustrating a configuration of a conventional sound image localization control device that localizes a sound image at a plurality of positions.

図１８において、ユーザーは、定位制御器１０の鉛直角スライダ１０ａを用いて音像定位位置の上下方向を調整し、距離スライダ１０ｂを用いてユーザーおよび音像定位位置間の距離を調整し、方向ダイアル１０ｃを用いて音像定位位置の水平方向を調整する。定位制御器１０からは、音像定位位置の上下方向角度の情報Ｓθと、ユーザーおよび音像定位位置間の距離の情報ＳＤと、音像定位位置の水平方向角度の情報Ｓφとが出力される。 In FIG. 18, the user adjusts the vertical direction of the sound image localization position using the vertical angle slider 10a of the localization controller 10, adjusts the distance between the user and the sound image localization position using the distance slider 10b, and the direction dial 10c. Use to adjust the horizontal direction of the sound image localization position. The localization controller 10 outputs information Sθ on the vertical angle of the sound image localization position, information SD on the distance between the user and the sound image localization position, and information Sφ on the horizontal angle of the sound image localization position.

所定位置に定位した音像から再生されるべき入力音響信号は、入力端子１を経てノッチフィルタ２に入力され、ノッチフィルタ２において信号処理が行われる。ここで、人間の知覚は、受聴信号の振幅周波数特性を手がかりにして、音像の上下方向の角度を判断することが知られている。ノッチフィルタ２には、所定の振幅周波数特性を実現するフィルタ係数が設定されている。所定の振幅周波数特性は、ユーザーの両耳における振幅周波数特性を上下方向角度の情報Ｓθに対応する頭部音響伝達関数の示す特性とするように入力音響信号の所定周波数成分を除去する特性となっている。このため、ノッチフィルタ２の信号処理によって、音像は、上下方向角度がＳθとなる位置に定位する。ノッチフィルタ２の出力信号は、可変ゲイン３ａ、３ｂに入力される。 An input acoustic signal to be reproduced from a sound image localized at a predetermined position is input to the notch filter 2 through the input terminal 1, and signal processing is performed in the notch filter 2. Here, it is known that human perception determines the angle in the vertical direction of a sound image by using the amplitude frequency characteristic of a listening signal as a clue. The notch filter 2 is set with a filter coefficient that realizes a predetermined amplitude frequency characteristic. The predetermined amplitude frequency characteristic is a characteristic that removes the predetermined frequency component of the input acoustic signal so that the amplitude frequency characteristic in both ears of the user is the characteristic indicated by the head acoustic transfer function corresponding to the vertical direction angle information Sθ. ing. For this reason, the sound image is localized at a position where the vertical angle is Sθ by the signal processing of the notch filter 2. The output signal of the notch filter 2 is input to the variable gains 3a and 3b.

可変ゲイン３ａおよび３ｂには、距離の情報ＳＤに対応するゲインが設定されている。ゲインは、後述する制御器１１によって与えられる。可変ゲイン３ａの出力信号は、近距離用分配器４に入力され、可変ゲイン３ｂの出力信号は、近距離用分配器４に入力される。近距離用分配器４は、１２個の可変ゲイン４ａ−１〜４ａ−１２で構成され、遠距離用分配器５は、１２個の可変ゲイン５ａ−１〜５ａ−１２で構成される。可変ゲイン４ａ−１〜４ａ−１２、５ａ−１〜５ａ−１２には、水平方向角度の情報Ｓφに対応する所定ゲインが与えられている。水平方向角度φと可変ゲイン４ａ−１〜４ａ−１２との関係を図１９に示す。図１９において、可変ゲイン４ａ−ｋ（２≦ｋ≦１２）は可変ゲイン４ａ−（ｋ−１）を図中の右方向に３０°平行移動したものとなっており、このことは図示しない他の可変ゲインについても適用される。また、水平方向角度φと可変ゲイン５ａ−１〜５ａ−１２の関係を図２０に示す。可変ゲイン５ａ−ｋ（２≦ｋ≦１２）は可変ゲイン５ａ−（ｋ−１）を図中の右方向に３０°平行移動したものとなっており、このことは図示しない他の可変ゲインについても適用される。 A gain corresponding to the distance information SD is set in the variable gains 3a and 3b. The gain is given by the controller 11 described later. The output signal of the variable gain 3 a is input to the short distance distributor 4, and the output signal of the variable gain 3 b is input to the short distance distributor 4. The short distance distributor 4 includes twelve variable gains 4a-1 to 4a-12, and the long distance distributor 5 includes twelve variable gains 5a-1 to 5a-12. The variable gains 4a-1 to 4a-12 and 5a-1 to 5a-12 are given predetermined gains corresponding to the horizontal direction angle information Sφ. FIG. 19 shows the relationship between the horizontal angle φ and the variable gains 4a-1 to 4a-12. In FIG. 19, the variable gain 4a-k (2 ≦ k ≦ 12) is obtained by translating the variable gain 4a- (k−1) 30 ° in the right direction in the drawing, and this is not shown in the figure. This also applies to the variable gain. FIG. 20 shows the relationship between the horizontal direction angle φ and the variable gains 5a-1 to 5a-12. The variable gain 5a-k (2 ≦ k ≦ 12) is obtained by translating the variable gain 5a- (k−1) by 30 ° in the right direction in the drawing, and this is about other variable gains not shown. Also applies.

近距離用分配器４の出力信号のうち可変ゲイン４ａ−１からの出力信号と、遠距離用分配器５の出力のうち可変ゲイン５ａ−１の出力信号とが加算器６ａ−１で加算される。同様に、近距離用分配器４を構成する他の１１個の可変ゲインの出力信号と、遠距離用分配器５を構成する他の１１個の可変ゲインの出力信号とが１１個の加算器６ａ−２〜６ａ−１２においてそれぞれ加算される。加算器６ａ−１〜６ａ−１２の各出力信号は、２個の方向付器７ａ−１〜７ａ−１２、７ｂ−１〜７ｂ−１２に入力される。方向付器７ａ−１〜７ａ−１２、７ｂ−１〜７ｂ−１２は、ＦＩＲフィルタで構成されており、３０°間隔の各水平方向角度に対応する頭部音響伝達関数がフィルタ係数として与えられている。方向付器７ａ−１〜７ａ−１２、７ｂ−１〜７ｂ−１２のうち左耳の頭部音響伝達関数に相当する方向付器の出力信号は、加算器６ｂにおいて加算される。右耳の頭部音響伝達関数に相当する方向付器の出力信号は、加算器６ｃにおいて加算される。加算器６ｂおよび６ｃの出力信号は、クロストークキャンセラ８にそれぞれ入力される。クロストークキャンセラ８では、公知の手法を用いて、音響再生空間におけるクロストークをキャンセルする信号処理が行われる。クロストークキャンセラ８からの各出力信号は、アンプ９で増幅され、図示しないスピーカに供給される。 The output signal from the variable gain 4a-1 among the output signals from the short distance distributor 4 and the output signal from the variable gain 5a-1 among the outputs from the long distance distributor 5 are added by the adder 6a-1. The Similarly, 11 other variable gain output signals constituting the short distance distributor 4 and 11 other variable gain output signals constituting the long distance distributor 5 are 11 adders. Addition is performed in 6a-2 to 6a-12. The output signals of the adders 6a-1 to 6a-12 are input to the two directional devices 7a-1 to 7a-12 and 7b-1 to 7b-12. Directional devices 7a-1 to 7a-12 and 7b-1 to 7b-12 are composed of FIR filters, and a head acoustic transfer function corresponding to each horizontal direction angle of 30 ° is given as a filter coefficient. ing. Out of the directional devices 7a-1 to 7a-12 and 7b-1 to 7b-12, the output signal of the directional device corresponding to the head acoustic transfer function of the left ear is added in the adder 6b. The output signal of the directional device corresponding to the head transfer function of the right ear is added in the adder 6c. The output signals of the adders 6b and 6c are input to the crosstalk canceller 8, respectively. In the crosstalk canceller 8, signal processing for canceling crosstalk in the sound reproduction space is performed using a known method. Each output signal from the crosstalk canceller 8 is amplified by an amplifier 9 and supplied to a speaker (not shown).

制御器１１は、距離の情報ＳＤに基づいて、近距離用分配器４および遠距離用分配器５で信号処理すべき信号の割合を決定し、決定した割合に応じて可変ゲイン３ａ、３ｂにゲインを与える。このとき、距離が遠いほど遠距離用分配器５で処理する信号の割合が増えるようなゲインが与えられる。近距離用分配器４では、例えば水平方向角度が６０°の場合、６０°の方向付器のみにゲインが与えられる。また、遠距離用分配器５では、０°の方向付器と１２０°の方向付器に０．１、３０°の方向付器と９０°の方向付器に０．４の割合の信号が供給されるように、ゲインが与えられる。このように、ユーザーの設定した距離に応じて複数の位置（水平方向角度が０°、３０°、９０°、１２０°となる位置）に音像を定位させる。これにより、音像に遠距離感をもさせている。 Based on the distance information SD, the controller 11 determines the ratio of signals to be processed by the short-distance distributor 4 and the long-distance distributor 5, and sets the variable gains 3a and 3b according to the determined ratio. Give gain. At this time, a gain is given such that the longer the distance, the greater the proportion of signals processed by the long-distance distributor 5. In the short distance distributor 4, for example, when the horizontal angle is 60 °, the gain is given only to the 60 ° directional device. Further, in the long-distance distributor 5, a signal with a ratio of 0.4 is output to the 0 ° directional device and the 120 ° directional device, 0.1, and the 30 ° directional device and the 90 ° directional device have a ratio of 0.4. Gain is provided to be supplied. Thus, the sound image is localized at a plurality of positions (positions where the horizontal direction angle is 0 °, 30 °, 90 °, and 120 °) according to the distance set by the user. As a result, the sound image has a long distance.

ここで、マルチチャンネルコンテンツの多様化に伴い、サラウンドチャンネルの音像の大きさや明瞭度に対するユーザーの好みも多様化している。例えば、コンテンツがスポーツである場合、サラウンドチャンネルには主に歓声が収録されているので、明瞭度が小さく、ぼやけた大きな音像が好まれる。逆に、コンテンツが映画である場合、オブジェクトの移動感を忠実に再現するため、明瞭度が高く、小さな音像が好まれる。 Here, with the diversification of multi-channel content, user preferences for the size and clarity of the sound image of the surround channel are also diversifying. For example, when the content is a sport, the surround channel mainly includes cheers, so a large sound image with low clarity and a small blur is preferred. On the contrary, when the content is a movie, in order to faithfully reproduce the sense of movement of the object, high clarity and a small sound image are preferred.

このようなユーザーの好みに応じて音像の大きさや明瞭度を調整するには、上述した従来の技術において音像の距離感を調整すればよい。上記従来の技術において、音像に近距離感をもたせるように制御すれば、音像が小さくなり、明瞭度が高くなる。逆に、音像に遠距離感をもたせるように制御すれば、音像が大きくなり、明瞭度が低くなる。このように、従来の技術を用いて音像の距離感を調整することによって、音像の大きさや明瞭度を調整することができる。
特開平６−１３３３９９号公報 In order to adjust the size and intelligibility of a sound image according to such user preferences, it is only necessary to adjust the sense of distance of the sound image in the conventional technology described above. In the above conventional technique, if the sound image is controlled so as to have a sense of short distance, the sound image becomes small and the clarity becomes high. Conversely, if the sound image is controlled to give a sense of distance, the sound image becomes larger and the clarity becomes lower. In this way, the size and clarity of the sound image can be adjusted by adjusting the sense of distance of the sound image using conventional techniques.
JP-A-6-133399

ここで、音像の大きさや明瞭度に対するユーザーの好みが多様化するほど、音像の大きさや明瞭度を調整する音像定位制御装置において、制御処理負担の軽減、回路の小規模化が望まれる。 Here, as the user's preference for the size and intelligibility of the sound image is diversified, in the sound image localization control device that adjusts the size and intelligibility of the sound image, it is desired to reduce the control processing load and reduce the circuit scale.

しかしながら、複数の遅延信号を用いた従来の制御技術では、音像の大きさや明瞭度を調整するにあたり、遅延信号の種類や、遅延信号を加算するタイミング、遅延信号の振幅比などを制御する必要がある。また、音像を大きくして明瞭度を下げるほど、より多種類の遅延信号を制御する必要がある。つまり、音像を大きくして明瞭度を下げるほど、制御処理での演算量が増加する。これらの原因により、複数の遅延信号を用いた従来の制御技術では、制御処理負担の軽減を実現することが実質的に困難であった。また、複数の遅延信号を用いた従来の制御技術では、メモリ等の記憶装置を設ける必要がある。このため、回路の小規模化が困難であった。 However, in the conventional control technique using a plurality of delay signals, it is necessary to control the type of the delay signal, the timing of adding the delay signal, the amplitude ratio of the delay signal, and the like when adjusting the size and clarity of the sound image. is there. In addition, as the sound image is enlarged and the intelligibility is lowered, it is necessary to control more types of delayed signals. That is, the larger the sound image and the lower the clarity, the greater the amount of computation in the control process. For these reasons, it has been substantially difficult to reduce the control processing load with the conventional control technique using a plurality of delay signals. In the conventional control technique using a plurality of delay signals, it is necessary to provide a storage device such as a memory. For this reason, it has been difficult to reduce the scale of the circuit.

また、複数の位置に音像を定位させる従来の制御技術では、音像の大きさや明瞭度を調整するにあたり、複数の位置に音像を定位させる必要がある。このため、当然ながら、１つの位置に音像を定位させる場合よりも制御処理に対して多くの演算量を必要とする。また、音像を大きくして明瞭度を下げるほど、より多くの位置に音像を定位させる必要がある。つまり、音像を大きくして明瞭度を下げるほど、制御処理での演算量が増加する。これらの原因により、複数の位置に音像を定位させる従来の制御技術では、制御処理負担の軽減を実現することが実質的に困難であった。また、複数の位置に音像を定位させる従来の制御技術では、１つの位置に音像を定位させる場合よりも装置の構成は複雑になる。このため、回路の小規模化が困難であった。 Further, in the conventional control technique for localizing a sound image at a plurality of positions, it is necessary to localize the sound image at a plurality of positions when adjusting the size and clarity of the sound image. Therefore, as a matter of course, a larger amount of calculation is required for the control process than when a sound image is localized at one position. In addition, the larger the sound image is, the lower the intelligibility is, and it is necessary to localize the sound image at more positions. That is, the larger the sound image and the lower the clarity, the greater the amount of computation in the control process. For these reasons, it has been substantially difficult to realize a reduction in control processing load with a conventional control technique that localizes sound images at a plurality of positions. In addition, in the conventional control technique for localizing a sound image at a plurality of positions, the configuration of the apparatus is more complicated than when a sound image is localized at one position. For this reason, it has been difficult to reduce the scale of the circuit.

それ故、本発明は、制御処理負担の軽減、および回路の小規模化を実現しつつ、音像の大きさや明瞭度を調整することが可能な音像定位制御装置を提供することを目的とする。 Therefore, an object of the present invention is to provide a sound image localization control device capable of adjusting the size and the intelligibility of a sound image while realizing a reduction in control processing load and a reduction in circuit scale.

本発明に係る音像定位制御装置は、上記課題を解決するものであり、本発明に係る音像定位制御装置は、複数のスピーカから音を出力して各スピーカとは異なる位置に音像を定位させる音像定位制御装置であって、複数のスピーカそれぞれに対応して設けられ、予め設定された１つ以上の処理パラメータに従って、音像が所定の位置に定位するように入力音響信号を処理し、対応するスピーカへ出力する複数の音像定位手段と、各音像定位手段に予め設定された処理パラメータの少なくとも１つを、ユーザーの所望する音像の大きさおよび／または音像の明瞭度に応じた処理パラメータに変更する処理パラメータ変更手段とを備える。 The sound image localization control device according to the present invention solves the above-mentioned problem, and the sound image localization control device according to the present invention outputs a sound from a plurality of speakers and localizes the sound image at a position different from each speaker. A localization control device that is provided corresponding to each of a plurality of speakers, processes an input acoustic signal so that a sound image is localized at a predetermined position in accordance with one or more preset processing parameters, and a corresponding speaker A plurality of sound image localization means to be output to and at least one of processing parameters preset in each of the sound image localization means is changed to a processing parameter in accordance with the size of the sound image desired by the user and / or the clarity of the sound image Processing parameter changing means.

なお、上記音像定位手段は、例えば、後述する実施形態におけるＦＩＲフィルタ１６ａ、ＦＩＲフィルタ１６ｂ、ＦＩＲフィルタ１６ａおよび振幅位相周波数特性調整部２４ａの組み合わせ、ＦＩＲフィルタ１６ｂおよび振幅位相周波数特性調整部２４ｂの組み合わせ、ＦＩＲフィルタ１６ａおよび混合部２６ａの組み合わせ、または、ＦＩＲフィルタ１６ｂおよび混合部２６ｂの組み合わせに相当するものである。 The sound image localization means is, for example, a combination of FIR filter 16a, FIR filter 16b, FIR filter 16a and amplitude phase frequency characteristic adjustment unit 24a, and a combination of FIR filter 16b and amplitude phase frequency characteristic adjustment unit 24b in the embodiments described later. This corresponds to a combination of the FIR filter 16a and the mixing unit 26a, or a combination of the FIR filter 16b and the mixing unit 26b.

上記本発明に係る音像定位制御装置によれば、処理パラメータ自体を直接変更することにより音像定位制御に誤差が生じ、音像の大きさや明瞭度を調整することができる。このため、本発明に係る音像定位制御装置によれば、音像の大きさや明瞭度を調整するにあたり、従来のように複数の遅延信号を制御する必要もなく、複数の位置に音像を定位させる必要もない。したがって、本発明に係る音像定位制御装置によれば、従来の制御技術に比べ、制御処理負担の軽減、および回路の小規模化をより容易に実現することができる。その結果、制御処理負担の軽減、および回路の小規模化を実現しつつ、音像の大きさや明瞭度を調整することが可能な音像定位制御装置を提供することができる。 According to the sound image localization control apparatus according to the present invention, an error occurs in the sound image localization control by directly changing the processing parameter itself, and the size and clarity of the sound image can be adjusted. For this reason, according to the sound image localization control device according to the present invention, it is not necessary to control a plurality of delay signals as in the prior art, and it is necessary to localize a sound image at a plurality of positions when adjusting the size and clarity of the sound image. Nor. Therefore, according to the sound image localization control device according to the present invention, it is possible to more easily realize a reduction in control processing load and a circuit scale as compared with the conventional control technique. As a result, it is possible to provide a sound image localization control device capable of adjusting the size and the intelligibility of a sound image while reducing the control processing burden and reducing the circuit scale.

好ましくは、音像定位手段の各々は、フィルタ係数が処理パラメータとして予め設定されており、当該フィルタ係数に従って音像が所定の位置に定位するように入力音響信号を処理し、対応するスピーカへ出力するＦＩＲフィルタを有し、処理パラメータ変更手段は、各ＦＩＲフィルタに予め設定されたフィルタ係数のフィルタ長を、ユーザーの所望する音像の大きさおよび／または音像の明瞭度に応じたフィルタ長にそれぞれ変更するとよい。この場合において、さらに、処理パラメータ変更手段は、ユーザーの所望する音像が大きいほど、および／またはユーザーの所望する音像の明瞭度が小さいほど、フィルタ係数のフィルタ長を短くするとよい。 Preferably, each of the sound image localization means has a filter coefficient set in advance as a processing parameter, processes the input acoustic signal so that the sound image is localized at a predetermined position in accordance with the filter coefficient, and outputs it to a corresponding speaker. The processing parameter changing means has a filter, and the processing parameter changing means changes the filter length of the filter coefficient preset in each FIR filter to a filter length according to the size of the sound image desired by the user and / or the clarity of the sound image. Good. In this case, the processing parameter changing means may further shorten the filter length of the filter coefficient as the sound image desired by the user is larger and / or as the clarity of the sound image desired by the user is smaller.

好ましくは、音像定位手段の各々は、フィルタ係数が処理パラメータとして予め設定されており、当該フィルタ係数に従って、音像が所定の位置に定位するように入力音響信号を処理するＦＩＲフィルタと、所定の調整量が処理パラメータとして予め設定されており、当該所定の調整量に従って、ＦＩＲフィルタの出力信号の振幅周波数特性および位相周波数特性のうち少なくとも一方を調整し、対応するスピーカへ出力する振幅位相周波数特性調整手段とを有し、処理パラメータ変更手段は、各振幅位相周波数特性調整手段に予め設定された所定の調整量を、ユーザーの所望する音像の大きさおよび／または音像の明瞭度に応じた調整量にそれぞれ変更するとよい。 Preferably, each of the sound image localization means has a filter coefficient set in advance as a processing parameter, and an FIR filter that processes an input acoustic signal so that the sound image is localized at a predetermined position according to the filter coefficient, and a predetermined adjustment The amount is preset as a processing parameter, and at least one of the amplitude frequency characteristic and the phase frequency characteristic of the output signal of the FIR filter is adjusted according to the predetermined adjustment amount, and the amplitude phase frequency characteristic adjustment is output to the corresponding speaker And the processing parameter changing means sets a predetermined adjustment amount preset in each amplitude phase frequency characteristic adjusting means to an adjustment amount according to the size of the sound image desired by the user and / or the clarity of the sound image. It is good to change each.

好ましくは、音像定位手段の各々は、フィルタ係数が処理パラメータとして予め設定されており、当該フィルタ係数に従って、音像が所定の位置に定位するように入力音響信号を処理するＦＩＲフィルタと、所定の混合量が処理パラメータとして予め設定されており、当該所定の混合量に従って、ＦＩＲフィルタの出力信号と、他の音像定位手段が有するＦＩＲフィルタの出力信号とを混合し、対応するスピーカへ出力する混合手段とを有し、処理パラメータ変更手段は、各混合手段に予め設定された所定の混合量を、ユーザーの所望する音像の大きさおよび／または音像の明瞭度に応じた混合量にそれぞれ変更するとよい。この場合において、さらに、混合手段の各々は、所定の周波数成分が処理パラメータとして予め設定されており、他の音像定位手段が有するＦＩＲフィルタの出力信号のうち、所定の周波数成分を有する信号のみを抽出する所定周波数成分抽出手段と、所定のゲインが処理パラメータとして予め設定されており、所定周波数成分抽出手段の抽出信号を所定のゲインで増幅する増幅手段と、増幅手段の出力信号と、自身の音像定位手段が有するＦＩＲフィルタの出力信号とを加算し、対応するスピーカへ出力する加算手段とを有するとよい。 Preferably, each of the sound image localization means has a filter coefficient set in advance as a processing parameter, and an FIR filter that processes an input acoustic signal so that the sound image is localized at a predetermined position according to the filter coefficient, and a predetermined mixing A mixing unit that presets the amount as a processing parameter, mixes the output signal of the FIR filter and the output signal of the FIR filter included in another sound image localization unit in accordance with the predetermined mixing amount, and outputs the mixed signal to the corresponding speaker. The processing parameter changing means may change the predetermined mixing amount preset in each mixing means to a mixing amount according to the size of the sound image desired by the user and / or the clarity of the sound image. . In this case, each of the mixing means further has a predetermined frequency component set in advance as a processing parameter, and among the output signals of the FIR filters of other sound image localization means, only the signal having the predetermined frequency component is used. Predetermined frequency component extracting means for extracting, a predetermined gain is set in advance as a processing parameter, an amplifying means for amplifying an extraction signal of the predetermined frequency component extracting means with a predetermined gain, an output signal of the amplifying means, It is preferable to have addition means for adding the output signal of the FIR filter included in the sound image localization means and outputting to the corresponding speaker.

好ましくは、音像定位手段の各々は、フィルタ係数が処理パラメータとして予め設定されており、当該フィルタ係数に従って、音像が所定の位置に定位するように入力音響信号を処理するＦＩＲフィルタと、所定の混合量が処理パラメータとして予め設定されており、当該所定の混合量に従って、ＦＩＲフィルタの出力信号と入力音響信号とを混合し、対応するスピーカへ出力する混合手段とを有し、処理パラメータ変更手段は、各混合手段に予め設定された所定の混合量を、ユーザーの所望する音像の大きさおよび／または音像の明瞭度に応じた混合量にそれぞれ変更するとよい。 Preferably, each of the sound image localization means has a filter coefficient set in advance as a processing parameter, and an FIR filter that processes an input acoustic signal so that the sound image is localized at a predetermined position according to the filter coefficient, and a predetermined mixing An amount is preset as a processing parameter, and according to the predetermined mixing amount, the output signal of the FIR filter and the input acoustic signal are mixed and output to a corresponding speaker. The predetermined mixing amount set in advance in each mixing unit may be changed to a mixing amount according to the size of the sound image desired by the user and / or the clarity of the sound image.

好ましくは、音像定位制御装置は、入力音響信号のうち、所定のカットオフ周波数以上の周波数を有する信号のみを通過させるハイパスフィルタと、入力音響信号のうち、所定のカットオフ周波数以下の周波数を有する信号のみを通過させるローパスフィルタとをさらに備え、音像定位手段の各々は、フィルタ係数が処理パラメータとして予め設定されており、当該フィルタ係数に従って、ハイパスフィルタの通過信号を処理するＦＩＲフィルタと、所定の混合量が処理パラメータとして予め設定されており、当該所定の混合量に従って、ＦＩＲフィルタの出力信号とローパスフィルタの通過信号とを混合し、対応するスピーカへ出力する混合手段とを有し、処理パラメータ変更手段は、各混合手段に予め設定された所定の混合量を、ユーザーの所望する音像の大きさおよび／または音像の明瞭度に応じた混合量にそれぞれ変更するとよい。この場合において、さらに、各ハイパスフィルタおよび各ローパスフィルタの所定のカットオフ周波数を、ユーザーの所望する音像の大きさおよび／または音像の明瞭度に応じたカットオフ周波数に変更するカットオフ周波数変更手段をさらに備えるとよい。 Preferably, the sound image localization control device has a high-pass filter that passes only a signal having a frequency equal to or higher than a predetermined cutoff frequency among the input acoustic signals, and a frequency equal to or lower than the predetermined cutoff frequency among the input acoustic signals. A low-pass filter that passes only the signal, and each of the sound image localization means has a filter coefficient set in advance as a processing parameter, and an FIR filter that processes a pass signal of the high-pass filter according to the filter coefficient; A mixing amount is set in advance as a processing parameter, and according to the predetermined mixing amount, the output signal of the FIR filter and the passing signal of the low-pass filter are mixed and output to a corresponding speaker. The changing means is configured to change a predetermined mixing amount preset in each mixing means to a user The desired mixing amount corresponding to the magnitude and / or the sound image of the clarity of the sound image may be changed respectively. In this case, the cutoff frequency changing means further changes the predetermined cutoff frequency of each high-pass filter and each low-pass filter to a cutoff frequency corresponding to the size and / or clarity of the sound image desired by the user. May be further provided.

好ましくは、入力音響信号の振幅周波数特性を所定の補正特性に従って補正し、各音像定位手段へそれぞれ出力する振幅周波数特性補正手段と、所定の補正特性を、処理パラメータ変更手段が変更した処理パラメータに応じた補正特性に変更する補正特性変更手段とを備えるとよい。この場合において、さらに、補正特性変更手段は、ユーザーのいずれか一方の耳近傍において所定の位置から入力音響信号が音として出力されたと仮定したときに当該音が有する振幅周波数特性と、一方の耳近傍において各スピーカから出力された音が有する振幅周波数特性との差を小さくするように、所定の補正特性を変更するとよい。 Preferably, the amplitude frequency characteristic of the input acoustic signal is corrected according to a predetermined correction characteristic, and the amplitude frequency characteristic correction unit that outputs the sound signal to each sound image localization unit, and the predetermined correction characteristic is changed to the processing parameter changed by the processing parameter changing unit. It is preferable to provide correction characteristic changing means for changing to a corresponding correction characteristic. In this case, the correction characteristic changing means further includes the amplitude frequency characteristic of the sound and the one ear when the input acoustic signal is assumed to be output as a sound from a predetermined position near one of the user's ears. The predetermined correction characteristic may be changed so as to reduce the difference from the amplitude frequency characteristic of the sound output from each speaker in the vicinity.

好ましくは、音像の大きさおよび／または音像の明瞭度に関する情報と、ユーザーの一部を表したオブジェクト画像との対応を示す対応情報を、当該オブジェクト画像の種類に応じて記憶する記憶手段と、ユーザーを撮影し、撮影した画像データを出力する撮影手段と、記憶手段に記憶される各オブジェクト画像のうち、撮影手段から出力された画像データに含まれるオブジェクト画像を検出するオブジェクト画像検出手段と、記憶手段に記憶された対応情報を参照して、オブジェクト画像検出手段において検出されたオブジェクト画像に対応する音像の大きさおよび／または音像の明瞭度に関する情報を特定し、特定した音像の大きさおよび／または音像の明瞭度に関する情報を処理パラメータ変更手段へ出力する出力手段とをさらに備えるとよい。この場合において、撮像手段において所定のタイミングで撮影された画像データから、記憶部に記憶されたオブジェクト画像とは異なるオブジェクト画像を検出し、記憶部に記憶されたオブジェクト画像を、検出したオブジェクト画像に更新するオブジェクト画像更新手段をさらに備えるとよい。 Preferably, storage means for storing correspondence information indicating the correspondence between the size of the sound image and / or the clarity of the sound image and the object image representing a part of the user according to the type of the object image, A photographing means for photographing a user and outputting photographed image data; an object image detecting means for detecting an object image included in the image data output from the photographing means among the object images stored in the storage means; With reference to the correspondence information stored in the storage means, information on the size of the sound image and / or the clarity of the sound image corresponding to the object image detected by the object image detection means is specified, and the size of the specified sound image and And / or output means for outputting information on the clarity of the sound image to the processing parameter changing means. When may. In this case, an object image different from the object image stored in the storage unit is detected from image data captured at a predetermined timing by the imaging unit, and the object image stored in the storage unit is converted into the detected object image. An object image update means for updating may be further provided.

好ましくは、ユーザーの動作を検出する加速度センサーと、音像の大きさおよび／または音像の明瞭度に関する情報と、ユーザーの動作の状態に関する情報との対応を示す対応情報を、ユーザーの動作の種類に応じて記憶する記憶手段と、記憶手段に記憶された対応情報を参照して、加速度センサーにおいて検出されたユーザーの動作に関する情報に対応する音像の大きさおよび／または音像の明瞭度に関する情報を特定し、特定した音像の大きさおよび／または音像の明瞭度に関する情報を、処理パラメータ変更手段へ出力する出力手段とをさらに備えるとよい。 Preferably, correspondence information indicating correspondence between an acceleration sensor for detecting a user's movement, information on the size and / or clarity of the sound image, and information on the state of the user's movement is used as the type of the user's movement. Referring to the storage means to be stored in response and the correspondence information stored in the storage means, information relating to the size of the sound image and / or the clarity of the sound image corresponding to the information related to the user's action detected by the acceleration sensor is specified. The information processing apparatus may further include output means for outputting information relating to the specified sound image size and / or clarity of the sound image to the processing parameter changing means.

好ましくは、ユーザーの音声を検出するマイクと、音像の大きさおよび／または音像の明瞭度に関する情報と、ユーザーの音声に関する情報との対応を示す対応情報を、ユーザーの音声の種類に応じて記憶する記憶手段と、記憶手段に記憶された対応情報を参照して、加速度センサーにおいて検出されたユーザーの音声に関する情報に対応する音像の大きさおよび／または音像の明瞭度に関する情報を、処理パラメータ変更手段へ出力する出力手段とを有するとよい。 Preferably, correspondence information indicating correspondence between a microphone for detecting the user's voice, information on the size and / or clarity of the sound image, and information on the user's voice is stored according to the type of the user's voice. And processing parameter change by referring to the correspondence information stored in the storage means and information relating to the size of the sound image and / or the clarity of the sound image corresponding to the information relating to the user's voice detected by the acceleration sensor Output means for outputting to the means.

また、本発明は、上記課題を解決する音像定位制御方法にも向けられており、本発明に係る音像定位制御方法は、複数のスピーカから音を出力して各スピーカとは異なる位置に音像を定位させる音像定位制御方法であって、複数のスピーカそれぞれについて別々に、予め設定された１つ以上の処理パラメータに従って、音像が所定の位置に定位するように入力音響信号を処理する音像定位ステップと、音像定位ステップにおいて用いられた処理パラメータの少なくとも１つを、ユーザーの所望する音像の大きさおよび／または音像の明瞭度に応じた処理パラメータに変更する処理パラメータ変更ステップとを有する。 The present invention is also directed to a sound image localization control method that solves the above-described problems. The sound image localization control method according to the present invention outputs sound from a plurality of speakers and places the sound image at a position different from each speaker. A sound image localization control method for localization, wherein each of a plurality of speakers separately processes a sound image localization step for processing an input sound signal so that a sound image is localized at a predetermined position in accordance with one or more preset processing parameters. And a processing parameter changing step for changing at least one of the processing parameters used in the sound image localization step into a processing parameter according to the size of the sound image desired by the user and / or the clarity of the sound image.

また、本発明は、上記課題を解決するプログラムにも向けられており、本発明に係るプログラムは、複数のスピーカから音を出力することによって、各スピーカとは異なる位置に音像を定位させるコンピュータに実行させるためのプログラムであって、複数のスピーカそれぞれについて別々に、予め設定された１つ以上の処理パラメータに従って、音像が所定の位置に定位するように入力音響信号を処理する音像定位ステップと、音像定位ステップにおいて用いられた処理パラメータの少なくとも１つを、ユーザーの所望する音像の大きさおよび／または音像の明瞭度に応じた処理パラメータに変更する処理パラメータ変更ステップとを、コンピュータに実行させる。 The present invention is also directed to a program that solves the above problems. The program according to the present invention outputs a sound from a plurality of speakers to localize a sound image at a position different from each speaker. A sound image localization step for processing the input acoustic signal so that the sound image is localized at a predetermined position according to one or more preset processing parameters separately for each of the plurality of speakers, A computer executes a processing parameter changing step of changing at least one of the processing parameters used in the sound image localization step into a processing parameter corresponding to the size of the sound image desired by the user and / or the clarity of the sound image.

また、本発明は、上記課題を解決する集積回路にも向けられており、本発明に係る集積回路は、複数のスピーカから音を出力して各スピーカとは異なる位置に音像を定位させる集積回路であって、複数のスピーカそれぞれに対応して設けられ、予め設定された１つ以上の処理パラメータに従って、音像が所定の位置に定位するように入力音響信号を処理し、対応するスピーカへ出力する複数の音像定位手段と、各音像定位手段に予め設定された処理パラメータの少なくとも１つを、ユーザーの所望する音像の大きさおよび／または音像の明瞭度に応じた処理パラメータに変更する処理パラメータ変更手段とを備える。 The present invention is also directed to an integrated circuit that solves the above-described problems. The integrated circuit according to the present invention outputs sound from a plurality of speakers and localizes a sound image at a position different from each speaker. The input acoustic signal is processed so that the sound image is localized at a predetermined position according to one or more processing parameters set in advance corresponding to each of the plurality of speakers, and is output to the corresponding speaker. Processing parameter change for changing a plurality of sound image localization means and at least one processing parameter preset in each sound image localization means to a processing parameter according to the size and / or clarity of the sound image desired by the user Means.

本発明によれば、制御処理負担の軽減、および回路の小規模化を実現しつつ、音像の大きさや明瞭度を調整することが可能な音像定位制御装置を提供することができる。 According to the present invention, it is possible to provide a sound image localization control device capable of adjusting the size and clarity of a sound image while reducing the control processing burden and reducing the circuit scale.

以下、本発明の実施の形態について、図面を用いて説明する。 Hereinafter, embodiments of the present invention will be described with reference to the drawings.

（第１の実施形態）
図１を用いて、本発明の第１の実施形態に係る音像定位制御装置の構成について説明する。図１は、第１の実施形態に係る音像定位制御装置の構成を示す図である。図１において、１２ａはユーザーであり、１３は音像を定位させるべき所定位置である。本実施形態では、所定位置１３は、一例としてユーザー１２ａの正面を０°したときの右後方１２０°の位置であるとする。１４ａは所定位置１３で再生されるべき音響信号である。図１において、音像定位制御装置は、振幅周波数特性補正部１５、ＦＩＲフィルタ１６ａおよび１６ｂ、スピーカ１７ａおよび１７ｂ、入力部１８、補正特性変更部１９、および第１の処理パラメータ変更部２０を備える。 (First embodiment)
The configuration of the sound image localization control device according to the first embodiment of the present invention will be described with reference to FIG. FIG. 1 is a diagram illustrating a configuration of a sound image localization control device according to the first embodiment. In FIG. 1, 12a is a user, and 13 is a predetermined position where the sound image should be localized. In the present embodiment, the predetermined position 13 is, for example, a position 120 ° to the right rear when the front of the user 12a is 0 °. Reference numeral 14 a denotes an acoustic signal to be reproduced at the predetermined position 13. In FIG. 1, the sound image localization control device includes an amplitude frequency characteristic correction unit 15, FIR filters 16 a and 16 b, speakers 17 a and 17 b, an input unit 18, a correction characteristic change unit 19, and a first processing parameter change unit 20.

振幅周波数特性補正部１５は、音響信号１４ａを入力とし、所定の補正特性に従って音響信号１４ａの振幅周波数特性を補正する。補正特性については、後述にて詳細に説明する。振幅周波数特性補正部１５の出力信号は、ＦＩＲフィルタ１６ａおよび１６ｂそれぞれへ出力される。 The amplitude frequency characteristic correction unit 15 receives the acoustic signal 14a and corrects the amplitude frequency characteristic of the acoustic signal 14a according to a predetermined correction characteristic. The correction characteristics will be described in detail later. The output signal of the amplitude frequency characteristic correction unit 15 is output to each of the FIR filters 16a and 16b.

ＦＩＲフィルタ１６ａおよび１６ｂそれぞれには、フィルタ係数が処理パラメータとして予め設定されている。なお、フィルタ係数のフィルタ長は可変である。ＦＩＲフィルタ１６ａおよび１６ｂは、予め設定されたフィルタ係数に従って、音像が所定位置１３に定位するように、振幅周波数特性補正部１５からの出力信号を処理する。ＦＩＲフィルタ１６ａの出力信号は、音を再生する手段であるスピーカ１７ａに入力される。ＦＩＲフィルタ１６ｂの出力信号は、音を再生する手段であるスピーカ１７ｂに入力される。 In each of the FIR filters 16a and 16b, filter coefficients are set in advance as processing parameters. Note that the filter length of the filter coefficient is variable. The FIR filters 16a and 16b process the output signal from the amplitude frequency characteristic correction unit 15 so that the sound image is localized at the predetermined position 13 in accordance with a preset filter coefficient. The output signal of the FIR filter 16a is input to the speaker 17a which is a means for reproducing sound. The output signal of the FIR filter 16b is input to the speaker 17b which is a means for reproducing sound.

入力部１８は、ユーザー１２ａの所望する音像の大きさ、および／または音像の明瞭度を入力するための入力手段である。音像の大きさと明瞭度との間には、音像が大きい程、明瞭度は低くなるという関係がある。したがって、入力部１８の構成は、音像の大きさと音像の明瞭度とをそれぞれ別々に入力できる構成であってもよいし、いずれか一方のみ入力できる構成であってもよい。以下の説明では、一例として、入力部１８が音像の明瞭度のみ入力可能な構成を有するとする。 The input unit 18 is input means for inputting the size of the sound image desired by the user 12a and / or the clarity of the sound image. There is a relationship between the size of the sound image and the intelligibility that the greater the sound image, the lower the intelligibility. Accordingly, the configuration of the input unit 18 may be a configuration in which the size of the sound image and the clarity of the sound image can be input separately, or a configuration in which only one of them can be input. In the following description, as an example, it is assumed that the input unit 18 has a configuration capable of inputting only the clarity of a sound image.

補正特性変更部１９は、振幅周波数特性補正部１５における所定の補正特性を、後述する第１の処理パラメータ変更部２０が変更したフィルタ長に応じた補正特性に変更する。 The correction characteristic changing unit 19 changes the predetermined correction characteristic in the amplitude frequency characteristic correcting unit 15 to a correction characteristic corresponding to the filter length changed by the first processing parameter changing unit 20 described later.

第１の処理パラメータ変更部２０は、ＦＩＲフィルタ１６ａおよび１６ｂに設定された処理パラメータであるフィルタ係数を、入力部１８で入力された音像の明瞭度に応じたフィルタ係数に変更する。具体的には、第１の処理パラメータ変更部２０は、フィルタ長を変更することによって、フィルタ係数を変更する。 The first processing parameter changing unit 20 changes the filter coefficient, which is a processing parameter set in the FIR filters 16a and 16b, to a filter coefficient corresponding to the clarity of the sound image input by the input unit 18. Specifically, the first processing parameter changing unit 20 changes the filter coefficient by changing the filter length.

次に、図２を用いて、所定位置１３に音像を定位させるためのＦＩＲフィルタ１６ａおよび１６ｂのフィルタ係数の設計について説明する。図２は、所定位置１３に音像を定位させるための伝達関数を測定する構成を示す図である。図２において、図１に示した構成要素と同じ構成要素については、同じ符号を付して説明を省略する。 Next, the design of the filter coefficients of the FIR filters 16a and 16b for localizing the sound image at the predetermined position 13 will be described with reference to FIG. FIG. 2 is a diagram showing a configuration for measuring a transfer function for localizing a sound image at a predetermined position 13. In FIG. 2, the same components as those shown in FIG.

図２において、１２ｂは、図１に示したユーザー１２ａと同じ位置に設置したダミーヘッドである。１４ｂは、測定信号である。測定信号１４ｂとして望ましくは、Ｍ系列信号のように可聴帯域において周波数一定のエネルギーをもつ信号を用いるのがよい。測定信号１４ｂは、ＦＩＲフィルタ１６ａおよび１６ｂで所定の伝達関数を有するように調整された後、スピーカ１７ａおよび１７ｂから音としてそれぞれ出力される。２１ａは、ダミーヘッド１２ｂの左耳に取り付けられた測定マイクである。２１ｂは、ダミーヘッド１２ｂの右耳に取り付けられた測定マイクである。２２は、伝達関数算出器である。 In FIG. 2, 12b is a dummy head installed at the same position as the user 12a shown in FIG. 14b is a measurement signal. The measurement signal 14b is preferably a signal having an energy with a constant frequency in the audible band, such as an M-sequence signal. The measurement signal 14b is adjusted to have a predetermined transfer function by the FIR filters 16a and 16b, and then output as sound from the speakers 17a and 17b, respectively. 21a is a measurement microphone attached to the left ear of the dummy head 12b. 21b is a measurement microphone attached to the right ear of the dummy head 12b. Reference numeral 22 denotes a transfer function calculator.

伝達関数算出器２２は、測定信号１４ｂと測定マイク２１ａおよび２１ｂの出力信号とを用いて、ＦＩＲフィルタ１６ａおよび１６ｂにおいて調整されるべき所定の伝達関数を算出する。算出方法については、公知の手法を用いるとし、ここでは詳細な説明を省略する。一般に、ＦＩＲフィルタ１６ａおよび１６ｂにおいて調整されるべき所定の伝達関数Ｇ１およびＧ２は、式（１）によって表されることが知られている。なお、式（１）において、所定位置１３からダミーヘッド１２ｂの左耳、右耳への音響伝達関数をＨ１、Ｈ２とし、スピーカ１７ａからダミーヘッド１２ｂの左耳、右耳への音響伝達関数をＣ１、Ｃ２とし、スピーカ１７ｂからダミーヘッド１２ｂの左耳、右耳への音響伝達関数をＣ３、Ｃ４とする。

The transfer function calculator 22 calculates a predetermined transfer function to be adjusted in the FIR filters 16a and 16b using the measurement signal 14b and the output signals of the

measurement microphones

21a and 21b. As the calculation method, a known method is used, and detailed description thereof is omitted here. In general, it is known that the predetermined transfer functions G1 and G2 to be adjusted in the FIR filters 16a and 16b are expressed by Expression (1). In Equation (1), the acoustic transfer functions from the predetermined position 13 to the left and right ears of the dummy head 12b are H1 and H2, and the acoustic transfer functions from the speaker 17a to the left and right ears of the dummy head 12b are Let C1 and C2 be the acoustic transfer functions from the speaker 17b to the left and right ears of the dummy head 12b.

ＦＩＲフィルタ１６ａに予め設定されるべきフィルタ係数は、例えば、伝達関数算出器２２で算出された所定の伝達関数Ｇ１を逆フーリエ変換して得られるインパルス応答から、算出される。ＦＩＲフィルタ１６ｂに予め設定されるべきフィルタ係数は、例えば、伝達関数算出器２２で算出された所定の伝達関数Ｇ２を逆フーリエ変換して得られるインパルス応答から、算出される。なお、ＦＩＲフィルタ１６ａおよび１６ｂのフィルタ係数は、その他の公知の適応フィルタ設計手法を用いて算出されてもよい。 The filter coefficient to be set in advance in the FIR filter 16a is calculated from, for example, an impulse response obtained by performing inverse Fourier transform on the predetermined transfer function G1 calculated by the transfer function calculator 22. The filter coefficient to be set in advance in the FIR filter 16b is calculated from, for example, an impulse response obtained by inverse Fourier transform of the predetermined transfer function G2 calculated by the transfer function calculator 22. The filter coefficients of the FIR filters 16a and 16b may be calculated using other known adaptive filter design methods.

図３は、所定の伝達関数Ｇ１およびＧ２を用いて算出された１０２４タップのフィルタ係数と、当該フィルタ係数を用いて音像定位制御を行った結果と、伝達関数両耳差の制御誤差とを示す図である。図３（ａ）は、ＦＩＲフィルタ１６ｂに設定される、所定の伝達関数Ｇ２を用いて算出された１０２４タップのフィルタ係数を示す図である。図３（ａ）に示すフィルタ係数は１０２４タップのフィルタ長をもち、係数値はほぼ収束している。図３（ｂ）は、１０２４タップのフィルタ係数をＦＩＲフィルタ１６ａおよび１６ｂに与えて音像定位制御を行い、ダミーヘッド１２ｂの左耳での伝達関数の測定結果Ｓ１を振幅周波数特性として示した図である。図３（ｃ）は、１０２４タップのフィルタ係数をＦＩＲフィルタ１６ａおよび１６ｂに与えて音像定位制御を行い、ダミーヘッド１２ｂの右耳での伝達関数の測定結果Ｓ２を振幅周波数特性として示した図である。 FIG. 3 shows a filter coefficient of 1024 taps calculated using predetermined transfer functions G1 and G2, a result of sound image localization control using the filter coefficient, and a control error of the transfer function binaural difference. FIG. FIG. 3A is a diagram illustrating a filter coefficient of 1024 taps calculated using a predetermined transfer function G2 set in the FIR filter 16b. The filter coefficient shown in FIG. 3A has a filter length of 1024 taps, and the coefficient values are almost converged. FIG. 3B is a diagram showing the result of measurement of the transfer function S1 at the left ear of the dummy head 12b as amplitude frequency characteristics by applying a filter coefficient of 1024 taps to the FIR filters 16a and 16b to perform sound image localization control. is there. FIG. 3C is a diagram showing the result of measuring the transfer function S2 at the right ear of the dummy head 12b as an amplitude frequency characteristic by applying a filter coefficient of 1024 taps to the FIR filters 16a and 16b to perform sound image localization control. is there.

ここで、人間は、伝達関数両耳差を手がかりとして、音像の定位方向を認知することが一般的に知られている。図３（ｂ）および図３（ｃ）において、実現すべき伝達関数両耳差は｜Ｈ１−Ｈ２｜であり、音像定位制御後の実際の伝達関数両耳差は｜Ｓ１−Ｓ２｜である。そこで、音像定位制御の誤差（制御誤差）として、伝達関数両耳差｜Ｓ１−Ｓ２｜と実現すべき伝達関数両耳差｜Ｈ１−Ｈ２｜との差を算出し、算出した結果を図３（ｄ）に示した。図３（ｄ）では、｜Ｈ１−Ｈ２｜と｜Ｓ１−Ｓ２｜の差が、１ｋＨｚ以下においてほぼ０ｄＢ（１０ｄＢ以上の制御誤差はない）を示しており（つまり、｜Ｓ１−Ｓ２｜と｜Ｈ１−Ｈ２｜とほぼ一致しており）、制御誤差が殆ど無いことが確認できる。これにより、明瞭度が高く、小さい音像が所定位置１３に定位する。なお、係数値が大きいタップほど、｜Ｓ１−Ｓ２｜と｜Ｈ１−Ｈ２｜とを一致させることに対して寄与する割合が高い。つまり、図３（ａ）でいえば、４００タップ付近が、｜Ｓ１−Ｓ２｜と｜Ｈ１−Ｈ２｜とを一致させることに対して最も大きく寄与する部分となる。 Here, it is generally known that a human recognizes a localization direction of a sound image using a transfer function binaural difference as a clue. 3 (b) and 3 (c), the transfer function binaural difference to be realized is | H1-H2 |, and the actual transfer function binaural difference after the sound image localization control is | S1-S2 |. . Therefore, the difference between the transfer function binaural difference | S1-S2 | and the transfer function binaural difference | H1-H2 | to be realized is calculated as an error (control error) of the sound image localization control, and the calculated result is shown in FIG. Shown in (d). In FIG. 3D, the difference between | H1−H2 | and | S1−S2 | shows almost 0 dB (no control error of 10 dB or more) at 1 kHz or less (that is, | S1−S2 | and | H1−H2 |) and almost no control error. Thereby, a small sound image with high clarity is localized at the predetermined position 13. A tap having a larger coefficient value has a higher contribution ratio to matching | S1-S2 | and | H1-H2 |. That is, in FIG. 3A, the vicinity of 400 taps is the portion that makes the largest contribution to matching | S1-S2 | and | H1-H2 |.

次に、図４〜図５を用いて、ＦＩＲフィルタ１６ａおよび１６ｂのフィルタ長と音像定位制御の誤差との関係について説明する。図４は、２５６タップのフィルタ係数と、当該フィルタ係数を用いて音像定位制御を行った結果と、伝達関数両耳差の制御誤差とを示す図である。図４（ａ）は、２５６タップのフィルタ係数を示す図である。図４（ａ）に示すフィルタ係数は、図３（ａ）に示したフィルタ係数のうち、係数値が大きい部分が含まれる３０１タップ目から５５６タップ目までの２５６タップ分のフィルタ係数に相当する。図４（ａ）に示すフィルタ係数の中心タップは、図３（ａ）に示したフィルタ係数において最も係数値が高い４００タップ付近となっている。図４（ｂ）は、２５６タップのフィルタ係数をＦＩＲフィルタ１６ａおよび１６ｂに与えて音像定位制御を行い、ダミーヘッド１２ｂの左耳での伝達関数の測定結果Ｓ１を振幅周波数特性として示した図である。図４（ｃ）は、２５６タップのフィルタ係数をＦＩＲフィルタ１６ａおよび１６ｂに与えて音像定位制御を行い、ダミーヘッド１２ｂの右耳での伝達関数の測定結果Ｓ２を振幅周波数特性として示した図である。図４（ｄ）は、図３（ｄ）と同様に制御誤差を算出した結果を示す図である。図４（ｄ）では、全周波数帯域において制御誤差が増大しており、１ｋＨｚ以下の周波数帯域においても１０ｄＢ以上の制御誤差が生じる周波数が存在する。 Next, the relationship between the filter lengths of the FIR filters 16a and 16b and the sound image localization control error will be described with reference to FIGS. FIG. 4 is a diagram illustrating a 256-tap filter coefficient, a result of performing sound image localization control using the filter coefficient, and a transfer function binaural difference control error. FIG. 4A is a diagram illustrating a 256-tap filter coefficient. The filter coefficients shown in FIG. 4A correspond to the filter coefficients for 256 taps from the 301st tap to the 556th tap including a portion having a large coefficient value among the filter coefficients shown in FIG. 3A. . The center tap of the filter coefficient shown in FIG. 4A is around 400 taps having the highest coefficient value in the filter coefficient shown in FIG. FIG. 4B is a diagram showing the result of measurement of the transfer function S1 at the left ear of the dummy head 12b as amplitude frequency characteristics by applying 256 tap filter coefficients to the FIR filters 16a and 16b to perform sound image localization control. is there. FIG. 4C is a diagram showing the result of measuring the transfer function S2 at the right ear of the dummy head 12b as an amplitude frequency characteristic by applying a 256-tap filter coefficient to the FIR filters 16a and 16b to perform sound image localization control. is there. FIG. 4D is a diagram showing the result of calculating the control error as in FIG. In FIG. 4D, the control error increases in the entire frequency band, and there is a frequency at which a control error of 10 dB or more occurs in the frequency band of 1 kHz or less.

図５は、６４タップのフィルタ係数と、当該フィルタ係数を用いて音像定位制御を行った結果と、伝達関数両耳差の制御誤差とを示す図である。図５（ａ）は、６４タップのフィルタ係数を示す図である。図５（ａ）に示すフィルタ係数は、図３（ａ）に示したフィルタ係数のうち、係数値が大きい部分が含まれる３７６タップ目から４３９タップ目までの６４タップ分のフィルタ係数に相当する。図５（ａ）に示すフィルタ係数の中心タップは、図３（ａ）に示したフィルタ係数において最も係数値が高い４００タップ付近となっている。図５（ｂ）は、６４タップのフィルタ係数をＦＩＲフィルタ１６ａおよび１６ｂに与えて音像定位制御を行い、ダミーヘッド１２ｂの左耳での伝達関数の測定結果Ｓ１を振幅周波数特性として示した図である。図５（ｃ）は、６４タップのフィルタ係数をＦＩＲフィルタ１６ａおよび１６ｂに与えて音像定位制御を行い、ダミーヘッド１２ｂの右耳での伝達関数の測定結果Ｓ２を振幅周波数特性として示した図である。図５（ｄ）は、図３（ｄ）と同様に制御誤差を算出した結果を示す図である。図５（ｄ）では、制御誤差を生じる周波数帯域が図４（ｄ）よりも広がっている。 FIG. 5 is a diagram illustrating a 64-tap filter coefficient, a result of performing sound image localization control using the filter coefficient, and a transfer function binaural difference control error. FIG. 5A shows a 64-tap filter coefficient. The filter coefficients shown in FIG. 5A correspond to the filter coefficients for 64 taps from the 376th tap to the 439th tap including a portion having a large coefficient value among the filter coefficients shown in FIG. 3A. . The center tap of the filter coefficient shown in FIG. 5A is around 400 taps having the highest coefficient value in the filter coefficient shown in FIG. FIG. 5B is a diagram showing the result of measurement of the transfer function S1 at the left ear of the dummy head 12b as an amplitude frequency characteristic by applying a 64-tap filter coefficient to the FIR filters 16a and 16b to perform sound image localization control. is there. FIG. 5C is a diagram showing the result of measuring the transfer function S2 at the right ear of the dummy head 12b as the amplitude frequency characteristic by applying a 64-tap filter coefficient to the FIR filters 16a and 16b to perform sound image localization control. is there. FIG. 5D is a diagram showing the result of calculating the control error as in FIG. In FIG. 5 (d), the frequency band causing the control error is wider than in FIG. 4 (d).

このように、ＦＩＲフィルタを用いた音像定位制御では、フィルタ長が短くなるほど、音像定位制御の誤差として伝達関数両耳差｜Ｓ１−Ｓ２｜と｜Ｈ１−Ｈ２｜との差が大きくなる。 As described above, in the sound image localization control using the FIR filter, the difference between the transfer function binaural difference | S1−S2 | and | H1−H2 | increases as the error in the sound image localization control as the filter length becomes shorter.

次に、図６を用いて、伝達関数両耳差｜Ｓ１−Ｓ２｜と｜Ｈ１−Ｈ２｜との差と、ユーザー１２ａが音響信号１４ａを受聴した際に感じる音像の大きさとの関係について説明する。図６は、伝達関数両耳差｜Ｓ１−Ｓ２｜と｜Ｈ１−Ｈ２｜との差と、ユーザー１２ａが音響信号１４ａを受聴した際に感じる音像の大きさとの関係を説明するための図である。図６において、２３ａはフィルタ長が１０２４タップの場合の音像を示し、２３ｂはフィルタ長が２５６タップの場合の音像を示し、２３ｃはフィルタ長が６４タップの場合の音像を示している。 Next, the relationship between the difference between the transfer function binaural differences | S1-S2 | and | H1-H2 | and the size of the sound image that the user 12a feels when listening to the acoustic signal 14a will be described with reference to FIG. To do. FIG. 6 is a diagram for explaining the relationship between the difference between the transfer function binaural differences | S1-S2 | and | H1-H2 | and the size of the sound image that the user 12a feels when listening to the acoustic signal 14a. is there. In FIG. 6, 23a shows a sound image when the filter length is 1024 taps, 23b shows a sound image when the filter length is 256 taps, and 23c shows a sound image when the filter length is 64 taps.

音像２３ａは、所定位置１３とほぼ一致した位置に定位し、その大きさは比較的小さく、明瞭度も高い。このような音像２３ａを用いて、サラウンドチャンネルの音響信号にオブジェクトが移動するような信号を含むコンテンツ、あるいはサラウンドチャンネルの音響信号に明瞭度が高い信号を含むコンテンツを再生した場合、ユーザー１２ａはコンテンツの制作者が意図した音場を体感することができる。 The sound image 23a is localized at a position substantially coincident with the predetermined position 13, and the size thereof is relatively small and the clarity is high. When such a sound image 23a is used to reproduce content including a signal that moves an object to an audio signal of a surround channel, or content including a signal with high clarity in an audio signal of a surround channel, the user 12a You can experience the sound field intended by the producer.

一方、音像２３ｂは、所定位置１３とは異なる位置（所定位置１３からスピーカ１７ｂ側に移動した位置、あるいは所定位置１３からユーザー１２ａの頭内側に移動した位置）に定位する。これは、伝達関数両耳差｜Ｓ１−Ｓ２｜と｜Ｈ１−Ｈ２｜との差が大きくなったためである。これにより、音像２３ｂの大きさは音像２３ａに比べて大きくなり、音像２３ｂの明瞭度も音像２３ａに比べて低くなる。このように音像２３ｂは、音像２３ａに比べてぼんやりとした音像となる。さらに音像２３ｃは、伝達関数両耳差｜Ｓ１−Ｓ２｜と｜Ｈ１−Ｈ２｜との差が全周波数帯域に渡って生じているため、ユーザー１２ａの右方向に定位する。そして、音像２３ｃの大きさは、ユーザー１２ａの右側の全領域に相当し、明瞭度も顕著に低くなる。この場合、ユーザー１２ａは、音像２３ｃがどこに定位しているのか判別が難しい状況となる。このような音像２３ｂおよび音像２３ｃを用いて、サラウンドチャンネルの音響信号に歓声のような明瞭度が低い信号を含むコンテンツを再生した場合、ユーザー１２ａは周囲を音で包まれたような音場を体感することができる。また、音像が頭内側に定位する方が迫力を感じるユーザー１２ａにとっては、音像２３ｂおよび音像２３ｃを用いてコンテンツを再生した場合の方が、より好ましい音場を体感することができる。以上に説明したように、コンテンツの種類やユーザーの好みに応じてフィルタ長を適切に変更することによって、ユーザー１２ａにとって好ましい明瞭度、大きさをもつ音像を生成することができる。 On the other hand, the sound image 23b is localized at a position different from the predetermined position 13 (a position moved from the predetermined position 13 to the speaker 17b side or a position moved from the predetermined position 13 to the inside of the head of the user 12a). This is because the difference between the transfer function binaural difference | S1−S2 | and | H1−H2 | Accordingly, the size of the sound image 23b is larger than that of the sound image 23a, and the clarity of the sound image 23b is also lower than that of the sound image 23a. Thus, the sound image 23b becomes a sound image that is more blurred than the sound image 23a. Furthermore, since the difference between the transfer function binaural difference | S1-S2 | and | H1-H2 | occurs over the entire frequency band, the sound image 23c is localized in the right direction of the user 12a. The size of the sound image 23c corresponds to the entire area on the right side of the user 12a, and the intelligibility is significantly reduced. In this case, it becomes difficult for the user 12a to determine where the sound image 23c is localized. When the sound image 23b and the sound image 23c are used to reproduce the content including a low clarity signal such as a cheer in the sound signal of the surround channel, the user 12a creates a sound field that is surrounded by sound. You can experience it. In addition, for the user 12a who feels more powerful when the sound image is localized inside the head, a more preferable sound field can be experienced when the content is reproduced using the sound image 23b and the sound image 23c. As described above, by appropriately changing the filter length according to the type of content and the user's preference, it is possible to generate a sound image having a clarity and size that is preferable for the user 12a.

次に、第１の処理パラメータ変更部２０がＦＩＲフィルタ１６ａおよび１６ｂに設定されたフィルタ係数を変更する処理について詳細に説明する。入力部１８には、例えば「明瞭度高」、「明瞭度中」、および「明瞭度低」のいずれか１つを選択する選択ボタンが設けられているとする。入力部１８は、選択された明瞭度を、補正特性変更部１９と第１の処理パラメータ変更部２０へそれぞれ出力する。第１の処理パラメータ変更部２０は、「明瞭度高」が入力された場合、ＦＩＲフィルタ１６ａおよび１６ｂのフィルタ長を１０２４タップに変更する。「明瞭度中」が入力された場合、ＦＩＲフィルタ１６ａおよび１６ｂのフィルタ長を２５６タップに変更する。「明瞭度低」が入力された場合、ＦＩＲフィルタ１６ａおよび１６ｂのフィルタ長を６４タップに変更する。なお、入力部１８には、３つのモード（「明瞭度高」、「明瞭度中」、および「明瞭度低」）のいずれか１つを選択するようにしていたが、これに限定されない。入力部１８には、ユーザー１２ａに明瞭度を連続的に入力させるためのスライダーなどが設けられてもよい。 Next, processing in which the first processing parameter changing unit 20 changes the filter coefficients set in the FIR filters 16a and 16b will be described in detail. For example, the input unit 18 is provided with a selection button for selecting one of “high clarity”, “medium clarity”, and “low clarity”. The input unit 18 outputs the selected clarity to the correction characteristic changing unit 19 and the first processing parameter changing unit 20, respectively. The first processing parameter changing unit 20 changes the filter lengths of the FIR filters 16a and 16b to 1024 taps when “high clarity” is input. When “medium clarity” is input, the filter lengths of the FIR filters 16a and 16b are changed to 256 taps. When “low clarity” is input, the filter lengths of the FIR filters 16a and 16b are changed to 64 taps. The input unit 18 selects one of the three modes (“high clarity”, “medium clarity”, and “low clarity”), but is not limited to this. The input unit 18 may be provided with a slider or the like for allowing the user 12a to continuously input clarity.

次に、図７および図８を用いて、振幅周波数特性補正部１５における補正特性について詳細に説明する。図７は、図３（ｃ）、図４（ｃ）、図５（ｃ）で示したＨ２とＳ２との差（Ｈ２−Ｓ２）を各フィルタ長毎に算出した結果を示す図である。図８は、振幅周波数特性補正部１５における補正特性を示した図である。図７により、実現すべき音響伝達関数Ｈ２と測定結果Ｓ２とでは、とりわけ低い周波数帯域において振幅レベルに差が生じていることがわかる。つまり、測定結果Ｓ２の振幅レベルは、とりわけ低い周波数帯域において、音響伝達関数Ｈ２よりも低くなっている。これにより、音量感が不足する、あるいは音質的に違和感が生じる。図７に示したＨ２とＳ２との差をなくすためには、振幅周波数特性補正部１５において図８に示すような補正特性が設定されればよい。図８において、補正特性は、フィルタ長毎に異なっている。これは、図７に示したＨ２とＳ２との差もフィルタ長毎に異なっているためである。したがって、補正特性変更部１９は、入力部１８において「明瞭度高」が入力された場合、振幅周波数特性補正部１５の補正特性を、１０２４タップのフィルタ長に相当する補正特性に変更する。また、「明瞭度中」が入力された場合、２５６タップのフィルタ長に相当する補正特性に変更する。「明瞭度低」が入力された場合、６４タップのフィルタ長に相当する補正特性に変更する。このように、補正特性変更部１９は、振幅周波数特性補正部１５における所定の補正特性を、第１の処理パラメータ変更部２０が変更したフィルタ長に応じた補正特性に変更する。なお、補正特性は、音響伝達関数Ｈ２と測定結果Ｓ２との差に基づく特性であったが、音響伝達関数Ｈ１と測定結果Ｓ１との差に基づく特性であってもよい。 Next, the correction characteristic in the amplitude frequency characteristic correction part 15 is demonstrated in detail using FIG. 7 and FIG. FIG. 7 is a diagram illustrating a result of calculating a difference (H2−S2) between H2 and S2 illustrated in FIGS. 3C, 4C, and 5C for each filter length. FIG. 8 is a diagram showing correction characteristics in the amplitude frequency characteristic correction unit 15. FIG. 7 shows that there is a difference in amplitude level between the acoustic transfer function H2 to be realized and the measurement result S2, particularly in a low frequency band. That is, the amplitude level of the measurement result S2 is lower than the acoustic transfer function H2 particularly in a low frequency band. Thereby, a feeling of volume is insufficient or a sense of incongruity occurs in sound quality. In order to eliminate the difference between H2 and S2 shown in FIG. 7, the amplitude frequency characteristic correction unit 15 may set the correction characteristic as shown in FIG. In FIG. 8, the correction characteristics are different for each filter length. This is because the difference between H2 and S2 shown in FIG. 7 is also different for each filter length. Therefore, when “high clarity” is input to the input unit 18, the correction characteristic changing unit 19 changes the correction characteristic of the amplitude frequency characteristic correcting unit 15 to a correction characteristic corresponding to a filter length of 1024 taps. Further, when “medium intelligibility” is input, the correction characteristic corresponding to the filter length of 256 taps is changed. When “low clarity” is input, the correction characteristic corresponding to a 64-tap filter length is changed. As described above, the correction characteristic changing unit 19 changes the predetermined correction characteristic in the amplitude frequency characteristic correcting unit 15 to a correction characteristic corresponding to the filter length changed by the first processing parameter changing unit 20. The correction characteristic is a characteristic based on a difference between the acoustic transfer function H2 and the measurement result S2, but may be a characteristic based on a difference between the acoustic transfer function H1 and the measurement result S1.

なお、振幅周波数特性補正部１５の出力信号は、ＦＩＲフィルタ１６ａおよび１６ｂの両方に出力されている。つまり、振幅周波数特性補正部１５での補正は、スピーカ１７ａから出力される音と、スピーカ１７ｂから出力される音とに適用される。このため、音像の大きさや明瞭度を変更するための、伝達関数両耳差｜Ｓ１−Ｓ２｜と｜Ｈ１−Ｈ２｜との差は、振幅周波数特性補正部１５での補正に影響しない。したがって、補正特性変更部１９および振幅周波数特性補正部１５の処理動作により、音像の大きさや明瞭度を変更可能にしつつ、音量感や音質を劣化をフィルタ長に応じて抑制することができる。 In addition, the output signal of the amplitude frequency characteristic correction | amendment part 15 is output to both FIR filters 16a and 16b. That is, the correction by the amplitude frequency characteristic correction unit 15 is applied to the sound output from the speaker 17a and the sound output from the speaker 17b. Therefore, the difference between the transfer function binaural differences | S1−S2 | and | H1−H2 | for changing the size and intelligibility of the sound image does not affect the correction in the amplitude frequency characteristic correction unit 15. Therefore, by the processing operations of the correction characteristic changing unit 19 and the amplitude frequency characteristic correcting unit 15, it is possible to change the size and clarity of the sound image, and to suppress deterioration in volume feeling and sound quality according to the filter length.

以上のように、本実施形態では、ＦＩＲフィルタ１６ａおよび１６ｂのフィルタ係数自体を直接変更して音像定位制御に誤差を生じさせることにより、音像の大きさや明瞭度を調整している。このため、本実施形態によれば、従来のように複数の遅延信号を制御する必要もなく、複数の位置に音像を定位させる必要もない。したがって、本実施形態によれば、従来の制御技術に比べ、制御処理負担の軽減、および回路の小規模化を容易に実現することができるといえる。 As described above, in the present embodiment, the size and clarity of the sound image are adjusted by directly changing the filter coefficients of the FIR filters 16a and 16b to cause an error in the sound image localization control. For this reason, according to this embodiment, it is not necessary to control a plurality of delay signals as in the prior art, and it is not necessary to localize a sound image at a plurality of positions. Therefore, according to the present embodiment, it can be said that the control processing load can be reduced and the circuit scale can be easily reduced as compared with the conventional control technique.

なお、図１に示した構成では、第１の処理パラメータ変更部２０は、ＦＩＲフィルタ１６ａおよび１６ｂに設定されたフィルタ係数を変更するようにしていたが、いずれか一方のフィルタ係数のみ変更するようにしてもよい。この場合であっても、ＦＩＲフィルタ１６ａおよび１６ｂに設定されたフィルタ係数を変更した場合と同様の効果を得ることができる。 In the configuration shown in FIG. 1, the first processing parameter changing unit 20 changes the filter coefficient set in the FIR filters 16a and 16b, but changes only one of the filter coefficients. It may be. Even in this case, the same effect as that obtained when the filter coefficients set in the FIR filters 16a and 16b are changed can be obtained.

なお、図４（ａ）および図５（ａ）に示したフィルタ係数の中心タップは、図３（ａ）に示したフィルタ係数において最も係数値が高い４００タップ付近としたが、これに限定されない。図３（ａ）に示したフィルタ係数のフィルタ長を短くするにあたり、係数値が低い部分を取り出すようにしてもよい。係数値が低い部分は、｜Ｓ１−Ｓ２｜と｜Ｈ１−Ｈ２｜とを一致させることに対して寄与する割合も低い。よって、係数値が低い部分を取り出すようにしても、音像定位制御の誤差である｜Ｓ１−Ｓ２｜と｜Ｈ１−Ｈ２｜との差を生じさせることができる。 The center taps of the filter coefficients shown in FIGS. 4 (a) and 5 (a) are around 400 taps having the highest coefficient value in the filter coefficients shown in FIG. 3 (a), but are not limited thereto. . When the filter length of the filter coefficient shown in FIG. 3A is shortened, a portion having a low coefficient value may be taken out. The portion where the coefficient value is low has a low contribution rate to matching | S1-S2 | with | H1-H2 |. Therefore, even if a portion having a low coefficient value is extracted, a difference between | S1−S2 | and | H1−H2 |, which is an error in sound image localization control, can be generated.

また、図１に示した音像定位制御装置は、音響信号１４ａに対して、２つのスピーカ１７ａおよび１７ｂと、２つのＦＩＲフィルタ１６ａおよび１６ｂを備えていたが、これに限定されない。音像定位制御装置は、音響信号１４ａに対して、３つ以上のスピーカと、各スピーカと対応するように設けられた、スピーカと同数のＦＩＲフィルタとを備えていてもよい。 The sound image localization control device shown in FIG. 1 includes the two speakers 17a and 17b and the two FIR filters 16a and 16b for the acoustic signal 14a, but is not limited thereto. The sound image localization control device may include three or more speakers and the same number of FIR filters as the speakers provided to correspond to each speaker with respect to the acoustic signal 14a.

また、図１に示した音像定位制御装置は、チャンネル１つ分の音響信号１４ａに対して音像定位制御を行っていたが、これに限定されない。複数のチャンネルの音響信号に対して音像定位制御を行うようにしてもよい。この場合、スピーカについては、複数のチャンネルの音響信号に対して、２つ以上備えていればよい。また、ＦＩＲフィルタについては、チャンネル１つ分の音響信号につき、スピーカと同数のＦＩＲフィルタを備えていればよい。 The sound image localization control device shown in FIG. 1 performs sound image localization control on the acoustic signal 14a for one channel, but is not limited to this. Sound image localization control may be performed on acoustic signals of a plurality of channels. In this case, two or more speakers may be provided for the sound signals of a plurality of channels. As for the FIR filters, it is only necessary to provide the same number of FIR filters as the speakers for one channel of acoustic signals.

（第２の実施形態）
図９を用いて、本発明の第２の実施形態に係る音像定位制御装置の構成について説明する。図９は、第２の実施形態に係る音像定位制御装置の構成を示す図である。図９において、音像定位制御装置は、振幅周波数特性補正部１５、ＦＩＲフィルタ１６ａおよび１６ｂ、スピーカ１７ａおよび１７ｂ、入力部１８、補正特性変更部１９ａ、振幅位相周波数特性調整部２４ａおよび２４ｂ、および第２の処理パラメータ変更部２５を備える。 (Second Embodiment)
The configuration of the sound image localization control device according to the second embodiment of the present invention will be described with reference to FIG. FIG. 9 is a diagram illustrating a configuration of a sound image localization control device according to the second embodiment. 9, the sound image localization control device includes an amplitude frequency characteristic correction unit 15, FIR filters 16a and 16b, speakers 17a and 17b, an input unit 18, a correction characteristic change unit 19a, amplitude phase frequency characteristic adjustment units 24a and 24b, and 2 processing parameter changing unit 25.

第１の実施形態では、ＦＩＲフィルタのフィルタ長を調節することにより、音像定位制御に誤差を生じさせていた。これに対し、本実施形態では、ＦＩＲフィルタ１６ａおよび１６ｂの出力信号の振幅周波数特性および／または位相周波数特性を調整することにより、音像定位制御に誤差を生じさせる。具体的には、本実施形態に係る音像定位制御装置は、図１に示した音像定位制御装置に対して、振幅位相周波数特性調整部２４ａおよび２４ｂが追加された点と、補正特性変更部１９が補正特性変更部１９ａに入れ代わった点と、第１の処理パラメータ変更部２０が第２の処理パラメータ変更部２５に入れ代わった点で異なる。それ以外の構成要素については、図１に示した構成要素と同じであり、同じ符号を付して説明を省略する。以下、異なる点を中心に説明する。 In the first embodiment, an error is caused in the sound image localization control by adjusting the filter length of the FIR filter. In contrast, in the present embodiment, an error is caused in the sound image localization control by adjusting the amplitude frequency characteristics and / or phase frequency characteristics of the output signals of the FIR filters 16a and 16b. Specifically, the sound image localization control device according to the present embodiment is different from the sound image localization control device shown in FIG. 1 in that amplitude phase frequency characteristic adjustment units 24 a and 24 b are added, and a correction characteristic change unit 19. Is different from the point that the correction characteristic changing unit 19a is replaced with the point that the first processing parameter changing unit 20 is replaced with the second processing parameter changing unit 25. The other constituent elements are the same as those shown in FIG. 1 and are given the same reference numerals, and the description thereof is omitted. Hereinafter, different points will be mainly described.

図９において、振幅位相周波数特性調整部２４ａは、ＦＩＲフィルタ１６ａの出力信号を入力とする。振幅位相周波数特性調整部２４ａには、所定の調整量が処理パラメータとして予め設定されている。振幅位相周波数特性調整部２４ａは、所定の調整量に従って、音像定位制御に誤差を生じさせるように（つまり、音像定位位置の手がかりとなる伝達関数両耳差に誤差を生じさせるように）、入力された出力信号の振幅周波数特性および／または位相周波数特性を調整する。振幅位相周波数特性調整部２４ａの出力信号は、スピーカ１７ａへ出力される。 In FIG. 9, the amplitude phase frequency characteristic adjustment unit 24a receives the output signal of the FIR filter 16a. In the amplitude phase frequency characteristic adjusting unit 24a, a predetermined adjustment amount is set in advance as a processing parameter. The amplitude phase frequency characteristic adjustment unit 24a is configured to generate an error in the sound image localization control according to a predetermined adjustment amount (that is, to generate an error in the transfer function binaural difference that is a clue to the sound image localization position). The amplitude frequency characteristic and / or phase frequency characteristic of the output signal is adjusted. The output signal of the amplitude phase frequency characteristic adjusting unit 24a is output to the speaker 17a.

振幅位相周波数特性調整部２４ｂは、ＦＩＲフィルタ１６ｂの出力信号を入力とする。振幅位相周波数特性調整部２４ｂには、所定の調整量が処理パラメータとして予め設定されている。振幅位相周波数特性調整部２４ｂは、所定の調整量に従って、音像定位制御に誤差を生じさせるように（つまり、音像定位位置の手がかりとなる伝達関数両耳差に誤差を生じさせるように）、入力された出力信号の振幅周波数特性および／または位相周波数特性を調整する。振幅位相周波数特性調整部２４ｂの出力信号は、スピーカ１７ｂへ出力される。 The amplitude phase frequency characteristic adjusting unit 24b receives the output signal of the FIR filter 16b. In the amplitude phase frequency characteristic adjustment unit 24b, a predetermined adjustment amount is set in advance as a processing parameter. The amplitude phase frequency characteristic adjustment unit 24b is configured to generate an error in the sound image localization control according to a predetermined adjustment amount (that is, to generate an error in the transfer function binaural difference that is a clue to the sound image localization position). The amplitude frequency characteristic and / or phase frequency characteristic of the output signal is adjusted. The output signal of the amplitude phase frequency characteristic adjusting unit 24b is output to the speaker 17b.

第２の処理パラメータ変更部２５は、振幅位相周波数特性調整部２４ａおよび２４ｂに設定された処理パラメータである所定の調整量を、入力部１８で入力された音像の明瞭度に応じた調整量に変更する。例えば、音像の明瞭度を小さくするとき、第２の処理パラメータ変更部２５は、所定の調整量を、音像定位制御の誤差が大きくなるように変更する。 The second processing parameter changing unit 25 changes the predetermined adjustment amount, which is the processing parameter set in the amplitude phase frequency characteristic adjusting units 24a and 24b, to an adjustment amount according to the clarity of the sound image input by the input unit 18. change. For example, when reducing the clarity of the sound image, the second processing parameter changing unit 25 changes the predetermined adjustment amount so that the error of the sound image localization control becomes large.

ここで、図７および図８で示したように、振幅周波数特性補正部１５の補正特性は、音響伝達関数Ｈ２と測定結果Ｓ２との差に基づく特性、または、音響伝達関数Ｈ１と測定結果Ｓ１との差に基づく特性であった。また、本実施形態では、振幅位相周波数特性調整部２４ａおよび２４ｂにおいて、音像定位制御に誤差が生じる。したがって、振幅周波数特性補正部１５の補正特性は、振幅位相周波数特性調整部２４ａによって調整された結果である測定結果Ｓ１と音響伝達関数Ｈ１との差に基づく特性、または、振幅位相周波数特性調整部２４ｂによって調整された結果である測定結果Ｓ２と音響伝達関数Ｈ２との差に基づく特性となる。したがって、補正特性変更部１９ａは、入力部１８において入力された音像の明瞭度に応じて振幅位相周波数特性調整部２４ａによって調整された結果である測定結果Ｓ１と音響伝達関数Ｈ１との差に基づく特性となるように、振幅周波数特性補正部１５の補正特性を変更する。または、補正特性変更部１９ａは、入力部１８において入力された音像の明瞭度に応じて振幅位相周波数特性調整部２４ｂによって調整された結果である測定結果Ｓ２と音響伝達関数Ｈ２との差に基づく特性となるように、振幅周波数特性補正部１５の補正特性を変更する。 Here, as shown in FIGS. 7 and 8, the correction characteristic of the amplitude frequency characteristic correction unit 15 is a characteristic based on the difference between the acoustic transfer function H2 and the measurement result S2, or the acoustic transfer function H1 and the measurement result S1. It was a characteristic based on the difference. In the present embodiment, an error occurs in the sound image localization control in the amplitude phase frequency characteristic adjusting units 24a and 24b. Therefore, the correction characteristic of the amplitude frequency characteristic correction unit 15 is a characteristic based on the difference between the measurement result S1 and the acoustic transfer function H1 that is the result adjusted by the amplitude phase frequency characteristic adjustment unit 24a, or the amplitude phase frequency characteristic adjustment unit. The characteristic is based on the difference between the measurement result S2 that is the result adjusted by 24b and the acoustic transfer function H2. Therefore, the correction characteristic changing unit 19a is based on the difference between the measurement result S1 and the acoustic transfer function H1 that are the results adjusted by the amplitude phase frequency characteristic adjusting unit 24a according to the clarity of the sound image input in the input unit 18. The correction characteristic of the amplitude frequency characteristic correction unit 15 is changed so as to obtain the characteristic. Alternatively, the correction characteristic changing unit 19a is based on the difference between the measurement result S2 and the acoustic transfer function H2, which is the result adjusted by the amplitude phase frequency characteristic adjusting unit 24b according to the clarity of the sound image input by the input unit 18. The correction characteristic of the amplitude frequency characteristic correction unit 15 is changed so as to obtain the characteristic.

以上のように、本実施形態によれば、ＦＩＲフィルタ１６ａおよび１６ｂの出力信号の振幅周波数特性または位相周波数特性を調整することにより、音像定位制御に誤差を生じさせている。これにより、フィルタ長を調整していた第１の実施形態と比べて、音像定位制御の誤差をより精密に調整することができる。その結果、音像の大きさや音像の明瞭度をより精密に調整することができる。 As described above, according to the present embodiment, an error is caused in the sound image localization control by adjusting the amplitude frequency characteristics or phase frequency characteristics of the output signals of the FIR filters 16a and 16b. Thereby, the error of sound image localization control can be adjusted more precisely than in the first embodiment in which the filter length is adjusted. As a result, the size of the sound image and the clarity of the sound image can be adjusted more precisely.

なお、図９に示した構成では、第２の処理パラメータ変更部２５は、振幅位相周波数特性調整部２４ａおよび２４ｂに設定された所定の調整量を変更するようにしていたが、いずれか一方の所定の調整量のみ変更するようにしてもよい。この場合であっても、振幅位相周波数特性調整部２４ａおよび２４ｂに設定された所定の調整量を変更した場合と同様の効果を得ることができる。 In the configuration shown in FIG. 9, the second processing parameter changing unit 25 changes the predetermined adjustment amount set in the amplitude phase frequency characteristic adjusting units 24a and 24b. Only a predetermined adjustment amount may be changed. Even in this case, it is possible to obtain the same effect as when the predetermined adjustment amount set in the amplitude phase frequency characteristic adjustment units 24a and 24b is changed.

（第３の実施形態）
図１０および図１１を用いて、本発明の第３の実施形態に係る音像定位制御装置の構成について説明する。図１０は、第３の実施形態に係る音像定位制御装置の構成を示す図である。図１０において、音像定位制御装置は、振幅周波数特性補正部１５、ＦＩＲフィルタ１６ａおよび１６ｂ、スピーカ１７ａおよび１７ｂ、入力部１８、補正特性変更部１９ｂ、混合部２６ａおよび２６ｂ、第３の処理パラメータ変更部２７、および第４の処理パラメータ変更部２８を備える。図１１は、混合部２６ａおよび２６ｂの構成を示す図である。図１１において、混合部２６ａおよび２６ｂは、所定周波数成分抽出部２９、可変ゲイン３０、および加算器３１をそれぞれ備える。なお、図１１に示すａ〜ｃは入出力端子であり、図１０の混合部２６ａおよび２６ｂの入出力端子に付されたａ〜ｃと対応している。 (Third embodiment)
The configuration of the sound image localization control apparatus according to the third embodiment of the present invention will be described with reference to FIGS. 10 and 11. FIG. 10 is a diagram illustrating a configuration of a sound image localization control device according to the third embodiment. 10, the sound image localization control device includes an amplitude frequency characteristic correction unit 15, FIR filters 16a and 16b, speakers 17a and 17b, an input unit 18, a correction characteristic change unit 19b, mixing units 26a and 26b, and a third processing parameter change. Unit 27 and a fourth processing parameter changing unit 28. FIG. 11 is a diagram illustrating the configuration of the mixing units 26a and 26b. In FIG. 11, the mixing units 26a and 26b include a predetermined frequency component extraction unit 29, a variable gain 30, and an adder 31, respectively. In addition, a to c shown in FIG. 11 are input / output terminals, and correspond to a to c attached to the input / output terminals of the mixing units 26a and 26b in FIG.

第１の実施形態では、ＦＩＲフィルタのフィルタ長を調節することにより、音像定位制御に誤差を生じさせていた。これに対し、本実施形態では、混合部２６ａおよび２６ｂにおいてＦＩＲフィルタ１６ａおよび１６ｂの出力信号を互いに混合することにより、音像定位制御に誤差を生じさせる。具体的には、本実施形態に係る音像定位制御装置は、図１に示した音像定位制御装置に対して、混合部２６ａおよび２６ｂが追加された点と、補正特性変更部１９が補正特性変更部１９ｂに入れ代わった点と、第１の処理パラメータ変更部２０が第３の処理パラメータ変更部２７および第４の処理パラメータ変更部２８に入れ代わった点で異なる。それ以外の構成要素については、図１に示した構成要素と同じであり、同じ符号を付して説明を省略する。以下、異なる点を中心に説明する。 In the first embodiment, an error is caused in the sound image localization control by adjusting the filter length of the FIR filter. On the other hand, in the present embodiment, the mixing units 26a and 26b mix the output signals of the FIR filters 16a and 16b with each other, thereby causing an error in the sound image localization control. Specifically, the sound image localization control device according to this embodiment is different from the sound image localization control device shown in FIG. 1 in that mixing units 26a and 26b are added, and the correction characteristic changing unit 19 changes the correction characteristic. The difference is that the first processing parameter changing unit 20 is replaced by the third processing parameter changing unit 27 and the fourth processing parameter changing unit 28. The other constituent elements are the same as those shown in FIG. 1 and are given the same reference numerals, and the description thereof is omitted. Hereinafter, different points will be mainly described.

図１４において、所定周波数成分抽出部２９には、所定の周波数成分が処理パラメータとして予め設定されている。所定周波数成分抽出部２９は、入力端子ｂから入力される信号のうち、所定の周波数成分を有する信号のみを抽出する。所定周波数成分抽出部２９の出力信号は、増幅手段である可変ゲイン３０へ出力される。可変ゲイン３０には、所定のゲインが処理パラメータとして予め設定されている。可変ゲイン３０は、所定周波数成分抽出部２９の出力信号を所定のゲインで増幅する。可変ゲイン３０の出力信号は、加算器３１へ出力される。加算器３１は、入力端子ａから入力される信号と可変ゲイン３０の出力信号とを加算して、出力端子ｃへ出力する。 In FIG. 14, a predetermined frequency component is preset in the predetermined frequency component extraction unit 29 as a processing parameter. The predetermined frequency component extraction unit 29 extracts only a signal having a predetermined frequency component from signals input from the input terminal b. The output signal of the predetermined frequency component extraction unit 29 is output to the variable gain 30 that is an amplification means. In the variable gain 30, a predetermined gain is set in advance as a processing parameter. The variable gain 30 amplifies the output signal of the predetermined frequency component extraction unit 29 with a predetermined gain. The output signal of the variable gain 30 is output to the adder 31. The adder 31 adds the signal input from the input terminal a and the output signal of the variable gain 30 and outputs the result to the output terminal c.

図１３に示す混合部２６ａは、入力端子ａがＦＩＲフィルタ１６ａと接続され、入力端子ｂがＦＩＲフィルタ１６ｂと接続され、出力端子ｃがスピーカ１７ａと接続されている。したがって、混合部２６ａは、ＦＩＲフィルタ１６ａの出力信号と、ＦＩＲフィルタ１６ｂの出力信号のうちの所定の周波数成分を有する信号のみに所定のゲインを与えた信号とを混合し、スピーカ１７ａへ出力することとなる。換言すれば、混合部２６ａには所定の混合量が予め設定されており、混合部２６ａは、所定の混合量に従って、ＦＩＲフィルタ１６ａの出力信号と、ＦＩＲフィルタ１６ｂの出力信号とを混合することになる。混合部２６ｂは、入力端子ａがＦＩＲフィルタ１６ｂと接続され、入力端子ｂがＦＩＲフィルタ１６ａと接続され、出力端子ｃがスピーカ１７ｂと接続されている。したがって、混合部２６ｂは、ＦＩＲフィルタ１６ｂの出力信号と、ＦＩＲフィルタ１６ａの出力信号のうちの所定の周波数成分を有する信号のみに所定のゲインを与えた信号とを混合し、スピーカ１７ｂへ出力することとなる。換言すれば、混合部２６ｂには所定の混合量が予め設定されており、混合部２６ｂは、所定の混合量に従って、ＦＩＲフィルタ１６ａの出力信号と、ＦＩＲフィルタ１６ｂの出力信号とを混合することになる。この混合部２６ａおよび２６ｂの処理により、音像定位制御に誤差（つまり、音像定位位置の手がかりとなる伝達関数両耳差の誤差）を生じさせることができる。 13, the input terminal a is connected to the FIR filter 16a, the input terminal b is connected to the FIR filter 16b, and the output terminal c is connected to the speaker 17a. Therefore, the mixing unit 26a mixes the output signal of the FIR filter 16a and a signal obtained by giving a predetermined gain only to a signal having a predetermined frequency component of the output signal of the FIR filter 16b, and outputs the mixed signal to the speaker 17a. It will be. In other words, a predetermined mixing amount is preset in the mixing unit 26a, and the mixing unit 26a mixes the output signal of the FIR filter 16a and the output signal of the FIR filter 16b according to the predetermined mixing amount. become. The mixing unit 26b has an input terminal a connected to the FIR filter 16b, an input terminal b connected to the FIR filter 16a, and an output terminal c connected to the speaker 17b. Therefore, the mixing unit 26b mixes the output signal of the FIR filter 16b and a signal obtained by giving a predetermined gain only to a signal having a predetermined frequency component of the output signal of the FIR filter 16a, and outputs the mixed signal to the speaker 17b. It will be. In other words, a predetermined mixing amount is preset in the mixing unit 26b, and the mixing unit 26b mixes the output signal of the FIR filter 16a and the output signal of the FIR filter 16b in accordance with the predetermined mixing amount. become. By the processing of the mixing units 26a and 26b, an error in the sound image localization control (that is, an error of the transfer function binaural difference that is a clue to the sound image localization position) can be generated.

第３の処理パラメータ変更部２７は、所定周波数成分抽出部２９に設定された処理パラメータである所定の周波数成分を変更する。第４の処理パラメータ変更部２８は、可変ゲイン３０に設定された処理パラメータである所定のゲインを変更する。なお、第３の処理パラメータ変更部２７および第４の処理パラメータ変更部２８それぞれにおいて処理パラメータが変更されることで、所定位置１３に定位すべき音像が入力部１８で入力された音像の明瞭度を有するようになる。例えば、音像の明瞭度を小さくするとき、第３の処理パラメータ変更部２７は、より幅広い周波数成分を抽出するように所定周波数成分抽出部２９に設定された処理パラメータを変更する。また、第４の処理パラメータ変更部２８は、可変ゲイン３０に設定されたゲインをより大きなゲインに変更する。このように、第３の処理パラメータ変更部２７および第４の処理パラメータ変更部２８は、混合部２６ａおよび２６ｂにおける所定の混合量を、入力部１８で入力された音像の明瞭度に応じた混合量に変更している。 The third processing parameter changing unit 27 changes a predetermined frequency component that is a processing parameter set in the predetermined frequency component extracting unit 29. The fourth processing parameter changing unit 28 changes a predetermined gain that is a processing parameter set in the variable gain 30. In addition, the clarity of the sound image in which the sound image to be localized at the predetermined position 13 is input by the input unit 18 by changing the processing parameter in each of the third processing parameter changing unit 27 and the fourth processing parameter changing unit 28. Will have. For example, when reducing the clarity of the sound image, the third processing parameter changing unit 27 changes the processing parameter set in the predetermined frequency component extracting unit 29 so as to extract a wider frequency component. Further, the fourth processing parameter changing unit 28 changes the gain set in the variable gain 30 to a larger gain. As described above, the third processing parameter changing unit 27 and the fourth processing parameter changing unit 28 mix the predetermined mixing amount in the mixing units 26a and 26b according to the clarity of the sound image input from the input unit 18. The amount has been changed.

ここで、図７および図８で示したように、振幅周波数特性補正部１５の補正特性は、音響伝達関数Ｈ２と測定結果Ｓ２との差に基づく特性、または、音響伝達関数Ｈ１と測定結果Ｓ１との差に基づく特性であった。また、本実施形態では、混合部２６ａおよび２６ｂにおいて、音像定位制御の誤差が生じる。したがって、振幅周波数特性補正部１５の補正特性は、混合部２６ａによって混合された結果である測定結果Ｓ１と音響伝達関数Ｈ１との差に基づく特性、または、混合部２６ｂによって混合された結果である測定結果Ｓ２と音響伝達関数Ｈ２との差に基づく特性となる。したがって、補正特性変更部１９ｂは、入力部１８において入力された音像の明瞭度に応じて混合部２６ａによって混合された結果である測定結果Ｓ１と音響伝達関数Ｈ１との差に基づく特性となるように、振幅周波数特性補正部１５の補正特性を変更する。または、補正特性変更部１９ｂは、入力部１８において入力された音像の明瞭度に応じて混合部２６ｂによって混合された結果である測定結果Ｓ２と音響伝達関数Ｈ２との差に基づく特性となるように、振幅周波数特性補正部１５の補正特性を変更する。 Here, as shown in FIGS. 7 and 8, the correction characteristic of the amplitude frequency characteristic correction unit 15 is a characteristic based on the difference between the acoustic transfer function H2 and the measurement result S2, or the acoustic transfer function H1 and the measurement result S1. It was a characteristic based on the difference. In this embodiment, an error in sound image localization control occurs in the mixing units 26a and 26b. Therefore, the correction characteristic of the amplitude frequency characteristic correction unit 15 is a characteristic based on a difference between the measurement result S1 and the acoustic transfer function H1 that is a result of mixing by the mixing unit 26a, or a result of mixing by the mixing unit 26b. The characteristic is based on the difference between the measurement result S2 and the acoustic transfer function H2. Therefore, the correction characteristic changing unit 19b has a characteristic based on the difference between the measurement result S1 and the acoustic transfer function H1, which is a result of mixing by the mixing unit 26a according to the clarity of the sound image input at the input unit 18. Further, the correction characteristic of the amplitude frequency characteristic correction unit 15 is changed. Alternatively, the correction characteristic changing unit 19b has characteristics based on the difference between the measurement result S2 and the acoustic transfer function H2 that are the result of mixing by the mixing unit 26b according to the clarity of the sound image input at the input unit 18. Further, the correction characteristic of the amplitude frequency characteristic correction unit 15 is changed.

以上のように、本実施形態によれば、混合部２６ａおよび２６ｂにおいてＦＩＲフィルタ１６ａおよび１６ｂの出力信号を互いに混合することにより、音像定位制御に誤差を生じさせている。これにより、フィルタ長を調整していた第１の実施形態と比べて、音像定位制御の誤差をより精密に調整することができる。その結果、音像の大きさや音像の明瞭度をより精密に調整することができる。 As described above, according to the present embodiment, the mixing units 26a and 26b mix the output signals of the FIR filters 16a and 16b with each other, thereby causing an error in the sound image localization control. Thereby, the error of sound image localization control can be adjusted more precisely than in the first embodiment in which the filter length is adjusted. As a result, the size of the sound image and the clarity of the sound image can be adjusted more precisely.

なお、図１０に示した構成では、混合部２６ａおよび２６ｂを備える構成であったが、いずれか一方を省略した構成であってもよい。 In addition, in the structure shown in FIG. 10, although it was the structure provided with the mixing parts 26a and 26b, the structure which abbreviate | omitted any one may be sufficient.

また、図１０に示した構成では、第３の処理パラメータ変更部２７および第４の処理パラメータ変更部２８は、混合部２６ａおよび２６ｂにおける所定の混合量を、入力部１８で入力された音像の明瞭度に応じた混合量に変更するようにしていたが、いずれか一方の所定の混合量のみ変更するようにしてもよい。この場合であっても、混合部２６ａおよび２６ｂにおける所定の混合量を変更した場合と同様の効果を得ることができる。 In the configuration shown in FIG. 10, the third processing parameter changing unit 27 and the fourth processing parameter changing unit 28 set the predetermined mixing amount in the mixing units 26 a and 26 b of the sound image input from the input unit 18. Although the mixing amount is changed according to the clarity, only one of the predetermined mixing amounts may be changed. Even in this case, it is possible to obtain the same effect as when the predetermined mixing amount in the mixing units 26a and 26b is changed.

（第４の実施形態）
図１２を用いて、本発明の第４の実施形態に係る音像定位制御装置の構成について説明する。図１２は、第４の実施形態に係る音像定位制御装置の構成を示す図である。図１２において、音像定位制御装置は、振幅周波数特性補正部１５、ＦＩＲフィルタ１６ａおよび１６ｂ、スピーカ１７ａおよび１７ｂ、入力部１８、補正特性変更部１９ｃ、混合部２６ａおよび２６ｂ、第３の処理パラメータ変更部２７、第４の処理パラメータ変更部２８、ハイパスフィルタ３２ａ、ローパスフィルタ３２ｂ、およびカットオフ周波数変更部３３を備える。 (Fourth embodiment)
The configuration of a sound image localization control device according to the fourth embodiment of the present invention will be described with reference to FIG. FIG. 12 is a diagram illustrating a configuration of a sound image localization control device according to the fourth embodiment. In FIG. 12, the sound image localization control apparatus includes an amplitude frequency characteristic correction unit 15, FIR filters 16a and 16b, speakers 17a and 17b, an input unit 18, a correction characteristic change unit 19c, mixing units 26a and 26b, and a third processing parameter change. 27, a fourth processing parameter changing unit 28, a high pass filter 32a, a low pass filter 32b, and a cutoff frequency changing unit 33.

第３の実施形態では、混合部２６ａおよび２６ｂにおいてＦＩＲフィルタ１６ａおよび１６ｂの出力信号を互いに混合することにより、音像定位制御に誤差を生じさせていた。これに対し、本実施形態では、混合部２６ａにおいて、ＦＩＲフィルタ１６ａの出力信号と、ＦＩＲフィルタ１６ｂにおいて処理されていない信号とを混合する。また、混合部２６ｂにおいて、ＦＩＲフィルタ１６ｂの出力信号と、ＦＩＲフィルタ１６ａにおいて処理されていない信号とを混合する。これらにより、音像定位制御に誤差を生じさせる。具体的には、本実施形態に係る音像定位制御装置は、図１０に示した音像定位制御装置に対して、ハイパスフィルタ３２ａ、ローパスフィルタ３２ｂ、およびカットオフ周波数変更部３３が追加された点と、補正特性変更部１９が補正特性変更部１９ｃに入れ代わった点と、混合部２６ａの入力端子ｂの接続先がＦＩＲフィルタ１６ｂからローパスフィルタ３２ｂに変更された点と、混合部２６ｂの入力端子ｂの接続先がＦＩＲフィルタ１６ａからローパスフィルタ３２ｂに変更された点で異なる。それ以外の構成要素については、図１０および図１１に示した構成要素と同じであり、同じ符号を付して説明を省略する。以下、異なる点を中心に説明する。 In the third embodiment, the output signals of the FIR filters 16a and 16b are mixed with each other in the mixing units 26a and 26b, thereby causing an error in the sound image localization control. On the other hand, in the present embodiment, the mixing unit 26a mixes the output signal of the FIR filter 16a and the signal not processed in the FIR filter 16b. Further, the mixing unit 26b mixes the output signal of the FIR filter 16b and the signal not processed in the FIR filter 16a. Thus, an error is caused in the sound image localization control. Specifically, the sound image localization control device according to the present embodiment has a high-pass filter 32a, a low-pass filter 32b, and a cutoff frequency changing unit 33 added to the sound image localization control device shown in FIG. The correction characteristic changing unit 19 is replaced with the correction characteristic changing unit 19c, the connection destination of the input terminal b of the mixing unit 26a is changed from the FIR filter 16b to the low-pass filter 32b, and the input terminal of the mixing unit 26b. b is different in that the connection destination of b is changed from the FIR filter 16a to the low-pass filter 32b. The other components are the same as those shown in FIGS. 10 and 11, and the same reference numerals are given and description thereof is omitted. Hereinafter, different points will be mainly described.

ハイパスフィルタ３２ａは、振幅周波数特性補正部１５の出力信号を入力とする。ハイパスフィルタ３２ａは、振幅周波数特性補正部１５の出力信号のうち、所定のカットオフ周波数以上の信号のみを通過させる。ハイパスフィルタ３２ａの出力信号は、ＦＩＲフィルタ１６ａおよび１６ｂへそれぞれ出力される。 The high pass filter 32a receives the output signal of the amplitude frequency characteristic correction unit 15 as an input. The high-pass filter 32a passes only a signal having a predetermined cutoff frequency or higher among the output signals of the amplitude frequency characteristic correction unit 15. The output signal of the high pass filter 32a is output to the FIR filters 16a and 16b, respectively.

ローパスフィルタ３２ｂは、振幅周波数特性補正部１５の出力信号を入力とする。ローパスフィルタ３２ｂには、ハイパスフィルタ３２ａと同じ周波数となる所定のカットオフ周波数が設定されている。ローパスフィルタ３２ｂは、振幅周波数特性補正部１５の出力信号のうち、所定のカットオフ周波数以下の信号のみを通過させる。ローパスフィルタ３２ｂの出力信号は、混合部２６ａの入力端子ｂと混合部２６ａの入力端子ｂとにそれぞれ出力される。ここで、ローパスフィルタ３２ｂの出力信号は、ＦＩＲフィルタ１６ａおよび１６ｂで処理しない信号となる。 The low pass filter 32b receives the output signal of the amplitude frequency characteristic correction unit 15 as an input. The low-pass filter 32b is set with a predetermined cutoff frequency that is the same frequency as the high-pass filter 32a. The low-pass filter 32b passes only a signal having a frequency equal to or lower than a predetermined cutoff frequency among the output signals of the amplitude frequency characteristic correction unit 15. The output signal of the low-pass filter 32b is output to the input terminal b of the mixing unit 26a and the input terminal b of the mixing unit 26a. Here, the output signal of the low-pass filter 32b is a signal that is not processed by the FIR filters 16a and 16b.

カットオフ周波数変更部３３は、ハイパスフィルタ３２ａおよびローパスフィルタ３２ｂに設定された所定のカットオフ周波数を変更する。なお、第３の処理パラメータ変更部２７、第４の処理パラメータ変更部２８、およびカットオフ周波数変更部３３それぞれにおいて処理パラメータやカットオフ周波数が変更されることで、所定位置１３に定位すべき音像が入力部１８で入力された音像の明瞭度を有するようになる。例えば、音像の明瞭度を小さくするとき、第３の処理パラメータ変更部２７は、より幅広い周波数成分を抽出するように所定周波数成分抽出部２９に設定された処理パラメータを変更する。また、第４の処理パラメータ変更部２８は、可変ゲイン３０に設定されたゲインをより大きなゲインに変更する。また、カットオフ周波数変更部３３は、ハイパスフィルタ３２ａおよびローパスフィルタ３２ｂに設定されたカットオフ周波数をより高い周波数に変更する。 The cut-off frequency changing unit 33 changes a predetermined cut-off frequency set in the high pass filter 32a and the low pass filter 32b. Note that the sound image to be localized at the predetermined position 13 is obtained by changing the processing parameter and the cutoff frequency in each of the third processing parameter changing unit 27, the fourth processing parameter changing unit 28, and the cutoff frequency changing unit 33. Has the clarity of the sound image input through the input unit 18. For example, when reducing the clarity of the sound image, the third processing parameter changing unit 27 changes the processing parameter set in the predetermined frequency component extracting unit 29 so as to extract a wider frequency component. Further, the fourth processing parameter changing unit 28 changes the gain set in the variable gain 30 to a larger gain. Further, the cut-off frequency changing unit 33 changes the cut-off frequency set in the high pass filter 32a and the low pass filter 32b to a higher frequency.

図１３は、入力部１８で入力された明瞭度の高低と、カットオフ周波数変更部３３が変更するカットオフ周波数との関係を示した図である。図１３に示すように、例えば、入力部１８で入力された音像の明瞭度が「明瞭度高」である場合、カットオフ周波数変更部３３はカットオフ周波数をｆａに変更する。入力部１８で入力された音像の明瞭度が「明瞭度中」である場合、カットオフ周波数変更部３３はカットオフ周波数をｆｂに変更する。入力部１８で入力された音像の明瞭度が「明瞭度低」である場合、カットオフ周波数変更部３３はカットオフ周波数をｆｃに変更する。ここで、ローパスフィルタ３２ｂの出力信号は、ＦＩＲフィルタ１６ａおよび１６ｂで処理しない信号となる。したがって、図１３に示したように、カットオフ周波数を高くしていくほど、処理しない信号が増え、音像定位制御の誤差も大きくなる。その結果、明瞭度が低くなる。 FIG. 13 is a diagram illustrating the relationship between the level of clarity input by the input unit 18 and the cutoff frequency changed by the cutoff frequency changing unit 33. As illustrated in FIG. 13, for example, when the clarity of the sound image input from the input unit 18 is “high clarity”, the cutoff frequency changing unit 33 changes the cutoff frequency to fa. When the intelligibility of the sound image input by the input unit 18 is “medium intelligibility”, the cut-off frequency changing unit 33 changes the cut-off frequency to fb. When the clarity of the sound image input by the input unit 18 is “low clarity”, the cutoff frequency changing unit 33 changes the cutoff frequency to fc. Here, the output signal of the low-pass filter 32b is a signal that is not processed by the FIR filters 16a and 16b. Therefore, as shown in FIG. 13, the higher the cut-off frequency, the more signals that are not processed, and the greater the error in sound image localization control. As a result, the clarity is lowered.

ここで、図７および図８で示したように、振幅周波数特性補正部１５の補正特性は、音響伝達関数Ｈ２と測定結果Ｓ２との差に基づく特性、または、音響伝達関数Ｈ１と測定結果Ｓ１との差に基づく特性であった。また、本実施形態では、ハイパスフィルタ３２ａおよびローパスフィルタ３２ｂを介した混合部２６ａおよび２６ｂにおいて、音像定位制御に誤差が生じる。したがって、振幅周波数特性補正部１５の補正特性は、ハイパスフィルタ３２ａおよびローパスフィルタ３２ｂを介した混合部２６ａによって混合された結果である測定結果Ｓ１と音響伝達関数Ｈ１との差に基づく特性、または、ハイパスフィルタ３２ａおよびローパスフィルタ３２ｂを介した混合部２６ｂによって混合された結果である測定結果Ｓ２と音響伝達関数Ｈ２との差に基づく特性となる。したがって、補正特性変更部１９ｃは、入力部１８において入力された音像の明瞭度に応じて混合部２６ａによって混合された結果である測定結果Ｓ１と音響伝達関数Ｈ１との差に基づく特性となるように、振幅周波数特性補正部１５の補正特性を変更する。または、補正特性変更部１９ｃは、入力部１８において入力された音像の明瞭度に応じて混合部２６ｂによって混合された結果である測定結果Ｓ２と音響伝達関数Ｈ２との差に基づく特性となるように、振幅周波数特性補正部１５の補正特性変更する。 Here, as shown in FIGS. 7 and 8, the correction characteristic of the amplitude frequency characteristic correction unit 15 is a characteristic based on the difference between the acoustic transfer function H2 and the measurement result S2, or the acoustic transfer function H1 and the measurement result S1. It was a characteristic based on the difference. In this embodiment, an error occurs in the sound image localization control in the mixing units 26a and 26b via the high-pass filter 32a and the low-pass filter 32b. Therefore, the correction characteristic of the amplitude frequency characteristic correction unit 15 is a characteristic based on a difference between the measurement result S1 and the acoustic transfer function H1 that is a result of mixing by the mixing unit 26a via the high pass filter 32a and the low pass filter 32b, or The characteristic is based on the difference between the measurement result S2 and the acoustic transfer function H2, which is a result of mixing by the mixing unit 26b via the high-pass filter 32a and the low-pass filter 32b. Accordingly, the correction characteristic changing unit 19c has characteristics based on the difference between the measurement result S1 and the acoustic transfer function H1 that are the result of mixing by the mixing unit 26a according to the clarity of the sound image input at the input unit 18. Further, the correction characteristic of the amplitude frequency characteristic correction unit 15 is changed. Alternatively, the correction characteristic changing unit 19c has characteristics based on the difference between the measurement result S2 and the acoustic transfer function H2 that are the result of mixing by the mixing unit 26b according to the clarity of the sound image input at the input unit 18. Further, the correction characteristic of the amplitude frequency characteristic correction unit 15 is changed.

以上のように、本実施形態によれば、混合部２６ａにおいて、ＦＩＲフィルタ１６ａの出力信号と、ＦＩＲフィルタ１６ｂにおいて処理されていない信号とを混合する。また、混合部２６ｂにおいて、ＦＩＲフィルタ１６ｂの出力信号と、ＦＩＲフィルタ１６ａにおいて処理されていない信号とを混合する。これらにより、音像定位制御に誤差を生じさせている。これにより、第３の実施形態と比べて、音像定位制御を行わない信号がスピーカ１７ａおよび１７ｂから出力されるので、音質を向上させることができる。 As described above, according to the present embodiment, the mixing unit 26a mixes the output signal of the FIR filter 16a and the signal not processed in the FIR filter 16b. Further, the mixing unit 26b mixes the output signal of the FIR filter 16b and the signal not processed in the FIR filter 16a. Thus, an error is caused in the sound image localization control. Thereby, as compared with the third embodiment, since the signal not performing the sound image localization control is output from the speakers 17a and 17b, the sound quality can be improved.

なお、図１２に示した構成では、混合部２６ａおよび２６ｂを備える構成であったが、いずれか一方を省略した構成であってもよい。また、図１２に示した構成では、ハイパスフィルタ３２ａおよび３２ｂを備える構成であったが、演算量を削減するため、いずれも省略した構成であってもよい。また、図１２に示した構成のうち、ハイパスフィルタ３２ａおよびローパスフィルタ３２ｂを入れ代えた構成であってもよい。 In addition, in the structure shown in FIG. 12, although it was the structure provided with the mixing parts 26a and 26b, the structure which abbreviate | omitted any one may be sufficient. In the configuration shown in FIG. 12, the high pass filters 32a and 32b are provided. However, in order to reduce the amount of calculation, any configuration may be omitted. Moreover, the structure which replaced the high-pass filter 32a and the low-pass filter 32b among the structures shown in FIG. 12 may be sufficient.

また、図１２に示した構成では、第３の処理パラメータ変更部２７および第４の処理パラメータ変更部２８は、混合部２６ａおよび２６ｂにおける所定の混合量を、入力部１８で入力された音像の明瞭度に応じた混合量に変更するようにしていたが、いずれか一方の所定の混合量のみ変更するようにしてもよい。この場合であっても、混合部２６ａおよび２６ｂにおける所定の混合量を変更した場合と同様の効果を得ることができる。 In the configuration shown in FIG. 12, the third processing parameter changing unit 27 and the fourth processing parameter changing unit 28 set the predetermined mixing amount in the mixing units 26 a and 26 b to the sound image input from the input unit 18. Although the mixing amount is changed according to the clarity, only one of the predetermined mixing amounts may be changed. Even in this case, it is possible to obtain the same effect as when the predetermined mixing amount in the mixing units 26a and 26b is changed.

なお、図１、図９、図１０、および図１２で示した第２〜第４の処理パラメータ変更部およびカットオフ周波数変更部３３に関わる回路構成を複数組み合わせた構成にし、より精密に明瞭度を調整できるようにしてもよい。また、図１、図９、図１０、および図１２で示した構成では、明瞭度の設定に関わらず同等の音量感、音質を提供するため、振幅周波数特性補正部１５および補正特性変更部１９ａ〜１９ｃを設けていた。しかしながら、図１、図９、図１０、および図１２で示した構成において、演算量を削減するため、振幅周波数特性補正部１５および補正特性変更部１９ａ〜１９ｃが省略されてもよい。 It should be noted that a plurality of circuit configurations related to the second to fourth processing parameter changing units and the cut-off frequency changing unit 33 shown in FIGS. 1, 9, 10, and 12 are combined to provide a more precise clarity. May be adjusted. In the configurations shown in FIGS. 1, 9, 10, and 12, the amplitude frequency characteristic correction unit 15 and the correction characteristic change unit 19a are provided in order to provide the same volume feeling and sound quality regardless of the clarity setting. To 19c. However, in the configurations shown in FIGS. 1, 9, 10, and 12, the amplitude frequency characteristic correction unit 15 and the correction characteristic change units 19a to 19c may be omitted in order to reduce the amount of calculation.

（第５の実施形態）
図１４を用いて、本発明の第５の実施形態に係る音像定位制御装置の構成について説明する。図１４は、第５の実施形態に係る音像定位制御装置の構成を示す図である。図１４において、音像定位制御装置は、振幅周波数特性補正部１５、ＦＩＲフィルタ１６ａおよび１６ｂ、スピーカ１７ａおよび１７ｂ、補正特性変更部１９、第１の処理パラメータ変更部２０、記憶部４０、撮影部４１、オブジェクト画像検出部４２、および出力部４３を備える。 (Fifth embodiment)
The configuration of a sound image localization control device according to the fifth embodiment of the present invention will be described with reference to FIG. FIG. 14 is a diagram illustrating a configuration of a sound image localization control device according to the fifth embodiment. 14, the sound image localization control device includes an amplitude frequency characteristic correction unit 15, FIR filters 16a and 16b, speakers 17a and 17b, a correction characteristic change unit 19, a first processing parameter change unit 20, a storage unit 40, and a photographing unit 41. An object image detection unit 42 and an output unit 43.

第１の実施形態では、ユーザー１２ａは、音像の大きさまたは音像の明瞭度を入力部１８で入力していた。これに対し、本実施形態では、ユーザー１２ａの入力作業を伴わずに、より直感的かつより容易に音像の明瞭度を入力することを可能にする。具体的には、本実施形態に係る音像定位制御装置は、図１に示した音像定位制御装置に対して、記憶部４０、撮影部４１、およびオブジェクト画像検出部４２が追加された点と、入力部１８が出力部４３に入れ代わった点で異なる。それ以外の構成要素については、図１に示した構成要素と同じであり、同じ符号を付して説明を省略する。以下、異なる点を中心に説明する。 In the first embodiment, the user 12 a inputs the size of the sound image or the clarity of the sound image through the input unit 18. On the other hand, in the present embodiment, it is possible to input the intelligibility of the sound image more intuitively and easily without the input work of the user 12a. Specifically, the sound image localization control device according to the present embodiment has a storage unit 40, a photographing unit 41, and an object image detection unit 42 added to the sound image localization control device shown in FIG. The difference is that the input unit 18 is replaced with the output unit 43. The other constituent elements are the same as those shown in FIG. 1 and are given the same reference numerals, and the description thereof is omitted. Hereinafter, different points will be mainly described.

図１４において、記憶部４０には、音像の大きさまたは音像の明瞭度に関する情報と、ユーザー１２ａの一部（手など）の形状を表したオブジェクト画像（静止画や動画など）との対応を示す対応情報が記憶されている。また、記憶部４０には、異なる形状を示すオブジェクト画像の種類に応じて、上記対応情報が記憶されている。図１５は、記憶部４０に記憶された対応情報の一例を模式的に示した図である。図１５において、「明瞭度高」は、手を完全に握りしめたときの手の形状を表したオブジェクト画像Ｇ１と対応付けされている。「明瞭度中」は、手を半分に握りしめたときの手の形状を表したオブジェクト画像Ｇ２と対応付けされている。「明瞭度低」は、手を完全に開いたときの手の形状を表したオブジェクト画像Ｇ３と対応付けされている。 In FIG. 14, the storage unit 40 associates information regarding the size of the sound image or the clarity of the sound image with an object image (such as a still image or a moving image) that represents the shape of a part of the user 12a (such as a hand). The correspondence information shown is stored. Further, the correspondence information is stored in the storage unit 40 in accordance with the types of object images showing different shapes. FIG. 15 is a diagram schematically illustrating an example of correspondence information stored in the storage unit 40. In FIG. 15, “high clarity” is associated with the object image G1 representing the shape of the hand when the hand is completely gripped. “Medium clarity” is associated with the object image G2 representing the shape of the hand when the hand is gripped in half. “Low clarity” is associated with the object image G3 representing the shape of the hand when the hand is fully opened.

撮影部４１は、ユーザー１２ａを撮影し、画像データを出力する。撮影部４１は、常時、または所定のタイミングでユーザー１２ａを撮影する。 The photographing unit 41 photographs the user 12a and outputs image data. The photographing unit 41 photographs the user 12a at all times or at a predetermined timing.

オブジェクト画像検出部４２は、記憶部４０に記憶された各オブジェクト画像のうち、撮影部４１から出力される画像データに含まれるオブジェクト画像を検出する。具体的には、オブジェクト画像検出部４２は、撮影部４１から出力される画像データ内に、記憶部４０に記憶されたオブジェクト画像と相関の高い部分が有るか無いかを判定する。この判定処理は、記憶部４０に記憶されたオブジェクト画像毎に行われる。また、この判定処理は、例えば公知のパターンマッチング技術を用いることで実現される。そして、オブジェクト画像検出部４２は、記憶部４０に記憶された各オブジェクト画像のうち、撮影部４１から出力される画像データ内に相関の高い部分が有ると判定したオブジェクト画像を、撮影部４１から出力される画像データに含まれるオブジェクト画像として検出する。ここで、例えば、撮影部４１から出力される画像データが図１６に示すような画像データであるとする。図１６は、画像データの一例を示す図である。また、記憶部４０には、図１５に示した対応情報が記憶されているとする。この場合、オブジェクト画像検出部４２は、上記判定処理を行うことによって、画像データに含まれるオブジェクト画像として、オブジェクト画像Ｇ３を検出することになる。 The object image detection unit 42 detects an object image included in the image data output from the photographing unit 41 among the object images stored in the storage unit 40. Specifically, the object image detection unit 42 determines whether or not the image data output from the photographing unit 41 has a portion highly correlated with the object image stored in the storage unit 40. This determination process is performed for each object image stored in the storage unit 40. Moreover, this determination process is implement | achieved by using a well-known pattern matching technique, for example. Then, the object image detection unit 42 detects, from the imaging unit 41, an object image that is determined to have a highly correlated portion in the image data output from the imaging unit 41 among the object images stored in the storage unit 40. It is detected as an object image included in the output image data. Here, for example, it is assumed that the image data output from the photographing unit 41 is image data as shown in FIG. FIG. 16 is a diagram illustrating an example of image data. Further, it is assumed that the correspondence information illustrated in FIG. 15 is stored in the storage unit 40. In this case, the object image detection unit 42 detects the object image G3 as the object image included in the image data by performing the determination process.

出力部４３は、オブジェクト画像検出部４２で検出されたオブジェクト画像に対応する音像の明瞭度を、記憶部４０の対応情報を参照して特定する。出力部４３は、特定した音像の明瞭度を補正特性変更部１９および第１の処理パラメータ変更部２０それぞれへ出力する。補正特性変更部１９および第１の処理パラメータ変更部２０での処理は、第１の実施形態で説明した処理を同様であり、ここでは説明を省略する。 The output unit 43 specifies the clarity of the sound image corresponding to the object image detected by the object image detection unit 42 with reference to the correspondence information in the storage unit 40. The output unit 43 outputs the specified clarity of the sound image to the correction characteristic changing unit 19 and the first processing parameter changing unit 20, respectively. The processing in the correction characteristic changing unit 19 and the first processing parameter changing unit 20 is the same as the processing described in the first embodiment, and the description thereof is omitted here.

以上のように、本実施形態によれば、ユーザー１２ａの入力作業を伴わずに、ユーザー１２ａの所望する音像の大きさや音像の明瞭度を、自動的に補正特性変更部１９および第１の処理パラメータ変更部２０それぞれへ出力することができる。これにより、ユーザー１２ａは、より直感的かつより容易に音像の明瞭度を入力することができる。 As described above, according to the present embodiment, the correction characteristic changing unit 19 and the first processing are automatically performed for the size of the sound image desired by the user 12a and the clarity of the sound image without the input work of the user 12a. Each parameter can be output to the parameter changing unit 20. Thereby, the user 12a can input the clarity of the sound image more intuitively and more easily.

なお、上述では、ユーザー１２ａの一部の形状に応じて異なるオブジェクト画像を用いていたが、これに限定されない。オブジェクト画像は、ユーザー１２ａの一部の動作に応じて異なる画像であってもよいし、ユーザー１２ａの一部の大きさに応じて異なる画像であってもよい。 In the above description, different object images are used depending on a part of the shape of the user 12a. However, the present invention is not limited to this. The object image may be a different image according to a part of the operation of the user 12a, or may be a different image according to a part of the size of the user 12a.

また、本実施形態に係る音像定位制御装置の構成は、図１７に示すように、記憶部４０に記憶されたオブジェクト画像を所定のタイミングで更新することが可能な構成であってもよい。図１７は、記憶部４０に記憶されたオブジェクト画像を所定のタイミングで更新することが可能な音像定位制御装置の構成を示す図である。 Further, the configuration of the sound image localization control device according to the present embodiment may be a configuration capable of updating the object image stored in the storage unit 40 at a predetermined timing, as shown in FIG. FIG. 17 is a diagram illustrating a configuration of a sound image localization control device that can update an object image stored in the storage unit 40 at a predetermined timing.

図１７に示す音像定位制御装置は、図１４に示した構成に対して、オブジェクト画像更新部４４をさらに備えている。それ以外の構成要素については、図１４に示した構成要素を同じであるので、同じ符号を付して説明を省略する。図１７において、オブジェクト画像更新部４４は、所定のタイミングで撮影された画像データから、記憶部４０に記憶されたオブジェクト画像とは異なるオブジェクト画像を検出する。オブジェクト画像更新部４４は、記憶部４０に記憶されたオブジェクト画像を、検出したオブジェクト画像に更新する。これにより、音像の大きさや音像の明瞭度を、ユーザー１２ａの手以外の部分で入力することができる。その結果、図１７の構成を用いれば、図１４で示した構成に比べて、ユーザー１２ａに合わせて操作性を向上させることができる。 The sound image localization control device shown in FIG. 17 further includes an object image update unit 44 in addition to the configuration shown in FIG. Since the other components are the same as those shown in FIG. 14, the same reference numerals are given and description thereof is omitted. In FIG. 17, the object image update unit 44 detects an object image different from the object image stored in the storage unit 40 from image data captured at a predetermined timing. The object image update unit 44 updates the object image stored in the storage unit 40 to the detected object image. Thereby, the magnitude | size of a sound image and the clarity of a sound image can be input in parts other than the user's 12a hand. As a result, if the configuration of FIG. 17 is used, the operability can be improved in accordance with the user 12a as compared to the configuration shown in FIG.

なお、上述では、撮影部４１を用いた画像認識を行うことにより、音像の大きさや音像の明瞭度の入力を自動的に行っていたが、これに限定されない。例えば、加速度センサー（図示なし）を用いてユーザー１２ａの動作を直接検出し、検出した動作の状態に応じて音像の大きさや音像の明瞭度の入力を自動的に行うようにしてもよい。この場合、記憶部４０には、音像の大きさまたは音像の明瞭度に関する情報と、ユーザー１２ａの動作の状態に関する情報との対応を示す対応情報が記憶されている。また、記憶部４０には、ユーザー１２ａの動作の種類に応じて、上記対応情報が記憶されている。出力部４３は、加速度センサーで検出されたユーザー１２ａの動作に関する情報に対応する音像の大きさや音像の明瞭度を、記憶部４０の対応情報を参照して特定する。出力部４３は、特定した音像の大きさや音像の明瞭度を補正特性変更部１９および第１の処理パラメータ変更部２０それぞれへ出力する。また例えば、マイク（図示なし）を用いてユーザー１２ａの音声を検出し、検出した音声に応じて音像の大きさや音像の明瞭度の入力を自動的に行うようにしてもよい。この場合、記憶部４０には、音像の大きさまたは音像の明瞭度に関する情報と、ユーザー１２ａの音声に関する情報との対応を示す対応情報が記憶されている。また、記憶部４０には、ユーザー１２ａの音声の種類に応じて、上記対応情報が記憶されている。出力部４３は、マイクで検出されたユーザー１２ａの音声に関する情報に対応する音像の大きさや音像の明瞭度を、記憶部４０の対応情報を参照して特定する。出力部４３は、特定した音像の大きさや音像の明瞭度を補正特性変更部１９および第１の処理パラメータ変更部２０それぞれへ出力する。 In the above description, the image recognition using the photographing unit 41 is performed to automatically input the size of the sound image and the clarity of the sound image. However, the present invention is not limited to this. For example, the motion of the user 12a may be directly detected using an acceleration sensor (not shown), and the sound image size and the clarity of the sound image may be automatically input according to the detected motion state. In this case, the storage unit 40 stores correspondence information indicating correspondence between the information on the size of the sound image or the clarity of the sound image and the information on the state of the operation of the user 12a. In addition, the storage unit 40 stores the correspondence information according to the type of operation of the user 12a. The output unit 43 specifies the size of the sound image and the clarity of the sound image corresponding to the information related to the operation of the user 12a detected by the acceleration sensor with reference to the correspondence information in the storage unit 40. The output unit 43 outputs the specified size of the sound image and the clarity of the sound image to the correction characteristic changing unit 19 and the first processing parameter changing unit 20, respectively. Further, for example, the sound of the user 12a may be detected using a microphone (not shown), and the sound image size and the clarity of the sound image may be automatically input according to the detected sound. In this case, the storage unit 40 stores correspondence information indicating the correspondence between the information about the size of the sound image or the clarity of the sound image and the information about the sound of the user 12a. Further, the correspondence information is stored in the storage unit 40 according to the type of the voice of the user 12a. The output unit 43 specifies the size of the sound image corresponding to the information related to the voice of the user 12a detected by the microphone and the clarity of the sound image with reference to the correspondence information in the storage unit 40. The output unit 43 outputs the specified size of the sound image and the clarity of the sound image to the correction characteristic changing unit 19 and the first processing parameter changing unit 20, respectively.

なお、上述した第１〜第５の実施形態で説明した音像定位制御装置は、例えば音響信号を入力とし、処理した音響信号をスピーカ１７ａおよび１７ｂへ出力する一般的なコンピュータシステム等の情報処理装置で実現可能である。この場合、上述した処理をコンピュータに実行させるプログラムを所定の情報記録媒体に格納し、当該情報記録媒体に格納されたプログラムをコンピュータが読み出して実行することによって、第１〜第５の実施形態で説明した音像定位装置が実現される。また、図１４および図１７に示した記憶部４０は、例えば情報処理装置内のハードディスク内で構成される。また、上記プログラムを格納する情報記録媒体は、例えば、ＲＯＭまたはフラッシュメモリのような不揮発性半導体メモリやＣＤ−ＲＯＭ、ＤＶＤ、あるいはそれらに類する光学式ディスク状記録媒体である。また、プログラムを他の媒体や通信回線を通じて上記情報処理装置に供給してもかまわない。また、記憶部４０を例えば情報処理装置内のハードディスク内で構成されるとしたが、情報処理装置内のメモリや情報処理装置外の他の記録媒体で構成されてもよい。 The sound image localization control apparatus described in the first to fifth embodiments described above is an information processing apparatus such as a general computer system that receives, for example, an acoustic signal and outputs the processed acoustic signal to the speakers 17a and 17b. It is feasible. In this case, a program for causing a computer to execute the above-described processing is stored in a predetermined information recording medium, and the computer reads and executes the program stored in the information recording medium, whereby the first to fifth embodiments are used. The described sound image localization apparatus is realized. The storage unit 40 shown in FIGS. 14 and 17 is configured in, for example, a hard disk in the information processing apparatus. The information recording medium for storing the program is, for example, a nonvolatile semiconductor memory such as a ROM or a flash memory, a CD-ROM, a DVD, or an optical disk-like recording medium similar to them. Further, the program may be supplied to the information processing apparatus through another medium or a communication line. Further, although the storage unit 40 is configured in, for example, a hard disk in the information processing apparatus, it may be configured in a memory in the information processing apparatus or another recording medium outside the information processing apparatus.

なお、上述した第１〜第５の実施形態で説明した音像定位制御装置の各構成要素は、ＬＳＩなどの集積回路や、専用の信号処理回路を用いて１チップ化したものによって実現されてもよい。また上述した第１〜第５の実施形態で説明した音像定位制御装置は、上記各構成要素の機能に相当するものをそれぞれチップ化したものによって実現されてもよい。なお、ここでは、ＬＳＩとしたが、集積度の違いにより、ＩＣ、システムＬＳＩ、スーパーＬＳＩ、ウルトラＬＳＩと呼称されることもある。また集積回路化の手法は、ＬＳＩに限るものではなく、専用回路又は汎用プロセッサで実現してもよい。ＬＳＩ製造後に、プログラムすることが可能なＦＰＧＡ（ＦｉｅｌｄＰｒｏｇｒａｍｍａｂｌｅＧａｔｅＡｒｒａｙ）や、ＬＳＩ内部の回路セルの接続や設定を再構成可能なリコンフィギュラブル・プロセッサを利用してもよい。さらには、半導体技術の進歩又は派生する別技術によりＬＳＩに置き換わる集積回路化の技術が登場すれば、当然、その技術を用いて機能ブロックの集積化を行ってもよい。 Note that each component of the sound image localization control device described in the first to fifth embodiments described above may be realized by a single chip using an integrated circuit such as an LSI or a dedicated signal processing circuit. Good. In addition, the sound image localization control device described in the first to fifth embodiments may be realized by a chip that is equivalent to the function of each component. Note that the name used here is LSI, but it may also be called IC, system LSI, super LSI, or ultra LSI depending on the degree of integration. Further, the method of circuit integration is not limited to LSI's, and implementation using dedicated circuitry or general purpose processors is also possible. An FPGA (Field Programmable Gate Array) that can be programmed after manufacturing the LSI or a reconfigurable processor that can reconfigure the connection and setting of circuit cells inside the LSI may be used. Further, if integrated circuit technology comes out to replace LSI's as a result of the advancement of semiconductor technology or a derivative other technology, it is naturally also possible to carry out function block integration using this technology.

本発明に係る音像定位制御装置は、制御処理負担の軽減、および回路の小規模化を実現しつつ、音像の大きさや明瞭度を調整することができ、オーディオ装置等に適用される。 The sound image localization control device according to the present invention can adjust the size and clarity of a sound image while reducing the burden of control processing and reducing the circuit scale, and is applied to an audio device or the like.

第１の実施形態に係る音像定位制御装置の構成を示す図The figure which shows the structure of the sound image localization control apparatus which concerns on 1st Embodiment. 所定位置１３に音像を定位させるための伝達関数を測定する構成を示す図The figure which shows the structure which measures the transfer function for localizing a sound image to the predetermined position 13 所定の伝達関数Ｇ１およびＧ２を用いて算出された１０２４タップのフィルタ係数と、当該フィルタ係数を用いて音像定位制御を行った結果と、伝達関数両耳差の制御誤差とを示す図The figure which shows the filter coefficient of 1024 taps calculated using predetermined | prescribed transfer functions G1 and G2, the result of having performed sound image localization control using the said filter coefficient, and the control error of a transfer function binaural difference ２５６タップのフィルタ係数と、当該フィルタ係数を用いて音像定位制御を行った結果と、伝達関数両耳差の制御誤差とを示す図The figure which shows the filter coefficient of 256 taps, the result of having performed sound image localization control using the said filter coefficient, and the control error of a transfer function binaural difference ６４タップのフィルタ係数と、当該フィルタ係数を用いて音像定位制御を行った結果と、伝達関数両耳差の制御誤差とを示す図The figure which shows the filter coefficient of 64 taps, the result of having performed sound image localization control using the said filter coefficient, and the control error of a transfer function binaural difference 伝達関数両耳差｜Ｓ１−Ｓ２｜と｜Ｈ１−Ｈ２｜との差と、ユーザー１２ａが音響信号１４ａを受聴した際に感じる音像の大きさとの関係を説明するための図The figure for explaining the relation between the difference between the transfer function binaural difference | S1-S2 | and | H1-H2 | and the size of the sound image that the user 12a feels when listening to the acoustic signal 14a. 図３（ｃ）、図４（ｃ）、図５（ｃ）で示したＨ２とＳ２との差（Ｈ２−Ｓ２）を各フィルタ長毎に算出した結果を示す図The figure which shows the result of having calculated the difference (H2-S2) of H2 and S2 shown in FIG.3 (c), FIG.4 (c), and FIG.5 (c) for every filter length. 振幅周波数特性補正部１５における補正特性を示した図The figure which showed the correction characteristic in the amplitude frequency characteristic correction | amendment part 15 第２の実施形態に係る音像定位制御装置の構成を示す図The figure which shows the structure of the sound image localization control apparatus which concerns on 2nd Embodiment. 第３の実施形態に係る音像定位制御装置の構成を示す図The figure which shows the structure of the sound image localization control apparatus which concerns on 3rd Embodiment. 混合部２６ａおよび２６ｂの構成を示す図The figure which shows the structure of the mixing parts 26a and 26b 第４の実施形態に係る音像定位制御装置の構成を示す図The figure which shows the structure of the sound image localization control apparatus which concerns on 4th Embodiment. 入力部１８で入力された明瞭度の高低と、カットオフ周波数変更部３３が変更するカットオフ周波数との関係を示した図The figure which showed the relationship between the level of the clarity input by the input part 18, and the cutoff frequency which the cutoff frequency change part 33 changes 第５の実施形態に係る音像定位制御装置の構成を示す図The figure which shows the structure of the sound image localization control apparatus which concerns on 5th Embodiment. 記憶部４０に記憶された対応情報の一例を模式的に示した図The figure which showed typically an example of the correspondence information memorize | stored in the memory | storage part 40 画像データの一例を示す図Diagram showing an example of image data 記憶部４０に記憶されたオブジェクト画像を所定のタイミングで更新することが可能な音像定位制御装置の構成を示す図The figure which shows the structure of the sound image localization control apparatus which can update the object image memorize | stored in the memory | storage part 40 at predetermined timing. 複数の位置に音像を定位させる従来の音像定位制御装置の構成を示した図The figure which showed the composition of the conventional sound image localization control device which localizes a sound image in a plurality of positions 水平方向角度φと可変ゲイン４ａ−１〜４ａ−１２との関係を示す図The figure which shows the relationship between horizontal direction angle (phi) and the variable gains 4a-1 to 4a-12. 水平方向角度φと可変ゲイン５ａ−１〜５ａ−１２の関係を示す図The figure which shows the relationship between horizontal direction angle (phi) and the variable gains 5a-1 to 5a-12.

Explanation of symbols

１入力端子
２ノッチフィルタ
３ａ、３ｂ、４ａ−１〜４ａ−１２、５ａ−１〜５ａ−１２、３０可変ゲイン
４近距離用分配器
５遠距離用分配器
６ａ−１〜６ａ−１２、６ｂ、６ｃ、３１加算器
７ａ−１〜７ａ−１２、７ｂ−１〜７ｂ−１２方向付器
８クロストークキャンセラ
９アンプ
１０定位制御器
１０ａ鉛直角スライダ
１０ｂ距離スライダ
１０ｃ方向ダイアル
１１制御器
１２ａユーザー
１２ｂダミーヘッド
１３所定位置
１４ａ音響信号
１４ｂ測定信号
１５振幅周波数特性補正部
１６ａ、１６ｂＦＩＲフィルタ
１７ａ、１７ｂスピーカ
１８入力部
１９、１９ａ、１９ｂ補正特定変更部
２０第１の処理パラメータ変更部
２１ａ、２１ｂ測定マイク
２２伝達関数算出器
２３ａ、２３ｂ、２３ｃ音像
２４ａ、２４ｂ振幅位相周波数特性調整部
２５第２の処理パラメータ変更部
２６ａ、２６ｂ混合部
２７第３の処理パラメータ変更部
２８第４の処理パラメータ変更部
２９所定周波数成分抽出部
３２ａハイパスフィルタ
３２ｂローパスフィルタ
３３カットオフ周波数変更部
４０記憶部
４１撮影部
４２オブジェクト画像検出部
４３出力部
４４オブジェクト画像更新部 DESCRIPTION OF SYMBOLS 1 Input terminal 2 Notch filter 3a, 3b, 4a-1 to 4a-12, 5a-1 to 5a-12, 30 Variable gain 4 Short distance distributor 5 Long distance distributor 6a-1 to 6a-12, 6b , 6c, 31 Adder 7a-1 to 7a-12, 7b-1 to 7b-12 Directional device 8 Crosstalk canceller 9 Amplifier 10 Localization controller 10a Vertical angle slider 10b Distance slider 10c Direction dial 11 Controller 12a User 12b Dummy head 13 Predetermined position 14a Acoustic signal 14b Measurement signal 15 Amplitude frequency characteristic correction unit 16a, 16b FIR filter 17a, 17b Speaker 18 Input unit 19, 19a, 19b Correction specific change unit 20 First processing parameter change unit 21a, 21b Measurement Microphone 22 Transfer function calculator 23a, 23b, 23c Sound image 24a 24b Amplitude phase frequency characteristic adjusting unit 25 Second processing parameter changing unit 26a, 26b Mixing unit 27 Third processing parameter changing unit 28 Fourth processing parameter changing unit 29 Predetermined frequency component extracting unit 32a High pass filter 32b Low pass filter 33 Cut Off-frequency changing unit 40 Storage unit 41 Imaging unit 42 Object image detection unit 43 Output unit 44 Object image update unit

Claims

A sound image localization control device that outputs sound from a plurality of speakers and localizes a sound image at a position different from each of the speakers,
A plurality of speakers that are provided corresponding to each of the plurality of speakers, process an input acoustic signal so that the sound image is localized at a predetermined position in accordance with one or more processing parameters set in advance, and output to the corresponding speakers Sound image localization means,
Processing parameter changing means for changing at least one of the processing parameters set in advance in each of the sound image localization means to a processing parameter according to the size of the sound image desired by the user and / or the clarity of the sound image; A sound image localization control device.

Each of the sound image localization means has a filter coefficient set in advance as the processing parameter, processes an input acoustic signal so that the sound image is localized at a predetermined position in accordance with the filter coefficient, and outputs it to a corresponding speaker. Have a filter,
The processing parameter changing means changes the filter length of the filter coefficient preset in each FIR filter to a filter length according to the size of the sound image desired by the user and / or the clarity of the sound image. The sound image localization control device according to claim 1, wherein

The processing parameter changing means shortens the filter length of the filter coefficient as the sound image desired by the user is larger and / or as the clarity of the sound image desired by the user is smaller. The sound image localization control apparatus according to claim 2.

Each of the sound image localization means,
A filter coefficient is preset as the processing parameter, and an FIR filter that processes an input acoustic signal so that the sound image is localized at a predetermined position according to the filter coefficient;
A predetermined adjustment amount is preset as the processing parameter, and at least one of the amplitude frequency characteristic and the phase frequency characteristic of the output signal of the FIR filter is adjusted according to the predetermined adjustment amount, and is output to the corresponding speaker. Amplitude phase frequency characteristic adjusting means,
The processing parameter changing means adjusts the predetermined adjustment amount preset in each amplitude phase frequency characteristic adjusting means to an adjustment amount according to the size of the sound image desired by the user and / or the clarity of the sound image. The sound image localization control device according to claim 1, wherein the sound image localization control device is changed to

Each of the sound image localization means,
A filter coefficient is preset as the processing parameter, and an FIR filter that processes an input acoustic signal so that the sound image is localized at a predetermined position according to the filter coefficient;
A predetermined mixing amount is set in advance as the processing parameter, and according to the predetermined mixing amount, the output signal of the FIR filter and the output signal of the FIR filter included in the other sound image localization means are mixed to correspond. Mixing means for outputting to a speaker
The processing parameter changing unit changes the predetermined mixing amount preset in each mixing unit to a mixing amount according to the size of the sound image desired by the user and / or the clarity of the sound image. The sound image localization control apparatus according to claim 1, wherein

Each of the mixing means includes
A predetermined frequency component extracting unit configured to extract only a signal having the predetermined frequency component from the output signal of the FIR filter included in the other sound image localization unit, wherein a predetermined frequency component is preset as the processing parameter; ,
A predetermined gain is preset as the processing parameter, and an amplifying means for amplifying the extraction signal of the predetermined frequency component extracting means with the predetermined gain;
6. The sound image localization control apparatus according to claim 5, further comprising an adding unit that adds the output signal of the amplification unit and the output signal of the FIR filter included in the sound image localization unit of the amplification unit and outputs the added signal to the corresponding speaker. .

Each of the sound image localization means,
A filter coefficient is preset as the processing parameter, and an FIR filter that processes an input acoustic signal so that the sound image is localized at a predetermined position according to the filter coefficient;
A predetermined mixing amount is set in advance as the processing parameter, and according to the predetermined mixing amount, the output signal of the FIR filter and the input acoustic signal are mixed and output to a corresponding speaker. ,
The processing parameter changing unit changes the predetermined mixing amount preset in each mixing unit to a mixing amount according to the size of the sound image desired by the user and / or the clarity of the sound image. The sound image localization control apparatus according to claim 1, wherein

The sound image localization control device includes:
A high-pass filter that passes only a signal having a frequency equal to or higher than a predetermined cutoff frequency among the input acoustic signals;
A low-pass filter that passes only a signal having a frequency equal to or lower than the predetermined cutoff frequency among the input acoustic signals;
Each of the sound image localization means,
A filter coefficient is preset as the processing parameter, and an FIR filter that processes a passing signal of the high-pass filter according to the filter coefficient;
A predetermined mixing amount is set in advance as the processing parameter, and according to the predetermined mixing amount, mixing means for mixing the output signal of the FIR filter and the passing signal of the low-pass filter and outputting to the corresponding speaker. Have
The processing parameter changing unit changes the predetermined mixing amount preset in each mixing unit to a mixing amount according to the size of the sound image desired by the user and / or the clarity of the sound image. The sound image localization control apparatus according to claim 1, wherein

Cut-off frequency changing means for changing the predetermined cut-off frequency of each high-pass filter and each low-pass filter to a cut-off frequency corresponding to the size of the sound image desired by the user and / or the clarity of the sound image The sound image localization control apparatus according to claim 8, further comprising:

Amplitude frequency characteristic correction means for correcting the amplitude frequency characteristic of the input acoustic signal according to a predetermined correction characteristic and outputting to each of the sound image localization means,
The sound image localization control apparatus according to claim 1, further comprising correction characteristic changing means for changing the predetermined correction characteristic to a correction characteristic corresponding to the processing parameter changed by the processing parameter changing means.

The correction characteristic changing means includes an amplitude frequency characteristic of the sound when the input acoustic signal is output as sound from the predetermined position in the vicinity of one of the user's ears, and the vicinity of the one ear. The sound image localization control apparatus according to claim 10, wherein the predetermined correction characteristic is changed so as to reduce a difference from an amplitude frequency characteristic of sounds output from the speakers.

Storage means for storing correspondence information indicating correspondence between the size of the sound image and / or the clarity of the sound image and the object image representing a part of the user according to the type of the object image;
Photographing means for photographing the user and outputting the photographed image data;
Object image detection means for detecting an object image included in the image data output from the photographing means among the object images stored in the storage means;
With reference to the correspondence information stored in the storage means, information relating to the size of the sound image and / or the clarity of the sound image corresponding to the object image detected by the object image detection means is specified and specified. The sound image localization control apparatus according to claim 1, further comprising: an output unit that outputs information regarding the size of the sound image and / or the clarity of the sound image to the processing parameter changing unit.

An object image different from the object image stored in the storage unit is detected from image data captured at a predetermined timing in the imaging unit, and the object image stored in the storage unit is updated to the detected object image. The sound image localization control apparatus according to claim 12, further comprising an object image update means for performing the operation.

An acceleration sensor for detecting the user's movement;
Storage means for storing correspondence information indicating the correspondence between the information about the size of the sound image and / or the clarity of the sound image and the information about the state of the user's operation according to the type of the user's operation;
With reference to the correspondence information stored in the storage means, the information on the size of the sound image and / or the clarity of the sound image corresponding to the information on the user's action detected by the acceleration sensor is specified, The sound image localization control apparatus according to claim 1, further comprising: an output unit that outputs information regarding the specified size of the sound image and / or the clarity of the sound image to the processing parameter changing unit.

A microphone for detecting the user's voice;
Storage means for storing correspondence information indicating correspondence between the information about the magnitude of the sound image and / or the clarity of the sound image and the information about the voice of the user according to the type of the voice of the user;
With reference to the correspondence information stored in the storage means, the information about the size of the sound image and / or the clarity of the sound image corresponding to the information about the user's voice detected by the acceleration sensor, The sound image localization control apparatus according to claim 1, further comprising output means for outputting to the parameter changing means.

A sound image localization control method for outputting sound from a plurality of speakers and localizing a sound image at a position different from each of the speakers,
A sound image localization step for processing an input acoustic signal so that the sound image is localized at a predetermined position according to one or more preset processing parameters separately for each of the plurality of speakers;
A processing parameter changing step for changing at least one of the processing parameters used in the sound image localization step to a processing parameter according to a size of the sound image desired by a user and / or a clarity of the sound image, Sound image localization control method.

A program for causing a computer to localize a sound image at a position different from each speaker by outputting sound from a plurality of speakers,
A sound image localization step for processing an input acoustic signal so that the sound image is localized at a predetermined position according to one or more preset processing parameters separately for each of the plurality of speakers;
A processing parameter changing step for changing at least one of the processing parameters used in the sound image localization step to a processing parameter according to the size of the sound image desired by a user and / or the clarity of the sound image; A program that causes a computer to execute.

An integrated circuit that outputs sound from a plurality of speakers and localizes a sound image at a position different from each of the speakers,
A plurality of speakers that are provided corresponding to each of the plurality of speakers, process an input acoustic signal so that the sound image is localized at a predetermined position in accordance with one or more processing parameters set in advance, and output to the corresponding speakers Sound image localization means,
Processing parameter changing means for changing at least one of the processing parameters set in advance in each of the sound image localization means to a processing parameter according to the size of the sound image desired by the user and / or the clarity of the sound image; An integrated circuit comprising: