JP2008092411A

JP2008092411A - Audio signal generating device

Info

Publication number: JP2008092411A
Application number: JP2006272725A
Authority: JP
Inventors: Katsumi Hasegawa; 勝巳長谷川; Takuma Suzuki; 琢磨鈴木; Toshiko Murata; 寿子村田
Original assignee: Victor Company of Japan Ltd
Current assignee: Victor Company of Japan Ltd
Priority date: 2006-10-04
Filing date: 2006-10-04
Publication date: 2008-04-17

Abstract

<P>PROBLEM TO BE SOLVED: To provide an audio signal generating device which generates stereo signals as expanded stereo signals having a wide interval through narrow-interval speakers, and reproduces a center sound source as a signal giving no feeling of absence of center localization. <P>SOLUTION: The audio signal generating device includes: FFT means 21 and 22 for obtaining sine wave components and cosine wave components at predetermined frequency intervals by Fourier-transforming a stereo signal; a frequency detecting means 24 for comparing the sine wave components and cosine wave components with each other respectively to detect frequencies having level differences within a predetermined range; an adding means 27 for obtaining addition components by adding sine wave components and cosine wave components of the frequencies detected by the frequency detecting means respectively among sine and cosine wave components obtained by the FFT means; an IFFT means 28 for inversely Fourier-transforming the addition components to obtain addition stereo signals; and signal generating means 3 and 4 for adding the addition audio signals to the expanded stereo signals obtained by processing the stereo signals to generate audio signals for reproduction. <P>COPYRIGHT: (C)2008,JPO&INPIT

Description

本発明は、ステレオ信号を所定間隔の左右スピーカよりも狭い間隔に配置される狭間隔スピーカから発音させ、前記ステレオ音響信号を狭間隔スピーカよりも広い間隔の広がりを有する拡大ステレオ信号として生成する音響信号生成装置に関する。 The present invention generates a stereo signal from a narrow-spaced speaker arranged at a narrower interval than left and right speakers at a predetermined interval, and generates the stereo sound signal as an enlarged stereo signal having a wider interval than the narrow-spaced speaker. The present invention relates to a signal generation device.

最近になり、小型で高忠実度の音響信号を発音可能なスピーカも実用化されるようになり、小型でありながら臨場感が優れたステレオ再生装置も市場導入されるようになってきた。
一方、臨場感のあるステレオ再生を行うためには、通常は視聴者の前方に正三角形となる位置、即ち±３０度の位置に左右スピーカの設置が想定されるが、視聴空間の制約のため狭い空間にスピーカを設置しなければならない場合もある。スピーカの設置角が小さい場合であっても所定のステレオ効果が得られるようにする。 Recently, small speakers capable of producing sound signals with high fidelity have come into practical use, and stereo reproduction devices that are small but have excellent realism have come into the market.
On the other hand, in order to perform stereo reproduction with a sense of presence, it is usually assumed that left and right speakers are installed at a position that becomes an equilateral triangle, that is, a position of ± 30 degrees in front of the viewer. In some cases, it is necessary to install a speaker in a narrow space. A predetermined stereo effect is obtained even when the installation angle of the speaker is small.

設置角の小さなスピーカ配置において、左右スピーカの配置される位置よりも外側に音源を定位させる拡大ステレオ音によりスピーカから発音させ、所望のステレオ効果を得ることができる。しかし、拡大ステレオ音による再生では、左右スピーカの中央に配置される中央音源も拡大されて再生されるため、拡大された中央音源は左右スピーカの中央位置に定位しなく、いわゆる中抜け音として再生されてしまう。スピーカの中央で歌われるボーカルの音源が周囲に分散された音として視聴されてしまう。左右スピーカの設置角が小さい場合であっても所望のステレオ効果が得られると共に、スピーカの中央に定位する音源は中抜けを伴わない音で再生できることは好ましい。 In a loudspeaker arrangement with a small installation angle, a desired stereo effect can be obtained by generating sound from the loudspeaker with an enlarged stereo sound that localizes the sound source outside the position where the left and right speakers are arranged. However, in playback with expanded stereo sound, the central sound source located in the center of the left and right speakers is also enlarged and played, so the enlarged central sound source is not localized at the center position of the left and right speakers, and is played back as a so-called hollow sound. Will be. The vocal sound source sung in the center of the speaker is viewed as a sound dispersed around. It is preferable that a desired stereo effect can be obtained even when the installation angle of the left and right speakers is small, and that the sound source localized at the center of the speaker can be reproduced with a sound without any hollows.

特許文献１には、入力信号がステレオかモノラルかによりステレオ感の拡大処理の効果を切り替えることにより、モノラル信号時の不自然感をなくすようにした音場再生装置が開示されている。入力信号がステレオ信号かモノラル信号かを判定回路により判定し、この結果によりＦＩＲフィルタの出力信号と入力信号の加算の割合を変化させる。入力信号がステレオであればＦＩＲフィルタの出力信号の方を大きく、入力信号がモノラルの場合はＦＩＲフィルタの出力信号の方を小さくするよう加算器での加算の割合を制御することにより、入力信号がモノラルの場合の音像のボケや音質の劣化を目立たないようにした音場再生装置が開示されている。
特開平５−２４３８８２号公報 Patent Document 1 discloses a sound field reproduction device that eliminates the unnatural feeling at the time of a monaural signal by switching the effect of the stereo enhancement process depending on whether the input signal is stereo or monaural. Whether the input signal is a stereo signal or a monaural signal is determined by a determination circuit, and based on this result, the ratio of the addition of the output signal of the FIR filter and the input signal is changed. When the input signal is stereo, the output signal of the FIR filter is larger, and when the input signal is monaural, the input signal is controlled by controlling the rate of addition in the adder so that the output signal of the FIR filter is smaller. Has disclosed a sound field reproduction device in which the blurring of the sound image and the deterioration of the sound quality are not conspicuous when the sound is monaural.
JP-A-5-243882

しかしながら、特許文献１に開示されている音場再生装置では入力信号がステレオ信号か、又はモノラルに近い信号であるかによりステレオ感の拡大効果を可変させるようにしているのみであり、例えば伴奏用楽器がステレオ配置され、ボーカルが中央でモノラルに近いボケのない音で歌っている場合などでは伴奏用楽器がステレオ感の拡大効果により拡大されて再生されると共に、ボーカルも拡大効果を伴って再生されるため、ボーカル音の音像のボケや音質の劣化が目立つようになってしまう。特に、ステレオ信号を所定間隔の左右スピーカよりも狭い間隔に配置される狭間隔スピーカから発音させ、ステレオ信号を狭間隔スピーカよりも広い間隔の広がりを有する拡大ステレオ信号として生成する場合において、中央位置に定位させるボーカル音源に対しても拡大ステレオ音響信号の効果が与えられてしまい、再生用音響信号としてボケを伴わないボーカル音源と共に生成する音響信号生成装置を実現することはできなかった。 However, the sound field reproducing device disclosed in Patent Document 1 only changes the effect of expanding the stereo feeling depending on whether the input signal is a stereo signal or a signal close to monaural. When the instruments are arranged in stereo and the vocals are singing in the center with a monaural sound with no blur, the accompaniment instruments are enlarged and played with the stereo expansion effect, and the vocals are also played with the expansion effect Therefore, the blur of the sound image of the vocal sound and the deterioration of the sound quality become conspicuous. In particular, when a stereo signal is sounded from a narrowly spaced speaker arranged at a narrower interval than left and right speakers at a predetermined interval, and the stereo signal is generated as an enlarged stereo signal having a wider interval than that of the narrowly spaced speaker, the center position The effect of the enlarged stereo sound signal is also given to the vocal sound source that is localized to the sound source, and it has not been possible to realize a sound signal generation device that generates a sound source for reproduction together with a vocal sound source without blur.

そこで、本発明は、上記のような問題点を解消するためになされたもので、ステレオ信号を所定間隔の左右スピーカよりも狭い間隔に配置される狭間隔スピーカから発音させ、ステレオ信号を狭間隔スピーカよりも広い間隔の広がりを有する拡大ステレオ信号として生成すると共に、左右スピーカの中央に定位される中央音源のみは拡大ステレオ音響信号の効果を与えなく、中抜け感を伴わない再生用音響信号として生成を可能とする音響信号生成装置を提供することを目的とする。 Accordingly, the present invention has been made to solve the above-described problems. A stereo signal is generated by a narrowly spaced speaker arranged at a narrower interval than left and right speakers having a predetermined interval, and the stereo signal is narrowly spaced. It is generated as an enlarged stereo signal with a wider interval than the speakers, and only the central sound source localized at the center of the left and right speakers does not give the effect of the enlarged stereo sound signal, and as a playback acoustic signal without a sense of hollow It is an object of the present invention to provide an acoustic signal generation device that enables generation.

本願発明における第１の発明は、入力される左右２チャンネルのオーディオ信号を左右１組のスピーカにおいて再生した際に得られる音像定位位置よりも左右に拡大した位置に音像定位させる処理を前記入力されるオーディオ信号に対して行って得た拡大オーディオ信号を出力する音響信号生成装置において、前記入力される左右２チャンネルのオーディオ信号のそれぞれをフーリエ変換して所定周波数間隔ごとの各周波数の正弦波成分及び余弦波成分を得るＦＦＴ手段と、前記ＦＦＴ手段より得られた各周波数毎の左チャンネルオーディオ信号の正弦波成分と右チャンネルオーディオ信号の正弦波成分とのレベルを比較しレベル差が所定の範囲にある周波数を１つのまとまりとした第１の周波数群を検出すると共に、前記ＦＦＴ手段より得られた各周波数毎の左チャンネルオーディオ信号の余弦波成分と右チャンネルオーディオ信号の余弦波成分とのレベルを比較しレベル差が所定の範囲にある周波数を１つのまとまりとした第２の周波数群を検出する周波数群検出手段と、前記周波数群検出手段により検出された第１の周波数群の周波数及び第２の周波数群の周波数のうち共通な周波数を検出する周波数検出手段と、前記ＦＦＴ手段により得られた各周波数の左チャンネルオーディオ信号の正弦波成分及び各周波数の右チャンネルオーディオ信号の正弦波成分のうち、前記周波数検出手段で検出された共通な周波数の正弦波成分同士を加算して加算正弦波成分を得ると共に、前記ＦＦＴ手段により得られた各周波数の左チャンネルオーディオ信号の余弦波成分及び各周波数の右チャンネルオーディオ信号の余弦波成分のうち、前記周波数検出手段で検出された共通な周波数の余弦波成分同士を加算して加算余弦波成分を得る加算手段と、前記加算手段より得られた加算正弦波成分及び加算余弦波成分を逆フーリエ変換して加算オーディオ信号を得るＩＦＦＴ手段と、前記左右に拡大した位置に音像定位させる処理を行った拡大オーディオ信号に、前記ＩＦＦＴ手段で得られた加算オーディオ信号を加算した信号を新たな拡大オーディオ信号として生成する信号生成手段と、を備えることを特徴とする音響信号生成装置を提供する。 According to a first aspect of the present invention, the processing for localizing a sound image to a position expanded to the left and right relative to the sound image localization position obtained when the input audio signals of two left and right channels are reproduced by a pair of left and right speakers is input. In an acoustic signal generating apparatus for outputting an enlarged audio signal obtained by performing an audio signal, a sine wave component of each frequency for each predetermined frequency interval by Fourier transforming each of the input left and right channel audio signals And the FFT means for obtaining the cosine wave component, and the level difference between the sine wave component of the left channel audio signal and the sine wave component of the right channel audio signal for each frequency obtained from the FFT means is compared and the level difference is within a predetermined range. Is detected from the FFT means, and a first frequency group having a single frequency group is detected. The second frequency group in which the levels of the cosine wave component of the left channel audio signal and the cosine wave component of the right channel audio signal for each frequency are compared and the frequencies having a level difference within a predetermined range are grouped together. The frequency group detecting means for detecting, the frequency detecting means for detecting a common frequency among the frequencies of the first frequency group and the frequency of the second frequency group detected by the frequency group detecting means, and the FFT means. Of the sine wave component of the left channel audio signal of each frequency and the sine wave component of the right channel audio signal of each frequency, the sine wave components of the common frequency detected by the frequency detection means are added together to add sine And a cosine wave component of the left channel audio signal of each frequency obtained by the FFT means and the right of each frequency. Among the cosine wave components of the channel audio signal, adding means for adding the cosine wave components of the common frequency detected by the frequency detecting means to obtain an added cosine wave component, and the addition sine obtained from the adding means IFFT means for obtaining an added audio signal by performing inverse Fourier transform on the wave component and the added cosine wave component, and the added audio signal obtained by the IFFT means to the enlarged audio signal subjected to the process of localizing the sound image to the position expanded to the left and right There is provided an acoustic signal generation device comprising: signal generation means for generating a signal obtained by adding signals as a new enlarged audio signal.

本発明によれば、入力される左右２チャンネルのオーディオ信号のそれぞれをフーリエ変換して所定周波数間隔ごとの各周波数の正弦波成分及び余弦波成分を得るＦＦＴ手段と、ＦＦＴ手段より得られた各周波数毎の左チャンネルオーディオ信号の正弦波成分と右チャンネルオーディオ信号の正弦波成分とのレベルを比較しレベル差が所定の範囲にある周波数を１つのまとまりとした第１の周波数群を検出すると共に、ＦＦＴ手段より得られた各周波数毎の左チャンネルオーディオ信号の余弦波成分と右チャンネルオーディオ信号の余弦波成分とのレベルを比較しレベル差が所定の範囲にある周波数を１つのまとまりとした第２の周波数群を検出する周波数群検出手段と、周波数群検出手段により検出された第１の周波数群の周波数及び第２の周波数群の周波数のうち共通な周波数を検出する周波数検出手段と、ＦＦＴ手段により得られた各周波数の左チャンネルオーディオ信号の正弦波成分及び各周波数の右チャンネルオーディオ信号の正弦波成分のうち、周波数検出手段で検出された共通な周波数の正弦波成分同士を加算して加算正弦波成分を得ると共に、ＦＦＴ手段により得られた各周波数の左チャンネルオーディオ信号の余弦波成分及び各周波数の右チャンネルオーディオ信号の余弦波成分のうち、周波数検出手段で検出された共通な周波数の余弦波成分同士を加算して加算余弦波成分を得る加算手段と、加算手段より得られた加算正弦波成分及び加算余弦波成分を逆フーリエ変換して加算オーディオ信号を得るＩＦＦＴ手段と、左右に拡大した位置に音像定位させる処理を行った拡大オーディオ信号に、ＩＦＦＴ手段で得られた加算オーディオ信号を加算した信号を新たな拡大オーディオ信号として生成する信号生成手段とを備えるので、ステレオ信号を所定間隔の左右スピーカよりも狭い間隔に配置される狭間隔スピーカから発音させ、ステレオ信号を狭間隔スピーカよりも広い間隔の広がりを有する拡大ステレオ信号として生成すると共に、左右スピーカの中央に定位される中央音源のみは拡大ステレオ音響信号の効果を与えなく、中抜け感を伴わない再生用音響信号として生成することを可能とする音響信号生成装置を実現できる。 According to the present invention, the FFT means for obtaining the sine wave component and the cosine wave component of each frequency for each predetermined frequency interval by Fourier transforming each of the input left and right two-channel audio signals, and each obtained by the FFT means The levels of the sine wave component of the left channel audio signal and the sine wave component of the right channel audio signal for each frequency are compared to detect a first frequency group in which frequencies having a level difference within a predetermined range are grouped together. , The levels of the cosine wave component of the left channel audio signal and the cosine wave component of the right channel audio signal obtained for each frequency obtained by the FFT means are compared, and the frequencies whose level difference is within a predetermined range are defined as one unit. Frequency group detecting means for detecting two frequency groups, and the frequency of the first frequency group detected by the frequency group detecting means and the second frequency group The frequency detection means for detecting a common frequency among the frequencies of the frequency group, and the frequency of the sine wave component of the left channel audio signal of each frequency and the sine wave component of the right channel audio signal of each frequency obtained by the FFT means The sine wave components of the common frequency detected by the detecting means are added together to obtain an added sine wave component, and the cosine wave component of the left channel audio signal of each frequency and the right channel audio of each frequency obtained by the FFT means. Of the cosine wave components of the signal, an adding means for adding the cosine wave components of the common frequency detected by the frequency detecting means to obtain an added cosine wave component, and an added sine wave component and an added cosine obtained from the adding means IFFT means for obtaining an added audio signal by performing inverse Fourier transform on wave components, and processing for localizing a sound image at a position expanded left and right And a signal generation unit that generates a signal obtained by adding the added audio signal obtained by the IFFT unit to the expanded audio signal performed as a new expanded audio signal, so that the stereo signal is spaced at a narrower interval than the left and right speakers at a predetermined interval. Generates sound from the arranged narrow-speaker speakers and generates a stereo signal as an enlarged stereo signal with a wider interval than the narrow-speaker speaker, and only the central sound source localized at the center of the left and right speakers has the effect of the enlarged stereo sound signal Therefore, it is possible to realize an acoustic signal generation device that can generate a reproduction acoustic signal without a sense of hollowing out.

以下に本発明の実施例に係る音響信号生成装置について図１〜図６を用いて説明する。
図１は、本発明の実施に係る音響信号生成装置の構成例を示すブロック図である。図２は、本発明の実施に係る音響信号生成装置の音像定位拡大の動作を説明するための図である。図３は、本発明の実施に係る音響信号生成装置のトランスオーラル処理を説明するための図である。図４は、本発明の実施に係る音響信号生成装置の音像定位拡大処理器の構成例を示すブロック図である。図５は、本発明の実施に係る音響信号生成装置の動作例をフローチャートで示した図である。図６は、本発明の実施に係る音響信号生成装置の応用動作例をフローチャートで示した図である。 Hereinafter, an acoustic signal generation device according to an embodiment of the present invention will be described with reference to FIGS.
FIG. 1 is a block diagram illustrating a configuration example of an acoustic signal generation device according to an embodiment of the present invention. FIG. 2 is a diagram for explaining the sound image localization expansion operation of the acoustic signal generation device according to the embodiment of the present invention. FIG. 3 is a diagram for explaining transoral processing of the acoustic signal generation device according to the embodiment of the present invention. FIG. 4 is a block diagram illustrating a configuration example of the sound image localization expansion processor of the acoustic signal generation device according to the embodiment of the present invention. FIG. 5 is a flowchart illustrating an operation example of the acoustic signal generation device according to the embodiment of the present invention. FIG. 6 is a flowchart illustrating an application operation example of the acoustic signal generation device according to the embodiment of the present invention.

その音響信号生成装置は、ステレオ信号を所定間隔の左右スピーカよりも狭い間隔に配置される狭間隔スピーカから発音させ、ステレオ信号を狭間隔スピーカよりも広い間隔の広がりを有する拡大ステレオ信号として生成すると共に、左右スピーカの中央に定位される中央音源のみは拡大ステレオ音響信号の効果を与えなく、中抜け感を伴わない再生用音響信号として生成することを可能とする装置を実現するという目的を、入力される左右２チャンネルのオーディオ信号のそれぞれをフーリエ変換して所定周波数間隔ごとの各周波数の正弦波成分及び余弦波成分を得るＦＦＴ手段と、ＦＦＴ手段より得られた各周波数毎の左チャンネルオーディオ信号の正弦波成分と右チャンネルオーディオ信号の正弦波成分とのレベルを比較しレベル差が所定の範囲にある周波数を１つのまとまりとした第１の周波数群を検出すると共に、ＦＦＴ手段より得られた各周波数毎の左チャンネルオーディオ信号の余弦波成分と右チャンネルオーディオ信号の余弦波成分とのレベルを比較しレベル差が所定の範囲にある周波数を１つのまとまりとした第２の周波数群を検出する周波数群検出手段と、周波数群検出手段により検出された第１の周波数群の周波数及び第２の周波数群の周波数のうち共通な周波数を検出する周波数検出手段と、ＦＦＴ手段により得られた各周波数の左チャンネルオーディオ信号の正弦波成分及び各周波数の右チャンネルオーディオ信号の正弦波成分のうち、周波数検出手段で検出された共通な周波数の正弦波成分同士を加算して加算正弦波成分を得ると共に、ＦＦＴ手段により得られた各周波数の左チャンネルオーディオ信号の余弦波成分及び各周波数の右チャンネルオーディオ信号の余弦波成分のうち、周波数検出手段で検出された共通な周波数の余弦波成分同士を加算して加算余弦波成分を得る加算手段と、加算手段より得られた加算正弦波成分及び加算余弦波成分を逆フーリエ変換して加算オーディオ信号を得るＩＦＦＴ手段と、左右に拡大した位置に音像定位させる処理を行った拡大オーディオ信号に、ＩＦＦＴ手段で得られた加算オーディオ信号を加算した信号を新たな拡大オーディオ信号として生成する信号生成手段とを備えるようにして実現した。 The acoustic signal generation device generates a stereo signal as an enlarged stereo signal having a wider interval than that of the narrow interval speaker by causing the stereo signal to sound from a narrow interval speaker arranged at a narrower interval than the left and right speakers at a predetermined interval. In addition, only the central sound source localized at the center of the left and right speakers does not give the effect of the enlarged stereo sound signal, and the object is to realize a device that can be generated as a reproduction sound signal without a sense of hollowing out, FFT means for obtaining a sine wave component and a cosine wave component of each frequency for each predetermined frequency interval by Fourier transforming each of the input left and right two-channel audio signals, and left channel audio for each frequency obtained from the FFT means The level difference between the sine wave component of the signal and the sine wave component of the right channel audio signal is compared. A first frequency group having a set of frequencies in a certain range is detected, and a cosine wave component of the left channel audio signal and a cosine wave component of the right channel audio signal for each frequency obtained by the FFT means Frequency group detecting means for detecting a second frequency group in which frequencies having a level difference within a predetermined range are grouped together, and the frequency of the first frequency group detected by the frequency group detecting means, A frequency detection means for detecting a common frequency among the frequencies of the second frequency group; a sine wave component of the left channel audio signal of each frequency and a sine wave component of the right channel audio signal of each frequency obtained by the FFT means; Among them, the sine wave components of the common frequency detected by the frequency detection means are added together to obtain an addition sine wave component, and the FFT means Among the obtained cosine wave components of the left channel audio signal of each frequency and the cosine wave components of the right channel audio signal of each frequency, add and add the cosine wave components of the common frequency detected by the frequency detection means. An adding means for obtaining a cosine wave component, an IFFT means for obtaining an added audio signal by performing inverse Fourier transform on the added sine wave component and the added cosine wave component obtained from the adding means, and a process for localizing the sound image to a position expanded left and right This is realized by including signal generation means for generating a signal obtained by adding the added audio signal obtained by the IFFT means to the performed enlarged audio signal as a new enlarged audio signal.

音響信号生成装置の構成について述べる。
図１に示す音響信号生成装置１０は、音像定位拡大処理器１１及びトランスオーラル再生処理器１２よりなるステレオ音拡大処理部１と、ＦＦＴ（Fast Fourier Transform：高速フーリエ変換器）２１、２２、振幅位相比較器２３、中央成分判定器２４、中央成分抽出器２５、２６、成分加算器２７、ＩＦＦＴ（Inverse fast Fourier transform；逆高速フーリエ変換器）２８、及び制御器２９よりなるセンター音処理部２と、加算器３、４とより構成される。
図４に示す音響信号生成装置のはフィルタ１１１、１１２よりなる音像定位拡大処理器１１と、フィルタ１２１、１２２、加算回路１２３、１２４、フィルタ１２５、及び１２６よりなるトランスオーラル再生処理器１２とより構成される。 The configuration of the acoustic signal generation device will be described.
The acoustic signal generation device 10 shown in FIG. 1 includes a stereo sound expansion processing unit 1 including a sound image localization expansion processor 11 and a transoral reproduction processor 12, FFTs (Fast Fourier Transforms) 21, 22, and amplitude. A center sound processing unit 2 including a phase comparator 23, a central component determiner 24, central component extractors 25 and 26, a component adder 27, an IFFT (Inverse fast Fourier transform) 28, and a controller 29. And adders 3 and 4.
The acoustic signal generation apparatus shown in FIG. 4 includes a sound image localization expansion processor 11 including filters 111 and 112, and a transoral reproduction processor 12 including filters 121 and 122, addition circuits 123 and 124, filters 125 and 126. Composed.

音響信号生成装置の動作について述べる。
まず、音響信号生成装置１０に入力される左右音響入力信号はステレオ音拡大処理部１及びセンター音処理部２に入力される。ステレオ音拡大処理部１の音像定位拡大処理器１１は、左右音響入力信号を狭い間隔で配置される再生用スピーカから発音した際に、発音された音が通常の間隔で配置されるスピーカの位置から発音された音としてヘッドフォンを用いた場合に受聴されるように音像定位を拡大処理した拡大音響信号を生成する。トランスオーラル再生処理器１２はヘッドフォン受聴用として生成された拡大音響信号をスピーカにより再生した場合に、視聴者はスピーカからの再生音がヘッドフォン受聴したと同様に視聴されるスピーカ再生信号として生成する。 The operation of the acoustic signal generator will be described.
First, the left and right sound input signals input to the sound signal generation device 10 are input to the stereo sound expansion processing unit 1 and the center sound processing unit 2. When the sound image localization enlargement processor 11 of the stereo sound enlargement processing unit 1 produces a sound from the playback speakers arranged at narrow intervals, the position of the speaker at which the generated sounds are arranged at normal intervals when the left and right sound input signals are produced from the reproduction speakers. Then, an enlarged acoustic signal is generated by enlarging the sound image localization so that the sound is heard when using headphones as a sound generated from the sound. When the transoral reproduction processor 12 reproduces the enlarged sound signal generated for listening to headphones through a speaker, the viewer generates the reproduced sound from the speaker as a speaker reproduction signal that is viewed in the same manner as when listening to the headphones.

センター音処理部２は、入力される左右音響入力信号から中央に定位される音響信号のみをセンター音信号として取得する。加算器３はトランスオーラル再生処理器１２でスピーカ再生信号として生成された左側の出力信号とセンター音処理部２で取得されたセンター音信号とを加算して出力する。同様にして、加算器４はスピーカ再生用信号として生成された右側の出力信号とセンター音号とを加算して出力する。音響信号生成装置１０から出力される左右出力信号は、狭い間隔で配置されるスピーカから発した音が通常の間隔で配置されるスピーカの位置から発音される拡大音響信号として再生されると共に、左右音響入力信号により中央に定位される音源は拡大音響信号処理がなされなく、中央音源のボケや中抜け感を伴わない再生用音響信号として生成される音響信号生成装置１０を実現している。 The center sound processing unit 2 acquires only the sound signal localized at the center from the input left and right sound input signals as the center sound signal. The adder 3 adds the left output signal generated as the speaker reproduction signal by the transoral reproduction processor 12 and the center sound signal acquired by the center sound processing unit 2 and outputs the result. Similarly, the adder 4 adds and outputs the right output signal generated as the speaker reproduction signal and the center sound. The left and right output signals output from the acoustic signal generation device 10 are reproduced as enlarged sound signals generated from the positions of the speakers arranged at normal intervals when the sound emitted from the speakers arranged at narrow intervals is reproduced. The sound source localized in the center by the sound input signal is not subjected to the enlarged sound signal processing, and the sound signal generation device 10 is generated that is generated as a reproduction sound signal without blurring of the center sound source and a feeling of hollowing out.

次に、詳細に説明する。
図１に示すセンター音処理部２についてさらに述べる。
ＦＦＴ２１は、例えば４４．１ｋＨｚで標本化され、ディジタル信号に変換されて入力される左音響入力信号を、例えば１０２４の標本点を窓区間として高速周波数変換処理を行う。左音響入力信号の正弦周波数成分、及び余弦周波数成分が得られる。ＦＦＴ２２は同様にして右音響入力信号の正弦周波数成分、及び余弦周波数成分を得る。振幅位相比較器２３はＦＦＴ２１及び２２から出力される左右音響入力信号の正弦周波数成分、及び余弦周波数成分のそれぞれを比較し、比較して得られる正弦周波数成分、及び余弦周波数成分のそれぞれのレベル差を、分析して得られる周波数毎に出力する。中央成分判定器２４は、分析して得られた周波数毎に左側音響入力信号の正弦周波数成分、及び余弦周波数成分と右側音響入力信号の正弦周波数成分、及び余弦周波数成分の相対応するそれぞれを比較し、同一周波数成分で正弦周波数成分、及び余弦周波数成分それぞれのレベル差が例えば３ｄＢ以下である周波数値を視聴用左右スピーカの中央に定位するセンター音信号であるとして判定する。 Next, this will be described in detail.
The center sound processing unit 2 shown in FIG. 1 will be further described.
The FFT 21 performs a high-speed frequency conversion process on the left sound input signal sampled at, for example, 44.1 kHz and converted into a digital signal, for example, using 1024 sampling points as window sections. A sine frequency component and a cosine frequency component of the left acoustic input signal are obtained. The FFT 22 similarly obtains a sine frequency component and a cosine frequency component of the right sound input signal. The amplitude phase comparator 23 compares the sine frequency component and the cosine frequency component of the left and right acoustic input signals output from the FFTs 21 and 22, and compares the level difference between the sine frequency component and the cosine frequency component obtained by comparison. Are output for each frequency obtained by analysis. The central component determination unit 24 compares the sine frequency component of the left acoustic input signal, the cosine frequency component, the sine frequency component of the right acoustic input signal, and the corresponding cosine frequency component for each frequency obtained by analysis. Then, it is determined that the center sound signal is localized at the center of the right and left speakers for viewing, with the frequency value having a level difference of, for example, 3 dB or less between the sine frequency component and the cosine frequency component at the same frequency component.

中央成分抽出器２５はＦＦＴ２１で周波数分析して得られた左音響入力信号を構成する周波数のうち、センター音信号であるとして判定された正弦周波数成分及び余弦周波数成分のそれぞれを取得する。中央成分抽出器２６は、ＦＦＴ２２で周波数分析して得られた右音響入力信号を構成する周波数のうち、センター音信号であるとして判定された正弦周波数成分及び余弦周波数成分のそれぞれを取得する。成分加算器２７は中央成分抽出器２５及び２６で取得された正弦周波数成分及び余弦周波数成分のそれぞれを加算する。ＩＦＦＴ２８は、加算して得られた正弦周波数成分及び余弦周波数成分のそれぞれを、ＦＦＴ２１及び２２の窓期間に相当する窓期間で逆高速フーリエ変換を行う。左右音響入力信号に含まれるセンター音信号のみが得られる。加算器３及び４のそれぞれは、トランスオーラル再生処理のなされた左右拡大音響信号に抽出されたセンター音信号を加算する。 The central component extractor 25 acquires each of the sine frequency component and the cosine frequency component determined to be the center sound signal among the frequencies constituting the left sound input signal obtained by frequency analysis by the FFT 21. The central component extractor 26 acquires each of the sine frequency component and the cosine frequency component determined to be the center sound signal among the frequencies constituting the right sound input signal obtained by frequency analysis by the FFT 22. The component adder 27 adds the sine frequency component and the cosine frequency component acquired by the central component extractors 25 and 26, respectively. The IFFT 28 performs inverse fast Fourier transform on each of the sine frequency component and cosine frequency component obtained by the addition in a window period corresponding to the window periods of the FFTs 21 and 22. Only the center sound signal included in the left and right sound input signals is obtained. Each of the adders 3 and 4 adds the extracted center sound signal to the left and right enlarged acoustic signals subjected to the transoral reproduction process.

音響信号生成装置１０は、拡大処理された左右拡大音響信号と、抽出され拡大処理のなされないセンター音信号とを加算して得られた左右出力信号を出力する。センター音信号は拡大処理された信号と拡大処理のなされない信号の両者が出力されるため、拡大処理された周囲音と拡大処理のなされないセンター音とは融合された音として視聴される。センター音信号の拡大処理を弱めたり、あるいは拡大処理されたセンター音と拡大処理のなされないセンター音とのレベル比の調整は設計事項であり、センター音の音源の中抜けが少なく、且つ左右音とセンター音とが分離することなく視聴される適当な配分比を設定することになる。
制御器２９は、上記の各回路ブロックの制御を行う。 The acoustic signal generation device 10 outputs a left and right output signal obtained by adding the enlarged left and right enlarged acoustic signals and the center sound signal that has been extracted and not enlarged. Since the center sound signal is output as both a signal that has undergone enlargement processing and a signal that has not undergone enlargement processing, the ambient sound that has undergone enlargement processing and the center sound that has not undergone enlargement processing are viewed as a fused sound. It is a design matter to weaken the center sound signal expansion process, or to adjust the level ratio between the center sound that has been expanded and the center sound that has not been expanded. Therefore, an appropriate distribution ratio is set so that the center sound can be viewed without being separated.
The controller 29 controls each circuit block described above.

なお、ここで、ＦＦＴ２１、２２によるセンター音源の抽出を正弦周波数成分及び余弦周波数成分のそれぞれを用いて行うとして述べた。ＦＦＴ２１、２２によるセンター音源の抽出は周波数成分のレベル、周波数成分の位相差を調べ、例えばレベル差が３ｄＢ程度であり、且つ位相差が例えば３０度以内である場合にセンター音源であるとして抽出するようにしても良い。 Here, it has been described that the center sound source is extracted by the FFTs 21 and 22 using the sine frequency component and the cosine frequency component, respectively. The center sound source is extracted by the FFTs 21 and 22 by checking the level of the frequency component and the phase difference of the frequency component. For example, when the level difference is about 3 dB and the phase difference is within 30 degrees, for example, the center sound source is extracted. You may do it.

音像定位拡大処理器１１及びトランスオーラル再生処理器１２での処理についてさらに述べる。
図２は、視聴者５９の前面に、例えば開き角２０度で配置される左右のスピーカ５１、５２から発音される音場を、標準的な開き角６０度で配置される仮想スピーカ５３、５４から発音された音として視聴させるための音像定位拡大について示している。即ち、スピーカ５１、５２から発音される音が、仮想スピーカ５３、５４から発音された音として視聴者５９に聞こえさせるようにする。その場合の左側仮想スピーカ５３から発音されて視聴者５９の左側の耳元に到達する音の伝達関数をｈ_ll、右側の耳元に到達する音の伝達関数をｈ_lrとする。右側仮想スピーカ５４から発音されて視聴者５９の右側の耳元に到達する音の伝達関数をｈ_rr、左側の耳元に到達する音の伝達関数をｈ_rlとする。左右の仮想スピーカ５３、５４から再生する信号をｘ_l（ｔ）、ｘ_r（ｔ）とするとき、左右の耳元に到達する信号ｆ_l（ｔ）、ｆ_r（ｔ）は次式で示される。 The processing in the sound image localization enlargement processor 11 and the transoral reproduction processor 12 will be further described.
FIG. 2 shows, for example, sound fields generated from left and right speakers 51 and 52 arranged at an opening angle of 20 degrees on the front of the viewer 59, and virtual speakers 53 and 54 arranged at a standard opening angle of 60 degrees. This shows the expansion of the sound image localization for viewing as a sound generated from the sound. That is, the sound generated from the speakers 51 and 52 is heard by the viewer 59 as the sound generated from the virtual speakers 53 and 54. In this case, the transfer function of the sound that is generated from the left virtual speaker 53 and reaches the left ear of the viewer 59 is h _ll , and the transfer function of the sound that reaches the right ear is h _lr . The transfer function of the sound that is generated from the right virtual speaker 54 and reaches the right ear of the viewer 59 is h _rr , and the transfer function of the sound that reaches the left ear is h _rl . When signals reproduced from the left and right virtual speakers 53 and 54 are x _l (t) and x _r (t), the signals f _l (t) and f _r (t) reaching the left and right ears are expressed by the following equations. It is.

図３を参照し、左右のスピーカ５５、５６から発音させ、式（１）の信号ｆ_l（ｔ）、ｆ_r（ｔ）を視聴者５８の左右の耳元に到達させるためのトランスオーラル信号処理について述べる。
同図は、そのトランスオーラル再生処理を説明するための図であり、収録現場において人工頭５７の左右の耳元で収音されるＰ_l（ｔ）、Ｐ_r（ｔ）の音を、視聴者５８の左右の耳元に生じさせる信号処理について説明している。
まず、人工頭５７の左右の耳元で収音された信号Ｐ_l（ｔ）、Ｐ_r（ｔ）は再生現場に設置されるトランスオーラル再生処理器１２に入力される。トランスオーラル再生処理器から出力される信号をｌ（ｔ）、ｒ（ｔ）とし、信号ｌ（ｔ）、ｒ（ｔ）を左右のスピーカ５５、５６から発音させる。 Referring to FIG. 3, transoral signal processing for causing sound from left and right speakers 55 and 56 to reach signals f ₁ (t) and f _r (t) of Expression (1) to the left and right ears of viewer 58. Is described.
This figure is a diagram for explaining the transoral reproduction process. The sounds of P _l (t) and P _r (t) collected at the right and left ears of the artificial head 57 at the recording site are shown to the viewer. The signal processing generated in the left and right ears 58 is described.
First, the signals P _l (t) and P _r (t) collected by the left and right ears of the artificial head 57 are input to the trans-oral reproduction processor 12 installed at the reproduction site. The signals output from the transoral reproduction processor are l (t) and r (t), and the signals l (t) and r (t) are generated from the left and right speakers 55 and 56, respectively.

ここでスピーカ５５から視聴者５８の左右の耳元までの伝達関数をｈ_LS（ｔ）、ｈ_LO（ｔ）とし、スピーカ５６から視聴者５８の右左の耳元までの伝達関数をｈ_RS（ｔ）、ｈ_RO（ｔ）とし、視聴者５８の左右の耳元に到達すべき音をＰ_l（ｔ）、Ｐ_r（ｔ）とし、次式で表される。

Here, the transfer functions from the speaker 55 to the left and right ears of the viewer 58 are h _LS (t) and h _LO (t), and the transfer functions from the speaker 56 to the right and left ears of the viewer 58 are h _RS (t). , H _RO (t), and the sounds that should reach the left and right ears of the viewer 58 are P ₁ (t) and P _r (t), and are expressed by the following equations.

式（２）の両辺をフーリエ変換し、行列の形で表すと式（３）が得られる。

When both sides of Expression (2) are Fourier transformed and expressed in the form of a matrix, Expression (3) is obtained.

ここで、システムのフィルタ行列をＦとすると式（３）は次式で示される。

Here, when the filter matrix of the system is F, Equation (3) is expressed by the following equation.

式（４）において、Ｆは式（３）中の４つの伝達関数行列式Ｈの逆行列である。このＦで構成されたトランスオーラルシステムをフィードバック方式と呼ぶ。
行列Ｈはさらに次式の様に変形できる。以下、（ｔ）を省いて記述する。

In Expression (4), F is an inverse matrix of the four transfer function determinants H in Expression (3). The transoral system constituted by F is called a feedback system.
The matrix H can be further transformed as follows: Hereinafter, description will be made with (t) omitted.

式（５）を式（３）に代入し、両辺をＨ_LSＨ_RSで除す。

Equation (5) into equation (3), Dividing both sides by H _LS H _RS.

式（６）をフーリエ逆変換し、移項する。

Equation (6) is inverse Fourier transformed and transposed.

図４を参照し、音像定位拡大処理器１１及びトランスオーラル再生処理器１２で行うフィルタ処理の特性について述べる。
音像定位拡大処理器１１は式（１）に示した処理を行う回路ブロックであり、フィルタ１１１は入力される左側の信号ｘ_l（ｔ）にｈ_ll（ｔ）＋ｈ_rl（ｔ）の特性を与え、フィルタ１１２は入力される右側の信号ｘ_r（ｔ）にｈ_lr（ｔ）＋ｈ_rr（ｔ）の特性を与える。
トランスオーラル再生処理器１２は式（７）に示した処理を行う回路ブロックである。即ち、音像定位拡大処理器１１から出力されるｆ_l（ｔ）、ｆ_r（ｔ）をそのまま視聴者５８に視聴させようとするものである。
ｆ_l（ｔ）＝Ｐ_l（ｔ）、ｆ_r（ｔ）＝Ｐ_r（ｔ）とさせるための信号処理を行う。
式（７）に示した特性はトランスオーラル再生処理器１２に示すフィルタ１２１、１２２、１２５、１２６、加算回路（減算回路）１２３、及び１２４により得ることが出来る。 With reference to FIG. 4, the characteristics of the filter processing performed by the sound image localization enlargement processor 11 and the transoral reproduction processor 12 will be described.
The sound image localization enlargement processor 11 is a circuit block that performs the processing shown in Expression (1), and the filter 111 _applies a characteristic of h _ll (t) + h _rl (t) to the input left signal x _l (t). The filter 112 gives a characteristic of h _lr (t) + h _rr (t) to the input right signal x _r (t).
The transoral reproduction processor 12 is a circuit block that performs the processing shown in Expression (7). That is, it is an attempt to view f _l (t) outputted from the sound image localization enlargement processor 11, f _r a (t) as it is to the viewer 58.
Signal processing is performed so that f _l (t) = P _l (t) and f _r (t) = P _r (t).
The characteristic shown in the equation (7) can be obtained by the filters 121, 122, 125, 126 and the addition circuits (subtraction circuits) 123 and 124 shown in the transoral regeneration processor 12.

音像定位拡大処理器１１及びトランスオーラル再生処理器１２により、狭い間隔で配置される再生用スピーカを用い、再生音を通常の間隔で配置されるスピーカの位置から発音された音としてヘッドフォン受聴されるように音像定位を拡大処理して視聴できるものの、左右スピーカの中央に定位されるセンター音源も拡大音響信号処理されて視聴される。センター音処理部２は、センター音源を左右音響入力信号から抽出し、拡大音響信号処理を行うことなく再生させるためセンター音源の音像が必要以上に大きくなったり、中抜け音として視聴されるのを防止している。 The sound image localization enlargement processor 11 and the transoral reproduction processor 12 use a reproduction speaker arranged at a narrow interval, and listen to the reproduced sound as a sound generated from the position of the speaker arranged at a normal interval. As described above, although the sound image localization can be viewed and enlarged, the center sound source localized at the center of the left and right speakers is also viewed with the enlarged acoustic signal processing. The center sound processing unit 2 extracts the center sound source from the left and right sound input signals and reproduces it without performing the enlarged sound signal processing, so that the sound image of the center sound source becomes larger than necessary or is viewed as a hollow sound. It is preventing.

図５を参照し、音響信号生成装置１０の処理の流れについて説明する。
まず、Ｓ（ステップ）７１で音響信号生成装置１０には左右音響入力信号が入力される。Ｓ７２で左右音響入力信号はステレオ音拡大処理部１とセンター音処理部２とに供給される。Ｓ７３でステレオ音拡大処理部１は左右音響入力信号の音像定位拡大処理及びトランスオーラル再生処理を行い、拡大音響信号を生成する。Ｓ７４でセンター音処理部２は左右音響入力信号をＦＦＴ処理し、振幅および位相情報、又は正弦波成分及び余弦波成分の成分情報を得る。Ｓ７５で、ＦＦＴ処理して得られた成分情報が左右の音響入力信号同士で同一である、即ちセンター音信号であるかが、左右に配置されるスピーカの近辺から再生される音響信号との比較において判定される。Ｓ７６で、左右で似通った周波数及びレベル成分が検出される場合はセンター音信号であるとして判定される。そのセンター音成分が抽出される。Ｓ７７で、抽出された中央成分はＩＦＦＴされ、センター音信号が得られる。
Ｓ７８で、Ｓ７３で拡大処理された入力信号と、Ｓ７７で抽出されたセンター音信号とが加算される。Ｓ７９で加算して得られる音響信号が出力信号として音響信号生成装置１０から出力される。 With reference to FIG. 5, the process flow of the acoustic signal generation device 10 will be described.
First, in S (step) 71, the left and right sound input signals are input to the sound signal generation device 10. In S72, the left and right sound input signals are supplied to the stereo sound expansion processing unit 1 and the center sound processing unit 2. In S73, the stereo sound enlargement processing unit 1 performs sound image localization enlargement processing and transoral reproduction processing of the left and right sound input signals, and generates an enlarged sound signal. In S74, the center sound processing unit 2 performs FFT processing on the left and right sound input signals to obtain amplitude and phase information, or component information of sine wave components and cosine wave components. In S75, the component information obtained by the FFT processing is the same between the left and right acoustic input signals, that is, whether it is the center sound signal or not, compared with the acoustic signal reproduced from the vicinity of the left and right speakers. Is determined. In S76, if frequency and level components similar to the left and right are detected, it is determined that the signal is a center sound signal. The center sound component is extracted. In S77, the extracted center component is IFFT to obtain a center sound signal.
In S78, the input signal expanded in S73 and the center sound signal extracted in S77 are added. The acoustic signal obtained by the addition in S79 is output from the acoustic signal generation device 10 as an output signal.

図６を参照し、音響信号生成装置１０の応用処理の流れについて説明する。
図６に示す応用処理は、図５に示した処理に比し、入力信号からセンター音信号を抽出する処理方法で異なっている。図５に示したと同じ処理については同じ符号を付し説明を省く。
Ｓ８１において、Ｓ７５で判定して得られた左右音響信号の中央成分が存在する周波数特性から、中央成分のみを通過させるためのフィルタ特性を得る。Ｓ８２で、得られたバンドパス特性を有するフィルタを実現する。そのフィルタは急峻な通過、遮断特性を有するイコライザ特性とも類似している。Ｓ８３で、Ｓ７２により供給された入力信号を、Ｓ８２で実現された特性で通過させることによりセンター音信号を抽出する。Ｓ７８以降は同様に動作し、音響信号が音響信号生成装置１０から出力される。 With reference to FIG. 6, the flow of application processing of the acoustic signal generation device 10 will be described.
The application process shown in FIG. 6 differs from the process shown in FIG. 5 in the processing method for extracting the center sound signal from the input signal. The same processes as those shown in FIG. 5 are denoted by the same reference numerals and description thereof is omitted.
In S81, a filter characteristic for passing only the central component is obtained from the frequency characteristic in which the central component of the left and right acoustic signals obtained by the determination in S75 is present. In S82, a filter having the obtained bandpass characteristic is realized. The filter is also similar to an equalizer characteristic having a steep passage and cutoff characteristic. In S83, the center sound signal is extracted by passing the input signal supplied in S72 with the characteristics realized in S82. After S78, the operation is the same, and an acoustic signal is output from the acoustic signal generation device 10.

以上のように、本実施例で示した音響信号生成装置１０によれば、入力される左右２チャンネルのオーディオ信号のそれぞれをフーリエ変換して所定周波数間隔ごとの各周波数の正弦波成分及び余弦波成分を得るＦＦＴ手段２１、２２と、ＦＦＴ手段より得られた各周波数毎の左チャンネルオーディオ信号の正弦波成分と右チャンネルオーディオ信号の正弦波成分とのレベルを比較しレベル差が所定の範囲にある周波数を１つのまとまりとした第１の周波数群を検出すると共に、ＦＦＴ手段より得られた各周波数毎の左チャンネルオーディオ信号の余弦波成分と右チャンネルオーディオ信号の余弦波成分とのレベルを比較しレベル差が所定の範囲にある周波数を１つのまとまりとした第２の周波数群を検出する周波数群検出手段２３と、周波数群検出手段により検出された第１の周波数群の周波数及び第２の周波数群の周波数のうち共通な周波数を検出する周波数検出手段２４と、ＦＦＴ手段により得られた各周波数の左チャンネルオーディオ信号の正弦波成分及び各周波数の右チャンネルオーディオ信号の正弦波成分のうち、周波数検出手段で検出された共通な周波数の正弦波成分同士を加算して加算正弦波成分を得ると共に、ＦＦＴ手段により得られた各周波数の左チャンネルオーディオ信号の余弦波成分及び各周波数の右チャンネルオーディオ信号の余弦波成分のうち、周波数検出手段で検出された共通な周波数の余弦波成分同士を加算して加算余弦波成分を得る加算手段２７と、加算手段より得られた加算正弦波成分及び加算余弦波成分を逆フーリエ変換して加算オーディオ信号を得るＩＦＦＴ手段２８と、左右に拡大した位置に音像定位させる処理を行った拡大オーディオ信号に、ＩＦＦＴ手段で得られた加算オーディオ信号を加算した信号を新たな拡大オーディオ信号として生成する信号生成手段３、４とを備えるので、ステレオ信号を所定間隔の左右スピーカよりも狭い間隔に配置される狭間隔スピーカから発音させ、ステレオ信号を狭間隔スピーカよりも広い間隔の広がりを有する拡大ステレオ信号として生成すると共に、左右スピーカの中央に定位される中央音源のみは拡大ステレオ音響信号の効果を与えなく、中抜け感を伴わない再生用音響信号として生成することを可能とする音響信号生成装置を実現できる。 As described above, according to the acoustic signal generation device 10 shown in the present embodiment, the left and right two-channel audio signals that are input are Fourier-transformed and the sine wave component and cosine wave of each frequency for each predetermined frequency interval. Compare the levels of the FFT means 21 and 22 for obtaining the components and the sine wave component of the left channel audio signal and the sine wave component of the right channel audio signal for each frequency obtained by the FFT means, and the level difference is within a predetermined range. Detects the first frequency group with a certain frequency as one unit, and compares the level of the cosine wave component of the left channel audio signal and the cosine wave component of the right channel audio signal for each frequency obtained from the FFT means Frequency group detecting means 23 for detecting a second frequency group in which frequencies having a level difference within a predetermined range are grouped, and a frequency A frequency detection means 24 for detecting a common frequency among the frequencies of the first frequency group and the second frequency group detected by the group detection means; and the left channel audio signal of each frequency obtained by the FFT means. Of the sine wave component and the sine wave component of the right channel audio signal of each frequency, the sine wave components of the common frequency detected by the frequency detecting means are added together to obtain an added sine wave component and obtained by the FFT means. Among the cosine wave component of the left channel audio signal of each frequency and the cosine wave component of the right channel audio signal of each frequency, the cosine wave components of the common frequency detected by the frequency detecting means are added together to add the cosine wave component The addition means 27 for obtaining the sum and the addition sine wave component and the addition cosine wave component obtained by the addition means are subjected to inverse Fourier transform to obtain an addition audio IFFT means 28 for obtaining a signal, and signal generation for generating a signal obtained by adding the added audio signal obtained by the IFFT means to the enlarged audio signal subjected to the process of localizing the sound image at a position enlarged left and right as a new enlarged audio signal Since the means 3 and 4 are provided, the stereo signal is emitted from a narrow-spaced speaker arranged at a narrower interval than the left and right speakers at a predetermined interval, and the stereo signal is expanded as a widened stereo signal having a wider interval than the narrow-spaced speaker. Realizes an acoustic signal generation device that can be generated as a reproduction acoustic signal without the effect of an enlarged stereo sound signal, and only the central sound source localized at the center of the left and right speakers does not give the effect of an enlarged stereo sound signal it can.

本発明の実施に係る音響信号生成装置の構成例を示すブロック図である。It is a block diagram which shows the structural example of the acoustic signal generation apparatus which concerns on implementation of this invention. 本発明の実施に係る音響信号生成装置の音像定位拡大の動作を説明するための図である。It is a figure for demonstrating the operation | movement of sound image localization expansion of the acoustic signal generation apparatus which concerns on implementation of this invention. 本発明の実施に係る音響信号生成装置のトランスオーラル処理を説明するための図である。It is a figure for demonstrating the trans-oral process of the acoustic signal generator based on implementation of this invention. 本発明の実施に係る音響信号生成装置の音像定位拡大処理器の構成例を示すブロック図である。It is a block diagram which shows the structural example of the sound image localization expansion processor of the acoustic signal generator which concerns on implementation of this invention. 本発明の実施に係る音響信号生成装置の動作例をフローチャートで示した図である。It is the figure which showed the operation example of the acoustic signal generation apparatus which concerns on implementation of this invention with the flowchart. 本発明の実施に係る音響信号生成装置の応用動作例をフローチャートで示した図である。It is the figure which showed the example of application operation | movement of the acoustic signal generator based on implementation of this invention with the flowchart.

Explanation of symbols

１ステレオ音拡大処理部
２センター音処理部
３、４加算器
１０音響信号生成装置
１１音像定位拡大処理器
１２トランスオーラル再生処理器
２１、２２ＦＦＴ
２３振幅位相比較器
２４中央成分判定器
２５、２６中央成分抽出器
２７成分加算器
２８ＩＦＦＴ
２９制御器
１１１、１１２、１２１、１２２、１２５、１２６フィルタ
１２３、１２４加算回路 DESCRIPTION OF SYMBOLS 1 Stereo sound expansion process part 2 Center sound process part 3, 4 Adder 10 Acoustic signal production | generation apparatus 11 Sound image localization expansion processer 12 Trans-oral reproduction | regeneration processor 21, 22 FFT
23 Amplitude / phase comparator 24 Central component determination unit 25, 26 Central component extractor 27 Component adder 28 IFFT
29 Controller 111, 112, 121, 122, 125, 126 Filter 123, 124 Adder circuit

Claims

Obtained by performing sound image localization processing on the input audio signal to a position expanded to the left and right with respect to the sound image localization position obtained when the input left and right two-channel audio signals are reproduced by a pair of left and right speakers. In an acoustic signal generation device that outputs an enlarged audio signal,
FFT means for obtaining a sine wave component and a cosine wave component of each frequency for each predetermined frequency interval by Fourier transforming each of the input left and right two-channel audio signals;
The levels of the sine wave component of the left channel audio signal and the sine wave component of the right channel audio signal obtained for each frequency obtained from the FFT means are compared, and the frequencies having a level difference within a predetermined range are defined as one unit. 1 frequency group is detected, and the levels of the cosine wave component of the left channel audio signal and the cosine wave component of the right channel audio signal for each frequency obtained from the FFT means are compared, and the level difference is within a predetermined range. A frequency group detection means for detecting a second frequency group in which a certain frequency is a unit;
Frequency detection means for detecting a common frequency among the frequencies of the first frequency group and the second frequency group detected by the frequency group detection means;
Of the sine wave component of the left channel audio signal of each frequency and the sine wave component of the right channel audio signal of each frequency obtained by the FFT unit, sine wave components of the common frequency detected by the frequency detection unit are combined. Addition to obtain an addition sine wave component, and the frequency detection means detects the cosine wave component of the left channel audio signal of each frequency and the cosine wave component of the right channel audio signal of each frequency obtained by the FFT means. Adding means for adding the cosine wave components having the same common frequency to obtain an added cosine wave component;
IFFT means for obtaining an added audio signal by performing inverse Fourier transform on the added sine wave component and the added cosine wave component obtained from the adding means;
Signal generating means for generating, as a new enlarged audio signal, a signal obtained by adding the added audio signal obtained by the IFFT means to the enlarged audio signal subjected to the process of localizing the sound image at the position expanded to the left and right;
An acoustic signal generation device comprising: