JP5547432B2

JP5547432B2 - Automatic volume control device

Info

Publication number: JP5547432B2
Application number: JP2009168407A
Authority: JP
Inventors: 武志橋本
Original assignee: Clarion Co Ltd
Current assignee: Faurecia Clarion Electronics Co Ltd
Priority date: 2009-07-17
Filing date: 2009-07-17
Publication date: 2014-07-16
Anticipated expiration: 2029-07-17
Also published as: JP2011024058A

Description

本発明は自動音量制御装置に関し、より詳細には、音楽が流されている空間において会話が行われた場合に、発話に応じて音楽の出力レベルを音声帯域のモノラル成分のみ低減させることにより、円滑な会話を実現することが可能な自動音量制御装置に関する。 The present invention relates to an automatic volume control device, and more particularly, by reducing only the monaural component of the audio band according to the utterance when the conversation is performed in a space where music is played, The present invention relates to an automatic volume control device capable of realizing smooth conversation.

走行中の車両の室内では、運転（走行）中に音楽やラジオ番組等を流すことが多い。このような状況において、運転手と同乗者とが会話を行う場合には、音楽等の再生音によって、円滑な会話（会話の聞き取り等）が妨げられてしまうおそれがあった。 In the interior of a running vehicle, music and radio programs are often played while driving (running). In such a situation, when the driver and the passenger have a conversation, there is a possibility that a smooth conversation (listening of the conversation, etc.) may be hindered by the reproduced sound such as music.

一般的な車載用オーディオ装置には、音量を調節するための音量調節スイッチや、音量を一時的に低減させるためのミュートスイッチなどが設けられている（例えば、特許文献１および特許文献２参照）。このため、運転者等は、音量調節スイッチやミュートスイッチを操作することにより、会話を妨げない程度まで音楽等の再生音量を低減させることが多かった。 A general in-vehicle audio apparatus is provided with a volume control switch for adjusting the volume, a mute switch for temporarily reducing the volume, and the like (see, for example, Patent Document 1 and Patent Document 2). . For this reason, the driver or the like often reduces the playback volume of music or the like to such an extent that the conversation is not hindered by operating the volume control switch and the mute switch.

特開２００８−６２９０６号公報JP 2008-62906 A 特開２００６−６７４９０号公報JP 2006-67490 A

しかしながら、会話を行う度に音量調節スイッチを操作して再生音を低減する方法では、操作が煩雑になり、かえって円滑な会話を妨げてしまうおそれがあるという問題があった。一方で、ミュートスイッチを用いて音楽の再生音（出力レベル）を低減させる方法では、会話が途切れた状態においてもそのまま音楽の再生音が低減された状態となってしまい、音楽やラジオ番組等を楽しむことができないという問題があった。 However, the method of operating the volume control switch every time a conversation is performed to reduce the reproduced sound has a problem that the operation becomes complicated and may hinder smooth conversation. On the other hand, with the method of reducing the music playback sound (output level) using the mute switch, the music playback sound is reduced even when the conversation is interrupted. There was a problem that I could not enjoy it.

このため、音量調節スイッチやミュートスイッチを操作することなく会話が成立するような大きな声を、発話者が発することにより、音楽等を再生させたままの状態で会話を行うこともしばしば行われるが、会話が続く場合には、発話者はもちろんのこと会話の相手側においても会話に疲労を感じてしまうおそれがあるという問題があった。 For this reason, it is often the case that a speaker utters a loud voice that can establish a conversation without operating a volume control switch or a mute switch, and the conversation is performed while music is being reproduced. When the conversation continues, there is a problem that the conversation partner may feel tired as well as the speaker.

本発明は、上記問題に鑑みてなされたものであり、運転者又は同乗者が音量調節スイッチやミュートスイッチを操作することなく、さらに、大きな声を発することなく通常の状態で円滑な会話を行うことが可能な自動音量制御装置を提供することを課題とする。 The present invention has been made in view of the above problems, and a driver or a passenger does a smooth conversation in a normal state without operating a volume control switch or a mute switch and without generating a loud voice. It is an object of the present invention to provide an automatic sound volume control device that can be used.

上記課題を解決するために、本発明に係る自動音量制御装置は、マイクにより取得された音響信号より音声信号の検出を行う音声検出手段と、音源により出力されるＬチャンネル用の音楽信号とＲチャンネル用の音楽信号とのそれぞれを、音声帯域に対応する周波数帯域からなる第１音声帯域信号と、前記音声帯域に対応しない周波数帯域からなる非音声帯域信号とに帯域分割する帯域分割手段と、Ｌチャンネル用の前記第１音声帯域信号とＲチャンネル用の前記第１音声帯域信号とのいずれか一方の音声帯域信号だけ位相反転を行う位相反転手段と、該位相反転手段により位相反転されたＬチャンネル用の前記第１音声帯域信号あるいはＲチャンネル用の前記第１音声帯域信号のいずれか一方の音声帯域信号と、前記位相反転手段により位相反転されなかった他方の音声帯域信号とを合成することにより、Ｌチャンネル用の第２音声帯域信号とＲチャンネル用の第２音声帯域信号とを生成する音声帯域信号合成手段と、前記音声検出手段により前記音声信号の検出が行われた場合に、前記音声帯域信号合成手段により生成されたＬチャンネル用の前記第２音声帯域信号とＲチャンネル用の前記第２音声帯域信号とを各チャンネル用の音声帯域信号として出力し、前記音声検出手段により前記音声信号の検出が行われなかった場合には、前記帯域分割手段により帯域分割されたＬチャンネル用の前記第１音声帯域信号とＲチャンネル用の前記第１音声帯域信号とをそのまま各チャンネル用の音声帯域信号として出力する音声帯域信号選択手段と、該音声帯域信号選択手段により出力されたＬチャンネル用の音声帯域信号およびＲチャンネル用の音声帯域信号と、前記帯域分割手段において帯域分割されたＬチャンネル用の非音声帯域信号およびＲチャンネル用の非音声帯域信号とを、それぞれのチャンネル毎に合成することにより、Ｌチャンネル用の全帯域の音楽信号およびＲチャンネル用の全帯域の音楽信号とを生成する音楽信号合成手段とを備えることを特徴とする。 In order to solve the above-described problems, an automatic volume control device according to the present invention includes an audio detection unit that detects an audio signal from an acoustic signal acquired by a microphone, an L channel music signal output from a sound source, and an R signal. Band division means for dividing each of the channel music signals into a first audio band signal composed of a frequency band corresponding to an audio band and a non-audio band signal composed of a frequency band not corresponding to the audio band; Phase inversion means for performing phase inversion for only one of the audio band signals of the first audio band signal for the L channel and the first audio band signal for the R channel, and L that has been phase inverted by the phase inversion means. Either the first audio band signal for the channel or the first audio band signal for the R channel and the phase inverting means A voice band signal synthesizing unit that generates a second voice band signal for the L channel and a second voice band signal for the R channel by synthesizing the other voice band signal that has not been inverted; and the voice detection unit When the audio signal is detected by the above, the second audio band signal for the L channel and the second audio band signal for the R channel generated by the audio band signal synthesis means are used for each channel. When the audio signal is output as an audio band signal and the audio signal is not detected by the audio detection unit, the first audio band signal for the L channel and the R channel for the channel divided by the band division unit Voice band signal selection means for outputting the first voice band signal as it is as a voice band signal for each channel, and output by the voice band signal selection means The L-channel audio band signal and the R-channel audio band signal, and the L-channel non-audio band signal and the R-channel non-audio band signal that are band-divided by the band dividing unit, respectively. It is characterized by comprising music signal synthesizing means for generating a music signal for the entire band for the L channel and a music signal for the entire band for the R channel by combining each channel.

本発明に係る自動音量制御装置によれば、Ｌチャンネル用の音楽信号とＲチャンネル用の音楽信号とのそれぞれを、音声帯域に対応する周波数帯域からなる第１音声帯域信号と、音声帯域に対応しない周波数帯域からなる非音声帯域信号とに帯域分割した後に、Ｌチャンネル用の第１音声帯域信号とＲチャンネル用の第１音声帯域信号とのいずれか一方の音声帯域信号だけ位相反転させて、位相反転されなかった他方の音声帯域信号に合成させるので、合成された第２音声帯域信号は、モノラル成分の出力（信号レベル）が合成により低減された状態になる。 According to the automatic volume control device of the present invention, each of the L channel music signal and the R channel music signal corresponds to the first audio band signal having a frequency band corresponding to the audio band and the audio band. After the frequency is divided into non-voice band signals composed of non-frequency bands, only one of the voice band signals of the first voice band signal for the L channel and the first voice band signal for the R channel is inverted, Since the other voice band signal that has not undergone phase inversion is synthesized, the synthesized second voice band signal is in a state where the output (signal level) of the monaural component is reduced by synthesis.

このようにして音声帯域におけるモノラル成分の出力（信号レベル）が低減された音楽信号（第２音声帯域信号）と帯域分割手段により帯域分割された非音声帯域信号とを合成することにより、音楽信号の全体的なステレオ感を維持した状態で、会話の妨げとなり得る音声帯域のモノラル成分の信号レベルだけを効果的に低減させることができるので、音楽が流れる状況であっても円滑な会話を楽しむことが可能となる。 The music signal is synthesized by synthesizing the music signal (second audio band signal) in which the output (signal level) of the monaural component in the audio band is reduced in this way and the non-audio band signal band-divided by the band dividing unit. Because it can effectively reduce only the signal level of the monaural component of the voice band that can hinder conversation while maintaining the overall stereo feeling, enjoy smooth conversation even when music flows It becomes possible.

また、会話中であっても音楽信号における全体的なステレオ感（特に非音声帯域信号のステレオ成分）は維持されているので、円滑な会話を実現させつつ、音楽の音質（臨場感や迫力など）の低下を抑制することが可能となる。 In addition, the overall stereo feeling of the music signal (especially the stereo component of the non-speech band signal) is maintained even during a conversation, so the sound quality of the music (realism, power, etc.) can be achieved while realizing a smooth conversation. ) Can be suppressed.

さらに、上述したように音楽信号合成手段により生成された全帯域の音楽信号において、音声帯域のモノラル成分における信号レベルを低減させる場合とは、音声検出手段により音声信号が検出された時だけ、つまり、発話者により言葉が発せられた時だけであるため、会話が行われる時だけ自動的に音楽の音声帯域におけるモノラル成分の信号レベルを低減させることができる。 Furthermore, as described above, in the music signal of the entire band generated by the music signal synthesizing unit, the signal level in the monaural component of the audio band is reduced only when the audio signal is detected by the audio detecting unit. Since it is only when words are spoken by the speaker, the signal level of the monaural component in the audio band of music can be automatically reduced only when conversation is performed.

また、このように音声信号の検出に応じて音楽信号の音量制御が行われるため、会話が行われていない状態（会話が途切れた状態など）では、音声帯域におけるモノラル成分の低減処理（音量制御）は行われない。このため、会話が行われていない状態（音声信号の検出がない状態）では、音楽信号において低減処理（音量制御）が全く行われないので、良質な音質の音楽を楽しむことが可能となる。 In addition, since the volume control of the music signal is performed in response to the detection of the audio signal in this way, in a state where the conversation is not being performed (such as a state where the conversation is interrupted), the monaural component reduction processing (volume control) ) Is not performed. For this reason, in a state where no conversation is performed (a state in which no audio signal is detected), no reduction processing (volume control) is performed on the music signal, so that it is possible to enjoy high-quality music.

また、前記音声帯域信号合成手段は、前記音声帯域信号合成手段により合成処理されたＬチャンネル用の前記第２音声帯域信号とＲチャンネル用の前記第２音声帯域信号とに対して、それぞれのチャンネルに応じたゲインの重み付けを行うゲイン付加手段を有するものであってもよい。 Further, the voice band signal synthesizing unit is configured to provide a channel for each of the second voice band signal for the L channel and the second voice band signal for the R channel synthesized by the voice band signal synthesizing unit. There may be provided gain adding means for weighting the gain in accordance with.

このように、本発明に係る自動音量制御装置では、ゲイン付加手段により、モノラル成分の低減が成された第２音声帯域信号に対して、それぞれのチャンネルに応じたゲインの重み付けが行われるため、音声帯域信号においてモノラル成分の低減が行われていても、残されたステレオ成分が強調されることになるので、音声帯域（ボーカル成分）を含む全体的なステレオ感を高く維持することが可能となる。 As described above, in the automatic sound volume control device according to the present invention, the gain addition unit weights the gain corresponding to each channel for the second audio band signal in which the monaural component is reduced. Even if the monaural component is reduced in the audio band signal, the remaining stereo component is emphasized, so that the overall stereo feeling including the audio band (vocal component) can be kept high. Become.

さらに、ゲイン付加手段により第２音声帯域信号に対する重み付け処理が行われるため、Ｌチャンネル用の第２音声帯域信号とＲチャンネル用の第２音声帯域信号とが無相関な信号となり、音楽信号合成手段において音楽信号の合成を行う場合において、ステレオ成分の音楽信号が干渉してしまうことを防止することが可能となる。 Further, since the weight addition processing is performed on the second audio band signal by the gain adding means, the second audio band signal for the L channel and the second audio band signal for the R channel become an uncorrelated signal, and the music signal synthesizing means It is possible to prevent the stereo component music signal from interfering when the music signal is synthesized in FIG.

また、上述した自動音量制御装置において、前記帯域分割手段は、前記音源により出力されるＬチャンネル用の前記音楽信号とＲチャンネル用の前記音楽信号とのそれぞれを、音声帯域に対応する周波数帯域からなる前記第１音声帯域信号と、前記音声帯域よりも低い周波数帯域からなる低域信号と、前記音声帯域よりも高い周波数帯域からなる高域信号とに帯域分割すると共に、前記第１音声帯域信号のみ位相反転を行い、前記音楽信号合成手段は、前記音声帯域信号選択手段により出力されたＬチャンネル用の前記音声帯域信号およびＲチャンネル用の前記音声帯域信号と、Ｌチャンネル用の前記低域信号およびＲチャンネル用の前記低域信号と、Ｌチャンネル用の前記高域信号およびＲチャンネル用の前記高域信号とを、それぞれのチャンネル毎に合成することにより、Ｌチャンネル用の全帯域の音楽信号およびＲチャンネル用の全帯域の音楽信号とを生成するものであってもよい。 Further, in the automatic volume control device described above, the band dividing unit extracts each of the music signal for L channel and the music signal for R channel output from the sound source from a frequency band corresponding to a voice band. The first audio band signal is divided into the first audio band signal, the low frequency signal having a frequency band lower than the audio band, and the high frequency signal having a frequency band higher than the audio band, and the first audio band signal And the music signal synthesizing means outputs the audio band signal for the L channel and the audio band signal for the R channel output by the audio band signal selecting means, and the low frequency signal for the L channel. And the low frequency signal for the R channel, the high frequency signal for the L channel, and the high frequency signal for the R channel, respectively. By synthesizing each tunnel, it may be configured to generate a full-band music signals for music signals and R-channel of the total bandwidth of the L-channel.

このように、非音声帯域信号として、音声帯域よりも低い周波数帯域からなる低域信号と、音声帯域よりも高い周波数帯域からなる高域信号とに、音声信号を帯域分割することにより、音楽信号合成手段において合成処理が行われた音楽信号の高帯域および低帯域のステレオ成分をより効果的に発現することが可能となる。 Thus, as a non-speech band signal, the audio signal is divided into a low-frequency signal having a frequency band lower than the voice band and a high-frequency signal having a frequency band higher than the voice band. It becomes possible to express the high-band and low-band stereo components of the music signal subjected to the synthesis processing in the synthesis means more effectively.

さらに、帯域分割手段において第２音声帯域信号のみ、あらかじめ位相反転を行うことにより、音楽信号合成手段で各帯域の信号を合成する場合において、各信号間で干渉が生じてしまうことを低減させることが可能となる。 Furthermore, by performing phase inversion only for the second audio band signal in the band dividing means in advance, it is possible to reduce the occurrence of interference between the signals when the music signal synthesizing means synthesizes the signals of each band. Is possible.

本発明に係る自動音量制御装置によれば、Ｌチャンネル用の音楽信号とＲチャンネル用の音楽信号とのそれぞれを、音声帯域に対応する周波数帯域からなる第１音声帯域信号と、前記音声帯域に対応しない周波数帯域からなる非音声帯域信号とに帯域分割した後に、Ｌチャンネル用の第１音声帯域信号とＲチャンネル用の第１音声帯域信号とのいずれか一方の音声帯域信号だけ位相反転させて、位相反転されなかった他方の音声帯域信号に合成させるので、合成された第２音声帯域信号は、モノラル成分の出力（信号レベル）が合成により低減された状態になる。 According to the automatic volume control device of the present invention, each of the L channel music signal and the R channel music signal is divided into the first voice band signal having a frequency band corresponding to the voice band and the voice band. After band division into non-voice band signals having non-corresponding frequency bands, only one of the voice band signals of the first voice band signal for L channel and the first voice band signal for R channel is inverted in phase. Therefore, the synthesized second audio band signal is in a state in which the output of the monaural component (signal level) is reduced by the synthesis.

本実施の形態に係る自動音量制御装置の概略構成を示したブロック図である。It is the block diagram which showed schematic structure of the automatic volume control apparatus which concerns on this Embodiment. 本実施の形態に係る音声強調処理部の概略構成を示したブロック図である。It is the block diagram which showed schematic structure of the audio | voice emphasis processing part which concerns on this Embodiment. 本実施の形態に係るオーディオキャンセラ部の概略構成を示したブロック図である。It is the block diagram which showed schematic structure of the audio canceller part which concerns on this Embodiment. 本実施の形態に係る音声検出部の概略構成を示したブロック図である。It is the block diagram which showed schematic structure of the audio | voice detection part which concerns on this Embodiment. 本実施の形態に係る移動平均部において移動平均が求められた音声信号と、音声検出スレッショルド部において設定される音声検出スレッショルドとの関係を示した図である。It is the figure which showed the relationship between the audio | voice signal from which the moving average was calculated | required in the moving average part which concerns on this Embodiment, and the audio | voice detection threshold set in an audio | voice detection threshold part. 本実施の形態に係る第１帯域分割部の概略構成を示したブロック図である。It is the block diagram which showed schematic structure of the 1st zone | band division | segmentation part which concerns on this Embodiment. 本実施の形態に係る第１帯域分割部において、低域音楽信号Ｌ、中域音楽信号Ｌおよび高域音楽信号Ｌを生成するためのフィルタ特性を示した図である。It is the figure which showed the filter characteristic for producing | generating the low-pass music signal L, the mid-range music signal L, and the high-pass music signal L in the 1st band division part which concerns on this Embodiment. 本実施の形態に係る音量制御部の概略構成を示したブロック図である。It is the block diagram which showed schematic structure of the volume control part which concerns on this Embodiment. 本実施の形態に係るマイクにより取得される音声信号の周波数特性を一例として示した図である。It is the figure which showed as an example the frequency characteristic of the audio | voice signal acquired with the microphone which concerns on this Embodiment. Ｌチャンネル用のホワイト雑音とＲチャンネル用のホワイト雑音とが同一のモノラル信号である場合において、本実施の形態に係る自動音量制御装置により、音量制御が行われた場合と行われていない場合とのホワイト雑音の周波数特性を示した図である。When the white noise for the L channel and the white noise for the R channel are the same monaural signal, the volume control is performed and the volume control is not performed by the automatic volume control device according to the present embodiment. It is the figure which showed the frequency characteristic of white noise. Ｌチャンネル用のホワイト雑音とＲチャンネル用のホワイト雑音とがそれぞれ無相関のステレオ信号である場合において、本実施の形態に係る自動音量制御装置により、音量制御が行われた場合と行われていない場合とのホワイト雑音の周波数特性を示した図である。When the white noise for the L channel and the white noise for the R channel are respectively uncorrelated stereo signals, the automatic volume control device according to this embodiment does not perform the volume control. It is the figure which showed the frequency characteristic of the white noise with the case. Ｌチャンネル用とＲチャンネル用との音楽信号がステレオ成分を備えている場合において、自動音量制御装置による音量制御が行われた場合と行われていない場合との音楽信号の周波数特性を示した図である。The figure which showed the frequency characteristic of the music signal with the case where the volume control by an automatic volume control apparatus is not performed when the music signal for L channels and the channel for R has a stereo component. It is.

以下、本発明に係る自動音量制御装置について、図面を用いて詳細に説明を行う。 Hereinafter, an automatic sound volume control apparatus according to the present invention will be described in detail with reference to the drawings.

図１は、本実施の形態に係る自動音量制御装置の概略構成を示したブロック図である。なお、本実施の形態では、自動音量制御装置１が車両に設置される場合を一例として示して説明する。本実施の形態に係る自動音量制御装置１を車両に設置することにより、会話の有無に応じて、音源より出力される音楽信号のうち音声帯域のモノラル成分のみを自動的に低減させることが可能になる。 FIG. 1 is a block diagram showing a schematic configuration of the automatic volume control device according to the present embodiment. In the present embodiment, the case where the automatic volume control device 1 is installed in a vehicle will be described as an example. By installing the automatic volume control device 1 according to the present embodiment in a vehicle, it is possible to automatically reduce only the monaural component of the voice band in the music signal output from the sound source according to the presence or absence of conversation. become.

本実施の形態に係る自動音量制御装置１は、図１に示すように、音声強調処理部２、音声検出部（音声検出手段）３、第１帯域分割部（帯域分割手段）４、第２帯域分割部（帯域分割手段）５、音量制御部（位相反転手段、音声帯域信号合成手段、音声帯域信号選択手段、ゲイン付加手段）６、第１合成部（音楽信号合成手段）７、第２合成部（音楽信号合成手段）８、第１パワーアンプ９、第２パワーアンプ１０、スピーカ１１ａ，１１ｂ、マイクＭにより概略構成されている。 As shown in FIG. 1, the automatic sound volume control apparatus 1 according to the present embodiment includes a voice enhancement processing unit 2, a voice detection unit (speech detection unit) 3, a first band division unit (band division unit) 4, a second Band division unit (band division unit) 5, volume control unit (phase inversion unit, voice band signal synthesis unit, voice band signal selection unit, gain addition unit) 6, first synthesis unit (music signal synthesis unit) 7, second A synthesizing unit (music signal synthesizing means) 8, a first power amplifier 9, a second power amplifier 10, speakers 11a and 11b, and a microphone M are roughly configured.

［音声強調処理部］
まず、音声強調処理部２について説明する。図２は、音声強調処理部２の概略構成を示したブロック図である。音声強調処理部２は、オーディオキャンセラ部１５と、ノイズキャンセラ部１６とを有している。音声強調処理部２は、マイクＭで取得された音響信号の中から、音楽信号や車両走行騒音などのノイズ音を低減させる役割を有している。 [Speech enhancement processor]
First, the speech enhancement processing unit 2 will be described. FIG. 2 is a block diagram showing a schematic configuration of the speech enhancement processing unit 2. The speech enhancement processing unit 2 includes an audio canceller unit 15 and a noise canceller unit 16. The speech enhancement processing unit 2 has a role of reducing noise sounds such as music signals and vehicle running noise from the acoustic signals acquired by the microphone M.

［オーディオキャンセラ部］
次に、オーディオキャンセラ部１５について説明を行う。オーディオキャンセラ部１５は、図３に示すように、第１バンドパスフィルタ部２１と、第２バンドパスフィルタ部２２と、第１遅延部２３と、第２遅延部２４と、第１適応フィルタ部２５と、第２適応フィルタ部２６とを有している。 [Audio canceller]
Next, the audio canceller unit 15 will be described. As shown in FIG. 3, the audio canceller unit 15 includes a first bandpass filter unit 21, a second bandpass filter unit 22, a first delay unit 23, a second delay unit 24, and a first adaptive filter unit. 25 and a second adaptive filter unit 26.

第１バンドパスフィルタ部２１および第２バンドパスフィルタ部２２は、音源より出力された２チャンネルの音楽信号、すなわちＬチャンネル（左側）用の音楽信号ＬおよびＲチャンネル（右側）用の音楽信号Ｒに対して２００Ｈｚ〜２．６ｋＨｚ程度の帯域制限を行うことにより、音声信号のうち主に音声帯域の信号のみを通過させる役割を有している。 The first band-pass filter unit 21 and the second band-pass filter unit 22 are two-channel music signals output from the sound source, that is, the music signal L for the L channel (left side) and the music signal R for the R channel (right side). On the other hand, by limiting the band of about 200 Hz to 2.6 kHz, it mainly has a role of allowing only the voice band signal among the voice signals to pass.

第１遅延部２３および第２遅延部２４は、第１バンドパスフィルタ部２１および第２バンドパスフィルタ部２２により帯域制限処理が行われた音響信号に対して遅延処理を施す役割を有している。第１遅延部２３および第２遅延部２４による遅延処理によって、マイクＭにより取得された音響信号の伝搬遅延の補正を行うことが可能となる。 The first delay unit 23 and the second delay unit 24 have a role of performing a delay process on the acoustic signal subjected to the band limiting process by the first band pass filter unit 21 and the second band pass filter unit 22. Yes. By the delay processing by the first delay unit 23 and the second delay unit 24, it is possible to correct the propagation delay of the acoustic signal acquired by the microphone M.

第１適応フィルタ部２５は、第１ＦＩＲ部２８、第１ＬＭＳ部２９、第１加算部３０により概略構成されており、第２適応フィルタ部２６は、第２ＦＩＲ部３１、第２ＬＭＳ部３２、第２加算部３３により概略構成されている。 The first adaptive filter unit 25 is roughly configured by a first FIR unit 28, a first LMS unit 29, and a first addition unit 30, and the second adaptive filter unit 26 includes a second FIR unit 31, a second LMS unit 32, a second The adder 33 is roughly configured.

第１ＦＩＲ部２８は、有限のインパルス応答フィルタを備えており、第１ＬＭＳ部２９によって行われる係数制御に基づいて、マイクＭで集音された音響信号に対してフィルタ処理を施す機能を有している。第１加算部３０は、第１ＦＩＲ部２８によりフィルタ処理が行われた音楽信号Ｌの位相を反転させた状態で、マイクＭより入力される音響信号に対して加算する（実質的には、マイクＭの音響信号から、フィルタ処理が行われた音楽信号Ｌの音響信号を減算する）。第１加算部３０により加算処理された音響信号は、第１適応フィルタ部２５から出力されるとともに、第１ＬＭＳ部２９へ出力される。 The first FIR unit 28 includes a finite impulse response filter, and has a function of filtering the acoustic signal collected by the microphone M based on the coefficient control performed by the first LMS unit 29. Yes. The first adder 30 adds the sound signal input from the microphone M in a state where the phase of the music signal L filtered by the first FIR unit 28 is inverted (substantially the microphone). The acoustic signal of the music signal L that has been filtered is subtracted from the acoustic signal of M). The acoustic signal added by the first addition unit 30 is output from the first adaptive filter unit 25 and also output to the first LMS unit 29.

第１ＬＭＳ部２９は、第１加算部３０より取得した音響信号（マイクＭの音響信号からフィルタ処理が行われた音楽信号Ｌが減算された信号）と、音源からの音楽信号Ｌとを用い、最小二乗アルゴリズムに基づいて第１ＦＩＲ部２８におけるフィルタの係数制御を行う。このように第１ＬＭＳ部２９を第１適応フィルタ部２５に設けることによって、ＬＭＳアルゴリズムを適用することが可能となる。 The first LMS unit 29 uses the acoustic signal (a signal obtained by subtracting the music signal L subjected to the filter processing from the acoustic signal of the microphone M) acquired from the first adding unit 30 and the music signal L from the sound source, Filter coefficient control in the first FIR unit 28 is performed based on the least square algorithm. By providing the first LMS unit 29 in the first adaptive filter unit 25 in this way, the LMS algorithm can be applied.

また、第２適応フィルタ部２６は、図３に示すように、第１適応フィルタ部２５における第１ＦＩＲ部２８と、第１ＬＭＳ部２９と、第１加算部３０とを、第２ＦＩＲ部３１と、第２ＬＭＳ部３２と、第２加算部３３に置き換えたものであり、第２適応フィルタ部２６の構成およびその機能は、第１適応フィルタ部２５と同じであるため、説明を省略する。 Further, as shown in FIG. 3, the second adaptive filter unit 26 includes a first FIR unit 28, a first LMS unit 29, a first adder unit 30, a second FIR unit 31 in the first adaptive filter unit 25, The second LMS unit 32 and the second adder unit 33 are replaced. The configuration and function of the second adaptive filter unit 26 are the same as those of the first adaptive filter unit 25, and thus the description thereof is omitted.

第２適応フィルタ部２６の第２加算部３３では、第１適応フィルタ部２５においてフィルタ処理が行われた音響信号に対して、第２ＦＩＲ部３１によりフィルタ処理が行われた音楽信号Ｒを、位相を反転させた状態で加算する（実質的には、第１適応フィルタ部２５においてフィルタ処理が行われた音響信号から、第２ＦＩＲ部３１によりフィルタ処理が行われた音楽信号Ｒを減算する）。第２加算部３３により加算処理された音響信号は、第２適応フィルタ部２６からノイズキャンセラ部１６へ出力されるとともに、第２ＬＭＳ部３２へ出力される。 In the second addition unit 33 of the second adaptive filter unit 26, the music signal R filtered by the second FIR unit 31 with respect to the acoustic signal filtered by the first adaptive filter unit 25 is phase-converted. Is added (substantially, the music signal R filtered by the second FIR unit 31 is subtracted from the acoustic signal filtered by the first adaptive filter unit 25). The acoustic signal added by the second addition unit 33 is output from the second adaptive filter unit 26 to the noise canceller unit 16 and also output to the second LMS unit 32.

なお、オーディオキャンセラ部１５では、図３に示すように、第１適応フィルタ部２５および第２適応フィルタ部２６がカスケード接続されている。従って、オーディオキャンセラ部１５では、第１適応フィルタ部２５においてマイクＭから入力された音響信号を音楽信号Ｌで減算処理した後に、第２適応フィルタ部２６において第１適応フィルタ部２５で減算処理された音響信号を音楽信号Ｒで減算処理する構成となっている。この場合において、第１適応フィルタ部２５は第２適応フィルタ部２６よりも早く収束させることが必要となるため、適応速度を大きく設定している。なお、音源が２チャンネル以上ある場合は、チャンネル数に応じて適応フィルタ部の設置数を増加することにより同様の効果を奏することが可能である。 In the audio canceller unit 15, the first adaptive filter unit 25 and the second adaptive filter unit 26 are cascade-connected as shown in FIG. Accordingly, in the audio canceller unit 15, the acoustic signal input from the microphone M in the first adaptive filter unit 25 is subtracted with the music signal L, and then subtracted in the first adaptive filter unit 25 in the second adaptive filter unit 26. The audio signal is subtracted by the music signal R. In this case, since the first adaptive filter unit 25 needs to converge faster than the second adaptive filter unit 26, the adaptive speed is set large. When there are two or more sound sources, the same effect can be obtained by increasing the number of adaptive filter units installed according to the number of channels.

［ノイズキャンセラ部］
ノイズキャンセラ部１６は、スペクトル減算法などを用いて、オーディオキャンセラ部１５より入力される信号からノイズ信号の除去を行う役割を有している。具体的に説明すると、オーディオキャンセラ部１５から出力される信号には、音声信号とノイズ信号（音声以外の信号）とが含まれている。ノイズキャンセラ部１６は、周波数スペクトルにおける時間的に定常的な成分（会話が行われていない状況、音声がない区間における成分）
）をノイズパターンとして推定・保持しており、音声信号とノイズ信号とが含まれる入力信号（オーディオキャンセラ部１５より入力される信号）の周波数領域において、保持するノイズパターンを減算すると同時に、ノイズパターン自体を更新し続けることにより、入力信号（オーディオキャンセラ部１５より入力される信号）からノイズ信号の除去を行う。 [Noise canceller]
The noise canceller 16 has a role of removing a noise signal from a signal input from the audio canceller 15 using a spectral subtraction method or the like. More specifically, the signal output from the audio canceller 15 includes an audio signal and a noise signal (a signal other than audio). The noise canceller 16 is a temporally stationary component in the frequency spectrum (a component in a situation where there is no speech and a section where there is no speech).
) Is estimated and held as a noise pattern, and at the same time as the noise pattern to be subtracted in the frequency domain of the input signal including the audio signal and the noise signal (signal input from the audio canceller unit 15), the noise pattern By continuing to update itself, the noise signal is removed from the input signal (the signal input from the audio canceller 15).

上述したようにオーディオキャンセラ部１５においてフィルタ処理を行うことによって、さらに、ノイズキャンセラ部１６においてノイズ除去処理を行うことによって、マイクＭで取得された音響信号の中から、音楽信号や車両走行騒音などのノイズ音が低減されるので、発話者の音声を強調させることが可能となる。 As described above, by performing filter processing in the audio canceller unit 15 and further performing noise removal processing in the noise canceller unit 16, music signals, vehicle running noise, and the like can be obtained from the acoustic signals acquired by the microphone M. Since the noise sound is reduced, the voice of the speaker can be emphasized.

［音声検出部］
次に、音声検出部３について説明する。図４は、音声検出部３の概略構成を示したブロック図である。音声検出部３は、図４に示すように、実効値検出部４１と、移動平均部４２と、音声検出スレッショルド部４３と、音声信号ホールド部４４とを有している。 [Audio detector]
Next, the voice detection unit 3 will be described. FIG. 4 is a block diagram showing a schematic configuration of the voice detection unit 3. As shown in FIG. 4, the voice detection unit 3 includes an effective value detection unit 41, a moving average unit 42, a voice detection threshold unit 43, and a voice signal hold unit 44.

実効値検出部４１は、音声強調処理部２によって音声強調処理が行われた音声信号において、所定区間の実効値の検出を行う役割を有している。移動平均部４２は、実効値検出部４１において実効値の検出が行われた信号に対して、所定区間の移動平均を求める役割を有している。音声検出スレッショルド部４３は、あらかじめ設定された音声検出スレッショルド（閾値）に基づいて、信号レベルが音声検出スレッショルド（閾値）を超えた音声信号の検出を行う役割を有している。 The effective value detection unit 41 has a role of detecting an effective value in a predetermined section in the audio signal subjected to the audio enhancement processing by the audio enhancement processing unit 2. The moving average unit 42 has a role of obtaining a moving average of a predetermined section with respect to the signal whose effective value is detected by the effective value detecting unit 41. The voice detection threshold unit 43 has a role of detecting a voice signal whose signal level exceeds the voice detection threshold (threshold) based on a preset voice detection threshold (threshold).

図５は、移動平均部４２において移動平均が求められた音声信号と、音声検出スレッショルド部４３において設定される音声検出スレッショルドとの関係を示した図である。音声検出スレッショルド部４３において音声信号の検出を行う場合、音声強調処理部２より入力される音声信号は、音声強調処理部２においてノイズ音が低減されて発話者の音声が強調された状態となっているので、音声検出スレッショルドに基づく音声信号の検出を容易に行うことが可能である。図５において音声検出スレッショルドよりも高い信号レベルとなる部分（例えば、図５に示した音声区間に該当する部分など）が音声信号の検出部分として判断されることになる。 FIG. 5 is a diagram showing a relationship between the voice signal whose moving average is obtained by the moving average unit 42 and the voice detection threshold set by the voice detection threshold unit 43. When the speech signal is detected by the speech detection threshold unit 43, the speech signal input from the speech enhancement processing unit 2 is in a state where the noise sound is reduced and the speech of the speaker is enhanced by the speech enhancement processing unit 2. Therefore, it is possible to easily detect the audio signal based on the audio detection threshold. In FIG. 5, a part having a signal level higher than the voice detection threshold (for example, a part corresponding to the voice section shown in FIG. 5) is determined as a voice signal detection part.

音声信号ホールド部４４は、音声検出スレッショルド部４３において検出された音声検出信号に対して所定時間の保持を行う役割を有している。音声信号ホールド部４４によりホールド処理された音声信号は、音声検出信号として音量制御部６に出力される。 The audio signal holding unit 44 has a role of holding a predetermined time for the audio detection signal detected by the audio detection threshold unit 43. The audio signal that is hold-processed by the audio signal holding unit 44 is output to the volume control unit 6 as an audio detection signal.

［第１帯域分割部］
次に、第１帯域分割部４について説明する。図６は、第１帯域分割部４の概略構成を示したブロック図である。第１帯域分割部４は、図６に示すように、第１ローパスフィルタ部５１と第２ローパスフィルタ部５２と、第１ハイパスフィルタ部５３と、第２ハイパスフィルタ部５４と、位相反転部５５とを有している。 [First Band Division]
Next, the first band dividing unit 4 will be described. FIG. 6 is a block diagram showing a schematic configuration of the first band dividing unit 4. As shown in FIG. 6, the first band dividing unit 4 includes a first low-pass filter unit 51, a second low-pass filter unit 52, a first high-pass filter unit 53, a second high-pass filter unit 54, and a phase inverting unit 55. And have.

第１ローパスフィルタ部５１および第２ローパスフィルタ部５２は、低域通過型の３次のバタワース（Butterworth）・フィルタを２段カスケード接続することにより構成されたものであり、また、第１ハイパスフィルタ部５３および第２ハイパスフィルタ部５４は、高域通過型の３次のバタワース・フィルタを２段カスケード接続することにより構成されている。 The first low-pass filter unit 51 and the second low-pass filter unit 52 are configured by two-stage cascade connection of low-pass type third-order Butterworth filters, and the first high-pass filter The unit 53 and the second high-pass filter unit 54 are configured by cascade-connecting high-pass third-order Butterworth filters.

本実施の形態に係る第１ローパスフィルタ部５１、第２ローパスフィルタ部５２、第１ハイパスフィルタ部５３および第２ハイパスフィルタ部５４を用いて、入力された音楽信号Ｌを、低域音楽信号Ｌ、中域音楽信号Ｌおよび高域音楽信号Ｌへ帯域分割するためのフィルタ特性は、図７に示した状態になる。具体的に、第１ローパスフィルタ部５１のカットオフ周波数は３００Ｈｚ、第２ローパスフィルタ部５２のカットオフ周波数は６，０００Ｈｚ、第１ハイパスフィルタ部５３のカットオフ周波数は３００Ｈｚ、第２ハイパスフィルタ部５４のカットオフ周波数は６，０００Ｈｚに設定されている。 Using the first low-pass filter unit 51, the second low-pass filter unit 52, the first high-pass filter unit 53, and the second high-pass filter unit 54 according to the present embodiment, the input music signal L is converted into the low-frequency music signal L. The filter characteristics for dividing the band into the mid-range music signal L and the high-range music signal L are as shown in FIG. Specifically, the cutoff frequency of the first low-pass filter unit 51 is 300 Hz, the cutoff frequency of the second low-pass filter unit 52 is 6,000 Hz, the cutoff frequency of the first high-pass filter unit 53 is 300 Hz, and the second high-pass filter unit. The cutoff frequency of 54 is set to 6,000 Hz.

このようにしてカットオフ周波数が設定される第１ローパスフィルタ部５１、第２ローパスフィルタ部５２、第１ハイパスフィルタ部５３および第２ハイパスフィルタ部５４を、図６に示すようにそれぞれを組み合わせることによって、第１帯域分割部４に入力される音楽信号Ｌを、低域用の音楽信号として帯域分割が行われた低域音楽信号（非音声帯域信号）Ｌと、中域用の音楽信号として帯域分割が行われた中域音楽信号（第１音声帯域信号）Ｌと、高域用の音楽信号として帯域分割が行われた高域音楽信号（非音声帯域信号）Ｌとに分割することが可能となる。ここで、中域音楽信号の帯域幅は、第２ローパスフィルタ部５２のカットオフ周波数と、第１ハイパスフィルタ部５３のカットオフ周波数より明らかなように、音声信号の音声帯域に相当する帯域として設定されている。 The first low-pass filter unit 51, the second low-pass filter unit 52, the first high-pass filter unit 53, and the second high-pass filter unit 54 in which the cutoff frequency is set in this way are combined as shown in FIG. Thus, the music signal L input to the first band dividing unit 4 is divided into a low-frequency music signal (non-speech band signal) L subjected to band division as a low-frequency music signal and a mid-range music signal. Dividing into a mid-range music signal (first audio band signal) L subjected to band division and a high-frequency music signal (non-sound band signal) L subjected to band division as a high-frequency music signal. It becomes possible. Here, the bandwidth of the mid-range music signal is a band corresponding to the audio band of the audio signal, as is clear from the cutoff frequency of the second low-pass filter unit 52 and the cutoff frequency of the first high-pass filter unit 53. Is set.

位相反転部５５は、第２ローパスフィルタ部５２および第１ハイパスフィルタ部５３によりフィルタ処理が行われた中域音楽信号Ｌの位相を反転させる役割を有している。帯域分割により中域音楽信号を生成する場合には、図７に示すように、音楽信号に対して第２ローパスフィルタ部５２が適用されてカットオフ周波数が６，０００Ｈｚ以下の信号レベルのみが音楽信号の出力として認められ、さらに、第１ハイパスフィルタ部５３が適用されてカットオフ周波数が３００Ｈｚ以上の信号レベルのみが音楽信号の出力として認められることになる。このため、位相反転部５５によって中域音楽信号の位相を反転させることにより、低域周波数部分および高域周波数部分の出力値が強調されることになり、第１合成部７で帯域分割されたそれぞれの音楽信号を合成する場合において、それぞれの信号間で生ずる干渉を低減させることが可能となる。 The phase inverting unit 55 has a role of inverting the phase of the mid-range music signal L that has been filtered by the second low-pass filter unit 52 and the first high-pass filter unit 53. In the case of generating a mid-range music signal by band division, as shown in FIG. 7, the second low-pass filter unit 52 is applied to the music signal and only the signal level with a cutoff frequency of 6,000 Hz or less is music. Only the signal level with a cut-off frequency of 300 Hz or higher is recognized as the output of the music signal by applying the first high-pass filter unit 53. For this reason, by inverting the phase of the mid-range music signal by the phase inverting unit 55, the output values of the low-frequency part and the high-frequency part are emphasized, and the band is divided by the first synthesizing unit 7. In the case of synthesizing each music signal, it is possible to reduce interference generated between the respective signals.

なお、第２帯域分割部５は、第１帯域分割部４と実質的に同一の構成であり、入力された音楽信号Ｒをフィルタ処理により低域音楽信号Ｒと中域音楽信号Ｒと高域音楽信号Ｒとに帯域分割する。第２帯域分割部５の構成については、第１帯域分割部４と同一であるため、ここでの詳細な説明は省略する。 The second band dividing unit 5 has substantially the same configuration as that of the first band dividing unit 4, and the input music signal R is filtered and processed by a low frequency music signal R, a middle frequency music signal R, and a high frequency band. Band division into music signal R is performed. Since the configuration of the second band dividing unit 5 is the same as that of the first band dividing unit 4, a detailed description thereof is omitted here.

［音量制御部］
次に、音量制御部６について説明する。図８は、音量制御部６の概略構成を示したブロック図である。音量制御部６は、図８に示すように、位相反転部（位相反転手段）６１と、合成部（音声帯域信号合成手段）６２と、第１ゲイン部（ゲイン付加手段）６３と、第２ゲイン部（ゲイン付加手段）６４と、選択部（音声帯域信号選択手段）６５とを有している。 [Volume control section]
Next, the volume control unit 6 will be described. FIG. 8 is a block diagram showing a schematic configuration of the volume control unit 6. As shown in FIG. 8, the volume control unit 6 includes a phase inversion unit (phase inversion unit) 61, a synthesis unit (audio band signal synthesis unit) 62, a first gain unit (gain addition unit) 63, and a second unit. A gain section (gain addition means) 64 and a selection section (voice band signal selection means) 65 are provided.

位相反転部６１は、第２帯域分割部５により帯域分割された中域音楽信号Ｒに対して位相反転を行う役割を有しており、合成部６２は、位相反転された中域音楽信号Ｒと、第１帯域分割部４により帯域分割された中域音楽信号Ｌとの合成を行う役割を有している。このように、中域音楽信号Ｌに対して、位相反転された中域音楽信号Ｒを合成することにより、中域音楽信号Ｒと中域音楽信号Ｌとの共通する音楽成分であるモノラル成分が除去されることになる。 The phase inversion unit 61 has a role of performing phase inversion on the mid-range music signal R band-divided by the second band-splitting unit 5, and the synthesizing unit 62 has the phase-inverted mid-range music signal R And the middle band music signal L which has been band-divided by the first band dividing unit 4. In this way, by synthesizing the mid-range music signal R whose phase has been inverted with respect to the mid-range music signal L, a monaural component that is a common music component between the mid-range music signal R and the mid-range music signal L is obtained. Will be removed.

特に、中域音楽信号Ｒおよび中域音楽信号Ｌは、音楽信号における音声帯域成分、つまり、音楽信号におけるボーカル帯域に該当するため、合成部６２による合成処理によって、ボーカル帯域のモノラル成分の除去が行われることになる。一方で、合成された中域音楽信号（第２音声帯域信号）において、中域音楽信号Ｒと中域音楽信号Ｌとで共通していない音楽成分は残されることになるので、結果として中域音楽信号においてステレオ成分は保持された状態となる。 In particular, the mid-range music signal R and the mid-range music signal L correspond to the voice band component in the music signal, that is, the vocal band in the music signal. Therefore, the monaural component in the vocal band is removed by the synthesis processing by the synthesis unit 62. Will be done. On the other hand, in the synthesized mid-range music signal (second audio band signal), a music component that is not common to the mid-range music signal R and the mid-range music signal L is left. The stereo component is retained in the music signal.

第１ゲイン部６３および第２ゲイン部６４は、ボーカル帯域のモノラル成分が除去された中域音楽信号に対してＬチャンネル用およびＲチャンネル用のゲインの重み付けを行うことにより、位相とレベルの補正を行う役割を有している。ボーカル帯域のモノラル成分が除去された（ステレオ成分が保持された）中域音楽信号は、第１ゲイン部６３において０．７０７（１／√２）倍のゲインの重み付けが行われ、また、第２ゲイン部６４において、−０．７０７（−１／√２）倍のゲインの重み付けが行われる。第１ゲイン部６３においてゲインの重み付けが行われた中域音楽信号はＬチャンネル用の中域音楽信号となり、第２ゲイン部６４において重み付けが行われた中域音楽信号はＲチャンネル用の中域音楽信号となる。 The first gain unit 63 and the second gain unit 64 perform phase and level correction by weighting the L channel and R channel gains to the mid-range music signal from which the monaural component of the vocal band has been removed. Has a role to do. The mid-range music signal from which the monaural component of the vocal band has been removed (the stereo component is retained) is weighted with a gain of 0.707 (1 / √2) in the first gain unit 63, and In the 2-gain unit 64, gain weighting of -0.707 (-1 / √2) times is performed. The mid-range music signal weighted by the first gain unit 63 is an L-channel mid-range music signal, and the mid-range music signal weighted by the second gain unit 64 is the R-channel mid-range music signal. It becomes a music signal.

この重み付け処理によって、Ｌチャンネル用およびＲチャンネル用のそれぞれの中域音楽信号は、チャンネル間で無相関な信号になると共に、それぞれのステレオ成分が強調されることになるため、次述する第１合成部７および第２合成部８において各帯域の音楽信号を合成する場合において、音楽信号がステレオ成分において干渉してしまうことを防止することが可能となると共に、ステレオ成分の音質劣化を低減させることが可能になる。 As a result of this weighting process, the mid-range music signals for the L channel and the R channel become uncorrelated signals between the channels, and the respective stereo components are emphasized. When the music signal of each band is synthesized in the synthesis unit 7 and the second synthesis unit 8, it is possible to prevent the music signal from interfering with the stereo component, and to reduce the deterioration of the sound quality of the stereo component. It becomes possible.

選択部６５は、音声検出部３により出力された音声検出信号に応じて、音量制御部６より第１合成部７へ出力する中域音楽信号Ｌ'と、第２合成部８へ出力する中域音楽信号Ｒ'との決定を行う役割を有している。具体的に選択部６５は、音声検出信号に基づいて音声の検出が行われたと判断した場合には、第１合成部７へ出力する中域音楽信号Ｌ'としてモノラル成分の除去が行われたＬチャンネル用の中域音楽信号を選択すると共に、第２合成部８へ出力する中域音楽信号Ｒ'としてモノラル成分の除去が行われたＲチャンネル用の中域音楽信号を選択する。 The selection unit 65 outputs the mid-range music signal L ′ output from the volume control unit 6 to the first synthesis unit 7 and the output to the second synthesis unit 8 in accordance with the voice detection signal output from the voice detection unit 3. It has the role of making a determination with the regional music signal R ′. Specifically, when the selection unit 65 determines that the voice is detected based on the voice detection signal, the monaural component is removed as the mid-range music signal L ′ to be output to the first synthesis unit 7. The mid-range music signal for the L channel from which the monaural component has been removed is selected as the mid-range music signal R ′ to be output to the second synthesis unit 8 while the mid-range music signal for the L channel is selected.

一方で、選択部６５は、音声検出信号に基づいて音声の検出が行われなかったと判断した場合には、第１合成部７へ出力する中域音楽信号Ｌ'としてモノラル成分の除去が行われていないＬチャンネル用の中域音楽信号Ｌ（つまり、第１帯域分割部４において帯域分割された中域音楽信号Ｌ）を選択すると共に、第２合成部８へ出力する中域音楽信号Ｒ'としてモノラル成分の除去が行われていないＲチャンネル用の中域音楽信号Ｒ（つまり、第２帯域分割部５において帯域分割された中域音楽信号Ｒ）を選択する。選択部６５により選択されたＬチャンネル用の中域音楽信号（中域音楽信号Ｌ'）は、第１合成部７へ出力され、選択されたＲチャンネル用の中域音楽信号（中域音楽信号Ｒ'）は、第２合成部８に出力される。 On the other hand, when the selection unit 65 determines that no voice is detected based on the voice detection signal, the monaural component is removed as the mid-range music signal L ′ output to the first synthesis unit 7. A mid-range music signal L ′ for the L channel that is not yet selected (that is, the mid-range music signal L band-divided by the first band dividing unit 4) and output to the second synthesizing unit 8 is selected. The R channel mid-band music signal R from which the monaural component has not been removed (that is, the mid-band music signal R band-divided by the second band dividing unit 5) is selected. The mid-range music signal for L channel (mid-range music signal L ′) selected by the selection unit 65 is output to the first synthesizing unit 7, and the selected mid-range music signal (mid-range music signal for R channel). R ′) is output to the second synthesis unit 8.

［第１合成部および第２合成部］
第１合成部７は、第１帯域分割部４において帯域分割された低域音楽信号Ｌおよび高域音楽信号Ｌと、音量制御部６により出力された中域音楽信号Ｌ'とを合成して第１パワーアンプ９へと出力する役割を有している。マイクＭにより音声信号が取得された場合、第１合成部７において合成される中域音楽信号Ｌ'（音声帯域に該当）は、モノラル成分が除去された状態となる。このため、車室内で会話が行われた場合には、会話の妨げとなり得る中域成分のモノラル成分のみが低減された状態となり、円滑な会話を行うことが可能となる。さらに、マイクＭにより音声信号が取得された場合には、音楽のモノラル成分は除去されてしまうが、ステレオ成分は除去されることがないため、音楽の音質（臨場感や迫力など）を損なうことがない。 [First synthesis unit and second synthesis unit]
The first synthesizing unit 7 synthesizes the low-frequency music signal L and the high-frequency music signal L band-divided by the first band dividing unit 4 with the mid-range music signal L ′ output from the volume control unit 6. It has a role to output to the first power amplifier 9. When the audio signal is acquired by the microphone M, the mid-range music signal L ′ (corresponding to the audio band) synthesized by the first synthesis unit 7 is in a state where the monaural component is removed. For this reason, when a conversation is performed in the passenger compartment, only the monaural component of the middle region that can hinder the conversation is reduced, and a smooth conversation can be performed. Furthermore, when an audio signal is acquired by the microphone M, the monaural component of the music is removed, but the stereo component is not removed, so that the sound quality of the music (such as presence and power) is impaired. There is no.

第２合成部８も、第１合成部７と同様に、第２帯域分割部５において帯域分割された低域音楽信号Ｒおよび高域音楽信号Ｒと、音量制御部６により出力された中域音楽信号Ｒ'とを合成して第２パワーアンプ１０へと出力する役割を有している。第２合成部８においても、マイクＭで音声信号が取得された場合、音声帯域に該当する中域音楽信号のモノラル成分が除去された状態となるため、会話の妨げとなり得る中域成分のモノラル成分のみが低減され、円滑な会話を行うことが可能となる。さらに、ステレオ成分は除去されることがないため、音楽の音質（臨場感や迫力など）を損なうことがなくなる。 Similarly to the first synthesizing unit 7, the second synthesizing unit 8 also includes the low-frequency music signal R and the high-frequency music signal R that are band-divided by the second band dividing unit 5, and the midrange output by the volume control unit 6 The music signal R ′ is combined and output to the second power amplifier 10. Also in the second synthesizing unit 8, when the audio signal is acquired by the microphone M, the monaural component of the midrange music signal corresponding to the audio band is removed, so that the monaural component of the midband component that can hinder the conversation Only the components are reduced, and a smooth conversation can be performed. Furthermore, since the stereo component is not removed, the sound quality of the music (such as presence and power) is not impaired.

第１パワーアンプ９では、第１合成部７により出力された音楽信号を増幅した後にスピーカ１１ａより出力させ、第２パワーアンプ１０では、第２合成部８により出力された音楽信号を増幅した後にスピーカ１１ｂより出力させる。 The first power amplifier 9 amplifies the music signal output from the first synthesis unit 7 and then outputs it from the speaker 11a. The second power amplifier 10 amplifies the music signal output from the second synthesis unit 8 after amplification. Output from the speaker 11b.

次に、上述した自動音量制御装置１を用いて、音楽信号のモノラル成分が低減された場合と低減されていない場合とのそれぞれの音楽信号の周波数特性を、図９〜図１２を用いて説明する。 Next, the frequency characteristics of the music signal when the monaural component of the music signal is reduced and when it is not reduced will be described with reference to FIGS. 9 to 12 using the automatic volume control device 1 described above. To do.

図９は、マイクＭにより取得される音声信号の周波数特性を一例として示している。このような音声信号がマイクＭで取得される場合において、図１０および図１１に示すようなホワイト雑音が、音楽信号の一例として音源から出力される場合について説明する。なお、図１０に示すホワイト雑音は、Ｌチャンネル用のホワイト雑音とＲチャンネル用のホワイト雑音とが同一のモノラル信号である場合を示し、図１１に示すホワイト雑音は、Ｌチャンネル用のホワイト雑音とＲチャンネル用のホワイト雑音とがそれぞれ無相関のステレオ信号である場合を示している。 FIG. 9 shows an example of frequency characteristics of an audio signal acquired by the microphone M. In the case where such an audio signal is acquired by the microphone M, a case where white noise as shown in FIGS. 10 and 11 is output from a sound source as an example of a music signal will be described. The white noise shown in FIG. 10 shows a case where the white noise for the L channel and the white noise for the R channel are the same monaural signal, and the white noise shown in FIG. 11 is the same as the white noise for the L channel. A case is shown in which the white noise for the R channel is an uncorrelated stereo signal.

図１０に示されるように音量制御が行われていない状態では、ホワイト雑音に特有な周波数特性（周波数にかかわらず信号レベルが一定）が示されているが、音量制御が行われた状態では、中帯域の信号レベルが顕著に低減されている様子が示されている。図１０に示したホワイト雑音の場合には、Ｌチャンネル用のホワイト雑音とＲチャンネル用のホワイト雑音とが同一のモノラル信号であり、中帯域におけるモノラル成分が多いため、信号レベルの低減が顕著に表れることになる。この中帯域におけるモノラル成分の信号レベルの低減により、円滑な会話を実現することが可能となる。 In the state where the volume control is not performed as shown in FIG. 10, the frequency characteristic peculiar to white noise (the signal level is constant regardless of the frequency) is shown, but in the state where the volume control is performed, It is shown that the signal level in the middle band is significantly reduced. In the case of the white noise shown in FIG. 10, since the white noise for the L channel and the white noise for the R channel are the same monaural signal and there are many monaural components in the middle band, the signal level is significantly reduced. Will appear. By reducing the signal level of the monaural component in the middle band, smooth conversation can be realized.

一方で、図１１の場合、音量制御が行われていない状態では、ホワイト雑音に特有な周波数特性（周波数にかかわらず信号レベルが一定）が示されているが、音量制御が行われた状態であっても、ホワイト雑音に特有な周波数特性（周波数にかかわらず信号レベルが一定）が示され、制御の有無によって信号レベルの変化を判断することが困難となっている。図１１に示したホワイト雑音の場合には、Ｌチャンネル用のホワイト雑音とＲチャンネル用のホワイト雑音とがそれぞれ無相関のステレオ信号であるため、中帯域におけるモノラル成分が存在せず、信号レベルの低減が行われないためである。このようにモノラル成分以外のステレオ成分に関しては中帯域における信号レベルの低減が行われないので、音楽の音質（臨場感や迫力など）などを維持することが可能となる。 On the other hand, in the case of FIG. 11, in a state where the volume control is not performed, a frequency characteristic peculiar to white noise (a signal level is constant regardless of the frequency) is shown, but in a state where the volume control is performed. Even in such a case, the frequency characteristic peculiar to white noise (the signal level is constant regardless of the frequency) is shown, and it is difficult to determine a change in the signal level depending on the presence or absence of control. In the case of the white noise shown in FIG. 11, since the white noise for the L channel and the white noise for the R channel are uncorrelated stereo signals, there is no monaural component in the middle band, and the signal level This is because there is no reduction. As described above, since the signal level in the middle band is not reduced for stereo components other than the monaural component, it is possible to maintain the sound quality of the music (such as presence and power).

図１２は、ボーカル音を含んだ音楽信号を用いた場合における周波数特性を示しており、図１２における音楽信号では、Ｌチャンネル用とＲチャンネル用との音楽信号がステレオ成分（モノラル成分も含む）を備えているものとする。図１２に示すように、音量制御を行った場合と、音量制御を行わない場合とのそれぞれの音楽信号の周波数特性を比較すると、ボーカル帯域（音声帯域）に該当する中帯域の信号レベルだけが大きく減衰し、高帯域および低帯域の信号レベルは、音量制御の有無にかかわらずほとんど同じような信号レベルを維持した状態となっている。 FIG. 12 shows frequency characteristics when a music signal including a vocal sound is used. In the music signal in FIG. 12, the music signals for the L channel and the R channel are stereo components (including monaural components). It shall be equipped with. As shown in FIG. 12, when comparing the frequency characteristics of music signals when the volume control is performed and when the volume control is not performed, only the signal level of the middle band corresponding to the vocal band (voice band) is obtained. The signal levels of the high band and the low band are largely attenuated, and are almost in the same state regardless of whether or not the volume control is performed.

図１２に示す場合には、ボーカル帯域に該当する中帯域のモノラル成分のみ低減されるため、中帯域の信号レベルが低減されるが、高帯域および低帯域に関してはモノラル成分の低減処理が行われないため信号レベルに変化が生じないことになる。一方で、音楽信号のステレオ成分に関しては、音量制御の有無にかかわらず信号レベルの低減が生じないため、ほぼ全ての周波数において一様に出力が確保されている。 In the case shown in FIG. 12, only the middle band monaural component corresponding to the vocal band is reduced, so that the signal level of the middle band is reduced. However, the monaural component reduction processing is performed for the high band and the low band. Therefore, the signal level does not change. On the other hand, regarding the stereo component of the music signal, since the signal level does not decrease regardless of whether or not the volume control is performed, the output is ensured uniformly at almost all frequencies.

このように、ＬチャンネルとＲチャンネルとの間が無相関な信号となるステレオ信号においては、第１合成部７および第２合成部８において音楽信号の合成処理を行っても干渉が発生していない。一方で、中帯域（ボーカル帯域）におけるモノラル成分に関しては、図１２に示すように、約１０ｄＢ程度の信号レベルの減衰が生じており、会話の妨げとなり得る音声帯域（ボーカル帯域、中帯域）の信号レベル（音量）だけを効果的に低減させる（制御する）ことが可能となる。なお、中帯域における信号レベルの減衰量は、音楽のボーカルのレベルに応じて可変し、ボーカルのレベルが大きくなるほど減衰が相対的に大きくなる。 As described above, in the stereo signal that is an uncorrelated signal between the L channel and the R channel, interference occurs even if the first synthesis unit 7 and the second synthesis unit 8 perform the music signal synthesis process. Absent. On the other hand, with respect to the monaural component in the middle band (vocal band), as shown in FIG. 12, the signal level is attenuated by about 10 dB, and the voice band (vocal band, middle band) that can hinder the conversation. Only the signal level (volume) can be effectively reduced (controlled). Note that the amount of attenuation of the signal level in the middle band varies according to the vocal level of music, and the attenuation increases relatively as the vocal level increases.

以上、説明したように、本実施の形態に係る自動音量制御装置１によれば、Ｌチャンネル用の音楽信号Ｌを、低域音楽信号Ｌ、中域音楽信号Ｌ、高域音楽信号Ｌに帯域分割し、同様に、Ｒチャンネル用の音楽信号Ｒを、低域音楽信号Ｒ、中域音楽信号Ｒ、高域音楽信号Ｒに帯域分割した後に、Ｒチャンネル用の中域音楽信号Ｒの位相を反転させてＬチャンネル用の中域音楽信号Ｌと合成させることにより、スピーカ１１ａ、１１ｂより出力させる音楽信号の音声帯域（中帯域）におけるモノラル成分の出力（信号レベル）を低減させることができる。 As described above, according to the automatic volume control device 1 according to the present embodiment, the music signal L for the L channel is banded into the low frequency music signal L, the mid frequency music signal L, and the high frequency music signal L. Similarly, after dividing the music signal R for the R channel into the low-frequency music signal R, the mid-range music signal R, and the high-frequency music signal R, the phase of the mid-range music signal R for the R channel is divided. By inverting and synthesizing with the mid-range music signal L for the L channel, it is possible to reduce the output (signal level) of the monaural component in the audio band (mid-band) of the music signal output from the speakers 11a and 11b.

このため、スピーカ１１ａ，１１ｂより出力される音楽信号の全体的なステレオ感を維持した状態で、会話の妨げとなり得る音声帯域のモノラル成分の信号レベルだけを効果的に低減させることができ、音楽が流れる状況であっても円滑な会話環境を実現することが可能となる。また、会話中であっても音楽信号における全体的なステレオ感（特に低域音楽信号と高域音楽信号のステレオ成分）は維持することができるので、円滑な会話を実現させつつ、音楽の音質（臨場感や迫力など）の低下を抑制することが可能となる。 For this reason, it is possible to effectively reduce only the signal level of the monaural component in the voice band that can hinder conversation while maintaining the overall stereo feeling of the music signals output from the speakers 11a and 11b. It is possible to realize a smooth conversation environment even in a situation where the current flows. In addition, the overall stereo feeling of the music signal (especially the stereo components of the low-frequency music signal and high-frequency music signal) can be maintained even during a conversation, so the sound quality of the music can be achieved while realizing a smooth conversation. It is possible to suppress a decrease in (existence, force, etc.).

特に、本実施の形態に係る自動音量制御装置１では、第１ゲイン部６３および第２ゲイン部６４においてモノラル成分の低減が成された中域音楽信号に対してチャンネル毎に重み付け処理が行われているため、中域音楽信号においてモノラル成分の低減が行われた場合であっても残されたステレオ成分が強調され、音声帯域（ボーカル帯域）を含む全体的なステレオ感を高く維持することが可能となる。 In particular, in the automatic volume control device 1 according to the present embodiment, a weighting process is performed for each channel on the mid-range music signal whose monaural component has been reduced in the first gain unit 63 and the second gain unit 64. Therefore, even if the monaural component is reduced in the mid-range music signal, the remaining stereo component is emphasized, and the overall stereo feeling including the voice band (vocal band) can be maintained high. It becomes possible.

さらに、第１ゲイン部６３および第２ゲイン部６４において中域音楽信号に対する重み付け処理が行われるため、Ｌチャンネル用の中域音楽信号とＲチャンネル用の中域音楽信号とが無相関な信号となり、第１合成部７および第２合成部８において音楽信号の合成を行う場合において、ステレオ成分の音楽信号が干渉してしまうことを防止することが可能となる。 Further, since the first gain unit 63 and the second gain unit 64 perform weighting processing on the mid-range music signal, the L-channel mid-range music signal and the R-channel mid-range music signal become uncorrelated signals. In the case where the music signals are synthesized in the first synthesis unit 7 and the second synthesis unit 8, it is possible to prevent the stereo component music signals from interfering with each other.

さらに、上述した音声帯域におけるモノラル成分の出力低減は、マイクＭにより音声信号が検出されたと音声検出部３により判断された時に、音量制御部６において制御されるため、発話者による会話が行われた時だけ、自動的に音楽信号レベルを低減させることができる。また、このように音声検出に応じて音楽信号の音量制御が行われるため、会話が行われていない状態（会話が途切れた状態など）では、音声帯域におけるモノラル成分の音量低減制御は行われないため、会話が行われていない状態では（音声信号の検出がない場合には）、音楽信号の信号レベルの低減処理（音量制御）が全く行われなくなり、良質な音質の音楽を楽しむことが可能となる。 Furthermore, since the output reduction of the monaural component in the voice band described above is controlled by the volume control unit 6 when the voice detection unit 3 determines that a voice signal has been detected by the microphone M, a conversation by the speaker is performed. The music signal level can be automatically reduced only when In addition, since the volume control of the music signal is performed according to the voice detection in this way, the volume reduction control of the monaural component in the voice band is not performed in a state where the conversation is not performed (a state where the conversation is interrupted). Therefore, when there is no conversation (when no audio signal is detected), the signal level reduction processing (volume control) of the music signal is not performed at all, and it is possible to enjoy high-quality music. It becomes.

さらに、第１帯域分割部４および第２帯域分割部５において帯域分割を行った中域音楽信号には、位相反転部５５においてあらかじめ位相反転が行われているので、第１合成部７および第２合成部８において、高域音楽信号および低域音楽信号と共に音声信号の合成を行う場合において、各信号間で干渉が生じてしまうことを低減させることが可能となる。 Further, since the phase inversion is performed in advance in the phase inversion unit 55 for the mid-range music signal that has been subjected to the band division in the first band division unit 4 and the second band division unit 5, When the two synthesis unit 8 synthesizes the audio signal together with the high frequency music signal and the low frequency music signal, it is possible to reduce the occurrence of interference between the signals.

また、オーディオキャンセラ部１５の第１適応フィルタ部２５および第２適応フィルタ部２６においてＬＭＳ適応アルゴリズムを適用することにより、音声信号における音声信号成分以外の信号成分（ノイズ成分）が効果的に低減され、さらに、ノイズキャンセラ部１６においてノイズ信号の除去が行われているので、音声検出部３における音声信号の検出精度の向上を図ることが可能となる。 Further, by applying the LMS adaptive algorithm in the first adaptive filter unit 25 and the second adaptive filter unit 26 of the audio canceller unit 15, signal components (noise components) other than the audio signal component in the audio signal are effectively reduced. Furthermore, since the noise signal is removed by the noise canceller 16, it is possible to improve the detection accuracy of the audio signal in the audio detector 3.

以上、本発明に係る自動音量制御装置について、図面を用いて詳細に説明した、本発明に係る自動音量制御装置は、上述した実施の形態に限定されるものではない。当業者であれば、特許請求の範囲に記載された範疇内において、各種の変更例または修正例に想到し得ることは明らかであり、それらについても当然に本発明の技術的範囲に属するものと了解される。 The automatic volume control device according to the present invention described above in detail with reference to the drawings for the automatic volume control device according to the present invention is not limited to the above-described embodiment. It will be apparent to those skilled in the art that various changes and modifications can be made within the scope of the claims, and these are naturally within the technical scope of the present invention. Understood.

例えば、本実施の形態では、自動音量制御装置１を車両に設置する場合を一例として示したが、本発明に係る自動音量制御装置は、車両に設置されるものには限定されず、音楽などが流れる空間において、音楽などにより会話が妨げられるおそれがある環境であれば、車室内には限定されず、異なる環境・空間に設置することも可能である。 For example, in the present embodiment, the case where the automatic volume control device 1 is installed in a vehicle is shown as an example. However, the automatic volume control device according to the present invention is not limited to that installed in a vehicle, such as music. As long as it is an environment in which a conversation may be hindered by music or the like in a space where music flows, the present invention is not limited to a vehicle interior, and can be installed in a different environment / space.

また、本実施の形態では、Ｌチャンネル用の音楽信号Ｌを、低域音楽信号Ｌ、中域音楽信号Ｌ、高域音楽信号Ｌの３帯域に帯域分割し、同様に、Ｒチャンネル用の音楽信号Ｒを、低域音楽信号Ｒ、中域音楽信号Ｒ、高域音楽信号Ｒの３帯域に帯域分割した場合について説明を行ったが、本発明に係る自動音量制御装置は、３帯域に分割されるものには限定されない。本発明に係る自動音量制御装置では、音声帯域を含む音声帯域信号と、音声帯域以外の帯域を含む非音声帯域信号とに少なくとも帯域分割することができればよく、非音声帯域信号に該当する信号は、高域信号および低域信号のみに限定されず、さらに細かく帯域分割されるものであってもよい。 In the present embodiment, the music signal L for the L channel is divided into three bands, ie, the low frequency music signal L, the mid frequency music signal L, and the high frequency music signal L, and similarly the music for the R channel. Although the case where the signal R is band-divided into three bands of the low-frequency music signal R, the mid-range music signal R, and the high-frequency music signal R has been described, the automatic volume control device according to the present invention is divided into three bands. It is not limited to what is done. In the automatic sound volume control device according to the present invention, it is only necessary to be able to at least divide a band into a voice band signal including a voice band and a non-voice band signal including a band other than the voice band. However, the present invention is not limited to the high frequency signal and the low frequency signal, and may be further divided into bands.

１ …自動音量制御装置
２ …音声強調処理部
３ …音声検出部（音声検出手段）
４ …第１帯域分割部（帯域分割手段）
５ …第２帯域分割部（帯域分割手段）
６ …音量制御部（位相反転手段、音声帯域信号合成手段、音声帯域信号選択手段、ゲイン付加手段）
７ …第１合成部（音楽信号合成手段）
８ …第２合成部（音楽信号合成手段）
９ …第１パワーアンプ
１０ …第２パワーアンプ
１１ａ、１１ｂ …スピーカ
１５ …（音声強調処理部の）オーディオキャンセラ部
１６ …（音声強調処理部の）ノイズキャンセラ部
２１ …（オーディオキャンセラ部の）第１バンドパスフィルタ部
２２ …（オーディオキャンセラ部の）第２バンドパスフィルタ部
２３ …（オーディオキャンセラ部の）第１遅延部
２４ …（オーディオキャンセラ部の）第２遅延部
２５ …（オーディオキャンセラ部の）第１適応フィルタ部
２６ …（オーディオキャンセラ部の）第２適応フィルタ部
２８ …（第１適応フィルタ部の）第１ＦＩＲ部
２９ …（第１適応フィルタ部の）第１ＬＭＳ部
３０ …（第１適応フィルタ部の）第１加算部
３１ …（第２適応フィルタ部の）第２ＦＩＲ部
３２ …（第２適応フィルタ部の）第２ＬＭＳ部
３３ …（第２適応フィルタ部の）第２加算部
４１ …（音声検出部の）実効値検出部
４２ …（音声検出部の）移動平均部
４３ …（音声検出部の）音声検出スレッショルド部
４４ …（音声検出部の）音声信号ホールド部
５１ …（第１帯域分割部の）第１ローパスフィルタ部
５２ …（第１帯域分割部の）第２ローパスフィルタ部
５３ …（第１帯域分割部の）第１ハイパスフィルタ部
５４ …（第１帯域分割部の）第２ハイパスフィルタ部
５５ …（第１帯域分割部の）位相反転部
６１ …（音量制御部の）位相反転部（位相反転手段）
６２ …（音量制御部の）合成部（音声帯域信号合成手段）
６３ …（音量制御部の）第１ゲイン部（ゲイン付加手段）
６４ …（音量制御部の）第２ゲイン部（ゲイン付加手段）
６５ …（音量制御部の）選択部（音声帯域信号選択手段）
Ｍ …マイク DESCRIPTION OF SYMBOLS 1 ... Automatic volume control apparatus 2 ... Voice emphasis processing part 3 ... Voice detection part (voice detection means)
4 ... 1st band division part (band division means)
5: Second band dividing unit (band dividing means)
6 ... Volume control section (phase inversion means, voice band signal synthesis means, voice band signal selection means, gain addition means)
7 ... 1st synthetic | combination part (music signal synthetic | combination means)
8 ... 2nd synthetic | combination part (music signal synthetic | combination means)
9 ... 1st power amplifier 10 ... 2nd power amplifier 11a, 11b ... Speaker 15 ... Audio canceller part 16 (of speech enhancement processing part) ... Noise canceller part 21 (of speech enhancement processing part) ... 1st (of audio canceller part) Bandpass filter unit 22 ... second bandpass filter unit 23 (of the audio canceller unit) ... first delay unit 24 (of the audio canceller unit) ... second delay unit 25 (of the audio canceller unit) ... (of the audio canceller unit) First adaptive filter section 26 ... second adaptive filter section 28 (of the audio canceller section) ... first FIR section 29 (of the first adaptive filter section) ... first LMS section 30 (of the first adaptive filter section) (first adaptation) First addition unit 31 (of the filter unit) ... Second FIR unit 32 (of the second adaptive filter unit) ... (Second adaptive filter) Second LMS unit 33 (second adaptive filter unit) second addition unit 41 (speech detection unit) effective value detection unit 42 (speech detection unit) moving average unit 43 (sound detection unit) ) Audio detection threshold unit 44 ... Audio signal hold unit 51 (of the audio detection unit) ... First low-pass filter unit 52 (of the first band dividing unit) ... Second low-pass filter unit 53 (of the first band dividing unit) ( First high-pass filter unit 54 (of the first band division unit) ... Second high-pass filter unit 55 (of the first band division unit) ... Phase inversion unit 61 (of the first band division unit) ... Phase inversion (of the volume control unit) Part (phase inversion means)
62 ... Synthesizer (of sound volume controller) (voice band signal synthesizer)
63... First gain section (gain adding means) (of volume control section)
64 ... 2nd gain part (of gain control part) (gain addition means)
65 ... Selection unit (of sound volume control unit) (voice band signal selection means)
M ... Mike

Claims

Voice detection means for detecting a voice signal from an acoustic signal acquired by a microphone;
Each of the L channel music signal and the R channel music signal output by the sound source includes a first voice band signal having a frequency band corresponding to a voice band and a non-frequency band having a frequency band not corresponding to the voice band. Band dividing means for dividing a band into audio band signals;
Phase inversion means for performing phase inversion of only one audio band signal of the first audio band signal for the L channel and the first audio band signal for the R channel;
Either the first audio band signal for the L channel or the first audio band signal for the R channel that has been phase-inverted by the phase inversion means, and the phase inversion has not been inverted by the phase inversion means. Voice band signal synthesis means for generating a second voice band signal for the L channel and a second voice band signal for the R channel by synthesizing the other voice band signal;
Phase correction means for performing phase inversion correction on the other of the second audio band signal for L channel and the second audio band signal for R channel generated by the audio band signal synthesis means;
If the detection of the audio signal has been performed by the voice detection means, the first for the second voice band signal and R channel for L channel is one of the phase inversion correction has been performed by the phase correcting means When two audio band signals are output as audio band signals for each channel and the audio signal is not detected by the audio detecting means, the second channel for the L channel divided by the band dividing means is used. Audio band signal selection means for outputting one audio band signal and the first audio band signal for the R channel as they are as an audio band signal for each channel;
The voice band signal for L channel and the voice band signal for R channel output by the voice band signal selecting means, the non-voice band signal for L channel and the non-voice for R channel divided by the band dividing means. And a music signal synthesizing unit for synthesizing the audio band signal for each channel to generate a full-band music signal for the L channel and a full-band music signal for the R channel. Automatic volume control device.

The phase correction means has a gain corresponding to each channel for the second audio band signal for L channel and the second audio band signal for R channel synthesized by the audio band signal synthesis means. The automatic volume control device according to claim 1, further comprising a function as a gain addition unit that performs weighting of the sound.

The band dividing unit is configured to convert each of the music signal for L channel and the music signal for R channel output from the sound source into the first audio band signal having a frequency band corresponding to an audio band, The frequency band is divided into a low frequency signal consisting of a frequency band lower than the audio band and a high frequency signal consisting of a frequency band higher than the audio band, and only the first audio band signal is phase-inverted,
The music signal synthesizing unit is configured to output the audio band signal for the L channel and the audio band signal for the R channel output from the audio band signal selecting unit, the low frequency signal for the L channel, and the audio signal for the R channel. By combining the low frequency signal, the high frequency signal for the L channel and the high frequency signal for the R channel for each channel, the music signal for the entire band for the L channel and the total for the R channel The automatic sound volume control device according to claim 1 or 2, wherein a band music signal is generated.