JP7020257B2

JP7020257B2 - Audio equipment, sound effect factor calculation method, and program

Info

Publication number: JP7020257B2
Application number: JP2018073642A
Authority: JP
Inventors: 毅彦川▲原▼
Original assignee: Yamaha Corp
Current assignee: Yamaha Corp
Priority date: 2018-04-06
Filing date: 2018-04-06
Publication date: 2022-02-16
Anticipated expiration: 2038-04-06
Also published as: JP2019186687A

Description

本発明は、雑音の存在する環境での音の再生を支援する技術に関する。 The present invention relates to a technique for supporting sound reproduction in a noisy environment.

走行中の車両の車室のように雑音の存在する環境で楽音などの音を再生する場合に再生音が雑音に埋もれないように音の再生を支援する技術が種々提案されており、その一例としては特許文献１に開示の技術が挙げられる。 Various technologies have been proposed to support sound reproduction so that the reproduced sound is not buried in the noise when reproducing a sound such as a musical sound in a noisy environment such as the passenger compartment of a moving vehicle. Examples thereof include the technique disclosed in Patent Document 1.

特許文献１に開示の技術では、スピーカから再生音が放音される環境にマイクロフォンを設置し、当該マイクロフォンに再生音とその環境において聴取される雑音とを収音させる。そして、特許文献１に開示の技術では、雑音の振幅スペクトルの平均値と再生音の振幅スペクトルの平均値とが周波数帯域毎に算出され、雑音の振幅スペクトルに再生音の振幅スペクトルが埋もれないように、再生音の音量が周波数帯域毎に補正される。 In the technique disclosed in Patent Document 1, a microphone is installed in an environment where the reproduced sound is emitted from the speaker, and the reproduced sound and the noise heard in the environment are collected by the microphone. Then, in the technique disclosed in Patent Document 1, the average value of the amplitude spectrum of the noise and the average value of the amplitude spectrum of the reproduced sound are calculated for each frequency band so that the amplitude spectrum of the reproduced sound is not buried in the amplitude spectrum of the noise. In addition, the volume of the reproduced sound is corrected for each frequency band.

特開平１１－２９８９９０号公報Japanese Unexamined Patent Publication No. 11-298990

しかし、従来の技術では、ある周波数帯域において再生音の振幅スペクトルに急峻なピークが現れている場合、再生音の振幅スペクトルを平均することで当該周波数帯域の音量は小と判定され、当該周波数帯域全体に亘って再生音の音量が引き上げられる。したがって、ピークに対応する音を聞き分けることができれば十分であるにも拘らず、当該周波数帯域における再生音の音量が過大に引き上げられる、といった問題がある。 However, in the conventional technique, when a steep peak appears in the amplitude spectrum of the reproduced sound in a certain frequency band, the volume of the frequency band is determined to be low by averaging the amplitude spectrum of the reproduced sound, and the frequency band is determined. The volume of the reproduced sound is raised throughout. Therefore, although it is sufficient to be able to distinguish the sound corresponding to the peak, there is a problem that the volume of the reproduced sound in the frequency band is excessively raised.

本発明に係るオーディオ装置の一態様は、スピーカから環境に放射される再生音を表す第１音信号を時間軸方向に複数に分割して得られる各信号に短時間周波数解析を施して一定時間における振幅スペクトルのピークを周波数毎に特定し、特定した振幅スペクトルのピークに基づいて前記再生音の音量を周波数帯域毎に推定する第１推定部と、前記環境において聴取される雑音を表す第２音信号を時間軸方向に複数に分割して得られる各信号に短時間周波数解析を施して前記一定時間における振幅スペクトルのピークを周波数毎に特定し、特定した振幅スペクトルのピークに基づいて前記雑音の音量を周波数帯域毎に推定する第２推定部と、前記第１推定部により推定された音量と前記第２推定部により推定された音量とを周波数帯域毎に比較し、比較結果に基づいて前記第１音信号に付与する音響効果係数を算出する算出部と、を備える。 In one aspect of the audio device according to the present invention, a short-time frequency analysis is performed on each signal obtained by dividing a first sound signal representing a reproduced sound radiated from a speaker to the environment into a plurality of signals in the time axis direction for a certain period of time. The first estimation unit that specifies the peak of the amplitude spectrum in each frequency and estimates the volume of the reproduced sound for each frequency band based on the peak of the specified amplitude spectrum, and the second that represents the noise heard in the environment. A short-time frequency analysis is performed on each signal obtained by dividing the sound signal into a plurality of signals in the time axis direction, the peak of the amplitude spectrum in the fixed time is specified for each frequency, and the noise is based on the peak of the specified amplitude spectrum. The second estimation unit that estimates the volume of each frequency band, the volume estimated by the first estimation unit, and the volume estimated by the second estimation unit are compared for each frequency band, and based on the comparison result. A calculation unit for calculating an acoustic effect coefficient applied to the first sound signal is provided.

本発明に係る音響効果係数算出方法の一態様は、スピーカから環境に放射される再生音を表す第１音信号を時間軸方向に複数に分割して得られる各信号に短時間周波数解析を施して一定時間における振幅スペクトルのピークを周波数毎に特定し、特定した振幅スペクトルのピークに基づいて前記再生音の音量を周波数帯域毎に推定し、前記環境において聴取される雑音を表す第２音信号を時間軸方向に複数に分割して得られる各信号に短時間周波数解析を施して前記一定時間における振幅スペクトルのピークを周波数毎に特定し、特定した振幅スペクトルのピークに基づいて前記雑音の音量を周波数帯域毎に推定し、前記再生音について推定された音量と前記雑音について推定された音量とを周波数帯域毎に比較し、比較結果に基づいて前記第１音信号に付与する音響効果係数を算出する。 One aspect of the acoustic effect coefficient calculation method according to the present invention is to perform short-time frequency analysis on each signal obtained by dividing the first sound signal representing the reproduced sound radiated from the speaker to the environment into a plurality of signals in the time axis direction. The peak of the amplitude spectrum in a certain time is specified for each frequency, the volume of the reproduced sound is estimated for each frequency band based on the peak of the specified amplitude spectrum, and the second sound signal representing the noise heard in the environment is expressed. The peak of the amplitude spectrum in the fixed time is specified for each frequency by performing a short-time frequency analysis on each signal obtained by dividing the signal into a plurality of signals in the time axis direction, and the volume of the noise is based on the peak of the specified amplitude spectrum. Is estimated for each frequency band, the volume estimated for the reproduced sound and the volume estimated for the noise are compared for each frequency band, and the acoustic effect coefficient given to the first sound signal is obtained based on the comparison result. calculate.

本発明に係るプログラムは、プロセッサを、スピーカから環境に放射される再生音を表す第１音信号を時間軸方向に複数に分割して得られる各信号に短時間周波数解析を施して一定時間における振幅スペクトルのピークを周波数毎に特定し、特定した振幅スペクトルのピークに基づいて前記再生音の音量を周波数帯域毎に推定する第１推定部と、前記環境において聴取される雑音を表す第２音信号を時間軸方向に複数に分割して得られる各信号に短時間周波数解析を施して前記一定時間における振幅スペクトルのピークを周波数毎に特定し、特定した振幅スペクトルのピークに基づいて前記雑音の音量を周波数帯域毎に推定する第２推定部と、前記第１推定部により推定された音量と前記第２推定部により推定された音量とを周波数帯域毎に比較し、比較結果に基づいて前記第１音信号に付与する音響効果係数を算出する算出部と、して機能させる。 In the program according to the present invention, the processor performs a short-time frequency analysis on each signal obtained by dividing the first sound signal representing the reproduced sound radiated from the speaker to the environment into a plurality of signals in the time axis direction, and at a fixed time. The first estimation unit that specifies the peak of the amplitude spectrum for each frequency and estimates the volume of the reproduced sound for each frequency band based on the peak of the specified amplitude spectrum, and the second sound that represents the noise heard in the environment. A short-time frequency analysis is performed on each signal obtained by dividing the signal into a plurality of signals in the time axis direction to identify the peak of the amplitude spectrum for each frequency in the fixed time, and the noise of the noise is based on the peak of the specified amplitude spectrum. The second estimation unit that estimates the volume for each frequency band, the volume estimated by the first estimation unit, and the volume estimated by the second estimation unit are compared for each frequency band, and the above is based on the comparison result. It functions as a calculation unit that calculates the acoustic effect coefficient given to the first sound signal.

本発明の一実施形態のオーディオ装置１２０を含むオーディオシステムの構成例を示す図である。It is a figure which shows the structural example of the audio system including the audio apparatus 120 of one Embodiment of this invention. オーディオ装置１２０の構成例を示す図である。It is a figure which shows the configuration example of an audio apparatus 120. 補正テーブル３０１の格納内容の一例を示す図である。It is a figure which shows an example of the storage contents of the correction table 301. オーディオ装置１２０の演算部２２０が再生支援プログラム３０３にしたがって実行する再生支援処理の流れを示すフローチャートである。It is a flowchart which shows the flow of the reproduction support processing which the arithmetic unit 220 of an audio apparatus 120 executes according to the reproduction support program 303. 再生音推定部２１２が実行する再生音推定処理の流れを示す図である。It is a figure which shows the flow of the reproduction sound estimation processing executed by the reproduction sound estimation unit 212. 再生音推定部２１２が再生音推定処理のステップＳＢ１００にて生成する振幅スペクトルの一例を示す図である。It is a figure which shows an example of the amplitude spectrum generated by the reproduction sound estimation unit 212 in the reproduction sound estimation process step SB100. 再生音推定部２１２が再生音推定処理のステップＳＢ１１０にて生成するデータＤＡ１の一例を示す図である。It is a figure which shows an example of the data DA1 generated by the reproduction sound estimation unit 212 in the reproduction sound estimation process step SB110. 再生音推定部２１２が再生音推定処理のステップＳＢ１２０にて生成するデータＤＡ２の一例を示す図である。It is a figure which shows an example of the data DA2 generated by the reproduction sound estimation unit 212 in the reproduction sound estimation process step SB120. 雑音推定部２１３が実行する雑音推定処理の流れを示す図である。It is a figure which shows the flow of the noise estimation processing executed by a noise estimation unit 213. 雑音推定部２１３が雑音推定処理のステップＳＢ１００にて生成する振幅スペクトルの一例を示す図である。It is a figure which shows an example of the amplitude spectrum generated by the noise estimation unit 213 in the step SB100 of a noise estimation process. 雑音推定部２１３が雑音推定処理のステップＳＢ１１０にて生成するデータＤＢ１の一例を示す図である。It is a figure which shows an example of the data DB 1 generated by the noise estimation unit 213 in the noise estimation process step SB110. 雑音推定部２１３が雑音推定処理のステップＳＣ１２０にて生成するデータＤＢ２の一例を示す図である。It is a figure which shows an example of the data DB 2 generated by the noise estimation unit 213 in the noise estimation process step SC120. 音響効果係数算出部２１４が実行する音響効果係数算出処理の流れを示す図である。It is a figure which shows the flow of the sound effect coefficient calculation process executed by the sound effect coefficient calculation unit 214. 補正処理ＳＤ１００を経たデータＤＣ１の一例を示す図である。It is a figure which shows an example of the data DC1 which has passed the correction process SD100. 比較処理ＳＤ１１０の処理結果を表すデータＤＣ２の一例を示す図である。It is a figure which shows an example of the data DC2 which shows the processing result of the comparison processing SD110. 再生音推定部２１２が音量推定処理のステップＳＢ１２０にて生成するデータＤＡ２の他の例を示す図である。It is a figure which shows the other example of the data DA2 generated by the reproduction sound estimation part 212 in step SB120 of a volume estimation process. 雑音推定部２１３が音量推定処理のステップＳＣ１２０にて生成するデータＤＢ２の他の例を示す図である。It is a figure which shows the other example of the data DB 2 generated by the noise estimation unit 213 in the volume estimation process step SC120.

以下、図面を参照しながら本発明に係る実施の形態を説明する。なお、図面において各部の寸法および縮尺は実際のものと適宜異なる。また、以下に記載する実施の形態は、本発明の好適な具体例である。このため、本実施形態には、技術的に好ましい種々の限定が付されている。しかしながら、本発明の範囲は、以下の説明において特に本発明を限定する旨の記載がない限り、これらの形態に限られるものではない。 Hereinafter, embodiments according to the present invention will be described with reference to the drawings. In the drawings, the dimensions and scale of each part may differ from the actual ones. Moreover, the embodiment described below is a suitable specific example of the present invention. For this reason, the present embodiment is provided with various technically preferable limitations. However, the scope of the present invention is not limited to these forms unless it is stated in the following description that the present invention is particularly limited.

＜１．実施形態＞
図１は、本発明の一実施形態のオーディオ装置１２０を含むオーディオシステムの構成例を示す図である。このオーディオシステムは、車両に搭載され、当該車両の車室１において楽音等を再生するカーオーディオシステムである。図１に示すように、このオーディオシステムは、オーディオ装置１２０と、スピーカ１３０－１から１３０－４と、マイクロフォン１４０と、を有する。図１に示すように、スピーカ１３０－１から１３０－４とマイクロフォン１４０とは、オーディオケーブルなどの信号線を介してオーディオ装置１２０に接続されている。 <1. Embodiment>
FIG. 1 is a diagram showing a configuration example of an audio system including the audio device 120 according to the embodiment of the present invention. This audio system is a car audio system that is mounted on a vehicle and reproduces musical sounds and the like in the vehicle compartment 1 of the vehicle. As shown in FIG. 1, the audio system includes an audio device 120, speakers 130-1 to 130-4, and a microphone 140. As shown in FIG. 1, the speakers 130-1 to 130-4 and the microphone 140 are connected to the audio device 120 via a signal line such as an audio cable.

図１に示すように、スピーカ１３０－１は、オーディオ装置１２０が搭載される車両の運転席１１１近傍に設けられており、スピーカ１３０－２は上記車両の助手席１１２近傍に設けられている。具体的には、スピーカ１３０－１は運転席側ドアの内側にその放音面を車室１内に向けた姿勢で設けられている。スピーカ１３０－２は助手席側ドアにその放音面を車室１内に向けた姿勢で設けられている。スピーカ１３０－３および１３０－４は、上記車両の後列席１１３を挟むように設けられている。具体的には、スピーカ１３０－３は上記車両の運転席側の後部ドアにその放音面を車室１内に向けた姿勢で設けられている。スピーカ１３０－４は上記車両の助手席側の後部ドアにその放音面を車室１内に向けた姿勢で設けられている。以下では、スピーカ１３０－１から１３０－４の各々を区別する必要がない場合には、「スピーカ１３０」と表記する。スピーカ１３０には、楽音等の再生音の音信号がオーディオ装置１２０から供給される。音信号とは、音の波形を表すアナログ信号のことをいう。スピーカ１３０は、オーディオ装置１２０から供給される音信号に応じた音を車室１内に放音する。 As shown in FIG. 1, the speaker 130-1 is provided in the vicinity of the driver's seat 111 of the vehicle in which the audio device 120 is mounted, and the speaker 130-2 is provided in the vicinity of the passenger seat 112 of the vehicle. Specifically, the speaker 130-1 is provided inside the driver's side door with its sound emitting surface facing the inside of the vehicle interior 1. The speaker 130-2 is provided on the passenger seat side door with its sound emitting surface facing the inside of the passenger compartment 1. The speakers 130-3 and 130-4 are provided so as to sandwich the rear row seat 113 of the vehicle. Specifically, the speaker 130-3 is provided on the rear door on the driver's side of the vehicle in a posture in which the sound emitting surface faces the inside of the vehicle interior 1. The speaker 130-4 is provided on the rear door on the passenger seat side of the vehicle in a posture in which the sound emitting surface faces the inside of the vehicle interior 1. In the following, when it is not necessary to distinguish each of the speakers 130-1 to 130-4, it is referred to as "speaker 130". A sound signal of a reproduced sound such as a musical sound is supplied to the speaker 130 from the audio device 120. A sound signal is an analog signal that represents a sound waveform. The speaker 130 emits a sound corresponding to the sound signal supplied from the audio device 120 into the vehicle interior 1.

マイクロフォン１４０は、例えば、オーディオ装置１２０が搭載される車両のダッシュボードに設けられている。マイクロフォン１４０は、その設置位置において収音した音の波形を表す音信号をオーディオ装置１２０へ出力する。マイクロフォン１４０により収音される音には、スピーカ１３０から車室１内に放音された再生音と当該音以外の雑音とが含まれている。マイクロフォン１４０により収音される雑音の具体例としては、車両の走行に伴って発生するロードノイズが挙げられる。 The microphone 140 is provided, for example, on the dashboard of a vehicle on which the audio device 120 is mounted. The microphone 140 outputs a sound signal representing a waveform of the sound picked up at the installation position to the audio device 120. The sound picked up by the microphone 140 includes a reproduced sound emitted from the speaker 130 into the vehicle interior 1 and noise other than the reproduced sound. Specific examples of the noise picked up by the microphone 140 include road noise generated when the vehicle travels.

図２は、オーディオ装置１２０の構成例を示す図である。図２に示すように、オーディオ装置１２０は、再生装置２１０と、演算部２２０と、記憶部２３０と、Ａ／Ｄ変換器２４０と、Ｄ／Ａ変換器２５０と、アンプ２６０と、を有する。 FIG. 2 is a diagram showing a configuration example of the audio device 120. As shown in FIG. 2, the audio device 120 includes a playback device 210, a calculation unit 220, a storage unit 230, an A / D converter 240, a D / A converter 250, and an amplifier 260.

再生装置２１０は、スピーカ１３０から放射する再生音の音データＤ１を演算部２２０へ出力する。音データとは、音の波形をサンプリングして得られるサンプル列などのデジタルデータのことをいう。再生装置２１０の具体例としては、ＣＤドライブなどの記録媒体読み取り装置或いはカーラジオなどの受信器が挙げられる。再生装置２１０が記録媒体読み取り装置である場合、再生装置２１０は、記録媒体から音データＤ１を読み出し、当該読み出した音データＤ１を演算部２２０へ出力する。 The reproduction device 210 outputs the sound data D1 of the reproduction sound radiated from the speaker 130 to the calculation unit 220. Sound data refers to digital data such as a sample sequence obtained by sampling a sound waveform. Specific examples of the playback device 210 include a recording medium reading device such as a CD drive or a receiver such as a car radio. When the reproduction device 210 is a recording medium reading device, the reproduction device 210 reads the sound data D1 from the recording medium and outputs the read sound data D1 to the calculation unit 220.

Ａ／Ｄ変換器２４０には、信号線を介してマイクロフォン１４０が接続されている。Ａ／Ｄ変換器２４０は、マイクロフォン１４０から与えられる音信号にアナログ／デジタル変換を施し、その変換結果である音データＤ２を演算部２２０に与える。 A microphone 140 is connected to the A / D converter 240 via a signal line. The A / D converter 240 performs analog / digital conversion on the sound signal given from the microphone 140, and gives the sound data D2 which is the conversion result to the calculation unit 220.

演算部２２０は、プロセッサであり、例えば単数または複数のチップで構成される。演算部２２０は、例えば、周辺装置とのインタフェース、演算装置およびレジスタ等を含む中央処理装置（ＣＰＵ：Central Processing Unit）で構成される。なお、演算部２２０の機能の一部または全部を、ＤＳＰ（Digital Signal Processor）、ＡＳＩＣ（Application Specific Integrated Circuit）、ＰＬＤ（Programmable Logic Device）、ＦＰＧＡ（Field Programmable Gate Array）等のハードウェアで実現してもよい。演算部２２０は、各種の処理を並列的または逐次的に実行する。
演算部２２０は、記憶部２３０に記憶されている再生支援プログラム３０３を実行し、オーディオ装置１２０の制御中枢として機能する。再生支援プログラム３０３にしたがって作動している演算部２２０は、再生装置２１０から与えられた音データＤ１の表す再生音の音量を過大に引き上げることなく、その再生音がロードノイズ等の雑音に埋もれないようにする再生支援処理を実行し、その処理結果である音データＤＤをＤ／Ａ変換器２５０へ出力する。この再生支援処理の詳細については後に明らかにする。 The arithmetic unit 220 is a processor, and is composed of, for example, a single or a plurality of chips. The arithmetic unit 220 is composed of, for example, a central processing unit (CPU) including an interface with peripheral devices, an arithmetic unit, registers, and the like. In addition, a part or all of the functions of the arithmetic unit 220 are realized by hardware such as DSP (Digital Signal Processor), ASIC (Application Specific Integrated Circuit), PLD (Programmable Logic Device), FPGA (Field Programmable Gate Array). You may. The arithmetic unit 220 executes various processes in parallel or sequentially.
The arithmetic unit 220 executes the reproduction support program 303 stored in the storage unit 230, and functions as a control center of the audio device 120. The calculation unit 220 operating according to the reproduction support program 303 does not excessively raise the volume of the reproduced sound represented by the sound data D1 given from the reproduced device 210, and the reproduced sound is not buried in noise such as road noise. The reproduction support process is executed, and the sound data DD, which is the result of the process, is output to the D / A converter 250. The details of this reproduction support process will be clarified later.

Ｄ／Ａ変換器２５０は、演算部２２０から与えられる音データＤＤにデジタル／アナログ変換を施して音信号に変換し、この音信号をアンプ２６０へ出力する。アンプ２６０には、スピーカ１３０－１から１３０－４の各々がそれぞれ信号線を介して接続されている。アンプ２６０は、Ｄ／Ａ変換器２５０から与えられた音信号の信号レベルをスピーカ１３０からの放音に適した信号レベルに増幅してスピーカ１３０へ出力する。 The D / A converter 250 performs digital / analog conversion on the sound data DD given from the calculation unit 220 to convert it into a sound signal, and outputs this sound signal to the amplifier 260. Each of the speakers 130-1 to 130-4 is connected to the amplifier 260 via a signal line. The amplifier 260 amplifies the signal level of the sound signal given from the D / A converter 250 to a signal level suitable for sound emission from the speaker 130, and outputs the signal level to the speaker 130.

記憶部２３０は、図２では詳細な図示を省略したが揮発性記憶部と不揮発性記憶部とを含んでいる。揮発性記憶部は、例えばＲＡＭ（Random Access Memory）である。この揮発性記憶部は、再生支援プログラム３０３を実行する際のワークエリアとして演算部２２０によって利用される。また、この揮発性記憶部は、Ａ／Ｄ変換器２４０から与えられた音データＤ２と、再生装置２１０から与えられた音データＤ１とを蓄積するバッファとしても利用される。不揮発性記憶部は、例えばハードディスクである。不揮発性記憶部には、補正テーブル３０１と、音響効果係数テーブル３０２と、再生支援プログラム３０３とが予め記憶されている。 The storage unit 230 includes a volatile storage unit and a non-volatile storage unit, although detailed illustration is omitted in FIG. The volatile storage unit is, for example, a RAM (Random Access Memory). This volatile storage unit is used by the arithmetic unit 220 as a work area when the reproduction support program 303 is executed. The volatile storage unit is also used as a buffer for storing the sound data D2 given by the A / D converter 240 and the sound data D1 given by the reproduction device 210. The non-volatile storage unit is, for example, a hard disk. The correction table 301, the sound effect coefficient table 302, and the reproduction support program 303 are stored in advance in the non-volatile storage unit.

補正テーブル３０１には、例えば２０Ｈｚから１００００Ｈｚの帯域を分割した複数の周波数帯域の各々において雑音の音量と再生音の音量とを比較する際の再生音の音量の補正量を表すデータが予め格納されている。本実施形態では、上記複数の周波数帯域として、各々の中心周波数が６３Ｈｚ，１２５Ｈｚ、２５０Ｈｚ、５００Ｈｚ、１０００Ｈｚ、２０００Ｈｚ、および４０００Ｈｚであり、各々１オクターブに相当するバンド幅を有するオクターブバンドが採用されている。以下では、上記複数の周波数帯域の各々を１から７のバンド番号で区別する。バンド番号＝１の周波数帯域の中心周波数は６３Ｈｚであり、バンド番号＝２の周波数帯域の中心周波数は１２５Ｈｚである。バンド番号＝３の周波数帯域の中心周波数は２５０Ｈｚであり、バンド番号＝４の周波数帯域の中心周波数は５００Ｈｚである。バンド番号＝５の周波数帯域の中心周波数は１０００Ｈｚであり、バンド番号＝６の周波数帯域の中心周波数は２０００Ｈｚであり、バンド番号＝７の周波数帯域の中心周波数は４０００Ｈｚである。 In the correction table 301, for example, data representing the correction amount of the volume of the reproduced sound when comparing the volume of the noise and the volume of the reproduced sound in each of the plurality of frequency bands divided into the bands of 20 Hz to 10000 Hz is stored in advance. ing. In the present embodiment, as the plurality of frequency bands, an octave band having a center frequency of 63 Hz, 125 Hz, 250 Hz, 500 Hz, 1000 Hz, 2000 Hz, and 4000 Hz, each having a bandwidth corresponding to one octave, is adopted. There is. In the following, each of the plurality of frequency bands will be distinguished by a band number from 1 to 7. The center frequency of the frequency band of band number = 1 is 63 Hz, and the center frequency of the frequency band of band number = 2 is 125 Hz. The center frequency of the frequency band of band number = 3 is 250 Hz, and the center frequency of the frequency band of band number = 4 is 500 Hz. The center frequency of the frequency band of band number = 5 is 1000 Hz, the center frequency of the frequency band of band number = 6 is 2000 Hz, and the center frequency of the frequency band of band number = 7 is 4000 Hz.

補正テーブル３０１の格納内容は、オーディオ装置１２０を含むオーディオシステムの特性に応じて定められる。本実施形態では、図３に示すように、バンド番号＝１のバンドについては音量を５ｄＢ引き下げ、バンド番号＝２から７の各バンドについては音量を２から８ｄＢ引き上げることを表すデータを格納した補正テーブル３０１が記憶部２３０に予め格納されている。 The stored contents of the correction table 301 are determined according to the characteristics of the audio system including the audio device 120. In the present embodiment, as shown in FIG. 3, a correction storing data indicating that the volume is lowered by 5 dB for the band with the band number = 1 and the volume is raised by 2 to 8 dB for each band with the band numbers = 2 to 7. The table 301 is stored in the storage unit 230 in advance.

音響効果係数テーブル３０２には、補正テーブル３０１の格納内容にしたがって補正された再生音の音量と雑音の音量との差に対応づけて、再生音が雑音に埋もれないように再生音の音量を補正する音響効果係数（例えば、音量の引き上げ量）を表すデータが格納されている。 In the sound effect coefficient table 302, the volume of the reproduced sound is corrected so that the reproduced sound is not buried in the noise by associating the difference between the volume of the reproduced sound corrected according to the stored contents of the correction table 301 and the volume of the noise. Data representing the sound effect coefficient (for example, the amount of increase in volume) to be performed is stored.

演算部２２０は、オーディオ装置１２０の電源（図２では図示略）の投入を契機として再生支援プログラム３０３を不揮発性記憶部から揮発性記憶部へ読み出し、当該再生支援プログラム３０３の実行を開始する。再生支援プログラム３０３にしたがって作動している演算部２２０は、再生支援処理（図４参照）を実行し、図２に示す音響効果付与部２１１、再生音推定部２１２、雑音推定部２１３、音響効果係数算出部２１４、およびエコーキャンセル部２１５として機能する。つまり、図２における音響効果付与部２１１、再生音推定部２１２、雑音推定部２１３、音響効果係数算出部２１４、およびエコーキャンセル部２１５の各々はソフトウェアにしたがって実現されるソフトウェアモジュールである。 The arithmetic unit 220 reads the reproduction support program 303 from the non-volatile storage unit to the volatile storage unit when the power supply of the audio device 120 (not shown in FIG. 2) is turned on, and starts executing the reproduction support program 303. The calculation unit 220 operating according to the reproduction support program 303 executes the reproduction support process (see FIG. 4), and the acoustic effect imparting unit 211, the reproduced sound estimation unit 212, the noise estimation unit 213, and the acoustic effect shown in FIG. It functions as a coefficient calculation unit 214 and an echo canceling unit 215. That is, each of the sound effect imparting unit 211, the reproduced sound estimation unit 212, the noise estimation unit 213, the sound effect coefficient calculation unit 214, and the echo canceling unit 215 in FIG. 2 is a software module realized according to software.

図２に示すように、音響効果付与部２１１、再生音推定部２１２、雑音推定部２１３、およびエコーキャンセル部２１５の各々には音データＤ１が与えられる。また、エコーキャンセル部２１５には音データＤ２も与えられる。 As shown in FIG. 2, sound data D1 is given to each of the sound effect imparting unit 211, the reproduced sound estimation unit 212, the noise estimation unit 213, and the echo canceling unit 215. Further, sound data D2 is also given to the echo canceling unit 215.

エコーキャンセル部２１５は、図４におけるエコーキャンセル処理ＳＡ１００を実行する。エコーキャンセル部２１５は、スピーカ１３０、当該スピーカ１３０からの再生音が放音される空間（車室１）、マイクロフォン１４０といった音の伝播経路の特性を考慮して音データＤ１´を上記バッファから読み出した音データＤ１に基づいて生成する。音の伝播経路の特性には、たとえば周波数ごとの振幅特性及び反射によるエコーなどが含まれる。具体的には、エコーキャンセル部２１５は、伝播経路の特性を表すＦＩＲ（finite impulse response）フィルタの係数をリアルタイムで推定し、推定した係数が設定されたＦＩＲフィルタに、音データＤ１を入力して音データＤ１´を得る。エコーキャンセル部２１５は、音データＤ２から当該音データＤ１´の表す信号成分を除外して新たな音データＤ３を生成し、雑音推定部２１３に出力する。 The echo canceling unit 215 executes the echo canceling process SA100 in FIG. The echo canceling unit 215 reads the sound data D1'from the above buffer in consideration of the characteristics of the sound propagation path such as the speaker 130, the space where the reproduced sound from the speaker 130 is emitted (vehicle compartment 1), and the microphone 140. It is generated based on the sound data D1. The characteristics of the sound propagation path include, for example, the amplitude characteristics for each frequency and the echo due to reflection. Specifically, the echo canceling unit 215 estimates the coefficient of the FIR (finite impulse response) filter representing the characteristics of the propagation path in real time, and inputs the sound data D1 to the FIR filter in which the estimated coefficient is set. Obtain the sound data D1'. The echo canceling unit 215 excludes the signal component represented by the sound data D1'from the sound data D2, generates new sound data D3, and outputs the new sound data D3 to the noise estimation unit 213.

再生音推定部２１２は、図４における再生音推定処理ＳＡ１１０を実行する。図５は、再生音推定処理の流れを示す図である。図５に示すように、再生音推定部２１２は、まず、一定時間分の音データＤ１をバッファから読み出し、当該音データＤ１を上記一定時間よりも短い時間分の処理単位データに区切って短時間周波数解析を施して周波数領域のデータに変換する（ステップＳＢ１００）。なお、本実施形態において一定時間とは、上記複数の周波数帯域における最も低い周波数の対応する音の複数周期（例えば４周期）に相当する時間であり、上記処理単位データは当該１周期に相当する時間分の音データである。図５では、短時間周波数解析はＳＴＦＴ（Short Time Fourier Transform）と略記されている。また、短時間周波数解析の具体的なアルゴリズムとしては、ＦＦＴ（Fast Fourier Transform）が挙げられる。例えば、上記一定時間分の音データＤ１が４個の処理単位データに分割される場合、各処理単位データに短時間周波数解析を施すことで図６に示す振幅スペクトルＳＰ１－１からＳＰ１－４が得られる。以下では、振幅スペクトルＳＰ１－１からＳＰ１－４の各々を区別する必要がない場合には「振幅スペクトルＳＰ１」と表記する。 The reproduced sound estimation unit 212 executes the reproduced sound estimation process SA110 in FIG. FIG. 5 is a diagram showing the flow of the reproduced sound estimation process. As shown in FIG. 5, the reproduced sound estimation unit 212 first reads the sound data D1 for a fixed time from the buffer, divides the sound data D1 into processing unit data for a time shorter than the fixed time, and shortens the time. Frequency analysis is performed and converted into data in the frequency region (step SB100). In the present embodiment, the fixed time is a time corresponding to a plurality of cycles (for example, 4 cycles) of the corresponding sounds of the lowest frequency in the plurality of frequency bands, and the processing unit data corresponds to the one cycle. Sound data for hours. In FIG. 5, short-time frequency analysis is abbreviated as STFT (Short Time Fourier Transform). Further, as a specific algorithm for short-time frequency analysis, FFT (Fast Fourier Transform) can be mentioned. For example, when the sound data D1 for a certain period of time is divided into four processing unit data, the amplitude spectra SP1-1 to SP1-4 shown in FIG. 6 can be obtained by performing a short-time frequency analysis on each processing unit data. can get. In the following, when it is not necessary to distinguish each of the amplitude spectra SP1-1 to SP1-4, it is referred to as “amplitude spectrum SP1”.

ステップＳＢ１１０では、再生音推定部２１２は、ステップＳＢ１００により得られた周波数領域のデータから、上記一定時間における振幅スペクトルのピーク（最大値）を周波数毎に特定し、上記一定時間における各周波数の振幅スペクトルのピークを表すデータＤＡ１を生成する。例えば、ステップＳＢ１００にて振幅ＳＰ１－１からＳＰ１－４を表すデータが得られた場合、再生音推定部２１２は図７に示すデータＤＡ１を生成する。 In step SB110, the reproduced sound estimation unit 212 identifies the peak (maximum value) of the amplitude spectrum in the fixed time for each frequency from the data in the frequency domain obtained in step SB100, and the amplitude of each frequency in the fixed time. Generate data DA1 representing the peak of the spectrum. For example, when data representing the amplitudes SP1-1 to SP1-4 are obtained in step SB100, the reproduced sound estimation unit 212 generates the data DA1 shown in FIG. 7.

ステップＳＢ１２０では、再生音推定部２１２は、ステップＳＢ１１０にて生成したデータを解析して上記一定時間における再生音の音量を周波数帯域毎に推定し、その推定結果を示すデータＤＡ２（図８参照）を音響効果係数算出部２１４へ出力する。より詳細に説明すると、再生音推定部２１２は、データＤＡ１を参照して周波数帯域毎にいずれの周波数の振幅スペクトルのピークが最大であるかを判定し、最大のピークを当該周波数帯域における再生音の音量とする。例えば、バンド番号＝１の周波数帯域ではその中心周波数の振幅スペクトルのピークが最大であったとする。この場合、再生音推定部２１２は、当該中心周波数（６３Ｈｚ）の振幅スペクトルのピークをバンド番号＝１の周波数帯域における再生音の音量とする。本実施形態において、上記一定時間における再生音の音量を周波数帯域毎にその周波数帯域に属する各周波数の振幅スペクトルの平均値ではなく、各周波数の振幅スペクトルのピークの最大値に基づいて推定するのは、平均値を用いることに起因する弊害を避けるためである。 In step SB120, the reproduced sound estimation unit 212 analyzes the data generated in step SB110 to estimate the volume of the reproduced sound in the fixed time for each frequency band, and the data DA2 (see FIG. 8) showing the estimation result. Is output to the sound effect coefficient calculation unit 214. More specifically, the reproduced sound estimation unit 212 determines which frequency has the maximum peak of the amplitude spectrum for each frequency band with reference to the data DA1, and the maximum peak is the reproduced sound in the frequency band. The volume is set to. For example, it is assumed that the peak of the amplitude spectrum of the center frequency is the maximum in the frequency band of band number = 1. In this case, the reproduced sound estimation unit 212 sets the peak of the amplitude spectrum of the center frequency (63 Hz) as the volume of the reproduced sound in the frequency band of band number = 1. In the present embodiment, the volume of the reproduced sound in the fixed time is estimated based on the maximum value of the peak of the amplitude spectrum of each frequency, not the average value of the amplitude spectrum of each frequency belonging to the frequency band for each frequency band. Is to avoid the harmful effects caused by using the average value.

雑音推定部２１３は、図４における雑音推定処理Ａ１２０を実行する。図９は、雑音推定処理の流れを示す図である。図９に示すように、雑音推定部２１３は、上記一定時間分の音データＤ３を処理対象として前述のステップＳＢ１００およびステップＳＢ１１０の処理を実行する。音データＤ３を処理対象としてステップＳＢ１００の処理を実行することで、例えば、図１０に示す振幅スペクトルＳＰ２－１からＳＰ２－４が生成される。以下では、振幅スペクトルＳＰ２－１からＳＰ２－４の各々を区別する必要がない場合には「振幅スペクトルＳＰ２」と表記する。そして、雑音推定部２１３は、振幅スペクトルＳＰ２－１からＳＰ２－４（図１０参照）を処理対象としてステップＳＢ１１０を実行することで、上記一定時間における雑音の各周波数の振幅スペクトルのピークを表すデータＤＢ１（図１１参照）を生成する。 The noise estimation unit 213 executes the noise estimation process A120 in FIG. FIG. 9 is a diagram showing a flow of noise estimation processing. As shown in FIG. 9, the noise estimation unit 213 executes the processing of the above-mentioned steps SB100 and SB110 with the sound data D3 for a certain period of time as the processing target. By executing the process of step SB100 with the sound data D3 as the processing target, for example, SP2-4 is generated from the amplitude spectra SP2-1 shown in FIG. In the following, when it is not necessary to distinguish each of the amplitude spectra SP2-1 to SP2-4, it is referred to as “amplitude spectrum SP2”. Then, the noise estimation unit 213 executes step SB110 with the amplitude spectra SP2-1 to SP2-4 (see FIG. 10) as processing targets, so that data representing the peak of the amplitude spectrum of each frequency of the noise in the fixed time. Generate DB1 (see FIG. 11).

雑音推定処理においてステップＳＢ１１０に後続して実行されるステップＳＣ１２０では、雑音推定部２１３は、データＤＢ１を解析して上記一定時間における雑音の音量を周波数帯域毎に推定し、その推定結果を示すデータＤＢ２（図１２参照）を音響効果係数算出部２１４へ出力する。より詳細に説明すると、このステップＳＣ１２０では、雑音推定部２１３は、データＤＢ１を参照して周波数帯域毎にいずれの周波数の振幅スペクトルのピークが最小であるかを判定し、最小のピークを当該周波数帯域における雑音の音量とする。例えば、バンド番号＝１の周波数帯域ではその上限周波数の振幅スペクトルのピークが最小であったとする。この場合、雑音推定部２１３は、当該周波数の振幅スペクトルのピークを当該周波数帯域における雑音の音量とする。本実施形態において、上記一定時間における雑音の音量を周波数帯域毎にその周波数帯域に属する各周波数の振幅スペクトルの平均値ではなく、各周波数の振幅スペクトルのピークの最小値に基づいて推定するのは、突発的な雑音の影響により雑音の音量が過大となることを回避しつつ、平均値を用いることに起因する弊害を避けるためである。 In step SC120, which is executed after step SB110 in the noise estimation process, the noise estimation unit 213 analyzes the data DB1 to estimate the noise volume in the fixed time for each frequency band, and the data showing the estimation result. DB2 (see FIG. 12) is output to the acoustic effect coefficient calculation unit 214. More specifically, in this step SC120, the noise estimation unit 213 determines which frequency has the smallest peak of the amplitude spectrum for each frequency band with reference to the data DB 1, and sets the minimum peak to the frequency. The volume of noise in the band. For example, it is assumed that the peak of the amplitude spectrum of the upper limit frequency is the smallest in the frequency band of band number = 1. In this case, the noise estimation unit 213 uses the peak of the amplitude spectrum of the frequency as the volume of noise in the frequency band. In the present embodiment, the volume of noise in the fixed time is estimated based on the minimum value of the peak of the amplitude spectrum of each frequency, not the average value of the amplitude spectrum of each frequency belonging to the frequency band for each frequency band. This is to prevent the volume of noise from becoming excessive due to the influence of sudden noise, and to avoid the harmful effects caused by using the average value.

本実施形態では、上記一定時間における雑音の音量を周波数帯域毎にその周波数帯域に属する各周波数の振幅スペクトルのピークの最小値に基づいて推定した。しかし、周波数帯域における各周波数の振幅スペクトルのピークの中央値、または各周波数の振幅スペクトルのピークの値をヒストグラム化することで得られた最頻出値に基づいて雑音の音量を推定してもよい。これらの態様によっても、突発的な雑音の影響により雑音の音量が過大となることを回避しつつ、平均値を用いることに起因する弊害を避けることができると考えられるからである。また、周波数帯域に属する各周波数の振幅スペクトルのピークから最大値および最小値を除外して振幅スペクトルのピークの平均値を算出し、その平均値を当該周波数帯域における雑音の音量としてもよい。このような態様によっても、最大値および最小値を除外せずに平均値を算出する態様に比較して急峻なピークを有する雑音の影響を受けないようにすることが可能になる。 In the present embodiment, the volume of noise in the fixed time is estimated for each frequency band based on the minimum value of the peak of the amplitude spectrum of each frequency belonging to the frequency band. However, the volume of noise may be estimated based on the median value of the peak of the amplitude spectrum of each frequency in the frequency band or the most frequent value obtained by making a histogram of the value of the peak of the amplitude spectrum of each frequency. .. This is because it is considered that these modes also prevent the noise volume from becoming excessive due to the influence of sudden noise, and at the same time, avoid the harmful effects caused by using the average value. Further, the average value of the peaks of the amplitude spectrum may be calculated by excluding the maximum value and the minimum value from the peak of the amplitude spectrum of each frequency belonging to the frequency band, and the average value may be used as the volume of noise in the frequency band. Even in such an embodiment, it is possible to avoid being affected by noise having a steep peak as compared with the embodiment in which the average value is calculated without excluding the maximum value and the minimum value.

音響効果係数算出部２１４は、図４における音響効果係数算出処理ＳＡ１３０を実行する。図１３は、音響効果係数算出処理の流れを示す図である。図１３における補正処理（ＳＤ１００）では、音響効果係数算出部２１４は、データＤＡ２の表す各周波数帯域の音量を補正テーブル３０１の格納内容（図３参照）に応じて補正し、補正後の音量を表すデータＤＣ１（図１４参照）を生成する。例えば、図３に示す補正テーブル３０１には、バンド番号＝１の周波数帯域の音量を５ｄＢ引き下げることを示すデータが格納されている。このため、本実施形態の補正処理では、バンド番号＝１の周波数帯域の音量は、データＤＡ２の示す当該周波数帯域の音量から５ｄＢ引き下げられる。 The sound effect coefficient calculation unit 214 executes the sound effect coefficient calculation process SA130 in FIG. FIG. 13 is a diagram showing the flow of the sound effect coefficient calculation process. In the correction process (SD100) in FIG. 13, the sound effect coefficient calculation unit 214 corrects the volume of each frequency band represented by the data DA2 according to the stored contents (see FIG. 3) of the correction table 301, and corrects the corrected volume. The data DC1 to be represented (see FIG. 14) is generated. For example, the correction table 301 shown in FIG. 3 stores data indicating that the volume of the frequency band of band number = 1 is reduced by 5 dB. Therefore, in the correction process of the present embodiment, the volume of the frequency band of band number = 1 is reduced by 5 dB from the volume of the frequency band indicated by the data DA2.

補正処理に後続する比較処理（図１３：ＳＤ１１０）では、音響効果係数算出部２１４は、複数の周波数帯域の各々における再生音の音量と雑音の音量とを周波数帯域毎に比較し、その比較結果を示すデータＤＣ２を生成する。具体的には、音響効果係数算出部２１４は、周波数帯域毎にデータＤＣ１の示す音量からデータＤＢ２の示す音量を減算してデータＤＣ２（図１５参照）を生成する。このデータＤＣ２において音量が負の値となっている周波数帯域は、再生音が雑音に埋もれていることを意味する。例えば、図１５に示すデータＤＣ２は、バンド番号＝１および２の各周波数帯域で再生音が雑音に埋もれていることを示している。そして、係数算出処理（図１３：ＳＤ１２０）では、音響効果係数算出部２１４は、周波数帯域毎に当該周波数帯域における再生音の音量と雑音の音量との差（データＤＣ２の示す音量差）に応じた音響効果係数を音響効果係数テーブル３０２から読み出して音響効果付与部２１１に与える。例えば、比較処理にて図１５に示すデータＤＣ２が算出された場合、バンド番号＝１および２の各周波数帯域の再生音の音量を、データＤＣ２の示す音量差に応じて引き上げることを示す音響効果係数が音響効果係数算出部２１４から音響効果付与部２１１へ与えられる。 In the comparison process (FIG. 13: SD110) following the correction process, the sound effect coefficient calculation unit 214 compares the volume of the reproduced sound and the volume of the noise in each of the plurality of frequency bands for each frequency band, and the comparison result is obtained. The data DC2 indicating the above is generated. Specifically, the acoustic effect coefficient calculation unit 214 generates the data DC2 (see FIG. 15) by subtracting the volume indicated by the data DB2 from the volume indicated by the data DC1 for each frequency band. The frequency band in which the volume is a negative value in the data DC2 means that the reproduced sound is buried in noise. For example, the data DC2 shown in FIG. 15 shows that the reproduced sound is buried in noise in each frequency band of band numbers = 1 and 2. Then, in the coefficient calculation process (FIG. 13: SD120), the acoustic effect coefficient calculation unit 214 responds to the difference between the volume of the reproduced sound and the volume of the noise in the frequency band (volume difference indicated by the data DC2) for each frequency band. The sound effect coefficient is read out from the sound effect coefficient table 302 and given to the sound effect imparting unit 211. For example, when the data DC2 shown in FIG. 15 is calculated by the comparison process, an acoustic effect indicating that the volume of the reproduced sound in each frequency band of band numbers = 1 and 2 is raised according to the volume difference indicated by the data DC2. The coefficient is given from the sound effect coefficient calculation unit 214 to the sound effect imparting unit 211.

音響効果付与部２１１は、再生装置２１０から与えられる音データＤ１に対して、音響効果係数算出部２１４から与えられた音響効果係数に応じた音響効果の付与（本実施形態では音量の調整）を行い、当該音響効果を付与済みの音データをＤ／Ａ変換器２５０に出力する。 The sound effect imparting unit 211 imparts an acoustic effect (adjustment of the volume in the present embodiment) according to the acoustic effect coefficient given by the acoustic effect coefficient calculation unit 214 to the sound data D1 given by the reproduction device 210. Then, the sound data to which the sound effect has been added is output to the D / A converter 250.

このような構成としたため、本実施形態のオーディオ装置１２０からスピーカ１３０へ与えられる音データには、各周波数帯域における再生音が当該周波数帯域における雑音に埋もれないようにする音響効果が付与されているので、いずれの周波数帯域においても再生音が雑音に埋もれることはない。 Due to such a configuration, the sound data given from the audio device 120 of the present embodiment to the speaker 130 is provided with an acoustic effect that prevents the reproduced sound in each frequency band from being buried in the noise in the frequency band. Therefore, the reproduced sound is not buried in the noise in any frequency band.

また、本実施形態のオーディオ装置１２０によれば、再生音の振幅スペクトルに急峻なピークが現れている周波数帯域の再生音の音量は当該ピークに基づいて推定される。このため、当該周波数帯域の再生音の音量が過小に推定され、当該周波数帯域の再生音の音量が過大に引き上げられることはない。また、ある周波数帯域において雑音の振幅スペクトルに急峻なピークが現れていても、当該周波数帯域における雑音の音量が過大に推定されることはなく、当該周波数帯域における再生音の音量が過大に引き上げられることはない。 Further, according to the audio device 120 of the present embodiment, the volume of the reproduced sound in the frequency band in which a steep peak appears in the amplitude spectrum of the reproduced sound is estimated based on the peak. Therefore, the volume of the reproduced sound in the frequency band is estimated to be underestimated, and the volume of the reproduced sound in the frequency band is not excessively increased. Further, even if a steep peak appears in the amplitude spectrum of noise in a certain frequency band, the volume of noise in the frequency band is not overestimated, and the volume of reproduced sound in the frequency band is excessively raised. There is no such thing.

このように本実施形態によれば、雑音の存在する環境で音を再生する際に、再生音の音量を過大に引き上げることなく、再生音が雑音に埋もれないようにすることが可能になる。 As described above, according to the present embodiment, when the sound is reproduced in an environment where noise is present, it is possible to prevent the reproduced sound from being buried in the noise without raising the volume of the reproduced sound excessively.

＜２．変形例＞
以上の実施態様は多様に変形され得る。具体的な変形の態様を以下に例示する。以下の例示から任意に選択された２以上の態様は相矛盾しない限り適宜に併合され得る。
＜２－１：変形例１＞
上記実施形態では、再生音の音量と雑音の音量とをオクターブバンド単位で推定し、オクターブバンド単位で再生音の音量を補正した。しかし、バンド幅が１／３の１／３オクターブバンド単位で、再生音および雑音の音量の推定と再生音の音量の補正をおこなってもよい。この場合、補正テーブル３０１には、１／３オクターブ単位の周波数帯域の各々において雑音の音量と再生音の音量とを比較する際の再生音の音量の補正量を表すデータを格納しておけばよい。また、再生音推定部２１２には、図８に示すデータＤ１に代えて、図１６に示すように、各々１／３オクターブのバンド幅を有するバンド番号＝１から２１の周波数帯域における再生音の音量を表すデータＤ１を生成させるようにすればよい。同様に、雑音推定部２１３には、図１２に示すデータＤ２に代えて、図１７に示すようにバンド番号＝１から２１の各周波数帯域における雑音の音量を表すデータＤ２を生成させるようにすればよい。なお、図１６および図１７において、バンド番号＝１の周波数帯域の中心周波数は６３Ｈｚ、バンド番号＝２の周波数帯域の中心周波数は８０Ｈｚ、…バンド番号＝２０の周波数帯域の中心周波数は５０００Ｈｚ、バンド番号＝２１の周波数帯域の中心周波数は６３００Ｈｚである。 <2. Modification example>
The above embodiments can be variously modified. Specific modes of modification are illustrated below. Two or more embodiments arbitrarily selected from the following examples can be appropriately merged as long as they do not conflict with each other.
<2-1: Modification 1>
In the above embodiment, the volume of the reproduced sound and the volume of the noise are estimated in octave band units, and the volume of the reproduced sound is corrected in octave band units. However, the volume of the reproduced sound and the noise may be estimated and the volume of the reproduced sound may be corrected in units of 1/3 octave bands having a bandwidth of 1/3. In this case, if the correction table 301 stores data representing the correction amount of the volume of the reproduced sound when comparing the volume of the noise and the volume of the reproduced sound in each of the frequency bands in units of 1/3 octave. good. Further, in the reproduced sound estimation unit 212, instead of the data D1 shown in FIG. 8, as shown in FIG. 16, the reproduced sound in the frequency band of band numbers = 1 to 21 each having a bandwidth of 1/3 octave is used. The data D1 representing the volume may be generated. Similarly, the noise estimation unit 213 is made to generate data D2 representing the volume of noise in each frequency band of band numbers = 1 to 21 as shown in FIG. 17, instead of the data D2 shown in FIG. Just do it. In FIGS. 16 and 17, the center frequency of the frequency band of band number = 1 is 63 Hz, the center frequency of the frequency band of band number = 2 is 80 Hz, ... The center frequency of the frequency band of band number = 20 is 5000 Hz, and the band. The center frequency of the frequency band of number = 21 is 6300 Hz.

＜２－２：変形例２＞
上記実施形態では、音響効果付与部２１１、再生音推定部２１２、雑音推定部２１３、音響効果係数算出部２１４、およびエコーキャンセル部２１５をソフトウェアで実現した。しかし、音響効果付与部２１１、再生音推定部２１２、雑音推定部２１３、音響効果係数算出部２１４、およびエコーキャンセル部２１５の各々を電子回路などのハードウェアで実現し、それらハードウェアを組み合わせてオーディオ装置１２０を構成してもよい。この場合、音響効果係数算出部２１４については、補正処理（図１３：ＳＤ１００）を実行する補正部と、比較処理（図１３：ＳＤ１１０）を実行する比較部と、係数算出処理（図１３；ＳＤ１２０）を実行する係数算出部の各部を電子回路で実現し、これら各部を組み合わせて構成すればよい。 <2-2: Modification 2>
In the above embodiment, the sound effect imparting unit 211, the reproduced sound estimation unit 212, the noise estimation unit 213, the sound effect coefficient calculation unit 214, and the echo canceling unit 215 are realized by software. However, each of the sound effect imparting unit 211, the reproduced sound estimation unit 212, the noise estimation unit 213, the sound effect coefficient calculation unit 214, and the echo canceling unit 215 is realized by hardware such as an electronic circuit, and these hardwares are combined. The audio device 120 may be configured. In this case, regarding the sound effect coefficient calculation unit 214, a correction unit that executes a correction process (FIG. 13: SD100), a comparison unit that executes a comparison process (FIG. 13: SD110), and a coefficient calculation process (FIG. 13; SD120). ) May be realized by an electronic circuit, and each part of the coefficient calculation part may be combined and configured.

＜２－３：変形例３＞
上記実施形態では、再生音が雑音に埋もれないように再生音の音量の補正量（音響効果係数）を決定した。しかし、補正後の再生音の音量を示すデータをバッファに蓄積しておき、当該バッファの格納内容の示す再生音の音量の時間変化から音のアタック部であるか否か、或いはリリース部であるか否かを判定し、その判定結果に応じて上記補正量を調整してもよい。例えば、アタック部である場合には、補正後の再生音の音量が前回の補正後の再生音の音量を上回るように上記補正量を調整することが好ましい。補正後の再生音の音量が前回の補正後の再生音の音量を下回っているとアタック感が薄れるからである。同様に、リリース部である場合には、補正後の再音の音量が前回の補正後の再生音の音量を下回るように補正量を調整することが好ましい。補正後の再生音の音量が前回の補正後の再生音の音量を上回っているとリリース感が薄れるからである。 <2-3: Modification 3>
In the above embodiment, the correction amount (acoustic effect coefficient) of the volume of the reproduced sound is determined so that the reproduced sound is not buried in the noise. However, the data indicating the volume of the reproduced sound after correction is stored in the buffer, and whether or not it is the attack part of the sound or the release part from the time change of the volume of the reproduced sound indicated by the stored contents of the buffer. It may be determined whether or not, and the correction amount may be adjusted according to the determination result. For example, in the case of the attack unit, it is preferable to adjust the correction amount so that the volume of the reproduced sound after the correction exceeds the volume of the reproduced sound after the previous correction. This is because if the volume of the reproduced sound after the correction is lower than the volume of the reproduced sound after the previous correction, the attack feeling is diminished. Similarly, in the case of the release unit, it is preferable to adjust the correction amount so that the volume of the re-sound after the correction is lower than the volume of the reproduced sound after the previous correction. This is because if the volume of the reproduced sound after the correction exceeds the volume of the reproduced sound after the previous correction, the feeling of release is diminished.

＜２－４：変形例４＞
上記実施形態のオーディオ装置１２０は再生装置２１０を含んでいたが、信号線を介して再生装置２１０をオーディオ装置１２０に接続する態様であってもよい。つまり、再生装置２１０は本発明のオーディオ装置の必須構成要素ではなく、省略可能である。同様に、記憶部２３０も本発明のオーディオ装置の必須構成要素ではなく、省略可能である。記憶部２３０を省略する場合には、ＵＳＢケーブル等を介してオーディオ装置１２０に接続されるハードディスクドライブ、或いはインターネットなどの電気通信回線経由で演算部２２０がアクセス可能なハードディスクドライブに記憶部２３０の役割を担わせればよい。 <2-4: Modification 4>
Although the audio device 120 of the above embodiment includes the playback device 210, the playback device 210 may be connected to the audio device 120 via a signal line. That is, the reproduction device 210 is not an essential component of the audio device of the present invention and can be omitted. Similarly, the storage unit 230 is not an essential component of the audio device of the present invention and can be omitted. When the storage unit 230 is omitted, the storage unit 230 plays a role in a hard disk drive connected to the audio device 120 via a USB cable or the like, or a hard disk drive accessible by the arithmetic unit 220 via a telecommunication line such as the Internet. You just have to carry.

マイクロフォン１４０がＡ／Ｄ変換器を含んでいる場合には、Ａ／Ｄ変換器２４０は不要であり、スピーカ１３０或いはアンプ２６０がＤ／Ａ変換器を含んでいる場合にはＤ／Ａ変換器２５０は不要である。また、信号線を介してオーディオ装置１２０に接続されるアンプにアンプ２６０の役割を担わせてもよく、この場合はアンプ２６０を省略可能である。 If the microphone 140 includes an A / D converter, the A / D converter 240 is unnecessary, and if the speaker 130 or the amplifier 260 includes a D / A converter, the D / A converter is unnecessary. 250 is unnecessary. Further, the amplifier connected to the audio device 120 via the signal line may play the role of the amplifier 260, and in this case, the amplifier 260 can be omitted.

また、マイクロフォン１４０として鋭い指向性を有するマイクロフォンを用い、当該マイクロフォンを雑音の音源近傍に設置する等、スピーカ１３０から放射される差音がマイクロフォン１４０によって収音されないようにすることができる場合には、エコーキャンセル部２１５を省略してもよい。また、音響効果付与部２１１をオーディオ装置１２０とは別箇のＤＳＰにより実現する場合には、音響効果付与部２１１も省略可能である。 Further, when a microphone having sharp directivity is used as the microphone 140 and the microphone is installed near a sound source of noise so that the difference tone radiated from the speaker 130 cannot be picked up by the microphone 140. , The echo canceling unit 215 may be omitted. Further, when the sound effect imparting unit 211 is realized by a DSP different from the audio device 120, the sound effect imparting unit 211 can also be omitted.

つまり、本発明のオーディオ装置は、スピーカから環境に放射される再生音を表す第１音信号を時間軸方向に複数に分割して得られる各信号に短時間周波数解析を施して一定時間における振幅スペクトルのピークを周波数毎に特定し、特定した振幅スペクトルのピークに基づいて前記再生音の音量を周波数帯域毎に推定する第１推定部（上記実施形態における再生音推定部２１２）と、前記環境において聴取される雑音を表す第２音信号を時間軸方向に複数に分割して得られる各信号に短時間周波数解析を施して前記一定時間における振幅スペクトルのピークを周波数毎に特定し、特定した振幅スペクトルのピークに基づいて前記雑音の音量を周波数帯域毎に推定する第２推定部（上記実施形態における雑音推定部２１３）と、前記第１推定部により推定された音量と前記第２推定部により推定された音量とを周波数帯域毎に比較し、比較結果に基づいて各周波数帯域において前記再生音の音量が前記雑音の音量を上回るように前記第１音信号に付与する音響効果係数を算出する算出部（上記実施形態における音響効果係数算出部２１４）と、を備えていればよい。 That is, in the audio device of the present invention, a short-time frequency analysis is performed on each signal obtained by dividing the first sound signal representing the reproduced sound radiated from the speaker to the environment into a plurality of sounds in the time axis direction, and the amplitude in a certain time is set. The first estimation unit (reproduced sound estimation unit 212 in the above embodiment) that specifies the peak of the spectrum for each frequency and estimates the volume of the reproduced sound for each frequency band based on the peak of the specified amplitude spectrum, and the environment. A short-time frequency analysis was performed on each signal obtained by dividing the second sound signal representing the noise heard in the above into a plurality of signals in the time axis direction, and the peak of the amplitude spectrum in the fixed time was specified and specified for each frequency. The second estimation unit (noise estimation unit 213 in the above embodiment) that estimates the volume of the noise for each frequency band based on the peak of the amplitude spectrum, the volume estimated by the first estimation unit, and the second estimation unit. The volume estimated by is compared for each frequency band, and the acoustic effect coefficient given to the first sound signal so that the volume of the reproduced sound exceeds the volume of the noise in each frequency band is calculated based on the comparison result. It suffices to include a calculation unit (acoustic effect coefficient calculation unit 214 in the above embodiment).

＜２－５：変形例５＞
上記実施形態では、再生支援プログラム３０３とは別箇に補正テーブル３０１と音響効果係数テーブル３０２とが記憶部２３０に記憶されていたが、補正テーブル３０１と音響効果係数テーブル３０２とを再生支援プログラム３０３に埋め込んで置いてもよい。また、上記実施形態では、オーディオ装置１２０の記憶部２３０に再生支援プログラム３０３を予め記憶させておいたが、補正テーブル３０１および音響効果係数テーブル３０２が埋め込まれているか否かを問わずに再生支援プログラム３０３を単体で提供してもよい。再生支援プログラム３０３の具体的な提供態様としては、ＣＤ－ＲＯＭなどのコンピュータ読み取り可能な記録媒体に書き込んで配布する態様、またはインターネットなどの電気通信回線経由のダウンロードにより配布する態様が考えられる。パーソナルコンピュータなどの一般的なコンピュータ装置に上記の要領で配布される再生支援プログラム３０３をインストールし、当該コンピュータ装置のプロセッサに当該再生支援プログラム３０３を実行させることで、当該コンピュータ装置を上記実施形態のオーディオ装置１２０として機能させることが可能になる。なお、補正テーブル３０１および音響効果係数テーブル３０２を再生支援プログラム３０３に埋め込んで置かない態様の場合、これらテーブルは当該再生支援プログラム３０３を実行するコンピュータ装置がアクセス可能な記憶装置に記憶されていればよい。 <2-5: Modification 5>
In the above embodiment, the correction table 301 and the sound effect coefficient table 302 are stored in the storage unit 230 separately from the reproduction support program 303, but the correction table 301 and the sound effect coefficient table 302 are stored in the reproduction support program 303. It may be embedded in. Further, in the above embodiment, the reproduction support program 303 is stored in advance in the storage unit 230 of the audio device 120, but the reproduction support is performed regardless of whether or not the correction table 301 and the sound effect coefficient table 302 are embedded. The program 303 may be provided alone. As a specific mode of providing the reproduction support program 303, a mode of writing and distributing it on a computer-readable recording medium such as a CD-ROM, or a mode of distributing it by downloading via a telecommunication line such as the Internet can be considered. By installing the reproduction support program 303 distributed in the above manner on a general computer device such as a personal computer and causing the processor of the computer device to execute the reproduction support program 303, the computer device of the above embodiment can be obtained. It becomes possible to function as an audio device 120. In the case where the correction table 301 and the sound effect coefficient table 302 are not embedded in the reproduction support program 303, these tables are stored in an accessible storage device of the computer device that executes the reproduction support program 303. good.

＜２－６：変形例６＞
上記実施形態では、カーオーディオシステムに含まれるオーディオ装置への本発明の適用例を説明したが、本発明の適用対象はカーオーディオシステムに含まれるオーディオ装置に限定される訳ではない。また、上記実施形態では、各周波数帯域において再生音が雑音に埋もれることがないように、各周波数帯域において再生音の音量が雑音の音量を上回るように第１音信号に付与する音響効果係数を算出した。しかし、再生音の音信号に付与する音響効果係数の算出（すなわち、再生音のチューニング）を、再生音について推定した音量と雑音について推定した音量との比較結果に応じて行う態様であればよい。要は、雑音の存在する環境で音の再生を行うオーディオ装置であれば、本発明を適用することで、再生音の音量が過小に推定されることを回避しつつ再生音のチューニングを行うことが可能になる。 <2-6: Modification 6>
In the above embodiment, an example of application of the present invention to an audio device included in a car audio system has been described, but the application target of the present invention is not limited to the audio device included in the car audio system. Further, in the above embodiment, the acoustic effect coefficient given to the first sound signal so that the volume of the reproduced sound exceeds the volume of the noise in each frequency band is set so that the reproduced sound is not buried in the noise in each frequency band. Calculated. However, the calculation of the sound effect coefficient given to the sound signal of the reproduced sound (that is, tuning of the reproduced sound) may be performed according to the comparison result between the volume estimated for the reproduced sound and the volume estimated for the noise. .. In short, if it is an audio device that reproduces sound in a noisy environment, by applying the present invention, tuning of the reproduced sound can be performed while avoiding that the volume of the reproduced sound is underestimated. Will be possible.

＜３．実施形態および各変形例の少なくとも１つから把握される態様＞
上述した各実施形態および各変形例の少なくとも１つから以下の態様が把握される。
オーディオ装置の一態様は、スピーカから環境に放射される再生音を表す第１音信号を時間軸方向に複数に分割して得られる各信号に短時間周波数解析を施して一定時間における振幅スペクトルのピークを周波数毎に特定し、特定した振幅スペクトルのピークに基づいて前記再生音の音量を周波数帯域毎に推定する第１推定部と、前記環境において聴取される雑音を表す第２音信号を時間軸方向に複数に分割して得られる各信号に短時間周波数解析を施して前記一定時間における振幅スペクトルのピークを周波数毎に特定し、特定した振幅スペクトルのピークに基づいて前記雑音の音量を周波数帯域毎に推定する第２推定部と、前記第１推定部により推定された音量と前記第２推定部により推定された音量とを周波数帯域毎に比較し、比較結果に基づいて前記第１音信号に付与する音響効果係数を算出する算出部と、を備える。
本態様によれば、各周波数帯域における再生音の音量は周波数帯域に属する各周波数の振幅スペクトルのピークに応じて推定される。このため、周波数帯域に属する各周波数の振幅スペクトルの平均値に応じて再生音の音量を推定する従来技術に比較して、再生音の振幅スペクトルに急峻なピークが現れている場合であっても、当該ピークに対応する周波数の属する周波数帯域における再生音の音量が過小に推定されることが回避される。本態様によれば、雑音の存在する環境で再生される再生音の振幅スペクトルに急峻なピークが現れている場合であっても、当該ピークに対応する周波数の属する周波数帯域における再生音の音量が過小に推定されることを回避しつつ、再生音をチューニングすることが可能になる。 <3. Aspects grasped from at least one of the embodiments and each modification>
The following aspects are grasped from at least one of each of the above-described embodiments and modifications.
One aspect of the audio device is to perform short-time frequency analysis on each signal obtained by dividing the first sound signal representing the reproduced sound radiated from the speaker to the environment into a plurality of signals in the time axis direction, and to obtain an amplitude spectrum at a fixed time. The first estimation unit that specifies the peak for each frequency and estimates the volume of the reproduced sound for each frequency band based on the peak of the specified amplitude spectrum, and the second sound signal that represents the noise heard in the environment are timed. A short-time frequency analysis is performed on each signal obtained by dividing into a plurality of signals in the axial direction to identify the peak of the amplitude spectrum for each frequency, and the volume of the noise is determined based on the peak of the specified amplitude spectrum. The second estimation unit estimated for each band, the volume estimated by the first estimation unit, and the volume estimated by the second estimation unit are compared for each frequency band, and the first sound is based on the comparison result. It is provided with a calculation unit for calculating an acoustic effect coefficient applied to a signal.
According to this aspect, the volume of the reproduced sound in each frequency band is estimated according to the peak of the amplitude spectrum of each frequency belonging to the frequency band. Therefore, even when a steep peak appears in the amplitude spectrum of the reproduced sound as compared with the conventional technique of estimating the volume of the reproduced sound according to the average value of the amplitude spectrum of each frequency belonging to the frequency band. , It is avoided that the volume of the reproduced sound in the frequency band to which the frequency corresponding to the peak belongs is underestimated. According to this aspect, even when a steep peak appears in the amplitude spectrum of the reproduced sound reproduced in a noisy environment, the volume of the reproduced sound in the frequency band to which the frequency corresponding to the peak belongs is It is possible to tune the reproduced sound while avoiding underestimation.

上述したオーディオ装置の一態様において、前記第１推定部は、周波数帯域における前記再生音の音量をその周波数帯域に属する各周波数の振幅スペクトルのピークの最大値に基づいて推定することが好ましい。
再生音の振幅スペクトルに急峻なピークが現れている場合、当該ピークは上記最大値に対応する。このため、本態様によれば、周波数帯域における再生音の音量が過小に推定されることを確実に回避することができる。 In one aspect of the audio apparatus described above, it is preferable that the first estimation unit estimates the volume of the reproduced sound in the frequency band based on the maximum value of the peak of the amplitude spectrum of each frequency belonging to the frequency band.
When a steep peak appears in the amplitude spectrum of the reproduced sound, the peak corresponds to the above maximum value. Therefore, according to this aspect, it is possible to surely prevent the volume of the reproduced sound in the frequency band from being underestimated.

上述したオーディオ装置の一態様において、前記第２推定部は、周波数帯域における前記雑音の音量をその周波数帯域に属する各周波数の振幅スペクトルのピークの最小値、中央値、最頻出値または各周波数に対応する振幅スペクトルのピークから最小値と最大値とを除外して算出される平均値のいずれかに基づいて推定することが好ましい。
本態様によれば、再生音の振幅スペクトルに急峻なピークが現れている場合、当該ピークは上記最大値に対応する。本態様によれば、各周波数帯域における雑音の音量の際に当該最大値は使用されないので、雑音の振幅スペクトルに急峻なピークが現れている周波数帯域における当該雑音の音量が過大に推定されることを確実に回避することができる。 In one aspect of the audio apparatus described above, the second estimation unit sets the volume of the noise in the frequency band to the minimum value, the median value, the most frequent value or each frequency of the peak of the amplitude spectrum of each frequency belonging to the frequency band. It is preferable to estimate based on one of the average values calculated by excluding the minimum value and the maximum value from the peak of the corresponding amplitude spectrum.
According to this aspect, when a steep peak appears in the amplitude spectrum of the reproduced sound, the peak corresponds to the above maximum value. According to this aspect, since the maximum value is not used for the volume of noise in each frequency band, the volume of the noise in the frequency band in which a steep peak appears in the amplitude spectrum of the noise is overestimated. Can be reliably avoided.

上述したオーディオ装置の一態様において、前記算出部は、前記第１推定部により推定された音量と前記第２推定部により推定された音量とを周波数帯域毎に比較し、比較結果に基づいて各周波数帯域において前記再生音の音量が前記雑音の音量を上回るように前記第１音信号に付与する音響効果係数を算出することを特徴とする。
本態様によれば、音の存在する環境で音を再生する際に、再生音の音量を過大に引き上げることなく、雑音に埋もれないように再生音をチューニングすることが可能になる。 In one aspect of the audio device described above, the calculation unit compares the volume estimated by the first estimation unit and the volume estimated by the second estimation unit for each frequency band, and each is based on the comparison result. It is characterized in that an acoustic effect coefficient given to the first sound signal is calculated so that the volume of the reproduced sound exceeds the volume of the noise in the frequency band.
According to this aspect, when the sound is reproduced in the environment where the sound is present, it is possible to tune the reproduced sound so as not to be buried in the noise without raising the volume of the reproduced sound excessively.

上記オーディオ装置の一態様において、前記再生音と前記雑音とを収音するマイクロフォンから出力される第３音信号から前記第１音信号に対応する信号成分を除外して前記第２音信号を生成することが好ましい。
本態様によれば、マイクロフォンの出力信号を用いて雑音の音量を簡便かつ正確に推定することが可能になる。 In one aspect of the audio device, the second sound signal is generated by excluding the signal component corresponding to the first sound signal from the third sound signal output from the microphone that collects the reproduced sound and the noise. It is preferable to do.
According to this aspect, it becomes possible to easily and accurately estimate the volume of noise by using the output signal of the microphone.

上記オーディオ装置の一態様において、前記算出部は、前記第１推定部により推定された各周波数帯域における前記再生音の音量についての補正量を周波数帯域毎に規定して補正テーブルにしたがって、前記第１推定部により推定された前記再生音の音量を周波数帯域毎に補正する補正部と、前記補正部による補正後の音量と前記第２推定部により推定された前記雑音の音量とを周波数帯域毎に比較する比較部と、を有し、前記比較部における比較結果に応じて前記音響効果係数を周波数帯域毎に算出することを特徴とすることが好ましい。
本態様によれば、補正テーブルを切り替えることで多様なオーディオシステムに対応することが可能になる。 In one aspect of the audio device, the calculation unit defines a correction amount for the volume of the reproduced sound in each frequency band estimated by the first estimation unit for each frequency band, and according to the correction table, the first unit. 1 A correction unit that corrects the volume of the reproduced sound estimated by the estimation unit for each frequency band, a volume after correction by the correction unit, and a volume of the noise estimated by the second estimation unit for each frequency band. It is preferable to have a comparison unit to be compared with, and to calculate the sound effect coefficient for each frequency band according to the comparison result in the comparison unit.
According to this aspect, it becomes possible to correspond to various audio systems by switching the correction table.

音響効果係数算出方法の一態様は、スピーカから環境に放射される再生音を表す第１音信号を時間軸方向に複数に分割して得られる各信号に短時間周波数解析を施して一定時間における振幅スペクトルのピークを周波数毎に特定し、特定した振幅スペクトルのピークに基づいて前記再生音の音量を周波数帯域毎に推定し、前記環境において聴取される雑音を表す第２音信号を時間軸方向に複数に分割して得られる各信号に短時間周波数解析を施して前記一定時間における振幅スペクトルのピークを周波数毎に特定し、特定した振幅スペクトルのピークに基づいて前記雑音の音量を周波数帯域毎に推定し、前記再生音について推定された音量と前記雑音について推定された音量とを周波数帯域毎に比較し、比較結果に基づいて前記第１音信号に付与する音響効果係数を算出する。
本態様によっても、雑音の存在する環境で再生される再生音の振幅スペクトルに急峻なピークが現れている場合であっても、当該ピークに対応する周波数の属する周波数帯域における再生音の音量が過小に推定されることを回避しつつ、再生音をチューニングすることが可能になる。 One aspect of the acoustic effect coefficient calculation method is to perform short-time frequency analysis on each signal obtained by dividing the first sound signal representing the reproduced sound radiated from the speaker to the environment into a plurality of signals in the time axis direction, and for a certain period of time. The peak of the amplitude spectrum is specified for each frequency, the volume of the reproduced sound is estimated for each frequency band based on the peak of the specified amplitude spectrum, and the second sound signal representing the noise heard in the environment is measured in the time axis direction. Short-time frequency analysis is performed on each signal obtained by dividing into a plurality of signals to identify the peak of the amplitude spectrum for each frequency, and the volume of the noise is determined for each frequency band based on the peak of the specified amplitude spectrum. The volume estimated for the reproduced sound and the volume estimated for the noise are compared for each frequency band, and the acoustic effect coefficient given to the first sound signal is calculated based on the comparison result.
Also in this embodiment, even if a steep peak appears in the amplitude spectrum of the reproduced sound reproduced in a noisy environment, the volume of the reproduced sound in the frequency band to which the frequency corresponding to the peak belongs is too low. It is possible to tune the reproduced sound while avoiding the estimation.

プログラムの一態様は、プロセッサを、スピーカから環境に放射される再生音を表す第１音信号を時間軸方向に複数に分割して得られる各信号に短時間周波数解析を施して一定時間における振幅スペクトルのピークを周波数毎に特定し、特定した振幅スペクトルのピークに基づいて前記再生音の音量を周波数帯域毎に推定する第１推定部と、前記環境において聴取される雑音を表す第２音信号を時間軸方向に複数に分割して得られる各信号に短時間周波数解析を施して前記一定時間における振幅スペクトルのピークを周波数毎に特定し、特定した振幅スペクトルのピークに基づいて前記雑音の音量を周波数帯域毎に推定する第２推定部と、前記第１推定部により推定された音量と前記第２推定部により推定された音量とを周波数帯域毎に比較し、比較結果に基づいて前記第１音信号に付与する音響効果係数を算出する算出部と、して機能させる。
本態様によっても、雑音の存在する環境で再生される再生音の振幅スペクトルに急峻なピークが現れている場合であっても、当該ピークに対応する周波数の属する周波数帯域における再生音の音量が過小に推定されることを回避しつつ、再生音をチューニングすることが可能になる。 One aspect of the program is that the processor performs a short-time frequency analysis on each signal obtained by dividing the first sound signal representing the reproduced sound radiated from the speaker to the environment into a plurality of sounds in the time axis direction, and the amplitude at a fixed time. The first estimation unit that specifies the peak of the spectrum for each frequency and estimates the volume of the reproduced sound for each frequency band based on the peak of the specified amplitude spectrum, and the second sound signal that represents the noise heard in the environment. The peak of the amplitude spectrum in the fixed time is specified for each frequency by performing a short-time frequency analysis on each signal obtained by dividing the signal into a plurality of parts in the time axis direction, and the volume of the noise is based on the peak of the specified amplitude spectrum. The second estimation unit that estimates for each frequency band, the volume estimated by the first estimation unit, and the volume estimated by the second estimation unit are compared for each frequency band, and the second estimation unit is based on the comparison result. It functions as a calculation unit that calculates the acoustic effect coefficient given to one sound signal.
Also in this embodiment, even if a steep peak appears in the amplitude spectrum of the reproduced sound reproduced in a noisy environment, the volume of the reproduced sound in the frequency band to which the frequency corresponding to the peak belongs is too low. It is possible to tune the reproduced sound while avoiding the estimation.

１…車室、１１１…運転席、１１２…助手席、１１３…後列席、１２０…オーディオ装置、１３０,１３０－１. １３０－２. １３０－３. １３０－４…スピーカ、１４０…マイクロフォン、２１０…再生装置、２２０…演算部、２３０…記憶部、２４０…Ａ／Ｄ変換器、２５０…Ｄ／Ａ変換器、２６０…アンプ、２１１…音響効果付与部、２１２…再生音推定部、２１３…雑音推定部、２１４…音響効果係数算出部、２１５…エコーキャンセル部、３０１…補正テーブル、３０２…音響効果係数テーブル、３０３…再生支援プログラム。
1 ... Car room, 111 ... Driver's seat, 112 ... Passenger seat, 113 ... Back row seat, 120 ... Audio device, 130,130-1. 130-2. 130-3. 130-4 ... Speaker, 140 ... Microphone, 210 ... reproduction device, 220 ... arithmetic unit, 230 ... storage unit, 240 ... A / D converter, 250 ... D / A converter, 260 ... amplifier, 211 ... acoustic effect addition unit, 212 ... reproduction sound estimation unit, 213 ... Noise estimation unit, 214 ... Acoustic effect coefficient calculation unit, 215 ... Echo canceling unit, 301 ... Correction table, 302 ... Acoustic effect coefficient table, 303 ... Playback support program.

Claims

A short-time frequency analysis is performed on each signal obtained by dividing the first sound signal representing the reproduced sound radiated from the speaker to the environment into a plurality of signals in the time axis direction, and the peak of the amplitude spectrum in a certain time is specified for each frequency. The first estimation unit that estimates the volume of the reproduced sound for each frequency band based on the peak of the specified amplitude spectrum,
A short-time frequency analysis is performed on each signal obtained by dividing the second sound signal representing the noise heard in the environment into a plurality of signals in the time axis direction, and the peak of the amplitude spectrum in the fixed time is specified for each frequency. A second estimation unit that estimates the volume of the noise for each frequency band based on the peak of the specified amplitude spectrum, and
Calculation to compare the volume estimated by the first estimation unit and the volume estimated by the second estimation unit for each frequency band, and calculate the acoustic effect coefficient given to the first sound signal based on the comparison result. Department and
An audio device equipped with.

The audio device according to claim 1, wherein the first estimation unit estimates the volume of the reproduced sound in a frequency band based on the maximum value of the peak of the amplitude spectrum of each frequency belonging to the frequency band.

The second estimation unit sets the volume of the noise in the frequency band from the minimum value, the median value, the most frequent value of the peak value of the amplitude spectrum of each frequency belonging to the frequency band, or the peak value of the amplitude spectrum corresponding to each frequency. The audio device according to claim 1 or 2, wherein the estimation is based on any of the average values calculated excluding the maximum value and the maximum value.

The calculation unit compares the volume estimated by the first estimation unit and the volume estimated by the second estimation unit for each frequency band, and the volume of the reproduced sound in each frequency band is based on the comparison result. The audio device according to any one of claims 1 to 3, wherein an acoustic effect coefficient applied to the first sound signal is calculated so as to exceed the volume of the noise.

Having an echo canceling unit that generates the second sound signal by excluding the signal component corresponding to the first sound signal from the third sound signal output from the microphone that collects the reproduced sound and the noise. The audio device according to any one of claims 1 to 4, wherein the audio device is characterized.

The calculation unit
The correction amount for the volume of the reproduced sound in each frequency band estimated by the first estimation unit is specified for each frequency band, and the volume of the reproduced sound estimated by the first estimation unit is calculated according to the correction table. A correction unit that corrects for each frequency band,
It has a comparison unit for comparing the volume corrected by the correction unit and the volume of the noise estimated by the second estimation unit for each frequency band.
The audio device according to any one of claims 1 to 5, wherein the acoustic effect coefficient is calculated for each frequency band according to the comparison result in the comparison unit.

A short-time frequency analysis is performed on each signal obtained by dividing the first sound signal representing the reproduced sound radiated from the speaker to the environment into a plurality of signals in the time axis direction, and the peak of the amplitude spectrum in a certain time is specified for each frequency. , The volume of the reproduced sound is estimated for each frequency band based on the peak of the specified amplitude spectrum, and
A short-time frequency analysis is performed on each signal obtained by dividing the second sound signal representing the noise heard in the environment into a plurality of signals in the time axis direction, and the peak of the amplitude spectrum in the fixed time is specified for each frequency. The volume of the noise is estimated for each frequency band based on the peak of the specified amplitude spectrum, and the volume of the noise is estimated for each frequency band.
The volume estimated for the reproduced sound and the volume estimated for the noise are compared for each frequency band, and the acoustic effect coefficient given to the first sound signal is calculated based on the comparison result.
Sound effect coefficient calculation method.

Processor,
A short-time frequency analysis is performed on each signal obtained by dividing the first sound signal representing the reproduced sound radiated from the speaker to the environment into a plurality of signals in the time axis direction, and the peak of the amplitude spectrum in a certain time is specified for each frequency. The first estimation unit that estimates the volume of the reproduced sound for each frequency band based on the peak of the specified amplitude spectrum,
A short-time frequency analysis is performed on each signal obtained by dividing the second sound signal representing the noise heard in the environment into a plurality of signals in the time axis direction, and the peak of the amplitude spectrum in the fixed time is specified for each frequency. A second estimation unit that estimates the volume of the noise for each frequency band based on the peak of the specified amplitude spectrum, and
Calculation to compare the volume estimated by the first estimation unit and the volume estimated by the second estimation unit for each frequency band, and calculate the acoustic effect coefficient given to the first sound signal based on the comparison result. Department and
A program that makes it work.