JP2012100117A

JP2012100117A - Acoustic processing apparatus and method

Info

Publication number: JP2012100117A
Application number: JP2010246746A
Authority: JP
Inventors: Kyohei Kitazawa; 恭平北澤
Original assignee: Canon Inc
Current assignee: Canon Inc
Priority date: 2010-11-02
Filing date: 2010-11-02
Publication date: 2012-05-24

Abstract

PROBLEM TO BE SOLVED: To provide an acoustic processing apparatus that can suppress grating caused by the reversal of fundamental sound and harmonic wave in a specific listening position, and in which grating sound does not newly occur in a position other than the specific listening position by applying filters.SOLUTION: The acoustic processing apparatus detects fundamental sound frequency f(n) and harmonic wave frequency f(n) from an input regenerative signal. In addition, the acoustic processing apparatus calculates fundamental sound level L(n) and harmonic wave level L(n) of the input regenerative signal and a convolution signal of an input audio characteristic. The acoustic processing apparatus generates a correction filter that satisfies L(n)>L(n) and applies the correction filter to a regenerative signal when the calculated fundamental sound amplitude level L(n) is compared with the harmonic wave amplitude level L(n) and there is L(n) greater than L(n).

Description

本発明は音響処理装置に関し、特に、聴取室の音響特性やスピーカの音圧周波数特性等の影響によって発生する音質の劣化を抑制する音響処理装置及び方法に関する。 The present invention relates to a sound processing apparatus, and more particularly to a sound processing apparatus and method for suppressing deterioration in sound quality caused by the influence of acoustic characteristics of a listening room, sound pressure frequency characteristics of a speaker, and the like.

Ｂｌｕ−ｒａｙＤｉｓｋなどの大容量記憶媒体の登場や高音質音源のネットワーク配信の拡大により、家庭内で高精度・高ダイナミックレンジのオーディオ信号が手軽に入手できるようになった。このようなオーディオ信号の高精度化に伴って、再生機器に対してもより原信号に近いすなわちより原音忠実な臨場感のある音響再生への要求が高まっている。原音忠実な再生環境を実現するには原信号がそのまま聴取者へ届く、すなわち再生装置から聴取者の耳までのオーディオ信号が辿る経路（以下「音響経路」という。）の音圧周波数特性（以下「ｆ特」という。）がフラットであることが望ましい。 With the advent of large-capacity storage media such as Blu-ray Disks and the expansion of network distribution of high-quality sound sources, high-accuracy and high-dynamic range audio signals can be easily obtained in the home. Along with such high accuracy of audio signals, there is an increasing demand for reproduction equipment for sound reproduction that is closer to the original signal, that is, more realistic and faithful to the original sound. In order to realize a reproduction environment that is faithful to the original sound, the original signal reaches the listener as it is, that is, the sound pressure frequency characteristic (hereinafter referred to as “acoustic path”) that the audio signal follows from the playback device to the listener's ear (hereinafter referred to as “acoustic path”) It is desirable that it is flat.

しかし、家庭でオーディオ信号を再生する場合、音響経路のｆ特をフラットにするのはきわめて困難である。音響経路のｆ特が変化する要因は様々あり、スピーカのｆ特や音響経路上の音響機器の持つｆ特、聴取室の音響特性などが挙げられるが、中でも影響力が大きいのが、聴取室の音響特性による影響である。 However, when reproducing an audio signal at home, it is very difficult to flatten the f characteristic of the acoustic path. There are various factors that change the f-characteristic of the acoustic path, such as the f-characteristic of the speaker, the f-characteristic of the acoustic device on the acoustic path, and the acoustic characteristics of the listening room. This is due to the influence of the acoustic characteristics.

聴取室内において聴取者にはスピーカから発音された直接音だけでなく、壁や床、天井などで反射した反射音も届く。それらの音波は干渉するため、ｆ特上に大きなディップやピークを生じる。さらに聴取位置が変化すると、直接音と反射音の経路差も変化するため、ピークやディップの位置が変化してしまう。つまり聴取室の音響特性による影響は聴取位置に依存し大きく変化してしまうという特性をもっている。 In the listening room, the listener receives not only the direct sound generated from the speaker but also the reflected sound reflected from the wall, floor, ceiling, etc. Since these sound waves interfere with each other, a particularly large dip or peak occurs. Further, when the listening position changes, the path difference between the direct sound and the reflected sound also changes, so that the position of the peak or dip changes. In other words, the influence of the acoustic characteristics of the listening room has a characteristic that it greatly changes depending on the listening position.

このように再生されたオーディオ信号は音質劣化、結果として聴感に様々な影響を及ぼす。そこで、原音忠実な信号再生を実現するためには上記のような音響経路のｆ特の乱れを補正し、聴感上の悪影響を抑制するような音響処理が非常に重要になってくる。 The audio signal reproduced in this way has various effects on sound quality degradation and consequently hearing. Therefore, in order to realize signal reproduction faithful to the original sound, acoustic processing that corrects the f-specific disturbance in the acoustic path as described above and suppresses adverse effects on hearing becomes very important.

従来、このようなｆ特の乱れに対しては、入力信号のｆ特を調整するグラフィックイコライザなどのｆ特調整装置を利用しユーザ自身が手作業で所望のｆ特に近づける方法があった。また、測定信号を用いて音響経路のｆ特を測定し、自動的に所望のｆ特に近づける補正を行う自動音場補正を用いる方法などもある（例えば、特許文献１参照）。 Conventionally, there has been a method in which the user himself / herself makes the desired f particularly close to the desired f by using an f special adjusting device such as a graphic equalizer that adjusts the f special of the input signal. In addition, there is a method using automatic sound field correction in which the f characteristic of an acoustic path is measured using a measurement signal and the correction is automatically performed so as to make the desired f particularly close (for example, see Patent Document 1).

特開昭６２−１６６６９８号公報Japanese Patent Laid-Open No. 62-166698

人の声や楽器の音は、図８（Ａ）に示すように、基音と高調波から構成される。基音とは、いわゆる音高（又はピッチ）にあたる周波数成分であり、高調波とは、基音のＮ倍(Ｎ＝２，３，４，・・・)の周波数成分である。通常、基音の振幅レベル（以下「基音レベルＬ_ｂ」という。）はＮ次高調波の振幅レベル（以下「高調波レベルＬ_Ｎ」という。）よりも大きく、基音レベルＬ_ｂと高調波レベルＬ_Ｎのバランスによって音色（ｔｏｎｅｃｏｌｏｒ）が決まる。 As shown in FIG. 8A, a human voice or instrument sound is composed of a fundamental tone and harmonics. The fundamental tone is a frequency component corresponding to a so-called pitch (or pitch), and the harmonic is a frequency component N times (N = 2, 3, 4,...) Of the fundamental tone. Usually, the amplitude level of the fundamental tone (hereinafter referred to as “fundamental tone level L _b ”) is larger than the amplitude level of the Nth harmonic (hereinafter referred to as “harmonic level L _N ”), and the fundamental tone level L _b and the harmonic level L _The tone color is determined by the balance of _N.

ここで、基音レベルＬ_ｂと高調波レベルＬ_Ｎのバランスについて、試聴実験から図８（Ｂ）に示すように、楽曲信号中の基音レベルＬ_ｂよりも高調波レベルＬ_Ｎが相対的に大きくなる現象（以下「逆転現象」という。）が発生すると聴感上耳障りに感じるとの結果が出ている。これはつまり、オーディオ信号が再生される際に音響経路のｆ特の影響を受け基音レベルＬ_ｂと高調波レベルＬ_Ｎのバランスが崩れてしまい逆転現象が発生すると、聴感上耳障りに感じてしまうことを意味する。 Here, the balance of the fundamental tone level L _b and harmonic level L _N, from listening experiments as shown in FIG. 8 (B), is relatively large harmonic level L _N than the fundamental tone level L _b in the music signal When this phenomenon occurs (hereinafter referred to as the “reversal phenomenon”), it has been found that the user feels harsh on hearing. This means that when the audio signal is the fundamental tone level L _b and reversal phenomenon collapses balanced harmonic level L _N affected f-characteristic of the acoustic path is generated when it is played, feels the sense of hearing jarring Means that.

このような影響を補正するため音響経路のｆ特をフラットにする必要性がある。ｆ特をフラットにする方法として逆フィルタによる補正が考えられる。しかしFIRフィルタで音響経路のｆ特をフラットにするほどの特性を実現するためには多大なタップ数が必要であり、現在の信号処理性能では非常に困難であった。 In order to correct such influence, it is necessary to flatten the f characteristic of the acoustic path. As a method of flattening the f characteristic, correction by an inverse filter can be considered. However, a large number of taps are required to realize characteristics that flatten the f-characteristics of the acoustic path with the FIR filter, which is very difficult with the current signal processing performance.

そのため、短いタップのフィルタを使用せざるを得なく、ｆ特上にピークやディップが残ってしまっていた。つまり従来の方法では、上述の逆転現象により発生する聴感上の耳障りを抑制することができなかった。 For this reason, a filter with a short tap has to be used, and peaks and dips remain in the f. That is, in the conventional method, it was not possible to suppress the audible harshness caused by the above-described reversal phenomenon.

また、聴取室のｆ特は聴取位置によって変化してしまう。そのため、ｆ特をフラットにする逆フィルタが実現できたとしても、聴取位置以外の点（例えば聴取位置から１ｍずれた点）における音響経路のｆ特のピークやディップを増大させてしまう可能性があった。このように、従来のような音響経路のｆ特に対する逆フィルタによる補正では、聴取位置以外の場所では耳障り音の発生を助長してしまう可能性があった。 In addition, the f characteristic of the listening room changes depending on the listening position. Therefore, even if an inverse filter that flattens the f characteristic can be realized, there is a possibility that the peak or dip of the f characteristic of the acoustic path at a point other than the listening position (for example, a point shifted by 1 m from the listening position) may be increased. there were. As described above, in the conventional correction by the inverse filter for the f-characteristic of the acoustic path, there is a possibility that the generation of the harsh sound is promoted at a place other than the listening position.

そこで、本発明は、特定の聴取位置において基音と高調波の逆転による耳障り音の発生を抑制することを可能にする。また、特定の聴取位置以外の位置においても、補正によって、耳障り音の発生を助長させないようにする。 Therefore, the present invention makes it possible to suppress the generation of an annoying sound due to the inversion of the fundamental tone and the harmonics at a specific listening position. In addition, even at positions other than the specific listening position, the generation of the harsh sound is not promoted by correction.

本発明の一側面に係る音響処理装置は、入力した音響信号の基音周波数及び高調波周波数を算出する算出手段と、音響経路の少なくとも一部における音響特性を入力する入力手段と、前記音響信号と前記音響特性との畳み込み演算を行う演算手段と、前記演算手段での演算により得られた信号について、前記基音周波数の振幅レベルである基音レベルと、前記高調波周波数の振幅レベルである高調波レベルとを検出する第１のレベル検出手段と、前記高調波レベルが前記基音レベルより高い場合に、該基音レベルが該高調波レベルよりも高いレベルとなるように該基音レベル又は該高調波レベルを調整する調整手段とを有することを特徴とする。 An acoustic processing apparatus according to one aspect of the present invention includes a calculation unit that calculates a fundamental frequency and a harmonic frequency of an input acoustic signal, an input unit that inputs acoustic characteristics in at least a part of an acoustic path, and the acoustic signal. A calculation means for performing a convolution calculation with the acoustic characteristics, and a fundamental level that is an amplitude level of the fundamental frequency and a harmonic level that is an amplitude level of the harmonic frequency for the signal obtained by the calculation in the calculation means First level detection means for detecting the fundamental level or the harmonic level so that the fundamental level is higher than the harmonic level when the harmonic level is higher than the fundamental level. And adjusting means for adjusting.

本発明によれば、音響経路の周波数特性の影響で発生する基音レベルと高調波レベルとの逆転現象を抑制し、聴感上の耳障りな音が発生するのを抑制することができる。また、特定の聴取位置以外の点においても聴感上の耳障りな音の発生を助長することなく補正を行うことができる。 ADVANTAGE OF THE INVENTION According to this invention, the reversal phenomenon of the fundamental tone level and harmonic level which generate | occur | produce by the influence of the frequency characteristic of an acoustic path can be suppressed, and generation | occurrence | production of the harsh sound on hearing can be suppressed. In addition, correction can be performed at points other than the specific listening position without facilitating the generation of harsh sounds.

（Ａ）及び（Ｂ）は実施形態１における音響処理装置のブロック図、（Ｃ）は実施形態１における音響信号処理部の詳細のブロック図。(A) And (B) is a block diagram of the acoustic processing apparatus in Embodiment 1, (C) is a block diagram of the detail of the acoustic signal processing part in Embodiment 1. FIG. （Ａ）は畳み込みＦＦＴ処理部の出力信号、（Ｂ）はフィルタ生成部で生成される帯域減衰フィルタの特性の例を示す図。(A) is an output signal of a convolution FFT processing part, (B) is a figure which shows the example of the characteristic of the band attenuation | damping filter produced | generated by a filter production | generation part. 実施形態１における音響処理のフローチャート。2 is a flowchart of acoustic processing in the first embodiment. 実施形態１における基音周波数ｆｂ及び高調波周波数ｆＮの検出を説明する図。FIG. 3 is a diagram for explaining detection of a fundamental frequency fb and a harmonic frequency fN in the first embodiment. 実施形態３における音響信号処理部の詳細のブロック図。FIG. 10 is a block diagram showing details of an acoustic signal processing unit in the third embodiment. 実施形態３における逆転判定を説明する図。FIG. 9 is a diagram for explaining reverse rotation determination in the third embodiment. 実施形態４における音響処理装置のブロック図。The block diagram of the sound processing apparatus in Embodiment 4. FIG. （Ａ）は基音及び高調波を説明する図、（Ｂ）は逆転現象を説明する図。(A) is a figure explaining a fundamental tone and a harmonic, (B) is a figure explaining a reverse phenomenon.

以下、添付の図面を参照して、本発明をその好適な実施形態に基づいて詳細に説明する。なお、以下の実施形態において示す構成は一例に過ぎず、本発明は図示された構成に限定されるものではない。 Hereinafter, the present invention will be described in detail based on preferred embodiments with reference to the accompanying drawings. The configurations shown in the following embodiments are merely examples, and the present invention is not limited to the illustrated configurations.

＜実施形態１＞
図１（Ａ）は、実施形態１に係る音響処理装置の構成を示すブロック図である。同図において、信号解析処理部１００は、再生信号入力部１０１、音響特性入力部１０２、音響信号処理部１０３、及び、信号出力部１０４を備える。 <Embodiment 1>
FIG. 1A is a block diagram illustrating a configuration of the sound processing apparatus according to the first embodiment. In FIG. 1, the signal analysis processing unit 100 includes a reproduction signal input unit 101, an acoustic characteristic input unit 102, an acoustic signal processing unit 103, and a signal output unit 104.

再生信号入力部１０１は、ＣＤプレーヤ１又はＤＶＤプレーヤなどの音響再生装置から再生信号としての音響信号を入力する。入力した音響信号がアナログ信号であった場合、後のデジタル信号処理のためにＡ／Ｄ変換を施す。 The reproduction signal input unit 101 inputs an audio signal as a reproduction signal from an audio reproduction device such as a CD player 1 or a DVD player. If the input acoustic signal is an analog signal, A / D conversion is performed for later digital signal processing.

音響特性入力部１０２には、パーソナルコンピュータ（ＰＣ）２等の機器から、あらかじめ測定された音響経路又はその一部における音響特性のデータであるインパルス応答が入力される。なお、音響経路とは、上述したように、音響処理装置から聴取者の耳までの、音響信号が辿る経路をいう。ここで、音響特性入力部１０２に入力されるインパルス応答は、音響経路の少なくとも一部におけるインパルス応答を算出できる形式であればよい。周波数スペクトルや伝達関数の形式で入力された場合は、入力された信号をインパルス応答に変換して出力するようにしてもよい。 The acoustic characteristic input unit 102 receives an impulse response, which is data of acoustic characteristics of a previously measured acoustic path or part thereof, from a device such as a personal computer (PC) 2. As described above, the acoustic path refers to a path followed by an acoustic signal from the acoustic processing device to the listener's ear. Here, the impulse response input to the acoustic characteristic input unit 102 may be in a format that can calculate the impulse response in at least a part of the acoustic path. When input in the form of a frequency spectrum or transfer function, the input signal may be converted into an impulse response and output.

なお、音響特性入力部１０２は、音響特性のデータを外部から入力するかわりに、音響処理装置自体が音響特性を測定できる機能を有していてもよい。そのような例を、図１（Ｂ）を用いて説明する。同図に示す信号解析処理部１００は、図１（Ａ）に示した構成に対して、音響特性測定部３０２及びマイクロホン３１３を更に有する構成である。 Note that the acoustic characteristic input unit 102 may have a function that allows the acoustic processing device itself to measure acoustic characteristics instead of inputting acoustic characteristic data from the outside. Such an example will be described with reference to FIG. The signal analysis processing unit 100 shown in the figure has a configuration further including an acoustic characteristic measurement unit 302 and a microphone 313 in addition to the configuration shown in FIG.

音響特性測定部３０２は、測定信号生成部３１１及び音響特性演算部３１２を含む。測定信号生成部３１１は、聴取室の室内インパルス応答を測定するための測定信号を発生する。一般的には、室内インパルス応答を測定するための信号としては、ＭＬＳ（Maximum Length Sequence）などが使用される。測定信号はその場で演算されてもよいし、あらかじめメモリなどに保持されていてもよい。測定信号生成部３１１で生成れたインパルス応答測定信号はスピーカ４から発音され、聴取位置にセッティングされたマイクロホン３１３で収音される。 The acoustic characteristic measurement unit 302 includes a measurement signal generation unit 311 and an acoustic characteristic calculation unit 312. The measurement signal generator 311 generates a measurement signal for measuring the indoor impulse response of the listening room. In general, MLS (Maximum Length Sequence) or the like is used as a signal for measuring the indoor impulse response. The measurement signal may be calculated on the spot or may be held in advance in a memory or the like. The impulse response measurement signal generated by the measurement signal generation unit 311 is generated by the speaker 4 and collected by the microphone 313 set at the listening position.

音響特性演算部３１２ではマイクロホン３１３で収音された信号と測定信号から相互相関を取ることでインパルス応答が演算される。ここで図１（Ｂ）において算出されるインパルス応答は測定信号が通過した音響経路、すなわちユーザが設置したイコライザ５及びスピーカ４のｆ特と聴取室の音響特性が統合されたインパルス応答が算出される。算出されたインパルス応答は音響特性入力部１０２へ入力される。
このようにして、音響処理装置自体が音響特性測定を行うようにしてもよい。 The acoustic characteristic calculation unit 312 calculates an impulse response by taking a cross-correlation from the signal collected by the microphone 313 and the measurement signal. Here, the impulse response calculated in FIG. 1B is the acoustic path through which the measurement signal has passed, that is, the impulse response in which the f characteristics of the equalizer 5 and the speaker 4 installed by the user and the acoustic characteristics of the listening room are integrated. The The calculated impulse response is input to the acoustic characteristic input unit 102.
In this way, the acoustic processing apparatus itself may perform acoustic characteristic measurement.

音響信号処理部１０３は、再生信号入力部１０１と音響特性入力部１０２からの入力信号を受け、再生信号入力部１０１からの入力信号に所要の音響処理を施す。信号出力部１０４は、音響信号処理部１０３からの信号を受け、アナログ信号を出力する場合にはＤ／Ａ変換を施し出力する。出力された信号はアンプ３などの音響機器に入力されスピーカ４から発音される。 The acoustic signal processing unit 103 receives input signals from the reproduction signal input unit 101 and the acoustic characteristic input unit 102, and performs necessary acoustic processing on the input signals from the reproduction signal input unit 101. The signal output unit 104 receives the signal from the acoustic signal processing unit 103, and performs D / A conversion when outputting an analog signal. The output signal is input to an audio device such as an amplifier 3 and is output from the speaker 4.

なお、図には１チャネル分の経路しか描かれていないが、入力信号がマルチチャネル信号の場合、チャネルごとに上記の処理系統が独立に設けられる。例えば入力信号がステレオ信号ならば、２チャネルの系統を有することになる。 Although only one channel path is shown in the figure, when the input signal is a multi-channel signal, the above processing system is provided independently for each channel. For example, if the input signal is a stereo signal, it has a two-channel system.

次に、上述した実施形態中の音響信号処理部１０３での信号の流れを図１（Ｃ）を参照して説明する。
再生信号入力部１０１より音響信号処理部１０３へ入力された音響信号は、フレーム分割部１１１へ入力される。フレーム分割部１１１は、入力した音響信号を所定の時間長のフレームに分割する。具体的には、後の処理のために信号を窓関数で切り出し、ＦＦＴ処理部１１２、畳み込みＦＦＴ処理部１１４及び遅延処理部１２０へ出力する。ＦＦＴ処理部１１２は、入力信号に対しＦＦＴ処理を施し、それにより得られた周波数スペクトルを基音・高調波周波数検出部１１３へ出力する。基音・高調波周波数検出部１１３は、入力された周波数スペクトルから基音周波数ｆ_ｂ及び高調波周波数ｆ_Ｎを検出し、レベル検出部１１５へ出力する。 Next, a signal flow in the acoustic signal processing unit 103 in the above-described embodiment will be described with reference to FIG.
The acoustic signal input to the acoustic signal processing unit 103 from the reproduction signal input unit 101 is input to the frame dividing unit 111. The frame dividing unit 111 divides the input acoustic signal into frames having a predetermined time length. Specifically, a signal is cut out by a window function for later processing, and is output to the FFT processing unit 112, the convolution FFT processing unit 114, and the delay processing unit 120. The FFT processing unit 112 performs FFT processing on the input signal, and outputs the frequency spectrum obtained thereby to the fundamental / harmonic frequency detection unit 113. The fundamental / harmonic frequency detection unit 113 detects the fundamental frequency f _b and the harmonic frequency f _N from the input frequency spectrum and outputs them to the level detection unit 115.

一方、畳み込みＦＦＴ処理部１１４は、フレーム分割部１１１から出力された１フレームの音響信号と音響特性入力部１０２から入力されたインパルス応答との畳み込み演算、及び畳み込み演算の結果に対するＦＦＴ処理を行う。算出された周波数スペクトルは、レベル検出部１１５へ出力される。 On the other hand, the convolution FFT processing unit 114 performs a convolution operation between the acoustic signal of one frame output from the frame division unit 111 and the impulse response input from the acoustic characteristic input unit 102, and an FFT process on the result of the convolution operation. The calculated frequency spectrum is output to the level detector 115.

レベル検出部１１５は、入力された周波数スペクトル、基音周波数ｆ_ｂ及び高調波周波数ｆ_Ｎから、入力された周波数スペクトルの基音レベルＬ_ｂと高調波レベルＬ_Ｎを検出する。検出した基音レベルＬ_ｂと高調波レベルＬ_Ｎは逆転判定部１１６へ出力される。 The level detector 115 detects the fundamental level L _b and the harmonic level L _N of the input frequency spectrum from the input frequency spectrum, the fundamental frequency f _b and the harmonic frequency f _N. The detected fundamental tone level L _b and harmonic level L _N are output to the reverse rotation determination unit 116.

逆転判定部１１６は、入力された基音レベルＬ_ｂと高調波レベルＬ_Ｎとを比較し、Ｌ_ｂより高い_Ｎが存在した場合、フィルタ生成処理実行を指示する制御信号をフィルタ生成部１１７へ出力する。一方、全てのＬ_ＮがＬ_ｂ以下であった場合、フィルタ生成部１１７へフィルタ生成及び適用処理停止を指示する制御信号を出力する。 Reversal determination unit 116 compares the fundamental tone level L _b input harmonics level L _N, if higher than L _b _N is present, outputs a control signal for instructing the filter generation process executed to the filter generation unit 117 To do. On the other hand, when all L _N are equal to or less than L _b , a control signal that instructs the filter generation unit 117 to stop the filter generation and application processing is output.

フィルタ生成部１１７は、制御信号を受けて、フィルタ生成の処理を行う。制御信号がフィルタ生成処理実行を示す場合は、次のような処理行う。例えば、畳み込みＦＦＴ処理部１１４の出力が図２（Ａ）のようであった場合、図２（Ｂ）のように全てのＬ_ＮについてＬｂ＞Ｌ_Ｎとなるよう高調波レベルを減衰させる帯域減衰フィルタを生成し、フィルタ適用部１１８へ出力する。ここで本実施形態では信号の増幅によるデータのクリッピングを避けるために高調波レベルを減衰させるが、Ｌ_ｂ＞Ｌ_Ｎとなるよう基音レベルを増幅させる帯域増幅フィルタを生成してもよい。さらにＬ_ｂ＞Ｌ_Ｎとなるよう基音レベルを増幅させる帯域増幅フィルタと高調波レベルを減衰させる帯域減衰フィルタの両方を生成してもよい。しかし、増幅フィルタによってデータがクリッピングしないように注意する必要がある。 The filter generation unit 117 receives the control signal and performs filter generation processing. When the control signal indicates execution of filter generation processing, the following processing is performed. For example, when the output of the convolution FFT processing unit 114 is as shown in FIG. 2A, the band attenuation that attenuates the harmonic level so that Lb> L _N for all L _N as shown in FIG. 2B. A filter is generated and output to the filter application unit 118. In this embodiment, the harmonic level is attenuated in order to avoid clipping of data due to signal amplification. However, a band amplification filter for amplifying the fundamental level may be generated so that L _b > L _N. Furthermore, both a band amplification filter that amplifies the fundamental sound level and a band attenuation filter that attenuates the harmonic level may be generated so that L _b > L _N. However, care must be taken so that the data is not clipped by the amplification filter.

遅延処理部１２０は、フィルタ生成までの処理で発生する時間遅延を考慮して遅延処理を施し、フィルタ適用部１１８へ出力する。フィルタ適用部１１８は、入力されたフィルタを、遅延処理部１２０の出力に対して適用し、フレーム結合部１１９へ出力する。 The delay processing unit 120 performs delay processing in consideration of the time delay that occurs in the processing up to filter generation, and outputs the result to the filter application unit 118. The filter applying unit 118 applies the input filter to the output of the delay processing unit 120 and outputs the applied filter to the frame combining unit 119.

逆転判定部１１６の出力がフィルタ生成及び適用停止を指示する制御信号であった場合、フィルタ生成部１１７及びフィルタ適用部１１８は処理を実行せず、フレーム分割部１１１の出力をそのままフレーム結合部１１９へ出力する。 When the output of the reverse rotation determination unit 116 is a control signal for instructing filter generation and application stop, the filter generation unit 117 and the filter application unit 118 do not execute the process, and the output of the frame division unit 111 is used as it is. Output to.

最後に、フレーム結合部１１９は、フレームごと処理されたフィルタ適用部１１８の出力を結合し、信号出力部１０４へ出力する。 Finally, the frame combination unit 119 combines the outputs of the filter application unit 118 processed for each frame and outputs the combined result to the signal output unit 104.

次に、音響信号処理部１０３での処理のフローを、図３を用いて詳細に説明する。
Ｓ２０１では、フレーム分割部１１１において再生信号の分割が行われる。すなわち、再生信号入力部１０１から入力された再生信号の時間波形信号を切り出す。ここで信号を切り出す窓関数は切り出した信号を再結合した際にデータ量が増減しないような窓関数を使用する。Ｓ２０２乃至Ｓ２１１の処理はこのフレーム単位で処理が行われ、処理が行われた信号はＳ２１２で結合される。 Next, the flow of processing in the acoustic signal processing unit 103 will be described in detail with reference to FIG.
In S201, the reproduction signal is divided in the frame dividing unit 111. That is, the time waveform signal of the reproduction signal input from the reproduction signal input unit 101 is cut out. Here, the window function that cuts out the signal uses a window function that does not increase or decrease the amount of data when the cut out signals are recombined. The processes in S202 to S211 are performed in units of frames, and the processed signals are combined in S212.

Ｓ２０２では、ＦＦＴ処理部１１２においてフレーム分割部１１１からの出力信号に対して周波数スペクトルを算出するためにＦＦＴ処理が行われる。Ｓ２０３乃至Ｓ２０５は、基音・高調波周波数検出部１１３で処理される。Ｓ２０３乃至Ｓ２０５の基音周波数検出、高調波周波数の算出、未検出ピークの有無の判断については、図４を用いて説明する。 In S <b> 202, the FFT processing unit 112 performs FFT processing to calculate a frequency spectrum for the output signal from the frame dividing unit 111. Steps S <b> 203 to S <b> 205 are processed by the fundamental / harmonic frequency detection unit 113. The fundamental frequency detection, harmonic frequency calculation, and determination of the presence or absence of undetected peaks in S203 to S205 will be described with reference to FIG.

はじめに、Ｓ２０３で、最大ピーク周波数検出が行われる。Ｓ２０２の処理によって算出された周波数スペクトル（図４（Ａ））から振幅レベルの最大値を検出し、その値が所定の振幅レベル以上である場合は、その最大値の周波数を算出する。ここで所定の振幅レベルとは信号とノイズを分離するために設定される値である。そのため、値は例えば図４（Ａ）の最大振幅レベルの半分の値(約−６ｄＢ)としてもよいし、ユーザが設定できるようにしてもよく、その他の方法を用いて値を決定してもよい。 First, in S203, maximum peak frequency detection is performed. The maximum value of the amplitude level is detected from the frequency spectrum (FIG. 4A) calculated by the process of S202, and when the value is equal to or higher than the predetermined amplitude level, the frequency of the maximum value is calculated. Here, the predetermined amplitude level is a value set for separating a signal and noise. Therefore, for example, the value may be a half value (about −6 dB) of the maximum amplitude level in FIG. 4A, may be set by the user, or may be determined using other methods. Good.

本実施形態では、図４（Ｂ）に示すように、算出した最大値の周波数を基音周波数ｆ_ｂ（ｎ）とする。ここでｎはＳ２０３及びＳ２０４の繰り返し回数を示すものであり、図４（Ｂ）では１回目の検出であるため、ｆ_ｂ(１)となっている。繰り返しの条件など詳細は後のＳ２０５の説明において述べる。 In the present embodiment, as shown in FIG. 4B, the calculated maximum frequency is set as the fundamental frequency f _b (n). Here, n indicates the number of repetitions of S203 and S204, and is f _b (1) in FIG. _4B because it is the first detection. Details such as the repetition condition will be described later in the description of S205.

Ｓ２０４では、基音周波数ｆ_ｂ（ｎ）に対してＮ倍(Ｎ＝２，３，４，・・・)の周波数をＮ次高調波周波数ｆ_Ｎ(ｎ)として算出する。 In S204, the frequency N times (N = 2, 3, 4,...) With respect to the fundamental frequency f _b (n) is calculated as the _Nth harmonic frequency f _N (n).

ｆ_Ｎ（ｎ） = ｆ_ｂ（ｎ）＊Ｎ（Ｎ＝２，３，４，・・・）
ここで、楽曲中の高調波周波数は必ずしも基音周波数の整数倍でなく、多少周波数がずれている可能性がある。そこで算出した高調波周波数ｆ_Ｎ(ｎ)に対応するＳ２０２の周波数スペクトルを参照し、所定の範囲内にスペクトルピークがあった場合、そのピークに対応する周波数を高調波周波数ｆ_Ｎ(ｎ)としてもよい。 f _N (n) = f _b (n) * N (N = 2, 3, 4,...)
Here, the harmonic frequency in the music is not necessarily an integral multiple of the fundamental frequency, and the frequency may be slightly shifted. Therefore, referring to the frequency spectrum of S202 corresponding to the calculated harmonic frequency f _N (n), when there is a spectrum peak within a predetermined range, the frequency corresponding to the peak is set as the harmonic frequency f _N (n). Also good.

Ｓ２０５では、Ｓ２０２の周波数スペクトルにおいて基音周波数ｆ_ｂ（ｎ）及び高調波周波数ｆ_Ｎ(ｎ)以外に所定の振幅レベルを超えるスペクトルピークがあるか否かを判定する。図４（Ａ）に示したＳ２０２のスペクトルに対し、すでに検出した基音周波数ｆ_ｂ(１)及び高調波周波数ｆ_Ｎ(１)の周波数成分を除いたスペクトル（図４（Ｃ））において所定の振幅レベルを超える信号があるかないかを判断する。所定の振幅レベルを超える信号があると判断された場合は、図４（Ｄ）の如く、Ｓ２０５において所定の振幅レベルを超える信号がなくなるまでＳ２０３及びＳ２０４の処理を行う。 In S205, it is determined whether or not there is a spectrum peak exceeding a predetermined amplitude level in addition to the fundamental frequency f _b (n) and the harmonic frequency f _N (n) in the frequency spectrum of S202. In the spectrum (FIG. 4C) obtained by removing the frequency components of the fundamental frequency f _b (1) and the harmonic frequency f _N (1) already detected from the spectrum of S202 shown in FIG. Determine if any signal exceeds the amplitude level. If it is determined that there is a signal exceeding the predetermined amplitude level, the processing of S203 and S204 is performed until there is no signal exceeding the predetermined amplitude level in S205 as shown in FIG.

Ｓ２０６乃至Ｓ２０７の処理は畳み込みＦＦＴ処理部１１４において行われる。Ｓ２０６では音響特性入力部１０２より入力されたインパルス応答と、Ｓ２０１でフレーム分割された信号の畳み込み演算が行われる。Ｓ２０７ではＳ２０６の畳み込み演算の結果に対してＦＦＴ処理を施し、Ｓ２０１でフレーム分割された信号に音響特性入力部１０２より入力されたインパルス応答を畳み込んだ信号の周波数スペクトルを算出する。この周波数スペクトルは再生信号の周波数スペクトルが入力された音響特性の影響を受けて変化した結果を示している。 The processing from S206 to S207 is performed by the convolution FFT processing unit 114. In S206, the convolution calculation of the impulse response input from the acoustic characteristic input unit 102 and the signal divided in S201 is performed. In S207, FFT processing is performed on the result of the convolution operation in S206, and the frequency spectrum of the signal obtained by convolving the impulse response input from the acoustic characteristic input unit 102 with the signal divided in S201 is calculated. This frequency spectrum shows the result of changing the frequency spectrum of the reproduction signal under the influence of the inputted acoustic characteristics.

Ｓ２０８の処理はレベル検出部１１５で行われる。Ｓ２０８ではＳ２０３乃至Ｓ２０５によって検出された全ての基音周波数ｆ_ｂ（ｎ）と全ての高調波周波数ｆ_Ｎ(ｎ)についてＳ２０７で算出した周波数スペクトルにおける基音レベルＬ_ｂ(ｎ)及び高調波レベルＬ_Ｎ(ｎ)を算出する。 The process of S208 is performed by the level detection unit 115. In S208, the fundamental level L _b (n) and the harmonic level L _N in the frequency spectrum calculated in S207 for all fundamental frequencies f _b (n) and all harmonic frequencies f _N (n) detected in S203 to S205. (n) is calculated.

Ｓ２０９の処理は逆転判定部１１６で行われる。Ｓ２０９ではそれぞれｎについて独立に基音レベルＬ_ｂ(ｎ)と高調波レベルＬ_Ｎ(ｎ)を比較する。全てのｎについて比較を行った結果、基音レベルＬ_ｂ(ｎ)よりも高調波レベルＬ_Ｎ(ｎ)が大きくなっている逆転現象が見つかった場合、フィルタ生成の制御信号をフィルタ生成部１１７へ出力する。一方、逆転現象が見つからなかった場合、フィルタ生成部１１７及びフィルタ適用部１１８へフィルタ生成及びフィルタ適用を止める制御信号を出力し、Ｓ２１０乃至Ｓ２１１の処理は行われない。 The process of S209 is performed by the reverse rotation determination unit 116. In S209, the fundamental tone level L _b (n) and the harmonic level L _N (n) are compared independently for each n. As a result of comparison for all n, if a reverse phenomenon is found in which the harmonic level L _N (n) is higher than the fundamental tone level L _b (n), a filter generation control signal is sent to the filter generation unit 117. Output. On the other hand, when the reverse phenomenon is not found, a control signal for stopping filter generation and filter application is output to the filter generation unit 117 and the filter application unit 118, and the processing of S210 to S211 is not performed.

Ｓ２１０の処理はフィルタ生成部１１７で行われる。Ｓ２１０ではＳ２０９で検出した逆転現象を起こしている高調波周波数ｆ_Ｎ(ｎ)に対して高調波レベルＬ_Ｎ(ｎ)が基音レベルＬ_ｂ(ｎ)より低くなるように帯域減衰フィルタ(ノッチフィルタ)を生成する。前述したとおり、基音周波数に対して基音レベルＬ_ｂ(ｎ)が高調波レベルＬ_Ｎ(ｎ)より高くなるように帯域増幅フィルタを生成してもよいし、その両方を生成してもよい。ここで生成させるフィルタは、フィルタ処理を施すピークに隣接するピークに影響を及ぼさないような帯域幅に設計されることが望ましい。 The process of S210 is performed by the filter generation unit 117. In S210, the band attenuation filter (notch filter) is set so that the harmonic level L _N (n) is lower than the fundamental level L _b (n) with respect to the harmonic frequency f _N (n) causing the reverse phenomenon detected in S209. ) Is generated. As described above, the band amplification filter may be generated such that the fundamental level L _b (n) is higher than the harmonic level L _N (n) with respect to the fundamental frequency, or both of them may be generated. The filter generated here is preferably designed to have a bandwidth that does not affect the peak adjacent to the peak to be filtered.

Ｓ２１１の処理はフィルタ適用部１１８で行われる。Ｓ２１１ではＳ２１０で生成したフィルタをＳ２０１でフレーム分割された信号に対して適用する。ここでＳ２０１の出力はＳ２０２乃至Ｓ２１０の処理時間を鑑み遅延処理部１２０で遅延処理されている。 The process of S211 is performed by the filter application unit 118. In S211, the filter generated in S210 is applied to the signal divided in S201. Here, the output of S201 is delayed by the delay processing unit 120 in consideration of the processing time of S202 to S210.

Ｓ２１２の処理はフレーム結合部１１９で行われる。Ｓ２１２ではフレームごとに処理されたＳ２１１の出力を結合していく。結合された信号は、信号出力部１０４へ出力される。 The process of S212 is performed by the frame combining unit 119. In S212, the outputs of S211 processed for each frame are combined. The combined signal is output to the signal output unit 104.

このような構成によれば、楽曲の高調波に対してのみ帯域減衰フィルタを適用するため、聴取位置以外の場所においても聴感上の耳障り音の発生を助長させることなく、聴取位置における耳障り音を抑制できるという利点がある。 According to such a configuration, since the band attenuation filter is applied only to the harmonics of the music, the jarring sound at the listening position can be generated without promoting the generation of the jarring sound on the audibility even at a place other than the listening position. There is an advantage that it can be suppressed.

＜実施形態２＞
以下では、聴取位置が複数ある場合の実施形態について説明する。
音響信号処理部の構成は図１（Ｃ）と同様である。本実施形態においては、説明のため、聴取点のｆ特（音圧周波数特性）を、Ｇ_LP(ｆ) (ＬＰ＝Ａ，Ｂ，Ｃ，・・・)で表す。ここでＡ，Ｂ，Ｃ，・・・はそれぞれ聴取位置を示し、ｆは周波数を示す。音響特性入力部１０２に複数聴取点のｆ特Ｇ_LP(ｆ)が入力された場合、畳み込みＦＦＴ処理部１１４ではそれぞれ聴取位置LPについて独立にＧ_LP(ｆ)とフレーム分割部１１１の出力との畳み込み演算及び演算結果に対するFFT処理が行われる。このようにして聴取位置ごと独立に周波数スペクトルが算出される。 <Embodiment 2>
Hereinafter, an embodiment when there are a plurality of listening positions will be described.
The configuration of the acoustic signal processing unit is the same as that in FIG. In the present embodiment, for the sake of explanation, the f characteristic (sound pressure frequency characteristic) of the listening point is represented by G _LP (f) (LP = A, B, C,...). Here, A, B, C,... Each indicate a listening position, and f indicates a frequency. When a plurality of listening points f-specific G _LP (f) are input to the acoustic characteristic input unit 102, the convolution FFT processing unit 114 independently calculates G _LP (f) and the output of the frame dividing unit 111 for each listening position LP. FFT processing is performed on the convolution operation and the operation result. In this way, the frequency spectrum is calculated independently for each listening position.

Ｓ２０８乃至Ｓ２０９の処理も聴取位置ＬＰごと独立に行われる。Ｓ２０９において逆転現象が検出された場合、逆転現象を抑制するフィルタを生成する。すなわち、全ての聴取位置ＬＰにおいて高調波レベルＬ_ＮLP(ｎ)が基音レベルＬｂ_LP(ｎ)より相対的に小さくなるようにフィルタを生成する。ここで、ユーザが各聴取位置ＬＰに優先度をつけ、その優先度に応じて優先度が高い場所において高調波レベルＬ_ＮLP(ｎ)が基音レベルＬ_ｂLP(ｎ)より相対的に小さくなるようにフィルタを生成してもよい。また、過半数の聴取位置ＬＰにおいて高調波レベルＬ_ＮLP(ｎ)が基音レベルＬ_ｂLP(ｎ)より相対的に小さくなるようにフィルタを生成してもよい。 The processing from S208 to S209 is also performed independently for each listening position LP. If a reverse phenomenon is detected in S209, a filter that suppresses the reverse phenomenon is generated. That is, the filter is generated so that the harmonic level L _NLP (n) is relatively smaller than the fundamental level Lb _LP (n) at all listening positions LP. Here, the user gives priority to each listening position LP, and the harmonic level L _NLP (n) is relatively smaller than the fundamental _sound level L _bLP (n) in a place where the priority is high according to the priority. A filter may be generated. Alternatively, the filter may be generated so that the harmonic level L _NLP (n) is relatively smaller than the fundamental level L _bLP (n) at the majority of the listening positions LP.

このように、各処理部はそれぞれ聴取位置LPについて独立に動作する。このようにすれば聴取者が複数人数の場合においてもそれぞれの聴取位置において耳障り音を抑制できる。 Thus, each processing unit operates independently for the listening position LP. In this way, even when there are a plurality of listeners, it is possible to suppress harsh sounds at each listening position.

＜実施形態３＞
図５は本発明の第３の実施形態に係る音響信号処理部１０３の構成をブロック図で示したものである。図１（Ｃ）に対して第２のレベル検出部２１５が追加されている。また、本実施形態において基音・高調波周波数検出部１１３では基音周波数検出方法として最大ピーク検出を用いた基音検出方法以外の方法を用いてもよい。例えば既存の音階検出(ピッチ検出)装置を用いて検出された音階を基音としてもよい。また逆転判定部１１６には第１のレベル検出部１１５の出力だけでなく第２のレベル検出部２１５の出力も入力される。なお他の構成は他の実施形態と同様である。 <Embodiment 3>
FIG. 5 is a block diagram showing the configuration of the acoustic signal processing unit 103 according to the third embodiment of the present invention. A second level detection unit 215 is added to FIG. In the present embodiment, the fundamental / harmonic frequency detection unit 113 may use a method other than the fundamental detection method using maximum peak detection as the fundamental frequency detection method. For example, a scale detected using an existing scale detection (pitch detection) device may be used as a fundamental tone. Further, not only the output of the first level detection unit 115 but also the output of the second level detection unit 215 is input to the reverse rotation determination unit 116. Other configurations are the same as those of the other embodiments.

レベル検出部１１５では畳み込みＦＦＴ処理部１１４の出力に対してＳ２０８の基音レベルＬ_ｂ(ｎ)検出及び高調波レベルＬ_Ｎ(ｎ)検出処理が行われる。レベル検出部２１５ではＦＦＴ処理部１１２の出力に対してＳ２０８の基音レベルＬ_sｂ(ｎ)及び高調波レベルＬ_sＮ(ｎ)の検出処理が行われる。 In the level detection unit 115, the fundamental level L _b (n) detection and the harmonic level L _N (n) detection processing of S208 are performed on the output of the convolution FFT processing unit 114. The level detection unit 215 performs the detection processing of the fundamental level L _sb (n) and the harmonic level L _sN (n) in S208 on the output of the FFT processing unit 112.

逆転判定部１１６では、レベル検出部１１５の出力に対して、それぞれｎについて独立に基音レベルＬ_ｂ(ｎ)と高調波レベルＬ_Ｎ(ｎ)の比較を行う。またレベル検出部２１５の出力に対しても、それぞれｎについて独立に基音レベルＬ_sｂ(ｎ)と高調波レベルＬ_sＮ(ｎ)の比較を行う。 The reverse rotation determination unit 116 compares the fundamental level L _b (n) and the harmonic level L _N (n) for n independently of the output of the level detection unit 115. Also, the fundamental level L _sb (n) and the harmonic level L _sN (n) are compared independently for each of the outputs of the level detection unit 215.

比較結果の一例を図６に示す。図６（Ａ）の如く上記比較の結果においてＬ_sｂ(ｎ)とＬ_sN(ｎ)の比較からは発見されなかった逆転現象がＬ_ｂ(ｎ)とＬ_Ｎ(ｎ)の比較において１つ以上発見された場合、フィルタ生成部１１７へフィルタ生成の制御信号を出力する。一方、図６（Ｂ）の如くＦＦＴ処理部１１２の出力において既に逆転が検出されている場合や逆転現象が起こっていなかった場合、フィルタ生成部１１７及びフィルタ適用部１１８へフィルタ生成及びフィルタ適用を止める制御信号を出力する。したがってＳ２１０乃至Ｓ２１１の処理は行われない。 An example of the comparison result is shown in FIG. As shown in FIG. _6A, there is one reversal phenomenon in the comparison of L _b (n) and L _N (n) that was not found from the comparison of L _sb (n) and L _sN (n). When it is found as described above, it outputs a filter generation control signal to the filter generation unit 117. On the other hand, when the reverse rotation is already detected in the output of the FFT processing unit 112 as shown in FIG. 6B or when the reverse rotation phenomenon has not occurred, the filter generation unit 117 and the filter application unit 118 are subjected to filter generation and filter application. Output a control signal to stop. Accordingly, the processing from S210 to S211 is not performed.

フィルタ生成部１１７では、逆転判定部１１６からの制御信号を受け逆転抑制フィルタの生成を行う。 The filter generation unit 117 receives the control signal from the reverse rotation determination unit 116 and generates a reverse rotation suppression filter.

上記比較の結果、Ｌ_sｂ(ｎ)＞Ｌ_sＮ(ｎ)、かつ、Ｌ_ｂ(ｎ)＜Ｌ_Ｎ(ｎ)である基音周波数ｆ_ｂ（ｎ）及び高調波周波数ｆ_Ｎ（ｎ）については次のようにする。すなわち、実施形態１の如く高調波レベルＬ_Ｎ(ｎ)が基音レベルＬ_ｂ(ｎ)よりも低くなるように帯域減衰フィルタを生成する。ここでフィルタは以下のように生成してもよい。 As a result of the comparison, the fundamental frequency f _b (n) and the harmonic frequency f _N (n) _satisfying L _sb (n)> L _sN (n) and L _b (n) <L _N (n) Do as follows. That is, the band attenuation filter is generated so that the harmonic level L _N (n) is lower than the fundamental sound level L _b (n) as in the first embodiment. Here, the filter may be generated as follows.

上記比較の結果が図６（Ｃ）の如くＬ_sｂ(ｎ)＞Ｌ_sN(ｎ)かつＬ_ｂ(ｎ)＜Ｌ_Ｎ(ｎ)かつＬ_Ｎ(ｎ)＞Ｌ_sＮ(ｎ)である場合、次のようにする。すなわち、高調波周波数ｆ_Ｎ(ｎ)に対してＬ_ｂ(ｎ) ＞Ｌ_Ｎ(ｎ)となるように帯域減衰フィルタ（ノッチフィルタ）を生成する。 When the result of the comparison is L _sb (n)> L _sN (n) and L _b (n) <L _N (n) and L _N (n)> L _sN (n) as shown in FIG. And do the following: That is, the band attenuation filter (notch filter) is generated so that L _b (n)> L _N (n) with respect to the harmonic frequency f _N (n).

また、上記比較の結果が図６（Ｄ）の如くＬ_sｂ(ｎ)＞Ｌ_sＮ(ｎ)かつＬ_ｂ(ｎ)＜Ｌ_Ｎ(ｎ)かつＬ_ｂ(ｎ)＜Ｌ_sｂ(ｎ)である場合は次のようにする。すなわち、基音周波数ｆ_ｂ（ｎ）に対して、Ｌ_ｂ(ｎ) ＞Ｌ_Ｎ(ｎ)となるように帯域増幅フィルタ(ピークフィルタ)を生成する。ここで、補正量すなわち補正フィルタのゲインは補正後の基音レベルと高調波レベルの比率が切り出し信号の比率と略等しくなるように設定されることが望ましい。 Further, as shown in FIG. 6D, the result of the comparison is L _sb (n)> L _sN (n), L _b (n) <L _N (n), and L _b (n) <L _sb (n). If there is, do the following: That is, a band amplification filter (peak filter) is generated so that L _b (n)> L _N (n) with respect to the fundamental frequency f _b (n). Here, it is desirable that the correction amount, that is, the gain of the correction filter, is set so that the ratio between the fundamental level and the harmonic level after correction is substantially equal to the ratio of the cut-out signal.

このような構成によれば、楽曲の基音に対して帯域増幅フィルタを適用し、高調波に対して帯域減衰フィルタを適用する。このため、聴取位置以外の場所においても聴感上の耳障り音の発生を助長させることなく、聴取位置における耳障り音を抑制できるという利点がある。さらに、実施形態１と比較して基音レベルＬ_ｂ(ｎ)と高調波レベルＬ_Ｎ(ｎ)のバランスを基音レベルＬ_sｂ(ｎ)と高調波レベルＬ_sＮ(ｎ)のバランスに近づけることができるため、より理想的な補正が可能である。 According to such a configuration, the band amplification filter is applied to the fundamental tone of the music, and the band attenuation filter is applied to the harmonics. For this reason, there is an advantage that it is possible to suppress the jarring sound at the listening position without promoting the generation of the jarring sound in the sense of hearing even at a place other than the listening position. Furthermore, compared with the first embodiment, the balance between the fundamental level L _b (n) and the harmonic level L _N (n) can be made closer to the balance between the fundamental level L _sb (n) and the harmonic level L _sN (n). Therefore, more ideal correction is possible.

＜実施形態４＞
試聴実験から、高域に関してはスピーカのｆ特と聴感の相関性が高く、低域は聴取位置で測定したｆ特と聴感の相関性が高いという結果が出ている。したがって、スピーカのｆ特と聴取位置で測定したｆ特が独立に入力されるように構成してもよい。また、グラフィックイコライザなどのユーザがｆ特を調整できる音響機器のｆ特は、逐次ユーザによって更新される可能性があるため、独立に入力されることが望ましい。その他にもスピーカが複数設置される場合、各スピーカごとに音響経路が異なるため、複数のインパルス応答の入力が必要となる。 <Embodiment 4>
From the trial listening experiment, it has been found that the high frequency has a high correlation between the f characteristic of the speaker and the audibility, and the low frequency has a high correlation between the f characteristic measured at the listening position and the audibility. Therefore, the f characteristic of the speaker and the f characteristic measured at the listening position may be input independently. In addition, the f-feature of an audio device such as a graphic equalizer that allows the user to adjust the f-feature may be sequentially updated by the user, so it is desirable that the f-feature be input independently. In addition, when a plurality of speakers are installed, an acoustic path is different for each speaker, and thus it is necessary to input a plurality of impulse responses.

このように、本発明において、音響特性入力部１０２に入力される音響特性は１つとは限らず、また、入力した音響特性を必ずしも全て使用する必要がないため、以下のような実施形態をとってもよい。 In this way, in the present invention, the acoustic characteristic input to the acoustic characteristic input unit 102 is not limited to one, and it is not always necessary to use all the input acoustic characteristics. Therefore, the following embodiment can be adopted. Good.

図７は、本発明の第４の実施形態に係る音響処理装置の構成を示すブロック図である。
図７では、図１（Ｃ）の音響特性入力部１０２が音響特性入力部４０２に置き換えられている。音響特性入力部４０２は、メモリ部４１２、選択部４１３、インパルス応答演算部４１１を含む。その他の構成は実施形態１と同様である。 FIG. 7 is a block diagram showing a configuration of a sound processing apparatus according to the fourth embodiment of the present invention.
In FIG. 7, the acoustic characteristic input unit 102 in FIG. 1C is replaced with an acoustic characteristic input unit 402. The acoustic characteristic input unit 402 includes a memory unit 412, a selection unit 413, and an impulse response calculation unit 411. Other configurations are the same as those of the first embodiment.

音響特性入力部４０２には例えばＰＣ２などが接続され、音響経路の少なくとも一部におけるインパルス応答が入力され、メモリ部４１２に記憶される。ここでメモリ部４１２には上述したような複数の音響特性が記憶されるため、あらかじめ記憶する音響特性に名前をつけるなど、それぞれの音響特性が何の音響特性か特定できるようにされていることが望ましい。 For example, a PC 2 or the like is connected to the acoustic characteristic input unit 402, and an impulse response in at least a part of the acoustic path is input and stored in the memory unit 412. Here, since the plurality of acoustic characteristics as described above are stored in the memory unit 412, it is possible to specify what acoustic characteristics each acoustic characteristic, such as naming the acoustic characteristics stored in advance. Is desirable.

選択部４１３では補正に使用される音響特性が選択される。ここで選択部４１３は補正に使用する音響特性をユーザ自身が選択できるようにメモリ部４１２に記憶されている音響特性を特定する情報を表示する表示装置やどの音響特性を選択するのかを切り替える切り替えスイッチなどを備えていてもよい。また、選択部４１３は再生信号入力部１０１からの入力信号によって自動的に使用する音響特性を選択するようにしてもよい。例えば、マルチチャネルの音響特性がチャネルごと独立に記憶されている場合、再生信号入力部１０１に入力された再生信号がステレオかマルチチャネルかによって使用する音響特性を自動で選択するようにしてもよい。 The selection unit 413 selects an acoustic characteristic used for correction. Here, the selection unit 413 switches the display device that displays information specifying the acoustic characteristics stored in the memory unit 412 and which acoustic characteristics are selected so that the user can select the acoustic characteristics to be used for correction. A switch or the like may be provided. Further, the selection unit 413 may select an acoustic characteristic to be automatically used according to an input signal from the reproduction signal input unit 101. For example, when multi-channel acoustic characteristics are stored independently for each channel, the acoustic characteristics to be used may be automatically selected depending on whether the reproduction signal input to the reproduction signal input unit 101 is stereo or multi-channel. .

インパルス応答演算部４１１では選択部４１３で選択された音響特性から補正に使用するインパルス応答を演算し出力する。ここで、インパルス応答演算部４１１において選択した音響特性の優先度や使用する帯域などを入力する装置を設け、ユーザがそれらのパラメータを任意に設定できるようにし、補正に使用するインパルス応答を演算するようにしてもよい。 The impulse response calculation unit 411 calculates and outputs an impulse response used for correction from the acoustic characteristics selected by the selection unit 413. Here, a device for inputting the priority of the acoustic characteristics selected by the impulse response calculation unit 411, the band to be used, and the like is provided so that the user can arbitrarily set those parameters and calculate the impulse response used for correction. You may do it.

このようにすれば、複数の音響特性が入力された場合においても補正に使用するインパルス応答をユーザが簡単に選択できる。 In this way, even when a plurality of acoustic characteristics are input, the user can easily select an impulse response used for correction.

＜他の実施形態＞
また、本発明は、以下の処理を実行することによっても実現される。即ち、上述した実施形態の機能を実現するソフトウェア（プログラム）を、ネットワーク又は各種記憶媒体を介してシステム或いは装置に供給し、そのシステム或いは装置のコンピュータ（又はＣＰＵやＭＰＵ等）がプログラムを読み出して実行する処理である。この場合、そのプログラム、及び該プログラムを記憶した記憶媒体は本発明を構成することになる。 <Other embodiments>
The present invention can also be realized by executing the following processing. That is, software (program) that realizes the functions of the above-described embodiments is supplied to a system or apparatus via a network or various storage media, and a computer (or CPU, MPU, etc.) of the system or apparatus reads the program. It is a process to be executed. In this case, the program and the storage medium storing the program constitute the present invention.

Claims

A calculating means for calculating a fundamental frequency and a harmonic frequency of the input acoustic signal;
Input means for inputting acoustic characteristics in at least part of the acoustic path;
Arithmetic means for performing a convolution operation between the acoustic signal and the acoustic characteristics;
First level detection means for detecting a fundamental level that is an amplitude level of the fundamental frequency and a harmonic level that is an amplitude level of the harmonic frequency for a signal obtained by computation in the computation means;
Adjusting means for adjusting the fundamental level or the harmonic level so that the fundamental level becomes higher than the harmonic level when the harmonic level is higher than the fundamental level;
A sound processing apparatus comprising:

The sound processing apparatus according to claim 1, wherein the adjustment unit is an amplification unit that amplifies the fundamental tone level to a level higher than the harmonic level.

The sound processing apparatus according to claim 1, wherein the adjustment unit is an attenuation unit that attenuates the harmonic level to a level lower than the fundamental level.

The acoustic characteristic includes a sound pressure frequency characteristic for each of a plurality of listening positions, and the computing means, the first level detecting means, and the adjusting means operate independently for each of the plurality of listening positions. The sound processing device according to claim 1, wherein the sound processing device is a sound processing device.

The acoustic signal further includes second level detection means for detecting the fundamental level and the harmonic level,
The adjustment means has a harmonic level detected by the second level detection means lower than the fundamental level detected by the second level detection means, and the harmonic level detected by the first level detection means. When the wave level is higher than the fundamental level detected by the first level detection means, the fundamental level detected by the second level detection means and the harmonic level detected by the second level detection means The first level detection unit is configured to attenuate the harmonic level detected by the first level detection unit or to amplify the fundamental level detected by the first level detection unit so as to approach the first level detection unit. The acoustic processing apparatus according to claim 1, wherein the detected harmonic level is adjusted to be lower than the fundamental level detected by the first level detecting means.

The sound processing apparatus according to claim 1, wherein the sound signal is a multi-channel sound signal, and the input unit selects and inputs sound characteristics for each channel.

The sound processing apparatus according to claim 1, wherein the acoustic characteristics include a sound pressure frequency characteristic of a speaker and a sound pressure frequency characteristic of a listening position.

The acoustic processing according to any one of claims 1 to 7, wherein the acoustic characteristic is a sound pressure frequency characteristic of an acoustic path followed by the acoustic signal from the acoustic processing device to a listening ear. apparatus.

A calculating step in which the calculating means calculates a fundamental frequency and a harmonic frequency of the input acoustic signal;
An input step in which the input means inputs an acoustic characteristic in at least a part of the acoustic path;
A computing step in which computing means performs a convolution computation between the acoustic signal and the acoustic characteristics;
A level detecting step for detecting a fundamental level, which is an amplitude level of the fundamental frequency, and a harmonic level, which is an amplitude level of the harmonic frequency, for a signal obtained by computation in the computation step; ,
An adjusting step in which, when the harmonic level is higher than the fundamental level, an adjustment means adjusts the fundamental level or the harmonic level or both so that the fundamental level becomes higher than the harmonic level; ,
A sound processing method comprising:

The program for functioning a computer as each means which the sound processing apparatus of any one of Claims 1 thru | or 8 has.