JP2022011890A

JP2022011890A - Noise update circuit

Info

Publication number: JP2022011890A
Application number: JP2020113298A
Authority: JP
Inventors: 康二郎今里; Kojiro Imazato
Original assignee: Japan Radio Co Ltd
Current assignee: Japan Radio Co Ltd
Priority date: 2020-06-30
Filing date: 2020-06-30
Publication date: 2022-01-17

Abstract

To provide a noise update circuit that estimates noise components accurately, and is used in a noise reduction circuit.SOLUTION: A noise update circuit has: a first update unit 111 that calculates a post-update noise spectrum when a frame to be processed is a frame including only noise components; and a second update unit 112 that calculates a post-update noise spectrum when the frame to be processed is a frame including voice components.EFFECT: Even when the frames including voice components are continued one after another as the frames to be processed, fluctuation of noise can be followed, and noise components can be accurately estimated.SELECTED DRAWING: Figure 2

Description

この発明は、ノイズ更新回路に関し、例えば、高周波信号を送受信する無線機に組み込まれるノイズリダクション回路に用いられ得るノイズ更新回路に関する。 The present invention relates to a noise update circuit, for example, a noise update circuit that can be used in a noise reduction circuit incorporated in a radio that transmits and receives high frequency signals.

音声信号に含まれる雑音成分を抑圧する手法としてスペクトル減算法（Ｓpectral Ｓubtraction）が知られている（例えば、特許文献１、非特許文献１参照）。 A spectral subtraction method is known as a method for suppressing a noise component contained in an audio signal (see, for example, Patent Document 1 and Non-Patent Document 1).

特開平４－２３８３９９号公報Japanese Unexamined Patent Publication No. 4-238399

Ｐ．Ｓcalart and Ｊ．Ｖieira Ｆilho「Ｓpeech Ｅnhancement Ｂased on a Ｐriori Ｓignal to Ｎoise Ｅstimation」，ＩＥＥＥＩnternational Ｃonference on．Ａcoustics，Ｓpeech，Ｓignal Ｐrocessing，Ａtlanta，ＧＡ，ＵＳＡ，ｖｏｌ．２，ｐｐ．６２９－６３２，１９９６年P. Scalart and J. Vieira Filho "Speech Enhancement Based on a Priori Signal to Noise Estimation", IEEE International Conference on. Acoustics, Speech, Signal Progressing, Atlanta, GA, USA, vol. 2, pp. 629-632, 1996

ところで、スペクトル減算法を適切に適用するためには、ノイズ成分を的確に推定して入力される音声信号から減算することが重要である。 By the way, in order to properly apply the spectral subtraction method, it is important to accurately estimate the noise component and subtract it from the input audio signal.

そこでこの発明は、ノイズ成分を的確に推定することが可能な、ノイズ更新回路を提供することを目的とする。 Therefore, an object of the present invention is to provide a noise update circuit capable of accurately estimating a noise component.

上記課題を解決するために、請求項１に記載の発明は、処理対象のフレームがノイズ成分のみのフレームである場合に、前記処理対象のフレームの振幅スペクトルに該当する信号を入力信号スペクトルＹi(ｆ)として、周波数ｆごとに更新後のノイズスペクトルＮi(ｆ)を平均化を含む処理によって算出する第１の更新部と（但し、ｉ：時系列の順序を表す順序数）、前記処理対象のフレームが音声成分を含むフレームである場合に、周波数ｆごとに更新後のノイズスペクトルＮi(ｆ)を平均化を含む処理によって算出する第２の更新部と、を有し、前記音声成分を含むフレームである場合に前記更新後のノイズスペクトルＮi(ｆ)を算出する際の平均化時間が、前記ノイズ成分のみのフレームである場合に前記更新後のノイズスペクトルＮi(ｆ)を算出する際の平均化時間よりも長い、ことを特徴とするノイズ更新回路である。 In order to solve the above problem, in the invention according to claim 1, when the frame to be processed is a frame having only a noise component, a signal corresponding to the amplitude spectrum of the frame to be processed is input to the input signal spectrum Yi ( As f), the first update unit that calculates the updated noise spectrum Ni (f) for each frequency f by a process including averaging (however, i: the number of orders representing the time series order), and the processing target. When the frame of is a frame including a voice component, the frame has a second update unit for calculating the updated noise spectrum Ni (f) for each frequency f by a process including averaging, and the voice component is used. When calculating the updated noise spectrum Ni (f) when the frame includes the frame, the averaging time when calculating the updated noise spectrum Ni (f) is when calculating the updated noise spectrum Ni (f) when the frame contains only the noise component. It is a noise update circuit characterized by having a longer time than the averaging time of.

請求項２に記載の発明は、処理対象のフレームがノイズ成分のみのフレームである場合に、前記処理対象のフレームの振幅スペクトルに該当する信号を入力信号スペクトルＹi(ｆ)として、周波数ｆごとに更新後のノイズスペクトルＮi(ｆ)を平均化を含む処理によって算出する第１の更新部と（但し、ｉ：時系列の順序を表す順序数）、前記処理対象のフレームが音声成分を含むフレームである場合に、周波数ｆごとに更新後のノイズスペクトルＮi(ｆ)を平均化を含む処理によって算出する第２の更新部と、を有し、前記音声成分を含むフレームである場合に前記更新後のノイズスペクトルＮi(ｆ)を算出する際の、前記入力信号スペクトルＹi(ｆ)と過去の入力信号スペクトルＹi-j(ｆ)と過去のノイズスペクトルＮi-k(ｆ)とを用いた平均の平均化時間が（但し、ｊ：時系列における順序数ｉとの隔たりの程度を表す１以上の整数、ｋ：時系列における順序数ｉとの隔たりの程度を表す１以上の整数）、前記ノイズ成分のみのフレームである場合に前記更新後のノイズスペクトルＮi(ｆ)を算出する際の、前記入力信号スペクトルＹi(ｆ)と前記過去の入力信号スペクトルＹi-j(ｆ)と前記過去のノイズスペクトルＮi-k(ｆ)とを用いた平均の平均化時間よりも長い、ことを特徴とするノイズ更新回路である。 According to the second aspect of the present invention, when the frame to be processed is a frame containing only noise components, the signal corresponding to the amplitude spectrum of the frame to be processed is set as the input signal spectrum Yi (f) for each frequency f. The first update unit that calculates the updated noise spectrum Ni (f) by processing including averaging (however, i: the number of orders representing the time series order), and the frame to be processed is a frame containing an audio component. In the case where the frame includes a second update unit for calculating the updated noise spectrum Ni (f) for each frequency f by a process including averaging, and the frame includes the voice component, the update is performed. An average of the input signal spectrum Yi (f), the past input signal spectrum Yi-j (f), and the past noise spectrum Ni-k (f) when calculating the later noise spectrum Ni (f). (However, j: an integer of 1 or more representing the degree of deviation from the order number i in the time series, k: an integer of 1 or more representing the degree of deviation from the order number i in the time series), said. The input signal spectrum Yi (f), the past input signal spectrum Yi-j (f), and the past when calculating the updated noise spectrum Ni (f) in the case of a frame containing only noise components. It is a noise update circuit characterized in that it is longer than the average averaging time using the noise spectrum Ni-k (f).

請求項３に記載の発明は、処理対象のフレームがノイズ成分のみのフレームである場合に、前記処理対象のフレームの振幅スペクトルに該当する信号を入力信号スペクトルＹi(ｆ)として、周波数ｆごとに更新後のノイズスペクトルＮi(ｆ)を平均化を含む処理によって算出する第１の更新部と（但し、ｉ：時系列の順序を表す順序数）、前記処理対象のフレームが音声成分を含むフレームである場合に、周波数ｆごとに更新後のノイズスペクトルＮi(ｆ)を平均化を含む処理によって算出する第２の更新部と、を有し、前記音声成分を含むフレームである場合に前記更新後のノイズスペクトルＮi(ｆ)を算出する際の、前記入力信号スペクトルＹi(ｆ)と過去のノイズスペクトルＮi-k(ｆ)とを用いた平均の平均化時間が（但し、ｋ：時系列における順序数ｉとの隔たりの程度を表す１以上の整数）、前記ノイズ成分のみのフレームである場合に前記更新後のノイズスペクトルＮi(ｆ)を算出する際の、前記入力信号スペクトルＹi(ｆ)と前記過去のノイズスペクトルＮi-k(ｆ)とを用いた平均の平均化時間よりも長い、ことを特徴とするノイズ更新回路である。 According to the third aspect of the present invention, when the frame to be processed is a frame containing only noise components, the signal corresponding to the amplitude spectrum of the frame to be processed is set as the input signal spectrum Yi (f) for each frequency f. The first update unit that calculates the updated noise spectrum Ni (f) by processing including averaging (however, i: the number of orders representing the time-series order), and the frame to be processed is a frame containing an audio component. In the case where the frame includes a second update unit for calculating the updated noise spectrum Ni (f) for each frequency f by a process including averaging, and the frame includes the voice component, the update is performed. The average averaging time using the input signal spectrum Yi (f) and the past noise spectrum Ni-k (f) when calculating the later noise spectrum Ni (f) (however, k: time series). The input signal spectrum Yi (f) when calculating the updated noise spectrum Ni (f) in the case of a frame containing only the noise component (an integer of 1 or more representing the degree of deviation from the order number i in ) And the past noise spectrum Ni-k (f), which is longer than the average averaging time.

請求項４に記載の発明は、処理対象のフレームがノイズ成分のみのフレームである場合に、前記処理対象のフレームの振幅スペクトルに該当する信号を入力信号スペクトルＹi(ｆ)として、周波数ｆごとに更新後のノイズスペクトルＮi(ｆ)を平均化を含む処理によって算出する第１の更新部と（但し、ｉ：時系列の順序を表す順序数）、前記処理対象のフレームが音声成分を含むフレームである場合に、周波数ｆごとに更新後のノイズスペクトルＮi(ｆ)を平均化を含む処理によって算出する第２の更新部と、を有し、前記音声成分を含むフレームである場合に前記更新後のノイズスペクトルＮi(ｆ)を算出する際の、前記入力信号スペクトルＹi(ｆ)と過去の入力信号スペクトルＹi-j(ｆ)とを用いた平均の平均化時間が（但し、ｊ：時系列における順序数ｉとの隔たりの程度を表す１以上の整数）、前記ノイズ成分のみのフレームである場合に前記更新後のノイズスペクトルＮi(ｆ)を算出する際の、前記入力信号スペクトルＹi(ｆ)と前記過去の入力信号スペクトルＹi-j(ｆ)とを用いた平均の平均化時間よりも長い、ことを特徴とするノイズ更新回路である。 According to the fourth aspect of the present invention, when the frame to be processed is a frame containing only noise components, the signal corresponding to the amplitude spectrum of the frame to be processed is set as the input signal spectrum Yi (f) for each frequency f. The first update unit that calculates the updated noise spectrum Ni (f) by processing including averaging (however, i: the number of orders representing the time-series order), and the frame to be processed is a frame containing an audio component. In the case where the frame includes a second update unit for calculating the updated noise spectrum Ni (f) for each frequency f by a process including averaging, and the frame includes the voice component, the update is performed. The average averaging time using the input signal spectrum Yi (f) and the past input signal spectrum Yi-j (f) when calculating the later noise spectrum Ni (f) (however, j: hour). An integer of 1 or more representing the degree of deviation from the order number i in the series), the input signal spectrum Yi (in the case of a frame containing only the noise component) when calculating the updated noise spectrum Ni (f). It is a noise update circuit characterized in that it is longer than the average averaging time using f) and the past input signal spectrum Yi-j (f).

請求項５に記載の発明は、請求項１から４のうちのいずれか１項に記載のノイズ更新回路おいて、前記音声成分を含むフレームである場合に前記更新後のノイズスペクトルＮi(ｆ)を算出する際の前記平均化時間が、１～１０秒の範囲のうちのいずれかの値である、ことを特徴とする。 The invention according to claim 5 is the noise spectrum Ni (f) after the update in the case of the frame containing the voice component in the noise update circuit according to any one of claims 1 to 4. Is characterized in that the averaging time is a value in the range of 1 to 10 seconds.

請求項６に記載の発明は、処理対象のフレームがノイズ成分のみのフレームである場合に、前記処理対象のフレームの振幅スペクトルに該当する信号を入力信号スペクトルＹi(ｆ)として、ＩＩＲフィルタである以下の数式１もしくはＦＩＲフィルタである以下の数式２に従って周波数ｆごとに更新後のノイズスペクトルＮi(ｆ)を算出する第１の更新部と（但し、Ｎi-1(ｆ)：更新の１フレーム前のノイズスペクトル、Ｙi-j(ｆ)：更新のｊフレーム前の入力信号スペクトル、Ｋn：処理対象のフレームがノイズ成分のみのフレームである場合のＹi(ｆ)に対するＮi-1(ｆ)の重みづけを決定づける定数、ｉ：時系列の順序を表す順序数、ｊ：時系列における順序数ｉとの隔たりの程度を表す０以上の整数）、

前記処理対象のフレームが音声成分を含むフレームである場合に、ＩＩＲフィルタである以下の数式３もしくはＦＩＲフィルタである以下の数式４に従って周波数ｆごとに更新後のノイズスペクトルＮi(ｆ)を算出する第２の更新部と（但し、Ｋs：処理対象のフレームが音声成分を含むフレームである場合のＹi(ｆ)に対するＮi-1(ｆ)の重みづけを決定づける定数、Ｋn＜Ｋs）、

を有する、ことを特徴とするノイズ更新回路である。 The invention according to claim 6 is an IIR filter in which a signal corresponding to the amplitude spectrum of the frame to be processed is used as an input signal spectrum Yi (f) when the frame to be processed is a frame having only a noise component. The first updater for calculating the updated noise spectrum Ni (f) for each frequency f according to the following formula 1 or the following formula 2 which is an FIR filter (however, Ni-1 (f): one frame of update). Previous noise spectrum, Yi-j (f): Input signal spectrum before j-frame update, Kn: Ni-1 (f) with respect to Yi (f) when the frame to be processed is a frame containing only noise components. Constants that determine the weighting, i: the number of orders that represent the order of the time series, j: an integer greater than or equal to 0 that represents the degree of deviation from the number i of the order in the time series),

When the frame to be processed is a frame containing an audio component, the updated noise spectrum Ni (f) is calculated for each frequency f according to the following formula 3 which is an IIR filter or the following formula 4 which is an FIR filter. The second updater and (however, Ks: a constant that determines the weighting of Ni-1 (f) with respect to Yi (f) when the frame to be processed is a frame containing an audio component, Kn <Ks),.

It is a noise update circuit characterized by having.

請求項７に記載の発明は、請求項６に記載のノイズ更新回路において、前記定数Ｋsが、ＩＩＲフィルタの時定数もしくはＦＩＲフィルタの平均区間の１～１０秒に相当する範囲のうちのいずれかの値に設定される、ことを特徴とする。 In the invention according to claim 7, in the noise update circuit according to claim 6, the constant Ks is either the time constant of the IIR filter or the range corresponding to 1 to 10 seconds of the average interval of the FIR filter. It is characterized in that it is set to the value of.

請求項８に記載の発明は、請求項１から７のうちのいずれか１項に記載のノイズ更新回路において、前記入力信号スペクトルＹi(ｆ)について、以下の数式５に従って平均スペクトルレベルＭを算出する平均値算出部と（但し、ｆ1：振幅スペクトルにおける最小の周波数、ｆ2：振幅スペクトルにおける最大の周波数、Ｆn：最小の周波数ｆ1から最大の周波数ｆ2までの範囲における周波数の個数）、

前記入力信号スペクトルＹi(ｆ)と前記平均スペクトルレベルＭとに関してＹi(ｆ)≧Ｔe×Ｍである周波数ｆについて、以下の数式６に従って更新後のノイズスペクトルＮi(ｆ)を決定する第３の更新部と（但し、Ｔe：係数、Ｎi-1(ｆ)：更新の１フレーム前のノイズスペクトル）、

をさらに有する、ことを特徴とする。 The invention according to claim 8 calculates an average spectrum level M for the input signal spectrum Yi (f) according to the following formula 5 in the noise update circuit according to any one of claims 1 to 7. (However, f1: minimum frequency in the amplitude spectrum, f2: maximum frequency in the amplitude spectrum, Fn: number of frequencies in the range from the minimum frequency f1 to the maximum frequency f2).

Regarding the frequency f in which Yi (f) ≧ Te × M with respect to the input signal spectrum Yi (f) and the average spectrum level M, the third noise spectrum Ni (f) after updating is determined according to the following formula 6. The update part and (however, Te: coefficient, Ni-1 (f): noise spectrum one frame before the update),

It is characterized by having further.

請求項９に記載の発明は、請求項８に記載のノイズ更新回路において、前記係数Ｔeが、１～１００の範囲のうちのいずれかの値に設定される、ことを特徴とする。 The invention according to claim 9 is characterized in that, in the noise update circuit according to claim 8, the coefficient Te is set to any value in the range of 1 to 100.

請求項１ないし請求項４に記載の発明によれば、処理対象のフレームとして音声成分を含むフレームが続いた場合であってもノイズの変動に追従することができ、ノイズ成分を的確に推定することが可能となる。具体的には、スペクトル減算法を実現する従来の回路では、処理対象のフレームがノイズ成分のみのフレームである場合にはノイズスペクトルを更新する一方で音声成分を含むフレームである場合にはノイズスペクトルを更新しないようにしているので、処理対象のフレームとして音声成分を含むフレームが続くとノイズスペクトルが更新されないためにノイズの変動に的確に追従することができず、結果的にノイズ成分を的確に推定することができない、という問題がある。これに対して、請求項１ないし請求項４に記載の発明では、処理対象のフレームが音声成分を含むフレームである場合もノイズスペクトルを更新するようにしているので、処理対象のフレームとして音声成分を含むフレームが続いた場合であってもノイズの変動に追従することができ、ノイズ成分を的確に推定することが可能となる。 According to the first to fourth aspects of the invention, it is possible to follow the fluctuation of noise even when a frame containing an audio component continues as a frame to be processed, and the noise component can be accurately estimated. It becomes possible. Specifically, in the conventional circuit that realizes the spectrum subtraction method, the noise spectrum is updated when the frame to be processed is a frame containing only a noise component, while the noise spectrum is updated when the frame contains a voice component. Since the noise spectrum is not updated when a frame containing an audio component continues as a frame to be processed, it is not possible to accurately follow the noise fluctuation, and as a result, the noise component is accurately updated. There is a problem that it cannot be estimated. On the other hand, in the invention according to claim 1 to 4, the noise spectrum is updated even when the frame to be processed is a frame containing an audio component, so that the audio component is used as the frame to be processed. Even when the frame containing the noise continues, it is possible to follow the fluctuation of the noise, and it is possible to accurately estimate the noise component.

請求項５に記載の発明によれば、処理対象のフレームが音声成分を含むフレームである場合に更新後のノイズスペクトルＮi(ｆ)を算出する際の平均化時間が適切な値に設定されるので、処理対象のフレームとして音声成分を含むフレームが続いた場合におけるノイズの変動に一層確実に追従することができ、ノイズ成分を一層確実に的確に推定することが可能となる。 According to the fifth aspect of the present invention, when the frame to be processed is a frame containing an audio component, the averaging time for calculating the updated noise spectrum Ni (f) is set to an appropriate value. Therefore, it is possible to more reliably follow the fluctuation of noise when a frame containing an audio component continues as the frame to be processed, and it is possible to estimate the noise component more reliably and accurately.

請求項６に記載の発明によれば、処理対象のフレームとして音声成分を含むフレームが続いた場合であってもノイズの変動に追従することができ、ノイズ成分を的確に推定することが可能となる。具体的には、スペクトル減算法を実現する従来の回路では、処理対象のフレームがノイズ成分のみのフレームである場合にはノイズスペクトルを更新する一方で音声成分を含むフレームである場合にはノイズスペクトルを更新しないようにしているので、処理対象のフレームとして音声成分を含むフレームが続くとノイズスペクトルが更新されないためにノイズの変動に的確に追従することができず、結果的にノイズ成分を的確に推定することができない、という問題がある。これに対して、請求項６に記載の発明では、処理対象のフレームが音声成分を含むフレームである場合もノイズスペクトルを更新するようにしているので、処理対象のフレームとして音声成分を含むフレームが続いた場合であってもノイズの変動に追従することができ、ノイズ成分を的確に推定することが可能となる。 According to the invention of claim 6, even when a frame containing an audio component continues as a frame to be processed, it is possible to follow the fluctuation of noise, and it is possible to accurately estimate the noise component. Become. Specifically, in the conventional circuit that realizes the spectrum subtraction method, the noise spectrum is updated when the frame to be processed is a frame containing only a noise component, while the noise spectrum is updated when the frame contains a voice component. Since the noise spectrum is not updated when a frame containing an audio component continues as a frame to be processed, it is not possible to accurately follow the noise fluctuation, and as a result, the noise component is accurately updated. There is a problem that it cannot be estimated. On the other hand, in the invention according to claim 6, since the noise spectrum is updated even when the frame to be processed is a frame containing an audio component, the frame including the audio component is used as the frame to be processed. Even if it continues, it is possible to follow the fluctuation of noise, and it is possible to accurately estimate the noise component.

請求項７に記載の発明によれば、処理対象のフレームが音声成分を含むフレームである場合の入力信号スペクトルＹi(ｆ)に対する更新の１フレーム前のノイズスペクトルＮi-1(ｆ)の重みづけを決定づける定数Ｋsが適切な値に設定されるので、処理対象のフレームとして音声成分を含むフレームが続いた場合におけるノイズの変動に一層確実に追従することができ、ノイズ成分を一層確実に的確に推定することが可能となる。 According to the invention of claim 7, when the frame to be processed is a frame containing an audio component, the noise spectrum Ni-1 (f) one frame before the update is weighted with respect to the input signal spectrum Yi (f). Since the constant Ks that determines It is possible to estimate.

請求項８に記載の発明によれば、トーン信号を抑圧しないようすることが可能となる。具体的には、スペクトル減算法を実現する従来の回路では、周波数スペクトルにおいて定常的に存在する成分をノイズと判断して抑圧するようにしているので、トーン信号も定常性があるためにノイズと判断されて抑圧の対象になり、ユーザにとって必要なトーン信号（例えば、モールス信号）も抑圧されてしまう、という問題がある。これに対して、請求項８に記載の発明では、平均スペクトルレベルよりも振幅スペクトルが著しく大きい周波数成分をノイズスペクトルの更新から除外するようにしているので、トーン信号を抑圧しないようにすることが可能となる。 According to the invention of claim 8, it is possible to prevent the tone signal from being suppressed. Specifically, in the conventional circuit that realizes the spectrum subtraction method, the component that is constantly present in the frequency spectrum is judged to be noise and suppressed, so that the tone signal is also noise because it is stationary. There is a problem that it is judged and becomes a target of suppression, and a tone signal (for example, a Morse signal) necessary for the user is also suppressed. On the other hand, in the invention according to claim 8, since the frequency component whose amplitude spectrum is significantly larger than the average spectrum level is excluded from the update of the noise spectrum, it is possible not to suppress the tone signal. It will be possible.

請求項９に記載の発明によれば、係数Ｔeが適切な値に設定されるので、平均スペクトルレベルよりも振幅スペクトルが著しく大きい周波数成分をノイズスペクトルの更新から一層確実に除外することができ、トーン信号を一層確実に抑圧しないようすることが可能となる。 According to the invention of claim 9, since the coefficient Te is set to an appropriate value, a frequency component whose amplitude spectrum is significantly larger than the average spectrum level can be more reliably excluded from the update of the noise spectrum. It is possible to prevent the tone signal from being suppressed more reliably.

この発明の実施の形態に係るノイズ更新回路を含むノイズリダクション回路の概略構成を示す機能ブロック図である。It is a functional block diagram which shows the schematic structure of the noise reduction circuit including the noise update circuit which concerns on embodiment of this invention. 実施の形態１に係るノイズ更新回路の概略構成を示す機能ブロック図である。It is a functional block diagram which shows the schematic structure of the noise update circuit which concerns on Embodiment 1. FIG. 実施の形態２に係るノイズ更新回路の概略構成を示す機能ブロック図である。It is a functional block diagram which shows the schematic structure of the noise update circuit which concerns on Embodiment 2. FIG.

以下、この発明を図示の実施の形態に基づいて説明する。 Hereinafter, the present invention will be described based on the illustrated embodiment.

（実施の形態１）
図１は、この発明の実施の形態に係るノイズ更新回路１１を含むノイズリダクション回路１の概略構成を示す機能ブロック図である。図２は、実施の形態１に係るノイズ更新回路１１の概略構成を示す機能ブロック図である。 (Embodiment 1)
FIG. 1 is a functional block diagram showing a schematic configuration of a noise reduction circuit 1 including a noise update circuit 11 according to an embodiment of the present invention. FIG. 2 is a functional block diagram showing a schematic configuration of the noise update circuit 11 according to the first embodiment.

ノイズリダクション回路１は、例えば、高周波信号を送受信する無線機に組み込まれて、音声信号に含まれる雑音成分を抑圧する手法であるスペクトル減算法（Ｓpectral Ｓubtraction）を実現する回路であり、主として、プリエンファシス回路２と、窓処理部３と、時間周波数変換部４と、変換結果出力部５と、減算部６と、合成部７と、周波数時間変換部８と、ディエンファシス回路９と、音声区間検出部１０と、ノイズ更新回路１１と、を有する。 The noise reduction circuit 1 is, for example, a circuit incorporated in a radio device that transmits and receives high-frequency signals to realize a spectrum subtraction method (Spectral Reduction), which is a method of suppressing noise components contained in an audio signal, and is mainly a pre. Emphasis circuit 2, window processing unit 3, time-frequency conversion unit 4, conversion result output unit 5, subtraction unit 6, synthesis unit 7, frequency-time conversion unit 8, de-emphasis circuit 9, and audio section. It has a detection unit 10 and a noise update circuit 11.

プリエンファシス（Ｐre-Ｅmphasis：ＰＥ）回路２は、アンテナから受信した高周波信号を復調した音声信号に対して高周波成分の相対強度を予め増幅する高域強調処理を施して、高域強調処理後の信号を出力する。 The pre-emphasis (PE) circuit 2 performs high-frequency enhancement processing for pre-amplifying the relative intensity of high-frequency components on an audio signal demodulated from a high-frequency signal received from an antenna, and after high-frequency enhancement processing. Output a signal.

窓処理部３は、プリエンファシス回路２から出力される高域強調処理後の信号の入力を受け、入力された前記信号から所定の時間長さのフレームを抽出する（例えば、１２．５ｍｓごとに２５ｍｓ分の時間波形を抽出する）とともに、各フレームに対して例えばハニング窓などの窓関数を乗じて窓処理を施す。窓処理部３は、各フレームに対して窓処理を施すたびに、窓処理後のフレームを出力する。 The window processing unit 3 receives the input of the signal after the high frequency enhancement processing output from the pre-emphasis circuit 2, and extracts a frame having a predetermined time length from the input signal (for example, every 12.5 ms). A time waveform for 25 ms is extracted), and each frame is subjected to window processing by multiplying it by a window function such as a Hanning window. The window processing unit 3 outputs the frame after the window processing each time the window processing is performed on each frame.

時間周波数変換部４は、窓処理部３から出力される窓処理後のフレームの入力を受け、前記フレームの入力を受けるたびに、前記フレームに対して時間領域の信号から周波数領域の信号への変換処理を施し、複数の周波数それぞれについての振幅成分と位相成分とを含む周波数スペクトルを計算して、実数と虚数との周波数スペクトルの信号を出力する。時間周波数変換部４は、例えば離散フーリエ変換（Ｄiscrete Ｆourier Ｔransform)や高速フーリエ変換（Ｆast Ｆourier Ｔransform）により、時間周波数変換を実行して周波数スペクトルを計算する。 The time-frequency conversion unit 4 receives the input of the window-processed frame output from the window processing unit 3, and each time the input of the frame is received, the time-frequency conversion unit 4 changes the signal in the time domain to the signal in the frequency domain with respect to the frame. The conversion process is performed, a frequency spectrum including an amplitude component and a phase component for each of a plurality of frequencies is calculated, and a signal of the frequency spectrum of a real number and an imaginary number is output. The time-frequency transform unit 4 executes a time-frequency transform and calculates a frequency spectrum by, for example, a discrete Fourier transform or a fast Fourier transform.

変換結果出力部５は、時間周波数変換部４から出力されるフレームごとの（例えば、１２．５ｍｓ程度の間隔で）周波数スペクトルの信号の入力を受け、フレームごとに、入力された前記周波数スペクトルのうちの各周波数の振幅成分を含む振幅スペクトルに該当する信号を減算部６に対して出力するとともに、入力された前記周波数スペクトルのうちの各周波数の位相成分を含む位相スペクトルに該当する信号を合成部７に対して出力する。 The conversion result output unit 5 receives the input of the signal of the frequency spectrum for each frame (for example, at intervals of about 12.5 ms) output from the time-frequency conversion unit 4, and for each frame, the input of the frequency spectrum. The signal corresponding to the amplitude spectrum including the amplitude component of each frequency is output to the subtraction unit 6, and the signal corresponding to the phase spectrum including the phase component of each frequency in the input frequency spectrum is synthesized. Output to unit 7.

減算部６は、変換結果出力部５から出力されるフレームごとの振幅スペクトルに該当する信号の入力を受けるとともに、ノイズ更新回路１１から出力されるフレームごとの更新後のノイズスペクトルに該当する信号の入力を受け、各フレームについて、入力された前記振幅スペクトルに該当する信号から、周波数ごとに（別言すると、スペクトルごとに）、入力された前記更新後のノイズスペクトルに該当する信号を減算する。これにより、音声信号に含まれる雑音成分が抑圧される。減算部６は、変換結果出力部５から出力されるフレームごとに、減算処理後の振幅スペクトルに該当する信号を出力する。 The subtraction unit 6 receives the input of the signal corresponding to the amplitude spectrum for each frame output from the conversion result output unit 5, and the signal corresponding to the updated noise spectrum for each frame output from the noise update circuit 11. Upon receiving the input, for each frame, the signal corresponding to the input updated noise spectrum is subtracted from the input signal corresponding to the amplitude spectrum for each frequency (in other words, for each spectrum). As a result, the noise component contained in the audio signal is suppressed. The subtraction unit 6 outputs a signal corresponding to the amplitude spectrum after the subtraction process for each frame output from the conversion result output unit 5.

合成部７は、変換結果出力部５から出力されるフレームごとの位相スペクトルに該当する信号の入力を受けるとともに、減算部６から出力されるフレームごとの減算処理後の振幅スペクトルに該当する信号の入力を受け、フレームごとに、入力された前記位相スペクトルに該当する信号と前記振幅スペクトルに該当する信号とを合成して周波数スペクトルを生成して、実数と虚数との周波数スペクトルの信号を出力する。 The synthesizing unit 7 receives the input of the signal corresponding to the phase spectrum for each frame output from the conversion result output unit 5, and the signal corresponding to the amplitude spectrum after the subtraction processing for each frame output from the subtraction unit 6. Upon receiving an input, for each frame, the input signal corresponding to the phase spectrum and the signal corresponding to the amplitude spectrum are combined to generate a frequency spectrum, and a signal having a frequency spectrum of a real number and an imaginary number is output. ..

周波数時間変換部８は、合成部７から出力されるフレームごとの周波数スペクトルの信号の入力を受け、フレームごとに、入力された前記周波数スペクトルの信号に対して周波数領域の信号から時間領域の信号への変換処理、すなわち時間周波数変換部４における変換処理の逆変換処理を施して、音声信号を出力する。周波数時間変換部８は、例えば逆離散フーリエ変換や逆高速フーリエ変換により、周波数時間変換を実行して音声信号を生成する。 The frequency-time conversion unit 8 receives the input of the frequency spectrum signal for each frame output from the synthesis unit 7, and for each frame, the signal in the frequency region to the signal in the time region with respect to the input signal of the frequency spectrum. The audio signal is output by performing the conversion process to, that is, the reverse conversion process of the conversion process in the time-frequency conversion unit 4. The frequency-time conversion unit 8 executes frequency-time conversion by, for example, an inverse discrete Fourier transform or an inverse fast Fourier transform to generate an audio signal.

ディエンファシス（Ｄe－Ｅmphasis：ＤＥ）回路９は、周波数時間変換部８から出力される音声信号の入力を受け、入力された前記音声信号に対して高周波成分の相対強度を減衰させる高域減衰処理、すなわちプリエンファシス回路２の逆フィルタによる減衰処理を施して、高域減衰処理後の音声信号を出力する。 The De-Emphasis (DE) circuit 9 receives the input of the audio signal output from the frequency time conversion unit 8 and attenuates the relative intensity of the high frequency component with respect to the input audio signal. That is, the attenuation processing by the inverse filter of the pre-emphasis circuit 2 is performed, and the audio signal after the high frequency attenuation processing is output.

音声区間検出部１０は、変換結果出力部５から出力されて分岐されるフレームごとの振幅スペクトルに該当する信号の入力を受け、フレームごとに、入力された前記振幅スペクトルに該当する信号について、ノイズ成分のみのフレームであるのか、音声成分を含むフレームであるのか、の判定を行う。 The voice section detection unit 10 receives the input of the signal corresponding to the amplitude spectrum for each frame output from the conversion result output unit 5 and branched, and the noise of the input signal corresponding to the amplitude spectrum for each frame. It is determined whether the frame contains only components or the frame contains audio components.

音声区間検出部１０における、処理対象のフレームがノイズ成分のみであるのか音声成分を含むのかの判定の仕法は、特定の手順や手法に限定されるものではなく、従来もしくは新規の手順や手法の中から適当な手順や手法が適宜選択され得る。 The method of determining whether the frame to be processed contains only a noise component or a voice component in the voice section detection unit 10 is not limited to a specific procedure or method, and is not limited to a specific procedure or method, but is a conventional or new procedure or method. An appropriate procedure or method can be appropriately selected from the above.

音声区間検出部１０における、処理対象のフレームがノイズ成分のみであるのか音声成分を含むのかの判定の仕法として、例えば、音声の非恒常性に着目して、振幅スペクトルの周波数別の振幅の大きさに関する平均や分散の値が直近のフレームにおいて複数回（例えば、３～５回程度）連続して所定の閾値未満であるときは処理対象のフレームはノイズ成分のみであると判定し、前記以外のときは処理対象のフレームには音声成分があると判定する手法、あるいは、振幅スペクトルの周波数別の振幅の大きさに関する平均や分散の値が所定の閾値未満であるときは処理対象のフレームはノイズ成分のみであると判定し、前記平均や分散の値が前記閾値以上であるときは処理対象のフレームには音声成分があると判定する手法などが用いられ得る。 As a method of determining whether the frame to be processed contains only a noise component or a voice component in the voice section detection unit 10, for example, focusing on the non-constancy of the voice, the magnitude of the amplitude of the amplitude spectrum for each frequency is large. When the average or dispersion value related to the amplitude is less than a predetermined threshold multiple times (for example, about 3 to 5 times) in a row in the latest frame, it is determined that the frame to be processed is only a noise component, and other than the above. In the case of, the method of determining that the frame to be processed has an audio component, or when the value of the average or dispersion regarding the magnitude of the amplitude of each frequency of the amplitude spectrum is less than a predetermined threshold value, the frame to be processed is A method may be used in which it is determined that only the noise component is present, and when the average or dispersion value is equal to or greater than the threshold value, it is determined that the frame to be processed has an audio component.

音声区間検出部１０は、処理対象のフレームはノイズ成分のみであると判定した場合にはノイズフレーム信号を出力し、また、処理対象のフレームには音声成分があると判定した場合には音声フレーム信号を出力する。音声区間検出部１０は、フレームごとに、音声区間検出結果としてノイズフレーム信号または音声フレーム信号を出力する。 The voice section detection unit 10 outputs a noise frame signal when it is determined that the frame to be processed has only a noise component, and when it is determined that the frame to be processed has a voice component, the voice frame is output. Output a signal. The voice section detection unit 10 outputs a noise frame signal or a voice frame signal as a voice section detection result for each frame.

そして、実施の形態に係るノイズ更新回路１１は、処理対象のフレームがノイズ成分のみのフレームである場合に、処理対象のフレームの振幅スペクトルに該当する信号を入力信号スペクトルＹi(ｆ)として、周波数ｆごとに更新後のノイズスペクトルＮi(ｆ)を平均化を含む処理によって算出する第１の更新部と（但し、ｉ：時系列の順序を表す順序数）、処理対象のフレームが音声成分を含むフレームである場合に、周波数ｆごとに更新後のノイズスペクトルＮi(ｆ)を平均化を含む処理によって算出する第２の更新部と、を有し、音声成分を含むフレームである場合に更新後のノイズスペクトルＮi(ｆ)を算出する際の、入力信号スペクトルＹi(ｆ)の平均化時間が、ノイズ成分のみのフレームである場合に更新後のノイズスペクトルＮi(ｆ)を算出する際の、入力信号スペクトルＹi(ｆ)の平均化時間よりも長い、ようにしている。 Then, in the noise update circuit 11 according to the embodiment, when the frame to be processed is a frame containing only noise components, the signal corresponding to the amplitude spectrum of the frame to be processed is set as the input signal spectrum Yi (f) and has a frequency. The first update part that calculates the noise spectrum Ni (f) after updating for each f by a process including averaging (however, i: the number of orders representing the order in the time series), and the frame to be processed contains the audio component. In the case of a frame including, it has a second update part for calculating the updated noise spectrum Ni (f) for each frequency f by a process including averaging, and is updated in the case of a frame containing an audio component. When calculating the updated noise spectrum Ni (f) when the averaging time of the input signal spectrum Yi (f) is a frame containing only noise components when calculating the later noise spectrum Ni (f). , It is longer than the averaging time of the input signal spectrum Yi (f).

ノイズ更新回路１１は、過去に計算された周波数ごとの雑音成分を表すノイズスペクトルに、現フレーム（別言すると、処理対象のフレーム、最新のフレーム）の振幅スペクトルを加味することにより、最新のノイズスペクトルに更新するものであり、第１の更新部１１１と、第２の更新部１１２と、を有する。 The noise update circuit 11 adds the amplitude spectrum of the current frame (in other words, the frame to be processed, the latest frame) to the noise spectrum representing the noise component for each frequency calculated in the past, so as to obtain the latest noise. It updates to the spectrum and has a first updating unit 111 and a second updating unit 112.

ノイズ更新回路１１は、変換結果出力部５から出力されて分岐されるフレームごとの振幅スペクトルに該当する信号の入力を受けるとともに、音声区間検出部１０から出力されるフレームごとの音声区間検出結果の入力を受け、入力された前記振幅スペクトルに該当する信号を用いて、周波数ｆごとに、更新の１フレーム前のノイズスペクトルＮi-1(ｆ)を更新するものとして、更新後のノイズスペクトルＮi(ｆ)を、入力された前記音声区間検出結果の内容に応じて下記の数式７もしくは数式８または数式９もしくは数式１０に従って算出する。なお、以降の数式における添字ｉは、時系列の順序を表す順序数であり、すべての数式に共通して適用される順序を表す。また、以降の数式における添字ｊは、時系列における順序数ｉとの隔たりの程度を表す変数であり、０以上の整数である。 The noise update circuit 11 receives the input of the signal corresponding to the amplitude spectrum for each frame output from the conversion result output unit 5 and branched, and the noise section detection result for each frame output from the voice section detection unit 10. The noise spectrum Ni (f) after the update is assumed to update the noise spectrum Ni-1 (f) one frame before the update for each frequency f by receiving the input and using the signal corresponding to the input amplitude spectrum. f) is calculated according to the following formula 7 or formula 8 or formula 9 or formula 10 according to the content of the input voice section detection result. The subscript i in the following mathematical formulas is an ordinal number representing the order of the time series, and represents the order that is commonly applied to all the mathematical formulas. Further, the subscript j in the following mathematical expressions is a variable representing the degree of deviation from the ordinal number i in the time series, and is an integer of 0 or more.

ノイズ更新回路１１へと入力された前記音声区間検出結果がノイズフレーム信号である場合には、第１の更新部１１１が、入力された前記振幅スペクトルに該当する信号を入力信号スペクトルＹi(ｆ)として、ＩＩＲ（Ｉnfinite Ｉmpulse Ｒesponse の略；無限インパルス応答）フィルタである以下の数式７もしくはＦＩＲ（Ｆinite Ｉmpulse Ｒesponse の略；有限インパルス応答）フィルタである以下の数式８に従って周波数ｆごとに更新後のノイズスペクトルＮi(ｆ)を算出する。なお、以降の数式における、Ｎi-1(ｆ)は更新の１フレーム前のノイズスペクトルを表し、Ｙi-j(ｆ)は更新のｊフレーム前の入力信号スペクトルを表す。

When the voice section detection result input to the noise update circuit 11 is a noise frame signal, the first update unit 111 inputs a signal corresponding to the input amplitude spectrum to the input signal spectrum Yi (f). The noise after updating for each frequency f according to the following formula 7 which is an IIR (abbreviation of Infinite Impulse Response; infinite impulse response) filter or the following formula 8 which is an FIR (abbreviation of Finite Impulse Response; finite impulse response) filter. The spectrum Ni (f) is calculated. In the following equations, Ni-1 (f) represents the noise spectrum one frame before the update, and Yi-j (f) represents the input signal spectrum one frame before the update.

数式７や数式８におけるＫnは、処理対象のフレーム（別言すると、現フレーム、最新のフレーム）がノイズ成分のみのフレームである場合の、前記処理対象のフレームの振幅スペクトルである入力信号スペクトルＹi(ｆ)に対する更新の１フレーム前のノイズスペクトルＮi-1(ｆ)の重みづけを決定づける定数であり、「ノイズ時更新前重み定数Ｋn」と呼ぶ。 Kn in Equation 7 and Equation 8 is an input signal spectrum Yi which is an amplitude spectrum of the frame to be processed when the frame to be processed (in other words, the current frame and the latest frame) is a frame containing only a noise component. It is a constant that determines the weighting of the noise spectrum Ni-1 (f) one frame before the update with respect to (f), and is called "the weight constant Kn before the update at the time of noise".

ノイズ時更新前重み定数Ｋnは、０以上の整数であれば特定の値には限定されない。ノイズ時更新前重み定数Ｋnは、具体的には例えば、ＩＩＲフィルタの時定数もしくはＦＩＲフィルタの平均区間の０．０６～０．２０秒程度に相当する範囲（例えば、フレーム間隔１２．５ｍｓにおいてＫn＝５～１６程度の範囲）のうちのいずれかの値に設定されることが考えられ、特にＩＩＲフィルタの時定数もしくはＦＩＲフィルタの平均区間の０．１秒程度に相当する値（例えば、フレーム間隔１２．５ｍｓにおいてＫn＝８程度）に設定されることが考えられる。なお、ＩＩＲフィルタの時定数もしくはＦＩＲフィルタの平均区間を例えば０．１秒としたとき、数式７における定数Ｋnの具体的な値と数式８における定数Ｋnの具体的な値とは異なる。 The weight constant Kn before update at the time of noise is not limited to a specific value as long as it is an integer of 0 or more. Specifically, the weight constant Kn before updating during noise is Kn in a range corresponding to, for example, the time constant of the IIR filter or the average interval of the FIR filter of about 0.06 to 0.20 seconds (for example, at a frame interval of 12.5 ms). It is conceivable that it is set to any value in the range of about 5 to 16), and in particular, a value corresponding to about 0.1 seconds of the time constant of the IIR filter or the average interval of the FIR filter (for example, frame). It is conceivable that Kn = 8) is set at an interval of 12.5 ms. When the time constant of the IIR filter or the average interval of the FIR filter is, for example, 0.1 seconds, the specific value of the constant Kn in Equation 7 and the specific value of the constant Kn in Equation 8 are different.

ノイズ更新回路１１へと入力された前記音声区間検出結果が音声フレーム信号である場合には、第２の更新部１１２が、入力された前記振幅スペクトルに該当する信号を入力信号スペクトルＹi(ｆ)として、ＩＩＲフィルタである以下の数式９もしくはＦＩＲフィルタである以下の数式１０に従って周波数ｆごとに更新後のノイズスペクトルＮi(ｆ)を算出する。

When the voice section detection result input to the noise update circuit 11 is a voice frame signal, the second update unit 112 inputs a signal corresponding to the input amplitude spectrum to the input signal spectrum Yi (f). The updated noise spectrum Ni (f) is calculated for each frequency f according to the following formula 9 which is an IIR filter or the following formula 10 which is an FIR filter.

数式９や数式１０におけるＫsは、処理対象のフレーム（別言すると、現フレーム、最新のフレーム）が音声成分を含むフレームである場合の、前記処理対象のフレームの振幅スペクトルである入力信号スペクトルＹi(ｆ)に対する更新の１フレーム前のノイズスペクトルＮi-1(ｆ)の重みづけを決定づける定数であり、「音声時更新前重み定数Ｋs」と呼ぶ。 Ks in Equation 9 and Equation 10 is an input signal spectrum Yi which is an amplitude spectrum of the frame to be processed when the frame to be processed (in other words, the current frame and the latest frame) is a frame containing an audio component. It is a constant that determines the weighting of the noise spectrum Ni-1 (f) one frame before the update with respect to (f), and is called "the weight constant Ks before the update during voice".

音声時更新前重み定数Ｋsは、ノイズ時更新前重み定数Ｋnよりも大きく、好ましくは十分に大きい整数であれば、特定の値には限定されない。音声時更新前重み定数Ｋsは、具体的には例えば、ＩＩＲフィルタの時定数もしくはＦＩＲフィルタの平均区間の１～１０秒程度に相当する範囲（例えば、フレーム間隔１２．５ｍｓにおいてＫs＝８０～８００程度の範囲）のうちのいずれかの値、さらに特定するとＩＩＲフィルタの時定数もしくはＦＩＲフィルタの平均区間の１～６秒程度に相当する範囲（例えば、フレーム間隔１２．５ｍｓにおいてＫs＝８０～５００程度の範囲）のうちのいずれかの値に設定されることが考えられ、特にＩＩＲフィルタの時定数もしくはＦＩＲフィルタの平均区間の３．７秒程度に相当する値（例えば、フレーム間隔１２．５ｍｓにおいてＫs＝２９５程度）に設定されることが考えられる。なお、ＩＩＲフィルタの時定数もしくはＦＩＲフィルタの平均区間を例えば３．７秒としたとき、数式９における定数Ｋsの具体的な値と数式１０における定数Ｋsの具体的な値とは異なる。 The weight constant Ks before update during voice is not limited to a specific value as long as it is an integer larger than the weight constant Kn before update during noise, preferably sufficiently large. Specifically, the weight constant Ks before update during voice is, for example, a range corresponding to about 1 to 10 seconds of the time constant of the IIR filter or the average interval of the FIR filter (for example, Ks = 80 to 800 at a frame interval of 12.5 ms). Any value in the range of degrees), more specifically, the time constant of the IIR filter or the range corresponding to about 1 to 6 seconds of the average interval of the FIR filter (for example, Ks = 80 to 500 at a frame interval of 12.5 ms). It is conceivable that it is set to any value within the range of degree), and in particular, a value corresponding to about 3.7 seconds of the time constant of the IIR filter or the average interval of the FIR filter (for example, the frame interval of 12.5 ms). It is conceivable that Ks = about 295). When the time constant of the IIR filter or the average interval of the FIR filter is, for example, 3.7 seconds, the specific value of the constant Ks in Equation 9 and the specific value of the constant Ks in Equation 10 are different.

音声時更新前重み定数Ｋsが十分に大きい値に設定されることにより、音声には非恒常性があるため、ノイズスペクトルの更新において音声成分が大きく反映されることが回避され、したがって、音声成分が雑音成分として抑圧されることがない。 By setting the pre-update weighting constant Ks to a sufficiently large value during audio, the audio has non-homeostasis, which prevents the audio component from being significantly reflected in the noise spectrum update, and therefore the audio component. Is not suppressed as a noise component.

なお、更新後のノイズスペクトルＮi(ｆ)を算出するための平均化処理として、ＩＩＲフィルタ（一般形はＮi(ｆ)＝ΣＡjＹi-j(ｆ)＋ΣＢkＮi-k(ｆ)；但し、Ａ，Ｂは係数、ｉは時系列の順序を表す順序数、ｊは時系列における順序数ｉとの隔たりの程度を表す０以上の整数、ｋは時系列における順序数ｉとの隔たりの程度を表す１以上の整数）を用いる場合は、処理対象のフレームがノイズ成分のみのフレームである場合に、処理対象のフレームの振幅スペクトルに該当する信号を入力信号スペクトルＹi(ｆ)として、周波数ｆごとに更新後のノイズスペクトルＮi(ｆ)を平均化を含む処理によって算出する第１の更新部１１１と、処理対象のフレームが音声成分を含むフレームである場合に、周波数ｆごとに更新後のノイズスペクトルＮi(ｆ)を平均化を含む処理によって算出する第２の更新部１１２と、を有し、音声成分を含むフレームである場合に更新後のノイズスペクトルＮi(ｆ)を算出する際の、入力信号スペクトルＹi(ｆ)と過去の入力信号スペクトルＹi-j(ｆ)と過去のノイズスペクトルＮi-k(ｆ)とを用いた平均の平均化時間が、ノイズ成分のみのフレームである場合に更新後のノイズスペクトルＮi(ｆ)を算出する際の、入力信号スペクトルＹi(ｆ)と過去の入力信号スペクトルＹi-j(ｆ)と過去のノイズスペクトルＮi-k(ｆ)とを用いた平均の平均化時間よりも長い、場合に該当する。 As an averaging process for calculating the updated noise spectrum Ni (f), an IIR filter (general form is Ni (f) = ΣAjYi-j (f) + ΣBkNi-k (f); however, A and B Is a coefficient, i is an order number representing the order of the time series, j is an integer of 0 or more representing the degree of deviation from the order number i in the time series, and k is 1 indicating the degree of deviation from the order number i in the time series. When the above integer) is used, when the frame to be processed is a frame containing only noise components, the signal corresponding to the amplitude spectrum of the frame to be processed is used as the input signal spectrum Yi (f) and updated for each frequency f. The first update unit 111 that calculates the subsequent noise spectrum Ni (f) by processing including averaging, and the noise spectrum Ni after updating for each frequency f when the frame to be processed is a frame containing an audio component. An input signal for calculating the updated noise spectrum Ni (f) in the case of a frame containing an audio component and having a second update unit 112 for calculating (f) by a process including averaging. After updating when the average averaging time using the spectrum Yi (f), the past input signal spectrum Yi-j (f), and the past noise spectrum Ni-k (f) is a frame containing only noise components. Average of the average of the input signal spectrum Yi (f), the past input signal spectrum Yi-j (f), and the past noise spectrum Ni-k (f) when calculating the noise spectrum Ni (f) of Applicable when it is longer than the conversion time.

また、上記のＩＩＲフィルタの一般形についてｊ≧１においてＡj＝０としたＩＩＲフィルタを用いる場合は、処理対象のフレームがノイズ成分のみのフレームである場合に、処理対象のフレームの振幅スペクトルに該当する信号を入力信号スペクトルＹi(ｆ)として、周波数ｆごとに更新後のノイズスペクトルＮi(ｆ)を平均化を含む処理によって算出する第１の更新部１１１と、処理対象のフレームが音声成分を含むフレームである場合に、周波数ｆごとに更新後のノイズスペクトルＮi(ｆ)を平均化を含む処理によって算出する第２の更新部１１２と、を有し、音声成分を含むフレームである場合に更新後のノイズスペクトルＮi(ｆ)を算出する際の、入力信号スペクトルＹi(ｆ)と過去のノイズスペクトルＮi-k(ｆ)とを用いた平均の平均化時間が、ノイズ成分のみのフレームである場合に更新後のノイズスペクトルＮi(ｆ)を算出する際の、入力信号スペクトルＹi(ｆ)と過去のノイズスペクトルＮi-k(ｆ)とを用いた平均の平均化時間よりも長い、場合に該当する。 Further, when using the IIR filter in which Aj = 0 in j ≧ 1 for the general form of the above IIR filter, it corresponds to the amplitude spectrum of the frame to be processed when the frame to be processed is a frame containing only noise components. The signal to be processed is the input signal spectrum Yi (f), and the first update unit 111, which calculates the updated noise spectrum Ni (f) for each frequency f by a process including averaging, and the frame to be processed have audio components. When the frame includes a second update unit 112 which calculates the updated noise spectrum Ni (f) for each frequency f by a process including averaging, and includes a voice component. The average averaging time using the input signal spectrum Yi (f) and the past noise spectrum Ni-k (f) when calculating the updated noise spectrum Ni (f) is in the frame of only the noise component. In some cases, it is longer than the average averaging time using the input signal spectrum Yi (f) and the past noise spectrum Ni-k (f) when calculating the updated noise spectrum Ni (f). Corresponds to.

なお、上記のＩＩＲフィルタの一般形についてｊ≧１においてＡj＝０としたうえで、Ａ0＝１／(１＋Ｋn)，Ｂ1＝Ｋn／(１＋Ｋn)，且つＢk＝０（ｋ≧２）である場合が上記の数式７に該当し、Ａ0＝１／(１＋Ｋs)，Ｂ1＝Ｋs／(１＋Ｋs)，且つＢk＝０（ｋ≧２）である場合が上記の数式９に該当する。 Regarding the general form of the above IIR filter, when Aj = 0 in j ≧ 1, A0 = 1 / (1 + Kn), B1 = Kn / (1 + Kn), and Bk = 0 (k ≧ 2). Corresponds to the above formula 7, and the case where A0 = 1 / (1 + Ks), B1 = Ks / (1 + Ks), and Bk = 0 (k ≧ 2) corresponds to the above formula 9.

さらに、更新後のノイズスペクトルＮi(ｆ)を算出するための平均化処理として、ＦＩＲフィルタ（一般形はＮi(ｆ)＝ΣＡjＹi-j(ｆ)；但し、Ａは係数、ｉは時系列の順序を表す順序数、ｊは時系列における順序数ｉとの隔たりの程度を表す０以上の整数；即ち、上記のＩＩＲフィルタの一般形についてＢk＝０としたもの）を用いる場合は、処理対象のフレームがノイズ成分のみのフレームである場合に、処理対象のフレームの振幅スペクトルに該当する信号を入力信号スペクトルＹi(ｆ)として、周波数ｆごとに更新後のノイズスペクトルＮi(ｆ)を平均化を含む処理によって算出する第１の更新部１１１と、処理対象のフレームが音声成分を含むフレームである場合に、周波数ｆごとに更新後のノイズスペクトルＮi(ｆ)を平均化を含む処理によって算出する第２の更新部１１２と、を有し、音声成分を含むフレームである場合に更新後のノイズスペクトルＮi(ｆ)を算出する際の、入力信号スペクトルＹi(ｆ)と過去の入力信号スペクトルＹi-j(ｆ)とを用いた平均の平均化時間が、ノイズ成分のみのフレームである場合に更新後のノイズスペクトルＮi(ｆ)を算出する際の、入力信号スペクトルＹi(ｆ)と過去の入力信号スペクトルＹi-j(ｆ)とを用いた平均の平均化時間よりも長い、場合に該当する。 Further, as an averaging process for calculating the updated noise spectrum Ni (f), an FIR filter (general form is Ni (f) = ΣAjYi-j (f); where A is a coefficient and i is a time series. When using an order number indicating the order, j is an integer of 0 or more indicating the degree of deviation from the order number i in the time series; that is, Bk = 0 for the general form of the above IIR filter), the processing target is used. When the frame of is a frame containing only noise components, the signal corresponding to the amplitude spectrum of the frame to be processed is set as the input signal spectrum Yi (f), and the updated noise spectrum Ni (f) is averaged for each frequency f. When the frame to be processed is a frame containing an audio component, the first update unit 111 calculated by the process including The input signal spectrum Yi (f) and the past input signal spectrum when calculating the updated noise spectrum Ni (f) in the case of a frame containing an audio component and having a second updating unit 112. The input signal spectrum Yi (f) and the past when calculating the updated noise spectrum Ni (f) when the average averaging time using Yi-j (f) is a frame containing only noise components. It corresponds to the case where it is longer than the average averaging time using the input signal spectrum Yi-j (f) of.

なお、上記のＦＩＲフィルタの一般形について、Ａj＝１／Ｋnである場合が上記の数式８に該当し、Ａj＝１／Ｋsである場合が上記の数式１０に該当する。 Regarding the general form of the above FIR filter, the case where Aj = 1 / Kn corresponds to the above formula 8 and the case where Aj = 1 / Ks corresponds to the above formula 10.

ノイズ更新回路１１は、フレームごとの振幅スペクトルに該当する信号および音声区間検出結果の入力を受けるたびに、更新後の、周波数ｆごとのノイズスペクトルＮi(ｆ)に該当する信号を減算部６に対して出力する。減算部６は、フレームごとに、ノイズ更新回路１１から出力される前記更新後のノイズスペクトルＮi(ｆ)に該当する信号を用いて、変換結果出力部５から出力される振幅スペクトルに該当する信号から前記更新後のノイズスペクトルＮi(ｆ)に該当する信号を減算する処理を行う。 Each time the noise update circuit 11 receives the signal corresponding to the amplitude spectrum for each frame and the input of the voice interval detection result, the noise update circuit 11 inputs the updated signal corresponding to the noise spectrum Ni (f) for each frequency f to the subtraction unit 6. Output to. The subtraction unit 6 uses the signal corresponding to the updated noise spectrum Ni (f) output from the noise update circuit 11 for each frame, and the signal corresponding to the amplitude spectrum output from the conversion result output unit 5. Is processed to subtract the signal corresponding to the updated noise spectrum Ni (f).

上記のようなノイズ更新回路１１によれば、処理対象のフレームとして音声成分を含むフレームが続いた場合であってもノイズの変動に追従することができ、ノイズ成分を的確に推定することが可能となる。具体的には、スペクトル減算法を実現する従来の回路では、処理対象のフレームがノイズ成分のみのフレームである場合にはノイズスペクトルを更新する一方で音声成分を含むフレームである場合にはノイズスペクトルを更新しないようにしているので、処理対象のフレームとして音声成分を含むフレームが続くとノイズスペクトルが更新されないためにノイズの変動に的確に追従することができず、結果的にノイズ成分を的確に推定することができない、という問題がある。これに対して、上記のようなノイズ更新回路１１では、処理対象のフレームが音声成分を含むフレームである場合もノイズスペクトルを更新するようにしているので、処理対象のフレームとして音声成分を含むフレームが続いた場合であってもノイズの変動に追従することができ、ノイズ成分を的確に推定することが可能となる。 According to the noise update circuit 11 as described above, it is possible to follow the fluctuation of noise even when a frame containing an audio component continues as a frame to be processed, and it is possible to accurately estimate the noise component. It becomes. Specifically, in the conventional circuit that realizes the spectrum subtraction method, the noise spectrum is updated when the frame to be processed is a frame containing only a noise component, while the noise spectrum is updated when the frame contains a voice component. Since the noise spectrum is not updated when a frame containing an audio component continues as a frame to be processed, it is not possible to accurately follow the noise fluctuation, and as a result, the noise component is accurately updated. There is a problem that it cannot be estimated. On the other hand, in the noise update circuit 11 as described above, since the noise spectrum is updated even when the frame to be processed is a frame containing an audio component, the frame including the audio component as the frame to be processed is used. It is possible to follow the fluctuation of noise even when the noise component continues, and it is possible to accurately estimate the noise component.

（実施の形態２）
図３は、この発明の実施の形態２に係るノイズ更新回路１１の概略構成を示す機能ブロック図である。 (Embodiment 2)
FIG. 3 is a functional block diagram showing a schematic configuration of the noise update circuit 11 according to the second embodiment of the present invention.

この実施の形態ではノイズ更新回路１１が上記の実施の形態１の構成と比べて追加の構成を有する点で実施の形態１と異なる一方で、共通する構成や処理の内容もあり、実施の形態１と同等の構成や処理の内容については同一符号を付することでその説明を省略する。 This embodiment is different from the first embodiment in that the noise update circuit 11 has an additional configuration as compared with the configuration of the first embodiment, but there are also common configurations and processing contents, and the embodiment is implemented. The same reference numerals are given to the configurations and processing contents equivalent to those in No. 1, and the description thereof will be omitted.

この実施の形態に係るノイズ更新回路１１は、入力信号スペクトルＹi(ｆ)について平均スペクトルレベルＭを算出する平均値算出部１１３と、入力信号スペクトルＹi(ｆ)と平均スペクトルレベルＭとに関してＹi(ｆ)≧Ｔe×Ｍである周波数ｆについて更新後のノイズスペクトルＮi(ｆ)を決定する第３の更新部１１４と、をさらに有する、ようにしている。 The noise update circuit 11 according to this embodiment has an average value calculation unit 113 that calculates an average spectrum level M for the input signal spectrum Yi (f), and Yi (for the input signal spectrum Yi (f) and the average spectrum level M. f) Further has a third update unit 114 for determining the updated noise spectrum Ni (f) for the frequency f for which ≧ Te × M.

ノイズ更新回路１１は、変換結果出力部５から出力されて分岐されるフレームごとの振幅スペクトルに該当する信号の入力を受けるとともに、音声区間検出部１０から出力されるフレームごとの音声区間検出結果の入力を受け、入力された前記振幅スペクトルに該当する信号を用いて、周波数ｆごとに、更新の１フレーム前のノイズスペクトルＮi-1(ｆ)を更新するものとして、更新後のノイズスペクトルＮi(ｆ)を、入力された前記音声区間検出結果の内容などに応じて上記の数式７もしくは数式８、数式９もしくは数式１０、ならびに数式１２のうちのいずれかに従って算出する。 The noise update circuit 11 receives the input of the signal corresponding to the amplitude spectrum for each frame output from the conversion result output unit 5 and branched, and the noise section detection result for each frame output from the voice section detection unit 10. The noise spectrum Ni (f) after the update is assumed to update the noise spectrum Ni-1 (f) one frame before the update for each frequency f by receiving the input and using the signal corresponding to the input amplitude spectrum. f) is calculated according to any one of the above-mentioned formula 7 or formula 8, formula 9 or formula 10, and formula 12 according to the content of the input voice section detection result and the like.

平均値算出部１１３は、変換結果出力部５から出力されて分岐されるフレームごとの振幅スペクトルに該当する信号を入力信号スペクトルＹi(ｆ)として、以下の数式１１に従って平均スペクトルレベルＭを算出する。平均スペクトルレベルＭは、すなわち、振幅スペクトルの周波数軸方向における平均値である。

The mean value calculation unit 113 calculates the mean spectrum level M according to the following equation 11 with the signal corresponding to the amplitude spectrum for each frame output from the conversion result output unit 5 and branched as the input signal spectrum Yi (f). .. The average spectrum level M is, that is, the average value of the amplitude spectrum in the frequency axis direction.

数式１１における、ｆは振幅スペクトルにおける周波数を表し、ｆ1は振幅スペクトルにおける最小の周波数であり、ｆ2は振幅スペクトルにおける最大の周波数であり、さらに、Ｆnは最小の周波数ｆ1から最大の周波数ｆ2までの範囲における周波数の個数を表す。周波数の個数Ｆnは、すなわち、時間周波数変換部４での時間周波数変換において１フレームに含まれるサンプル点数をｎとすると、ｎ／２＋１である。 In Equation 11, f represents the frequency in the amplitude spectrum, f1 is the minimum frequency in the amplitude spectrum, f2 is the maximum frequency in the amplitude spectrum, and Fn is from the minimum frequency f1 to the maximum frequency f2. Represents the number of frequencies in the range. The number of frequencies Fn is n / 2 + 1, that is, where n is the number of sample points included in one frame in the time-frequency conversion in the time-frequency conversion unit 4.

平均値算出部１１３は、フレームごとの振幅スペクトルに該当する信号の入力を受けるたびに、平均スペクトルレベルＭを算出する。 The mean value calculation unit 113 calculates the mean spectrum level M each time a signal corresponding to the amplitude spectrum for each frame is input.

そのうえで、ノイズ更新回路１１へと入力された音声区間検出結果がノイズフレーム信号である場合には、入力された振幅スペクトルに該当する信号を入力信号スペクトルＹi(ｆ)として、周波数ｆごとに、前記入力信号スペクトルＹi(ｆ)と平均スペクトルレベルＭとの間の関係に応じて下記の〈ア〉または〈イ〉の処理が行われる。 Then, when the voice section detection result input to the noise update circuit 11 is a noise frame signal, the signal corresponding to the input amplitude spectrum is set as the input signal spectrum Yi (f), and the above is described for each frequency f. The following processing of <a> or <b> is performed according to the relationship between the input signal spectrum Yi (f) and the average spectrum level M.

〈ア〉Ｙi(ｆ)＜Ｔe×Ｍである周波数ｆについて
第１の更新部１１１が、上記の数式７もしくは数式８に従って更新後のノイズスペクトルＮi(ｆ)を算出する。 <A> For the frequency f where Yi (f) <Te × M, the first updating unit 111 calculates the updated noise spectrum Ni (f) according to the above formula 7 or formula 8.

入力信号スペクトルＹi(ｆ)と平均スペクトルレベルＭとの間の関係におけるＴeは、平均スペクトルレベルＭよりも振幅スペクトルが著しく大きい周波数成分をノイズスペクトルの更新から除外するための係数である。係数Ｔeは、特定の値に限定されるものではなく、例えばトーン信号に相当する周波数成分がノイズスペクトルの更新から除外されるようにしてトーン信号が減算部６で減算されず雑音成分として抑圧されないようにすることが考慮されるなどしたうえで、適当な値に適宜設定される。係数Ｔeは、具体的には、１～１００程度の範囲のうちのいずれかの値に設定されることが考えられ、特に１０程度に設定されることが考えられる。 Te in the relationship between the input signal spectrum Yi (f) and the average spectrum level M is a coefficient for excluding frequency components whose amplitude spectrum is significantly larger than the average spectrum level M from updating the noise spectrum. The coefficient Te is not limited to a specific value. For example, the tone signal is not subtracted by the subtractor 6 and is not suppressed as a noise component so that the frequency component corresponding to the tone signal is excluded from the update of the noise spectrum. It is set to an appropriate value after considering the above. Specifically, the coefficient Te may be set to any value in the range of about 1 to 100, and in particular, it may be set to about 10.

〈イ〉Ｙi(ｆ)≧Ｔe×Ｍである周波数ｆについて
第３の更新部１１４が、以下の数式１２に従って更新後のノイズスペクトルＮi(ｆ)を決定する。

<A> With respect to the frequency f in which Yi (f) ≧ Te × M, the third update unit 114 determines the updated noise spectrum Ni (f) according to the following equation 12.

数式１２におけるＮi-1(ｆ)は、更新の１フレーム前のノイズスペクトルを表す。 Ni-1 (f) in Equation 12 represents the noise spectrum one frame before the update.

また、ノイズ更新回路１１へと入力された音声区間検出結果が音声フレーム信号である場合には、入力された振幅スペクトルに該当する信号を入力信号スペクトルＹi(ｆ)として、周波数ｆごとに、前記入力信号スペクトルＹi(ｆ)と平均スペクトルレベルＭとの間の関係に応じて下記の〈ウ〉または〈エ〉の処理が行われる。 When the voice section detection result input to the noise update circuit 11 is a voice frame signal, the signal corresponding to the input amplitude spectrum is set as the input signal spectrum Yi (f), and the above is described for each frequency f. The following processing <c> or <d> is performed according to the relationship between the input signal spectrum Yi (f) and the average spectrum level M.

〈ウ〉Ｙi(ｆ)＜Ｔe×Ｍである周波数ｆについて
第２の更新部１１２が、上記の数式９もしくは数式１０に従って更新後のノイズスペクトルＮi(ｆ)を算出する。 <C> For the frequency f where Yi (f) <Te × M, the second updating unit 112 calculates the updated noise spectrum Ni (f) according to the above formula 9 or formula 10.

〈エ〉Ｙi(ｆ)≧Ｔe×Ｍである周波数ｆについて
第３の更新部１１４が、上記の数式１２に従って更新後のノイズスペクトルＮi(ｆ)を決定する。 <D> With respect to the frequency f in which Yi (f) ≧ Te × M, the third updating unit 114 determines the updated noise spectrum Ni (f) according to the above equation 12.

上記の処理では、すなわち、Ｙi(ｆ)≧Ｔe×Ｍであって平均スペクトルレベルＭよりも振幅スペクトルが著しく大きい周波数成分については、当該の更新の後のノイズスペクトルＮi(ｆ)を更新の１フレーム前のノイズスペクトルＮi-1(ｆ)のままとして更新から除外する。 In the above processing, that is, for a frequency component in which Yi (f) ≧ Te × M and the amplitude spectrum is significantly larger than the average spectrum level M, the noise spectrum Ni (f) after the update is updated to 1. The noise spectrum before the frame is Ni-1 (f) and is excluded from the update.

上記のようなノイズ更新回路１１によれば、トーン信号を抑圧しないようすることが可能となる。具体的には、スペクトル減算法を実現する従来の回路では、周波数スペクトルにおいて定常的に存在する成分をノイズと判断して抑圧するようにしているので、トーン信号も定常性があるためにノイズと判断されて抑圧の対象になり、ユーザにとって必要なトーン信号（例えば、モールス信号）も抑圧されてしまう、という問題がある。これに対して、上記のようなノイズ更新回路１１では、平均スペクトルレベルよりも振幅スペクトルが著しく大きい周波数成分をノイズスペクトルの更新から除外するようにしているので、トーン信号を抑圧しないようにすることが可能となる。 According to the noise update circuit 11 as described above, it is possible not to suppress the tone signal. Specifically, in the conventional circuit that realizes the spectrum subtraction method, the component that is constantly present in the frequency spectrum is judged to be noise and suppressed, so that the tone signal is also noise because it is stationary. There is a problem that it is judged and becomes a target of suppression, and a tone signal (for example, a Morse signal) necessary for the user is also suppressed. On the other hand, in the noise update circuit 11 as described above, since the frequency component whose amplitude spectrum is significantly larger than the average spectrum level is excluded from the update of the noise spectrum, the tone signal should not be suppressed. Is possible.

なお、実施の形態２に係るノイズ更新回路１１については、その時々の運用に応じてユーザが平均値算出部１１３および第３の更新部１１４を機能させるか否かを選択するというモードの選択に合わせてノイズスペクトルの更新の仕方が調整されて雑音成分の抑圧の仕方が調整されるようにしてもよい。例えば、ユーザが、雑音成分を抑圧しながら主に音声信号を送受信する場合には平均値算出部１１３および第３の更新部１１４を機能させないモードを選択してトーン信号を抑圧し、一方、モールス信号を送受信する場合には平均値算出部１１３および第３の更新部１１４を機能させるモードを選択してトーン信号を抑圧しないようすることが考えられる。 Regarding the noise update circuit 11 according to the second embodiment, the mode is selected in which the user selects whether or not to operate the average value calculation unit 113 and the third update unit 114 according to the operation at that time. At the same time, the method of updating the noise spectrum may be adjusted so that the method of suppressing the noise component may be adjusted. For example, when the user mainly transmits and receives an audio signal while suppressing a noise component, the user selects a mode in which the mean value calculation unit 113 and the third update unit 114 do not function to suppress the tone signal, while suppressing the tone signal. When transmitting and receiving signals, it is conceivable to select a mode in which the mean value calculation unit 113 and the third update unit 114 function so as not to suppress the tone signal.

以上、この発明の実施の形態について説明したが、具体的な構成は、上記の実施の形態に限られるものではなく、この発明の要旨を逸脱しない範囲の設計の変更等があっても、この発明に含まれる。例えば、上記の実施の形態では図１に概略構成を示すノイズリダクション回路１に対してこの発明に係るノイズ更新回路１１が適用される場合を例に挙げて説明しているが、この発明が適用され得るノイズリダクション回路の構成は図１に示す例には限定されない。さらに言えば、この発明が適用され得る回路は、ノイズリダクション回路には限定されない。すなわち、この発明は、ノイズスペクトルを時系列で更新することが必要とされる種々の回路に対して適用され得る。 Although the embodiment of the present invention has been described above, the specific configuration is not limited to the above-described embodiment, and even if there is a design change or the like within a range that does not deviate from the gist of the present invention. Included in the invention. For example, in the above embodiment, the case where the noise update circuit 11 according to the present invention is applied to the noise reduction circuit 1 whose schematic configuration is shown in FIG. 1 is described as an example, but the present invention is applied. The possible configuration of the noise reduction circuit is not limited to the example shown in FIG. Furthermore, the circuits to which the present invention can be applied are not limited to noise reduction circuits. That is, the present invention can be applied to various circuits in which the noise spectrum needs to be updated in time series.

また、更新後のノイズスペクトルＮi(ｆ)を算出するための平均化を含む処理は、上記の実施の形態におけるノイズ時更新前重み定数Ｋnおよび音声時更新前重み定数Ｋsに相当する係数を用いる方法であればどのような方法であってもよい。すなわち、更新後のノイズスペクトルＮi(ｆ)の算出式は、上記の実施の形態における数式７ないし数式１０には限定されない。 Further, in the process including averaging for calculating the noise spectrum Ni (f) after the update, the coefficients corresponding to the noise constant before update Kn and the voice constant before update Ks in the above embodiment are used. Any method may be used as long as it is a method. That is, the updated noise spectrum Ni (f) calculation formula is not limited to the formula 7 to the formula 10 in the above embodiment.

１ノイズリダクション回路
２プリエンファシス回路
３窓処理部
４時間周波数変換部
５変換結果出力部
６減算部
７合成部
８周波数時間変換部
９ディエンファシス回路
１０音声区間検出部
１１ノイズ更新回路
１１１第１の更新部
１１２第２の更新部
１１３平均値算出部（実施の形態２）
１１４第３の更新部（実施の形態２） 1 Noise reduction circuit 2 Pre-emphasis circuit 3 Window processing unit 4 Time frequency conversion unit 5 Conversion result output unit 6 Subtraction unit 7 Synthesis unit 8 Frequency time conversion unit 9 De-emphasis circuit 10 Voice section detection unit 11 Noise update circuit 111 First Update unit 112 Second update unit 113 Average value calculation unit (Embodiment 2)
114 Third update section (Embodiment 2)

Claims

When the frame to be processed is a frame containing only noise components, the signal corresponding to the amplitude spectrum of the frame to be processed is set as the input signal spectrum Yi (f), and the noise spectrum Ni (f) updated for each frequency f. With the first update part calculated by processing including averaging (however, i: the number of orders representing the order of the time series),
When the frame to be processed is a frame containing an audio component, it has a second update unit for calculating the updated noise spectrum Ni (f) for each frequency f by a process including averaging.
When the average time for calculating the updated noise spectrum Ni (f) is a frame containing only the noise component, the updated noise spectrum Ni (f) is used. Longer than the averaging time when calculating,
A noise update circuit characterized by that.

When the frame to be processed is a frame containing only noise components, the signal corresponding to the amplitude spectrum of the frame to be processed is set as the input signal spectrum Yi (f), and the noise spectrum Ni (f) updated for each frequency f. With the first update part calculated by processing including averaging (however, i: the number of orders representing the order of the time series),
When the frame to be processed is a frame containing an audio component, it has a second update unit for calculating the updated noise spectrum Ni (f) for each frequency f by a process including averaging.
The input signal spectrum Yi (f), the past input signal spectrum Yi-j (f), and the past noise when calculating the updated noise spectrum Ni (f) in the case of a frame containing the voice component. The average averaging time using the spectrum Ni-k (f) (where j: an integer of 1 or more representing the degree of deviation from the order number i in the time series, k: the order number i in the time series). The input signal spectrum Yi (f) and the past input when calculating the updated noise spectrum Ni (f) in the case of a frame containing only the noise component (an integer of 1 or more representing the degree of separation). It is longer than the average averaging time using the signal spectrum Yi-j (f) and the past noise spectrum Ni-k (f).
A noise update circuit characterized by that.

When the frame to be processed is a frame containing only noise components, the signal corresponding to the amplitude spectrum of the frame to be processed is set as the input signal spectrum Yi (f), and the noise spectrum Ni (f) updated for each frequency f. With the first update part calculated by processing including averaging (however, i: the number of orders representing the order of the time series),
When the frame to be processed is a frame containing an audio component, it has a second update unit for calculating the updated noise spectrum Ni (f) for each frequency f by a process including averaging.
An average using the input signal spectrum Yi (f) and the past noise spectrum Ni-k (f) when calculating the updated noise spectrum Ni (f) in the case of a frame containing the voice component. (However, k: an integer of 1 or more representing the degree of deviation from the order number i in the time series), and in the case of a frame containing only the noise component, the updated noise spectrum Ni (f) is obtained. It is longer than the average averaging time using the input signal spectrum Yi (f) and the past noise spectrum Ni-k (f) in the calculation.
A noise update circuit characterized by that.

When the frame to be processed is a frame containing only noise components, the signal corresponding to the amplitude spectrum of the frame to be processed is set as the input signal spectrum Yi (f), and the noise spectrum Ni (f) updated for each frequency f. With the first update part calculated by processing including averaging (however, i: the number of orders representing the order of the time series),
When the frame to be processed is a frame containing an audio component, it has a second update unit for calculating the updated noise spectrum Ni (f) for each frequency f by a process including averaging.
The input signal spectrum Yi (f) and the past input signal spectrum Yi-j (f) used in calculating the updated noise spectrum Ni (f) in the case of a frame containing the voice component are used. The updated noise spectrum Ni (f) when the average averaging time (where j: an integer of 1 or more representing the degree of deviation from the order number i in the time series) and the frame containing only the noise component is used. Is longer than the average averaging time using the input signal spectrum Yi (f) and the past input signal spectrum Yi-j (f) in calculating.
A noise update circuit characterized by that.

In the case of a frame containing the voice component, the averaging time when calculating the updated noise spectrum Ni (f) is any value in the range of 1 to 10 seconds.
The noise update circuit according to any one of claims 1 to 4, wherein the noise update circuit is characterized in that.

When the frame to be processed is a frame containing only noise components, the signal corresponding to the amplitude spectrum of the frame to be processed is set as the input signal spectrum Yi (f), and the following equation 1 or FIR filter which is an IIR filter is used. The first update unit that calculates the updated noise spectrum Ni (f) for each frequency f according to the following formula 2 (however, Ni-1 (f): the noise spectrum one frame before the update, Yi-j ( f): Input signal spectrum before j-frame update, Kn: Constant that determines the weighting of Ni-1 (f) to Yi (f) when the frame to be processed is a frame containing only noise components, i: Hour Order number representing the order of the series, j: an integer of 0 or more representing the degree of deviation from the order number i in the time series),

A noise update circuit characterized by having.

The constant Ks is set to a value in either the time constant of the IIR filter or the range corresponding to 1 to 10 seconds of the average interval of the FIR filter.
The noise update circuit according to claim 6.

For the input signal spectrum Yi (f), an average value calculation unit for calculating the average spectrum level M according to the following formula 5 (however, f1: minimum frequency in the amplitude spectrum, f2: maximum frequency in the amplitude spectrum, Fn: Number of frequencies in the range from the minimum frequency f1 to the maximum frequency f2),

The noise update circuit according to any one of claims 1 to 7, further comprising.

The coefficient Te is set to any value in the range of 1 to 100.
The noise update circuit according to claim 8.